Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / tatsu-lab/alpaca_eval issues and pull requests

#351 - Add Infinity-Instruct-3M-0613-Mistral-7B to AlpacaEval

Pull Request - State: closed - Opened by cszhengyh 7 days ago - 1 comment

#350 - cannot reindex

Issue - State: open - Opened by Chirobocea 7 days ago - 1 comment

#349 - [BUG] trust repo alpaca_eval

Pull Request - State: closed - Opened by YannDubs 8 days ago

#348 - Add claude-3-5-sonnet-20240620 to AlpacaEval

Pull Request - State: closed - Opened by MarjovanLier 8 days ago - 1 comment

#347 - Add OpenPipe Mixture of Agents model to Alpaca Eval

Pull Request - State: closed - Opened by saum7800 8 days ago

#345 - Add Nanbeige2-16B-Chat to AlpacaEval

Pull Request - State: closed - Opened by yuani114 9 days ago - 1 comment

#344 - Add Storm-7B, Storm-7B (best-of-64) to AlpacaEval

Pull Request - State: closed - Opened by yifan123 11 days ago - 3 comments

#342 - Add Together-MoA, Together-MoA-Lite to AlpacaEval

Pull Request - State: closed - Opened by IsThatYou 17 days ago - 1 comment

#341 - ERROR:root:Error while parsing completion:

Issue - State: closed - Opened by AGTSAAA 19 days ago - 1 comment

#339 - tensor_parallel_size can not work

Issue - State: open - Opened by AGTSAAA 19 days ago

#338 - [BUG] fix bs in VLLM and add chatml

Pull Request - State: closed - Opened by YannDubs 19 days ago

#337 - Why is max_num_seqs allowed here?

Issue - State: closed - Opened by RAY2L 20 days ago - 1 comment

#336 - Preference doesn't match log_probs in `annotations.json`

Issue - State: closed - Opened by YJWon99 21 days ago - 1 comment

#335 - confused about openai API

Issue - State: open - Opened by junkangwu 23 days ago

#334 - Add merlinite-7B-AOT to AlpacaEval

Pull Request - State: closed - Opened by imelnyk 23 days ago - 1 comment

#333 - The code for computing instruction difficulty

Issue - State: open - Opened by calvinh99 25 days ago

#332 - fix model link

Pull Request - State: closed - Opened by chujiezheng 26 days ago

#331 - Add ExPO + `Llama-3-Instruct-8B-SimPO` results

Pull Request - State: closed - Opened by chujiezheng 27 days ago - 1 comment

#330 - [ENH&BUG] improve VLLM

Pull Request - State: closed - Opened by YannDubs 28 days ago

#329 - Trouble with custom model hosted on OpenAI compatible endpoint

Issue - State: closed - Opened by tastycode 29 days ago - 1 comment

#328 - Unexpected low judge preference for some prompts

Issue - State: closed - Opened by geoalgo 29 days ago - 1 comment

#326 - Add REBEL-Llama-3-8B-Instruct to AlpacaEval

Pull Request - State: closed - Opened by ZhaolinGao about 1 month ago - 1 comment

#324 - Add Aligner 2B+GPT-4 Turbo (04/09) Results

Pull Request - State: closed - Opened by AlignInc about 1 month ago - 1 comment

#323 - Add Aligner 2B+GPT-4 Turbo (04/09) to AlpacaEval

Pull Request - State: closed - Opened by AlignInc about 1 month ago

#322 - Add Phi 3 models

Issue - State: closed - Opened by EwoutH about 1 month ago - 3 comments

#321 - [ENH] Use multi threading instead of processing

Pull Request - State: closed - Opened by YannDubs about 1 month ago

#320 - Add Llama-3-Instruct-8B-SimPO to AlpacaEval

Pull Request - State: closed - Opened by xiamengzhou about 1 month ago - 1 comment

#319 - [ENH] vicuna 1.5

Pull Request - State: closed - Opened by YannDubs about 1 month ago

#318 - [CLEAN] move evaluators lb llama3

Pull Request - State: closed - Opened by YannDubs about 1 month ago

#317 - [ENH] add LC SEM

Pull Request - State: closed - Opened by YannDubs about 1 month ago

#316 - Alpaca Evaluation Instruction Difficulty used also for Custom Evaluation Dataset

Issue - State: closed - Opened by fanconic about 1 month ago - 3 comments

#315 - Update README.md

Pull Request - State: closed - Opened by zhuang-li about 1 month ago - 1 comment

#314 - llama3 evaluator

Pull Request - State: closed - Opened by zhuang-li about 1 month ago - 2 comments

#313 - possibility of adding llama3-70b as the evaluator?

Issue - State: closed - Opened by zhuang-li about 1 month ago - 3 comments

#312 - How to use AE1 to evaluate model

Issue - State: closed - Opened by matenglearn about 2 months ago - 2 comments

#311 - [ADD] GPT4-o

Pull Request - State: closed - Opened by YannDubs about 2 months ago

#310 - Overly High Win Rate for Alpaca v2 on mistral 7b orpo

Issue - State: closed - Opened by qingquansong about 2 months ago - 12 comments

#309 - [verified] Yi-large

Pull Request - State: closed - Opened by YannDubs about 2 months ago

#308 - The n_total of n_total result is not 805

Issue - State: closed - Opened by matenglearn about 2 months ago - 6 comments

#307 - "Add Mistral-7B+RAHF-DUAL+LoRA to AlpacaEval"

Pull Request - State: closed - Opened by LiuAmber about 2 months ago - 1 comment

#306 - Add <Mistral-7B+RAHF-DUAL+LoRA> to AlpacaEval

Pull Request - State: closed - Opened by LiuAmber about 2 months ago

#305 - How to change cache path when evaluating multi-models

Issue - State: closed - Opened by bittersweet1999 about 2 months ago - 1 comment

#304 - Add Yi-Large Preview to AlpacaEval

Pull Request - State: closed - Opened by HyperdriveHustle about 2 months ago - 2 comments

#303 - How to get the LC Win Rate in AlpacaEval 1.0 version?

Issue - State: closed - Opened by RZFan525 about 2 months ago - 4 comments

#302 - Fix typo in README.md

Pull Request - State: closed - Opened by tongyx361 about 2 months ago

#301 - What to do if the log prob is not returned?

Issue - State: closed - Opened by e0397123 about 2 months ago - 7 comments

#300 - How to solve the problem of null appearing in the evaluation results

Issue - State: closed - Opened by LiuAmber about 2 months ago - 4 comments

#299 - Add ExPO results to AlpacaEval

Pull Request - State: closed - Opened by chujiezheng about 2 months ago

#298 - Add SPPO-Mistral7B-PairRM to AlpacaEval

Pull Request - State: closed - Opened by Edward-Sun about 2 months ago - 1 comment

#297 - Use verified by default

Pull Request - State: closed - Opened by YannDubs about 2 months ago

#296 - Missing result file in notebook

Issue - State: closed - Opened by geoalgo about 2 months ago - 1 comment

#295 - Missing item in results/llama-2-70b-chat-hf

Issue - State: closed - Opened by chchenhui about 2 months ago - 1 comment

#294 - Add Storm-7B to AlpacaEval

Pull Request - State: closed - Opened by yifan123 2 months ago - 3 comments

#293 - Enable analyzing evaluators/annotators on data without multiple generator models

Pull Request - State: closed - Opened by rdnfn 2 months ago - 1 comment

#292 - [ENH] verifying all the qwens

Pull Request - State: closed - Opened by YannDubs 2 months ago

#291 - add Qwen1.5-110B-Chat self-report results

Pull Request - State: closed - Opened by Lukeming-tsinghua 2 months ago - 1 comment

#290 - Unable to reproduce results

Issue - State: closed - Opened by felipemaiapolo 2 months ago - 1 comment

#289 - Add link for FsfairX-Zephyr-Chat-v0.1

Pull Request - State: closed - Opened by hendrydong 2 months ago

#288 - Add Ghost 7B Alpha to AlpacaEval

Pull Request - State: closed - Opened by lh0x00 2 months ago

#287 - Llama-3-Instruct not using official prompt template?

Issue - State: closed - Opened by ZHZisZZ 2 months ago - 1 comment

#286 - Add the evaluation result for our latest model

Pull Request - State: closed - Opened by hendrydong 2 months ago - 2 comments

#285 - [ENH] llama3

Pull Request - State: closed - Opened by YannDubs 2 months ago

#284 - How instruction_difficulty feature is obtained

Issue - State: closed - Opened by stepyndriyy 2 months ago - 1 comment

#283 - [BUG] revert to GPT4 preview 1106

Pull Request - State: closed - Opened by YannDubs 2 months ago

#282 - Confusion in Model Evaluation Results Due to GPT Updates

Issue - State: closed - Opened by yifan123 2 months ago - 2 comments

#281 - Add support for analyzing evaluators with custom cross-annotations

Pull Request - State: closed - Opened by rdnfn 2 months ago - 1 comment

#280 - Update README.md

Pull Request - State: closed - Opened by Dominic789654 2 months ago

#279 - Add Nanbeige-Plus-Chat-v0.1 to AlpacaEval

Pull Request - State: closed - Opened by yuani114 2 months ago - 2 comments

#278 - [BUG] backward compatibility with AF

Pull Request - State: closed - Opened by YannDubs 3 months ago - 1 comment

#277 - Fix KeyError at line 17; annotations['preference'] -> annotation['preferences']

Pull Request - State: closed - Opened by wjdghks950 3 months ago - 4 comments

#276 - openai_configs.yaml when using Azure only

Issue - State: closed - Opened by Yuancheng-Xu 3 months ago - 7 comments

#275 - [ENH] adding drbx and gpt4 turbo

Pull Request - State: closed - Opened by YannDubs 3 months ago

#274 - Add Nanbeige2-8B-Chat to AlpacaEval

Pull Request - State: closed - Opened by yuani114 3 months ago - 1 comment

#273 - Question about the GPT-4 API

Issue - State: closed - Opened by HypherX 3 months ago - 11 comments

#272 - With unstable GPT-4 API, I encounterd a tricky problem

Issue - State: closed - Opened by njupopsicle 3 months ago - 2 comments

#271 - With unstable GPT-4 API, I encounterd a tricky problem

Issue - State: closed - Opened by njupopsicle 3 months ago - 1 comment

#270 - Question on Using Character-Level Length

Issue - State: closed - Opened by Leymore 3 months ago - 1 comment

#269 - Logistic regression for length-controlled winrate

Issue - State: closed - Opened by normster 3 months ago - 2 comments

#268 - Updating link to a super fast demo!

Pull Request - State: closed - Opened by kyleliang919 3 months ago - 1 comment

#267 - Add Conifer-7B-DPO to AlpacaEval

Pull Request - State: closed - Opened by liulixin29 3 months ago - 1 comment

#266 - "Add Mistral-7B-LoRA-RAHF-DUAL to AlpacaEval"

Pull Request - State: closed - Opened by LiuAmber 3 months ago - 2 comments

#265 - Add <Mistral-7B-LoRA-RAHF-DUAL> to AlpacaEval

Pull Request - State: closed - Opened by LiuAmber 3 months ago - 1 comment

#264 - Add TempNet-LLaMA2-Chat to AlpacaEval

Pull Request - State: closed - Opened by xumao-nju 3 months ago - 2 comments

#262 - Add Ein-70B-v0.1 to AlpacaEval

Pull Request - State: closed - Opened by bin-bi 3 months ago - 1 comment

#261 - Supplement for Aligner

Pull Request - State: closed - Opened by AlignInc 3 months ago

#261 - Supplement for Aligner

Pull Request - State: closed - Opened by AlignInc 3 months ago

#260 - Latest LC-AlpacaEval update broken?

Issue - State: closed - Opened by rraju1 3 months ago - 4 comments

#259 - Add Aligner-2B+Qwen1.5-72B-Chat & Aligner-2B+Claude3 Opus to AlpacaEval

Pull Request - State: closed - Opened by AlignInc 3 months ago - 5 comments

#258 - Yann/length correction

Pull Request - State: open - Opened by YannDubs 3 months ago

#257 - Add Mistral-ORPO-Beta to AlpacaEval

Pull Request - State: closed - Opened by jiwooya1000 3 months ago - 1 comment

#256 - Add Samba-CoE-v0.2-best-of-16 to AlpacaEval

Pull Request - State: closed - Opened by kyleliang919 4 months ago - 1 comment

#255 - Reproducing numbers for evaluator human-agreement leaderboard.

Issue - State: closed - Opened by Varun221 4 months ago - 1 comment

#253 - Add Samba-CoE-v0.2 to AlpacaEval

Pull Request - State: closed - Opened by kyleliang919 4 months ago - 1 comment