Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / birgermoell/swedish-medical-benchmark issues and pull requests
#16 - Added specialist tests with new format and conversion script
Pull Request -
State: closed - Opened by BirgerMoell 5 months ago
#16 - Added specialist tests with new format and conversion script
Pull Request -
State: closed - Opened by BirgerMoell 5 months ago
#15 - Added encrypted test files
Pull Request -
State: closed - Opened by BirgerMoell 6 months ago
#15 - Added encrypted test files
Pull Request -
State: closed - Opened by BirgerMoell 6 months ago
#14 - Emergency medicine
Pull Request -
State: closed - Opened by BirgerMoell 6 months ago
#14 - Emergency medicine
Pull Request -
State: closed - Opened by BirgerMoell 6 months ago
#13 - Added GP questions
Pull Request -
State: closed - Opened by BirgerMoell 6 months ago
#13 - Added GP questions
Pull Request -
State: closed - Opened by BirgerMoell 6 months ago
#12 - Add swedish theoretical doctors exams: clinical cases
Pull Request -
State: closed - Opened by northern-64bit 6 months ago
- 1 comment
Labels: enhancement
#12 - Add swedish theoretical doctors exams: clinical cases
Pull Request -
State: closed - Opened by northern-64bit 6 months ago
- 1 comment
Labels: enhancement
#11 - Fixes the benchmark set up
Pull Request -
State: closed - Opened by northern-64bit 9 months ago
- 1 comment
Labels: enhancement
#10 - Adds LLM runs under results folder
Pull Request -
State: closed - Opened by northern-64bit 9 months ago
Labels: enhancement
#9 - Add the remaining PubMedQA-L-SWE translations
Pull Request -
State: closed - Opened by northern-64bit 9 months ago
- 1 comment
Labels: enhancement
#8 - Adds API accesable LLM:s (like GPT-4) with LiteLLM + fixes
Pull Request -
State: closed - Opened by northern-64bit 10 months ago
- 1 comment
Labels: bug, enhancement
#7 - Refactor code structure + detailed metrics
Pull Request -
State: closed - Opened by northern-64bit 10 months ago
- 1 comment
#6 - Refactor eval code
Pull Request -
State: closed - Opened by LinusJohaansson 10 months ago
#5 - Benchmark against GPT-4
Issue -
State: open - Opened by BirgerMoell 10 months ago
#4 - Keep the benchmark balanced
Issue -
State: open - Opened by BirgerMoell 10 months ago
#3 - Chatbot Arena style human evaluation
Issue -
State: open - Opened by BirgerMoell 10 months ago
#2 - Adding more items to PubMedQA-L-SWE
Pull Request -
State: closed - Opened by northern-64bit 10 months ago
- 1 comment
Labels: enhancement
#1 - Evaluate benchmarks from BioMistral to decide which ones are good to translate use
Issue -
State: open - Opened by BirgerMoell 10 months ago
- 5 comments