Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / birgermoell/swedish-medical-benchmark issues and pull requests

#16 - Added specialist tests with new format and conversion script

Pull Request - State: closed - Opened by BirgerMoell 5 months ago

#16 - Added specialist tests with new format and conversion script

Pull Request - State: closed - Opened by BirgerMoell 5 months ago

#15 - Added encrypted test files

Pull Request - State: closed - Opened by BirgerMoell 6 months ago

#15 - Added encrypted test files

Pull Request - State: closed - Opened by BirgerMoell 6 months ago

#14 - Emergency medicine

Pull Request - State: closed - Opened by BirgerMoell 6 months ago

#14 - Emergency medicine

Pull Request - State: closed - Opened by BirgerMoell 6 months ago

#13 - Added GP questions

Pull Request - State: closed - Opened by BirgerMoell 6 months ago

#13 - Added GP questions

Pull Request - State: closed - Opened by BirgerMoell 6 months ago

#12 - Add swedish theoretical doctors exams: clinical cases

Pull Request - State: closed - Opened by northern-64bit 6 months ago - 1 comment
Labels: enhancement

#12 - Add swedish theoretical doctors exams: clinical cases

Pull Request - State: closed - Opened by northern-64bit 6 months ago - 1 comment
Labels: enhancement

#11 - Fixes the benchmark set up

Pull Request - State: closed - Opened by northern-64bit 9 months ago - 1 comment
Labels: enhancement

#10 - Adds LLM runs under results folder

Pull Request - State: closed - Opened by northern-64bit 9 months ago
Labels: enhancement

#9 - Add the remaining PubMedQA-L-SWE translations

Pull Request - State: closed - Opened by northern-64bit 9 months ago - 1 comment
Labels: enhancement

#8 - Adds API accesable LLM:s (like GPT-4) with LiteLLM + fixes

Pull Request - State: closed - Opened by northern-64bit 10 months ago - 1 comment
Labels: bug, enhancement

#7 - Refactor code structure + detailed metrics

Pull Request - State: closed - Opened by northern-64bit 10 months ago - 1 comment

#6 - Refactor eval code

Pull Request - State: closed - Opened by LinusJohaansson 10 months ago

#5 - Benchmark against GPT-4

Issue - State: open - Opened by BirgerMoell 10 months ago

#4 - Keep the benchmark balanced

Issue - State: open - Opened by BirgerMoell 10 months ago

#3 - Chatbot Arena style human evaluation

Issue - State: open - Opened by BirgerMoell 10 months ago

#2 - Adding more items to PubMedQA-L-SWE

Pull Request - State: closed - Opened by northern-64bit 10 months ago - 1 comment
Labels: enhancement