Skip to content

Pull requests: openai/evals

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

add domain parser eval
#122 opened Mar 15, 2023 by iamsk Loading…
5 of 12 tasks
Fix pip install path in mmlu.ipynb
#121 opened Mar 15, 2023 by RangeKing Loading…
Add Integer Multiplication Eval
#119 opened Mar 15, 2023 by jacklightChen Loading…
12 tasks done
README.zh.md
#118 opened Mar 15, 2023 by aidreamwin Loading…
12 tasks
Add eval for parsing and normalizing relative datetimes
#117 opened Mar 15, 2023 by HarrisonJackson Loading…
11 of 12 tasks
Fibonacci word selection character count total
#115 opened Mar 15, 2023 by mstooks Loading…
11 of 12 tasks
Refactor code to improve readability and error handling
#114 opened Mar 15, 2023 by surajptl Loading…
12 tasks done
Add infix to postfix conversion eval
#113 opened Mar 15, 2023 by vinhowe Loading…
11 tasks done
Add Born First Eval
#112 opened Mar 15, 2023 by njbbaer Loading…
11 of 12 tasks
Improve cryptographic pattern matching with standard playfair cipher
#110 opened Mar 15, 2023 by dash-tobin Loading…
12 tasks done
README.jp.md
#109 opened Mar 15, 2023 by 0xkf Loading…
12 tasks done
Add Base 64 Decoding Eval
#108 opened Mar 15, 2023 by MrDevel0per Loading…
12 tasks done
Add Chinese pronunciation Test
#107 opened Mar 15, 2023 by JacobLinCool Loading…
10 of 12 tasks
Add quadratic-from-three-points eval (3% accuracy, 100 samples)
#106 opened Mar 15, 2023 by nottheswimmer Loading…
12 tasks done
Sentiment Analysis
#104 opened Mar 15, 2023 by Aaronmanuel Loading…
Add Recursive Functions eval
#103 opened Mar 15, 2023 by aaronsmithtv Loading…
12 tasks done
Parse email and invoice data
#102 opened Mar 15, 2023 by DavidPatterson-Cole Loading…
11 tasks done
Aristotle poems
#101 opened Mar 15, 2023 by Aaronmanuel Loading…
12 tasks
go_board_count eval
#100 opened Mar 15, 2023 by SonOfLilit Loading…
12 tasks done
Market momentum
#99 opened Mar 15, 2023 by jameslholcombe Loading…
12 tasks done
Word Count Eval (40% accuracy, 100+ samples)
#96 opened Mar 15, 2023 by ricky-sb Loading…
12 tasks done
Legal Ethics - Model Rules of Professional Conduct - True/False
#95 opened Mar 15, 2023 by avery-bub Loading…
12 tasks done
Add modelgraded evaluation for iambic pentameter
#94 opened Mar 15, 2023 by ianonavy Loading…
11 of 12 tasks
Add Numeric Sort Eval
#93 opened Mar 15, 2023 by AlbertGozzi Loading…
12 tasks done
ProTip! Filter pull requests by the default branch with base:main.