動画検索
関連広告
検索結果
Evaluating Large Language Models Trained on Code (Codex)
Outline
A language model for code
Evaluation framework
Nuances of pass@k estimation
Evaluation details
Prompting for evaluation
Loss scaling and temperature
Sampling heuristics and BLEU score
Results on the APPS dataset
Code generation examples
Supervised Fine-tuning: Results
Docstring generation
Limitation: sample efficiency
Limitation: degradation with docstring length
Broader Impacts and Hazard Analysis
Misalignment
Misalignment Results
Bias Analysis
Bias probes
Economic Impact Analysis
Security implications
Insecure code generation
Risk Mitigation
Summary
Announcements
Plans for Today
C code
Knocking Off Goalposts
Parsing Code Info
Generating the Code Section
Troubleshooting with Disassembler
Erlang Function Signatures
Getting Ahead of Myself
Empty Export Table
Instruction Format
Pushing Label
func_info
label_count and function_count
export bada:bada/0
Troubleshooting
Jumping to the Second Label
I don't need Elixir
You only need Vector and write() function
Generating Atom List
Generating Atoms Chunk
Collecting Function Labels
Successfully Compiled Module
Unhardcode input/output paths
Outro
求める情報が見つからない場合は、キーワードや指定した条件を変えてみてください。