Large language models (LLMs) demonstrate potential as assistants in functional genomics, offering a new avenue for gene set analysis. In our evaluation of five LLMs, GPT-4 was the top-performing model and generated common functions for gene sets with high specificity, reliable self-assessed confidence and supporting analysis, complementing traditional functional enrichment.