Skip to content
View akanyaani's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Block or report akanyaani

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
akanyaani/README.md

👋 Welcome to My GitHub Profile!

I'm Abhay Kumar, Principal Research Scientist at Yellow.ai, with a deep passion for training neural networks, particularly language models and large language models (LLMs). With over 8 years of experience in Data Science and NLP.

🌟 Highlights

  • Principal Research Scientist at Yellow.ai
  • Co-Author of Komodo LLM: An open-source, language-specific LLM for Indonesian.
  • 8+ years of experience in Data Science and NLP, with a focus on Language Modeling.

💡 Projects

  • Komodo LLM: A foundational large language model tailored for a specific language.
  • GPT2-TF: Implementation of GPT-2 in TensorFlow 2, recognized as the first repository for GPT-2 in TensorFlow 2.
  • miniLLama: A straightforward and compact implementation of the LLAMA Model, inspired by Andrej Karpathy's minGPT.

Pinned Loading

  1. gpt-2-tensorflow2.0 gpt-2-tensorflow2.0 Public

    OpenAI GPT2 pre-training and sequence prediction implementation in Tensorflow 2.0

    Python 261 83

  2. ranknet-tensorflow2.0 ranknet-tensorflow2.0 Public

    Implementation of RankNet to LambdaRank in TensorFlow 2.0

    Python 39 7

  3. miniLLAMA miniLLAMA Public

    A simplified LLAMA implementation for training and inference tasks.

    Python 27 1

  4. minGPTF minGPTF Public

    A TF re-implementation of the Karpathy's minGPT (Generative Pretrained Transformer) training

    Python 6 1

  5. Illustrated_GPT2_With_Code Illustrated_GPT2_With_Code Public

    Explained GPT-2 Transformer model step by step with code.

    Jupyter Notebook 14 4

  6. Phrase_Extraction_Bi-LSTM Phrase_Extraction_Bi-LSTM Public

    Phrase Extraction using Bi Directional LSTM

    Python 11