Skip to content
View bzantium's full-sized avatar

Block or report bzantium

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
bzantium/README.md

[2024.06 - Current] LLM Researcher @Kakao

  • Develop Decoder based Embedding Models
  • Develop Proprietary LLM on TPUs

[2023.08 - 2024.05] LLM Researcher @Kakaobrain

  • Developed Korean Language Foundation Model a.k.a. KoGPT2 (66B)

[2020.03 - 2023.08] Machine Learning Engineer @SK Telecom & EleutherAI

  • Developed Large Language Model for SK Telecom
  • Developed Multimodal AI Service at SKT A.
    • AI Eraser (Object Removal with Image Inpainting)
      • role: project manager & implementation of pipeline algorithm (segmentation postprocessing & enhancement in inpainting performance).
  • Developed polyglot and oslo project at EleutherAI
    • polyglot: Large Language Models of Well-balanced Competence in Multi-languages
      • role: distributed training of LM with Megatron LM & data crawling, preprocessing and model evaluaton. Published 1.3B, 3.8B, 5.8B, 12.8B polyglot-ko models.
    • oslo: Open Source for Large-scale Optimization
      • role: tensor parallel 1D, 2D, 3D implementation.

Interest

  • Foundation Model / NLP / Multimodal AI

Linkedin Badge Gmail Badge Google Scholar Badge

Pinned Loading

  1. EleutherAI/polyglot EleutherAI/polyglot Public

    Polyglot: Large Language Models of Well-balanced Competence in Multi-languages

    477 39

  2. EleutherAI/oslo EleutherAI/oslo Public

    OSLO: Open Source for Large-scale Optimization

    Python 175 26

  3. lassl/lassl lassl/lassl Public

    Easy Language Model Pretraining leveraging Huggingface's Transformers and Datasets

    Python 127 14

  4. pytorch-admm-pruning pytorch-admm-pruning Public

    Prune DNN using Alternating Direction Method of Multipliers (ADMM)

    Python 99 18