diff --git a/_apprentice_projects/2026/coral-contextual-relevance.md b/_apprentice_projects/2026/coral-contextual-relevance.md index 47a8a66..13a7a5f 100644 --- a/_apprentice_projects/2026/coral-contextual-relevance.md +++ b/_apprentice_projects/2026/coral-contextual-relevance.md @@ -3,6 +3,13 @@ batch: 2026 order: 2 title: "CoRaL: Contextual Relevance and Linguistic Enrichment" summary: "A multi-dimensional data curation framework to balance quality, relevance, and cultural coverage in low-resource corpora." +mentors: + - name: "Fajri Koto" + - name: "M Dehan Al-Kautsar" +mentees: + - name: "Thanh-Nhi Nguyen" + - name: "Feliks Victor Parningotan Samosir" + - name: "Michael Christlambert Sinanta" --- Low-resource language corpora often suffer from noise, domain imbalance, and linguistic mixing, making naive filtering harmful to both quantity and cultural representation. diff --git a/_apprentice_projects/2026/knowledge-distillation-vision-text.md b/_apprentice_projects/2026/knowledge-distillation-vision-text.md index 4cbd6f4..552acd9 100644 --- a/_apprentice_projects/2026/knowledge-distillation-vision-text.md +++ b/_apprentice_projects/2026/knowledge-distillation-vision-text.md @@ -3,6 +3,15 @@ batch: 2026 order: 5 title: "Knowledge Distillation in Multilingual Vision-Text Model" summary: "Distill compact multilingual vision-text embeddings from large multimodal teachers for real-world deployment." +mentors: + - name: "Peerat Limkonchotiwat" + - name: "Ekapol Chuangsuwanich" + - name: "Pume Tuchinda" +mentees: + - name: "Ashvanth S" + - name: "Faiz Assabil Firdaus" + - name: "Ilma Aliya Fiddien" + - name: "Puja Ahmad Habibi" --- We propose a training framework to distill a small vision-text embedding model from a large multimodal teacher. Existing KD approaches often assume a base-sized teacher and focus on monolingual settings, leaving large teachers and multilingual scenarios underexplored. diff --git a/_apprentice_projects/2026/multilingual-agentic-underrepresented.md b/_apprentice_projects/2026/multilingual-agentic-underrepresented.md index 713511c..35401d1 100644 --- a/_apprentice_projects/2026/multilingual-agentic-underrepresented.md +++ b/_apprentice_projects/2026/multilingual-agentic-underrepresented.md @@ -1,8 +1,17 @@ --- batch: 2026 order: 1 -title: "Multilingual Agentic for Underrepresented Regions" +title: "Multilingual Agentic Benchmarking for Underrepresented Regions" summary: "Build an environment and evaluation benchmark for agentic LLMs in low-resource languages and underrepresented regions." +mentors: + - name: "Samuel Cahyawijaya" + - name: "Patomporn Payoungkhamdee" +mentees: + - name: "Aulia Adila" + - name: "Kittiphat Leesombatwathana" + - name: "My (Chiffon) Nguyen" + - name: "Saksorn Ruangtanusak" + - name: "Vissuta Gunawan Lim" --- In this work, we address the gap in enabling LLMs with agentic capabilities for low-resource languages and underrepresented regions. Most existing environments and evaluation benchmarks (e.g., Taubench) are Anglocentric, leaving a critical void in assessing performance across diverse linguistic contexts. diff --git a/_apprentice_projects/2026/reasoning-agentic-router.md b/_apprentice_projects/2026/reasoning-agentic-router.md index 6775b45..53901ed 100644 --- a/_apprentice_projects/2026/reasoning-agentic-router.md +++ b/_apprentice_projects/2026/reasoning-agentic-router.md @@ -3,6 +3,13 @@ batch: 2026 order: 3 title: "Reasoning Agentic LLM Router" summary: "Develop skill-based routing to reduce inference costs while preserving strong generalization." +mentors: + - name: "Genta Indra Winata" + - name: "David Anugraha" +mentees: + - name: "Napol Rachatasumrit" + - name: "Quyen Le Hoang Tran" + - name: "Jaycent Gunawan Ongris" --- Learning to route effectively is crucial for improving the efficiency of LLM inference by leveraging model capabilities. Prior work explores routing strategies, but does not thoroughly examine fine-grained, skill-based routing that can substantially reduce costs while preserving strong generalization. diff --git a/_apprentice_projects/2026/selective-memory-layer.md b/_apprentice_projects/2026/selective-memory-layer.md index 076e2f0..a9cd3cd 100644 --- a/_apprentice_projects/2026/selective-memory-layer.md +++ b/_apprentice_projects/2026/selective-memory-layer.md @@ -3,6 +3,16 @@ batch: 2026 order: 4 title: "Selective Memory Layer Finetuning" summary: "Explore memory-layer finetuning strategies to improve continual learning without catastrophic forgetting." +mentors: + - name: "Alham Fikri Aji" + - name: "Farid Adilazuarda" + - name: "Muhammad Reza Qorib" +mentees: + - name: "Faeyza Rishad Ardi" + - name: "Izaaz Inhar" + - name: "Phudish Prateepamornkul" + - name: "Quang Minh Nguyen" + - name: "Tri Vo" --- We tackle continual learning from an architectural perspective. Instead of LoRA, whose parameters grow with the number of tasks or languages, we explore memory layers where the model can store or learn context by injecting key-value information during inference. @@ -14,5 +24,5 @@ We will test this empirically by systematically finetuning different components **Relevant publications:** - Empirical Study on Updating Key-Value Memories in Transformer Feed-Forward Layers -- Memory Layers at Scale | Research - AI at Meta +- [Memory Layers at Scale](https://arxiv.org/abs/2412.09764) - Continual Learning via Sparse Memory Finetuning diff --git a/_config.yml b/_config.yml index 06f955c..f42e84c 100644 --- a/_config.yml +++ b/_config.yml @@ -38,6 +38,11 @@ defaults: type: "projects" values: layout: "project" + - scope: + path: "" + type: "apprentice_projects" + values: + layout: "apprentice-project" - scope: path: "assets" values: @@ -59,7 +64,8 @@ collections: permalink: /people/:title.html sort_by: name apprentice_projects: - output: false + output: true + permalink: /apprentice-projects/:path.html include: - _pages diff --git a/_data/navbar.yml b/_data/navbar.yml index e17acea..fa0f429 100644 --- a/_data/navbar.yml +++ b/_data/navbar.yml @@ -9,7 +9,7 @@ - name: "Publications" href: "/publications" - name: "Resources" - href: "/resources/" + href: "/resources" - name: "Blog" href: "/posts/" diff --git a/_includes/apprentice-project-tabs.html b/_includes/apprentice-project-tabs.html new file mode 100644 index 0000000..a207e26 --- /dev/null +++ b/_includes/apprentice-project-tabs.html @@ -0,0 +1,57 @@ +{% comment %} + Tab navigation for apprentice project batch pages. + Each tab is a link to /apprentice-projects/{batch}/. + Usage: {% include apprentice-project-tabs.html active_batch=page.batch %} +{% endcomment %} +{% assign batches = site.data.apprentice_batches %} +{% if batches == null or batches.size == 0 %} + {% assign batch_years = site.apprentice_projects + | map: 'batch' + | uniq + | sort + | reverse + %} + {% assign batches = '' | split: '' %} + {% for year in batch_years %} + {% assign batches = batches | push: year %} + {% endfor %} +{% endif %} + +{% assign active_str = include.active_batch | append: '' %} + +
diff --git a/_includes/apprenticeship-tabs.html b/_includes/apprenticeship-tabs.html deleted file mode 100644 index 9507b2f..0000000 --- a/_includes/apprenticeship-tabs.html +++ /dev/null @@ -1,120 +0,0 @@ -{% assign batches = site.data.apprentice_batches %} -{% if batches == null or batches.size == 0 %} - {% assign batch_years = site.apprentice_projects - | map: 'batch' - | uniq - | sort - | reverse - %} - {% assign batches = '' | split: '' %} - {% for year in batch_years %} - {% assign batches = batches | push: year %} - {% endfor %} -{% endif %} -{% assign default_batch = batches | where: 'current', true | first %} -{% if default_batch == null %} - {% assign default_batch = batches.first %} -{% endif %} -{% assign default_batch_key = default_batch.year | default: default_batch %} - - - -- Projects for this batch will be announced soon. -
- {% else %} - {% include project-cards.html projects=projects batch_key=batch_key %} - {% endif %} - -{{ summary_text }}
-