-
Notifications
You must be signed in to change notification settings - Fork 0
Expand file tree
/
Copy pathabout.qmd
More file actions
42 lines (36 loc) · 3.54 KB
/
about.qmd
File metadata and controls
42 lines (36 loc) · 3.54 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
---
title: "About"
image: profile.jpg
# about:
# template: jolla
# links:
# - icon: twitter
# text: Twitter
# href: https://twitter.com
# - icon: linkedin
# text: LinkedIn
# href: https://linkedin.com
# - icon: github
# text: Github
# href: https://github.com=
---
<!--
TODO:
* add twitter/linkedin handles etc.
* what is IDEAS NCBR, link
* add research statement, like https://kindly-cartoon-566.notion.site/Informal-research-statement-Piotr-Mi-o-5008a7f6f9c24e6c88a4ba35971d6196
-->
Blog of LLM-Random research group in [IDEAS NCBR](https://ideas-ncbr.pl/en/), started in mid-2022 by Sebastian Jaszczur. Team members, in the order of joining, listed below. We are happy to collaborate with outside institutions and people.
* Sebastian Jaszczur [{{< fa brands x-twitter >}}](https://twitter.com/S_Jaszczur) [{{< fa brands linkedin >}}](https://pl.linkedin.com/in/sebastian-jaszczur-129866a3) [{{< fa brands github >}}](https://github.com/sebastianjaszczur) [{{< fa brands google-scholar >}}](https://scholar.google.pl/citations?user=KF__0WgAAAAJ)
* Marek Cygan [{{< fa brands x-twitter >}}](https://twitter.com/marek_a_cygan), as advisor
* Jakub Krajewski [{{< fa brands x-twitter >}}](https://twitter.com/kuba_krj) [{{< fa brands github >}}](http://github.com/kuba-krj) [{{< fa brands google-scholar >}}](https://scholar.google.com/citations?user=v5mZs1kAAAAJ)
* Szymon Antoniak [{{< fa brands x-twitter >}}](https://twitter.com/Simontwice2) [{{< fa brands github >}}](https://github.com/Simontwice) (past member)
* Maciej Pióro [{{< fa brands x-twitter >}}](https://twitter.com/maciejpioro) [{{< fa brands linkedin >}}](https://www.linkedin.com/in/maciej-pioro/) [{{< fa brands github >}}](https://github.com/maciek-pioro) [{{< fa brands google-scholar >}}](https://scholar.google.pl/citations?user=KkbtNw0AAAAJ)
* Tomasz Odrzygóźdź (past member)
* Jan Ludziejewski [{{< fa brands x-twitter >}}](https://twitter.com/jahulas) [{{< fa brands linkedin >}}](https://www.linkedin.com/in/jan-ludziejewski-034959223/) [{{< fa brands github >}}](https://github.com/ludziej) [{{< fa brands google-scholar >}}](https://scholar.google.com/citations?user=YihTUGQAAAAJ&hl=pl)
* Michał Krutul [{{< fa brands x-twitter >}}](https://twitter.com/CrewTool) [{{< fa brands linkedin >}}](https://www.linkedin.com/in/crewtool/) [{{< fa brands github >}}](https://github.com/crewtool)
* Kamil Ciebiera [{{< fa brands x-twitter >}}](https://twitter.com/KamilCiebiera) [{{< fa brands linkedin >}}](www.linkedin.com/in/kamil-ciebiera-9439a0252) [{{< fa brands github >}}](https://github.com/ciebson123)
* Krystian Król
Public repositories of the group are available at [on GitHub](https://github.com/llm-random/).
# Research Statement
We are interested in improving the state-of-the-art in Large Language Models, as we believe this to be the most impactful research direction. We want to do this by improving the efficiency of those models during both training and inference. Among the techniques we explore in our work are: Mixture-of-Experts (see [Mixture of Tokens](posts/mixture_of_tokens/)), fast/speculative decoding (blogpost coming soon), and improving parameter efficiency (see [Neuron Recycling](/posts/neuron_recycling/)).