Skip to content

Commit 643cbf9

Browse files
committed
Merge branch 'main' of github.com:VectorInstitute/implementation-catalog
2 parents 1d806ce + 59206b2 commit 643cbf9

File tree

1 file changed

+50
-50
lines changed

1 file changed

+50
-50
lines changed

docs/index.md

Lines changed: 50 additions & 50 deletions
Original file line numberDiff line numberDiff line change
@@ -400,30 +400,30 @@ a.dataset-tag:hover {
400400
<div class="grid cards" markdown>
401401
<div class="card" markdown>
402402
<div class="header">
403-
<h3><a href="https://github.com/VectorInstitute/anomaly-detection" title="Go to Repository">anomaly-detection</a></h3>
404-
<span class="tag year-tag">2023</span>
403+
<h3><a href="https://github.com/VectorInstitute/recommender-systems" title="Go to Repository">recommender-systems</a></h3>
404+
<span class="tag year-tag">2022</span>
405405
<span class="tag type-tag">bootcamp</span>
406406
</div>
407-
<p>A repository with implementation of anomaly detection techniques</p>
407+
<p>A repository with implementations of recommender systems</p>
408408
<div class="tag-container">
409-
<span class="tag" data-tippy="Logistic Regression (Supervised)">Logistic Regression (Supervised)</span> <span class="tag" data-tippy="Random Forest (Supervised)">Random Forest (Supervised)</span> <span class="tag" data-tippy="XGBoost (Supervised)">XGBoost (Supervised)</span> <span class="tag" data-tippy="CatBoost (Supervised)">CatBoost (Supervised)</span> <span class="tag" data-tippy="Light GBM (Supervised)">Light GBM (Supervised)</span> <span class="tag" data-tippy="TabNet (Supervised and Semi-supervised)">TabNet (Supervised and Semi-supervised)</span> <span class="tag" data-tippy="Autoencoder (AE) (Unsupervised)">Autoencoder (AE) (Unsupervised)</span> <span class="tag" data-tippy="Isolation Forest (Unsupervised)">Isolation Forest (Unsupervised)</span>
410-
</div>
411-
<div class="datasets">
412-
<strong>Datasets:</strong> <a href="https://arxiv.org/pdf/2211.13358.pdf" class="dataset-tag" target="_blank">Bank Account Fraud Detection</a> <a href="https://dgraph.xinye.com/dataset" class="dataset-tag" target="_blank">DGraph dataset</a> <a href="https://www.mvtec.com/company/research/datasets/mvtec-ad" class="dataset-tag" target="_blank">MVTec dataset</a> <a href="http://www.svcl.ucsd.edu/projects/anomaly/dataset.htm" class="dataset-tag" target="_blank">UCSD Anomaly Detection Dataset</a> <a href="https://www.kaggle.com/datasets/odins0n/ucf-crime-dataset" class="dataset-tag" target="_blank">UCF Crime Dataset</a>
409+
<span class="tag" data-tippy="Matrix Factorization">Matrix Factorization</span> <span class="tag" data-tippy="Collaborative Filtering">Collaborative Filtering</span> <span class="tag" data-tippy="Content-Based Filtering">Content-Based Filtering</span> <span class="tag" data-tippy="Sequence Aware Recommender Systems">Sequence Aware Recommender Systems</span> <span class="tag" data-tippy="Session-Based Recommender Systems">Session-Based Recommender Systems</span> <span class="tag" data-tippy="Knowledge Graph-Based Recommender Systems">Knowledge Graph-Based Recommender Systems</span>
413410
</div>
414411

412+
415413
</div>
416414
<div class="card" markdown>
417415
<div class="header">
418-
<h3><a href="https://github.com/VectorInstitute/recommender-systems" title="Go to Repository">recommender-systems</a></h3>
419-
<span class="tag year-tag">2022</span>
416+
<h3><a href="https://github.com/VectorInstitute/diffusion_models" title="Go to Repository">diffusion-models</a></h3>
417+
<span class="tag year-tag">2024</span>
420418
<span class="tag type-tag">bootcamp</span>
421419
</div>
422-
<p>A repository with implementations of recommender systems</p>
420+
<p>A repository with demos for various diffusion models for tabular and time series data</p>
423421
<div class="tag-container">
424-
<span class="tag" data-tippy="Matrix Factorization">Matrix Factorization</span> <span class="tag" data-tippy="Collaborative Filtering">Collaborative Filtering</span> <span class="tag" data-tippy="Content-Based Filtering">Content-Based Filtering</span> <span class="tag" data-tippy="Sequence Aware Recommender Systems">Sequence Aware Recommender Systems</span> <span class="tag" data-tippy="Session-Based Recommender Systems">Session-Based Recommender Systems</span> <span class="tag" data-tippy="Knowledge Graph-Based Recommender Systems">Knowledge Graph-Based Recommender Systems</span>
422+
<span class="tag" data-tippy="TabDDPM">TabDDPM</span> <span class="tag" data-tippy="TabSyn">TabSyn</span> <a href="https://arxiv.org/abs/2405.17724" class="tag" target="_blank">ClavaDDPM</a> <span class="tag" data-tippy="CSDI">CSDI</span> <a href="https://arxiv.org/abs/2307.11494" class="tag" target="_blank">TSDiff</a>
423+
</div>
424+
<div class="datasets">
425+
<strong>Datasets:</strong> <a href="https://www.physionet.org/content/challenge-2012/1.0.0/" class="dataset-tag" target="_blank">Physionet Challenge 2012</a> <a href="https://archive.ics.uci.edu/dataset/321/electricityloaddiagrams20112014" class="dataset-tag" target="_blank">Electricity dataset (UCI Machine Learning Repository)</a>
425426
</div>
426-
427427

428428
</div>
429429
<div class="card" markdown>
@@ -441,74 +441,74 @@ a.dataset-tag:hover {
441441
</div>
442442
<div class="card" markdown>
443443
<div class="header">
444-
<h3><a href="https://github.com/VectorInstitute/self-supervised-learning" title="Go to Repository">self-supervised-learning</a></h3>
444+
<h3><a href="https://github.com/VectorInstitute/ai-deployment" title="Go to Repository">ai-deployment</a></h3>
445445
<span class="tag year-tag">2024</span>
446446
<span class="tag type-tag">bootcamp</span>
447447
</div>
448-
<p>A repository with reference implementations of self-supervised learning techniques</p>
448+
<p>A repository with reference implementations for deploying AI models in production environments, focusing on best practices and cloud-native solutions.</p>
449449
<div class="tag-container">
450-
<a href="https://proceedings.mlr.press/v162/qiu22b/qiu22b.pdf" class="tag" target="_blank">Internal Contrastive Learning (ICL) + Latent Outlier Exposure (LOE)</a> <a href="https://arxiv.org/abs/2302.00861" class="tag" target="_blank">SimMTM</a> <a href="https://arxiv.org/abs/2303.15747" class="tag" target="_blank">TabRet</a> <a href="https://arxiv.org/abs/2202.03555" class="tag" target="_blank">Data2Vec</a>
451-
</div>
452-
<div class="datasets">
453-
<strong>Datasets:</strong> <a href="https://cs.stanford.edu/~acoates/stl10/" class="dataset-tag" target="_blank">STL-10</a> <a href="https://archive.ics.uci.edu/dataset/381/beijing+pm2+5+data" class="dataset-tag" target="_blank">Beijing PM 2.5</a>
450+
<a href="https://aws.amazon.com/" class="tag" target="_blank">AWS</a> <a href="https://cloud.google.com/" class="tag" target="_blank">GCP</a>
454451
</div>
455452

453+
456454
</div>
457455
<div class="card" markdown>
458456
<div class="header">
459-
<h3><a href="https://github.com/VectorInstitute/diffusion_models" title="Go to Repository">diffusion-models</a></h3>
460-
<span class="tag year-tag">2024</span>
457+
<h3><a href="https://github.com/VectorInstitute/anomaly-detection" title="Go to Repository">anomaly-detection</a></h3>
458+
<span class="tag year-tag">2023</span>
461459
<span class="tag type-tag">bootcamp</span>
462460
</div>
463-
<p>A repository with demos for various diffusion models for tabular and time series data</p>
461+
<p>A repository with implementation of anomaly detection techniques</p>
464462
<div class="tag-container">
465-
<span class="tag" data-tippy="TabDDPM">TabDDPM</span> <span class="tag" data-tippy="TabSyn">TabSyn</span> <a href="https://arxiv.org/abs/2405.17724" class="tag" target="_blank">ClavaDDPM</a> <span class="tag" data-tippy="CSDI">CSDI</span> <a href="https://arxiv.org/abs/2307.11494" class="tag" target="_blank">TSDiff</a>
463+
<span class="tag" data-tippy="Logistic Regression (Supervised)">Logistic Regression (Supervised)</span> <span class="tag" data-tippy="Random Forest (Supervised)">Random Forest (Supervised)</span> <span class="tag" data-tippy="XGBoost (Supervised)">XGBoost (Supervised)</span> <span class="tag" data-tippy="CatBoost (Supervised)">CatBoost (Supervised)</span> <span class="tag" data-tippy="Light GBM (Supervised)">Light GBM (Supervised)</span> <span class="tag" data-tippy="TabNet (Supervised and Semi-supervised)">TabNet (Supervised and Semi-supervised)</span> <span class="tag" data-tippy="Autoencoder (AE) (Unsupervised)">Autoencoder (AE) (Unsupervised)</span> <span class="tag" data-tippy="Isolation Forest (Unsupervised)">Isolation Forest (Unsupervised)</span>
466464
</div>
467465
<div class="datasets">
468-
<strong>Datasets:</strong> <a href="https://www.physionet.org/content/challenge-2012/1.0.0/" class="dataset-tag" target="_blank">Physionet Challenge 2012</a> <a href="https://archive.ics.uci.edu/dataset/321/electricityloaddiagrams20112014" class="dataset-tag" target="_blank">Electricity dataset (UCI Machine Learning Repository)</a>
466+
<strong>Datasets:</strong> <a href="https://arxiv.org/pdf/2211.13358.pdf" class="dataset-tag" target="_blank">Bank Account Fraud Detection</a> <a href="https://dgraph.xinye.com/dataset" class="dataset-tag" target="_blank">DGraph dataset</a> <a href="https://www.mvtec.com/company/research/datasets/mvtec-ad" class="dataset-tag" target="_blank">MVTec dataset</a> <a href="http://www.svcl.ucsd.edu/projects/anomaly/dataset.htm" class="dataset-tag" target="_blank">UCSD Anomaly Detection Dataset</a> <a href="https://www.kaggle.com/datasets/odins0n/ucf-crime-dataset" class="dataset-tag" target="_blank">UCF Crime Dataset</a>
469467
</div>
470468

471469
</div>
472470
<div class="card" markdown>
473471
<div class="header">
474-
<h3><a href="https://github.com/VectorInstitute/ai-deployment" title="Go to Repository">ai-deployment</a></h3>
472+
<h3><a href="https://github.com/VectorInstitute/self-supervised-learning" title="Go to Repository">self-supervised-learning</a></h3>
475473
<span class="tag year-tag">2024</span>
476474
<span class="tag type-tag">bootcamp</span>
477475
</div>
478-
<p>A repository with reference implementations for deploying AI models in production environments, focusing on best practices and cloud-native solutions.</p>
476+
<p>A repository with reference implementations of self-supervised learning techniques</p>
479477
<div class="tag-container">
480-
<a href="https://aws.amazon.com/" class="tag" target="_blank">AWS</a> <a href="https://cloud.google.com/" class="tag" target="_blank">GCP</a>
478+
<a href="https://proceedings.mlr.press/v162/qiu22b/qiu22b.pdf" class="tag" target="_blank">Internal Contrastive Learning (ICL) + Latent Outlier Exposure (LOE)</a> <a href="https://arxiv.org/abs/2302.00861" class="tag" target="_blank">SimMTM</a> <a href="https://arxiv.org/abs/2303.15747" class="tag" target="_blank">TabRet</a> <a href="https://arxiv.org/abs/2202.03555" class="tag" target="_blank">Data2Vec</a>
479+
</div>
480+
<div class="datasets">
481+
<strong>Datasets:</strong> <a href="https://cs.stanford.edu/~acoates/stl10/" class="dataset-tag" target="_blank">STL-10</a> <a href="https://archive.ics.uci.edu/dataset/381/beijing+pm2+5+data" class="dataset-tag" target="_blank">Beijing PM 2.5</a>
481482
</div>
482-
483483

484484
</div>
485485
<div class="card" markdown>
486486
<div class="header">
487-
<h3><a href="https://github.com/VectorInstitute/retrieval-augmented-generation" title="Go to Repository">retrieval-augmented-generation</a></h3>
487+
<h3><a href="https://github.com/VectorInstitute/finetuning-and-alignment" title="Go to Repository">finetuning-and-alignment</a></h3>
488488
<span class="tag year-tag">2024</span>
489489
<span class="tag type-tag">bootcamp</span>
490490
</div>
491-
<p>A repository reference implementations for retrieval-augmented generation</p>
491+
<p>A repository with implementations advanced fine-tuning techniques and approaches to enhance Large Language Model performance, reduce their computational cost, with a focus on alignment with human values</p>
492492
<div class="tag-container">
493-
<span class="tag" data-tippy="Web Search">Web Search</span> <span class="tag" data-tippy="Document Search">Document Search</span> <span class="tag" data-tippy="SQL Search">SQL Search</span> <span class="tag" data-tippy="Cloud Search">Cloud Search</span> <span class="tag" data-tippy="PubMed QA">PubMed QA</span> <span class="tag" data-tippy="RAG Evaluation">RAG Evaluation</span>
493+
<a href="https://docs.pytorch.org/tutorials/intermediate/FSDP_tutorial.html" class="tag" target="_blank">FSDP</a> <a href="https://docs.pytorch.org/tutorials/intermediate/ddp_tutorial.html" class="tag" target="_blank">DDP</a> <span class="tag" data-tippy="Instruction Tuning">Instruction Tuning</span> <a href="https://github.com/huggingface/peft" class="tag" target="_blank">PEFT</a> <span class="tag" data-tippy="Quantization">Quantization</span> <span class="tag" data-tippy="Supervised Fine-tuning">Supervised Fine-tuning</span>
494494
</div>
495495
<div class="datasets">
496-
<strong>Datasets:</strong> <a href="https://pubmed.ncbi.nlm.nih.gov" class="dataset-tag" target="_blank">PubMed</a> <a href="https://www.kaggle.com/datasets/prakharrathi25/banking-dataset-marketing-targets" class="dataset-tag" target="_blank">Banking Dataset - Marketing Targets</a>
496+
<strong>Datasets:</strong> <a href="https://huggingface.co/datasets/knkarthick/samsum" class="dataset-tag" target="_blank">SAMSum dataset</a> <a href="https://github.com/cardiffnlp/tweeteval" class="dataset-tag" target="_blank">TweetEval</a>
497497
</div>
498498

499499
</div>
500500
<div class="card" markdown>
501501
<div class="header">
502-
<h3><a href="https://github.com/VectorInstitute/finetuning-and-alignment" title="Go to Repository">finetuning-and-alignment</a></h3>
502+
<h3><a href="https://github.com/VectorInstitute/retrieval-augmented-generation" title="Go to Repository">retrieval-augmented-generation</a></h3>
503503
<span class="tag year-tag">2024</span>
504504
<span class="tag type-tag">bootcamp</span>
505505
</div>
506-
<p>A repository with implementations advanced fine-tuning techniques and approaches to enhance Large Language Model performance, reduce their computational cost, with a focus on alignment with human values</p>
506+
<p>A repository reference implementations for retrieval-augmented generation</p>
507507
<div class="tag-container">
508-
<a href="https://docs.pytorch.org/tutorials/intermediate/FSDP_tutorial.html" class="tag" target="_blank">FSDP</a> <a href="https://docs.pytorch.org/tutorials/intermediate/ddp_tutorial.html" class="tag" target="_blank">DDP</a> <span class="tag" data-tippy="Instruction Tuning">Instruction Tuning</span> <a href="https://github.com/huggingface/peft" class="tag" target="_blank">PEFT</a> <span class="tag" data-tippy="Quantization">Quantization</span> <span class="tag" data-tippy="Supervised Fine-tuning">Supervised Fine-tuning</span>
508+
<span class="tag" data-tippy="Web Search">Web Search</span> <span class="tag" data-tippy="Document Search">Document Search</span> <span class="tag" data-tippy="SQL Search">SQL Search</span> <span class="tag" data-tippy="Cloud Search">Cloud Search</span> <span class="tag" data-tippy="PubMed QA">PubMed QA</span> <span class="tag" data-tippy="RAG Evaluation">RAG Evaluation</span>
509509
</div>
510510
<div class="datasets">
511-
<strong>Datasets:</strong> <a href="https://huggingface.co/datasets/knkarthick/samsum" class="dataset-tag" target="_blank">SAMSum dataset</a> <a href="https://github.com/cardiffnlp/tweeteval" class="dataset-tag" target="_blank">TweetEval</a>
511+
<strong>Datasets:</strong> <a href="https://pubmed.ncbi.nlm.nih.gov" class="dataset-tag" target="_blank">PubMed</a> <a href="https://www.kaggle.com/datasets/prakharrathi25/banking-dataset-marketing-targets" class="dataset-tag" target="_blank">Banking Dataset - Marketing Targets</a>
512512
</div>
513513

514514
</div>
@@ -519,6 +519,21 @@ a.dataset-tag:hover {
519519

520520
<div class="grid cards" markdown>
521521
<div class="card" markdown>
522+
<div class="header">
523+
<h3><a href="https://github.com/VectorInstitute/mmlearn" title="Go to Repository">mmlearn</a></h3>
524+
<span class="tag year-tag">2024</span>
525+
<span class="tag type-tag">tool</span>
526+
</div>
527+
<p>A toolkit for research on multimodal representation learning</p>
528+
<div class="tag-container">
529+
<span class="tag" data-tippy="Contrastive Pretraining">Contrastive Pretraining</span> <a href="https://arxiv.org/abs/2301.08243" class="tag" target="_blank">I-JEPA</a>
530+
</div>
531+
<div class="datasets">
532+
<strong>Datasets:</strong> <a href="https://www.image-net.org/" class="dataset-tag" target="_blank">ImageNet</a> <a href="https://www.openslr.org/12/" class="dataset-tag" target="_blank">LibriSpeech</a> <a href="https://rgbd.cs.princeton.edu/" class="dataset-tag" target="_blank">RGB-D</a>
533+
</div>
534+
535+
</div>
536+
<div class="card" markdown>
522537
<div class="header">
523538
<h3><a href="https://github.com/VectorInstitute/fed-rag" title="Go to Repository">fed-rag</a></h3>
524539
<span class="tag year-tag">2025</span>
@@ -546,21 +561,6 @@ a.dataset-tag:hover {
546561
<a href="#" class="bibtex-button" data-bibtex-id="baghbanzadeh2025advancing" title="View Citation">Cite</a>
547562
<a href="https://arxiv.org/abs/2503.14377" class="paper-link" title="View Paper" target="_blank">Paper</a>
548563
</div>
549-
</div>
550-
<div class="card" markdown>
551-
<div class="header">
552-
<h3><a href="https://github.com/VectorInstitute/mmlearn" title="Go to Repository">mmlearn</a></h3>
553-
<span class="tag year-tag">2024</span>
554-
<span class="tag type-tag">tool</span>
555-
</div>
556-
<p>A toolkit for research on multimodal representation learning</p>
557-
<div class="tag-container">
558-
<span class="tag" data-tippy="Contrastive Pretraining">Contrastive Pretraining</span> <a href="https://arxiv.org/abs/2301.08243" class="tag" target="_blank">I-JEPA</a>
559-
</div>
560-
<div class="datasets">
561-
<strong>Datasets:</strong> <a href="https://www.image-net.org/" class="dataset-tag" target="_blank">ImageNet</a> <a href="https://www.openslr.org/12/" class="dataset-tag" target="_blank">LibriSpeech</a> <a href="https://rgbd.cs.princeton.edu/" class="dataset-tag" target="_blank">RGB-D</a>
562-
</div>
563-
564564
</div>
565565
<div class="card" markdown>
566566
<div class="header">

0 commit comments

Comments
 (0)