@@ -400,30 +400,30 @@ a.dataset-tag:hover {
400400 <div class="grid cards" markdown>
401401 <div class="card" markdown>
402402 <div class="header">
403- <h3><a href="https://github.com/VectorInstitute/anomaly-detection " title="Go to Repository">anomaly-detection </a></h3>
404- <span class="tag year-tag">2023 </span>
403+ <h3><a href="https://github.com/VectorInstitute/recommender-systems " title="Go to Repository">recommender-systems </a></h3>
404+ <span class="tag year-tag">2022 </span>
405405 <span class="tag type-tag">bootcamp</span>
406406 </div>
407- <p>A repository with implementation of anomaly detection techniques </p>
407+ <p>A repository with implementations of recommender systems </p>
408408 <div class="tag-container">
409- <span class="tag" data-tippy="Logistic Regression (Supervised)">Logistic Regression (Supervised)</span> <span class="tag" data-tippy="Random Forest (Supervised)">Random Forest (Supervised)</span> <span class="tag" data-tippy="XGBoost (Supervised)">XGBoost (Supervised)</span> <span class="tag" data-tippy="CatBoost (Supervised)">CatBoost (Supervised)</span> <span class="tag" data-tippy="Light GBM (Supervised)">Light GBM (Supervised)</span> <span class="tag" data-tippy="TabNet (Supervised and Semi-supervised)">TabNet (Supervised and Semi-supervised)</span> <span class="tag" data-tippy="Autoencoder (AE) (Unsupervised)">Autoencoder (AE) (Unsupervised)</span> <span class="tag" data-tippy="Isolation Forest (Unsupervised)">Isolation Forest (Unsupervised)</span>
410- </div>
411- <div class="datasets">
412- <strong>Datasets:</strong> <a href="https://arxiv.org/pdf/2211.13358.pdf" class="dataset-tag" target="_blank">Bank Account Fraud Detection</a> <a href="https://dgraph.xinye.com/dataset" class="dataset-tag" target="_blank">DGraph dataset</a> <a href="https://www.mvtec.com/company/research/datasets/mvtec-ad" class="dataset-tag" target="_blank">MVTec dataset</a> <a href="http://www.svcl.ucsd.edu/projects/anomaly/dataset.htm" class="dataset-tag" target="_blank">UCSD Anomaly Detection Dataset</a> <a href="https://www.kaggle.com/datasets/odins0n/ucf-crime-dataset" class="dataset-tag" target="_blank">UCF Crime Dataset</a>
409+ <span class="tag" data-tippy="Matrix Factorization">Matrix Factorization</span> <span class="tag" data-tippy="Collaborative Filtering">Collaborative Filtering</span> <span class="tag" data-tippy="Content-Based Filtering">Content-Based Filtering</span> <span class="tag" data-tippy="Sequence Aware Recommender Systems">Sequence Aware Recommender Systems</span> <span class="tag" data-tippy="Session-Based Recommender Systems">Session-Based Recommender Systems</span> <span class="tag" data-tippy="Knowledge Graph-Based Recommender Systems">Knowledge Graph-Based Recommender Systems</span>
413410 </div>
414411
412+
415413 </div>
416414 <div class="card" markdown>
417415 <div class="header">
418- <h3><a href="https://github.com/VectorInstitute/recommender-systems " title="Go to Repository">recommender-systems </a></h3>
419- <span class="tag year-tag">2022 </span>
416+ <h3><a href="https://github.com/VectorInstitute/diffusion_models " title="Go to Repository">diffusion-models </a></h3>
417+ <span class="tag year-tag">2024 </span>
420418 <span class="tag type-tag">bootcamp</span>
421419 </div>
422- <p>A repository with implementations of recommender systems </p>
420+ <p>A repository with demos for various diffusion models for tabular and time series data </p>
423421 <div class="tag-container">
424- <span class="tag" data-tippy="Matrix Factorization">Matrix Factorization</span> <span class="tag" data-tippy="Collaborative Filtering">Collaborative Filtering</span> <span class="tag" data-tippy="Content-Based Filtering">Content-Based Filtering</span> <span class="tag" data-tippy="Sequence Aware Recommender Systems">Sequence Aware Recommender Systems</span> <span class="tag" data-tippy="Session-Based Recommender Systems">Session-Based Recommender Systems</span> <span class="tag" data-tippy="Knowledge Graph-Based Recommender Systems">Knowledge Graph-Based Recommender Systems</span>
422+ <span class="tag" data-tippy="TabDDPM">TabDDPM</span> <span class="tag" data-tippy="TabSyn">TabSyn</span> <a href="https://arxiv.org/abs/2405.17724" class="tag" target="_blank">ClavaDDPM</a> <span class="tag" data-tippy="CSDI">CSDI</span> <a href="https://arxiv.org/abs/2307.11494" class="tag" target="_blank">TSDiff</a>
423+ </div>
424+ <div class="datasets">
425+ <strong>Datasets:</strong> <a href="https://www.physionet.org/content/challenge-2012/1.0.0/" class="dataset-tag" target="_blank">Physionet Challenge 2012</a> <a href="https://archive.ics.uci.edu/dataset/321/electricityloaddiagrams20112014" class="dataset-tag" target="_blank">Electricity dataset (UCI Machine Learning Repository)</a>
425426 </div>
426-
427427
428428 </div>
429429 <div class="card" markdown>
@@ -441,74 +441,74 @@ a.dataset-tag:hover {
441441 </div>
442442 <div class="card" markdown>
443443 <div class="header">
444- <h3><a href="https://github.com/VectorInstitute/self-supervised-learning " title="Go to Repository">self-supervised-learning </a></h3>
444+ <h3><a href="https://github.com/VectorInstitute/ai-deployment " title="Go to Repository">ai-deployment </a></h3>
445445 <span class="tag year-tag">2024</span>
446446 <span class="tag type-tag">bootcamp</span>
447447 </div>
448- <p>A repository with reference implementations of self-supervised learning techniques </p>
448+ <p>A repository with reference implementations for deploying AI models in production environments, focusing on best practices and cloud-native solutions. </p>
449449 <div class="tag-container">
450- <a href="https://proceedings.mlr.press/v162/qiu22b/qiu22b.pdf" class="tag" target="_blank">Internal Contrastive Learning (ICL) + Latent Outlier Exposure (LOE)</a> <a href="https://arxiv.org/abs/2302.00861" class="tag" target="_blank">SimMTM</a> <a href="https://arxiv.org/abs/2303.15747" class="tag" target="_blank">TabRet</a> <a href="https://arxiv.org/abs/2202.03555" class="tag" target="_blank">Data2Vec</a>
451- </div>
452- <div class="datasets">
453- <strong>Datasets:</strong> <a href="https://cs.stanford.edu/~acoates/stl10/" class="dataset-tag" target="_blank">STL-10</a> <a href="https://archive.ics.uci.edu/dataset/381/beijing+pm2+5+data" class="dataset-tag" target="_blank">Beijing PM 2.5</a>
450+ <a href="https://aws.amazon.com/" class="tag" target="_blank">AWS</a> <a href="https://cloud.google.com/" class="tag" target="_blank">GCP</a>
454451 </div>
455452
453+
456454 </div>
457455 <div class="card" markdown>
458456 <div class="header">
459- <h3><a href="https://github.com/VectorInstitute/diffusion_models " title="Go to Repository">diffusion-models </a></h3>
460- <span class="tag year-tag">2024 </span>
457+ <h3><a href="https://github.com/VectorInstitute/anomaly-detection " title="Go to Repository">anomaly-detection </a></h3>
458+ <span class="tag year-tag">2023 </span>
461459 <span class="tag type-tag">bootcamp</span>
462460 </div>
463- <p>A repository with demos for various diffusion models for tabular and time series data </p>
461+ <p>A repository with implementation of anomaly detection techniques </p>
464462 <div class="tag-container">
465- <span class="tag" data-tippy="TabDDPM">TabDDPM </span> <span class="tag" data-tippy="TabSyn">TabSyn </span> <a href="https://arxiv.org/abs/2405.17724" class="tag" target="_blank">ClavaDDPM</a > <span class="tag" data-tippy="CSDI">CSDI </span> <a href="https://arxiv.org/abs/2307.11494" class="tag" target="_blank">TSDiff</a >
463+ <span class="tag" data-tippy="Logistic Regression (Supervised)">Logistic Regression (Supervised) </span> <span class="tag" data-tippy="Random Forest (Supervised)">Random Forest (Supervised) </span> <span class="tag" data-tippy="XGBoost (Supervised)">XGBoost (Supervised)</span> <span class="tag" data-tippy="CatBoost (Supervised)">CatBoost (Supervised)</span > <span class="tag" data-tippy="Light GBM (Supervised)">Light GBM (Supervised) </span> <span class="tag" data-tippy="TabNet (Supervised and Semi-supervised)">TabNet (Supervised and Semi-supervised)</span> <span class="tag" data-tippy="Autoencoder (AE) (Unsupervised)">Autoencoder (AE) (Unsupervised)</span> <span class="tag" data-tippy="Isolation Forest (Unsupervised)">Isolation Forest (Unsupervised)</span >
466464 </div>
467465 <div class="datasets">
468- <strong>Datasets:</strong> <a href="https://www.physionet. org/content/challenge-2012/1.0.0/ " class="dataset-tag" target="_blank">Physionet Challenge 2012 </a> <a href="https://archive.ics.uci. edu/dataset/321/electricityloaddiagrams20112014 " class="dataset-tag" target="_blank">Electricity dataset (UCI Machine Learning Repository) </a>
466+ <strong>Datasets:</strong> <a href="https://arxiv. org/pdf/2211.13358.pdf" class="dataset-tag" target="_blank">Bank Account Fraud Detection</a> <a href="https://dgraph.xinye.com/dataset " class="dataset-tag" target="_blank">DGraph dataset </a> <a href="https://www.mvtec.com/company/research/datasets/mvtec-ad" class="dataset-tag" target="_blank">MVTec dataset</a> <a href="http://www.svcl.ucsd. edu/projects/anomaly/dataset.htm " class="dataset-tag" target="_blank">UCSD Anomaly Detection Dataset</a> <a href="https://www.kaggle.com/datasets/odins0n/ucf-crime- dataset" class="dataset-tag" target="_blank">UCF Crime Dataset </a>
469467 </div>
470468
471469 </div>
472470 <div class="card" markdown>
473471 <div class="header">
474- <h3><a href="https://github.com/VectorInstitute/ai-deployment " title="Go to Repository">ai-deployment </a></h3>
472+ <h3><a href="https://github.com/VectorInstitute/self-supervised-learning " title="Go to Repository">self-supervised-learning </a></h3>
475473 <span class="tag year-tag">2024</span>
476474 <span class="tag type-tag">bootcamp</span>
477475 </div>
478- <p>A repository with reference implementations for deploying AI models in production environments, focusing on best practices and cloud-native solutions. </p>
476+ <p>A repository with reference implementations of self-supervised learning techniques </p>
479477 <div class="tag-container">
480- <a href="https://aws.amazon.com/" class="tag" target="_blank">AWS</a> <a href="https://cloud.google.com/" class="tag" target="_blank">GCP</a>
478+ <a href="https://proceedings.mlr.press/v162/qiu22b/qiu22b.pdf" class="tag" target="_blank">Internal Contrastive Learning (ICL) + Latent Outlier Exposure (LOE)</a> <a href="https://arxiv.org/abs/2302.00861" class="tag" target="_blank">SimMTM</a> <a href="https://arxiv.org/abs/2303.15747" class="tag" target="_blank">TabRet</a> <a href="https://arxiv.org/abs/2202.03555" class="tag" target="_blank">Data2Vec</a>
479+ </div>
480+ <div class="datasets">
481+ <strong>Datasets:</strong> <a href="https://cs.stanford.edu/~acoates/stl10/" class="dataset-tag" target="_blank">STL-10</a> <a href="https://archive.ics.uci.edu/dataset/381/beijing+pm2+5+data" class="dataset-tag" target="_blank">Beijing PM 2.5</a>
481482 </div>
482-
483483
484484 </div>
485485 <div class="card" markdown>
486486 <div class="header">
487- <h3><a href="https://github.com/VectorInstitute/retrieval-augmented-generation " title="Go to Repository">retrieval-augmented-generation </a></h3>
487+ <h3><a href="https://github.com/VectorInstitute/finetuning-and-alignment " title="Go to Repository">finetuning-and-alignment </a></h3>
488488 <span class="tag year-tag">2024</span>
489489 <span class="tag type-tag">bootcamp</span>
490490 </div>
491- <p>A repository reference implementations for retrieval-augmented generation </p>
491+ <p>A repository with implementations advanced fine-tuning techniques and approaches to enhance Large Language Model performance, reduce their computational cost, with a focus on alignment with human values </p>
492492 <div class="tag-container">
493- <span class="tag" data-tippy="Web Search">Web Search</span > <span class="tag" data-tippy="Document Search">Document Search</span > <span class="tag" data-tippy="SQL Search">SQL Search </span> <span class="tag" data-tippy="Cloud Search">Cloud Search</span > <span class="tag" data-tippy="PubMed QA">PubMed QA </span> <span class="tag" data-tippy="RAG Evaluation">RAG Evaluation </span>
493+ <a href="https://docs.pytorch.org/tutorials/intermediate/FSDP_tutorial.html" class="tag" target="_blank">FSDP</a > <a href="https://docs.pytorch.org/tutorials/intermediate/ddp_tutorial.html" class="tag" target="_blank">DDP</a > <span class="tag" data-tippy="Instruction Tuning">Instruction Tuning </span> <a href="https://github.com/huggingface/peft" class="tag" target="_blank">PEFT</a > <span class="tag" data-tippy="Quantization">Quantization </span> <span class="tag" data-tippy="Supervised Fine-tuning">Supervised Fine-tuning </span>
494494 </div>
495495 <div class="datasets">
496- <strong>Datasets:</strong> <a href="https://pubmed.ncbi.nlm.nih.gov " class="dataset-tag" target="_blank">PubMed </a> <a href="https://www.kaggle. com/datasets/prakharrathi25/banking-dataset-marketing-targets " class="dataset-tag" target="_blank">Banking Dataset - Marketing Targets </a>
496+ <strong>Datasets:</strong> <a href="https://huggingface.co/datasets/knkarthick/samsum " class="dataset-tag" target="_blank">SAMSum dataset </a> <a href="https://github. com/cardiffnlp/tweeteval " class="dataset-tag" target="_blank">TweetEval </a>
497497 </div>
498498
499499 </div>
500500 <div class="card" markdown>
501501 <div class="header">
502- <h3><a href="https://github.com/VectorInstitute/finetuning-and-alignment " title="Go to Repository">finetuning-and-alignment </a></h3>
502+ <h3><a href="https://github.com/VectorInstitute/retrieval-augmented-generation " title="Go to Repository">retrieval-augmented-generation </a></h3>
503503 <span class="tag year-tag">2024</span>
504504 <span class="tag type-tag">bootcamp</span>
505505 </div>
506- <p>A repository with implementations advanced fine-tuning techniques and approaches to enhance Large Language Model performance, reduce their computational cost, with a focus on alignment with human values </p>
506+ <p>A repository reference implementations for retrieval-augmented generation </p>
507507 <div class="tag-container">
508- <a href="https://docs.pytorch.org/tutorials/intermediate/FSDP_tutorial.html" class="tag" target="_blank">FSDP</a > <a href="https://docs.pytorch.org/tutorials/intermediate/ddp_tutorial.html" class="tag" target="_blank">DDP</a > <span class="tag" data-tippy="Instruction Tuning">Instruction Tuning </span> <a href="https://github.com/huggingface/peft" class="tag" target="_blank">PEFT</a > <span class="tag" data-tippy="Quantization">Quantization </span> <span class="tag" data-tippy="Supervised Fine-tuning">Supervised Fine-tuning </span>
508+ <span class="tag" data-tippy="Web Search">Web Search</span > <span class="tag" data-tippy="Document Search">Document Search</span > <span class="tag" data-tippy="SQL Search">SQL Search </span> <span class="tag" data-tippy="Cloud Search">Cloud Search</span > <span class="tag" data-tippy="PubMed QA">PubMed QA </span> <span class="tag" data-tippy="RAG Evaluation">RAG Evaluation </span>
509509 </div>
510510 <div class="datasets">
511- <strong>Datasets:</strong> <a href="https://huggingface.co/datasets/knkarthick/samsum " class="dataset-tag" target="_blank">SAMSum dataset </a> <a href="https://github. com/cardiffnlp/tweeteval " class="dataset-tag" target="_blank">TweetEval </a>
511+ <strong>Datasets:</strong> <a href="https://pubmed.ncbi.nlm.nih.gov " class="dataset-tag" target="_blank">PubMed </a> <a href="https://www.kaggle. com/datasets/prakharrathi25/banking-dataset-marketing-targets " class="dataset-tag" target="_blank">Banking Dataset - Marketing Targets </a>
512512 </div>
513513
514514 </div>
@@ -519,6 +519,21 @@ a.dataset-tag:hover {
519519
520520 <div class="grid cards" markdown>
521521 <div class="card" markdown>
522+ <div class="header">
523+ <h3><a href="https://github.com/VectorInstitute/mmlearn" title="Go to Repository">mmlearn</a></h3>
524+ <span class="tag year-tag">2024</span>
525+ <span class="tag type-tag">tool</span>
526+ </div>
527+ <p>A toolkit for research on multimodal representation learning</p>
528+ <div class="tag-container">
529+ <span class="tag" data-tippy="Contrastive Pretraining">Contrastive Pretraining</span> <a href="https://arxiv.org/abs/2301.08243" class="tag" target="_blank">I-JEPA</a>
530+ </div>
531+ <div class="datasets">
532+ <strong>Datasets:</strong> <a href="https://www.image-net.org/" class="dataset-tag" target="_blank">ImageNet</a> <a href="https://www.openslr.org/12/" class="dataset-tag" target="_blank">LibriSpeech</a> <a href="https://rgbd.cs.princeton.edu/" class="dataset-tag" target="_blank">RGB-D</a>
533+ </div>
534+
535+ </div>
536+ <div class="card" markdown>
522537 <div class="header">
523538 <h3><a href="https://github.com/VectorInstitute/fed-rag" title="Go to Repository">fed-rag</a></h3>
524539 <span class="tag year-tag">2025</span>
@@ -546,21 +561,6 @@ a.dataset-tag:hover {
546561 <a href="#" class="bibtex-button" data-bibtex-id="baghbanzadeh2025advancing" title="View Citation">Cite</a>
547562 <a href="https://arxiv.org/abs/2503.14377" class="paper-link" title="View Paper" target="_blank">Paper</a>
548563 </div>
549- </div>
550- <div class="card" markdown>
551- <div class="header">
552- <h3><a href="https://github.com/VectorInstitute/mmlearn" title="Go to Repository">mmlearn</a></h3>
553- <span class="tag year-tag">2024</span>
554- <span class="tag type-tag">tool</span>
555- </div>
556- <p>A toolkit for research on multimodal representation learning</p>
557- <div class="tag-container">
558- <span class="tag" data-tippy="Contrastive Pretraining">Contrastive Pretraining</span> <a href="https://arxiv.org/abs/2301.08243" class="tag" target="_blank">I-JEPA</a>
559- </div>
560- <div class="datasets">
561- <strong>Datasets:</strong> <a href="https://www.image-net.org/" class="dataset-tag" target="_blank">ImageNet</a> <a href="https://www.openslr.org/12/" class="dataset-tag" target="_blank">LibriSpeech</a> <a href="https://rgbd.cs.princeton.edu/" class="dataset-tag" target="_blank">RGB-D</a>
562- </div>
563-
564564 </div>
565565 <div class="card" markdown>
566566 <div class="header">
0 commit comments