CS449 Final Project Topics

Final Project Topics
Your final project proposal is due April 18 Before then, you need to find/finalize your group members, pick a topic, and download the dataset you want to work with.
If you already have a group, you should meet with them to discuss project ideas. Then either as a group or individually, you should fill out this survey with some thoughts about what you’re interested in.
Your project in total will be worth at least 32% of your grade, broken down as follows:
¡ñ 8% for the proposal, due April 18
¡ñ 8% for the update, due May 11
¡ñ 16% for the report, due during finals
If you want to focus more on the project, you can choose to increase its weight by:
¡ñ 8% for a literature review which replaces two of your readings
¡ñ 8% for extensions that replace one homework assignment
If you want to replace a homework with project work, you need to submit a “bonus proposal” describing what you hope to do.
Three general kinds of projects
¡ñ Write code to reproduce a landmark research paper
For example, you might write code to implement a Transformer model from scratch as introduced in the paper, ¡°Attention is All You Need.¡±
¡ñ Find a dataset you’re interested in, then figure out what model to use on it. For example, suppose you’re interested in a dataset of restaurant reviews, and want to predict the numerical rating based on the review text. Maybe you decide you want to compare the performance of an LSTM and a

Code Help, Add WeChat: cstutorcs
Transformer model.
¡ñ Find a model you’re interested in, then apply it to a new dataset.
For example, maybe you want to understand how a GAN works. You might decide to download a pretrained model and explore how its performance changes when being fine-tuned on a new data source.
Models and concepts you might be interested in
This is by no means an exhaustive list, but we will cover each of these in at least one lecture.
1. ConvolutionalNeuralNetworks,e.g.,
2. RecurrentNeuralNetworks,e.g.,LSTMs 3. TransformerModels,e.g.,GPT
4. DiffusionModels,e.g.,Dall-E
5. Autoencoders
6. ReinforcementLearning
Datasets you might be interested in
This is obviously not an exhaustive list, but hopefully provides some interesting options and inspiration for other resources.
1. MedicalandGeneticsApplications
¡ñ 10x Genomics https://www.10xgenomics.com/resources/datasets
¡ñ Links to an external site.
¡ñ Links to an external site. ¡ñ
¡ñ Genotype-Tissue Expression https://www.gtexportal.org/home/datasets
¡ñ Links to an external site. ¡ñ
¡ñ Protein Classification http://scop.mrc-lmb.cam.ac.uk
¡ñ Links to an external site. ¡ñ
¡ñ Medical image analysis
https://paperswithcode.com/datasets?mod=medical&page=1
¡ñ Links to an external site.
¡ñ Links to an external site. ¡ñ
2. Natural Language
¡ñ Wikipedia: https://en.wikipedia.org/wiki/Wikipedia:Database_download
¡ñ Links to an external site. ¡ñ

Code Help
¡ñ Sentiment prediction:
http://archive.ics.uci.edu/ml/datasets/Sentiment+Labelled+Sentences
¡ñ Links to an external site. ¡ñ
¡ñ Reddit: https://huggingface.co/datasets/reddit
¡ñ Links to an external site. ¡ñ
¡ñ Hate speech on Twitter
https://data.world/thomasrdavidson/hate-speech-and-offensive-language
¡ñ Links to an external site. ¡ñ
¡ñ Machine Translation: https://huggingface.co/datasets/wmt16
¡ñ Links to an external site. ¡ñ
3. Image classification
¡ñ Many options:
https://huggingface.co/datasets?task_categories=task_categories:image-c
lassification&sort=downloads
¡ñ Links to an external site. ¡ñ
¡ñ So many options:
https://paperswithcode.com/datasets?task=image-classification
¡ñ Links to an external site. ¡ñ
4. Finance and economics
¡ñ ¡ñ ¡ñ ¡ñ ¡ñ ¡ñ ¡ñ ¡ñ ¡ñ
5. Other ¡ñ
World Bank Open Data: https://data.worldbank.org/ Links to an external site.
International Monetary Fund Data: https://www.imf.org/en/Data Links to an external site.
Stock prices from pandas_datareader Links to an external site.
Climate Modeling:
https://wiki.climatechange.ai/wiki/Climate_Modeling_and_Analysis#Data Links to an external site.

程序代写 CS代考 加QQ: 749389476
¡ñ Energy and Emissions:
https://wiki.climatechange.ai/wiki/Electricity_Systems#Data
¡ñ Links to an external site. ¡ñ
¡ñ Lie Detection from Videos: http://iab-rubric.org/index.php/bag-of-lies
¡ñ Links to an external site.
¡ð Note: we already have access to this data If you have something else in mind, there are:
¡ñ another 18,789 datasets available at https://huggingface.co/datasets
¡ñ Links to an external site. ¡ñ
¡ñ another 7,624 datasets available at https://paperswithcode.com/datasets
¡ñ Links to an external site.