# CSD501 / Data Science

## Syllabus

{% file src="<https://3148391480-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FoWho7cxjZIbvsuDwIAzB%2Fuploads%2FOZHmPFhpMVRT5K3m3ZOL%2FY3S5-CSD501-SYLLABUS-BTECH-CSE-IT.pdf?alt=media&token=632e14a8-9cee-4084-842e-87a329a769b2>" %}

{% file src="<https://3148391480-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FoWho7cxjZIbvsuDwIAzB%2Fuploads%2FDuDC14EhHwA9bRPfKLsI%2FY3S5-CSD521-SYLLABUS-BTECH-CSE-IT.pdf?alt=media&token=a6497b94-b80b-4722-9de3-1f8092947cdc>" %}

## Resources

<details>

<summary>M1: Introduction</summary>

* Introduction to social media and natural language processing research
* Collecting and extracting social media data using APIs

</details>

<details>

<summary>M2: Language Identification and Naïve Bayes</summary>

* Domain and genre difference
* Language identification
* Supervised learning and classification
* Naïve Bayes algorithm
  * Feature selection (Information Gain)
* Text preprocessing
  * Tokenization
  * Emoticons handling
  * Noisy text normalization

</details>

<details>

<summary>M3: Overview of Paraphrase Research</summary>

* Key resources and corpora
  * WordNet
  * DIRT
  * MRPC (Microsoft Research Paraphrase Corpus)
  * PPDB (Paraphrase Database)
* Regression and classification models
  * Linear regression
    * Cost function
    * Gradient descent
  * Logistic regression
    * Decision boundary

</details>

<details>

<summary>M4: Vector Semantics</summary>

* Unsupervised learning techniques
* Class-based clustering
  * Brown clusters
* Soft clustering
  * Singular Value Decomposition (SVD)
* Neural word embeddings
  * Word2Vec
    * CBOW model
    * Skip-gram model

</details>

<details>

<summary>M5: Deep Learning for NLP</summary>

* Neural network fundamentals
  * Neuron and activation function
  * Non-linearity and learning
* Recurrent Neural Networks (RNNs)
  * Long Short-Term Memory (LSTM) networks
* Advanced applications
  * Neural machine translation
  * Neural conversation generation
  * Sentiment analysis
* Deep architectures and attention
  * Convolutional Neural Networks (CNNs)
  * Attention model

</details>

{% hint style="danger" %}
Content Yet to be Updated for this Page, Submit Resources If you want to collaborate, find form attached on this page below.
{% endhint %}

## Notes

### MidTerm

### EndSem

\[⤓]

## Question Directory

\[⤓]

### Assignment Questions

\[⤓]

### Previous Year Questions

#### Mid-Sem-PYQ

<table><thead><tr><th width="81.90771484375">[⤓]</th><th width="554.568115234375">Content Preview</th></tr></thead><tbody><tr><td><a href="https://drive.google.com/uc?export=download&#x26;id=1j1Uj6mkmPhr5eXvly3IjrblwouwllzoD" class="button primary" data-icon="arrow-down-to-square"></a></td><td><a href="https://drive.google.com/file/d/1j1Uj6mkmPhr5eXvly3IjrblwouwllzoD/view?usp=drive_link">Y3S5-CSD501-TSA-MidTerm-PYQ-OCT25</a></td></tr></tbody></table>

#### End-Sem-PYQ

<table><thead><tr><th width="81.9005126953125">[⤓]</th><th width="547.80322265625">Content Preview</th></tr></thead><tbody><tr><td><a href="https://drive.google.com/uc?export=download&#x26;id=162Yz6FHymLTp3rD71NutBY4Dg-5y-A-G" class="button primary" data-icon="arrow-down-to-square"></a></td><td><a href="https://drive.google.com/file/d/162Yz6FHymLTp3rD71NutBY4Dg-5y-A-G/view?usp=drive_link">Y3S5-CSD501-TSA-EndSem-PYQ-JAN23</a></td></tr><tr><td><a href="https://drive.google.com/uc?export=download&#x26;id=1Sl2_bonpkJN2RTW6PDvUpK2NRUCZVKTX" class="button primary" data-icon="arrow-down-to-square"></a></td><td><a href="https://drive.google.com/file/d/1Sl2_bonpkJN2RTW6PDvUpK2NRUCZVKTX/view?usp=drive_link">Y3S5-CSD501-TSA-EndSem-PYQ-DEC23</a></td></tr><tr><td><a href="https://drive.google.com/uc?export=download&#x26;id=1HY1X2aqWNWHUr32PG9yPWjXFdbqTADDE" class="button primary" data-icon="arrow-down-to-square"></a></td><td><a href="https://drive.google.com/file/d/1HY1X2aqWNWHUr32PG9yPWjXFdbqTADDE/view?usp=drive_link">Y3S5-CSD501-TSA-EndSem-PYQ-DEC24</a></td></tr></tbody></table>

## External Sources

\[⤓]

***

{% embed url="<https://discord.gg/6ywR3zbNfg>" %}

{% embed url="<https://mantavyam.notion.site/18152f7cde8880d699a5f2e65f87374e>" %}

{% embed url="<https://mantavyam.notion.site/17e52f7cde8880e0987fd06d33ef6019>" %}
