Deep Learning – IIT Ropar Week 9 Assignment Answers

Deep Learning - IIT Ropar

Deep Learning – IIT Ropar Week 9 Assignment Answers (Jan-Apr 2026)


1. What is the length of the one-hot vector used to represent each word in this system?

  • 3
  • 8,000
  • 10,000
  • Depends on document length
Answer : b

2. Why are one-hot encoded vectors considered highly sparse?

  • They contain negative values
  • They have repeated values
  • Only one element is non-zero
  • They are normalized vectors
Answer : c

3. How many non-zero entries are present in the one-hot representation of Document A?

  • 3
  • 1
  • 8000
  • Depends on vocabulary
Answer : a

4. Which of the following are limitations of one-hot encoding highlighted by this case study?

  • High dimensionality
  • No notion of semantic similarity
  • Large memory requirement
  • Dense representations
Answer : a, b, c

5. Why does cosine similarity work reasonably well for document comparison in this setup?

  • It is independent of document length
  • It measures overlap in word usage
  • It captures semantic meaning of words
  • Works only for dense vectors
Answer : a, b

6. Why does the team choose a context window size of 1?

  • To reduce vocabulary size
  • To capture only immediate word relationships
  • To remove stopwords automatically
  • To ensure dense representations
Answer : b

7. In the sentence
government announces new education policy,
how many context contributions does the word education make when using an a context window of 1?

  • 0
  • 1
  • 2
  • 3
Answer : c

8. Across the entire corpus, how many times does the word policy appear adjacent to the word education?

  • 1
  • 2
  • 3
  • 4
Answer : b

9. Which of the following statements about co-occurrence matrices are true in this case?

  • Matrix size depends on vocabulary size
  • Larger context windows capture broader relationships
  • Co-occurrence matrices are always dense
  • Word order within the context window is ignored
Answer : a, b, d

10. If the vocabulary size is 6, the total number of entries in the co-occurrence matrix is __________.

Answer : 36

11. Why is truncated SVD applied to the co-occurrence matrix in this system?

  • To make the matrix diagonal
  • To remove stopwords automatically
  • To reduce dimensionality while preserving structure
  • To increase sparsity
Answer : c

12. What property of truncated SVD enables semantically related words to cluster together?

  • Orthogonality of one-hot vectors
  • Preservation of exact co-occurrence counts
  • Capture of latent semantic structure
  • Removal of rare words
Answer : c

13. What is the size of the reduced word representation matrix Ukk ?

  • 8000 × 8000
  • 300 × 300
  • 8000 × 300
  • 300 × 8000
Answer : c

14. Why is truncated SVD suitable for the public feedback system?

  • It requires labeled training data
  • It can suppress less important components by keeping only top-k factors
  • It enables efficient similarity computation
  • It improves scalability of text processing
Answer : b, c, d

15. After truncated SVD with k = 300, each word is represented by a vector of length __________.

Answer : 300