Skip to content

Conversation

lareinahu-2023
Copy link
Contributor

  1. Uses SentenceTransformer('all-MiniLM-L6-v2') to vectorize the comments column.
  2. Saves the outputs to multiple files:
    .pkl: Full Python dictionary with vectors and metadata
    .npy: Raw NumPy matrix for fast loading
    .csv: Full table with all metadata and embedding values per dimension

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant