Could you point me to glove.6B.300d.pickle? #8

aerinkim · 2018-10-24T22:47:54Z

Usually, it's glove.6B.300d.txt but I think you did some preprocessing here. I'd appreciate if you could share how you pickled it.

rjadr · 2019-04-13T06:47:43Z

This is how I generated it:

import pickle
import numpy as np

f = open('glove.6B.300d.txt', 'r')
g = open('glove.6B.300d_pickle', 'wb')
word_dict = {}
wordvec = []
for idx, line in enumerate(f.readlines()):
    word_split = line.split(' ')
    word = word_split[0]
    word_dict[word] = idx
    d = word_split[1:]
    d[-1] = d[-1][:-1]
    d = [float(e) for e in d]
    wordvec.append(d)

embedding = np.array(wordvec)
pickling = {}
pickling = {'embedding' : embedding, 'word_dict': word_dict}
pickle.dump(pickling, g)
f.close()
g.close()

ott-fogliata · 2019-07-05T14:14:39Z

@rjadr To use it with the paraphraser code, you need some changes:

import pickle
import numpy as np

f = open('glove.6B.300d.txt', 'r')
g = open('glove.6B.300d.pickle', 'wb')

word_to_id = {}
id_to_word = {}

wordvec = []

for idx, line in enumerate(f.readlines()):

    word_split = line.split(' ')
    word = word_split[0]
    word_to_id[word] = idx
    id_to_word[idx] = word

    d = word_split[1:]
    d[-1] = d[-1][:-1]
    d = [float(e) for e in d]
    wordvec.append(d)

embedding = np.array(wordvec)

pickling = word_to_id, id_to_word, embedding

pickle.dump(pickling, g)

f.close()
g.close()

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Could you point me to glove.6B.300d.pickle? #8

Could you point me to glove.6B.300d.pickle? #8

aerinkim commented Oct 24, 2018

rjadr commented Apr 13, 2019 •

edited

Loading

ott-fogliata commented Jul 5, 2019

Could you point me to glove.6B.300d.pickle? #8

Could you point me to glove.6B.300d.pickle? #8

Comments

aerinkim commented Oct 24, 2018

rjadr commented Apr 13, 2019 • edited Loading

ott-fogliata commented Jul 5, 2019

rjadr commented Apr 13, 2019 •

edited

Loading