Skip to content
GitLab
Explore
Projects
Groups
Snippets
Projects
Groups
Snippets
/
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
Memri
Twitter Topic Model
Commits
e621cede
Commit
e621cede
authored
2 years ago
by
Eelco van der Wel
Browse files
Options
Download
Email Patches
Plain Diff
smaller min topic size
parent
88918508
Pipeline
#12558
passed with stage
in 9 minutes and 39 seconds
Changes
3
Pipelines
1
Hide whitespace changes
Inline
Side-by-side
Showing
3 changed files
twitter_topic_model/clustering.py
+1
-0
twitter_topic_model/clustering.py
twitter_topic_model/model.py
+0
-1
twitter_topic_model/model.py
twitter_topic_model/plugin.py
+1
-1
twitter_topic_model/plugin.py
with
2 additions
and
2 deletions
+2
-2
twitter_topic_model/clustering.py
+
1
-
0
View file @
e621cede
...
...
@@ -10,6 +10,7 @@ def compute_umap(embeddings: np.ndarray) -> np.ndarray:
min_dist
=
0.0
,
metric
=
'cosine'
,
low_memory
=
False
,
random_state
=
42
,
)
return
umap
.
fit_transform
(
embeddings
)
...
...
This diff is collapsed.
Click to expand it.
twitter_topic_model/model.py
+
0
-
1
View file @
e621cede
from
dataclasses
import
dataclass
from
typing
import
Dict
,
List
,
Optional
,
Set
,
Tuple
,
Union
import
hdbscan
import
numpy
as
np
import
torch
from
keybert
import
KeyBERT
...
...
This diff is collapsed.
Click to expand it.
twitter_topic_model/plugin.py
+
1
-
1
View file @
e621cede
...
...
@@ -18,7 +18,7 @@ from .preprocessing import PreprocessedTweet, preprocess_tweets
from
.schema
import
Cluster
,
ClusterEntry
,
TwitterTopicModel
from
.utils
import
get_tweets
MIN_TOPIC_SIZE
=
8
MIN_TOPIC_SIZE
=
6
NUM_TOPIC_DESCRIPTORS
=
3
DESCRIPTION_DIVERSITY
=
0.3
...
...
This diff is collapsed.
Click to expand it.
Write
Preview
Supports
Markdown
0%
Try again
or
attach a new file
.
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment
Menu
Explore
Projects
Groups
Snippets