Platform
Resources
Pricing
Sign in
Get started
Jonathan St-Onge
Workspace
Fork
Public
NLP-related
By
Jonathan St-Onge
Edited
Jul 1, 2023
Paused
1 star
1
NLP-related
Traditional text analysis of complaints
Insert cell
Insert cell
Insert cell
Insert cell
Insert cell
Insert cell
Insert cell
Insert cell
// ADD CUSTOM STOPWORDS HERE
customStopwords
=
[
'airtransat'
,
'rouge'
,
'tel'
,
'aircanada'
,
'miami'
,
'francfort'
,
'frankfurt'
,
'lima'
,
'air'
,
'canada'
,
'flight'
,
'newark'
,
'aviv'
,
'ons'
,
'yeah'
,
'faudra'
]
Insert cell
Insert cell
Insert cell
Insert cell
Insert cell
Insert cell
dat
Filter
Columns
Sort
Slice
Save
Type Table, then Shift-Enter. Ctrl-space for more options.
Insert cell
Insert cell
matrix
=
similarity
.
getDistanceMatrix
(
)
Insert cell
similarity
=
new
tfidf
.
Similarity
(
corpus
)
Insert cell
Insert cell
Insert cell
Insert cell
Insert cell
dat
=
db
.
query
(
`SELECT GROUP_CONCAT(${
select2
}) as text, airline FROM data WHERE lang = '${
select1
}' GROUP BY airline`
)
Insert cell
airlines
=
dat
.
map
(
d
=>
d
.
airline
)
Insert cell
complaints
=
dat
.
map
(
d
=>
d
.
text
)
Insert cell
Insert cell
corpus
=
new
tfidf
.
Corpus
(
airlines
,
complaints
,
toggle
,
customStopwords
,
K1
,
b
)
Insert cell
marginwidth
=
80
Insert cell
K1
=
2.0
Insert cell
b
=
0.75
Insert cell
Insert cell
uniq_lang
=
db
.
query
(
`SELECT DISTINCT(lang) as lang from data`
)
Insert cell
uniq_airline
=
db
.
query
(
`SELECT DISTINCT(airline) as airline from data`
)
Insert cell
db
=
DuckDBClient
.
of
(
{
data
:
FileAttachment
(
"tripadvisor_reviews_multi_downsampled_trans.parquet"
)
}
)
Insert cell
tfidf
=
import
(
"tiny-tfidf"
)
Insert cell
Purpose-built for displays of data
Observable is your go-to platform for exploring data and creating expressive data visualizations. Use reactive JavaScript notebooks for prototyping and a collaborative canvas for visual data exploration and dashboard creation.
Try it for free
Learn more
Fork
View
Export
transat_fr_dat
Edit
Add comment
Copy import
Select
Duplicate
Copy link
Embed
Delete
JavaScript
Markdown
HTML
toggle
Edit
Add comment
Copy import
Select
Duplicate
Copy link
Embed
Delete
JavaScript
Markdown
HTML
select1
Edit
Add comment
Copy import
Select
Duplicate
Copy link
Embed
Delete
JavaScript
Markdown
HTML
select2
Edit
Add comment
Copy import
Select
Duplicate
Copy link
Embed
Delete
JavaScript
Markdown
HTML
select
Edit
Add comment
Copy import
Select
Duplicate
Copy link
Embed
Delete
JavaScript
Markdown
HTML
range
Edit
Add comment
Copy import
Select
Duplicate
Copy link
Embed
Delete
JavaScript
Markdown
HTML
maxCF
Edit
Add comment
Copy import
Select
Duplicate
Copy link
Embed
Delete
JavaScript
Markdown
HTML
customStopwords
Add comment
Copy import
Select
Duplicate
Copy link
Embed
Delete
JavaScript
Markdown
HTML
Edit
Add comment
Select
Duplicate
Copy link
Embed
Delete
JavaScript
Markdown
HTML
Edit
Add comment
Select
Duplicate
Copy link
Embed
Delete
JavaScript
Markdown
HTML
Edit
Add comment
Select
Duplicate
Copy link
Embed
Delete
JavaScript
Markdown
HTML
Edit
Add comment
Select
Duplicate
Copy link
Embed
Delete
JavaScript
Markdown
HTML
Edit
Add comment
Select
Duplicate
Copy link
Embed
Delete
JavaScript
Markdown
HTML
Add comment
Select
Duplicate
Copy link
Embed
Delete
Cells
dat
File attachments
tripadvisor_reviews_multi_downsampled_trans.parquet
Parquet
Databases
Filter
Column
Operator
Columns
Sort
Column
Direction
Descending
Ascending
Slice
From
Start
To
End
Edit
Add comment
Select
Duplicate
Copy link
Embed
Delete
JavaScript
Markdown
HTML
matrix
Add comment
Copy import
Select
Duplicate
Copy link
Embed
Delete
JavaScript
Markdown
HTML
similarity
Add comment
Copy import
Select
Duplicate
Copy link
Embed
Delete
JavaScript
Markdown
HTML
plot_tfidf
Edit
Add comment
Copy import
Select
Duplicate
Copy link
Embed
Delete
JavaScript
Markdown
HTML
plot_tf
Edit
Add comment
Copy import
Select
Duplicate
Copy link
Embed
Delete
JavaScript
Markdown
HTML
plot_word_count
Edit
Add comment
Copy import
Select
Duplicate
Copy link
Embed
Delete
JavaScript
Markdown
HTML
Edit
Add comment
Select
Duplicate
Copy link
Embed
Delete
JavaScript
Markdown
HTML
dat
Add comment
Copy import
Select
Duplicate
Copy link
Embed
Delete
JavaScript
Markdown
HTML
airlines
Add comment
Copy import
Select
Duplicate
Copy link
Embed
Delete
JavaScript
Markdown
HTML
complaints
Add comment
Copy import
Select
Duplicate
Copy link
Embed
Delete
JavaScript
Markdown
HTML
Edit
Add comment
Select
Duplicate
Copy link
Embed
Delete
JavaScript
Markdown
HTML
corpus
Add comment
Copy import
Select
Duplicate
Copy link
Embed
Delete
JavaScript
Markdown
HTML
marginwidth
Add comment
Copy import
Select
Duplicate
Copy link
Embed
Delete
JavaScript
Markdown
HTML
K1
Add comment
Copy import
Select
Duplicate
Copy link
Embed
Delete
JavaScript
Markdown
HTML
b
Add comment
Copy import
Select
Duplicate
Copy link
Embed
Delete
JavaScript
Markdown
HTML
data
Edit
Add comment
Copy import
Select
Duplicate
Copy link
Embed
Delete
JavaScript
Markdown
HTML
uniq_lang
Add comment
Copy import
Select
Duplicate
Copy link
Embed
Delete
JavaScript
Markdown
HTML
uniq_airline
Add comment
Copy import
Select
Duplicate
Copy link
Embed
Delete
JavaScript
Markdown
HTML
db
Add comment
Copy import
Select
Duplicate
Copy link
Embed
Delete
JavaScript
Markdown
HTML
tfidf
Add comment
Copy import
Select
Duplicate
Copy link
Embed
Delete
JavaScript
Markdown
HTML