Writing Fake Scotch Reviews

Photo by Adam Jaime on Unsplash

Data

Distribution of points given in each product review. The reviews are generally very positive. The mean score was 87/100. The lowest score was 63/100, and the highest was 97/100.

Model Design

A basic illustration of how the network maps an input sequence of tokens to integers, and how the last word of the sequence (output) is decided. In this illustration, the output is decided deterministically, though stochastic methods may also be used.

Training

Generating Fake Reviews

  1. Decide the length of the fake review (This was randomly sampled uniformly between (76, 95) tokens, which was the IQR of the length of the reviews in the dataset.
  2. Randomly initialize the sequence with n - 1 random tokens.
  3. Feed the input sequence into the model to predict the next token.
  4. Append the last token to the end of the sequence.
  5. Take the last n - 1 tokens in the sequence and repeat steps 3 and 4 until the desired length is reached.

Results

A:

B:

C:

D:

E:

Further considerations

--

--

--

mostly for fun

Love podcasts or audiobooks? Learn on the go with our new app.

Recommended from Medium

Classifying Documents with Quantum-enhanced Transfer Learning

Deep Learning Techniques for Text Classification

In Depth: Parameter tuning for Random Forest

MediaPipe tutorial: Find memes that match your facial expression 😮

A Quick Guide to Gradient Descent and its Variants

AUTOENCODER

Recurrent Neural Networks

Single board computing now a days is the heart of innovative projects by hobbyist and professionals.

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Alex Leung

Alex Leung

mostly for fun

More from Medium

2022 Machine Learning Baseball Projections: A Look Back at Things to Come

Training Undergraduate Medical Students in Artificial Intelligence: A Student Perspective

The Machine Learning You did in High School

Proving global warming is not based on data and science