Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

added file input and output Akashlogics1 #151

Merged
merged 3 commits into from
Oct 8, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view

Large diffs are not rendered by default.

Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
20 changes: 20 additions & 0 deletions Natural Language Processing/webscrating and nltk score/reabme.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,20 @@
The notebook appears to perform text analysis and sentiment scoring on a given dataset or text. Here's a step-by-step explanation of the processes involved:

1. **Sentiment Analysis and Readability Calculation:**
- The code first analyzes a piece of text to calculate sentiment scores using measures like polarity (how positive/negative the text is) and subjectivity (how opinionated the text is).
- It calculates important text statistics such as the total number of words, sentences, and the Fog Index, which is a measure of text complexity. A higher Fog Index means the text is more difficult to read.

2. **Word and Sentence Metrics:**
- It calculates the average number of words per sentence, an important readability measure. Simpler texts usually have shorter sentences, making them easier to read.
- It also counts the total number of words in the text.

3. **Personal Pronoun Count:**
- The code counts the occurrences of specific personal pronouns such as "I," "we," "my," "ours," and "us." This can be used to analyze the tone or focus of the text (e.g., whether the text is more personal or impersonal).

4. **Average Word Length:**
- The code calculates the average length of the words used in the text. Longer words tend to make the text more complex, while shorter words are easier to understand.

5. **Results Output:**
- After running the calculations, the script prints out the sentiment scores, word and sentence metrics, and readability index. These outputs help in understanding the overall structure and tone of the text.

In summary, the notebook performs an in-depth analysis of a text, focusing on sentiment (polarity and subjectivity), readability (Fog Index, word and sentence metrics), and the use of personal pronouns. This information can be useful for content evaluation, for example, determining how easy a piece of text is to read or how opinionated it is.
148 changes: 148 additions & 0 deletions Natural Language Processing/webscrating and nltk score/result.csv

Large diffs are not rendered by default.

Loading