commit
This commit is contained in:
parent
6edab5ee62
commit
1d3d13094d
|
@ -7,12 +7,12 @@ The csv file *classification_labelled_corrected.csv* contains 1497 labeled news
|
||||||
**Best F1 score results**:
|
**Best F1 score results**:
|
||||||
|
|
||||||
* **Support Vector Machines Classifier (SVM):**
|
* **Support Vector Machines Classifier (SVM):**
|
||||||
F1 score: 0.8944166649330559
|
F1 score: 0.894
|
||||||
Best parameters set found on development set:
|
Best parameters set found on development set:
|
||||||
{'SVC\__C': 0.1, 'SVC\__gamma': 0.01, 'SVC\__kernel': 'linear', 'perc\__percentile': 50}
|
{'SVC\__C': 0.1, 'SVC\__gamma': 0.01, 'SVC\__kernel': 'linear', 'perc\__percentile': 50}
|
||||||
|
|
||||||
* **Naive Bayes Classifier**:
|
* **Naive Bayes Classifier**:
|
||||||
F1 score: 0.8324014738144634 (average)
|
F1 score: 0.832 (average)
|
||||||
Parameters: SelectPercentile(25), own Bag of Words implementation, 10-fold cross validation
|
Parameters: SelectPercentile(25), own Bag of Words implementation, 10-fold cross validation
|
||||||
|
|
||||||
The complete documentation can be found in the latex document in the *thesis* folder.
|
The complete documentation can be found in the latex document in the *thesis* folder.
|
||||||
|
@ -35,7 +35,7 @@ The scripts can be called separately.
|
||||||
You need to enter a valid personal key for *webhose.io* before you call *Requester.py*.
|
You need to enter a valid personal key for *webhose.io* before you call *Requester.py*.
|
||||||
To run *NER.py* you need to change the path to the JAVAHOME environment variable in *find_companies* method.
|
To run *NER.py* you need to change the path to the JAVAHOME environment variable in *find_companies* method.
|
||||||
|
|
||||||
|
---
|
||||||
**Author:** Anne Lorenz / Datavard AG
|
**Author:** Anne Lorenz / Datavard AG
|
||||||
|
|
||||||
**Project Status:** work in progress
|
**Project Status:** work in progress
|
Loading…
Reference in New Issue