tutanota/packages/tutanota-utils
map 5293be6a4a
Implement spam training data sync and add TutanotaModelV98
We sync the spam training data encrypted through our server to make
sure that all clients for a specific user behave the same when
classifying mails. Additionally, this enables the spam classification
in the webApp. We compress the training data vectors
(see clientSpamTrainingDatum) before uploading to our server using
SparseVectorCompressor.ts. When a user has the ClientSpamClassification
enabled, the spam training data sync will happen for every mail
received.

ClientSpamTrainingDatum are not stored in the CacheStorage.
No entityEvents are emitted for this type.
However, we retrieve creations and updates for ClientSpamTrainingData
through the modifiedClientSpamTrainingDataIndex.

We calculate a threshold per classifier based on the dataset ham to spam
ratio, we also subsample our training data to cap the ham to spam ratio
within a certain limit.

Co-authored-by: jomapp <17314077+jomapp@users.noreply.github.com>
Co-authored-by: das <das@tutao.de>
Co-authored-by: abp <abp@tutao.de>
Co-authored-by: Kinan <104761667+kibibytium@users.noreply.github.com>
Co-authored-by: sug <sug@tutao.de>
Co-authored-by: nif <nif@tutao.de>
Co-authored-by: map <mpfau@users.noreply.github.com>
2025-11-18 13:56:19 +01:00
..
lib Implement spam training data sync and add TutanotaModelV98 2025-11-18 13:56:19 +01:00
test Add an option for admins to enforce 2FA 2025-10-22 10:09:21 +02:00
LICENSE.txt Create utils module (#3627) 2021-11-04 14:05:23 +01:00
package.json v314.251117.0 2025-11-17 13:32:33 +01:00
README.md Convert codebase to Typescript, close #3746 close #3594 2022-01-07 16:21:07 +01:00
tsconfig.json Run prettier on the whole project 2022-12-27 16:08:23 +01:00

tutanota-utils

This is a collection of common utils we use across multiple projects/modules internally. As creating this module really is just an intermediate step towards re-organising some of the dependency structure of our software, it is most likely going to change a lot in the future and might even disappear altogether. For these reasons we strongly discourage anyone outside of our organisation to directly depend on this module.