Strona głównaSztuka programowania

One of the main processes in cookit is dealing with extracting recipe information from raw html. I know it isn’t the most elegant solution but it is the only universal one. But to the point. Every web page goes through a process involving html parsing, stemming, parsing, and n-gram token matching. Then it’s saved to Sql Server and after transformation to Solr. So a lot of string manipulation, math calculations and from time to time mostly 0-gen GC. In the most pessimistic case this process has to be r...

Dziel się z innymi:
[EN] Local optimizations don't add up – IndexOutOfRange

Sztuka programowania 3085 dni, 8 godzin, 34 minuty temu maklipsa 60 pokaż kod licznika zwiń

Wczytywanie artykułów...