Why would I use the hoop to search strings in multiple large files instead of using unix tools outside the box , Grep and cat?
Thanks Ashish
Hadoop is designed to find many files and results Other tools mentioned in the coalition will not do this (without too much work), especially since Hadop does this (in multiple servers), in parallel , it would mean that "lots of work" done in some language Should support that parallelism. Python can be used for ad hoc prototyping, but it is unlikely that split, grep and cat like Bash are comparable (unless you greatly reduce the requirements).
No comments:
Post a Comment