We currently have a large file (that's over 100 million lines of tab separated values, which is about 1.5 GB in size). Does anyone know of a fast way we can sort this file through one of the fields. We already tried Hive but that was too slow. Would Python be able to do this?
Free Guide: Managing storage for virtual environments
Complete a brief survey to get a complimentary 70-page whitepaper featuring the best methods and solutions for your virtual environment, as well as hypervisor-specific management advice from TechTarget experts. Don’t miss out on this exclusive content!