" COMPARING MACHINE LEARNING AND HAND-CRAFTED APPROACHES FOR INFORMATION EXTRACTION FROM HTML DOCUMENTS.POMPES PAR TRANSITIONS MULTIPLES "