FILE : wsj_002.txt -a--- (NP (CD 61) (NNS years) ) -the-a- (NP (DT the) (NN board) ) -a--- (PP-CLR (IN as) -a--- (NP (DT a) (JJ nonexecutive) (NN director) ) ) ( (S -a--- (NP (NN chairman) ) ---the- (NP (DT the) (NNP Dutch) (VBG publishing (NN group) ))))) ( (S -a--- (NP (CD 55) (NNS years) ) -a--- (CC and ) -a--- (NP (JJ former) (NN chairman) ) -a--- (NP (NNP Consolidated) (NNP Gold) (NNP Fields) (NNP PLC) )))) -a--- (VP (VBD was) -a--- (VP (VBN named) (S -a--- (NP (DT a) (JJ nonexecutive) (NN director) ) -a--- (NP (DT this) (JJ British) (JJ industrial) (NN conglomerate) )))))) ( (S -a--- (NP (NN asbestos) ) ) ) -a--- (VP (VB make ) -a--- (NP (NNP Kent) (NN cigarette) (NNS filters) ) ) ) ) ) ) ) -a--- (VP (VBZ has ) -a--- (VP (VBN caused) -a--- ( NP ( DT a ) ( 3J high ) ( NN percentage ) -a--- (NP (NN cancer) (NNS deaths) ) ) -a--- ( PP-LOC ( IN among) -a--- (NP (DT a) (NN group) ) -a--- ( QP ( RBR more ) ( IN tilan ) ( CD 3 0 ) -a--- (NNS years ) -a--- ( IN ago) ) ) ) ) ) ) ) ) ) ) ) -a--- (NP -SBJ (NNS researchers ) (S (-NONE- *T*-1.) ) ) ) ( (S -a--- (NP (DT The) (NN asbestos) (NN f. ber) -a--- (ADJP-PRD (RB unusually) (JJ resilient) ) (S ---the- (NP (DT the) (NNS lungs) ) ) ) ) -a--- (VP (VBG causing) -a--- (WHNP-1 (WDT that ) (S -a--- (NP (NNS decades ) -a--- (JJ later) ) ) ) ) ) ) ) ) ) ) -a--- (NP-SBJ (NNS researchers ) -a--- (VP (VBD said) (S (-NONE- *T*-2) ) ) ) ( (S -a--- (NP (NNP Lorillard) (NNP Inc. ) ---the- (NP (DT the) (NN unit) ) -a--- ( ADJP ( JJ New ) ( JJ York-based ) -a--- (WHNP-2 (WDT that ) (S -a--- (VP (VBZ makes) -a--- (NP (NNP Kent) (NNS cigarettes) ) ) ) ) ) -a--- (NP (PRP$ its) (NN Micronite) (NN ci~arette) (NNS filters) ) ) In the file wsj_002.txt Nb of sentences : 11 nb of string (a) : 45 nb of words by itself ( a ) : 4 nb of string beginning a word (a...) : 6 nb of string finishing a word (...a) : 0 nb of string inside a word (...a...) : 34 In the file wsj_002.txt Nb of sentences : 11 nb of string (the) : 4 nb of words by itself ( the ) : 4 nb of string beginning a word (the...) : 0 nb of string finishing a word (...the) : 0 nb of string inside a word (...the...) : 0