How to split strings from streams using Java

Question

I have a huge .txt file and format is like below:

29 clueweb12-1500wb-39-00001
19 clueweb12-1500wb-39-00002
20 clueweb12-1500wb-39-00003

I need to read that file line by line and separate two parts. The first part has scores(29,19,20) and the second part is docIds (clueweb12-1500wb-39-00001). I read to txt file line by line by using stream but how can i put these two parts in a String?

Stream<String> lines = File.lines(Paths.get("path-to-file");
lines.forEach(s -> s.split(" "));

`

Actually, ı put these part into a Map<Integer ,List<String> to do this i need to two of them separately — user7780446
– user7780446, Commented Mar 28, 2017 at 14:37

Andrii Abramov · Accepted Answer · 2017-03-28 17:55:03Z

2

To make the code clearer, you could use simple foreach loop:

Stream<String> lines = File.lines(Paths.get("path-to-file");
lines.forEach(s -> s.split(" "));

/**
* Takes a stream, splits group by first part of the string:
*/
public Map<Integer, List<String>> split(Stream<String> a) {

    Map<Integer, List<String>> result = new HashMap<>();

    a.forEach(s -> {
        String[] pair = s.split(" ");

        Integer key = Integer.valueOf(pair[0]);
        String value = pair[1];

        // as 4castle suggested - to avoid unnecessary computation
        result.computeIfAbsent(key, key -> new ArrayList<>());

        result.get(key).add(value);
    });

    return result;
}

Or you can map your input directly in the stream processing:

a.map(s -> s.split(" "))
 .forEach(pair -> {
     Integer key = Integer.valueOf(pair[0]);
     String value = pair[1];

     result.putIfAbsent(key, new ArrayList<>());    
     result.get(key).add(value);
 });

edited Mar 28, 2017 at 17:55

answered Mar 28, 2017 at 14:56

Andrii Abramov

10.9k12 gold badges81 silver badges107 bronze badges

Sign up to request clarification or add additional context in comments.

2 Comments

castletheperson Over a year ago

In order to avoid creating a new ArrayList<>() on the iterations where it's not needed, you can use result.computeIfAbsent(key, k -> new ArrayList<>())

Andrii Abramov Over a year ago

@4castle yes, absolutely right! I will edit the answer.

castletheperson · Accepted Answer · 2017-03-28 15:01:45Z

1

Use Collectors.groupingBy with a downstream collector which gets the second part of the split line before collecting to a list.

Map<Integer, List<String> table =
    Files.lines(Paths.get("path-to-file"))
         .map(line -> line.split(" ", 2))
         .collect(Collectors.groupingBy(
             parts -> Integer.valueOf(parts[0]),
             Collectors.mapping(parts -> parts[1], Collectors.toList())
         ));

answered Mar 28, 2017 at 15:01

castletheperson

33.6k11 gold badges74 silver badges111 bronze badges

Comments

Anonymous · Accepted Answer · 2017-03-28 15:05:49Z

1

The Java streams way, I believe, is:

    Map<Integer, List<String>> parts = lines.map(s -> s.split(" "))
            .collect(Collectors.groupingBy(splitLine -> Integer.valueOf(splitLine[0]),
                    Collectors.mapping(splitLine -> splitLine[1], Collectors.toList())));

This gives you the following map:

{19=[clueweb12-1500wb-39-00002], 20=[clueweb12-1500wb-39-00003], 29=[clueweb12-1500wb-39-00001]}

Its toString method doesn’t give you the most readable output, but I believe it’s the map you asked for. For now there is only one string in each list, but if multiple lines have the same score, there will be more.

edited Mar 28, 2017 at 15:05

answered Mar 28, 2017 at 15:00

Anonymous

87.4k15 gold badges163 silver badges181 bronze badges

Comments

rohan · Accepted Answer · 2017-03-28 14:58:27Z

You can get in the HashMap like this : Read the file and split it using String Split function and save in into the HashMap key value pair.

public static HashMap<Integer, String>  readFile(String fileName) throws IOException {
    BufferedReader br = new BufferedReader(new FileReader(fileName));
    try {
        HashMap<Integer, String> fileData = new HashMap<>(); 
        String line = br.readLine();

        while (line != null) {
            String[] lineData = line.split(" ");
            System.out.println(lineData[0]+" "+lineData[1]);
            fileData.put(Integer.valueOf(lineData[0]), lineData[1]);
            line = br.readLine();
        }
        return fileData;
    } finally {
        br.close();
    }
}

Collectives™ on Stack Overflow

How to split strings from streams using Java

4 Answers 4

2 Comments

Comments

Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

2 Comments

Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related