When using google protocol buffers to transfer String character,got messy code

Question

In debug view:

Here is the code which encodes into messy string...

((S2CEnterCollection)objS2c).toByteString().toStringUtf8();

Output:

    ���"default(
    ���"default(
    ���"default(
    ���"default(
    ���"default(
    ����"default(
    ����"default(
    �����"default(

Here is the code which has the right string:

((S2CEnterCollection)objS2c).toString()

The original string was:

    cardList {
      cardId: 100001
      liked: 100
      number: 10
      finder: "default"
      rank: 1
    }
    cardList {
      cardId: 100002
      liked: 123
      number: 10
      finder: "default"
      rank: 1
    }
    cardList {
      cardId: 100003
      liked: 543
      number: 10
      finder: "default"
      rank: 1
    }
    cardList {
      cardId: 100004
      liked: 766
      number: 10
      finder: "default"
      rank: 1
    }
    cardList {
      cardId: 100005
      liked: 78
      number: 10
      finder: "default"
      rank: 1
    }
    cardList {
      cardId: 100006
      liked: 89
      number: 123
      finder: "default"
      rank: 1
    }
    cardList {
      cardId: 100007
      liked: 199
      number: 567
      finder: "default"
      rank: 1
    }
    cardList {
      cardId: 100008
      liked: 90909
      number: 232
      finder: "default"
      rank: 1
    }

So, does anyone know how it works?

Hi Ryan, you might try adding in the code you used that generated this. Also, what character encoding are you using? — jmort253
hi, @jmort253, i was using utf-8 encoding which is the default. And I tried to code like : new String(((S2CEnterCollection)objS2c).toString().getBytes(Charset.forName("utf-8"))); which worked well and gave the expected result.. But as u can see,this way didn't include any protocol buffers framework and it seemed like i got straight and back of data transferring which actually just return the string of the object as ((S2CEnterCollection)objS2c).toString()... — Ryan Zhu
sorry for a mistake,actually i didn't use Chinese, just english... but still got messy data... — Ryan Zhu
Then you should edit your question title to edit that out. On Stack Overflow, nothing you post is immutable. — jmort253

Marc Gravell Marc Gravell · Accepted Answer · 2013-01-07T08:11:09

protobuf data is binary, and is not encoded text. You cannot run it through an encoding like UTF-8 and expect to get a string (or expect it to still be valid). The only way to convert protobuf data to a string is to run it through a base-N encode for some N, typically 64 (because it is well-supported on most platforms).

When using google protocol buffers to transfer String character,got messy code

3 Answers