Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix marshaling of any types to JSON when contained object has custom marshaling #361

Merged
Merged
Show file tree
Hide file tree
Changes from 1 commit
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Next Next commit
fix marshaling of any types to JSON when contained object has custom …
…marshaling/unmarshaling
  • Loading branch information
jhump committed May 26, 2017
commit 03a02321e7c9e401cb002dd325f187bf45c728ff
70 changes: 57 additions & 13 deletions jsonpb/jsonpb.go
Original file line number Diff line number Diff line change
Expand Up @@ -77,15 +77,26 @@ type Marshaler struct {
// JSONPBMarshaler is implemented by protobuf messages that customize the
// way they are marshaled to JSON. Messages that implement this should
// also implement JSONPBUnmarshaler so that the custom format can be
// parsed.
// parsed. If this method produces JSON other than an object (for example,
// produces an encoded JSON string), the type must also implement another
// method:
// XXX_CustomJSON() bool
// If this method is implemented and returns true when invoked, then the
// message is allowed to marshal itself to JSON types other than objects.
type JSONPBMarshaler interface {
MarshalJSONPB(*Marshaler) ([]byte, error)
}

// JSONPBUnmarshaler is implemented by protobuf messages that customize
// the way they are unmarshaled from JSON. Messages that implement this
// should also implement JSONPBMarshaler so that the custom format can be
// produced.
// produced. If this method expects to unmarshal itself from JSON other
// than an object (for example, unmarshals from an encoded string), the
// type must also implement another method:
// XXX_CustomJSON() bool
// If this method is implemented and returns true when invoked, then the
// message is allowed to unmarshal itself from JSON types other than
// objects.
type JSONPBUnmarshaler interface {
UnmarshalJSONPB(*Unmarshaler, []byte) error
}
Expand Down Expand Up @@ -116,13 +127,43 @@ type wkt interface {
XXX_WellKnownType() string
}

// customJSON can be implemented by messages that have custom non-object JSON
// representations. If the message implements this method and the method
// returns true, the JSON representation will not be an object (for example,
// the message marshals itself as a string).
type customJSON interface {
XXX_CustomJSON() bool
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I suspect there will be opinions on what this method should be named, and possibly even whether this should even be supported (custom message type that can marshal itself into custom JSON).

The main case I can think of is a dynamic message that happens to have a well-known type as a nested field. In that case, depending on implementation, it may be possible that the well-known type is actually represented via a dynamic message vs. the generated message type. Perhaps there are others. Possibly worth pointing out that the doc for Any just talks about messages with custom JSON and doesn't specifically restrict this to only well-known types.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah, the doc doesn't explain in good detail about what it means for a "value that has special JSON mapping" in order to serialize into the form {"@type": xxx, "value": yyy}. I'd like to take it that "special JSON mapping" refers to WKTs and that is already handled in marshalAny. I wish that all Any marshals into that form.

I'd like to think that a dynamic message should marshal into JSON object. This would keep it simpler. This means that if it did not marshal into a JSON object, we can error out for now. And if it did, then it would be easier to just add in "@type" key-value.

I've posted this question to [email protected]. Not sure if we'll hear back soon. But let me know what you think of my suggestion above for keeping the assumptions simpler for now.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I did see your question a few moments ago.

Sure, I can cut this stuff out for now. It's not terribly complicated to add back later if/when a more concrete need arises.

}

// marshalObject writes a struct to the Writer.
func (m *Marshaler) marshalObject(out *errWriter, v proto.Message, indent, typeURL string) error {
if jsm, ok := v.(JSONPBMarshaler); ok {
b, err := jsm.MarshalJSONPB(m)
if err != nil {
return err
}
if typeURL != "" {
// we are marshaling this object to an Any type
var js interface{}
if err = json.Unmarshal(b, &js); err != nil {
return fmt.Errorf("Type %v produced invalid JSON: %v", reflect.TypeOf(v), err)
}
m, ok := js.(map[string]interface{});
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Use map[string]*json.RawMessage instead?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If I cut out the code that supports non-object JSON types, then, yes, I can unmarshal into that type (which will certainly be more efficient to parse and then to re-marshal below).

The current formulation is because I didn't know apriori whether the JSON produced will be an object or something else. So I have to supply an interface{} to json.Unmarshal, in which case the type of the value will be map[string]interface{} if it was an object/hash.

if ok {
m["@type"] = typeURL
} else {
if cj, ok := v.(customJSON); !ok || !cj.XXX_CustomJSON() {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

As mentioned above, I'm not sure if this is necessary. I'd suggest error'ing here for now. But if you think this is something needed, then I think you can simply take the RawMessage and set it to "value", right?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

As mentioned above, I'm not sure if this is necessary. I'd suggest error'ing here for now.

Okay, I'll tweak this branch to remove the customJson stuff and push changes.

But if you think this is something needed, then I think you can simply take the RawMessage and set it to "value", right?

I think that's what the code is doing below. But perhaps I'm not following this remark. (Though it will be moot after removing the customJson stuff.)

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Agree. Remark is moot when going with the assumption that a dynamic message should produce a JSON object. And we should go with that.

Sorry for lack of explanation on my end. My previous comment was if it did not produce a JSON object, I'd assume that it should still produce a valid JSON value and hence you can simply set the bytes output from MarshalJSONPB to "value".

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ah, I see what you meant. So, were I to have kept this code, basically change this:

m["value"] = js

to this:

m["value"] = (*json.RawMessage)(&b)

Anyhow, removed all of this code; but, now that I understand the suggestion, yes, that would have been a better way to do it! :)

return fmt.Errorf("Type %v illegally produced JSON that is not an object", reflect.TypeOf(v))
}
m = map[string]interface{}{}
m["@type"] = typeURL
m["value"] = js
}
if b, err = json.Marshal(m); err != nil {
return err
}
}

out.write(string(b))
return out.err
}
Expand Down Expand Up @@ -441,9 +482,6 @@ func (m *Marshaler) marshalValue(out *errWriter, prop *proto.Properties, v refle

// Handle well-known types.
// Most are handled up in marshalObject (because 99% are messages).
type wkt interface {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for these cleanups.

XXX_WellKnownType() string
}
if wkt, ok := v.Interface().(wkt); ok {
switch wkt.XXX_WellKnownType() {
case "NullValue":
Expand Down Expand Up @@ -611,9 +649,6 @@ func (u *Unmarshaler) unmarshalValue(target reflect.Value, inputValue json.RawMe
}

// Handle well-known types.
type wkt interface {
XXX_WellKnownType() string
}
if w, ok := target.Addr().Interface().(wkt); ok {
switch w.XXX_WellKnownType() {
case "DoubleValue", "FloatValue", "Int64Value", "UInt64Value",
Expand All @@ -633,13 +668,13 @@ func (u *Unmarshaler) unmarshalValue(target reflect.Value, inputValue json.RawMe
}

val, ok := jsonFields["@type"]
if !ok {
if !ok || val == nil {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for catching this. My mistake, I missed this one when I changed jsonFields to use map[string]*RawMessage.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah, I had actually made the same change locally in my branch, in order for my tests to pass. After I rebased to latest master and reconciled with your fix, I figured I'd leave this change in :)

return errors.New("Any JSON doesn't have '@type'")
}

var turl string
if err := json.Unmarshal([]byte(*val), &turl); err != nil {
return fmt.Errorf("can't unmarshal Any's '@type': %q", val)
return fmt.Errorf("can't unmarshal Any's '@type': %q", *val)
}
target.Field(0).SetString(turl)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just realized that we should also check if turl is empty.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think it's probably okay without such a check. What will happen is that this method will end up returning an error like unknown message type "", which seems reasonable that it's saying the empty string is not valid as the type URL.


Expand All @@ -653,14 +688,23 @@ func (u *Unmarshaler) unmarshalValue(target reflect.Value, inputValue json.RawMe
}

m := reflect.New(mt.Elem()).Interface().(proto.Message)
hasCustomJson := false
if _, ok := m.(wkt); ok {
hasCustomJson = true
} else if _, ok := m.(JSONPBUnmarshaler); ok {
// custom JSON unmarshallers may expect non-object JSON type
if cj, ok := m.(customJSON); ok && cj.XXX_CustomJSON() {
hasCustomJson = true
}
}
if hasCustomJson {
val, ok := jsonFields["value"]
if !ok {
return errors.New("Any JSON doesn't have 'value'")
}

if err := u.unmarshalValue(reflect.ValueOf(m).Elem(), *val, nil); err != nil {
return fmt.Errorf("can't unmarshal Any's WKT: %v", err)
return fmt.Errorf("can't unmarshal Any nested proto %v: %v", reflect.TypeOf(m), err)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Minor. Use %T instead of reflect.TypeOf.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done

}
} else {
delete(jsonFields, "@type")
Expand All @@ -670,13 +714,13 @@ func (u *Unmarshaler) unmarshalValue(target reflect.Value, inputValue json.RawMe
}

if err = u.unmarshalValue(reflect.ValueOf(m).Elem(), nestedProto, nil); err != nil {
return fmt.Errorf("can't unmarshal nested Any proto: %v", err)
return fmt.Errorf("can't unmarshal Any nested proto %v: %v", reflect.TypeOf(m), err)
}
}

b, err := proto.Marshal(m)
if err != nil {
return fmt.Errorf("can't marshal proto into Any.Value: %v", err)
return fmt.Errorf("can't marshal proto %v into Any.Value: %v", reflect.TypeOf(m), err)
}
target.Field(1).SetBytes(b)

Expand Down
144 changes: 140 additions & 4 deletions jsonpb/jsonpb_test.go
Original file line number Diff line number Diff line change
Expand Up @@ -34,6 +34,7 @@ package jsonpb
import (
"bytes"
"encoding/json"
"fmt"
"io"
"reflect"
"strings"
Expand All @@ -43,6 +44,7 @@ import (

pb "github.com/golang/protobuf/jsonpb/jsonpb_test_proto"
proto3pb "github.com/golang/protobuf/proto/proto3_proto"
"github.com/golang/protobuf/ptypes"
anypb "github.com/golang/protobuf/ptypes/any"
durpb "github.com/golang/protobuf/ptypes/duration"
stpb "github.com/golang/protobuf/ptypes/struct"
Expand Down Expand Up @@ -454,6 +456,55 @@ func TestMarshalingWithJSONPBMarshaler(t *testing.T) {
}
}

func TestMarshalingToAnyWithJSONPBMarshaler(t *testing.T) {
msg := dynamicMessage{rawJson: `{ "foo": "bar", "baz": [0, 1, 2, 3] }`}
a, err := ptypes.MarshalAny(&msg)
if err != nil {
t.Errorf("an unexpected error occurred when marshalling to Any: %v", err)
}
str, err := new(Marshaler).MarshalToString(a)
if err != nil {
t.Errorf("an unexpected error occurred when marshalling Any to JSON: %v", err)
}
// after custom marshaling, it's round-tripped through JSON decoding/encoding already,
// so the keys are sorted, whitespace is compacted, and "@type" key has been added
expected := `{"@type":"type.googleapis.com/` + dynamicMessageName +`","baz":[0,1,2,3],"foo":"bar"}`
if str != expected {
t.Errorf("marshalling JSON produced incorrect output: got %s, wanted %s", str, expected)
}
}

func TestMarshalingToAnyWithJSONPBMarshalerCustomJSON(t *testing.T) {
// first try will fail because we're producing invalid JSON object
var msg proto.Message = &dynamicMessage{rawJson: `"fubar"`}
a, err := ptypes.MarshalAny(msg)
if err != nil {
t.Errorf("an unexpected error occurred when marshalling to Any: %v", err)
}
str, err := new(Marshaler).MarshalToString(a)
if err == nil || !strings.Contains(err.Error(), "illegally produced JSON that is not an object") {
t.Errorf(`expected error message "illegally produced JSON that is not an object" but instead got %v`, err)
}

// now we try, but with custom marshaling that allows non-object JSON type
msg = &dynamicMessageCustomJson{dynamicMessage{rawJson: `"fubar"`}}
a, err = ptypes.MarshalAny(msg)
if err != nil {
t.Errorf("an unexpected error occurred when marshalling to Any: %v", err)
}
str, err = new(Marshaler).MarshalToString(a)
if err != nil {
t.Errorf("an unexpected error occurred when marshalling Any to JSON: %v", err)
}

// after custom marshaling, it's round-tripped through JSON decoding/encoding already,
// so the keys are sorted, whitespace is compacted, and "@type" key has been added
expected := `{"@type":"type.googleapis.com/` + dynamicMessageCustomJsonName +`","value":"fubar"}`
if str != expected {
t.Errorf("marshalling JSON produced incorrect output: got %s, wanted %s", str, expected)
}
}

var unmarshalingTests = []struct {
desc string
unmarshaler Unmarshaler
Expand Down Expand Up @@ -654,15 +705,62 @@ func TestUnmarshalingBadInput(t *testing.T) {
func TestUnmarshalWithJSONPBUnmarshaler(t *testing.T) {
rawJson := `{ "foo": "bar", "baz": [0, 1, 2, 3] }`
var msg dynamicMessage
err := Unmarshal(strings.NewReader(rawJson), &msg)
if err != nil {
if err := Unmarshal(strings.NewReader(rawJson), &msg); err != nil {
t.Errorf("an unexpected error occurred when parsing into JSONPBUnmarshaler: %v", err)
}
if msg.rawJson != rawJson {
t.Errorf("message contents not set correctly after unmarshalling JSON: got %s, wanted %s", msg.rawJson, rawJson)
}
}

func TestUnmarshalAnyWithJSONPBUnmarshaler(t *testing.T) {
rawJson := `{ "@type": "blah.com/` + dynamicMessageName + `", "foo": "bar", "baz": [0, 1, 2, 3] }`
var a anypb.Any
if err := Unmarshal(strings.NewReader(rawJson), &a); err != nil {
t.Errorf("an unexpected error occurred when parsing into JSONPBUnmarshaler: %v", err)
}
var msg ptypes.DynamicAny
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

How about constructing an expected Any and compared directly using proto.Equal?

d := &dynamicMessage{rawJson: {"baz":[0,1,2,3],"foo":"bar"}}
b, err := proto.Marshal(d)
if err != nil {
...
}
want := &anypb.Any{
TypeURL: "blah.com/+ dynamicMessageName +",
Value: b,
}

if proto.Equal(got, want) ...

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done

if err := ptypes.UnmarshalAny(&a, &msg); err != nil {
t.Errorf("an unexpected error occurred when parsing from Any type: %v", err)
}
dm := msg.Message.(*dynamicMessage)
// when it gets to the custom unmarshal, it's round-tripped through JSON decoding/encoding
// already, so the keys are sorted, whitespace is compacted, and "@type" key has been removed
expected := `{"baz":[0,1,2,3],"foo":"bar"}`
if dm.rawJson != expected {
t.Errorf("message contents not set correctly after unmarshalling JSON: got %s, wanted %s", dm.rawJson, expected)
}
}

func TestUnmarshalAnyWithJSONPBUnmarshalerCustomJSON(t *testing.T) {
// with custom JSON, we unmarshal contents at "value" key instead of whole JSON object
rawJson := `{ "@type": "blah.com/` + dynamicMessageCustomJsonName + `", "value": "fubar"}`
var a anypb.Any
if err := Unmarshal(strings.NewReader(rawJson), &a); err != nil {
t.Errorf("an unexpected error occurred when parsing into JSONPBUnmarshaler: %v", err)
}
var msg ptypes.DynamicAny
if err := ptypes.UnmarshalAny(&a, &msg); err != nil {
t.Errorf("an unexpected error occurred when parsing from Any type: %v", err)
}
dm := msg.Message.(*dynamicMessageCustomJson)
// when it gets to the custom unmarshal, it's round-tripped through JSON decoding/encoding
// already, so the keys are sorted, whitespace is compacted, and "@type" key has been removed
if dm.rawJson != `"fubar"` {
t.Errorf("message contents not set correctly after unmarshalling JSON: got %s, wanted %s", dm.rawJson, `"fubar"`)
}
}

const (
dynamicMessageName = "google.protobuf.jsonpb.testing.dynamicMessage"
dynamicMessageCustomJsonName = "google.protobuf.jsonpb.testing.dynamicMessageCustomJson"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Not used. Remove?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Oops. Didn't mean to leave that in there. Now gone.

)
func init() {
// we register the custom types below so that we can use them in Any types
proto.RegisterType((*dynamicMessage)(nil), dynamicMessageName)
proto.RegisterType((*dynamicMessageCustomJson)(nil), dynamicMessageCustomJsonName)
}

// dynamicMessage implements protobuf.Message but is not a normal generated message type.
// It provides implementations of JSONPBMarshaler and JSONPBUnmarshaler for JSON support.
type dynamicMessage struct {
Expand All @@ -684,7 +782,45 @@ func (m *dynamicMessage) MarshalJSONPB(jm *Marshaler) ([]byte, error) {
return []byte(m.rawJson), nil
}

func (m *dynamicMessage) UnmarshalJSONPB(jum *Unmarshaler, json []byte) error {
m.rawJson = string(json)
func (m *dynamicMessage) UnmarshalJSONPB(jum *Unmarshaler, js []byte) error {
m.rawJson = string(js)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Optional. Perhaps change rawJson's type to []byte instead of string, then there'll be fewer type casting around.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I looked at doing this, but it would actually result in more casts, not fewer (6 casts instead of just 2).

return nil
}

func (m *dynamicMessage) Marshal() ([]byte, error) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this necessary? If field is exported, I think it'll marshal and unmarshal correctly. I'm ok with this, just curious though.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You're right. If I add a properly formatted struct tag to the field, this is no longer necessary. Done.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ah yes, it's the protobuf tag instead.

var b proto.Buffer
// tag #1, wire type length-delimited
if err := b.EncodeVarint(uint64((1 << 3) | proto.WireBytes)); err != nil {
return nil, err
}
if err := b.EncodeRawBytes([]byte(m.rawJson)); err != nil {
return nil, err
}
return b.Bytes(), nil
}

func (m *dynamicMessage) Unmarshal(b []byte) error {
// really dumb unmarshalling, just for test
buf := proto.NewBuffer(b)
if tagAndWire, err := buf.DecodeVarint(); err != nil {
return err
} else if tagAndWire != uint64((1 << 3) | proto.WireBytes) {
return fmt.Errorf("Unknown tag/wire type: %x", tagAndWire)
}
if bb, err := buf.DecodeRawBytes(true); err != nil {
return err
} else {
m.rawJson = string(bb)
}
return nil
}

// dynamicMessageCustomJson is the same as dynamicMessage except it also provides
// XXX_CustomJSON method, for marshaling to non-object JSON types.
type dynamicMessageCustomJson struct {
dynamicMessage
}

func (m *dynamicMessageCustomJson) XXX_CustomJSON() bool {
return true
}