region features can't match text features while calculating similarity

Hi, thanks for your contributive work.
I'm trying some demos based on your pretrained checkpoints. And I followed your settings in modeling.meta_arch.PretrainFastRCNN which I believe is your base model for pretraing.
According to your codes in self.get_region_features and self.region_concept_matching, we can get image features and text features, respectively. But the dimension of text features is fixed to 1024 while of image features is 2048, because of your design of ModifiedResNet. 
def _shared_roi_transform(self, features, boxes, backbone_res5):
        x = self.pooler(features, boxes)
        return backbone_res5(x)
In this function above, features are 1024-d and x would be 1024-d also, backbone_res5 is backbone.layer4, and output will be 2048-d.
I didn't find any transformation before you calculated the similarity of concepts and regions in region_concept_matching.
So, could you please help me about this issue, thanks again.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

region features can't match text features while calculating similarity #95

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

region features can't match text features while calculating similarity #95

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions