; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g01820 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g01820
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionSerine/arginine repetitive matrix protein 1-like
Genome locationchr4:1151627..1152496
RNA-Seq ExpressionMoc04g01820
SyntenyMoc04g01820
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572349.1 hypothetical protein SDJN03_29077, partial [Cucurbita argyrosperma subsp. sororia]6.0e-8868.98Show/hide
Query:  MGCCFSRALHGGRPRVPDE--------NPGPFDTHVGNVSGNGTPEEETVKEVLSETPIARPCSVQVQQTEQNNKTPEMKVKSLPSPMDGLPSKAEEGVV
        MGCC SR L  G+P + D+        +PG   T VGN+ G GTPEEETVKEVLSETPIA+PC+  +QQT   NK+ E+KVKS  S MDG  +K EEG  
Subjt:  MGCCFSRALHGGRPRVPDE--------NPGPFDTHVGNVSGNGTPEEETVKEVLSETPIARPCSVQVQQTEQNNKTPEMKVKSLPSPMDGLPSKAEEGVV

Query:  SVSVSVSETSQVTEWCSNMSESVSMATTISEQREGDEASSKQSREIGRSTKPKIRRKRPYAGDPSWRREQREKCPTK-SAEVLPEKKSRVTRRYTQG-TE
          ++SVSE SQVTEWCSNMSESVSMATTISEQREGDEASSKQSRE+GR+ KPKIRRKRPY+GDPS+RREQR+KC TK  AE+L EKKSRVT RYT G TE
Subjt:  SVSVSVSETSQVTEWCSNMSESVSMATTISEQREGDEASSKQSREIGRSTKPKIRRKRPYAGDPSWRREQREKCPTK-SAEVLPEKKSRVTRRYTQG-TE

Query:  SRKARTRKLN---EQQSGVSHVRRSRSPATRTARGMSKFQGNMKSSAMTPMKVTAQAGDQQEAATAEKRDEGTAEKPPDGT-IQPPNESIENPLVSLECF
        SR+ARTRKLN   EQ+SGV+H RRSRSPATRT R  +K  GNMKSSA   +KVT QAG+Q EA T EKRDEG  +K  DG+  QPPNESIENPLVSLECF
Subjt:  SRKARTRKLN---EQQSGVSHVRRSRSPATRTARGMSKFQGNMKSSAMTPMKVTAQAGDQQEAATAEKRDEGTAEKPPDGT-IQPPNESIENPLVSLECF

Query:  IFL
        IFL
Subjt:  IFL

XP_022147830.1 uncharacterized protein LOC111016674 [Momordica charantia]4.5e-12888.58Show/hide
Query:  MGCCFSRALHGGRPRVPDENPGPFDTHVGNVSGNGTPEEETVKEVLSETPIARPCSVQVQQTEQNNKTPEMKVKSLPSPMDGLPSKAEEGVVSVSVSVSE
        MGCCFSRALHGGRPRVPDENPGPFDTHVGNVSGNGTPEEETVKEVLSETPIARPCSVQVQQTEQNNKTPEMKVKSLPSPMDGLPSKAEEGVVSVSVSVSE
Subjt:  MGCCFSRALHGGRPRVPDENPGPFDTHVGNVSGNGTPEEETVKEVLSETPIARPCSVQVQQTEQNNKTPEMKVKSLPSPMDGLPSKAEEGVVSVSVSVSE

Query:  TSQVTEWCSNMSESVSMATTISEQREGDEASSKQSREIGRSTKPKIRRKRPYAGDPSWRREQREKCPTKSAEVLPEKKSRVTRRYTQGTESRKARTRKLN
        TSQVTEWCSNMSESVSMATTISEQREGDEASSKQSREIGR                                +LPEKKSRVTRRYTQGTESRKARTRKLN
Subjt:  TSQVTEWCSNMSESVSMATTISEQREGDEASSKQSREIGRSTKPKIRRKRPYAGDPSWRREQREKCPTKSAEVLPEKKSRVTRRYTQGTESRKARTRKLN

Query:  EQQSGVSHVRRSRSPATRTARGMSKFQGNMKSSAMTPMKVTAQAGDQQEAATAEKRDEGTAEKPPDGTIQPPNESIENPLVSLECFIFL
        EQQSGVSHVRRSRSPATRTARGMSKFQGNMKSSAMTPMKVTAQAGDQQEAATAEKRDEGTAEKPPDGTIQPPNESIENPLVSLECFIFL
Subjt:  EQQSGVSHVRRSRSPATRTARGMSKFQGNMKSSAMTPMKVTAQAGDQQEAATAEKRDEGTAEKPPDGTIQPPNESIENPLVSLECFIFL

XP_022953000.1 uncharacterized protein LOC111455516 [Cucurbita moschata]6.0e-8868.98Show/hide
Query:  MGCCFSRALHGGRPRVPDE--------NPGPFDTHVGNVSGNGTPEEETVKEVLSETPIARPCSVQVQQTEQNNKTPEMKVKSLPSPMDGLPSKAEEGVV
        MGCC SR L  G+P + D+        +PG   T VGN+ G GTPEEETVKEVLSETPIA+PC+  +QQT   NK+ E+KVKS  S MDG  +K EEG  
Subjt:  MGCCFSRALHGGRPRVPDE--------NPGPFDTHVGNVSGNGTPEEETVKEVLSETPIARPCSVQVQQTEQNNKTPEMKVKSLPSPMDGLPSKAEEGVV

Query:  SVSVSVSETSQVTEWCSNMSESVSMATTISEQREGDEASSKQSREIGRSTKPKIRRKRPYAGDPSWRREQREKCPTK-SAEVLPEKKSRVTRRYTQG-TE
          ++SVSE SQVTEWCSNMSESVSMATTISEQREGDEASSKQSRE+GR+ KPKIRRKRPY+GDPS+RREQR+KC TK  AE+L EKKSRVT RYT G TE
Subjt:  SVSVSVSETSQVTEWCSNMSESVSMATTISEQREGDEASSKQSREIGRSTKPKIRRKRPYAGDPSWRREQREKCPTK-SAEVLPEKKSRVTRRYTQG-TE

Query:  SRKARTRKLN---EQQSGVSHVRRSRSPATRTARGMSKFQGNMKSSAMTPMKVTAQAGDQQEAATAEKRDEGTAEKPPDGT-IQPPNESIENPLVSLECF
        SR+ARTRKLN   EQ+SGV+H RRSRSPATRT R  +K  GNMKSSA   +KVT QAG+Q EA T EKRDEG  +K  DG+  QPPNESIENPLVSLECF
Subjt:  SRKARTRKLN---EQQSGVSHVRRSRSPATRTARGMSKFQGNMKSSAMTPMKVTAQAGDQQEAATAEKRDEGTAEKPPDGT-IQPPNESIENPLVSLECF

Query:  IFL
        IFL
Subjt:  IFL

XP_022968880.1 uncharacterized protein LOC111468062 [Cucurbita maxima]1.7e-8768.65Show/hide
Query:  MGCCFSRALHGGRPRVPDE--------NPGPFDTHVGNVSGNGTPEEETVKEVLSETPIARPCSVQVQQTEQNNKTPEMKVKSLPSPMDGLPSKAEEGVV
        MGCC SR L  G+P + D+        +PG   T VGN+ G GTPEEETVKEVLSETPIA+PC+  +QQT   NK+ E+KVKS  S MDG  +K EEG  
Subjt:  MGCCFSRALHGGRPRVPDE--------NPGPFDTHVGNVSGNGTPEEETVKEVLSETPIARPCSVQVQQTEQNNKTPEMKVKSLPSPMDGLPSKAEEGVV

Query:  SVSVSVSETSQVTEWCSNMSESVSMATTISEQREGDEASSKQSREIGRSTKPKIRRKRPYAGDPSWRREQREKCPTK-SAEVLPEKKSRVTRRYTQG-TE
          ++SVSE SQVTEWCSNMSESVSMATTISEQREGDEASSKQSRE+GR+ KPKIRRKRPY+GDPS+RR+QR+KC TK  AE+L EKKSRVT RYT G TE
Subjt:  SVSVSVSETSQVTEWCSNMSESVSMATTISEQREGDEASSKQSREIGRSTKPKIRRKRPYAGDPSWRREQREKCPTK-SAEVLPEKKSRVTRRYTQG-TE

Query:  SRKARTRKLN---EQQSGVSHVRRSRSPATRTARGMSKFQGNMKSSAMTPMKVTAQAGDQQEAATAEKRDEGTAEKPPDG-TIQPPNESIENPLVSLECF
        SR+ARTRKLN   EQ+SGVSH RRSRSPAT+T R  +K  GNMKSSA   +K T QAG+Q EA T EKRDEG  +K  DG  IQPPNESIENPLVSLECF
Subjt:  SRKARTRKLN---EQQSGVSHVRRSRSPATRTARGMSKFQGNMKSSAMTPMKVTAQAGDQQEAATAEKRDEGTAEKPPDG-TIQPPNESIENPLVSLECF

Query:  IFL
        IFL
Subjt:  IFL

XP_038887224.1 uncharacterized protein LOC120077414 [Benincasa hispida]5.4e-8970.3Show/hide
Query:  MGCCFSRALHGGRPRVPDE--------NPGPFDTHVGNVSGNGTPEEETVKEVLSETPIARPCSVQVQQTEQNNKTPEMKVKSLPSPMDGLPSKAEEGVV
        MGCC SRAL  G+P + D+        +PG FDT VGN+  NGTPEEETVKEVLSETPIA+PC+  VQQT   NK+ E+KVK+  S MDG  SK EE + 
Subjt:  MGCCFSRALHGGRPRVPDE--------NPGPFDTHVGNVSGNGTPEEETVKEVLSETPIARPCSVQVQQTEQNNKTPEMKVKSLPSPMDGLPSKAEEGVV

Query:  SVSVSVSETSQVTEWCSNMSESVSMATTISEQREGDEASSKQSREIGRSTKPKIRRKRPYAGDPSWRREQREKCPTK-SAEVLPEKKSRVTRRYTQG-TE
           VSVSETSQVTEWCSN+SES+SMATTISEQREGDEASSK SREIGR+TKPKIRRKRPY+GD S+RREQR+KC TK  AE+LPEKKSRV  RYT G TE
Subjt:  SVSVSVSETSQVTEWCSNMSESVSMATTISEQREGDEASSKQSREIGRSTKPKIRRKRPYAGDPSWRREQREKCPTK-SAEVLPEKKSRVTRRYTQG-TE

Query:  SRKARTRKLN----EQQSGVSHVRRSRSPATRTARGMSKFQGNMKSSAMTPMKVTAQAGDQQEAATAEKRDEGTAEKPPDGTIQPPNESIENPLVSLECF
        SR+ARTRKLN    EQQSGVSH RRSRSPATRT R  +K  GNMKSS    MK+T QAGDQ E  T EK  EG  EKP D  IQPPNESIENPLVSLECF
Subjt:  SRKARTRKLN----EQQSGVSHVRRSRSPATRTARGMSKFQGNMKSSAMTPMKVTAQAGDQQEAATAEKRDEGTAEKPPDGTIQPPNESIENPLVSLECF

Query:  IFL
        IFL
Subjt:  IFL

TrEMBL top hitse value%identityAlignment
A0A0A0K701 Uncharacterized protein1.1e-8468.98Show/hide
Query:  MGCCFSRALHGGRPRVPDEN--------PGPFDTHVGNVSGNGTPEEETVKEVLSETPIARPCSVQVQQTEQNNKTPEMKVKSLPSPMDGLPSKAEEGVV
        MGCC SRAL  G+P   D+N        PG FDT V NV  N TPEEETVKEVLSETPIA+PCS  V QT  + K PE +VK+  S MDG   K EE + 
Subjt:  MGCCFSRALHGGRPRVPDEN--------PGPFDTHVGNVSGNGTPEEETVKEVLSETPIARPCSVQVQQTEQNNKTPEMKVKSLPSPMDGLPSKAEEGVV

Query:  SVSVSVSETSQVTEWCSNMSESVSMATTISEQREGDEASSKQSREIGRSTKPKIRRKRPYAGDPSWRREQREKCPTK-SAEVLPEKKSRVTRRYTQG-TE
           VSVSETSQVTEWCSNMSESVSMATTISEQREGDEASSK SR+IGR+ KPKIRRKRP +G+ S+RREQR+KC TK  AE+LPEKKSRV  RY+ G TE
Subjt:  SVSVSVSETSQVTEWCSNMSESVSMATTISEQREGDEASSKQSREIGRSTKPKIRRKRPYAGDPSWRREQREKCPTK-SAEVLPEKKSRVTRRYTQG-TE

Query:  SRKARTRKLN----EQQSGVSHVRRSRSPATRTARGMSKFQGNMKSSAMTPMKVTAQAGDQQEAATAEKRDEGTAEKPPDGTIQPPNESIENPLVSLECF
        SR+ARTRKLN    EQQSGVSH RRSRSPATRT +  +K  GNMKSS    MK+T Q GDQQE  T E RDEG  EKP DG+IQPPNESIENPLVSLECF
Subjt:  SRKARTRKLN----EQQSGVSHVRRSRSPATRTARGMSKFQGNMKSSAMTPMKVTAQAGDQQEAATAEKRDEGTAEKPPDGTIQPPNESIENPLVSLECF

Query:  IFL
        IFL
Subjt:  IFL

A0A1S3C0P3 uncharacterized protein LOC103495529 isoform X21.4e-8269.7Show/hide
Query:  RALHGGRPRVPDEN--------PGPFDTHVGNVSGNGTPEEETVKEVLSETPIARPCSVQVQQTEQNNKTPEMKVKSLPSPMDGLPSKAEEGVVSVSVSV
        RAL  G+P   D+N        PG FDT V NV  N TPEEETVKEVLSETPIA+PCS  V+QT  + K PE KVK+  S MDG   K EE +    VSV
Subjt:  RALHGGRPRVPDEN--------PGPFDTHVGNVSGNGTPEEETVKEVLSETPIARPCSVQVQQTEQNNKTPEMKVKSLPSPMDGLPSKAEEGVVSVSVSV

Query:  SETSQVTEWCSNMSESVSMATTISEQREGDEASSKQSREIGRSTKPKIRRKRPYAGDPSWRREQREKCPTK-SAEVLPEKKSRVTRRYTQG-TESRKART
        SETSQVTEWCSNMSESVSMATTISEQREGDEASSK SREIGR+ KPKIRRKRP +GD S+RREQR+KC TK  AE+LPEKKSRV  RY+ G TESR+ART
Subjt:  SETSQVTEWCSNMSESVSMATTISEQREGDEASSKQSREIGRSTKPKIRRKRPYAGDPSWRREQREKCPTK-SAEVLPEKKSRVTRRYTQG-TESRKART

Query:  RKLN----EQQSGVSHVRRSRSPATRTARGMSKFQGNMKSSAMTPMKVTAQAGDQQEAATAEKRDEGTAEKPPDGTIQPPNESIENPLVSLECFIFL
        RKLN    EQQSGVSH RRSRSPA RT +  +K  GNMKSS    MK+T QAGDQQE  T E RDEG  EKP DG+IQPPNESIENPLVSLECFIFL
Subjt:  RKLN----EQQSGVSHVRRSRSPATRTARGMSKFQGNMKSSAMTPMKVTAQAGDQQEAATAEKRDEGTAEKPPDGTIQPPNESIENPLVSLECFIFL

A0A6J1D181 uncharacterized protein LOC1110166742.2e-12888.58Show/hide
Query:  MGCCFSRALHGGRPRVPDENPGPFDTHVGNVSGNGTPEEETVKEVLSETPIARPCSVQVQQTEQNNKTPEMKVKSLPSPMDGLPSKAEEGVVSVSVSVSE
        MGCCFSRALHGGRPRVPDENPGPFDTHVGNVSGNGTPEEETVKEVLSETPIARPCSVQVQQTEQNNKTPEMKVKSLPSPMDGLPSKAEEGVVSVSVSVSE
Subjt:  MGCCFSRALHGGRPRVPDENPGPFDTHVGNVSGNGTPEEETVKEVLSETPIARPCSVQVQQTEQNNKTPEMKVKSLPSPMDGLPSKAEEGVVSVSVSVSE

Query:  TSQVTEWCSNMSESVSMATTISEQREGDEASSKQSREIGRSTKPKIRRKRPYAGDPSWRREQREKCPTKSAEVLPEKKSRVTRRYTQGTESRKARTRKLN
        TSQVTEWCSNMSESVSMATTISEQREGDEASSKQSREIGR                                +LPEKKSRVTRRYTQGTESRKARTRKLN
Subjt:  TSQVTEWCSNMSESVSMATTISEQREGDEASSKQSREIGRSTKPKIRRKRPYAGDPSWRREQREKCPTKSAEVLPEKKSRVTRRYTQGTESRKARTRKLN

Query:  EQQSGVSHVRRSRSPATRTARGMSKFQGNMKSSAMTPMKVTAQAGDQQEAATAEKRDEGTAEKPPDGTIQPPNESIENPLVSLECFIFL
        EQQSGVSHVRRSRSPATRTARGMSKFQGNMKSSAMTPMKVTAQAGDQQEAATAEKRDEGTAEKPPDGTIQPPNESIENPLVSLECFIFL
Subjt:  EQQSGVSHVRRSRSPATRTARGMSKFQGNMKSSAMTPMKVTAQAGDQQEAATAEKRDEGTAEKPPDGTIQPPNESIENPLVSLECFIFL

A0A6J1GNG7 uncharacterized protein LOC1114555162.9e-8868.98Show/hide
Query:  MGCCFSRALHGGRPRVPDE--------NPGPFDTHVGNVSGNGTPEEETVKEVLSETPIARPCSVQVQQTEQNNKTPEMKVKSLPSPMDGLPSKAEEGVV
        MGCC SR L  G+P + D+        +PG   T VGN+ G GTPEEETVKEVLSETPIA+PC+  +QQT   NK+ E+KVKS  S MDG  +K EEG  
Subjt:  MGCCFSRALHGGRPRVPDE--------NPGPFDTHVGNVSGNGTPEEETVKEVLSETPIARPCSVQVQQTEQNNKTPEMKVKSLPSPMDGLPSKAEEGVV

Query:  SVSVSVSETSQVTEWCSNMSESVSMATTISEQREGDEASSKQSREIGRSTKPKIRRKRPYAGDPSWRREQREKCPTK-SAEVLPEKKSRVTRRYTQG-TE
          ++SVSE SQVTEWCSNMSESVSMATTISEQREGDEASSKQSRE+GR+ KPKIRRKRPY+GDPS+RREQR+KC TK  AE+L EKKSRVT RYT G TE
Subjt:  SVSVSVSETSQVTEWCSNMSESVSMATTISEQREGDEASSKQSREIGRSTKPKIRRKRPYAGDPSWRREQREKCPTK-SAEVLPEKKSRVTRRYTQG-TE

Query:  SRKARTRKLN---EQQSGVSHVRRSRSPATRTARGMSKFQGNMKSSAMTPMKVTAQAGDQQEAATAEKRDEGTAEKPPDGT-IQPPNESIENPLVSLECF
        SR+ARTRKLN   EQ+SGV+H RRSRSPATRT R  +K  GNMKSSA   +KVT QAG+Q EA T EKRDEG  +K  DG+  QPPNESIENPLVSLECF
Subjt:  SRKARTRKLN---EQQSGVSHVRRSRSPATRTARGMSKFQGNMKSSAMTPMKVTAQAGDQQEAATAEKRDEGTAEKPPDGT-IQPPNESIENPLVSLECF

Query:  IFL
        IFL
Subjt:  IFL

A0A6J1HYF2 uncharacterized protein LOC1114680628.4e-8868.65Show/hide
Query:  MGCCFSRALHGGRPRVPDE--------NPGPFDTHVGNVSGNGTPEEETVKEVLSETPIARPCSVQVQQTEQNNKTPEMKVKSLPSPMDGLPSKAEEGVV
        MGCC SR L  G+P + D+        +PG   T VGN+ G GTPEEETVKEVLSETPIA+PC+  +QQT   NK+ E+KVKS  S MDG  +K EEG  
Subjt:  MGCCFSRALHGGRPRVPDE--------NPGPFDTHVGNVSGNGTPEEETVKEVLSETPIARPCSVQVQQTEQNNKTPEMKVKSLPSPMDGLPSKAEEGVV

Query:  SVSVSVSETSQVTEWCSNMSESVSMATTISEQREGDEASSKQSREIGRSTKPKIRRKRPYAGDPSWRREQREKCPTK-SAEVLPEKKSRVTRRYTQG-TE
          ++SVSE SQVTEWCSNMSESVSMATTISEQREGDEASSKQSRE+GR+ KPKIRRKRPY+GDPS+RR+QR+KC TK  AE+L EKKSRVT RYT G TE
Subjt:  SVSVSVSETSQVTEWCSNMSESVSMATTISEQREGDEASSKQSREIGRSTKPKIRRKRPYAGDPSWRREQREKCPTK-SAEVLPEKKSRVTRRYTQG-TE

Query:  SRKARTRKLN---EQQSGVSHVRRSRSPATRTARGMSKFQGNMKSSAMTPMKVTAQAGDQQEAATAEKRDEGTAEKPPDG-TIQPPNESIENPLVSLECF
        SR+ARTRKLN   EQ+SGVSH RRSRSPAT+T R  +K  GNMKSSA   +K T QAG+Q EA T EKRDEG  +K  DG  IQPPNESIENPLVSLECF
Subjt:  SRKARTRKLN---EQQSGVSHVRRSRSPATRTARGMSKFQGNMKSSAMTPMKVTAQAGDQQEAATAEKRDEGTAEKPPDG-TIQPPNESIENPLVSLECF

Query:  IFL
        IFL
Subjt:  IFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTGTTGCTTTAGCAGAGCCTTGCACGGAGGTCGGCCTCGTGTCCCGGACGAAAATCCGGGACCGTTCGACACCCATGTAGGAAACGTCTCCGGCAACGGGACCCC
GGAAGAAGAAACTGTTAAAGAGGTCCTCTCAGAGACGCCCATTGCCAGGCCATGCAGCGTACAGGTACAACAAACAGAGCAAAACAACAAAACTCCTGAGATGAAAGTAA
AATCCTTGCCAAGTCCAATGGATGGTTTGCCCAGCAAAGCGGAGGAAGGGGTAGTTTCAGTTTCAGTTTCAGTTTCAGAAACATCTCAGGTTACAGAATGGTGTAGCAAT
ATGAGCGAGAGCGTTTCAATGGCCACCACCATTTCGGAGCAGAGAGAAGGCGACGAAGCATCGAGCAAACAGAGTAGAGAAATTGGTCGGAGTACGAAACCAAAGATTCG
CAGAAAGCGTCCATACGCCGGCGACCCGTCGTGGCGGAGAGAACAGAGAGAGAAATGTCCGACCAAGAGTGCTGAAGTTTTACCGGAGAAGAAATCTCGTGTCACTCGCA
GGTACACGCAGGGGACGGAATCAAGAAAGGCGAGGACCAGGAAGCTTAACGAGCAACAATCCGGAGTCAGCCATGTCCGGCGTTCGAGGTCGCCGGCTACTCGAACAGCC
AGAGGAATGAGTAAGTTCCAGGGGAACATGAAAAGCAGTGCCATGACTCCCATGAAAGTGACTGCACAAGCTGGGGACCAGCAAGAGGCAGCGACCGCCGAGAAAAGAGA
CGAAGGAACGGCGGAGAAGCCGCCGGACGGTACAATTCAGCCCCCAAATGAATCAATAGAAAACCCACTTGTCTCACTTGAATGTTTCATCTTTCTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTTGTTGCTTTAGCAGAGCCTTGCACGGAGGTCGGCCTCGTGTCCCGGACGAAAATCCGGGACCGTTCGACACCCATGTAGGAAACGTCTCCGGCAACGGGACCCC
GGAAGAAGAAACTGTTAAAGAGGTCCTCTCAGAGACGCCCATTGCCAGGCCATGCAGCGTACAGGTACAACAAACAGAGCAAAACAACAAAACTCCTGAGATGAAAGTAA
AATCCTTGCCAAGTCCAATGGATGGTTTGCCCAGCAAAGCGGAGGAAGGGGTAGTTTCAGTTTCAGTTTCAGTTTCAGAAACATCTCAGGTTACAGAATGGTGTAGCAAT
ATGAGCGAGAGCGTTTCAATGGCCACCACCATTTCGGAGCAGAGAGAAGGCGACGAAGCATCGAGCAAACAGAGTAGAGAAATTGGTCGGAGTACGAAACCAAAGATTCG
CAGAAAGCGTCCATACGCCGGCGACCCGTCGTGGCGGAGAGAACAGAGAGAGAAATGTCCGACCAAGAGTGCTGAAGTTTTACCGGAGAAGAAATCTCGTGTCACTCGCA
GGTACACGCAGGGGACGGAATCAAGAAAGGCGAGGACCAGGAAGCTTAACGAGCAACAATCCGGAGTCAGCCATGTCCGGCGTTCGAGGTCGCCGGCTACTCGAACAGCC
AGAGGAATGAGTAAGTTCCAGGGGAACATGAAAAGCAGTGCCATGACTCCCATGAAAGTGACTGCACAAGCTGGGGACCAGCAAGAGGCAGCGACCGCCGAGAAAAGAGA
CGAAGGAACGGCGGAGAAGCCGCCGGACGGTACAATTCAGCCCCCAAATGAATCAATAGAAAACCCACTTGTCTCACTTGAATGTTTCATCTTTCTGTAG
Protein sequenceShow/hide protein sequence
MGCCFSRALHGGRPRVPDENPGPFDTHVGNVSGNGTPEEETVKEVLSETPIARPCSVQVQQTEQNNKTPEMKVKSLPSPMDGLPSKAEEGVVSVSVSVSETSQVTEWCSN
MSESVSMATTISEQREGDEASSKQSREIGRSTKPKIRRKRPYAGDPSWRREQREKCPTKSAEVLPEKKSRVTRRYTQGTESRKARTRKLNEQQSGVSHVRRSRSPATRTA
RGMSKFQGNMKSSAMTPMKVTAQAGDQQEAATAEKRDEGTAEKPPDGTIQPPNESIENPLVSLECFIFL