; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g31730 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g31730
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionBEST Arabidopsis thaliana protein match is: embryo defective 2170 .
Genome locationchr4:23844391..23845473
RNA-Seq ExpressionMoc04g31730
SyntenyMoc04g31730
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7032834.1 hypothetical protein SDJN02_06884, partial [Cucurbita argyrosperma subsp. argyrosperma]2.4e-8065.56Show/hide
Query:  MGDPRPKLPRNPIRSPDSVAPREQEFAAEDGIVFRAV-----DEESQSESGACSPTLWGSNSRSSPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRNMP
        MGD   K PRNPI +            AED I+FR +     D++SQSESG CSPTLWGS+SR SPQFHR RNR+LSPTSR QAIARGQQELMEMVRNMP
Subjt:  MGDPRPKLPRNPIRSPDSVAPREQEFAAEDGIVFRAV-----DEESQSESGACSPTLWGSNSRSSPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRNMP

Query:  EASYELSLKDLVEYHRVANPDTSIESRDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAKKKRSVGNDSGLNGSSRVSPKPP
        E+SYELSLKDLVE+H       S+E + DS SETSF RD  KK +ETRALVTRSRSV+SGGFYLKMFFP+P GQIS KKK ++ +DS LNGSSRVSPKPP
Subjt:  EASYELSLKDLVEYHRVANPDTSIESRDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAKKKRSVGNDSGLNGSSRVSPKPP

Query:  LVRDGSGKGADRDWWRKRSSLAAGESEGSVSGGSMTSSGSSNS--TSSRNAESQGSCWFCISPIRTKDPE
         V        DRDWWRKRSS  +GE+ GSVSG S ++S S++S  ++SRN+ES+GSCWFCISPIR+K+PE
Subjt:  LVRDGSGKGADRDWWRKRSSLAAGESEGSVSGGSMTSSGSSNS--TSSRNAESQGSCWFCISPIRTKDPE

XP_004144253.1 uncharacterized protein LOC101219576 [Cucumis sativus]7.3e-8567.86Show/hide
Query:  MGDPRPKLPRNPIRSPDSVAPREQEFAAEDGIVFR------AVDEESQSESGACSPTLWGSNSRSSPQF-HRPRNRSLSPTSRIQAIARGQQELMEMVRN
        MGD   KLP      PDS   +  E  +ED I+FR      A+D++SQSESG  SPTLW SNSR++PQF HR RNRSLSPTSR QAIARGQQELMEMVRN
Subjt:  MGDPRPKLPRNPIRSPDSVAPREQEFAAEDGIVFR------AVDEESQSESGACSPTLWGSNSRSSPQF-HRPRNRSLSPTSRIQAIARGQQELMEMVRN

Query:  MPEASYELSLKDLVEYH-----RVANPDTSIESRDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAKKKRSVGNDSGLNGSS
        MPE+SYELSLKDLVE+H     R  + D +  +RDDS+SETSFRRD SK   ETRALVTRSRSVDSGGFYLKMFFPLPFGQ+SAKKK ++  DSGL+GSS
Subjt:  MPEASYELSLKDLVEYH-----RVANPDTSIESRDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAKKKRSVGNDSGLNGSS

Query:  RVSPKPPLVRDGSGKGADRDWWRKRSSLAAGESEGSVSGGSMTSSGSSNSTS-----SRNAESQGSCWFCISPIRTKDPE
        RVSPKPP V        D+DWWRKRSS++ GE++GS+SGGSMTSSGSSNSTS     SRN+ESQGSCWFCISP+R+KD E
Subjt:  RVSPKPPLVRDGSGKGADRDWWRKRSSLAAGESEGSVSGGSMTSSGSSNSTS-----SRNAESQGSCWFCISPIRTKDPE

XP_008441584.1 PREDICTED: uncharacterized protein LOC103485667 [Cucumis melo]1.3e-8668.21Show/hide
Query:  MGDPRPKLPRNPI-RSPDSVAPREQEFAAEDGIVFR------AVDEESQSESGACSPTLWGSNSRSSPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRN
        MGD   K P  PI  +PDS   +  E  +ED I+FR      A+D++SQSESG  SPTLW SNSR++PQFHR RNRSLSPTSR QAIARGQQELMEMVRN
Subjt:  MGDPRPKLPRNPI-RSPDSVAPREQEFAAEDGIVFR------AVDEESQSESGACSPTLWGSNSRSSPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRN

Query:  MPEASYELSLKDLVEYH-----RVANPDTSIESRDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAKKKRSVGNDSGLNGSS
        MPE+SYELSLKDLVE+H     R  +   +  SRDDS+SETSFRRD SK   ETRALVTRSRSVDSGGFYLKMFFPLPFGQ+SAKKK ++  DSGL+GSS
Subjt:  MPEASYELSLKDLVEYH-----RVANPDTSIESRDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAKKKRSVGNDSGLNGSS

Query:  RVSPKPPLVRDGSGKGADRDWWRKRSSLAAGESEGSVSGGSMTSSGSSNSTS-----SRNAESQGSCWFCISPIRTKDPE
        RVSPKPP V        D+DWWRKRSS++ GE++GS+SGGSMTSSGSSNSTS     SRN+ESQGSCWFCISP+R+KD E
Subjt:  RVSPKPPLVRDGSGKGADRDWWRKRSSLAAGESEGSVSGGSMTSSGSSNSTS-----SRNAESQGSCWFCISPIRTKDPE

XP_022989800.1 uncharacterized protein LOC111486878 [Cucurbita maxima]4.2e-8065.56Show/hide
Query:  MGDPRPKLPRNPIRSPDSVAPREQEFAAEDGIVFRAV-----DEESQSESGACSPTLWGSNSRSSPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRNMP
        MGD   K PRNPI +            AED I+FR +     D++SQSESG CSPTLWGS+SR SPQFHR RNR+LSPTSR QAIARGQQELMEMVRNMP
Subjt:  MGDPRPKLPRNPIRSPDSVAPREQEFAAEDGIVFRAV-----DEESQSESGACSPTLWGSNSRSSPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRNMP

Query:  EASYELSLKDLVEYHRVANPDTSIESRDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAKKKRSVGNDSGLNGSSRVSPKPP
        E+SYELSLKDLVE+H       S+E + DSTSETSF RD  KK +ETRALVTRSRSV+SGGFYLKMFFP+P GQIS KKK ++ +DS LNG SRVSPKPP
Subjt:  EASYELSLKDLVEYHRVANPDTSIESRDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAKKKRSVGNDSGLNGSSRVSPKPP

Query:  LVRDGSGKGADRDWWRKRSSLAAGESEGSVSGGSMTSSGSSNS--TSSRNAESQGSCWFCISPIRTKDPE
         V        DRDWWRKRSS  +GE+ GSVSG S ++S S++S  ++SRN+ES+GSCWFCISPIR+K+PE
Subjt:  LVRDGSGKGADRDWWRKRSSLAAGESEGSVSGGSMTSSGSSNS--TSSRNAESQGSCWFCISPIRTKDPE

XP_038885358.1 uncharacterized protein LOC120075766 [Benincasa hispida]1.6e-8768.33Show/hide
Query:  MGDPRPKLPRNPIRSPDSVAPREQEFAAEDGIVFR------AVDEESQSESGACSPTLWGSNSRSSPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRNM
        MGD R K PR    +PDS   +  +   ED I+FR      A+D++SQSESG  SPTLWGSNSR+SPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRNM
Subjt:  MGDPRPKLPRNPIRSPDSVAPREQEFAAEDGIVFR------AVDEESQSESGACSPTLWGSNSRSSPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRNM

Query:  PEASYELSLKDLVEYH-----RVANPDTSIES--RDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAKKKRSVGNDSGLNGS
        PE+SYELSLKDLVE+H     R  + D +  S  RDDS+SETSFRRDSSK  +ETR LVTRSRSVDSGGFYLKMF PLPFGQ+SAKKK ++  DSGL+G 
Subjt:  PEASYELSLKDLVEYH-----RVANPDTSIES--RDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAKKKRSVGNDSGLNGS

Query:  SRVSPKPPLVRDGSGKGADRDWWRKRSSLAAGESEGSVSGGSMTSSGSSNSTS-----SRNAESQGSCWFCISPIRTKDPE
        SRVSPKPP V        ++DWWRKRS++A GE+EGS+SGGSM SSGSSNSTS     SRN+ESQGSCWFCISP+R+KD E
Subjt:  SRVSPKPPLVRDGSGKGADRDWWRKRSSLAAGESEGSVSGGSMTSSGSSNSTS-----SRNAESQGSCWFCISPIRTKDPE

TrEMBL top hitse value%identityAlignment
A0A0A0KCW4 Uncharacterized protein3.5e-8567.86Show/hide
Query:  MGDPRPKLPRNPIRSPDSVAPREQEFAAEDGIVFR------AVDEESQSESGACSPTLWGSNSRSSPQF-HRPRNRSLSPTSRIQAIARGQQELMEMVRN
        MGD   KLP      PDS   +  E  +ED I+FR      A+D++SQSESG  SPTLW SNSR++PQF HR RNRSLSPTSR QAIARGQQELMEMVRN
Subjt:  MGDPRPKLPRNPIRSPDSVAPREQEFAAEDGIVFR------AVDEESQSESGACSPTLWGSNSRSSPQF-HRPRNRSLSPTSRIQAIARGQQELMEMVRN

Query:  MPEASYELSLKDLVEYH-----RVANPDTSIESRDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAKKKRSVGNDSGLNGSS
        MPE+SYELSLKDLVE+H     R  + D +  +RDDS+SETSFRRD SK   ETRALVTRSRSVDSGGFYLKMFFPLPFGQ+SAKKK ++  DSGL+GSS
Subjt:  MPEASYELSLKDLVEYH-----RVANPDTSIESRDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAKKKRSVGNDSGLNGSS

Query:  RVSPKPPLVRDGSGKGADRDWWRKRSSLAAGESEGSVSGGSMTSSGSSNSTS-----SRNAESQGSCWFCISPIRTKDPE
        RVSPKPP V        D+DWWRKRSS++ GE++GS+SGGSMTSSGSSNSTS     SRN+ESQGSCWFCISP+R+KD E
Subjt:  RVSPKPPLVRDGSGKGADRDWWRKRSSLAAGESEGSVSGGSMTSSGSSNSTS-----SRNAESQGSCWFCISPIRTKDPE

A0A1S3B3A7 uncharacterized protein LOC1034856676.5e-8768.21Show/hide
Query:  MGDPRPKLPRNPI-RSPDSVAPREQEFAAEDGIVFR------AVDEESQSESGACSPTLWGSNSRSSPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRN
        MGD   K P  PI  +PDS   +  E  +ED I+FR      A+D++SQSESG  SPTLW SNSR++PQFHR RNRSLSPTSR QAIARGQQELMEMVRN
Subjt:  MGDPRPKLPRNPI-RSPDSVAPREQEFAAEDGIVFR------AVDEESQSESGACSPTLWGSNSRSSPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRN

Query:  MPEASYELSLKDLVEYH-----RVANPDTSIESRDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAKKKRSVGNDSGLNGSS
        MPE+SYELSLKDLVE+H     R  +   +  SRDDS+SETSFRRD SK   ETRALVTRSRSVDSGGFYLKMFFPLPFGQ+SAKKK ++  DSGL+GSS
Subjt:  MPEASYELSLKDLVEYH-----RVANPDTSIESRDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAKKKRSVGNDSGLNGSS

Query:  RVSPKPPLVRDGSGKGADRDWWRKRSSLAAGESEGSVSGGSMTSSGSSNSTS-----SRNAESQGSCWFCISPIRTKDPE
        RVSPKPP V        D+DWWRKRSS++ GE++GS+SGGSMTSSGSSNSTS     SRN+ESQGSCWFCISP+R+KD E
Subjt:  RVSPKPPLVRDGSGKGADRDWWRKRSSLAAGESEGSVSGGSMTSSGSSNSTS-----SRNAESQGSCWFCISPIRTKDPE

A0A6J1FJY2 uncharacterized protein LOC1114447106.5e-7971.43Show/hide
Query:  AEDGIVFR-----AVDEESQSESGACSPTLWGSNSRSSPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRNMPEASYELSLKDLVEYH----RVANPDTS
        AED I+FR     A D++SQSESG CSPTLWGSNSR++ QFHRPRNRSLSPTSR QAIARGQQELMEMVRNMPE+SYELSLKDLVE+H       + D S
Subjt:  AEDGIVFR-----AVDEESQSESGACSPTLWGSNSRSSPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRNMPEASYELSLKDLVEYH----RVANPDTS

Query:  IESRDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAKKKRSVGNDSGLNGSSRVSPKPPLVRDGSGKGADRDWWRKRSSLAA
        + SRDDS SETSFRRD+SK  +ETRALVTRSRSVDSGGFYLKMF PLPFGQ+SAKKKR++  DSGLN SSRVSPKPP V        DR+WWRKRS    
Subjt:  IESRDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAKKKRSVGNDSGLNGSSRVSPKPPLVRDGSGKGADRDWWRKRSSLAA

Query:  GESEGSVSGGSMTSSGSSNSTSSRNAESQGSCWFCISPIRTKDPE
          +EGSVSGG   SS S++S  SRN+ESQG CWFCISP+R+KDPE
Subjt:  GESEGSVSGGSMTSSGSSNSTSSRNAESQGSCWFCISPIRTKDPE

A0A6J1HEJ4 uncharacterized protein LOC1114621654.5e-8065.56Show/hide
Query:  MGDPRPKLPRNPIRSPDSVAPREQEFAAEDGIVFRAV-----DEESQSESGACSPTLWGSNSRSSPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRNMP
        MGD   K PRNPI +            AED I+FR +     D++SQSESG CSPTLWGS+SR +PQFHR RNR+LSPTSR QAIARGQQELMEMVRNMP
Subjt:  MGDPRPKLPRNPIRSPDSVAPREQEFAAEDGIVFRAV-----DEESQSESGACSPTLWGSNSRSSPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRNMP

Query:  EASYELSLKDLVEYHRVANPDTSIESRDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAKKKRSVGNDSGLNGSSRVSPKPP
        E+SYELSLKDLVE+H       SIE + DS SETSF RD  KK +ETRALVTRSRSV+SGGFYLKMFFP+P G+IS KKK +V +DS LNGSSRVSPKPP
Subjt:  EASYELSLKDLVEYHRVANPDTSIESRDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAKKKRSVGNDSGLNGSSRVSPKPP

Query:  LVRDGSGKGADRDWWRKRSSLAAGESEGSVSGGSMTSSGSSNS--TSSRNAESQGSCWFCISPIRTKDPE
         V        DRDWWRKRSS  +GE+ GSVSG S ++S S++S  ++SRN+ES+GSCWFCISPIR+K+PE
Subjt:  LVRDGSGKGADRDWWRKRSSLAAGESEGSVSGGSMTSSGSSNS--TSSRNAESQGSCWFCISPIRTKDPE

A0A6J1JND4 uncharacterized protein LOC1114868782.0e-8065.56Show/hide
Query:  MGDPRPKLPRNPIRSPDSVAPREQEFAAEDGIVFRAV-----DEESQSESGACSPTLWGSNSRSSPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRNMP
        MGD   K PRNPI +            AED I+FR +     D++SQSESG CSPTLWGS+SR SPQFHR RNR+LSPTSR QAIARGQQELMEMVRNMP
Subjt:  MGDPRPKLPRNPIRSPDSVAPREQEFAAEDGIVFRAV-----DEESQSESGACSPTLWGSNSRSSPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRNMP

Query:  EASYELSLKDLVEYHRVANPDTSIESRDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAKKKRSVGNDSGLNGSSRVSPKPP
        E+SYELSLKDLVE+H       S+E + DSTSETSF RD  KK +ETRALVTRSRSV+SGGFYLKMFFP+P GQIS KKK ++ +DS LNG SRVSPKPP
Subjt:  EASYELSLKDLVEYHRVANPDTSIESRDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAKKKRSVGNDSGLNGSSRVSPKPP

Query:  LVRDGSGKGADRDWWRKRSSLAAGESEGSVSGGSMTSSGSSNS--TSSRNAESQGSCWFCISPIRTKDPE
         V        DRDWWRKRSS  +GE+ GSVSG S ++S S++S  ++SRN+ES+GSCWFCISPIR+K+PE
Subjt:  LVRDGSGKGADRDWWRKRSSLAAGESEGSVSGGSMTSSGSSNS--TSSRNAESQGSCWFCISPIRTKDPE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21390.1 embryo defective 21704.2e-2238.33Show/hide
Query:  IVFRAVDEESQSESGACSPTLW-GSNSRSSPQFHRPRNR-SLSPTSRIQAIARGQQELMEMVRNMPEASYELSLKDLVEYHRVANPDTSIESRDDSTSET
        + F ++  +  S+SG CSPTLW  S  +S P FHRP +  SLSP S+ QAIARGQ+ELMEMV  MPE+ YELSLKDLVE     N +   +  D+     
Subjt:  IVFRAVDEESQSESGACSPTLW-GSNSRSSPQFHRPRNR-SLSPTSRIQAIARGQQELMEMVRNMPEASYELSLKDLVEYHRVANPDTSIESRDDSTSET

Query:  SFRRDSSKKT-AETRALVTRSRSVDSGGFYLKMFFPLPFGQI--SAKKKRSVGNDSGLNGSSRVSPKPPLVRDGSGKGADRDWWRKRSSLAAGESEGSVS
        + +    +KT ++ R    RS   ++ GF LK+ F +  G +  + KKK+    D     + +VSP+P  + + + K  D++WW + S     ES    S
Subjt:  SFRRDSSKKT-AETRALVTRSRSVDSGGFYLKMFFPLPFGQI--SAKKKRSVGNDSGLNGSSRVSPKPPLVRDGSGKGADRDWWRKRSSLAAGESEGSVS

Query:  GGSMTSSGSSNSTSSRNA--ESQGSCW
        G    SS S+NS  SR++  + + SC+
Subjt:  GGSMTSSGSSNSTSSRNA--ESQGSCW

AT1G76980.1 BEST Arabidopsis thaliana protein match is: embryo defective 2170 (TAIR:AT1G21390.1)2.6e-1935.51Show/hide
Query:  ESQSESGACSPTLWGSNSRSSPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRNMPEASYELSLKDLVEYHRVANPDTSIESRDDSTSETSFRRDSSKKT
        +  S+SG CSP LW ++   SP       ++LSP ++ Q IARGQ+ELM+MV  MPE+ YELSLKDLVE +           + +       R+  S K 
Subjt:  ESQSESGACSPTLWGSNSRSSPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRNMPEASYELSLKDLVEYHRVANPDTSIESRDDSTSETSFRRDSSKKT

Query:  AETRALVTRSRSVDSGGFYLKMFFPLPFG--QISAKKKRSVGNDSGLNGS-SRVSPKPPLVRDGSGKGADRDWWR------KRSSLAAGESEGSVSGGSM
         +      R+  V++ GF LK+ FP+  G  + + KKK +  +DS +    S +S   P + D S K  D+DWW+      +RS           S  S 
Subjt:  AETRALVTRSRSVDSGGFYLKMFFPLPFG--QISAKKKRSVGNDSGLNGS-SRVSPKPPLVRDGSGKGADRDWWR------KRSSLAAGESEGSVSGGSM

Query:  TSSGSSNSTSSRNA
         SS  SNS  SRN+
Subjt:  TSSGSSNSTSSRNA

AT1G76980.2 FUNCTIONS IN: molecular_function unknown1.5e-1935.11Show/hide
Query:  ESQSESGACSPTLWGSNSRSSPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRNMPEASYELSLKDLVEYHRVANPDTSIESRDDSTSETSFRRDSSKKT
        +  S+SG CSP LW ++   SP       ++LSP ++ Q IARGQ+ELM+MV  MPE+ YELSLKDLVE +           + +       R+  S K 
Subjt:  ESQSESGACSPTLWGSNSRSSPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRNMPEASYELSLKDLVEYHRVANPDTSIESRDDSTSETSFRRDSSKKT

Query:  AETRALVTRSRSVDSGGFYLKMFFPLPFG--QISAKKKRSVGNDSGLNGS-SRVSPKPPLVRDGSGKGADRDWW-------RKRSSLAAGESEGS--VSG
         +      R+  V++ GF LK+ FP+  G  + + KKK +  +DS +    S +S   P + D S K  D+DWW       R+  S+ +  + GS   SG
Subjt:  AETRALVTRSRSVDSGGFYLKMFFPLPFG--QISAKKKRSVGNDSGLNGS-SRVSPKPPLVRDGSGKGADRDWW-------RKRSSLAAGESEGS--VSG

Query:  GSMTSSGSSNSTSSRNAESQGSCWF
        GS + S S  S +S   E++G   F
Subjt:  GSMTSSGSSNSTSSRNAESQGSCWF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGATCCTAGGCCGAAACTCCCAAGAAATCCTATCCGTAGCCCAGATTCTGTCGCGCCTCGAGAGCAGGAGTTTGCGGCCGAAGACGGCATCGTTTTCAGGGCCGT
GGATGAAGAGTCGCAATCCGAATCAGGAGCCTGTTCCCCCACGCTCTGGGGCTCCAATTCTCGGAGCAGCCCCCAATTTCACCGCCCGCGTAATCGGAGCCTCTCCCCAA
CTTCGCGGATCCAAGCCATAGCCCGCGGTCAGCAGGAGCTCATGGAGATGGTCAGGAACATGCCCGAGGCCTCCTACGAGCTCTCTCTCAAAGATCTCGTCGAGTACCAC
CGCGTCGCTAATCCCGATACCTCTATTGAAAGCAGAGACGATTCCACCTCCGAAACTTCCTTCAGAAGAGACTCCAGCAAGAAGACGGCTGAAACCAGAGCGCTGGTCAC
CCGGAGTAGAAGCGTCGACAGCGGCGGATTTTACCTCAAAATGTTCTTCCCGCTGCCCTTCGGCCAGATTTCGGCCAAAAAGAAGAGGAGTGTTGGGAACGATTCGGGGT
TGAATGGCAGTTCGAGAGTGTCTCCTAAGCCGCCGCTGGTGAGGGACGGATCTGGAAAGGGCGCGGACAGAGACTGGTGGAGGAAGAGATCGTCGCTGGCCGCCGGCGAG
AGCGAGGGCAGCGTCTCCGGCGGAAGCATGACGAGCAGCGGCAGTAGCAACAGCACGAGCAGCAGGAATGCAGAATCTCAAGGGAGTTGCTGGTTTTGTATCAGTCCAAT
CAGAACTAAAGATCCAGAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGCGATCCTAGGCCGAAACTCCCAAGAAATCCTATCCGTAGCCCAGATTCTGTCGCGCCTCGAGAGCAGGAGTTTGCGGCCGAAGACGGCATCGTTTTCAGGGCCGT
GGATGAAGAGTCGCAATCCGAATCAGGAGCCTGTTCCCCCACGCTCTGGGGCTCCAATTCTCGGAGCAGCCCCCAATTTCACCGCCCGCGTAATCGGAGCCTCTCCCCAA
CTTCGCGGATCCAAGCCATAGCCCGCGGTCAGCAGGAGCTCATGGAGATGGTCAGGAACATGCCCGAGGCCTCCTACGAGCTCTCTCTCAAAGATCTCGTCGAGTACCAC
CGCGTCGCTAATCCCGATACCTCTATTGAAAGCAGAGACGATTCCACCTCCGAAACTTCCTTCAGAAGAGACTCCAGCAAGAAGACGGCTGAAACCAGAGCGCTGGTCAC
CCGGAGTAGAAGCGTCGACAGCGGCGGATTTTACCTCAAAATGTTCTTCCCGCTGCCCTTCGGCCAGATTTCGGCCAAAAAGAAGAGGAGTGTTGGGAACGATTCGGGGT
TGAATGGCAGTTCGAGAGTGTCTCCTAAGCCGCCGCTGGTGAGGGACGGATCTGGAAAGGGCGCGGACAGAGACTGGTGGAGGAAGAGATCGTCGCTGGCCGCCGGCGAG
AGCGAGGGCAGCGTCTCCGGCGGAAGCATGACGAGCAGCGGCAGTAGCAACAGCACGAGCAGCAGGAATGCAGAATCTCAAGGGAGTTGCTGGTTTTGTATCAGTCCAAT
CAGAACTAAAGATCCAGAGTAA
Protein sequenceShow/hide protein sequence
MGDPRPKLPRNPIRSPDSVAPREQEFAAEDGIVFRAVDEESQSESGACSPTLWGSNSRSSPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRNMPEASYELSLKDLVEYH
RVANPDTSIESRDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAKKKRSVGNDSGLNGSSRVSPKPPLVRDGSGKGADRDWWRKRSSLAAGE
SEGSVSGGSMTSSGSSNSTSSRNAESQGSCWFCISPIRTKDPE