; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g1306 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g1306
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionBEST Arabidopsis thaliana protein match is: embryo defective 2170 .
Genome locationMC04:21081290..21082761
RNA-Seq ExpressionMC04g1306
SyntenyMC04g1306
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7032834.1 hypothetical protein SDJN02_06884, partial [Cucurbita argyrosperma subsp. argyrosperma]1.03e-10165.68Show/hide
Query:  AMGDPRPKLPRNPIRSPDSVAPREQEFAAEDGIVFRAV-----DEESQSESGACSPTLWGSNSRSSPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRNM
        AMGD   K PRNPI +            AED I+FR +     D++SQSESG CSPTLWGS+SR SPQFHR RNR+LSPTSR QAIARGQQELMEMVRNM
Subjt:  AMGDPRPKLPRNPIRSPDSVAPREQEFAAEDGIVFRAV-----DEESQSESGACSPTLWGSNSRSSPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRNM

Query:  PEASYELSLKDLVEYHRVANPDTSIESRDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAKKKRSVGNDSGLNGSSRVSPKP
        PE+SYELSLKDLVE+H       S+E + DS SETSF RD  KK +ETRALVTRSRSV+SGGFYLKMFFP+P GQIS KKK ++ +DS LNGSSRVSPKP
Subjt:  PEASYELSLKDLVEYHRVANPDTSIESRDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAKKKRSVGNDSGLNGSSRVSPKP

Query:  PLVRDGSGKGADRDWWRKRSSLAAGESEGSVSGGSMTSSGSSNS--TSSRNAESQGSCWFCISPIRTKDPE
        P V        DRDWWRKRSS  +GE+ GSVSG S ++S S++S  ++SRN+ES+GSCWFCISPIR+K+PE
Subjt:  PLVRDGSGKGADRDWWRKRSSLAAGESEGSVSGGSMTSSGSSNS--TSSRNAESQGSCWFCISPIRTKDPE

XP_004144253.1 uncharacterized protein LOC101219576 [Cucumis sativus]3.88e-10767.86Show/hide
Query:  MGDPRPKLPRNPIRSPDSVAPREQEFAAEDGIVFR------AVDEESQSESGACSPTLWGSNSRSSPQFH-RPRNRSLSPTSRIQAIARGQQELMEMVRN
        MGD   KLP      PDS   +  E  +ED I+FR      A+D++SQSESG  SPTLW SNSR++PQFH R RNRSLSPTSR QAIARGQQELMEMVRN
Subjt:  MGDPRPKLPRNPIRSPDSVAPREQEFAAEDGIVFR------AVDEESQSESGACSPTLWGSNSRSSPQFH-RPRNRSLSPTSRIQAIARGQQELMEMVRN

Query:  MPEASYELSLKDLVEYH-----RVANPDTSIESRDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAKKKRSVGNDSGLNGSS
        MPE+SYELSLKDLVE+H     R  + D +  +RDDS+SETSFRRD SK   ETRALVTRSRSVDSGGFYLKMFFPLPFGQ+SAKKK ++  DSGL+GSS
Subjt:  MPEASYELSLKDLVEYH-----RVANPDTSIESRDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAKKKRSVGNDSGLNGSS

Query:  RVSPKPPLVRDGSGKGADRDWWRKRSSLAAGESEGSVSGGSMTSSGSSNSTSS-----RNAESQGSCWFCISPIRTKDPE
        RVSPKPP V        D+DWWRKRSS++ GE++GS+SGGSMTSSGSSNSTSS     RN+ESQGSCWFCISP+R+KD E
Subjt:  RVSPKPPLVRDGSGKGADRDWWRKRSSLAAGESEGSVSGGSMTSSGSSNSTSS-----RNAESQGSCWFCISPIRTKDPE

XP_008441584.1 PREDICTED: uncharacterized protein LOC103485667 [Cucumis melo]2.35e-10968.21Show/hide
Query:  MGDPRPKLPRNPIR-SPDSVAPREQEFAAEDGIVFR------AVDEESQSESGACSPTLWGSNSRSSPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRN
        MGD   K P  PI  +PDS   +  E  +ED I+FR      A+D++SQSESG  SPTLW SNSR++PQFHR RNRSLSPTSR QAIARGQQELMEMVRN
Subjt:  MGDPRPKLPRNPIR-SPDSVAPREQEFAAEDGIVFR------AVDEESQSESGACSPTLWGSNSRSSPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRN

Query:  MPEASYELSLKDLVEYH-----RVANPDTSIESRDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAKKKRSVGNDSGLNGSS
        MPE+SYELSLKDLVE+H     R  +   +  SRDDS+SETSFRRD SK   ETRALVTRSRSVDSGGFYLKMFFPLPFGQ+SAKKK ++  DSGL+GSS
Subjt:  MPEASYELSLKDLVEYH-----RVANPDTSIESRDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAKKKRSVGNDSGLNGSS

Query:  RVSPKPPLVRDGSGKGADRDWWRKRSSLAAGESEGSVSGGSMTSSGSSNSTSS-----RNAESQGSCWFCISPIRTKDPE
        RVSPKPP V        D+DWWRKRSS++ GE++GS+SGGSMTSSGSSNSTSS     RN+ESQGSCWFCISP+R+KD E
Subjt:  RVSPKPPLVRDGSGKGADRDWWRKRSSLAAGESEGSVSGGSMTSSGSSNSTSS-----RNAESQGSCWFCISPIRTKDPE

XP_022989800.1 uncharacterized protein LOC111486878 [Cucurbita maxima]2.46e-10165.56Show/hide
Query:  MGDPRPKLPRNPIRSPDSVAPREQEFAAEDGIVFRAVD-----EESQSESGACSPTLWGSNSRSSPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRNMP
        MGD   K PRNPI +            AED I+FR +D     ++SQSESG CSPTLWGS+SR SPQFHR RNR+LSPTSR QAIARGQQELMEMVRNMP
Subjt:  MGDPRPKLPRNPIRSPDSVAPREQEFAAEDGIVFRAVD-----EESQSESGACSPTLWGSNSRSSPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRNMP

Query:  EASYELSLKDLVEYHRVANPDTSIESRDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAKKKRSVGNDSGLNGSSRVSPKPP
        E+SYELSLKDLVE+H       S+E + DSTSETSF RD  KK +ETRALVTRSRSV+SGGFYLKMFFP+P GQIS KKK ++ +DS LNG SRVSPKPP
Subjt:  EASYELSLKDLVEYHRVANPDTSIESRDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAKKKRSVGNDSGLNGSSRVSPKPP

Query:  LVRDGSGKGADRDWWRKRSSLAAGESEGSVSGGSMTSSGSSNS--TSSRNAESQGSCWFCISPIRTKDPE
         V        DRDWWRKRSS  +GE+ GSVSG S ++S S++S  ++SRN+ES+GSCWFCISPIR+K+PE
Subjt:  LVRDGSGKGADRDWWRKRSSLAAGESEGSVSGGSMTSSGSSNS--TSSRNAESQGSCWFCISPIRTKDPE

XP_038885358.1 uncharacterized protein LOC120075766 [Benincasa hispida]1.48e-11068.33Show/hide
Query:  MGDPRPKLPRNPIRSPDSVAPREQEFAAEDGIVFR------AVDEESQSESGACSPTLWGSNSRSSPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRNM
        MGD R K PR    +PDS   +  +   ED I+FR      A+D++SQSESG  SPTLWGSNSR+SPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRNM
Subjt:  MGDPRPKLPRNPIRSPDSVAPREQEFAAEDGIVFR------AVDEESQSESGACSPTLWGSNSRSSPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRNM

Query:  PEASYELSLKDLVEYH-----RVANPDTSIES--RDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAKKKRSVGNDSGLNGS
        PE+SYELSLKDLVE+H     R  + D +  S  RDDS+SETSFRRDSSK  +ETR LVTRSRSVDSGGFYLKMF PLPFGQ+SAKKK ++  DSGL+G 
Subjt:  PEASYELSLKDLVEYH-----RVANPDTSIES--RDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAKKKRSVGNDSGLNGS

Query:  SRVSPKPPLVRDGSGKGADRDWWRKRSSLAAGESEGSVSGGSMTSSGSSNSTSS-----RNAESQGSCWFCISPIRTKDPE
        SRVSPKPP V        ++DWWRKRS++A GE+EGS+SGGSM SSGSSNSTSS     RN+ESQGSCWFCISP+R+KD E
Subjt:  SRVSPKPPLVRDGSGKGADRDWWRKRSSLAAGESEGSVSGGSMTSSGSSNSTSS-----RNAESQGSCWFCISPIRTKDPE

TrEMBL top hitse value%identityAlignment
A0A0A0KCW4 Uncharacterized protein1.88e-10767.86Show/hide
Query:  MGDPRPKLPRNPIRSPDSVAPREQEFAAEDGIVFR------AVDEESQSESGACSPTLWGSNSRSSPQFH-RPRNRSLSPTSRIQAIARGQQELMEMVRN
        MGD   KLP      PDS   +  E  +ED I+FR      A+D++SQSESG  SPTLW SNSR++PQFH R RNRSLSPTSR QAIARGQQELMEMVRN
Subjt:  MGDPRPKLPRNPIRSPDSVAPREQEFAAEDGIVFR------AVDEESQSESGACSPTLWGSNSRSSPQFH-RPRNRSLSPTSRIQAIARGQQELMEMVRN

Query:  MPEASYELSLKDLVEYH-----RVANPDTSIESRDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAKKKRSVGNDSGLNGSS
        MPE+SYELSLKDLVE+H     R  + D +  +RDDS+SETSFRRD SK   ETRALVTRSRSVDSGGFYLKMFFPLPFGQ+SAKKK ++  DSGL+GSS
Subjt:  MPEASYELSLKDLVEYH-----RVANPDTSIESRDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAKKKRSVGNDSGLNGSS

Query:  RVSPKPPLVRDGSGKGADRDWWRKRSSLAAGESEGSVSGGSMTSSGSSNSTSS-----RNAESQGSCWFCISPIRTKDPE
        RVSPKPP V        D+DWWRKRSS++ GE++GS+SGGSMTSSGSSNSTSS     RN+ESQGSCWFCISP+R+KD E
Subjt:  RVSPKPPLVRDGSGKGADRDWWRKRSSLAAGESEGSVSGGSMTSSGSSNSTSS-----RNAESQGSCWFCISPIRTKDPE

A0A1S3B3A7 uncharacterized protein LOC1034856671.14e-10968.21Show/hide
Query:  MGDPRPKLPRNPIR-SPDSVAPREQEFAAEDGIVFR------AVDEESQSESGACSPTLWGSNSRSSPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRN
        MGD   K P  PI  +PDS   +  E  +ED I+FR      A+D++SQSESG  SPTLW SNSR++PQFHR RNRSLSPTSR QAIARGQQELMEMVRN
Subjt:  MGDPRPKLPRNPIR-SPDSVAPREQEFAAEDGIVFR------AVDEESQSESGACSPTLWGSNSRSSPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRN

Query:  MPEASYELSLKDLVEYH-----RVANPDTSIESRDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAKKKRSVGNDSGLNGSS
        MPE+SYELSLKDLVE+H     R  +   +  SRDDS+SETSFRRD SK   ETRALVTRSRSVDSGGFYLKMFFPLPFGQ+SAKKK ++  DSGL+GSS
Subjt:  MPEASYELSLKDLVEYH-----RVANPDTSIESRDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAKKKRSVGNDSGLNGSS

Query:  RVSPKPPLVRDGSGKGADRDWWRKRSSLAAGESEGSVSGGSMTSSGSSNSTSS-----RNAESQGSCWFCISPIRTKDPE
        RVSPKPP V        D+DWWRKRSS++ GE++GS+SGGSMTSSGSSNSTSS     RN+ESQGSCWFCISP+R+KD E
Subjt:  RVSPKPPLVRDGSGKGADRDWWRKRSSLAAGESEGSVSGGSMTSSGSSNSTSS-----RNAESQGSCWFCISPIRTKDPE

A0A6J1FJY2 uncharacterized protein LOC1114447104.73e-10071.43Show/hide
Query:  AEDGIVFR-----AVDEESQSESGACSPTLWGSNSRSSPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRNMPEASYELSLKDLVEYH----RVANPDTS
        AED I+FR     A D++SQSESG CSPTLWGSNSR++ QFHRPRNRSLSPTSR QAIARGQQELMEMVRNMPE+SYELSLKDLVE+H       + D S
Subjt:  AEDGIVFR-----AVDEESQSESGACSPTLWGSNSRSSPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRNMPEASYELSLKDLVEYH----RVANPDTS

Query:  IESRDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAKKKRSVGNDSGLNGSSRVSPKPPLVRDGSGKGADRDWWRKRSSLAA
        + SRDDS SETSFRRD+SK  +ETRALVTRSRSVDSGGFYLKMF PLPFGQ+SAKKKR++  DSGLN SSRVSPKPP V        DR+WWRKRS+   
Subjt:  IESRDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAKKKRSVGNDSGLNGSSRVSPKPPLVRDGSGKGADRDWWRKRSSLAA

Query:  GESEGSVSGGSMTSSGSSNSTSSRNAESQGSCWFCISPIRTKDPE
           EGSVSGGS   S S++S  SRN+ESQG CWFCISP+R+KDPE
Subjt:  GESEGSVSGGSMTSSGSSNSTSSRNAESQGSCWFCISPIRTKDPE

A0A6J1HEJ4 uncharacterized protein LOC1114621653.40e-10165.56Show/hide
Query:  MGDPRPKLPRNPIRSPDSVAPREQEFAAEDGIVFRAVD-----EESQSESGACSPTLWGSNSRSSPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRNMP
        MGD   K PRNPI +            AED I+FR +D     ++SQSESG CSPTLWGS+SR +PQFHR RNR+LSPTSR QAIARGQQELMEMVRNMP
Subjt:  MGDPRPKLPRNPIRSPDSVAPREQEFAAEDGIVFRAVD-----EESQSESGACSPTLWGSNSRSSPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRNMP

Query:  EASYELSLKDLVEYHRVANPDTSIESRDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAKKKRSVGNDSGLNGSSRVSPKPP
        E+SYELSLKDLVE+H       SIE + DS SETSF RD  KK +ETRALVTRSRSV+SGGFYLKMFFP+P G+IS KKK +V +DS LNGSSRVSPKPP
Subjt:  EASYELSLKDLVEYHRVANPDTSIESRDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAKKKRSVGNDSGLNGSSRVSPKPP

Query:  LVRDGSGKGADRDWWRKRSSLAAGESEGSVSGGSMTSSGSSNS--TSSRNAESQGSCWFCISPIRTKDPE
         V        DRDWWRKRSS  +GE+ GSVSG S ++S S++S  ++SRN+ES+GSCWFCISPIR+K+PE
Subjt:  LVRDGSGKGADRDWWRKRSSLAAGESEGSVSGGSMTSSGSSNS--TSSRNAESQGSCWFCISPIRTKDPE

A0A6J1JND4 uncharacterized protein LOC1114868781.19e-10165.56Show/hide
Query:  MGDPRPKLPRNPIRSPDSVAPREQEFAAEDGIVFRAVD-----EESQSESGACSPTLWGSNSRSSPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRNMP
        MGD   K PRNPI +            AED I+FR +D     ++SQSESG CSPTLWGS+SR SPQFHR RNR+LSPTSR QAIARGQQELMEMVRNMP
Subjt:  MGDPRPKLPRNPIRSPDSVAPREQEFAAEDGIVFRAVD-----EESQSESGACSPTLWGSNSRSSPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRNMP

Query:  EASYELSLKDLVEYHRVANPDTSIESRDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAKKKRSVGNDSGLNGSSRVSPKPP
        E+SYELSLKDLVE+H       S+E + DSTSETSF RD  KK +ETRALVTRSRSV+SGGFYLKMFFP+P GQIS KKK ++ +DS LNG SRVSPKPP
Subjt:  EASYELSLKDLVEYHRVANPDTSIESRDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAKKKRSVGNDSGLNGSSRVSPKPP

Query:  LVRDGSGKGADRDWWRKRSSLAAGESEGSVSGGSMTSSGSSNS--TSSRNAESQGSCWFCISPIRTKDPE
         V        DRDWWRKRSS  +GE+ GSVSG S ++S S++S  ++SRN+ES+GSCWFCISPIR+K+PE
Subjt:  LVRDGSGKGADRDWWRKRSSLAAGESEGSVSGGSMTSSGSSNS--TSSRNAESQGSCWFCISPIRTKDPE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21390.1 embryo defective 21705.0e-2238.33Show/hide
Query:  IVFRAVDEESQSESGACSPTLW-GSNSRSSPQFHRPRNR-SLSPTSRIQAIARGQQELMEMVRNMPEASYELSLKDLVEYHRVANPDTSIESRDDSTSET
        + F ++  +  S+SG CSPTLW  S  +S P FHRP +  SLSP S+ QAIARGQ+ELMEMV  MPE+ YELSLKDLVE     N +   +  D+     
Subjt:  IVFRAVDEESQSESGACSPTLW-GSNSRSSPQFHRPRNR-SLSPTSRIQAIARGQQELMEMVRNMPEASYELSLKDLVEYHRVANPDTSIESRDDSTSET

Query:  SFRRDSSKKT-AETRALVTRSRSVDSGGFYLKMFFPLPFGQI--SAKKKRSVGNDSGLNGSSRVSPKPPLVRDGSGKGADRDWWRKRSSLAAGESEGSVS
        + +    +KT ++ R    RS   ++ GF LK+ F +  G +  + KKK+    D     + +VSP+P  + + + K  D++WW + S     ES    S
Subjt:  SFRRDSSKKT-AETRALVTRSRSVDSGGFYLKMFFPLPFGQI--SAKKKRSVGNDSGLNGSSRVSPKPPLVRDGSGKGADRDWWRKRSSLAAGESEGSVS

Query:  GGSMTSSGSSNSTSSRNA--ESQGSCW
        G    SS S+NS  SR++  + + SC+
Subjt:  GGSMTSSGSSNSTSSRNA--ESQGSCW

AT1G76980.1 BEST Arabidopsis thaliana protein match is: embryo defective 2170 (TAIR:AT1G21390.1)3.0e-1935.51Show/hide
Query:  ESQSESGACSPTLWGSNSRSSPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRNMPEASYELSLKDLVEYHRVANPDTSIESRDDSTSETSFRRDSSKKT
        +  S+SG CSP LW ++   SP       ++LSP ++ Q IARGQ+ELM+MV  MPE+ YELSLKDLVE +           + +       R+  S K 
Subjt:  ESQSESGACSPTLWGSNSRSSPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRNMPEASYELSLKDLVEYHRVANPDTSIESRDDSTSETSFRRDSSKKT

Query:  AETRALVTRSRSVDSGGFYLKMFFPLPFG--QISAKKKRSVGNDSGLNGS-SRVSPKPPLVRDGSGKGADRDWWR------KRSSLAAGESEGSVSGGSM
         +      R+  V++ GF LK+ FP+  G  + + KKK +  +DS +    S +S   P + D S K  D+DWW+      +RS           S  S 
Subjt:  AETRALVTRSRSVDSGGFYLKMFFPLPFG--QISAKKKRSVGNDSGLNGS-SRVSPKPPLVRDGSGKGADRDWWR------KRSSLAAGESEGSVSGGSM

Query:  TSSGSSNSTSSRNA
         SS  SNS  SRN+
Subjt:  TSSGSSNSTSSRNA

AT1G76980.2 FUNCTIONS IN: molecular_function unknown1.8e-1935.11Show/hide
Query:  ESQSESGACSPTLWGSNSRSSPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRNMPEASYELSLKDLVEYHRVANPDTSIESRDDSTSETSFRRDSSKKT
        +  S+SG CSP LW ++   SP       ++LSP ++ Q IARGQ+ELM+MV  MPE+ YELSLKDLVE +           + +       R+  S K 
Subjt:  ESQSESGACSPTLWGSNSRSSPQFHRPRNRSLSPTSRIQAIARGQQELMEMVRNMPEASYELSLKDLVEYHRVANPDTSIESRDDSTSETSFRRDSSKKT

Query:  AETRALVTRSRSVDSGGFYLKMFFPLPFG--QISAKKKRSVGNDSGLNGS-SRVSPKPPLVRDGSGKGADRDWW-------RKRSSLAAGESEGS--VSG
         +      R+  V++ GF LK+ FP+  G  + + KKK +  +DS +    S +S   P + D S K  D+DWW       R+  S+ +  + GS   SG
Subjt:  AETRALVTRSRSVDSGGFYLKMFFPLPFG--QISAKKKRSVGNDSGLNGS-SRVSPKPPLVRDGSGKGADRDWW-------RKRSSLAAGESEGS--VSG

Query:  GSMTSSGSSNSTSSRNAESQGSCWF
        GS + S S  S +S   E++G   F
Subjt:  GSMTSSGSSNSTSSRNAESQGSCWF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TCCCCCTCCCTCGCTTCAAACGCACCACACATAAAAACAAACCATTTCAACTCTCTCTCTCGAGTTTCTTCCTCTACATTTCCCCAGAAAATCCAAAACACTGAGAAAAA
AAAAAAAGAAGAAAAAAGGAAAATCAAAGCCATGGGCGATCCTAGGCCGAAACTCCCAAGAAATCCTATCCGTAGCCCAGATTCTGTCGCGCCTCGAGAGCAGGAGTTTG
CGGCCGAAGACGGCATCGTTTTCAGGGCCGTGGATGAAGAGTCGCAATCCGAATCAGGAGCCTGTTCCCCCACGCTCTGGGGCTCCAATTCTCGGAGCAGCCCCCAATTT
CACCGCCCGCGTAATCGGAGCCTCTCCCCAACTTCGCGGATCCAAGCCATAGCCCGCGGTCAGCAGGAGCTCATGGAGATGGTCAGGAACATGCCCGAGGCCTCCTACGA
GCTCTCTCTCAAAGATCTCGTCGAGTACCACCGCGTCGCTAATCCCGATACCTCTATTGAAAGCAGAGACGATTCCACCTCCGAAACTTCCTTCAGAAGAGACTCCAGCA
AGAAGACGGCTGAAACCAGAGCGCTGGTCACCCGGAGTAGAAGCGTCGACAGCGGCGGATTTTACCTCAAAATGTTCTTCCCGCTGCCCTTCGGCCAGATTTCGGCCAAA
AAGAAGAGGAGTGTTGGGAACGATTCGGGGTTGAATGGCAGTTCGAGAGTGTCTCCTAAGCCGCCGCTGGTGAGGGACGGATCTGGAAAGGGCGCGGACAGAGACTGGTG
GAGGAAGAGATCGTCGCTGGCCGCCGGCGAGAGCGAGGGCAGCGTCTCCGGCGGAAGCATGACGAGCAGCGGCAGTAGCAACAGCACGAGCAGCAGGAATGCAGAATCTC
AAGGGAGTTGCTGGTTTTGTATCAGTCCAATCAGAACTAAAGATCCAGAGTAA
mRNA sequenceShow/hide mRNA sequence
GCTCCCCCTCCCTCGCTTCAAACGCACCACACATAAAAACAAACCATTTCAACTCTCTCTCTCGAGTTTCTTCCTCTACATTTCCCCAGAAAATCCAAAACACTGAGAAA
AAAAAAAAAGAAGAAAAAAGGAAAATCAAAGCCATGGGCGATCCTAGGCCGAAACTCCCAAGAAATCCTATCCGTAGCCCAGATTCTGTCGCGCCTCGAGAGCAGGAGTT
TGCGGCCGAAGACGGCATCGTTTTCAGGGCCGTGGATGAAGAGTCGCAATCCGAATCAGGAGCCTGTTCCCCCACGCTCTGGGGCTCCAATTCTCGGAGCAGCCCCCAAT
TTCACCGCCCGCGTAATCGGAGCCTCTCCCCAACTTCGCGGATCCAAGCCATAGCCCGCGGTCAGCAGGAGCTCATGGAGATGGTCAGGAACATGCCCGAGGCCTCCTAC
GAGCTCTCTCTCAAAGATCTCGTCGAGTACCACCGCGTCGCTAATCCCGATACCTCTATTGAAAGCAGAGACGATTCCACCTCCGAAACTTCCTTCAGAAGAGACTCCAG
CAAGAAGACGGCTGAAACCAGAGCGCTGGTCACCCGGAGTAGAAGCGTCGACAGCGGCGGATTTTACCTCAAAATGTTCTTCCCGCTGCCCTTCGGCCAGATTTCGGCCA
AAAAGAAGAGGAGTGTTGGGAACGATTCGGGGTTGAATGGCAGTTCGAGAGTGTCTCCTAAGCCGCCGCTGGTGAGGGACGGATCTGGAAAGGGCGCGGACAGAGACTGG
TGGAGGAAGAGATCGTCGCTGGCCGCCGGCGAGAGCGAGGGCAGCGTCTCCGGCGGAAGCATGACGAGCAGCGGCAGTAGCAACAGCACGAGCAGCAGGAATGCAGAATC
TCAAGGGAGTTGCTGGTTTTGTATCAGTCCAATCAGAACTAAAGATCCAGAGTAAAAAAAGGCTACAATTACCCAATCTTACTATGTAATAATCTCCTTTGAATAATGCT
TTGGTATAATCCCACTAAATCACAGCTGAATTTTGCAGCTAAGCTTGTCTATATTTTTTTTGTTTTATTACTTTTTCTTTTTGGAAAAAAGGAGGATTCCTTGTCTTGTT
CGTTCCTGATCTCTTTCTTCATTAGTATTTTAATTTCCATGTTATATATATGTATGTATATTGTTAACCAAATAATTTACT
Protein sequenceShow/hide protein sequence
SPSLASNAPHIKTNHFNSLSRVSSSTFPQKIQNTEKKKKEEKRKIKAMGDPRPKLPRNPIRSPDSVAPREQEFAAEDGIVFRAVDEESQSESGACSPTLWGSNSRSSPQF
HRPRNRSLSPTSRIQAIARGQQELMEMVRNMPEASYELSLKDLVEYHRVANPDTSIESRDDSTSETSFRRDSSKKTAETRALVTRSRSVDSGGFYLKMFFPLPFGQISAK
KKRSVGNDSGLNGSSRVSPKPPLVRDGSGKGADRDWWRKRSSLAAGESEGSVSGGSMTSSGSSNSTSSRNAESQGSCWFCISPIRTKDPE