; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g0237 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g0237
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionprotein E6-like
Genome locationMC04:1847747..1848439
RNA-Seq ExpressionMC04g0237
SyntenyMC04g0237
Gene Ontology termsNA
InterPro domainsIPR040290 - Protein E6-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0059120.1 protein E6-like [Cucumis melo var. makuwa]6.91e-6659.02Show/hide
Query:  KHLPFFFLLLLLLSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPTP----FFYESQNAYGLYGRASDDTENNPSITDVE
         HL FFFLLLL  SSVQ EARVNKFFSKFIHTD  +    T       SPAPLS PPE SP LAPTP    FF ESQNAYGLYG   D  EN  +ITDVE
Subjt:  KHLPFFFLLLLLLSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPTP----FFYESQNAYGLYGRASDDTENNPSITDVE

Query:  EEILAEDGGDE--SYKSGYPKTSFHGT-DFESSRRDEQYQ-------SSYGN-NGYGNSEYENNG--GRNYQYESNFEDGGFRRSRYEPRERQGMSDTRF
        EEIL  +G  +  + KS +P  +F  T D E   +++ Y+       S Y N N Y NSEYENN   GRNYQY+SNFEDGG+RRSR+EP E+QGMSDTRF
Subjt:  EEILAEDGGDE--SYKSGYPKTSFHGT-DFESSRRDEQYQ-------SSYGN-NGYGNSEYENNG--GRNYQYESNFEDGGFRRSRYEPRERQGMSDTRF

Query:  VENGKYYYDANSRAGEGGESYGSKKNPIPNQFEFDSMEEYEKSE
        +ENG+Y++D NS+  E   SYGSKK   P ++EFDSMEEYE+SE
Subjt:  VENGKYYYDANSRAGEGGESYGSKKNPIPNQFEFDSMEEYEKSE

XP_008455495.1 PREDICTED: protein E6-like [Cucumis melo]4.25e-6759.43Show/hide
Query:  KHLPFFFLLLLLLSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPTP----FFYESQNAYGLYGRASDDTENNPSITDVE
         HL FFFLLLL  SSVQ EARVNKFFSKFIHTD  +  P T       SPAPLS PPE SP LAPTP    FF ESQNAYGLYG   D  EN  +ITDVE
Subjt:  KHLPFFFLLLLLLSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPTP----FFYESQNAYGLYGRASDDTENNPSITDVE

Query:  EEILAEDGGDE--SYKSGYPKTSFHGT-DFESSRRDEQYQ-------SSYGN-NGYGNSEYENNG--GRNYQYESNFEDGGFRRSRYEPRERQGMSDTRF
        EEIL  +G  +  + KS +P  +F  T D E   +++ Y+       S Y N N Y NSEYENN   GRNYQY+SNFEDGG+RRSR+EP E+QGMSDTRF
Subjt:  EEILAEDGGDE--SYKSGYPKTSFHGT-DFESSRRDEQYQ-------SSYGN-NGYGNSEYENNG--GRNYQYESNFEDGGFRRSRYEPRERQGMSDTRF

Query:  VENGKYYYDANSRAGEGGESYGSKKNPIPNQFEFDSMEEYEKSE
        +ENG+Y++D NS+  E   SYGSKK   P ++EFDSMEEYE+SE
Subjt:  VENGKYYYDANSRAGEGGESYGSKKNPIPNQFEFDSMEEYEKSE

XP_022136144.1 protein E6-like [Momordica charantia]4.34e-158100Show/hide
Query:  ASALKHLPFFFLLLLLLSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPTPFFYESQNAYGLYGRASDDTENNPSITDVE
        ASALKHLPFFFLLLLLLSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPTPFFYESQNAYGLYGRASDDTENNPSITDVE
Subjt:  ASALKHLPFFFLLLLLLSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPTPFFYESQNAYGLYGRASDDTENNPSITDVE

Query:  EEILAEDGGDESYKSGYPKTSFHGTDFESSRRDEQYQSSYGNNGYGNSEYENNGGRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVENGKYYYDANSR
        EEILAEDGGDESYKSGYPKTSFHGTDFESSRRDEQYQSSYGNNGYGNSEYENNGGRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVENGKYYYDANSR
Subjt:  EEILAEDGGDESYKSGYPKTSFHGTDFESSRRDEQYQSSYGNNGYGNSEYENNGGRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVENGKYYYDANSR

Query:  AGEGGESYGSKKNPIPNQFEFDSMEEYEKSE
        AGEGGESYGSKKNPIPNQFEFDSMEEYEKSE
Subjt:  AGEGGESYGSKKNPIPNQFEFDSMEEYEKSE

XP_022969172.1 probable ATP-dependent RNA helicase ddx42 isoform X2 [Cucurbita maxima]3.87e-6954.84Show/hide
Query:  KHLPFFFLLLLLLSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPTP----FFYESQNAYGLYGRASDDTENNPSITDVE
        KHLPF FLLL   SSVQIEARVNKFFSKFIH DR+        LPVA SPAP+S PPEISP LAPTP    FF ESQNAYGLYG  +DD+E++ +ITDVE
Subjt:  KHLPFFFLLLLLLSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPTP----FFYESQNAYGLYGRASDDTENNPSITDVE

Query:  EEILAEDGGDE-SYKSGYPKTSFHGTDFESSRRDE------------QYQSSY-------------------GNNGYGNSEYENN---------------
        EEILAEDG D+ ++KSGY +T+ H  +FES +R E            +Y+S+                     NN Y NSEYENN               
Subjt:  EEILAEDGGDE-SYKSGYPKTSFHGTDFESSRRDE------------QYQSSY-------------------GNNGYGNSEYENN---------------

Query:  -GGRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVENGKYYYDANSRAGEGGESYGSKKNPIPNQFEFDSMEEYEKSE
         G RNYQY+SN E  G+R+ RYEP E+QGMSDTRF+ENG+YY++ NS  GE  +SYGSKK P     EFDSMEEYEKSE
Subjt:  -GGRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVENGKYYYDANSRAGEGGESYGSKKNPIPNQFEFDSMEEYEKSE

XP_038886989.1 protein E6-like [Benincasa hispida]1.08e-6855.82Show/hide
Query:  ASALKHLPFFFLLLLLLSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPTP----FFYESQNAYGLYGRASDDTENNPSI
        ++    L FFF  +LLLSSVQIEARVNKFFSKFI+TDR         +P    PAP+SAPPEISP LAPTP    FF ESQNAYGLYGR +D  EN  +I
Subjt:  ASALKHLPFFFLLLLLLSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPTP----FFYESQNAYGLYGRASDDTENNPSI

Query:  TDVEEEILAEDGGDE---SYKSGYPKTS-----FHGTDFESSR--RDEQYQSSYGNNGYGNSEYE----NNGGRNYQYESNFEDGGFRRSRYEPRERQGM
        TDVEEEILA DG DE   ++K+ YP T+     +   ++E++   R+ +Y++   +N Y NSEYE    NN  RNYQY+SNFED G+RR R+EP  +QGM
Subjt:  TDVEEEILAEDGGDE---SYKSGYPKTS-----FHGTDFESSR--RDEQYQSSYGNNGYGNSEYE----NNGGRNYQYESNFEDGGFRRSRYEPRERQGM

Query:  SDTRFVENGKYYYDANSRAGEGGESYGSKKNPIPNQFEFDSMEEYEKSE
        SDTRF+ENG+Y++D NS+ GE   SYG+ K P   ++EFDSMEEYE+SE
Subjt:  SDTRFVENGKYYYDANSRAGEGGESYGSKKNPIPNQFEFDSMEEYEKSE

TrEMBL top hitse value%identityAlignment
A0A1S3C0L5 protein E6-like2.06e-6759.43Show/hide
Query:  KHLPFFFLLLLLLSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPTP----FFYESQNAYGLYGRASDDTENNPSITDVE
         HL FFFLLLL  SSVQ EARVNKFFSKFIHTD  +  P T       SPAPLS PPE SP LAPTP    FF ESQNAYGLYG   D  EN  +ITDVE
Subjt:  KHLPFFFLLLLLLSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPTP----FFYESQNAYGLYGRASDDTENNPSITDVE

Query:  EEILAEDGGDE--SYKSGYPKTSFHGT-DFESSRRDEQYQ-------SSYGN-NGYGNSEYENNG--GRNYQYESNFEDGGFRRSRYEPRERQGMSDTRF
        EEIL  +G  +  + KS +P  +F  T D E   +++ Y+       S Y N N Y NSEYENN   GRNYQY+SNFEDGG+RRSR+EP E+QGMSDTRF
Subjt:  EEILAEDGGDE--SYKSGYPKTSFHGT-DFESSRRDEQYQ-------SSYGN-NGYGNSEYENNG--GRNYQYESNFEDGGFRRSRYEPRERQGMSDTRF

Query:  VENGKYYYDANSRAGEGGESYGSKKNPIPNQFEFDSMEEYEKSE
        +ENG+Y++D NS+  E   SYGSKK   P ++EFDSMEEYE+SE
Subjt:  VENGKYYYDANSRAGEGGESYGSKKNPIPNQFEFDSMEEYEKSE

A0A5A7UVJ1 Protein E6-like3.34e-6659.02Show/hide
Query:  KHLPFFFLLLLLLSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPTP----FFYESQNAYGLYGRASDDTENNPSITDVE
         HL FFFLLLL  SSVQ EARVNKFFSKFIHTD  +    T       SPAPLS PPE SP LAPTP    FF ESQNAYGLYG   D  EN  +ITDVE
Subjt:  KHLPFFFLLLLLLSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPTP----FFYESQNAYGLYGRASDDTENNPSITDVE

Query:  EEILAEDGGDE--SYKSGYPKTSFHGT-DFESSRRDEQYQ-------SSYGN-NGYGNSEYENNG--GRNYQYESNFEDGGFRRSRYEPRERQGMSDTRF
        EEIL  +G  +  + KS +P  +F  T D E   +++ Y+       S Y N N Y NSEYENN   GRNYQY+SNFEDGG+RRSR+EP E+QGMSDTRF
Subjt:  EEILAEDGGDE--SYKSGYPKTSFHGT-DFESSRRDEQYQ-------SSYGN-NGYGNSEYENNG--GRNYQYESNFEDGGFRRSRYEPRERQGMSDTRF

Query:  VENGKYYYDANSRAGEGGESYGSKKNPIPNQFEFDSMEEYEKSE
        +ENG+Y++D NS+  E   SYGSKK   P ++EFDSMEEYE+SE
Subjt:  VENGKYYYDANSRAGEGGESYGSKKNPIPNQFEFDSMEEYEKSE

A0A5D3DDN1 Protein E6-like2.06e-6759.43Show/hide
Query:  KHLPFFFLLLLLLSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPTP----FFYESQNAYGLYGRASDDTENNPSITDVE
         HL FFFLLLL  SSVQ EARVNKFFSKFIHTD  +  P T       SPAPLS PPE SP LAPTP    FF ESQNAYGLYG   D  EN  +ITDVE
Subjt:  KHLPFFFLLLLLLSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPTP----FFYESQNAYGLYGRASDDTENNPSITDVE

Query:  EEILAEDGGDE--SYKSGYPKTSFHGT-DFESSRRDEQYQ-------SSYGN-NGYGNSEYENNG--GRNYQYESNFEDGGFRRSRYEPRERQGMSDTRF
        EEIL  +G  +  + KS +P  +F  T D E   +++ Y+       S Y N N Y NSEYENN   GRNYQY+SNFEDGG+RRSR+EP E+QGMSDTRF
Subjt:  EEILAEDGGDE--SYKSGYPKTSFHGT-DFESSRRDEQYQ-------SSYGN-NGYGNSEYENNG--GRNYQYESNFEDGGFRRSRYEPRERQGMSDTRF

Query:  VENGKYYYDANSRAGEGGESYGSKKNPIPNQFEFDSMEEYEKSE
        +ENG+Y++D NS+  E   SYGSKK   P ++EFDSMEEYE+SE
Subjt:  VENGKYYYDANSRAGEGGESYGSKKNPIPNQFEFDSMEEYEKSE

A0A6J1C3G6 protein E6-like2.10e-158100Show/hide
Query:  ASALKHLPFFFLLLLLLSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPTPFFYESQNAYGLYGRASDDTENNPSITDVE
        ASALKHLPFFFLLLLLLSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPTPFFYESQNAYGLYGRASDDTENNPSITDVE
Subjt:  ASALKHLPFFFLLLLLLSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPTPFFYESQNAYGLYGRASDDTENNPSITDVE

Query:  EEILAEDGGDESYKSGYPKTSFHGTDFESSRRDEQYQSSYGNNGYGNSEYENNGGRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVENGKYYYDANSR
        EEILAEDGGDESYKSGYPKTSFHGTDFESSRRDEQYQSSYGNNGYGNSEYENNGGRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVENGKYYYDANSR
Subjt:  EEILAEDGGDESYKSGYPKTSFHGTDFESSRRDEQYQSSYGNNGYGNSEYENNGGRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVENGKYYYDANSR

Query:  AGEGGESYGSKKNPIPNQFEFDSMEEYEKSE
        AGEGGESYGSKKNPIPNQFEFDSMEEYEKSE
Subjt:  AGEGGESYGSKKNPIPNQFEFDSMEEYEKSE

A0A6J1HVL6 probable ATP-dependent RNA helicase ddx42 isoform X21.87e-6954.84Show/hide
Query:  KHLPFFFLLLLLLSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPTP----FFYESQNAYGLYGRASDDTENNPSITDVE
        KHLPF FLLL   SSVQIEARVNKFFSKFIH DR+        LPVA SPAP+S PPEISP LAPTP    FF ESQNAYGLYG  +DD+E++ +ITDVE
Subjt:  KHLPFFFLLLLLLSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPTP----FFYESQNAYGLYGRASDDTENNPSITDVE

Query:  EEILAEDGGDE-SYKSGYPKTSFHGTDFESSRRDE------------QYQSSY-------------------GNNGYGNSEYENN---------------
        EEILAEDG D+ ++KSGY +T+ H  +FES +R E            +Y+S+                     NN Y NSEYENN               
Subjt:  EEILAEDGGDE-SYKSGYPKTSFHGTDFESSRRDE------------QYQSSY-------------------GNNGYGNSEYENN---------------

Query:  -GGRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVENGKYYYDANSRAGEGGESYGSKKNPIPNQFEFDSMEEYEKSE
         G RNYQY+SN E  G+R+ RYEP E+QGMSDTRF+ENG+YY++ NS  GE  +SYGSKK P     EFDSMEEYEKSE
Subjt:  -GGRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVENGKYYYDANSRAGEGGESYGSKKNPIPNQFEFDSMEEYEKSE

SwissProt top hitse value%identityAlignment
Q01197 Protein E61.2e-0630.43Show/hide
Query:  ASALKHLPFFFLLLLLLSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAP-SPAPLSAPPEISPILAPTPFFYESQNAYGLYGRASDDTENNPSITDV
        AS+ K      L L  L S+QI AR  ++FSKF   + N+    T         P     P E  P   P     E+QN YGLYG  S  +  + +  + 
Subjt:  ASALKHLPFFFLLLLLLSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAP-SPAPLSAPPEISPILAPTPFFYESQNAYGLYGRASDDTENNPSITDV

Query:  EEEIL--AEDGGDESYKSGYPKTS-------FHGTDFESSRRDEQYQSSYGNNGYGNSEYENNGGRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVEN
         E  +       DE Y S  P++S       ++   +ES+++    ++ +   G+   E +NN   NY Y  N        + Y   E+QGMSDTR++EN
Subjt:  EEEIL--AEDGGDESYKSGYPKTS-------FHGTDFESSRRDEQYQSSYGNNGYGNSEYENNGGRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVEN

Query:  GKYYYDANSRAGEGGESYGSKKNPIPNQFE
        GKYYYD             S+ N  PN+F+
Subjt:  GKYYYDANSRAGEGGESYGSKKNPIPNQFE

Arabidopsis top hitse value%identityAlignment
AT1G03820.1 unknown protein5.2e-0833.19Show/hide
Query:  ALKHLPFFFLLLLLLSSVQIEARVNK-FFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPTPFFYESQNAYGLYGRASDDTENNPSITDVEE
        ALK + F F+ +       +EAR  K FFSKF H DR +         VA SPAP     + +  L    F   S    G+  +  +   ++ + TD E 
Subjt:  ALKHLPFFFLLLLLLSSVQIEARVNK-FFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPTPFFYESQNAYGLYGRASDDTENNPSITDVEE

Query:  EILAEDGGDESYKSGYPKTSFHGTDFESSRRDEQYQSSYGNNGYGNSEYENNGGRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVENGKYYYDANSRA
        E L     DE   +  P+      + E S    + +  Y NN        NN G  Y   +N+ D G  R      E+QGMSDTR +ENGKY+YD   R 
Subjt:  EILAEDGGDESYKSGYPKTSFHGTDFESSRRDEQYQSSYGNNGYGNSEYENNGGRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVENGKYYYDANSRA

Query:  GEGGESYG---SKKNPIPNQFEFDSMEEYEKS
         E   S G   ++ N   N  EF++MEEY KS
Subjt:  GEGGESYG---SKKNPIPNQFEFDSMEEYEKS

AT1G28400.1 unknown protein4.4e-0727.55Show/hide
Query:  LPFFFLLLLLLSSVQIEARVNKFFSKFIHTDRNDASP--LTPALPVAPSPAPLSAPPEISPILAPTPFFYESQNAYGLYGRASDDTENN-----------
        L FFF  L+LLS+ QI AR + FF KF      D +P    P      +    S   +      PT F  ES N YGLYG  +    NN           
Subjt:  LPFFFLLLLLLSSVQIEARVNKFFSKFIHTDRNDASP--LTPALPVAPSPAPLSAPPEISPILAPTPFFYESQNAYGLYGRASDDTENN-----------

Query:  ------------PSITDVEEEILAEDGGDESYKSGYP-KTSFHGT--------DFESSRRDEQYQSSYGNN----GYGNSEYENNGGR---NYQYESNFE
                    PS+++ EE          +Y+  YP KT  +GT        +  +++ D  ++  + NN     Y   E+ NN      NY+Y+ N +
Subjt:  ------------PSITDVEEEILAEDGGDESYKSGYP-KTSFHGT--------DFESSRRDEQYQSSYGNN----GYGNSEYENNGGR---NYQYESNFE

Query:  DGGFRRSRYEPR--------------------------ERQGMSDTRFVENGKYYYDANSRAGEG
        +  F  +  + +                          ERQGMSDTRF+E G YYYD  +    G
Subjt:  DGGFRRSRYEPR--------------------------ERQGMSDTRFVENGKYYYDANSRAGEG

AT2G33850.1 unknown protein2.5e-1028.08Show/hide
Query:  FFFLLLLLLSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPTPFFYESQNAYGLYGRASDDTENNP-SITDVEEEILAED
        FFFLL L+L S QI AR +  F KF   D  + +P    +P+  +      P + +P   P     +S+N YGLYG  + D  N   +    E+ +  +D
Subjt:  FFFLLLLLLSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPTPFFYESQNAYGLYGRASDDTENNP-SITDVEEEILAED

Query:  G--------------GDESYKSGYPKTSFHGTDFESSRRDEQYQSSYGNNGYG--------NSEYENNGGRNYQYESNFEDGGFRRSRYEPR--------
                         ++YK  YPKT    T+   + +D  Y  +  +N YG        N  Y+    ++  Y  N    G  +   EP         
Subjt:  G--------------GDESYKSGYPKTSFHGTDFESSRRDEQYQSSYGNNGYG--------NSEYENNGGRNYQYESNFEDGGFRRSRYEPR--------

Query:  ERQGMSDTRFVENGKYYYDANSRAGEG-----------GESYGSKKNPIPNQFEFDSMEE
        ERQGMSDTR++ NGKYYYD +     G              Y  KK+   N ++ +   E
Subjt:  ERQGMSDTRFVENGKYYYDANSRAGEG-----------GESYGSKKNPIPNQFEFDSMEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GCTTCCGCTTTGAAGCATCTCCCCTTCTTCTTCCTCCTCCTCCTCCTCCTCTCCTCTGTCCAAATCGAAGCCAGAGTCAACAAATTCTTCAGCAAATTCATCCACACGGA
TCGCAATGATGCTTCACCACTCACCCCGGCTCTTCCGGTAGCCCCTTCCCCGGCGCCGCTATCTGCTCCGCCTGAAATATCTCCAATCTTGGCGCCGACGCCGTTTTTCT
ACGAATCGCAGAATGCGTACGGTCTCTACGGCCGTGCTTCCGACGATACCGAGAACAACCCGTCGATCACCGACGTGGAGGAGGAGATTCTCGCGGAAGACGGCGGCGAC
GAGAGCTACAAATCTGGCTATCCGAAGACGAGTTTTCACGGTACCGATTTCGAAAGCTCTAGGAGAGACGAGCAGTACCAGAGCAGTTACGGCAACAATGGCTACGGAAA
TTCCGAGTACGAGAACAACGGCGGCAGAAATTACCAGTACGAGAGCAATTTCGAGGACGGTGGATTCAGAAGGAGCCGGTACGAGCCGAGGGAGCGGCAGGGGATGAGCG
ACACCAGATTCGTGGAGAACGGGAAGTACTATTACGATGCGAACTCGAGGGCTGGAGAAGGCGGCGAATCGTACGGGAGCAAGAAGAATCCGATTCCGAACCAGTTCGAG
TTCGATTCAATGGAGGAGTACGAGAAGAGCGAG
mRNA sequenceShow/hide mRNA sequence
GCTTCCGCTTTGAAGCATCTCCCCTTCTTCTTCCTCCTCCTCCTCCTCCTCTCCTCTGTCCAAATCGAAGCCAGAGTCAACAAATTCTTCAGCAAATTCATCCACACGGA
TCGCAATGATGCTTCACCACTCACCCCGGCTCTTCCGGTAGCCCCTTCCCCGGCGCCGCTATCTGCTCCGCCTGAAATATCTCCAATCTTGGCGCCGACGCCGTTTTTCT
ACGAATCGCAGAATGCGTACGGTCTCTACGGCCGTGCTTCCGACGATACCGAGAACAACCCGTCGATCACCGACGTGGAGGAGGAGATTCTCGCGGAAGACGGCGGCGAC
GAGAGCTACAAATCTGGCTATCCGAAGACGAGTTTTCACGGTACCGATTTCGAAAGCTCTAGGAGAGACGAGCAGTACCAGAGCAGTTACGGCAACAATGGCTACGGAAA
TTCCGAGTACGAGAACAACGGCGGCAGAAATTACCAGTACGAGAGCAATTTCGAGGACGGTGGATTCAGAAGGAGCCGGTACGAGCCGAGGGAGCGGCAGGGGATGAGCG
ACACCAGATTCGTGGAGAACGGGAAGTACTATTACGATGCGAACTCGAGGGCTGGAGAAGGCGGCGAATCGTACGGGAGCAAGAAGAATCCGATTCCGAACCAGTTCGAG
TTCGATTCAATGGAGGAGTACGAGAAGAGCGAG
Protein sequenceShow/hide protein sequence
ASALKHLPFFFLLLLLLSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPTPFFYESQNAYGLYGRASDDTENNPSITDVEEEILAEDGGD
ESYKSGYPKTSFHGTDFESSRRDEQYQSSYGNNGYGNSEYENNGGRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVENGKYYYDANSRAGEGGESYGSKKNPIPNQFE
FDSMEEYEKSE