; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS022342 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS022342
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionprotein E6-like
Genome locationscaffold47:2665677..2666384
RNA-Seq ExpressionMS022342
SyntenyMS022342
Gene Ontology termsNA
InterPro domainsIPR040290 - Protein E6-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0059120.1 protein E6-like [Cucumis melo var. makuwa]5.6e-5258.26Show/hide
Query:  HLPFFFLLLILSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPT----PFFYESQNAYGLYGRASDDTENNPSITDVEEE
        HL FFFLLL LSSVQ EARVNKFFSKFIHTD  +    T       SPAPLS PPE SP LAPT    PFF ESQNAYGLYG   D  EN  +ITDVEEE
Subjt:  HLPFFFLLLILSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPT----PFFYESQNAYGLYGRASDDTENNPSITDVEEE

Query:  ILAEDGGDE--SYKSGYPKTSFHGT-DFESSRRDEQYQSSYG--------NNGYGNSEYENNG--GRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVE
        IL  +G  +  + KS +P  +F  T D E   +++ Y+ + G        +N Y NSEYENN   GRNYQY+SNFEDGG+RRSR+EP E+QGMSDTRF+E
Subjt:  ILAEDGGDE--SYKSGYPKTSFHGT-DFESSRRDEQYQSSYG--------NNGYGNSEYENNG--GRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVE

Query:  NGKYYYDANSRAGEGGESYGSKKNPIPNQFEFDSMEEYEKSE
        NG+Y++D NS+  E   SYGSKK     ++EFDSMEEYE+SE
Subjt:  NGKYYYDANSRAGEGGESYGSKKNPIPNQFEFDSMEEYEKSE

XP_008455495.1 PREDICTED: protein E6-like [Cucumis melo]6.6e-5358.68Show/hide
Query:  HLPFFFLLLILSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPT----PFFYESQNAYGLYGRASDDTENNPSITDVEEE
        HL FFFLLL LSSVQ EARVNKFFSKFIHTD  +  P T       SPAPLS PPE SP LAPT    PFF ESQNAYGLYG   D  EN  +ITDVEEE
Subjt:  HLPFFFLLLILSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPT----PFFYESQNAYGLYGRASDDTENNPSITDVEEE

Query:  ILAEDGGDE--SYKSGYPKTSFHGT-DFESSRRDEQYQSSYG--------NNGYGNSEYENNG--GRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVE
        IL  +G  +  + KS +P  +F  T D E   +++ Y+ + G        +N Y NSEYENN   GRNYQY+SNFEDGG+RRSR+EP E+QGMSDTRF+E
Subjt:  ILAEDGGDE--SYKSGYPKTSFHGT-DFESSRRDEQYQSSYG--------NNGYGNSEYENNG--GRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVE

Query:  NGKYYYDANSRAGEGGESYGSKKNPIPNQFEFDSMEEYEKSE
        NG+Y++D NS+  E   SYGSKK     ++EFDSMEEYE+SE
Subjt:  NGKYYYDANSRAGEGGESYGSKKNPIPNQFEFDSMEEYEKSE

XP_022136144.1 protein E6-like [Momordica charantia]1.6e-12399.15Show/hide
Query:  MASALKHLPFFF-LLLILSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPTPFFYESQNAYGLYGRASDDTENNPSITDV
        MASALKHLPFFF LLL+LSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPTPFFYESQNAYGLYGRASDDTENNPSITDV
Subjt:  MASALKHLPFFF-LLLILSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPTPFFYESQNAYGLYGRASDDTENNPSITDV

Query:  EEEILAEDGGDESYKSGYPKTSFHGTDFESSRRDEQYQSSYGNNGYGNSEYENNGGRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVENGKYYYDANS
        EEEILAEDGGDESYKSGYPKTSFHGTDFESSRRDEQYQSSYGNNGYGNSEYENNGGRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVENGKYYYDANS
Subjt:  EEEILAEDGGDESYKSGYPKTSFHGTDFESSRRDEQYQSSYGNNGYGNSEYENNGGRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVENGKYYYDANS

Query:  RAGEGGESYGSKKNPIPNQFEFDSMEEYEKSEEFHR
        RAGEGGESYGSKKNPIPNQFEFDSMEEYEKSEEFHR
Subjt:  RAGEGGESYGSKKNPIPNQFEFDSMEEYEKSEEFHR

XP_022969172.1 probable ATP-dependent RNA helicase ddx42 isoform X2 [Cucurbita maxima]2.4e-5554.17Show/hide
Query:  SMASAL--KHLPFFFLLLILSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPT----PFFYESQNAYGLYGRASDDTENN
        +MA+++  KHLPF FLL  LSSVQIEARVNKFFSKFIH DR+        LPVA SPAP+S PPEISP LAPT    PFF ESQNAYGLYG  +DD+E++
Subjt:  SMASAL--KHLPFFFLLLILSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPT----PFFYESQNAYGLYGRASDDTENN

Query:  PSITDVEEEILAEDGGD-ESYKSGYPKTSFHGTDFESSRRDE------------QYQSS-------------------YGNNGYGNSEYENN--------
         +ITDVEEEILAEDG D +++KSGY +T+ H  +FES +R E            +Y+S+                     NN Y NSEYENN        
Subjt:  PSITDVEEEILAEDGGD-ESYKSGYPKTSFHGTDFESSRRDE------------QYQSS-------------------YGNNGYGNSEYENN--------

Query:  --------GGRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVENGKYYYDANSRAGEGGESYGSKKNPIPNQFEFDSMEEYEKSEEF
                G RNYQY+SN E  G+R+ RYEP E+QGMSDTRF+ENG+YY++ NS  GE  +SYGSKK P     EFDSMEEYEKSE F
Subjt:  --------GGRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVENGKYYYDANSRAGEGGESYGSKKNPIPNQFEFDSMEEYEKSEEF

XP_038886989.1 protein E6-like [Benincasa hispida]7.8e-5457.08Show/hide
Query:  FFFLLLILSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPT----PFFYESQNAYGLYGRASDDTENNPSITDVEEEILA
        FFF +L+LSSVQIEARVNKFFSKFI+TDR         +P    PAP+SAPPEISP LAPT    PFF ESQNAYGLYGR +D  EN  +ITDVEEEILA
Subjt:  FFFLLLILSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPT----PFFYESQNAYGLYGRASDDTENNPSITDVEEEILA

Query:  EDGGDE---SYKSGYPKT-----SFHGTDFESSR--RDEQYQSSYGNNGYGNSEY----ENNGGRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVENG
         DG DE   ++K+ YP T     ++   ++E++   R+ +Y++   +N Y NSEY    ENN  RNYQY+SNFED G+RR R+EP  +QGMSDTRF+ENG
Subjt:  EDGGDE---SYKSGYPKT-----SFHGTDFESSR--RDEQYQSSYGNNGYGNSEY----ENNGGRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVENG

Query:  KYYYDANSRAGEGGESYGSKKNPIPNQFEFDSMEEYEKSE
        +Y++D NS+ GE   SYG+ K P   ++EFDSMEEYE+SE
Subjt:  KYYYDANSRAGEGGESYGSKKNPIPNQFEFDSMEEYEKSE

TrEMBL top hitse value%identityAlignment
A0A1S3C0L5 protein E6-like3.2e-5358.68Show/hide
Query:  HLPFFFLLLILSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPT----PFFYESQNAYGLYGRASDDTENNPSITDVEEE
        HL FFFLLL LSSVQ EARVNKFFSKFIHTD  +  P T       SPAPLS PPE SP LAPT    PFF ESQNAYGLYG   D  EN  +ITDVEEE
Subjt:  HLPFFFLLLILSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPT----PFFYESQNAYGLYGRASDDTENNPSITDVEEE

Query:  ILAEDGGDE--SYKSGYPKTSFHGT-DFESSRRDEQYQSSYG--------NNGYGNSEYENNG--GRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVE
        IL  +G  +  + KS +P  +F  T D E   +++ Y+ + G        +N Y NSEYENN   GRNYQY+SNFEDGG+RRSR+EP E+QGMSDTRF+E
Subjt:  ILAEDGGDE--SYKSGYPKTSFHGT-DFESSRRDEQYQSSYG--------NNGYGNSEYENNG--GRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVE

Query:  NGKYYYDANSRAGEGGESYGSKKNPIPNQFEFDSMEEYEKSE
        NG+Y++D NS+  E   SYGSKK     ++EFDSMEEYE+SE
Subjt:  NGKYYYDANSRAGEGGESYGSKKNPIPNQFEFDSMEEYEKSE

A0A5A7UVJ1 Protein E6-like2.7e-5258.26Show/hide
Query:  HLPFFFLLLILSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPT----PFFYESQNAYGLYGRASDDTENNPSITDVEEE
        HL FFFLLL LSSVQ EARVNKFFSKFIHTD  +    T       SPAPLS PPE SP LAPT    PFF ESQNAYGLYG   D  EN  +ITDVEEE
Subjt:  HLPFFFLLLILSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPT----PFFYESQNAYGLYGRASDDTENNPSITDVEEE

Query:  ILAEDGGDE--SYKSGYPKTSFHGT-DFESSRRDEQYQSSYG--------NNGYGNSEYENNG--GRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVE
        IL  +G  +  + KS +P  +F  T D E   +++ Y+ + G        +N Y NSEYENN   GRNYQY+SNFEDGG+RRSR+EP E+QGMSDTRF+E
Subjt:  ILAEDGGDE--SYKSGYPKTSFHGT-DFESSRRDEQYQSSYG--------NNGYGNSEYENNG--GRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVE

Query:  NGKYYYDANSRAGEGGESYGSKKNPIPNQFEFDSMEEYEKSE
        NG+Y++D NS+  E   SYGSKK     ++EFDSMEEYE+SE
Subjt:  NGKYYYDANSRAGEGGESYGSKKNPIPNQFEFDSMEEYEKSE

A0A5D3DDN1 Protein E6-like3.2e-5358.68Show/hide
Query:  HLPFFFLLLILSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPT----PFFYESQNAYGLYGRASDDTENNPSITDVEEE
        HL FFFLLL LSSVQ EARVNKFFSKFIHTD  +  P T       SPAPLS PPE SP LAPT    PFF ESQNAYGLYG   D  EN  +ITDVEEE
Subjt:  HLPFFFLLLILSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPT----PFFYESQNAYGLYGRASDDTENNPSITDVEEE

Query:  ILAEDGGDE--SYKSGYPKTSFHGT-DFESSRRDEQYQSSYG--------NNGYGNSEYENNG--GRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVE
        IL  +G  +  + KS +P  +F  T D E   +++ Y+ + G        +N Y NSEYENN   GRNYQY+SNFEDGG+RRSR+EP E+QGMSDTRF+E
Subjt:  ILAEDGGDE--SYKSGYPKTSFHGT-DFESSRRDEQYQSSYG--------NNGYGNSEYENNG--GRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVE

Query:  NGKYYYDANSRAGEGGESYGSKKNPIPNQFEFDSMEEYEKSE
        NG+Y++D NS+  E   SYGSKK     ++EFDSMEEYE+SE
Subjt:  NGKYYYDANSRAGEGGESYGSKKNPIPNQFEFDSMEEYEKSE

A0A6J1C3G6 protein E6-like7.7e-12499.15Show/hide
Query:  MASALKHLPFFF-LLLILSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPTPFFYESQNAYGLYGRASDDTENNPSITDV
        MASALKHLPFFF LLL+LSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPTPFFYESQNAYGLYGRASDDTENNPSITDV
Subjt:  MASALKHLPFFF-LLLILSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPTPFFYESQNAYGLYGRASDDTENNPSITDV

Query:  EEEILAEDGGDESYKSGYPKTSFHGTDFESSRRDEQYQSSYGNNGYGNSEYENNGGRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVENGKYYYDANS
        EEEILAEDGGDESYKSGYPKTSFHGTDFESSRRDEQYQSSYGNNGYGNSEYENNGGRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVENGKYYYDANS
Subjt:  EEEILAEDGGDESYKSGYPKTSFHGTDFESSRRDEQYQSSYGNNGYGNSEYENNGGRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVENGKYYYDANS

Query:  RAGEGGESYGSKKNPIPNQFEFDSMEEYEKSEEFHR
        RAGEGGESYGSKKNPIPNQFEFDSMEEYEKSEEFHR
Subjt:  RAGEGGESYGSKKNPIPNQFEFDSMEEYEKSEEFHR

A0A6J1HVL6 probable ATP-dependent RNA helicase ddx42 isoform X21.2e-5554.17Show/hide
Query:  SMASAL--KHLPFFFLLLILSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPT----PFFYESQNAYGLYGRASDDTENN
        +MA+++  KHLPF FLL  LSSVQIEARVNKFFSKFIH DR+        LPVA SPAP+S PPEISP LAPT    PFF ESQNAYGLYG  +DD+E++
Subjt:  SMASAL--KHLPFFFLLLILSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPT----PFFYESQNAYGLYGRASDDTENN

Query:  PSITDVEEEILAEDGGD-ESYKSGYPKTSFHGTDFESSRRDE------------QYQSS-------------------YGNNGYGNSEYENN--------
         +ITDVEEEILAEDG D +++KSGY +T+ H  +FES +R E            +Y+S+                     NN Y NSEYENN        
Subjt:  PSITDVEEEILAEDGGD-ESYKSGYPKTSFHGTDFESSRRDE------------QYQSS-------------------YGNNGYGNSEYENN--------

Query:  --------GGRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVENGKYYYDANSRAGEGGESYGSKKNPIPNQFEFDSMEEYEKSEEF
                G RNYQY+SN E  G+R+ RYEP E+QGMSDTRF+ENG+YY++ NS  GE  +SYGSKK P     EFDSMEEYEKSE F
Subjt:  --------GGRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVENGKYYYDANSRAGEGGESYGSKKNPIPNQFEFDSMEEYEKSEEF

SwissProt top hitse value%identityAlignment
Q01197 Protein E63.1e-0529.53Show/hide
Query:  MASALKHLPFFFLLLI-LSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAP-SPAPLSAPPEISPILAPTPFFYESQNAYGLYGRASDDTENNPSITD
        MAS+ K      L L  L S+QI AR  ++FSKF   + N+    T         P     P E  P   P     E+QN YGLYG  S  +  + +  +
Subjt:  MASALKHLPFFFLLLI-LSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAP-SPAPLSAPPEISPILAPTPFFYESQNAYGLYGRASDDTENNPSITD

Query:  VEEEIL--AEDGGDESYKSGYPKTS-------FHGTDFESSRRDEQYQSSYGNNGYGNSEYENNGGRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVE
          E  +       DE Y S  P++S       ++   +ES+++    ++ +   G+   E +NN   NY Y  N        + Y   E+QGMSDTR++E
Subjt:  VEEEIL--AEDGGDESYKSGYPKTS-------FHGTDFESSRRDEQYQSSYGNNGYGNSEYENNGGRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVE

Query:  NGKYYYDANSRAG------EGGESYGSKKNPIPNQFE-----FDSMEEYEKSEE
        NGKYYYD  S         +      S+     N++        + EE+E+SEE
Subjt:  NGKYYYDANSRAG------EGGESYGSKKNPIPNQFE-----FDSMEEYEKSEE

Arabidopsis top hitse value%identityAlignment
AT1G03820.1 unknown protein2.4e-0833.76Show/hide
Query:  MAS-ALKHLPFFFLLLILSSVQIEARVNK-FFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPTPFFYESQNAYGLYGRASDDTENNPSITD
        MAS ALK +  F  +       +EAR  K FFSKF H DR +         VA SPAP     + +  L    F   S    G+  +  +   ++ + TD
Subjt:  MAS-ALKHLPFFFLLLILSSVQIEARVNK-FFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPTPFFYESQNAYGLYGRASDDTENNPSITD

Query:  VEEEILAEDGGDESYKSGYPKTSFHGTDFESSRRDEQYQSSYGNNGYGNSEYENNGGRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVENGKYYYDAN
         E E L     DE   +  P+      + E S    + +  Y NN        NN G  Y   +N+ D G  R      E+QGMSDTR +ENGKY+YD  
Subjt:  VEEEILAEDGGDESYKSGYPKTSFHGTDFESSRRDEQYQSSYGNNGYGNSEYENNGGRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVENGKYYYDAN

Query:  SRAGEGGESYG---SKKNPIPNQFEFDSMEEYEKSEE
         R  E   S G   ++ N   N  EF++MEEY KS E
Subjt:  SRAGEGGESYG---SKKNPIPNQFEFDSMEEYEKSEE

AT1G28400.1 unknown protein4.1e-0827.27Show/hide
Query:  LPFFFLLLILSSVQIEARVNKFFSKFIHTDRNDASP--LTPALPVAPSPAPLSAPPEISPILAPTPFFYESQNAYGLYGRASDDTENN------------
        L FFF  L+L S QI AR + FF KF      D +P    P      +    S   +      PT F  ES N YGLYG  +    NN            
Subjt:  LPFFFLLLILSSVQIEARVNKFFSKFIHTDRNDASP--LTPALPVAPSPAPLSAPPEISPILAPTPFFYESQNAYGLYGRASDDTENN------------

Query:  -----------PSITDVEEEILAEDGGDESYKSGYP-KTSFHGT--------DFESSRRDEQYQSSYGNN----GYGNSEYENNGGR---NYQYESNFED
                   PS+++ EE          +Y+  YP KT  +GT        +  +++ D  ++  + NN     Y   E+ NN      NY+Y+ N ++
Subjt:  -----------PSITDVEEEILAEDGGDESYKSGYP-KTSFHGT--------DFESSRRDEQYQSSYGNN----GYGNSEYENNGGR---NYQYESNFED

Query:  GGFRRSRYEPR--------------------------ERQGMSDTRFVENGKYYYDANSRAGEG
          F  +  + +                          ERQGMSDTRF+E G YYYD  +    G
Subjt:  GGFRRSRYEPR--------------------------ERQGMSDTRFVENGKYYYDANSRAGEG

AT2G33850.1 unknown protein2.8e-0927.64Show/hide
Query:  MASALKHLPFFFLL-LILSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPTPFFYESQNAYGLYGRASDDTENNP-SITD
        MA +     FFFLL L+L S QI AR +  F KF   D  + +P    +P+  +      P + +P   P     +S+N YGLYG  + D  N   +   
Subjt:  MASALKHLPFFFLL-LILSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPTPFFYESQNAYGLYGRASDDTENNP-SITD

Query:  VEEEILAEDG--------------GDESYKSGYPKTSFHGTDFESSRRDEQYQSSYGNNGYG--------NSEYENNGGRNYQYESNFEDGGFRRSRYEP
         E+ +  +D                 ++YK  YPKT    T+   + +D  Y  +  +N YG        N  Y+    ++  Y  N    G  +   EP
Subjt:  VEEEILAEDG--------------GDESYKSGYPKTSFHGTDFESSRRDEQYQSSYGNNGYG--------NSEYENNGGRNYQYESNFEDGGFRRSRYEP

Query:  R--------ERQGMSDTRFVENGKYYYDANSRAGEG-----------GESYGSKKNPIPNQFEFDSMEEYEKSEE
                 ERQGMSDTR++ NGKYYYD +     G              Y  KK+   N ++ +   E    E+
Subjt:  R--------ERQGMSDTRFVENGKYYYDANSRAGEG-----------GESYGSKKNPIPNQFEFDSMEEYEKSEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TCAATGGCTTCCGCTTTGAAGCATCTCCCCTTCTTCTTCCTCCTCCTCATCCTCTCCTCTGTCCAAATCGAAGCCAGAGTCAACAAATTCTTCAGCAAATTCATCCACAC
GGATCGCAATGATGCTTCACCACTCACCCCGGCTCTTCCGGTAGCCCCTTCCCCGGCGCCGCTATCTGCTCCGCCTGAAATATCTCCAATCTTGGCGCCGACGCCGTTTT
TCTACGAATCGCAGAATGCGTACGGTCTCTACGGCCGTGCTTCCGACGATACCGAGAACAACCCGTCGATCACCGACGTGGAGGAGGAGATTCTCGCGGAAGACGGCGGC
GACGAGAGCTACAAATCTGGCTATCCGAAGACGAGTTTTCACGGTACCGATTTCGAAAGCTCTAGGAGAGACGAGCAGTACCAGAGCAGTTACGGCAACAATGGCTACGG
AAATTCCGAGTACGAGAACAACGGCGGCAGAAATTACCAGTACGAGAGCAATTTCGAGGACGGTGGATTCAGAAGGAGCCGGTACGAGCCGAGGGAGCGGCAGGGGATGA
GCGACACCAGATTCGTGGAGAACGGGAAGTACTATTACGATGCGAACTCGAGGGCTGGAGAAGGCGGCGAATCGTACGGGAGCAAGAAGAATCCGATTCCGAACCAGTTC
GAGTTCGATTCAATGGAGGAGTACGAGAAGAGCGAGGAATTTCATCGT
mRNA sequenceShow/hide mRNA sequence
TCAATGGCTTCCGCTTTGAAGCATCTCCCCTTCTTCTTCCTCCTCCTCATCCTCTCCTCTGTCCAAATCGAAGCCAGAGTCAACAAATTCTTCAGCAAATTCATCCACAC
GGATCGCAATGATGCTTCACCACTCACCCCGGCTCTTCCGGTAGCCCCTTCCCCGGCGCCGCTATCTGCTCCGCCTGAAATATCTCCAATCTTGGCGCCGACGCCGTTTT
TCTACGAATCGCAGAATGCGTACGGTCTCTACGGCCGTGCTTCCGACGATACCGAGAACAACCCGTCGATCACCGACGTGGAGGAGGAGATTCTCGCGGAAGACGGCGGC
GACGAGAGCTACAAATCTGGCTATCCGAAGACGAGTTTTCACGGTACCGATTTCGAAAGCTCTAGGAGAGACGAGCAGTACCAGAGCAGTTACGGCAACAATGGCTACGG
AAATTCCGAGTACGAGAACAACGGCGGCAGAAATTACCAGTACGAGAGCAATTTCGAGGACGGTGGATTCAGAAGGAGCCGGTACGAGCCGAGGGAGCGGCAGGGGATGA
GCGACACCAGATTCGTGGAGAACGGGAAGTACTATTACGATGCGAACTCGAGGGCTGGAGAAGGCGGCGAATCGTACGGGAGCAAGAAGAATCCGATTCCGAACCAGTTC
GAGTTCGATTCAATGGAGGAGTACGAGAAGAGCGAGGAATTTCATCGT
Protein sequenceShow/hide protein sequence
SMASALKHLPFFFLLLILSSVQIEARVNKFFSKFIHTDRNDASPLTPALPVAPSPAPLSAPPEISPILAPTPFFYESQNAYGLYGRASDDTENNPSITDVEEEILAEDGG
DESYKSGYPKTSFHGTDFESSRRDEQYQSSYGNNGYGNSEYENNGGRNYQYESNFEDGGFRRSRYEPRERQGMSDTRFVENGKYYYDANSRAGEGGESYGSKKNPIPNQF
EFDSMEEYEKSEEFHR