; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10002374 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10002374
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionAnkyrin repeat-containing protein
Genome locationChr11:6159969..6161453
RNA-Seq ExpressionHG10002374
SyntenyHG10002374
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR026961 - PGG domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0058428.1 ankyrin repeat-containing protein [Cucumis melo var. makuwa]4.7e-8075.46Show/hide
Query:  MSLYTKNPSGRWKVWSKK-LKYKGDWVEEVQETMMLVATVIATVTFQAGVNPPGGVLQQDTSFNYWRFNSNGTFYDI-LYSSLSGFNDTNVTLPAGTAIM
        M+LYTK  +G+WK+W KK LKYKG+W+EEVQ TMMLVATVIATVTFQAGVN PGGV QQDTSF+Y  F  N T   + LYSSLS +N+  V LPAGT+IM
Subjt:  MSLYTKNPSGRWKVWSKK-LKYKGDWVEEVQETMMLVATVIATVTFQAGVNPPGGVLQQDTSFNYWRFNSNGTFYDI-LYSSLSGFNDTNVTLPAGTAIM

Query:  SYQQ-QEYWIYLWINTVSFLASMSVILMIVSRFPLKNRICSWLLTLAMCIAVVSLAIGYLFGVKMVNLISFRDYLDISSYDKVFPLTIMCWLGVVGMVGL
         YQQ +EYWIYLWINTVSFLASMS++LMIVSRF L+NRICS LLTLA CIAVVSLAIGYL GVKMVNL+SF DY++I+ YD  FP TIMC LGVVGMVGL
Subjt:  SYQQ-QEYWIYLWINTVSFLASMSVILMIVSRFPLKNRICSWLLTLAMCIAVVSLAIGYLFGVKMVNLISFRDYLDISSYDKVFPLTIMCWLGVVGMVGL

Query:  WQLTHFLKTLFHIFKS
        WQLTHFLK+LFHIF S
Subjt:  WQLTHFLKTLFHIFKS

XP_004141217.1 uncharacterized protein LOC101204214 [Cucumis sativus]2.0e-8661.69Show/hide
Query:  MDDAGNTILDLSLMLRRIEMVGYLLKIPEVKTRSTCMTSIASSNNTKEITVDSQKMTKTRNPKRKRRGCMSLYTKNPSGR--WKVWSKKLKYKGDWVEEV
        +DD GNTILDLS+MLRRIEMVGYLL IPEV TR T MT  +SSN  K +   S+K+T T++ +R+RR  +SL+T     R  +   SKKL+Y+GDWV EV
Subjt:  MDDAGNTILDLSLMLRRIEMVGYLLKIPEVKTRSTCMTSIASSNNTKEITVDSQKMTKTRNPKRKRRGCMSLYTKNPSGR--WKVWSKKLKYKGDWVEEV

Query:  QETMMLVATVIATVTFQAGVNPPGGVLQQDTSFNYWRFNSNG----------TFYDILYSSLSGFNDTNVTLPAGTAIMSYQQ-QEYWIYLWINTVSFLA
        Q+TMMLVATVIATVTFQ GVNPPGG+ QQDTSFNY  FN++           + YD L ++++  N+  V  PAGT +M YQQ Q YWIYL +NT+SFLA
Subjt:  QETMMLVATVIATVTFQAGVNPPGGVLQQDTSFNYWRFNSNG----------TFYDILYSSLSGFNDTNVTLPAGTAIMSYQQ-QEYWIYLWINTVSFLA

Query:  SMSVILMIVSRFPLKNRICSWLLTLAMCIAVVSLAIGYLFGVKMVNLISFRDYLDISSYDKVFPLTIMCWLGVVGMVGLWQLTHFLKTLFHIFKS
        S+SVILMIV RFPLKNRI SW+L+L MC AVVSLAIGYL GVKM+NL++  DY+  + +D V P T+ CWLGVVGMVGLWQ+ HFLK+LFHIF S
Subjt:  SMSVILMIVSRFPLKNRICSWLLTLAMCIAVVSLAIGYLFGVKMVNLISFRDYLDISSYDKVFPLTIMCWLGVVGMVGLWQLTHFLKTLFHIFKS

XP_008447612.1 PREDICTED: uncharacterized protein LOC103490026 [Cucumis melo]6.4e-8563.67Show/hide
Query:  MDDAGNTILDLSLMLRRIEMVGYLLKIPEVKTRSTCMTSIASSNNTKEITVDSQKMTKTRNPKRKRRGCMSLYT-KNPSGRWKVWSKKLKYKGDWVEEVQ
        MDD GNTILDLSLMLRRIEMVGYLL IPE KTR         +N+ KE  ++SQK+TKTRN K +RR  +SL T K P GRWKVW KKLKY+GDWV+EVQ
Subjt:  MDDAGNTILDLSLMLRRIEMVGYLLKIPEVKTRSTCMTSIASSNNTKEITVDSQKMTKTRNPKRKRRGCMSLYT-KNPSGRWKVWSKKLKYKGDWVEEVQ

Query:  ETMMLVATVIATVTFQAGVNPPGGVLQQDTSFNYWRF---NSNG---TFYDI-LYSSLSGF-NDTNVTLPAGTAIMSYQQQEYW-IYLWINTVSFLASMS
         TMMLVATVIATVTFQ GVNPPGGV QQDT F Y        NG    + D  LY   S   N+T+V  PAGT +M +QQ     +YLW+NTVSFLASMS
Subjt:  ETMMLVATVIATVTFQAGVNPPGGVLQQDTSFNYWRF---NSNG---TFYDI-LYSSLSGF-NDTNVTLPAGTAIMSYQQQEYW-IYLWINTVSFLASMS

Query:  VILMIVSRFPLKNRICSWLLTLAMCIAVVSLAIGYLFGVKMVNLISFRDYLDISSYDKVFPLTIMCWLGVVGMVGLWQLTHFL----KTLFHIF----KS
        VILMIVSRFPLKNRICSWLLTL MCIAVVSLAIGYL GVKMVNL++F +  D S+ D VF LT++CW G+VG+V LW +T  L    KTL H F    K 
Subjt:  VILMIVSRFPLKNRICSWLLTLAMCIAVVSLAIGYLFGVKMVNLISFRDYLDISSYDKVFPLTIMCWLGVVGMVGLWQLTHFL----KTLFHIF----KS

Query:  HSFKLNSTQGS
        HSF +NST+ S
Subjt:  HSFKLNSTQGS

XP_011649355.1 uncharacterized protein LOC101212496 [Cucumis sativus]6.2e-8058.05Show/hide
Query:  MDDAGNTILDLSLMLRRIEMVGYLLKIPEVKTRSTCMTSIASSNNTKEITVDSQKMTKTRNPKRKRRGCMSLYTKNPS-GRWKVWSKKLKYKGDWVEEVQ
        MDD GNTILDLSL LRRIEMVGYLL IPE KTR         +N+TKE  ++SQK+TK RN K KRR  +SL TK  S G WKVW KKLKYKGDWV+EVQ
Subjt:  MDDAGNTILDLSLMLRRIEMVGYLLKIPEVKTRSTCMTSIASSNNTKEITVDSQKMTKTRNPKRKRRGCMSLYTKNPS-GRWKVWSKKLKYKGDWVEEVQ

Query:  ETMMLVATVIATVTFQAGVNPPGGVLQQDTSFNYWRFNS------NGTFYDILYSSLSGFNDTNVTLPAGTAIMSYQQQE-YWIYLWINTVSFLASMSVI
         TMMLVATVIATVTFQ GVNPPGGV QQDT F Y  FN       N  + +     L  +++T V   AGT +M  QQ E Y IY+W+NTVSFLASM+VI
Subjt:  ETMMLVATVIATVTFQAGVNPPGGVLQQDTSFNYWRFNS------NGTFYDILYSSLSGFNDTNVTLPAGTAIMSYQQQE-YWIYLWINTVSFLASMSVI

Query:  LMIVSRFPLKNRICSWLLTLAMCIAVVSLAIGYLFGVKMVNLISFRDYLDISSYDKVFPLTIMCWLGVVGMVGL--------WQ----------------
        LMIVSRFPLKNRICSWLL  AMCIAV+SLAIGYL GVKMV+L++F D    + Y  +F LTI+CWLGVVG+V L        W                 
Subjt:  LMIVSRFPLKNRICSWLLTLAMCIAVVSLAIGYLFGVKMVNLISFRDYLDISSYDKVFPLTIMCWLGVVGMVGL--------WQ----------------

Query:  --LTHFLKTLFHIF----KSHSFKLNSTQ
          L   +K L H F    KSHSF +NST+
Subjt:  --LTHFLKTLFHIF----KSHSFKLNSTQ

XP_016903124.1 PREDICTED: uncharacterized protein LOC107992044 [Cucumis melo]1.2e-9673.53Show/hide
Query:  MLRRIEMVGYLLKIPEVKTRSTCMTSIASSNNTKEITVDSQKMTKTRNPKRKRRGCMSLYTKNPSGRWKVWSKK-LKYKGDWVEEVQETMMLVATVIATV
        MLRRIEMVGY L IPE+KTR+      ASSNNTKE +V S+K TK RN KRKRR  M+LYTK  +G+WK+W KK LKYKG+W+EEVQ TMMLVATVIATV
Subjt:  MLRRIEMVGYLLKIPEVKTRSTCMTSIASSNNTKEITVDSQKMTKTRNPKRKRRGCMSLYTKNPSGRWKVWSKK-LKYKGDWVEEVQETMMLVATVIATV

Query:  TFQAGVNPPGGVLQQDTSFNYWRFNSNGTFYDI-LYSSLSGFNDTNVTLPAGTAIMSYQQ-QEYWIYLWINTVSFLASMSVILMIVSRFPLKNRICSWLL
        TFQAGVN PGGV QQDTSF+Y  F  N T   + LYSSLS +N+  V LPAGT+IM YQQ +EYWIYLWINTVSFLASMS++LMIVSRF L+NRICS LL
Subjt:  TFQAGVNPPGGVLQQDTSFNYWRFNSNGTFYDI-LYSSLSGFNDTNVTLPAGTAIMSYQQ-QEYWIYLWINTVSFLASMSVILMIVSRFPLKNRICSWLL

Query:  TLAMCIAVVSLAIGYLFGVKMVNLISFRDYLDISSYDKVFPLTIMCWLGVVGMVGLWQLTHFLKTLFHIFKS
        TLA CIAVVSLAIGYL GVKMVNL+SF DY++I+ YD  FP TIMC LGVVGMVGLWQLTHFLK+LFHIF S
Subjt:  TLAMCIAVVSLAIGYLFGVKMVNLISFRDYLDISSYDKVFPLTIMCWLGVVGMVGLWQLTHFLKTLFHIFKS

TrEMBL top hitse value%identityAlignment
A0A0A0LCQ0 ANK_REP_REGION domain-containing protein9.6e-8761.69Show/hide
Query:  MDDAGNTILDLSLMLRRIEMVGYLLKIPEVKTRSTCMTSIASSNNTKEITVDSQKMTKTRNPKRKRRGCMSLYTKNPSGR--WKVWSKKLKYKGDWVEEV
        +DD GNTILDLS+MLRRIEMVGYLL IPEV TR T MT  +SSN  K +   S+K+T T++ +R+RR  +SL+T     R  +   SKKL+Y+GDWV EV
Subjt:  MDDAGNTILDLSLMLRRIEMVGYLLKIPEVKTRSTCMTSIASSNNTKEITVDSQKMTKTRNPKRKRRGCMSLYTKNPSGR--WKVWSKKLKYKGDWVEEV

Query:  QETMMLVATVIATVTFQAGVNPPGGVLQQDTSFNYWRFNSNG----------TFYDILYSSLSGFNDTNVTLPAGTAIMSYQQ-QEYWIYLWINTVSFLA
        Q+TMMLVATVIATVTFQ GVNPPGG+ QQDTSFNY  FN++           + YD L ++++  N+  V  PAGT +M YQQ Q YWIYL +NT+SFLA
Subjt:  QETMMLVATVIATVTFQAGVNPPGGVLQQDTSFNYWRFNSNG----------TFYDILYSSLSGFNDTNVTLPAGTAIMSYQQ-QEYWIYLWINTVSFLA

Query:  SMSVILMIVSRFPLKNRICSWLLTLAMCIAVVSLAIGYLFGVKMVNLISFRDYLDISSYDKVFPLTIMCWLGVVGMVGLWQLTHFLKTLFHIFKS
        S+SVILMIV RFPLKNRI SW+L+L MC AVVSLAIGYL GVKM+NL++  DY+  + +D V P T+ CWLGVVGMVGLWQ+ HFLK+LFHIF S
Subjt:  SMSVILMIVSRFPLKNRICSWLLTLAMCIAVVSLAIGYLFGVKMVNLISFRDYLDISSYDKVFPLTIMCWLGVVGMVGLWQLTHFLKTLFHIFKS

A0A1S3BIS1 uncharacterized protein LOC1034900263.1e-8563.67Show/hide
Query:  MDDAGNTILDLSLMLRRIEMVGYLLKIPEVKTRSTCMTSIASSNNTKEITVDSQKMTKTRNPKRKRRGCMSLYT-KNPSGRWKVWSKKLKYKGDWVEEVQ
        MDD GNTILDLSLMLRRIEMVGYLL IPE KTR         +N+ KE  ++SQK+TKTRN K +RR  +SL T K P GRWKVW KKLKY+GDWV+EVQ
Subjt:  MDDAGNTILDLSLMLRRIEMVGYLLKIPEVKTRSTCMTSIASSNNTKEITVDSQKMTKTRNPKRKRRGCMSLYT-KNPSGRWKVWSKKLKYKGDWVEEVQ

Query:  ETMMLVATVIATVTFQAGVNPPGGVLQQDTSFNYWRF---NSNG---TFYDI-LYSSLSGF-NDTNVTLPAGTAIMSYQQQEYW-IYLWINTVSFLASMS
         TMMLVATVIATVTFQ GVNPPGGV QQDT F Y        NG    + D  LY   S   N+T+V  PAGT +M +QQ     +YLW+NTVSFLASMS
Subjt:  ETMMLVATVIATVTFQAGVNPPGGVLQQDTSFNYWRF---NSNG---TFYDI-LYSSLSGF-NDTNVTLPAGTAIMSYQQQEYW-IYLWINTVSFLASMS

Query:  VILMIVSRFPLKNRICSWLLTLAMCIAVVSLAIGYLFGVKMVNLISFRDYLDISSYDKVFPLTIMCWLGVVGMVGLWQLTHFL----KTLFHIF----KS
        VILMIVSRFPLKNRICSWLLTL MCIAVVSLAIGYL GVKMVNL++F +  D S+ D VF LT++CW G+VG+V LW +T  L    KTL H F    K 
Subjt:  VILMIVSRFPLKNRICSWLLTLAMCIAVVSLAIGYLFGVKMVNLISFRDYLDISSYDKVFPLTIMCWLGVVGMVGLWQLTHFL----KTLFHIF----KS

Query:  HSFKLNSTQGS
        HSF +NST+ S
Subjt:  HSFKLNSTQGS

A0A1S4E4G3 uncharacterized protein LOC1079920422.9e-6755.51Show/hide
Query:  MVGYLLKIPEVKTRSTCMTSIASSNNTKEITVDSQKMTKTRNPKRKRRGCMSLYTKNPSGRWKVWSKKLKYKGDWVEEVQETMMLVATVIATVTFQAGVN
        MVGYLL IPE     T +          +I VD QK TK R                      V SKKLKYKGDWV + Q+T+MLVATVIAT+TFQ GVN
Subjt:  MVGYLLKIPEVKTRSTCMTSIASSNNTKEITVDSQKMTKTRNPKRKRRGCMSLYTKNPSGRWKVWSKKLKYKGDWVEEVQETMMLVATVIATVTFQAGVN

Query:  PPGGVLQQDTSFNYWRFNSNGTFYDILYSSLSGF--------NDTNVTLPAGTAIMSYQQ-QEYWIYLWINTVSFLASMSVILMIVSRFPLKNRICSWLL
        PPGG+ QQDTSFNY  F  +  +      SL G+        N   +  PAGT +M+YQQ Q YW+YLW+NT+SFLAS+SVILMIV RFPLKNRI SW+L
Subjt:  PPGGVLQQDTSFNYWRFNSNGTFYDILYSSLSGF--------NDTNVTLPAGTAIMSYQQ-QEYWIYLWINTVSFLASMSVILMIVSRFPLKNRICSWLL

Query:  TLAMCIAVVSLAIGYLFGVKMVNLISFRDYLDISSYDKVFPLTIMCWLGVVGMVGLWQLTHFLKTLFHIFKS
         LAMCIAVVSLAIGYL GVKMVNL++  +Y+  + YD V P +++CWL VVGMVGLWQ+ HFLK+LF+IF S
Subjt:  TLAMCIAVVSLAIGYLFGVKMVNLISFRDYLDISSYDKVFPLTIMCWLGVVGMVGLWQLTHFLKTLFHIFKS

A0A1S4E4H3 uncharacterized protein LOC1079920446.0e-9773.53Show/hide
Query:  MLRRIEMVGYLLKIPEVKTRSTCMTSIASSNNTKEITVDSQKMTKTRNPKRKRRGCMSLYTKNPSGRWKVWSKK-LKYKGDWVEEVQETMMLVATVIATV
        MLRRIEMVGY L IPE+KTR+      ASSNNTKE +V S+K TK RN KRKRR  M+LYTK  +G+WK+W KK LKYKG+W+EEVQ TMMLVATVIATV
Subjt:  MLRRIEMVGYLLKIPEVKTRSTCMTSIASSNNTKEITVDSQKMTKTRNPKRKRRGCMSLYTKNPSGRWKVWSKK-LKYKGDWVEEVQETMMLVATVIATV

Query:  TFQAGVNPPGGVLQQDTSFNYWRFNSNGTFYDI-LYSSLSGFNDTNVTLPAGTAIMSYQQ-QEYWIYLWINTVSFLASMSVILMIVSRFPLKNRICSWLL
        TFQAGVN PGGV QQDTSF+Y  F  N T   + LYSSLS +N+  V LPAGT+IM YQQ +EYWIYLWINTVSFLASMS++LMIVSRF L+NRICS LL
Subjt:  TFQAGVNPPGGVLQQDTSFNYWRFNSNGTFYDI-LYSSLSGFNDTNVTLPAGTAIMSYQQ-QEYWIYLWINTVSFLASMSVILMIVSRFPLKNRICSWLL

Query:  TLAMCIAVVSLAIGYLFGVKMVNLISFRDYLDISSYDKVFPLTIMCWLGVVGMVGLWQLTHFLKTLFHIFKS
        TLA CIAVVSLAIGYL GVKMVNL+SF DY++I+ YD  FP TIMC LGVVGMVGLWQLTHFLK+LFHIF S
Subjt:  TLAMCIAVVSLAIGYLFGVKMVNLISFRDYLDISSYDKVFPLTIMCWLGVVGMVGLWQLTHFLKTLFHIFKS

A0A5D3BCC9 Ankyrin repeat-containing protein2.3e-8075.46Show/hide
Query:  MSLYTKNPSGRWKVWSKK-LKYKGDWVEEVQETMMLVATVIATVTFQAGVNPPGGVLQQDTSFNYWRFNSNGTFYDI-LYSSLSGFNDTNVTLPAGTAIM
        M+LYTK  +G+WK+W KK LKYKG+W+EEVQ TMMLVATVIATVTFQAGVN PGGV QQDTSF+Y  F  N T   + LYSSLS +N+  V LPAGT+IM
Subjt:  MSLYTKNPSGRWKVWSKK-LKYKGDWVEEVQETMMLVATVIATVTFQAGVNPPGGVLQQDTSFNYWRFNSNGTFYDI-LYSSLSGFNDTNVTLPAGTAIM

Query:  SYQQ-QEYWIYLWINTVSFLASMSVILMIVSRFPLKNRICSWLLTLAMCIAVVSLAIGYLFGVKMVNLISFRDYLDISSYDKVFPLTIMCWLGVVGMVGL
         YQQ +EYWIYLWINTVSFLASMS++LMIVSRF L+NRICS LLTLA CIAVVSLAIGYL GVKMVNL+SF DY++I+ YD  FP TIMC LGVVGMVGL
Subjt:  SYQQ-QEYWIYLWINTVSFLASMSVILMIVSRFPLKNRICSWLLTLAMCIAVVSLAIGYLFGVKMVNLISFRDYLDISSYDKVFPLTIMCWLGVVGMVGL

Query:  WQLTHFLKTLFHIFKS
        WQLTHFLK+LFHIF S
Subjt:  WQLTHFLKTLFHIFKS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G13950.1 unknown protein1.3e-1433.74Show/hide
Query:  SGRWKVWSKKLKYKGDWVEEVQETMMLVATVIATVTFQAGVNPPGGVLQQDTSFNYWRFNSNGTFYDILYSSLSGFNDTNVTLP-----AGTAIMSYQQQ
        S  W V  K LK +GDW+E+ +  +M+ ATVIA ++FQ  VNPPGGV Q D                        F +   T P     AGTA++ Y+  
Subjt:  SGRWKVWSKKLKYKGDWVEEVQETMMLVATVIATVTFQAGVNPPGGVLQQDTSFNYWRFNSNGTFYDILYSSLSGFNDTNVTLP-----AGTAIMSYQQQ

Query:  EYWIYLWI---NTVSFLASMSVILMIVSRFPLKNRICSWLLTLAMCIAVVSLAIGYLFGVKMV
        +   Y+ +   +TVSF  SMS+IL+++S   L+NR+   +L   M +AV+ ++  + F + +V
Subjt:  EYWIYLWI---NTVSFLASMSVILMIVSRFPLKNRICSWLLTLAMCIAVVSLAIGYLFGVKMV

AT4G13266.1 unknown protein1.0e-1126.98Show/hide
Query:  LKYKGDWVEEVQETMMLVATVIATVTFQAGVNPPGGVLQQDTSFNYWRFNSNGTFYDILYSSLSGFNDTNVTLPAGTAIMSYQQQEYWIYLWI---NTVS
        L ++GDW+E+ +  +++ ATVIA ++F   VNPPGGV Q +               D      + F         GT+I+ +   +   YL +   N VS
Subjt:  LKYKGDWVEEVQETMMLVATVIATVTFQAGVNPPGGVLQQDTSFNYWRFNSNGTFYDILYSSLSGFNDTNVTLPAGTAIMSYQQQEYWIYLWI---NTVS

Query:  FLASMSVILMIVSRFPLKNRICSWLLTLAMCIAVVSLAIGYLFGVKMVNLISFRDYLDISSYDKVFPLTIMCWLGVVGMVGLWQLTHFL
        F ASM +I +++  F  +NR+   ++ + M +AV+ ++  + F      L+   D       +K+  + +  W+ +  +V L QL  FL
Subjt:  FLASMSVILMIVSRFPLKNRICSWLLTLAMCIAVVSLAIGYLFGVKMVNLISFRDYLDISSYDKVFPLTIMCWLGVVGMVGLWQLTHFL

AT5G51160.1 Ankyrin repeat family protein4.1e-0525.74Show/hide
Query:  CMSLYTKNPSGRWKVWSKKLKYKGDWVEEVQETMMLVATVIATVTFQAGVNPPGGVLQQDTSFNYWRFNSNGTFYDILYSSLSGFNDTNVTL-PAGTAIM
        C     K+ S +  V     K   D   E +  +++VA+++AT TFQA + PPGG  Q  +                +  + +  N TN     AG +IM
Subjt:  CMSLYTKNPSGRWKVWSKKLKYKGDWVEEVQETMMLVATVIATVTFQAGVNPPGGVLQQDTSFNYWRFNSNGTFYDILYSSLSGFNDTNVTL-PAGTAIM

Query:  -SYQQQEYWIYLWINTVSFLASMSVILMIVSRFPLK
         ++    + ++++ NT+ F  S+S++ ++   FPL+
Subjt:  -SYQQQEYWIYLWINTVSFLASMSVILMIVSRFPLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGATGCAGGAAATACTATTTTGGATTTGTCTTTGATGTTACGACGAATTGAGATGGTAGGGTATTTACTCAAGATTCCAGAAGTAAAAACTAGATCAACATGCAT
GACAAGTATTGCATCATCAAACAACACAAAAGAAATTACTGTAGACTCCCAAAAGATGACAAAGACAAGAAACCCCAAAAGAAAAAGACGAGGATGTATGTCATTGTACA
CCAAAAATCCATCAGGACGGTGGAAGGTATGGAGTAAGAAACTAAAATACAAAGGAGATTGGGTTGAAGAAGTGCAAGAAACAATGATGCTTGTAGCTACCGTGATTGCA
ACCGTGACTTTTCAAGCCGGAGTGAACCCTCCTGGTGGCGTTTTGCAACAAGACACTTCATTCAATTATTGGAGATTCAACAGTAATGGTACATTTTATGACATATTATA
TAGCTCTTTAAGTGGATTCAATGACACAAATGTTACTTTACCAGCTGGAACTGCAATAATGAGTTACCAACAACAAGAGTACTGGATATACTTGTGGATTAACACAGTAT
CGTTCTTGGCATCTATGAGCGTGATTTTGATGATCGTTAGCCGATTTCCGCTAAAAAATAGGATTTGTAGTTGGCTATTGACTCTAGCCATGTGCATAGCAGTGGTATCC
TTAGCAATTGGGTATTTGTTCGGAGTTAAAATGGTTAACCTCATTTCATTTCGTGATTATCTTGATATCAGTTCATACGACAAGGTGTTTCCTTTAACAATTATGTGTTG
GCTTGGGGTGGTTGGAATGGTTGGCCTATGGCAACTAACTCACTTTCTCAAGACCTTATTCCACATTTTTAAATCTCACAGCTTCAAGCTGAACTCAACACAAGGTTCTT
CCAATCTACAACTTAGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATGATGCAGGAAATACTATTTTGGATTTGTCTTTGATGTTACGACGAATTGAGATGGTAGGGTATTTACTCAAGATTCCAGAAGTAAAAACTAGATCAACATGCAT
GACAAGTATTGCATCATCAAACAACACAAAAGAAATTACTGTAGACTCCCAAAAGATGACAAAGACAAGAAACCCCAAAAGAAAAAGACGAGGATGTATGTCATTGTACA
CCAAAAATCCATCAGGACGGTGGAAGGTATGGAGTAAGAAACTAAAATACAAAGGAGATTGGGTTGAAGAAGTGCAAGAAACAATGATGCTTGTAGCTACCGTGATTGCA
ACCGTGACTTTTCAAGCCGGAGTGAACCCTCCTGGTGGCGTTTTGCAACAAGACACTTCATTCAATTATTGGAGATTCAACAGTAATGGTACATTTTATGACATATTATA
TAGCTCTTTAAGTGGATTCAATGACACAAATGTTACTTTACCAGCTGGAACTGCAATAATGAGTTACCAACAACAAGAGTACTGGATATACTTGTGGATTAACACAGTAT
CGTTCTTGGCATCTATGAGCGTGATTTTGATGATCGTTAGCCGATTTCCGCTAAAAAATAGGATTTGTAGTTGGCTATTGACTCTAGCCATGTGCATAGCAGTGGTATCC
TTAGCAATTGGGTATTTGTTCGGAGTTAAAATGGTTAACCTCATTTCATTTCGTGATTATCTTGATATCAGTTCATACGACAAGGTGTTTCCTTTAACAATTATGTGTTG
GCTTGGGGTGGTTGGAATGGTTGGCCTATGGCAACTAACTCACTTTCTCAAGACCTTATTCCACATTTTTAAATCTCACAGCTTCAAGCTGAACTCAACACAAGGTTCTT
CCAATCTACAACTTAGTTAA
Protein sequenceShow/hide protein sequence
MDDAGNTILDLSLMLRRIEMVGYLLKIPEVKTRSTCMTSIASSNNTKEITVDSQKMTKTRNPKRKRRGCMSLYTKNPSGRWKVWSKKLKYKGDWVEEVQETMMLVATVIA
TVTFQAGVNPPGGVLQQDTSFNYWRFNSNGTFYDILYSSLSGFNDTNVTLPAGTAIMSYQQQEYWIYLWINTVSFLASMSVILMIVSRFPLKNRICSWLLTLAMCIAVVS
LAIGYLFGVKMVNLISFRDYLDISSYDKVFPLTIMCWLGVVGMVGLWQLTHFLKTLFHIFKSHSFKLNSTQGSSNLQLS