; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10014922 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10014922
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of unknown function (DUF674)
Genome locationChr02:21860917..21862578
RNA-Seq ExpressionHG10014922
SyntenyHG10014922
Gene Ontology termsNA
InterPro domainsIPR007750 - Protein of unknown function DUF674


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008461735.1 PREDICTED: uncharacterized protein LOC103500268 [Cucumis melo]9.0e-5256.19Show/hide
Query:  DVRLKLLIDTKTERVLYGEANKKFIDFLFNLLSLPVGAVIRLLKKDGMVGCLGNLYESVETLNESYLQPNQSKDIVLKPKVLFNSFTKLLANIVVPA---
        +VRLKLLID++ +RVL+GEA+K  IDFLFNLLSLP+G VIRLLKK GMVG L NLYESVE LN++YLQPNQSKD +LKPKV F++ T LL NI   A   
Subjt:  DVRLKLLIDTKTERVLYGEANKKFIDFLFNLLSLPVGAVIRLLKKDGMVGCLGNLYESVETLNESYLQPNQSKDIVLKPKVLFNSFTKLLANIVVPA---

Query:  -------------AAKPPATF-YCHNGSYSSCRGIQPSPGR-----------GFVKDLATYIVMDDLTVKHISDFSITTLLKKFNIKDVDSLEEKVITLD
                     A+ P A    C N     C  + P               GFVK + TY+VMDDL+VK +S  S  TLL KFNIK+V +LEEKVITLD
Subjt:  -------------AAKPPATF-YCHNGSYSSCRGIQPSPGR-----------GFVKDLATYIVMDDLTVKHISDFSITTLLKKFNIKDVDSLEEKVITLD

Query:  VDEGVELLEASLQSKTVLTNAFLKRR
        V++GV+LL ASLQSKTVLT+ FL R+
Subjt:  VDEGVELLEASLQSKTVLTNAFLKRR

XP_008465479.1 PREDICTED: uncharacterized protein LOC103503094 [Cucumis melo]7.5e-9174.9Show/hide
Query:  MEQTDVRLKLLIDTKTERVLYGEANKKFIDFLFNLLSLPVGAVIRLLKKDGMVGCLGNLYESVETLNESYLQPNQSKDIVLKPKVLFNSFTKLLANIVVP
        MEQTDV LKLLID KTERVLYGEA+KKFIDFL N+LSLP+G VI LLKK+GMVGCLGNLYESVETLN+SYLQPNQS+D VLKPK+LFNSFTKLL N+ VP
Subjt:  MEQTDVRLKLLIDTKTERVLYGEANKKFIDFLFNLLSLPVGAVIRLLKKDGMVGCLGNLYESVETLNESYLQPNQSKDIVLKPKVLFNSFTKLLANIVVP

Query:  AAAKPPATFYCHNGSYSSCRGI------------------------------QPSPGRGFVKDLATYIVMDDLTVKHISDFSITTLLKKFNIKDVDSLEE
        AAA PPA FYC+  +YSSCR                                QP+ G G VKDLATYIVMDDLTVKHISDFSITTLLKKFNIKDVDSLEE
Subjt:  AAAKPPATFYCHNGSYSSCRGI------------------------------QPSPGRGFVKDLATYIVMDDLTVKHISDFSITTLLKKFNIKDVDSLEE

Query:  KVITLDVDEGVELLEASLQSKTVLTNAFLKRRRSHIDSSDVKLSEVVKPSV
        KVITLDVDEGVELLEASLQSKTVLTNAFLKRRR HID SDVKLSE + PSV
Subjt:  KVITLDVDEGVELLEASLQSKTVLTNAFLKRRRSHIDSSDVKLSEVVKPSV

XP_022138964.1 uncharacterized protein LOC111010013 [Momordica charantia]2.3e-5555Show/hide
Query:  MEQTDVRLKLLIDTKTERVLYGEANKKFIDFLFNLLSLPVGAVIRLLKKDGMVGCLGNLYESVETLNESYLQPNQSKDIVLKPKVLF--NSFTKLLANIV
        M   +VRLKLLID+K +RVL+GEA+K  IDFLFNLLSLP+G VIRLLKK GMVGCLGNLYESVETLN++YLQPNQSKDI+LKPKV F  +S T LL NI 
Subjt:  MEQTDVRLKLLIDTKTERVLYGEANKKFIDFLFNLLSLPVGAVIRLLKKDGMVGCLGNLYESVETLNESYLQPNQSKDIVLKPKVLF--NSFTKLLANIV

Query:  VPAAAKPPATFY-CHNGSYSSC-RGIQPSPGR----------------------------------GFVKDLATYIVMDDLTVKHISDFSITTLLKKFNI
        + AAA    TFY C++ ++++C R +   P                                    GFVK + TY+VMDDL+VK +S  S   LL KFN+
Subjt:  VPAAAKPPATFY-CHNGSYSSC-RGIQPSPGR----------------------------------GFVKDLATYIVMDDLTVKHISDFSITTLLKKFNI

Query:  KDVDSLEEKVITLDVDEGVELLEASLQSKTVLTNAFLKRR
        K+V +LEEKV+TLDV+EGV+LL+ASL SKTVLT+ F++R+
Subjt:  KDVDSLEEKVITLDVDEGVELLEASLQSKTVLTNAFLKRR

XP_031741245.1 uncharacterized protein LOC105435653 [Cucumis sativus]1.3e-9076Show/hide
Query:  MEQTDVRLKLLIDTKTERVLYGEANKKFIDFLFNLLSLPVGAVIRLLKKDGMVGCLGNLYESVETLNESYLQPNQSKDIVLKPKVLFNSFTKLLANI-VV
        MEQTDV LKLLID KTERVLYGEA+KKFIDFL N+LSLP+G VIRLLKKDGMVGCLGNLYESVETLN SYLQPNQS+DIVLKPK++FNSFTKL+ N+ VV
Subjt:  MEQTDVRLKLLIDTKTERVLYGEANKKFIDFLFNLLSLPVGAVIRLLKKDGMVGCLGNLYESVETLNESYLQPNQSKDIVLKPKVLFNSFTKLLANI-VV

Query:  PAAAKPPATFYCHNGSY---------------------SSCR-------GIQPSPGRGFVKDLATYIVMDDLTVKHISDFSITTLLKKFNIKDVDSLEEK
        PAAA PPA FYC  GSY                      +CR       G QP+P RGFVKDLATYIV DDLTVKHISDFSITTLLKKFNIKDVDSLEEK
Subjt:  PAAAKPPATFYCHNGSY---------------------SSCR-------GIQPSPGRGFVKDLATYIVMDDLTVKHISDFSITTLLKKFNIKDVDSLEEK

Query:  VITLDVDEGVELLEASLQSKTVLTNAFLKRRRSHIDSSDVKLSEVVKPSV
        VITLDV+EGVELLEASLQSKTVLTNAFLKRRRSHID +DVKLS  + PSV
Subjt:  VITLDVDEGVELLEASLQSKTVLTNAFLKRRRSHIDSSDVKLSEVVKPSV

XP_038891339.1 uncharacterized protein LOC120080784 [Benincasa hispida]5.3e-5249.46Show/hide
Query:  EQTDVRLKLLIDTKTERVLYGEANKKFIDFLFNLLSLPVGAVIRLLKKDGMVGCLGNLYESVETLNES-YLQPNQSKDIVLKPKVLFNSFTKLLANIVVP
        + T+VRLKLLIDT+   VLYGEA+K FIDFLFNLLSLP+GAVIRLL K  M+GCLGNLYES+ETLNE+ +++P QSK+ +L+PKV  N  TKLL  I+  
Subjt:  EQTDVRLKLLIDTKTERVLYGEANKKFIDFLFNLLSLPVGAVIRLLKKDGMVGCLGNLYESVETLNES-YLQPNQSKDIVLKPKVLFNSFTKLLANIVVP

Query:  AAAK-----------------------------------------------PPATFYCHNGSYSS------------------CRGIQPSPGR----GFV
         ++                                                PP++F    GS SS                  C     S G     GFV
Subjt:  AAAK-----------------------------------------------PPATFYCHNGSYSS------------------CRGIQPSPGR----GFV

Query:  KDLATYIVMDDLTVKHISDFSITTLLKKFNIKDVDSLEEKVITLDVDEGVELLEASLQSKTVLTNAFLKRRRSHIDS
        K LATYIVMDDLTVKHISDFSI +L +KFNIKD  +LEEKVITL+VDEGVELL A+LQSK VLT+ FL+R R  ID+
Subjt:  KDLATYIVMDDLTVKHISDFSITTLLKKFNIKDVDSLEEKVITLDVDEGVELLEASLQSKTVLTNAFLKRRRSHIDS

TrEMBL top hitse value%identityAlignment
A0A1S3CGQ2 uncharacterized protein LOC1035002684.3e-5256.19Show/hide
Query:  DVRLKLLIDTKTERVLYGEANKKFIDFLFNLLSLPVGAVIRLLKKDGMVGCLGNLYESVETLNESYLQPNQSKDIVLKPKVLFNSFTKLLANIVVPA---
        +VRLKLLID++ +RVL+GEA+K  IDFLFNLLSLP+G VIRLLKK GMVG L NLYESVE LN++YLQPNQSKD +LKPKV F++ T LL NI   A   
Subjt:  DVRLKLLIDTKTERVLYGEANKKFIDFLFNLLSLPVGAVIRLLKKDGMVGCLGNLYESVETLNESYLQPNQSKDIVLKPKVLFNSFTKLLANIVVPA---

Query:  -------------AAKPPATF-YCHNGSYSSCRGIQPSPGR-----------GFVKDLATYIVMDDLTVKHISDFSITTLLKKFNIKDVDSLEEKVITLD
                     A+ P A    C N     C  + P               GFVK + TY+VMDDL+VK +S  S  TLL KFNIK+V +LEEKVITLD
Subjt:  -------------AAKPPATF-YCHNGSYSSCRGIQPSPGR-----------GFVKDLATYIVMDDLTVKHISDFSITTLLKKFNIKDVDSLEEKVITLD

Query:  VDEGVELLEASLQSKTVLTNAFLKRR
        V++GV+LL ASLQSKTVLT+ FL R+
Subjt:  VDEGVELLEASLQSKTVLTNAFLKRR

A0A1S3CPD6 uncharacterized protein LOC1035030943.6e-9174.9Show/hide
Query:  MEQTDVRLKLLIDTKTERVLYGEANKKFIDFLFNLLSLPVGAVIRLLKKDGMVGCLGNLYESVETLNESYLQPNQSKDIVLKPKVLFNSFTKLLANIVVP
        MEQTDV LKLLID KTERVLYGEA+KKFIDFL N+LSLP+G VI LLKK+GMVGCLGNLYESVETLN+SYLQPNQS+D VLKPK+LFNSFTKLL N+ VP
Subjt:  MEQTDVRLKLLIDTKTERVLYGEANKKFIDFLFNLLSLPVGAVIRLLKKDGMVGCLGNLYESVETLNESYLQPNQSKDIVLKPKVLFNSFTKLLANIVVP

Query:  AAAKPPATFYCHNGSYSSCRGI------------------------------QPSPGRGFVKDLATYIVMDDLTVKHISDFSITTLLKKFNIKDVDSLEE
        AAA PPA FYC+  +YSSCR                                QP+ G G VKDLATYIVMDDLTVKHISDFSITTLLKKFNIKDVDSLEE
Subjt:  AAAKPPATFYCHNGSYSSCRGI------------------------------QPSPGRGFVKDLATYIVMDDLTVKHISDFSITTLLKKFNIKDVDSLEE

Query:  KVITLDVDEGVELLEASLQSKTVLTNAFLKRRRSHIDSSDVKLSEVVKPSV
        KVITLDVDEGVELLEASLQSKTVLTNAFLKRRR HID SDVKLSE + PSV
Subjt:  KVITLDVDEGVELLEASLQSKTVLTNAFLKRRRSHIDSSDVKLSEVVKPSV

A0A5A7U8V2 DUF674 domain-containing protein4.3e-5256.19Show/hide
Query:  DVRLKLLIDTKTERVLYGEANKKFIDFLFNLLSLPVGAVIRLLKKDGMVGCLGNLYESVETLNESYLQPNQSKDIVLKPKVLFNSFTKLLANIVVPA---
        +VRLKLLID++ +RVL+GEA+K  IDFLFNLLSLP+G VIRLLKK GMVG L NLYESVE LN++YLQPNQSKD +LKPKV F++ T LL NI   A   
Subjt:  DVRLKLLIDTKTERVLYGEANKKFIDFLFNLLSLPVGAVIRLLKKDGMVGCLGNLYESVETLNESYLQPNQSKDIVLKPKVLFNSFTKLLANIVVPA---

Query:  -------------AAKPPATF-YCHNGSYSSCRGIQPSPGR-----------GFVKDLATYIVMDDLTVKHISDFSITTLLKKFNIKDVDSLEEKVITLD
                     A+ P A    C N     C  + P               GFVK + TY+VMDDL+VK +S  S  TLL KFNIK+V +LEEKVITLD
Subjt:  -------------AAKPPATF-YCHNGSYSSCRGIQPSPGR-----------GFVKDLATYIVMDDLTVKHISDFSITTLLKKFNIKDVDSLEEKVITLD

Query:  VDEGVELLEASLQSKTVLTNAFLKRR
        V++GV+LL ASLQSKTVLT+ FL R+
Subjt:  VDEGVELLEASLQSKTVLTNAFLKRR

A0A5A7V731 Putative DNA polymerase zeta catalytic subunit3.6e-9174.9Show/hide
Query:  MEQTDVRLKLLIDTKTERVLYGEANKKFIDFLFNLLSLPVGAVIRLLKKDGMVGCLGNLYESVETLNESYLQPNQSKDIVLKPKVLFNSFTKLLANIVVP
        MEQTDV LKLLID KTERVLYGEA+KKFIDFL N+LSLP+G VI LLKK+GMVGCLGNLYESVETLN+SYLQPNQS+D VLKPK+LFNSFTKLL N+ VP
Subjt:  MEQTDVRLKLLIDTKTERVLYGEANKKFIDFLFNLLSLPVGAVIRLLKKDGMVGCLGNLYESVETLNESYLQPNQSKDIVLKPKVLFNSFTKLLANIVVP

Query:  AAAKPPATFYCHNGSYSSCRGI------------------------------QPSPGRGFVKDLATYIVMDDLTVKHISDFSITTLLKKFNIKDVDSLEE
        AAA PPA FYC+  +YSSCR                                QP+ G G VKDLATYIVMDDLTVKHISDFSITTLLKKFNIKDVDSLEE
Subjt:  AAAKPPATFYCHNGSYSSCRGI------------------------------QPSPGRGFVKDLATYIVMDDLTVKHISDFSITTLLKKFNIKDVDSLEE

Query:  KVITLDVDEGVELLEASLQSKTVLTNAFLKRRRSHIDSSDVKLSEVVKPSV
        KVITLDVDEGVELLEASLQSKTVLTNAFLKRRR HID SDVKLSE + PSV
Subjt:  KVITLDVDEGVELLEASLQSKTVLTNAFLKRRRSHIDSSDVKLSEVVKPSV

A0A6J1CBJ8 uncharacterized protein LOC1110100131.1e-5555Show/hide
Query:  MEQTDVRLKLLIDTKTERVLYGEANKKFIDFLFNLLSLPVGAVIRLLKKDGMVGCLGNLYESVETLNESYLQPNQSKDIVLKPKVLF--NSFTKLLANIV
        M   +VRLKLLID+K +RVL+GEA+K  IDFLFNLLSLP+G VIRLLKK GMVGCLGNLYESVETLN++YLQPNQSKDI+LKPKV F  +S T LL NI 
Subjt:  MEQTDVRLKLLIDTKTERVLYGEANKKFIDFLFNLLSLPVGAVIRLLKKDGMVGCLGNLYESVETLNESYLQPNQSKDIVLKPKVLF--NSFTKLLANIV

Query:  VPAAAKPPATFY-CHNGSYSSC-RGIQPSPGR----------------------------------GFVKDLATYIVMDDLTVKHISDFSITTLLKKFNI
        + AAA    TFY C++ ++++C R +   P                                    GFVK + TY+VMDDL+VK +S  S   LL KFN+
Subjt:  VPAAAKPPATFY-CHNGSYSSC-RGIQPSPGR----------------------------------GFVKDLATYIVMDDLTVKHISDFSITTLLKKFNI

Query:  KDVDSLEEKVITLDVDEGVELLEASLQSKTVLTNAFLKRR
        K+V +LEEKV+TLDV+EGV+LL+ASL SKTVLT+ F++R+
Subjt:  KDVDSLEEKVITLDVDEGVELLEASLQSKTVLTNAFLKRR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09110.1 Protein of unknown function (DUF674)3.2e-1529.26Show/hide
Query:  EQTDVRLKLLIDTKTERVLYGEANKKFIDFLFNLLSLPVGAVIRLLKK-----DGMVGCLGNLYESVETLNESYLQPNQSKDIVLKPKVLFNSFTK-LLA
        E+    L+LLID +  RV+  EA K F+D L +LL+LP+G ++RLL+K       +VGCL NLY+SV  ++    +    K  +L P+    S  + L  
Subjt:  EQTDVRLKLLIDTKTERVLYGEANKKFIDFLFNLLSLPVGAVIRLLKK-----DGMVGCLGNLYESVETLNESYLQPNQSKDIVLKPKVLFNSFTK-LLA

Query:  NIVVPAAAKPPATFYCHN-GSYSSCRGI----------------------QPSPGRGFVKDLATYIVMDDLTVKHISDFSITTLLKKFNIKDVDSLEEKV
        NI    A K    F C N  S  +CR +                      +      F     ++++ DDL V   S   +  +L  F     D L+E +
Subjt:  NIVVPAAAKPPATFYCHN-GSYSSCRGI----------------------QPSPGRGFVKDLATYIVMDDLTVKHISDFSITTLLKKFNIKDVDSLEEKV

Query:  ITLDVDEGVELLEASLQSKTVLTNAFLKR
        I +  +E + LL     S+  LT+ FL++
Subjt:  ITLDVDEGVELLEASLQSKTVLTNAFLKR

AT5G01120.1 Protein of unknown function (DUF674)6.5e-1628.27Show/hide
Query:  EQTDVRLKLLIDTKTERVLYGEANKKFIDFLFNLLSLPVGAVIRLLK-----KDGMVGCLGNLYESVETLNESYLQPNQSKDIVLKP-------------
        E + + LKLLID +  +V++ EA   F+D LF+  +LP+G ++RLL+     +   +GC  N+Y SV ++   +      K ++L P             
Subjt:  EQTDVRLKLLIDTKTERVLYGEANKKFIDFLFNLLSLPVGAVIRLLK-----KDGMVGCLGNLYESVETLNESYLQPNQSKDIVLKP-------------

Query:  KVLFNSFTKLLANIVVPAAAKPPATFYCHNGSYSSCRG-------IQPSPGRG-------FVKDLAT-YIVMDDLTVKHISDFSITTLLKKFNIKDVDSL
        K+  +  TK     +   + +    +     S  SC          Q   GRG       FV+   T +I+ DDL V+  S  S   +LK     D D L
Subjt:  KVLFNSFTKLLANIVVPAAAKPPATFYCHNGSYSSCRG-------IQPSPGRG-------FVKDLAT-YIVMDDLTVKHISDFSITTLLKKFNIKDVDSL

Query:  EEKVITLDVDEGVELLEASLQSKTVLTNAFLKRRRSH
         E ++ +++ E   LL     S T LT+ FLK++ SH
Subjt:  EEKVITLDVDEGVELLEASLQSKTVLTNAFLKRRRSH

AT5G01130.1 Protein of unknown function (DUF674)4.5e-1730.63Show/hide
Query:  EQTDVRLKLLIDTKTERVLYGEANKKFIDFLFNLLSLPVGAVIRLLKKDG-----MVGCLGNLYESVETLNESYLQPNQSKDIVLKPKVLFNSFTKLLAN
        E+  V L+L ID +  +V+  EA+K F+D LF+LL+LP+G +IRLL++        VGC  NLY SV  +     + +  K I+L P+ + +   K L  
Subjt:  EQTDVRLKLLIDTKTERVLYGEANKKFIDFLFNLLSLPVGAVIRLLKKDG-----MVGCLGNLYESVETLNESYLQPNQSKDIVLKPKVLFNSFTKLLAN

Query:  IVVPAAAK-----PPATFY-------CHNGSYSSCRGIQPS--PGRGFVKD--LATYIVMDDLTVKHISDFSITTLLKKFNIKDVDSLEEKVITLDVDEG
         + P   K         FY       C  GS  +    +P   P    +++     +I+ DDL V   S   +   LK     D+  L E ++ +  +E 
Subjt:  IVVPAAAK-----PPATFY-------CHNGSYSSCRGIQPS--PGRGFVKD--LATYIVMDDLTVKHISDFSITTLLKKFNIKDVDSLEEKVITLDVDEG

Query:  VELLEASLQSKTVLTNAFLKRR
        + LLE    SK  LTN FL ++
Subjt:  VELLEASLQSKTVLTNAFLKRR

AT5G43240.1 Protein of unknown function (DUF674)2.5e-1528.39Show/hide
Query:  VRLKLLIDTKTERVLYGEANKKFIDFLFNLLSLPVGAVIRLLK-----KDGMVGCLGNLYESVETLNESYLQPNQSKDIVLKPKVLFNSFTKLLANIVVP
        ++LKLLID +  +V++ EA K F+D LF+  +LP+G ++RLL+     +   +GC  N+Y SV ++   +      K ++L P  L +   + L   V  
Subjt:  VRLKLLIDTKTERVLYGEANKKFIDFLFNLLSLPVGAVIRLLK-----KDGMVGCLGNLYESVETLNESYLQPNQSKDIVLKPKVLFNSFTKLLANIVVP

Query:  AAA-------------KPPATFYCHNGSYSSC------------RGIQPSPGRG-----FVK-DLATYIVMDDLTVKHISDFSITTLLKKFNIKDVDSLE
        + A             +   ++   N S  SC            RG   S G G     FV+ D  ++++ DDL V+  S      +LK     D + L+
Subjt:  AAA-------------KPPATFYCHNGSYSSC------------RGIQPSPGRG-----FVK-DLATYIVMDDLTVKHISDFSITTLLKKFNIKDVDSLE

Query:  EKVITLDVDEGVELLEASLQSKTVLTNAFLKRRRSH
        EK+  ++++E   LLE    S   LT+ FLK++ S+
Subjt:  EKVITLDVDEGVELLEASLQSKTVLTNAFLKRRRSH

AT5G43240.3 Protein of unknown function (DUF674)2.5e-1528.39Show/hide
Query:  VRLKLLIDTKTERVLYGEANKKFIDFLFNLLSLPVGAVIRLLK-----KDGMVGCLGNLYESVETLNESYLQPNQSKDIVLKPKVLFNSFTKLLANIVVP
        ++LKLLID +  +V++ EA K F+D LF+  +LP+G ++RLL+     +   +GC  N+Y SV ++   +      K ++L P  L +   + L   V  
Subjt:  VRLKLLIDTKTERVLYGEANKKFIDFLFNLLSLPVGAVIRLLK-----KDGMVGCLGNLYESVETLNESYLQPNQSKDIVLKPKVLFNSFTKLLANIVVP

Query:  AAA-------------KPPATFYCHNGSYSSC------------RGIQPSPGRG-----FVK-DLATYIVMDDLTVKHISDFSITTLLKKFNIKDVDSLE
        + A             +   ++   N S  SC            RG   S G G     FV+ D  ++++ DDL V+  S      +LK     D + L+
Subjt:  AAA-------------KPPATFYCHNGSYSSC------------RGIQPSPGRG-----FVK-DLATYIVMDDLTVKHISDFSITTLLKKFNIKDVDSLE

Query:  EKVITLDVDEGVELLEASLQSKTVLTNAFLKRRRSH
        EK+  ++++E   LLE    S   LT+ FLK++ S+
Subjt:  EKVITLDVDEGVELLEASLQSKTVLTNAFLKRRRSH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACAAACCGATGTGAGATTGAAGCTTCTGATAGACACAAAAACAGAACGAGTTCTTTACGGTGAAGCAAACAAGAAGTTCATAGACTTTCTTTTCAATCTACTTTC
CCTCCCAGTTGGGGCTGTAATTAGGCTGTTGAAAAAGGATGGCATGGTGGGGTGTTTGGGAAATCTGTACGAGAGTGTAGAAACCTTGAACGAGTCCTATTTGCAGCCAA
ATCAAAGCAAAGATATAGTCCTAAAACCCAAAGTCTTATTCAATTCTTTCACCAAACTTTTGGCTAATATTGTTGTCCCTGCTGCAGCTAAACCACCTGCTACATTTTAT
TGCCATAATGGATCATATAGCAGCTGCCGTGGAATTCAACCATCTCCTGGTAGGGGATTTGTGAAGGATTTGGCCACTTACATAGTGATGGATGACCTTACTGTCAAGCA
CATTTCTGACTTCTCCATTACAACTCTTTTGAAAAAGTTCAATATCAAGGATGTGGATTCTTTGGAGGAGAAAGTCATCACTTTGGATGTCGATGAGGGTGTGGAATTAC
TAGAGGCTTCTTTGCAGTCAAAGACAGTTCTAACTAACGCGTTTCTGAAAAGACGAAGATCACACATTGACAGTAGTGATGTTAAGCTTTCTGAAGTTGTTAAGCCTTCT
GTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAACAAACCGATGTGAGATTGAAGCTTCTGATAGACACAAAAACAGAACGAGTTCTTTACGGTGAAGCAAACAAGAAGTTCATAGACTTTCTTTTCAATCTACTTTC
CCTCCCAGTTGGGGCTGTAATTAGGCTGTTGAAAAAGGATGGCATGGTGGGGTGTTTGGGAAATCTGTACGAGAGTGTAGAAACCTTGAACGAGTCCTATTTGCAGCCAA
ATCAAAGCAAAGATATAGTCCTAAAACCCAAAGTCTTATTCAATTCTTTCACCAAACTTTTGGCTAATATTGTTGTCCCTGCTGCAGCTAAACCACCTGCTACATTTTAT
TGCCATAATGGATCATATAGCAGCTGCCGTGGAATTCAACCATCTCCTGGTAGGGGATTTGTGAAGGATTTGGCCACTTACATAGTGATGGATGACCTTACTGTCAAGCA
CATTTCTGACTTCTCCATTACAACTCTTTTGAAAAAGTTCAATATCAAGGATGTGGATTCTTTGGAGGAGAAAGTCATCACTTTGGATGTCGATGAGGGTGTGGAATTAC
TAGAGGCTTCTTTGCAGTCAAAGACAGTTCTAACTAACGCGTTTCTGAAAAGACGAAGATCACACATTGACAGTAGTGATGTTAAGCTTTCTGAAGTTGTTAAGCCTTCT
GTTTAA
Protein sequenceShow/hide protein sequence
MEQTDVRLKLLIDTKTERVLYGEANKKFIDFLFNLLSLPVGAVIRLLKKDGMVGCLGNLYESVETLNESYLQPNQSKDIVLKPKVLFNSFTKLLANIVVPAAAKPPATFY
CHNGSYSSCRGIQPSPGRGFVKDLATYIVMDDLTVKHISDFSITTLLKKFNIKDVDSLEEKVITLDVDEGVELLEASLQSKTVLTNAFLKRRRSHIDSSDVKLSEVVKPS
V