; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0014294 (gene) of Snake gourd v1 genome

Gene IDTan0014294
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF1262)
Genome locationLG04:8143934..8146221
RNA-Seq ExpressionTan0014294
SyntenyTan0014294
Gene Ontology termsGO:0052736 - beta-glucanase activity (molecular function)
InterPro domainsIPR010683 - Protein of unknown function DUF1262


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6574010.1 hypothetical protein SDJN03_27897, partial [Cucurbita argyrosperma subsp. sororia]3.5e-17978.41Show/hide
Query:  MYVTRPLSLYRNSPSSMSLAPPEGPNSGILAIEDEEAAESKWCCGLFKMKESVKELPFPQNKLLRLTHAADAGEYEYSDSVRAVLIPVLNQPISSNQYYI
        MYVTRPLSLYRNSPSS+S APPEGPNSGIL I+DEE AE++WCCGLFK KESVKELPFPQNK+LRLTHAA+AGE EYS+SVRAVLIPVLN P+SSNQYYI
Subjt:  MYVTRPLSLYRNSPSSMSLAPPEGPNSGILAIEDEEAAESKWCCGLFKMKESVKELPFPQNKLLRLTHAADAGEYEYSDSVRAVLIPVLNQPISSNQYYI

Query:  INSHGTRKGLACTSSKEEDNTSSRFCYSVPDPPPQLFDPKNTYQQFQISNYIYCGGPNGFISKSMAPDGVPPERLSRKGWKAYTSPLFNNNYEPTEALGL
        INSHGTRKGLACTSSKEE+ TS R CYSVPDPPPQLFDPKN YQQFQIS+YIYCGGPNG+I+KSMAPDGVPP+RLSRKG +AYT PL   N+EPTEALGL
Subjt:  INSHGTRKGLACTSSKEEDNTSSRFCYSVPDPPPQLFDPKNTYQQFQISNYIYCGGPNGFISKSMAPDGVPPERLSRKGWKAYTSPLFNNNYEPTEALGL

Query:  NHSLRARLPDQLNSSSDPVVVGKWYCPFIFIREGEVGSQMSNSMYYEMTLEQNWVEIFGCEGSSKGKNGVDVDVYVEREAVSVAGKTMATTRRRTNVGDG
        N SLR RLP+    SSDPVVVGKWYCPFIFIREG+V SQMSNS YYEMTL +NW EIFGCE S +G NGVDVDVYVERE  SVAG   A   RR   GDG
Subjt:  NHSLRARLPDQLNSSSDPVVVGKWYCPFIFIREGEVGSQMSNSMYYEMTLEQNWVEIFGCEGSSKGKNGVDVDVYVEREAVSVAGKTMATTRRRTNVGDG

Query:  FMWFEGAATTATTTTTTRVGLSLMIVERVKWEEGRVGFGWVEEGGEKKVRVKRREEFEGLGVGMWRRFRCYVLVERFVLKRMDGSLVLAWEFRHTHQLRT
        F+WFE A           VGLS  IVERVKWEEGR GFGWVEEG EKK+RV+RREE +  GVG+WRRF CYVL+ERFVLKRMDGS+VL WEFRHTHQL T
Subjt:  FMWFEGAATTATTTTTTRVGLSLMIVERVKWEEGRVGFGWVEEGGEKKVRVKRREEFEGLGVGMWRRFRCYVLVERFVLKRMDGSLVLAWEFRHTHQLRT

Query:  KWE
        KWE
Subjt:  KWE

KAG7013069.1 hypothetical protein SDJN02_25825, partial [Cucurbita argyrosperma subsp. argyrosperma]6.7e-15469.98Show/hide
Query:  MYVTRPLSLYRNSPSSMSLAPPEGPNSGILAIEDEEAAESKWCCGLFKMKESVKELPFPQNKLLRLTHAADAGEYEYSDSVRAVLIPVLNQPISSNQYYI
        MYVTRPLSLYRNSPSS+S APPEGPNSGIL I+DEE AE++WCCGLFK KESVKELPFPQNK+LRLTHAA+AGE EYS+S                    
Subjt:  MYVTRPLSLYRNSPSSMSLAPPEGPNSGILAIEDEEAAESKWCCGLFKMKESVKELPFPQNKLLRLTHAADAGEYEYSDSVRAVLIPVLNQPISSNQYYI

Query:  INSHGTRKGLACTSSKEEDNTSSRFCYSVPDPPPQLFDPKNTYQQFQISNYIYCGGPNGFISKSMAPDGVPPERLSRKGWKAYTSPLFNNNYEPTEALGL
                         E+ TS R CYSVPDPPPQLFDPKN YQQFQIS+YIYCGGPNG+I+KSMAPDGVPP+RLSRKG +AYT PL   N+EPTEALGL
Subjt:  INSHGTRKGLACTSSKEEDNTSSRFCYSVPDPPPQLFDPKNTYQQFQISNYIYCGGPNGFISKSMAPDGVPPERLSRKGWKAYTSPLFNNNYEPTEALGL

Query:  NHSLRARLPDQLNSSSDPVVVGKWYCPFIFIREGEVGSQMSNSMYYEMTLEQNWVEIFGCEGSSKGKNGVDVDVYVEREAVSVAGKTMATTRRRTNVGDG
        N SLR RLP+    SSDPVVVGKWYCPFIFIREG+V SQMSNS YYEMTL +NW EIFGCE S +G NGVDVDVYVERE  SVAG   A   RR   GDG
Subjt:  NHSLRARLPDQLNSSSDPVVVGKWYCPFIFIREGEVGSQMSNSMYYEMTLEQNWVEIFGCEGSSKGKNGVDVDVYVEREAVSVAGKTMATTRRRTNVGDG

Query:  FMWFEGAATTATTTTTTRVGLSLMIVERVKWEEGRVGFGWVEEGGEKKVRVKRREEFEGLGVGMWRRFRCYVLVERFVLKRMDGSLVLAWEFRHTHQLRT
        F+WFE A           VGLS  IVERVKWEEGR GFGWVEEG EKK+RV+RREE +  GVG+WRRF CYVLVERFVLKRMDGS+VL WEFRHTHQL T
Subjt:  FMWFEGAATTATTTTTTRVGLSLMIVERVKWEEGRVGFGWVEEGGEKKVRVKRREEFEGLGVGMWRRFRCYVLVERFVLKRMDGSLVLAWEFRHTHQLRT

Query:  KWE
        KWE
Subjt:  KWE

XP_022149112.1 uncharacterized protein LOC111017603 [Momordica charantia]4.0e-15169.46Show/hide
Query:  MYVTRPLSLYRNSPSSMSLAPPEGPNSGILAIED-EEAAESKWCCGLFKMKESVKELPFPQNKLLRLTHAADAGEYEYSDSVRAVLIPVLNQPISSNQYY
        MYVTRPLSLYRNSPS +S  PPEGPNSGIL  ED EE AES+W  G+FK K+SVK  P PQN++LRLTHAADAGEYEYSDS+ A+L+PVLNQP+SSNQYY
Subjt:  MYVTRPLSLYRNSPSSMSLAPPEGPNSGILAIED-EEAAESKWCCGLFKMKESVKELPFPQNKLLRLTHAADAGEYEYSDSVRAVLIPVLNQPISSNQYY

Query:  IINSHGTRKGLACTSSKEEDNTS--SRFCYSVPDPPPQLFDPKNTYQQFQISNYIYCGGPNGFISKSMAPDGVPPERLSRKGWKAYTSPLFNNNYEPTEA
        +I+S GT KGLACTSSK EDNTS  SRF Y + D P QL DPKNTYQQFQISNYIYCG PNGFISKS+APDGVPPE L RKGW+AY  PL NNN  PTEA
Subjt:  IINSHGTRKGLACTSSKEEDNTS--SRFCYSVPDPPPQLFDPKNTYQQFQISNYIYCGGPNGFISKSMAPDGVPPERLSRKGWKAYTSPLFNNNYEPTEA

Query:  LGLNHSLRARLPDQLNSSSDPVVVGKWYCPFIFIREGEVGSQMSNSMYYEMTLEQNWVEIFGCEGSSKGKNGVDVDVYVEREAVSVAGKTMATTRRRTNV
        LGL+ +LRARLPD L    +PVVVGKWYCPFIF+R+G V SQMSNS YYEMTL+QNW EIFGC     G   VD DV VERE +S+AG+         N 
Subjt:  LGLNHSLRARLPDQLNSSSDPVVVGKWYCPFIFIREGEVGSQMSNSMYYEMTLEQNWVEIFGCEGSSKGKNGVDVDVYVEREAVSVAGKTMATTRRRTNV

Query:  GDGFMWFEGAATTATTTTTTRVGLSLMIVERVKWEEGRVGFGWVEEGGEKKVRVKRREEFEGLGVGMWRRFRCYVLVERFVLKRMDGSLVLAWEFRHTHQ
        GDG MWF  +           VGLSL IVERVKWEE R GF + +E  +K V+VKRREEF+    G WRRF CYVLVERFVLKRMDGSLVL WEFRHTHQ
Subjt:  GDGFMWFEGAATTATTTTTTRVGLSLMIVERVKWEEGRVGFGWVEEGGEKKVRVKRREEFEGLGVGMWRRFRCYVLVERFVLKRMDGSLVLAWEFRHTHQ

Query:  LRTKWE
        +RTKWE
Subjt:  LRTKWE

XP_022945417.1 uncharacterized protein LOC111449654 [Cucurbita moschata]1.5e-17778.16Show/hide
Query:  MYVTRPLSLYRNSPSSMSLAPPEGPNSGILAIEDEEAAESKWCCGLFKMKESVKELPFPQNKLLRLTHAADAGEYEYSDSVRAVLIPVLNQPISSNQYYI
        MYVTRPLSLYRNSPSS+S APPEGPNSGIL I+DEE AE++WCCGLFK KESVKELPFPQNK+LRLTHAA+AGE EYS+SVRAVLIPVLN P+SSNQYYI
Subjt:  MYVTRPLSLYRNSPSSMSLAPPEGPNSGILAIEDEEAAESKWCCGLFKMKESVKELPFPQNKLLRLTHAADAGEYEYSDSVRAVLIPVLNQPISSNQYYI

Query:  INSHGTRKGLACTSSKEEDNTSSRFCYSVPDPPPQLFDPKNTYQQFQISNYIYCGGPNGFISKSMAPDGVPPERLSRKGWKAYTSPLFNNNYEPTEALGL
        INSHGTRKGLACTSSKEE+ TS R CYSV DPPPQLFDPKN YQQFQIS+YIYCGGPNG+I+KSMAPDGVPP+RLSRKG +AYT PL   N+EPTEALGL
Subjt:  INSHGTRKGLACTSSKEEDNTSSRFCYSVPDPPPQLFDPKNTYQQFQISNYIYCGGPNGFISKSMAPDGVPPERLSRKGWKAYTSPLFNNNYEPTEALGL

Query:  NHSLRARLPDQLNSSSDPVVVGKWYCPFIFIREGEVGSQMSNSMYYEMTLEQNWVEIFGCEGSSKGKNGVDVDVYVEREAVSVAGKTMATTRRRTNVGDG
        N SLR RLP+    SSDPVVVGKWYCPFIFIREG+V SQMSNS YYEMTL ++W EIFGCE S +G NGVDVDVYVERE  SVAG   A   RR   GDG
Subjt:  NHSLRARLPDQLNSSSDPVVVGKWYCPFIFIREGEVGSQMSNSMYYEMTLEQNWVEIFGCEGSSKGKNGVDVDVYVEREAVSVAGKTMATTRRRTNVGDG

Query:  FMWFEGAATTATTTTTTRVGLSLMIVERVKWEEGRVGFGWVEEGGEKKVRVKRREEFEGLGVGMWRRFRCYVLVERFVLKRMDGSLVLAWEFRHTHQLRT
        F+WFE A           VGLS  IVERVKWEEGR GFGWVEEG EKK+RV+RREE +  GVG+WRRF CYVLVERFVLKRMDGS+VL WEFRHTHQL T
Subjt:  FMWFEGAATTATTTTTTRVGLSLMIVERVKWEEGRVGFGWVEEGGEKKVRVKRREEFEGLGVGMWRRFRCYVLVERFVLKRMDGSLVLAWEFRHTHQLRT

Query:  KWE
        KWE
Subjt:  KWE

XP_038892872.1 uncharacterized protein LOC120081783 [Benincasa hispida]1.4e-15168.27Show/hide
Query:  MYVTRPLSLYRNSPSSMSLAPPEGPNSGILAI----EDEEAAESKWCCGLFKMKESVKELPFPQNKLLRLTHAADAGEYEYSDSVRAVLIPVLNQPISSN
        MYVTRPLSLYR SPSS+S+ PPEGPNSGIL I    ED E   SKW CG+FK KESVK LPFPQNK+LRLTH+ +AGE+EYS+SV AVLIPVLN+P+SSN
Subjt:  MYVTRPLSLYRNSPSSMSLAPPEGPNSGILAI----EDEEAAESKWCCGLFKMKESVKELPFPQNKLLRLTHAADAGEYEYSDSVRAVLIPVLNQPISSN

Query:  QYYIINSHGTRKGLACTSSKEEDNTSSRFCYSVPDPPPQLFDPKNTYQQFQISNYIYCGGPNGFISKSMAPDGVPPERLSRKGWKAYTSPLFNNNYEPTE
        QYYIIN+ G RKGLACT+SKE++ +SS+ CY+VPDPPPQ+FDPKN YQQFQIS+YIYCGG +GF+SKS+APDGVPP RLSR GW+AY  PL NN  EPT+
Subjt:  QYYIINSHGTRKGLACTSSKEEDNTSSRFCYSVPDPPPQLFDPKNTYQQFQISNYIYCGGPNGFISKSMAPDGVPPERLSRKGWKAYTSPLFNNNYEPTE

Query:  ALGLNHSLRARLPDQLN-----SSSDPVVVGKWYCPFIFIREGE--VGSQMSNSMYYEMTLEQNWVEIFGCEGSSKGKNGVDVDVYVEREAVSVAGKTMA
        ALGLN SLRA LPD LN      SSD VVVGKWYCPFIFIREG   VGSQM+NS+YYE+TL QNWVEIF CE +    +  +VD +VERE VS+AG+  A
Subjt:  ALGLNHSLRARLPDQLN-----SSSDPVVVGKWYCPFIFIREGE--VGSQMSNSMYYEMTLEQNWVEIFGCEGSSKGKNGVDVDVYVEREAVSVAGKTMA

Query:  TTRRRTNVGDGFMWFEGAATTATTTTTTRVGLSLMIVERVKWEEGRVGFGWVEEGGEKKVR-VKRREEFEGLGVG-MWRRFRCYVLVERFVLKRMDGSLV
        T     NVGDG +WFE            +VGLSL+IVER+KWE+ R GF WVEE  EKKVR VK +EE +    G  W+RF CYVLVERFV+KRMDGSLV
Subjt:  TTRRRTNVGDGFMWFEGAATTATTTTTTRVGLSLMIVERVKWEEGRVGFGWVEEGGEKKVR-VKRREEFEGLGVG-MWRRFRCYVLVERFVLKRMDGSLV

Query:  LAWEFRHTHQLRTKWE
        L WEFRHTHQ+RTKWE
Subjt:  LAWEFRHTHQLRTKWE

TrEMBL top hitse value%identityAlignment
A0A0A0KT31 Uncharacterized protein1.7e-14266.27Show/hide
Query:  MYVTRPLSLYRNSPSSMSLAPPEGPNSGILAIED-EEAAE---SKWCCGLFKMKESVKELPFPQNKLLRLTHAADAGEYEYSDSVRAVLIPVLNQPISSN
        MYVTRPLSLYR+SP S+S+ PPEGPNSGIL I+D EE AE   S+W CGLFK KESVK  PFPQNK+L+LTH+A+AGE+EYS+SV AV+IPVLNQP+SSN
Subjt:  MYVTRPLSLYRNSPSSMSLAPPEGPNSGILAIED-EEAAE---SKWCCGLFKMKESVKELPFPQNKLLRLTHAADAGEYEYSDSVRAVLIPVLNQPISSN

Query:  QYYIINSHGTRKGLACTSSKEEDNTSSRFCYSVPDPPPQLFDPKNTYQQFQISNYIYCGGPNGFISKSMAPDGVPPERLSRKGWKAYTSPLFNNNYEPTE
        QYYIIN+ G RKGLACTSSK ++ +SS+ CY+VPDPPPQLFDPKN YQQFQIS+Y+YCGG +GFI  S+A DGV P RLSR GW+AY  PL N+ +EPT 
Subjt:  QYYIINSHGTRKGLACTSSKEEDNTSSRFCYSVPDPPPQLFDPKNTYQQFQISNYIYCGGPNGFISKSMAPDGVPPERLSRKGWKAYTSPLFNNNYEPTE

Query:  ALGLNHSLRARLPDQLN-----SSSDPVVVGKWYCPFIFIREGE--VGSQMSNSMYYEMTLEQNWVEIFGCEGSSKGKNGV---DVDVYVEREAVSVAGK
        A GLN  LRARLPD LN      SSDPV VGKWY PFIFIR+G   VGSQM+NS YYE+TL QNWVEIFGCE      NGV   +VDV+VERE VS  G+
Subjt:  ALGLNHSLRARLPDQLN-----SSSDPVVVGKWYCPFIFIREGE--VGSQMSNSMYYEMTLEQNWVEIFGCEGSSKGKNGV---DVDVYVEREAVSVAGK

Query:  TMATTRRRTNVGDGFMWFEGAATTATTTTTTRVGLSLMIVERVKWEEGRVGFGWVEEGGEKKVR-VKRREEFEGLGVGMWRRFRCYVLVERFVLKRMDGS
        + A++ +  NV DG +WFE            +VGLSL++VER+KWEE R GF WV+EG EKKVR VK R + + +G   W RF CYVLVERFV+KRMDGS
Subjt:  TMATTRRRTNVGDGFMWFEGAATTATTTTTTRVGLSLMIVERVKWEEGRVGFGWVEEGGEKKVR-VKRREEFEGLGVGMWRRFRCYVLVERFVLKRMDGS

Query:  LVLAWEFRHTHQLRTKWE
        LVL WEFRHTHQ+ TKWE
Subjt:  LVLAWEFRHTHQLRTKWE

A0A1S3BER1 uncharacterized protein LOC1034892232.7e-12459.02Show/hide
Query:  MYVTRPLSLYRNSPSSMSLAPPEGPNSGILAIEDEEAAESKWCCGLFKMKESVKELPFPQNKLLRLTHAADAGEYEYSDSVRAVLIPVLNQPISSNQYYI
        MYVTRPLS+ RNSPS++S+APPEGPNSGIL I+D EAAESKW  G+ K  E+V   PFPQNK + L+H    G     + + A+LIPVLNQP+SSNQYYI
Subjt:  MYVTRPLSLYRNSPSSMSLAPPEGPNSGILAIEDEEAAESKWCCGLFKMKESVKELPFPQNKLLRLTHAADAGEYEYSDSVRAVLIPVLNQPISSNQYYI

Query:  INSHGTRKGLACTSSKEEDNTSSRFCYSVPDPPPQLFDPKNTYQQFQISNYIYCGGPNGFISKSMAPDGVPPERLSRKGWKAYTSPLFNNNYEP-TEALG
        I S+G+ KGLA   SKEE+ +       V D PPQ FDP N YQ+F+ISN +Y G PNGF  KS+A +GV P  ++ K W+AY   L    ++P TEALG
Subjt:  INSHGTRKGLACTSSKEEDNTSSRFCYSVPDPPPQLFDPKNTYQQFQISNYIYCGGPNGFISKSMAPDGVPPERLSRKGWKAYTSPLFNNNYEP-TEALG

Query:  LNHSLRARLPDQLNS-----SSDPVVVGKWYCPFIFIREGEVGSQMSNSMYYEMTLEQNWVEIFGC-EGSSKGKNGVDVDVYVEREAVSVAGKTMATTRR
        L+ SLRARLP    S     SS  VVVGKWYCPFIF+REG+V SQ+ NS YYEM L+QNWVE+FGC   ++ G  GV++DV VE+E VSV G+ +     
Subjt:  LNHSLRARLPDQLNS-----SSDPVVVGKWYCPFIFIREGEVGSQMSNSMYYEMTLEQNWVEIFGC-EGSSKGKNGVDVDVYVEREAVSVAGKTMATTRR

Query:  RTNVGDGFMWFEGAATTATTTTTTRVGLSLMIVERVKWEEGRVGFGWVEEGGEKKVRVKRREEFEGLGVGMWRRFRCYVLVERFVLKRMDGSLVLAWEFR
          N+GDG  WF           ++RVGLS+ IVER++WEE R GF WV EGGEK V+V+RREEF+  GVGMWRRF CYVLVERF LKRMDGSLVL+WEFR
Subjt:  RTNVGDGFMWFEGAATTATTTTTTRVGLSLMIVERVKWEEGRVGFGWVEEGGEKKVRVKRREEFEGLGVGMWRRFRCYVLVERFVLKRMDGSLVLAWEFR

Query:  HTHQLRTKWE
        HTHQ+RTKWE
Subjt:  HTHQLRTKWE

A0A6J1D6Y4 uncharacterized protein LOC1110176032.0e-15169.46Show/hide
Query:  MYVTRPLSLYRNSPSSMSLAPPEGPNSGILAIED-EEAAESKWCCGLFKMKESVKELPFPQNKLLRLTHAADAGEYEYSDSVRAVLIPVLNQPISSNQYY
        MYVTRPLSLYRNSPS +S  PPEGPNSGIL  ED EE AES+W  G+FK K+SVK  P PQN++LRLTHAADAGEYEYSDS+ A+L+PVLNQP+SSNQYY
Subjt:  MYVTRPLSLYRNSPSSMSLAPPEGPNSGILAIED-EEAAESKWCCGLFKMKESVKELPFPQNKLLRLTHAADAGEYEYSDSVRAVLIPVLNQPISSNQYY

Query:  IINSHGTRKGLACTSSKEEDNTS--SRFCYSVPDPPPQLFDPKNTYQQFQISNYIYCGGPNGFISKSMAPDGVPPERLSRKGWKAYTSPLFNNNYEPTEA
        +I+S GT KGLACTSSK EDNTS  SRF Y + D P QL DPKNTYQQFQISNYIYCG PNGFISKS+APDGVPPE L RKGW+AY  PL NNN  PTEA
Subjt:  IINSHGTRKGLACTSSKEEDNTS--SRFCYSVPDPPPQLFDPKNTYQQFQISNYIYCGGPNGFISKSMAPDGVPPERLSRKGWKAYTSPLFNNNYEPTEA

Query:  LGLNHSLRARLPDQLNSSSDPVVVGKWYCPFIFIREGEVGSQMSNSMYYEMTLEQNWVEIFGCEGSSKGKNGVDVDVYVEREAVSVAGKTMATTRRRTNV
        LGL+ +LRARLPD L    +PVVVGKWYCPFIF+R+G V SQMSNS YYEMTL+QNW EIFGC     G   VD DV VERE +S+AG+         N 
Subjt:  LGLNHSLRARLPDQLNSSSDPVVVGKWYCPFIFIREGEVGSQMSNSMYYEMTLEQNWVEIFGCEGSSKGKNGVDVDVYVEREAVSVAGKTMATTRRRTNV

Query:  GDGFMWFEGAATTATTTTTTRVGLSLMIVERVKWEEGRVGFGWVEEGGEKKVRVKRREEFEGLGVGMWRRFRCYVLVERFVLKRMDGSLVLAWEFRHTHQ
        GDG MWF  +           VGLSL IVERVKWEE R GF + +E  +K V+VKRREEF+    G WRRF CYVLVERFVLKRMDGSLVL WEFRHTHQ
Subjt:  GDGFMWFEGAATTATTTTTTRVGLSLMIVERVKWEEGRVGFGWVEEGGEKKVRVKRREEFEGLGVGMWRRFRCYVLVERFVLKRMDGSLVLAWEFRHTHQ

Query:  LRTKWE
        +RTKWE
Subjt:  LRTKWE

A0A6J1D7B1 uncharacterized protein LOC1110175842.2e-13159.95Show/hide
Query:  MYVTRPLSLYRNSPSSMSLAPP----EGPNSGILAIEDEEAAESKWCCGLFKMKESVKELPFPQNKLLRLTHAADAGEYEYSDSVRAVLIPVLNQPISSN
        MYVTRPLS+YRN  ++ +   P    EGPN+G+L IED EAAES+W  GL K K SVK  PFPQNK++ L +  ++GE++++D   A+LIPV+N+P+SSN
Subjt:  MYVTRPLSLYRNSPSSMSLAPP----EGPNSGILAIEDEEAAESKWCCGLFKMKESVKELPFPQNKLLRLTHAADAGEYEYSDSVRAVLIPVLNQPISSN

Query:  QYYIINSHGTRKGLACTSSKEEDNTSSRFCYSVPDPPPQLFDPKNTYQQFQISNYIYCGGPNGFISKSMAPDGVPPERLSRKGWKAYTSPLFNNNYEPTE
        +YY+I S G  KGLACTSSKE+D TS   C+ +PD PPQLFDP N YQQFQISNY+ C GP GF++ S+APDGVPP  L R+GW+AYT    N N E T+
Subjt:  QYYIINSHGTRKGLACTSSKEEDNTSSRFCYSVPDPPPQLFDPKNTYQQFQISNYIYCGGPNGFISKSMAPDGVPPERLSRKGWKAYTSPLFNNNYEPTE

Query:  ALGLNHSLRARLPDQLN-----SSSDPVVVGKWYCPFIFIREGEVGSQMSNSMYYEMTLEQNWVEIFGCEGSSKGKNGVDVDVYVEREAVSVAGKTMATT
        ALGL+ +LRA LP  LN      SSDPVVVGKWYCPFIF+R+GEVGSQ+SNS YYEMTL+Q+W EIFGC     G+ GVD DV VE+E + +AG+   T 
Subjt:  ALGLNHSLRARLPDQLN-----SSSDPVVVGKWYCPFIFIREGEVGSQMSNSMYYEMTLEQNWVEIFGCEGSSKGKNGVDVDVYVEREAVSVAGKTMATT

Query:  RRRTNVGDGFMWFEGAATTATTTTTTRVGLSLMIVERVKWEEGRVGFGWVEEGGEKKVRVKRREEFEGLGVGMWRRFRCYVLVERFVLKRMDGSLVLAWE
         R   VGDG +WF           +  VGLSL IVERVKWEE R GF + +E  +K V+VKRREE+   GVG W+RF CYVL+ERFVLKRMDGSLVL WE
Subjt:  RRRTNVGDGFMWFEGAATTATTTTTTRVGLSLMIVERVKWEEGRVGFGWVEEGGEKKVRVKRREEFEGLGVGMWRRFRCYVLVERFVLKRMDGSLVLAWE

Query:  FRHTHQLRTKWE
        F+HTHQ+RTKWE
Subjt:  FRHTHQLRTKWE

A0A6J1G0V4 uncharacterized protein LOC1114496547.1e-17878.16Show/hide
Query:  MYVTRPLSLYRNSPSSMSLAPPEGPNSGILAIEDEEAAESKWCCGLFKMKESVKELPFPQNKLLRLTHAADAGEYEYSDSVRAVLIPVLNQPISSNQYYI
        MYVTRPLSLYRNSPSS+S APPEGPNSGIL I+DEE AE++WCCGLFK KESVKELPFPQNK+LRLTHAA+AGE EYS+SVRAVLIPVLN P+SSNQYYI
Subjt:  MYVTRPLSLYRNSPSSMSLAPPEGPNSGILAIEDEEAAESKWCCGLFKMKESVKELPFPQNKLLRLTHAADAGEYEYSDSVRAVLIPVLNQPISSNQYYI

Query:  INSHGTRKGLACTSSKEEDNTSSRFCYSVPDPPPQLFDPKNTYQQFQISNYIYCGGPNGFISKSMAPDGVPPERLSRKGWKAYTSPLFNNNYEPTEALGL
        INSHGTRKGLACTSSKEE+ TS R CYSV DPPPQLFDPKN YQQFQIS+YIYCGGPNG+I+KSMAPDGVPP+RLSRKG +AYT PL   N+EPTEALGL
Subjt:  INSHGTRKGLACTSSKEEDNTSSRFCYSVPDPPPQLFDPKNTYQQFQISNYIYCGGPNGFISKSMAPDGVPPERLSRKGWKAYTSPLFNNNYEPTEALGL

Query:  NHSLRARLPDQLNSSSDPVVVGKWYCPFIFIREGEVGSQMSNSMYYEMTLEQNWVEIFGCEGSSKGKNGVDVDVYVEREAVSVAGKTMATTRRRTNVGDG
        N SLR RLP+    SSDPVVVGKWYCPFIFIREG+V SQMSNS YYEMTL ++W EIFGCE S +G NGVDVDVYVERE  SVAG   A   RR   GDG
Subjt:  NHSLRARLPDQLNSSSDPVVVGKWYCPFIFIREGEVGSQMSNSMYYEMTLEQNWVEIFGCEGSSKGKNGVDVDVYVEREAVSVAGKTMATTRRRTNVGDG

Query:  FMWFEGAATTATTTTTTRVGLSLMIVERVKWEEGRVGFGWVEEGGEKKVRVKRREEFEGLGVGMWRRFRCYVLVERFVLKRMDGSLVLAWEFRHTHQLRT
        F+WFE A           VGLS  IVERVKWEEGR GFGWVEEG EKK+RV+RREE +  GVG+WRRF CYVLVERFVLKRMDGS+VL WEFRHTHQL T
Subjt:  FMWFEGAATTATTTTTTRVGLSLMIVERVKWEEGRVGFGWVEEGGEKKVRVKRREEFEGLGVGMWRRFRCYVLVERFVLKRMDGSLVLAWEFRHTHQLRT

Query:  KWE
        KWE
Subjt:  KWE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13470.1 Protein of unknown function (DUF1262)8.6e-7541.34Show/hide
Query:  MYVTRPLSLYRNSPSSMSLAPPEGPNSGILAIEDEEAAESKWCCGLFKMKESVKELPFPQNKLLRLTHAADAGEYEYSDSVRAVLIPVLNQPISSNQYYI
        MYVTR LS Y+ + S +    PE PNSG+L I+DEE+  +  CC     + ++K LPFPQN  L ++ +        +     V IPVL+QP+SSN+YY+
Subjt:  MYVTRPLSLYRNSPSSMSLAPPEGPNSGILAIEDEEAAESKWCCGLFKMKESVKELPFPQNKLLRLTHAADAGEYEYSDSVRAVLIPVLNQPISSNQYYI

Query:  INSHGTRKGLACTSSKEEDNTSSRFCYS-VPDPPPQLFDPKNTYQQFQISNYIYCGGPNGFISKSMAPDGVPPERLSRKGWKAYTSPLFNNNYEPTEALG
        I   G   G A  ++KEED     FC+S VP+  PQ  DP + YQQFQI  +        + + S+AP+G PPE L RK W A  S   +   +  +A G
Subjt:  INSHGTRKGLACTSSKEEDNTSSRFCYS-VPDPPPQLFDPKNTYQQFQISNYIYCGGPNGFISKSMAPDGVPPERLSRKGWKAYTSPLFNNNYEPTEALG

Query:  LNHSLRARLPDQLNSSSDPVVVGKWYCPFIFIREGEVGSQMSNSMYYEMTLEQNWVEIFGCEGSSKGKNGVDVDVYVEREAVSVAGKTMATTRRRTNVGD
        +   +R+ LP   N+S   VVVGKWY PFIF++EG    Q+ +S YY M L Q W E++ CE +      V VDV VE E V + G+ +    +R +  +
Subjt:  LNHSLRARLPDQLNSSSDPVVVGKWYCPFIFIREGEVGSQMSNSMYYEMTLEQNWVEIFGCEGSSKGKNGVDVDVYVEREAVSVAGKTMATTRRRTNVGD

Query:  GFMWFEGAATTATTTTTTRVGLSLMIVERVKWEEGRVGFGWVEEGGEKKVRVKRREEFEGLGVGMWRRFRCYVLVERFVLKRMDGSLVLAWEFRHTHQLR
        G  WF             R+GL  +++ER+KWEE R  FGW  E   ++  VK+ E F G G   W+ +RCYVLVE F L+R D SLVL +EF+H  +LR
Subjt:  GFMWFEGAATTATTTTTTRVGLSLMIVERVKWEEGRVGFGWVEEGGEKKVRVKRREEFEGLGVGMWRRFRCYVLVERFVLKRMDGSLVLAWEFRHTHQLR

Query:  TKWE
        TKWE
Subjt:  TKWE

AT1G13480.1 Protein of unknown function (DUF1262)1.8e-8543.56Show/hide
Query:  MYVTRPLSLYRNSPSSMSLAPPEGPNSGILAIEDEEAAESKWCCGLFKMKESVKELPFPQNKLLRLTHAADAGEYEYSDSVRAVLIPVLNQPISSNQYYI
        MYVTR LS Y+  PS + L PPEGPNSGI+ I+DEE+  +  CC     +  +K LPFPQN  L   + +  GE++ +     V IPVL+QP+SSN YY+
Subjt:  MYVTRPLSLYRNSPSSMSLAPPEGPNSGILAIEDEEAAESKWCCGLFKMKESVKELPFPQNKLLRLTHAADAGEYEYSDSVRAVLIPVLNQPISSNQYYI

Query:  INSHGTRKGLACTSSKEEDNTSSRFCYS-VPDPPPQLFDPKNTYQQFQISNYIYCGGPNGFISKSMAPDGVPPERLSRKGWKAYTSPLFNNNYEPTEALG
        +   G   G A  S+ EE+  SS FC+S +PD  PQ  DP + YQQF+I  +        + + S+A DGVPP  L RK W    S   +      +A G
Subjt:  INSHGTRKGLACTSSKEEDNTSSRFCYS-VPDPPPQLFDPKNTYQQFQISNYIYCGGPNGFISKSMAPDGVPPERLSRKGWKAYTSPLFNNNYEPTEALG

Query:  LNHSLRARLPDQLNSSSDPVVVGKWYCPFIFIREGEVGSQMSNSMYYEMTLEQNWVEIFGCEGSSKGKNGVDVDVYVEREAVSVAGKTMATTRRRTNVGD
        +N  L  RL  +L S    + +GKWY PFIF+ EG+V  QM+ S +Y +TL+Q W E+F CE        V VDV VE E+V + G+    T  R + GD
Subjt:  LNHSLRARLPDQLNSSSDPVVVGKWYCPFIFIREGEVGSQMSNSMYYEMTLEQNWVEIFGCEGSSKGKNGVDVDVYVEREAVSVAGKTMATTRRRTNVGD

Query:  GFMWFEGAATTATTTTTTRVGLSLMIVERVKWEEGRVGFGWVEEGGEKKVRVKRREEFEGLGVGMWRRFRCYVLVERFVLKRMDGSLVLAWEFRHTHQLR
        G +WF   +         ++GL  ++VER+KWEE R  FGW+ E GE+   +KR E FEG G   W+ +RCYVL+E F L RMDGSLVL +EFRH  +L+
Subjt:  GFMWFEGAATTATTTTTTRVGLSLMIVERVKWEEGRVGFGWVEEGGEKKVRVKRREEFEGLGVGMWRRFRCYVLVERFVLKRMDGSLVLAWEFRHTHQLR

Query:  TKWE
        +KW+
Subjt:  TKWE

AT1G13500.1 Protein of unknown function (DUF1262)1.1e-8243Show/hide
Query:  MYVTRPLSLYRNSPSSMSLAPPEGPNSGILAIEDEEAAESKWCCGLFKMKESVKELPFPQNKLLRLTHAADAGEYEYSDSVRAVLIPVLNQPISSNQYYI
        MYVT+ LS Y+ +PS ++L P EGPNSG+L I+DEE+  +  CC +      +  LPFPQN   R+      G   Y D V    IPVL+QP SSN YY+
Subjt:  MYVTRPLSLYRNSPSSMSLAPPEGPNSGILAIEDEEAAESKWCCGLFKMKESVKELPFPQNKLLRLTHAADAGEYEYSDSVRAVLIPVLNQPISSNQYYI

Query:  INSHGTRKGLACTSSKEEDNTSSRFCYS-VPDPPPQLFDPKNTYQQFQISNYIYCGGPNGFISKSMAPDGVPPERLSRKGWKA-YTSPLFNNNYEPTEAL
        I   G   G AC S+KE D  S  FC++ +P+  P+  DP +  QQF+I  +        F + S+A DG+PP+ L+RKGW   +++P   +     +A 
Subjt:  INSHGTRKGLACTSSKEEDNTSSRFCYS-VPDPPPQLFDPKNTYQQFQISNYIYCGGPNGFISKSMAPDGVPPERLSRKGWKA-YTSPLFNNNYEPTEAL

Query:  GL-NHSLRARLPDQLNSSSDPVVVGKWYCPFIFIREGEVGSQMSNSMYYEMTLEQNWVEIFGCEGSSKGKN-GVDVDVYVEREAVSVAGKTMATTRRRTN
        G+ +  LR  LPD  NS    VVVGKWY PF+F++EG+   QM  SMYY MTL+Q + E+F CE     K   V VDV VE E V + G+ +A   +  N
Subjt:  GL-NHSLRARLPDQLNSSSDPVVVGKWYCPFIFIREGEVGSQMSNSMYYEMTLEQNWVEIFGCEGSSKGKN-GVDVDVYVEREAVSVAGKTMATTRRRTN

Query:  VGDGFMWFEGAATTATTTTTTRVGLSLMIVERVKWEEGRVGFGWVEEGGEKKVRVKRREEFEGLGVGMWRRFRCYVLVERFVLKRMDGSLVLAWEFRHTH
          DG +WF  +          ++GL  +++ER+KWEE R  FGW+ +G E++  +KR E FEG G   W+ +RCYVLVE F LKR DGSLVL +EF+H  
Subjt:  VGDGFMWFEGAATTATTTTTTRVGLSLMIVERVKWEEGRVGFGWVEEGGEKKVRVKRREEFEGLGVGMWRRFRCYVLVERFVLKRMDGSLVLAWEFRHTH

Query:  QLRTKWE
        +L++KW+
Subjt:  QLRTKWE

AT1G13520.1 Protein of unknown function (DUF1262)8.0e-8141.81Show/hide
Query:  MYVTRPLSLYRNSPSSMSLAPPEGPNSGILAIEDEEAAESKWCCGLFKMKESVKELPFPQNKLLRLTHAADAGEYEYSDSVRAVLIPVLNQPISSNQYYI
        MYVTR LS Y+ + S ++ + PEGPNSG+L I+DEE+  +  CC        +K LPFPQN  L +T+    G    S   + + IPVL+QP  SN+YY+
Subjt:  MYVTRPLSLYRNSPSSMSLAPPEGPNSGILAIEDEEAAESKWCCGLFKMKESVKELPFPQNKLLRLTHAADAGEYEYSDSVRAVLIPVLNQPISSNQYYI

Query:  INSHGTRKGLACTSSKEEDNTSSRFCYS-VPDPPPQLFDPKNTYQQFQISNYIYCGGPNGFISKSMAPDGVPPERLSRKGWKAYTSPLFNNNYE---PTE
        I   G + G A  S+KEED     FC+S VP+  PQ  DP + YQQF++       G   + + S+AP+G+PPE L RK W       ++N+ +     +
Subjt:  INSHGTRKGLACTSSKEEDNTSSRFCYS-VPDPPPQLFDPKNTYQQFQISNYIYCGGPNGFISKSMAPDGVPPERLSRKGWKAYTSPLFNNNYE---PTE

Query:  ALGLNHSLRARLPDQLNSSSDPVVVGKWYCPFIFIREGEVGSQMSNSMYYEMTLEQNWVEIFGCEGSSKGKNG--VDVDVYVEREAVSVAGKTMATTRRR
        A G+N +LR++LP+ +N+S   VVVGKWY PFIF++E +   Q+ +S YY MTL+Q W E++ C   +  + G  V VDV VE + V + G+    T  R
Subjt:  ALGLNHSLRARLPDQLNSSSDPVVVGKWYCPFIFIREGEVGSQMSNSMYYEMTLEQNWVEIFGCEGSSKGKNG--VDVDVYVEREAVSVAGKTMATTRRR

Query:  TNVGDGFMWFEGAATTATTTTTTRVGLSLMIVERVKWEEGRVGFGWVEEGGEKKVRVKRREEFEGLGVGMWRRFRCYVLVERFVLKRMDGSLVLAWEFRH
           G GF+WF   +         ++GL  ++VER+KWEE R  FGW+  G  ++  +KR E FEG G   W+ +RC VL+E F LKRMDGSLVL +EF H
Subjt:  TNVGDGFMWFEGAATTATTTTTTRVGLSLMIVERVKWEEGRVGFGWVEEGGEKKVRVKRREEFEGLGVGMWRRFRCYVLVERFVLKRMDGSLVLAWEFRH

Query:  THQLRTKWE
          +L++KW+
Subjt:  THQLRTKWE

AT1G13530.1 Protein of unknown function (DUF1262)4.4e-7941.58Show/hide
Query:  MYVTRPLSLYRNSPSSMSLAPPEGPNSGILAIEDEEAAESKWCCGLFKMKESVKELPFPQNKLLRLTHAADAGEYEYSDSVRAVLIPVLNQPISSNQYYI
        MYVT+ LS Y+ +PS ++  P EGPNSG+L I+DEE+  +  CC        +  LPFPQN  + + +    G+           IPVL+QP SSN YY+
Subjt:  MYVTRPLSLYRNSPSSMSLAPPEGPNSGILAIEDEEAAESKWCCGLFKMKESVKELPFPQNKLLRLTHAADAGEYEYSDSVRAVLIPVLNQPISSNQYYI

Query:  INSHGTRKGLACTSSKEEDNTSSRFCYS-VPDPPPQLFDPKNTYQQFQISNYIYCGGPNGFISKSMAPDGVPPERLSRKGWKAYTSPLFNNNYEPTEALG
        I   G   G AC S+KEED  S   C++ V +  P+L DP + YQQF+I  +        F + S+A DG+PP  L RKGW    S   +          
Subjt:  INSHGTRKGLACTSSKEEDNTSSRFCYS-VPDPPPQLFDPKNTYQQFQISNYIYCGGPNGFISKSMAPDGVPPERLSRKGWKAYTSPLFNNNYEPTEALG

Query:  LNHSLRARLPDQLNSSSDPVVVGKWYCPFIFIREGEVGSQMSNSMYYEMTLEQNWVEIFGCEGSSKGKNGVDVDVYVEREAVSVAGKTMATTRRRTNVGD
        ++  LR  LPD   S    VVVGKWY PF+F++EG+   QM  SMYY MTL Q + E+F CE        V VDV VE E V + G+ +    +  N  D
Subjt:  LNHSLRARLPDQLNSSSDPVVVGKWYCPFIFIREGEVGSQMSNSMYYEMTLEQNWVEIFGCEGSSKGKNGVDVDVYVEREAVSVAGKTMATTRRRTNVGD

Query:  GFMWFEGAATTATTTTTTRVGLSLMIVERVKWEEGRVGFGWVEEGGEKKVRVKRREEFEGLGVGMWRRFRCYVLVERFVLKRMDGSLVLAWEFRHTHQLR
        G +WF         + T ++G+  +++ER+KWEE R  FGW ++G E K  +KR E+FEG G   W+ +RCYVLVE F LK+ DGSLVL +EFRH  +L+
Subjt:  GFMWFEGAATTATTTTTTRVGLSLMIVERVKWEEGRVGFGWVEEGGEKKVRVKRREEFEGLGVGMWRRFRCYVLVERFVLKRMDGSLVLAWEFRHTHQLR

Query:  TKWE
        +KW+
Subjt:  TKWE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACGTGACGAGGCCTCTTTCTCTGTACAGAAACTCTCCGTCGTCGATGTCGTTGGCGCCGCCGGAGGGTCCAAATTCGGGGATTTTGGCGATCGAAGACGAAGAAGC
TGCAGAATCGAAATGGTGCTGTGGATTGTTTAAGATGAAGGAGTCTGTGAAGGAACTGCCCTTCCCTCAGAACAAGTTACTGCGGCTTACACATGCAGCTGATGCAGGGG
AGTACGAGTATTCAGATTCAGTACGCGCAGTGCTTATTCCAGTTCTGAATCAGCCCATCTCTTCTAACCAATATTATATCATCAATTCCCATGGCACTCGAAAAGGGCTA
GCATGCACAAGTTCAAAGGAAGAAGATAACACAAGTAGCCGGTTTTGCTACAGTGTTCCCGATCCACCACCGCAACTCTTCGACCCAAAAAACACATATCAACAATTCCA
AATCAGTAATTACATATACTGTGGAGGACCCAATGGTTTCATCTCCAAGTCCATGGCTCCGGACGGGGTTCCACCAGAACGCCTAAGCCGCAAGGGTTGGAAAGCTTACA
CGTCCCCACTTTTCAACAACAACTACGAACCCACTGAAGCACTTGGTCTCAACCACTCCCTACGAGCCCGCCTTCCAGATCAGCTCAACTCATCTTCAGACCCAGTGGTG
GTCGGGAAGTGGTATTGCCCTTTCATTTTCATTCGAGAAGGGGAGGTTGGGTCTCAGATGAGTAACTCAATGTACTACGAAATGACCCTTGAGCAGAATTGGGTTGAGAT
ATTTGGATGTGAAGGCTCTAGCAAAGGTAAGAATGGCGTGGATGTGGATGTTTATGTGGAGAGGGAAGCGGTTTCGGTTGCTGGCAAGACGATGGCGACAACGAGGAGGA
GAACTAATGTTGGTGATGGTTTCATGTGGTTTGAGGGGGCAGCAACGACGGCGACAACGACAACGACGACTAGAGTTGGATTGAGCTTGATGATTGTGGAGAGAGTGAAA
TGGGAGGAGGGGAGAGTGGGATTTGGATGGGTTGAAGAAGGAGGAGAGAAGAAAGTGAGAGTGAAGAGAAGGGAGGAGTTTGAAGGGTTGGGAGTGGGAATGTGGAGGAG
ATTTAGGTGTTATGTGTTGGTTGAGAGGTTTGTGCTTAAGAGAATGGATGGAAGTTTGGTTCTTGCTTGGGAATTTAGGCACACTCATCAACTTAGAACCAAATGGGAGT
GA
mRNA sequenceShow/hide mRNA sequence
CAATAATTAAGATGTACGTGACGAGGCCTCTTTCTCTGTACAGAAACTCTCCGTCGTCGATGTCGTTGGCGCCGCCGGAGGGTCCAAATTCGGGGATTTTGGCGATCGAA
GACGAAGAAGCTGCAGAATCGAAATGGTGCTGTGGATTGTTTAAGATGAAGGAGTCTGTGAAGGAACTGCCCTTCCCTCAGAACAAGTTACTGCGGCTTACACATGCAGC
TGATGCAGGGGAGTACGAGTATTCAGATTCAGTACGCGCAGTGCTTATTCCAGTTCTGAATCAGCCCATCTCTTCTAACCAATATTATATCATCAATTCCCATGGCACTC
GAAAAGGGCTAGCATGCACAAGTTCAAAGGAAGAAGATAACACAAGTAGCCGGTTTTGCTACAGTGTTCCCGATCCACCACCGCAACTCTTCGACCCAAAAAACACATAT
CAACAATTCCAAATCAGTAATTACATATACTGTGGAGGACCCAATGGTTTCATCTCCAAGTCCATGGCTCCGGACGGGGTTCCACCAGAACGCCTAAGCCGCAAGGGTTG
GAAAGCTTACACGTCCCCACTTTTCAACAACAACTACGAACCCACTGAAGCACTTGGTCTCAACCACTCCCTACGAGCCCGCCTTCCAGATCAGCTCAACTCATCTTCAG
ACCCAGTGGTGGTCGGGAAGTGGTATTGCCCTTTCATTTTCATTCGAGAAGGGGAGGTTGGGTCTCAGATGAGTAACTCAATGTACTACGAAATGACCCTTGAGCAGAAT
TGGGTTGAGATATTTGGATGTGAAGGCTCTAGCAAAGGTAAGAATGGCGTGGATGTGGATGTTTATGTGGAGAGGGAAGCGGTTTCGGTTGCTGGCAAGACGATGGCGAC
AACGAGGAGGAGAACTAATGTTGGTGATGGTTTCATGTGGTTTGAGGGGGCAGCAACGACGGCGACAACGACAACGACGACTAGAGTTGGATTGAGCTTGATGATTGTGG
AGAGAGTGAAATGGGAGGAGGGGAGAGTGGGATTTGGATGGGTTGAAGAAGGAGGAGAGAAGAAAGTGAGAGTGAAGAGAAGGGAGGAGTTTGAAGGGTTGGGAGTGGGA
ATGTGGAGGAGATTTAGGTGTTATGTGTTGGTTGAGAGGTTTGTGCTTAAGAGAATGGATGGAAGTTTGGTTCTTGCTTGGGAATTTAGGCACACTCATCAACTTAGAAC
CAAATGGGAGTGATCAAGACAAACATTGGGAGGTTTAGGATGAATTAATGTTATTATATATTGTTGCCTTAGCTAATTTAGCTTTTTTATGTTATTTGTTGCTTTGGTTT
GGGTGAGATGATGGTAATTGCATTGGCATTGGGGTGATTTTTGTTTGCTAAAATAAAAGGCTTGTCTACGGTGTATTGTATTTGTTTATTCATATTGTATTTGATAATA
Protein sequenceShow/hide protein sequence
MYVTRPLSLYRNSPSSMSLAPPEGPNSGILAIEDEEAAESKWCCGLFKMKESVKELPFPQNKLLRLTHAADAGEYEYSDSVRAVLIPVLNQPISSNQYYIINSHGTRKGL
ACTSSKEEDNTSSRFCYSVPDPPPQLFDPKNTYQQFQISNYIYCGGPNGFISKSMAPDGVPPERLSRKGWKAYTSPLFNNNYEPTEALGLNHSLRARLPDQLNSSSDPVV
VGKWYCPFIFIREGEVGSQMSNSMYYEMTLEQNWVEIFGCEGSSKGKNGVDVDVYVEREAVSVAGKTMATTRRRTNVGDGFMWFEGAATTATTTTTTRVGLSLMIVERVK
WEEGRVGFGWVEEGGEKKVRVKRREEFEGLGVGMWRRFRCYVLVERFVLKRMDGSLVLAWEFRHTHQLRTKWE