; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0027182 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0027182
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein of unknown function (DUF604)
Genome locationchr10:45647298..45653058
RNA-Seq ExpressionLag0027182
SyntenyLag0027182
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0008375 - acetylglucosaminyltransferase activity (molecular function)
InterPro domainsIPR006740 - Protein of unknown function DUF604


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022145771.1 uncharacterized protein LOC111015146 [Momordica charantia]5.9e-22481.7Show/hide
Query:  LEACKVLVLRPTDVFAPFIRALLVISAVASFSLFFYLTFSDQN-YRCLGCYGSLRHSNHRKVQA--FDAGE-QPTNMSHLVFGIGGSVKTWNERRHYCEL
        ++  KVLVLRP D  +PFIRA+ VISAVASFSLFFYLTFSDQN   C+GC+G+LR+SNHRK++A   D GE + TN+SH+VFGIGGSV TWNERRHYCEL
Subjt:  LEACKVLVLRPTDVFAPFIRALLVISAVASFSLFFYLTFSDQN-YRCLGCYGSLRHSNHRKVQA--FDAGE-QPTNMSHLVFGIGGSVKTWNERRHYCEL

Query:  WW-KNVTRGFVWLEEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYIGG
        WW KNVTRGFVWLEEKP FPWPE+SPPYRIS+DTS+FNYTCWYGFRSAIRVAR+IKETFELGLENVRWFVMGDDDTVFFTENLVELL +YDHNQMYYIG 
Subjt:  WW-KNVTRGFVWLEEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYIGG

Query:  NSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGILDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDYV
        NSESVEQD++HSYTMAYGG GFA+SYPLA  LV ILDGCINRYADM  SDQKIQGC+SEIGV +TKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDYV
Subjt:  NSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGILDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDYV

Query:  QTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATPKELETPFLTFQTWKSYGNEPFTFETRPVSSDPCERPILYFL
        QTIFPAMSQPDSLK+L++AY+TDPSRALQHTFCYD AR WSVSVSWGY+VQLYPWLAT K+LETPFLTFQTWK+  NEPF F+TRPVSSDPC+RPIL+FL
Subjt:  QTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATPKELETPFLTFQTWKSYGNEPFTFETRPVSSDPCERPILYFL

Query:  DAAERFGGGGVGGRRTVTRYRKFVEEAGKECERADYAPALAVRYFNVAAPEFDRRLWRQ
        DAA+R     + GRRTVTRYR++VEE  KECER+DY PAL VR+F+V+APEFDRRLWRQ
Subjt:  DAAERFGGGGVGGRRTVTRYRKFVEEAGKECERADYAPALAVRYFNVAAPEFDRRLWRQ

XP_022935260.1 uncharacterized protein LOC111442198 [Cucurbita moschata]7.0e-22580.91Show/hide
Query:  MSIQNPLEACKVLVLRPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNYRCLGCYGSLRHSNHRKVQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYC
        MS QN L+A K +V RP DVFA F+R  LVIS VASFSLF YLT  D+  RC GCYG+LR SNHR+V+AF AGEQPTN+SHLVFGIGGSVKTWNERRHYC
Subjt:  MSIQNPLEACKVLVLRPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNYRCLGCYGSLRHSNHRKVQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYC

Query:  ELWW-KNVTRGFVWLEEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYI
        ELWW KNVTRGFVWLEEKPEFPWP+SSPPYRIS+DTS+FNYTCWYGFRSAIRVAR+IKET++LGL+NVRWFVMGDDDTVFFTENLV+LL KYDHNQMYYI
Subjt:  ELWW-KNVTRGFVWLEEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYI

Query:  GGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGILDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLD
        G NSESVEQD +HSY MAYGG GFA+SYPLAA LV ILDGCINRYADMYGSDQKIQGCIS+IGVPLTKELGFHQVDIRG+ YG+LAAHPVAPLVSLHHLD
Subjt:  GGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGILDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLD

Query:  YVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATPKELETPFLTFQTWKSYGNEPFTFETRPVSSDPCERPILY
        Y++ IFPAM++PDS+KKLH AY+TDP RALQH+FCYD AR WSVSVSWGYSVQLYPWLAT KEL+TPFLTFQTWK+  NE FTF+TRPVSS+PCERPILY
Subjt:  YVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATPKELETPFLTFQTWKSYGNEPFTFETRPVSSDPCERPILY

Query:  FLDAAERFGGGGVGGRRTVTRYRKFVEEAGKECERADYAPALAVRYFNVAAPEFDRRLWRQ
        FLD AERFGG      RT+TRYRK+VE    EC + DYA AL+V YFNV+APEFDRRLWRQ
Subjt:  FLDAAERFGGGGVGGRRTVTRYRKFVEEAGKECERADYAPALAVRYFNVAAPEFDRRLWRQ

XP_022983991.1 uncharacterized protein LOC111482442 [Cucurbita maxima]1.7e-22681.78Show/hide
Query:  MSIQNPLEACKVLVLRPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNYRCLGCYGSLRHSNHRKVQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYC
        MS QN L+A K +V RP DVFA F+R LLVIS VASFSLFFYLT  D+  RC GCYG+LR SNHR+V+AF AGEQPTN+SHLVFGIGGSVKTW+ERRHYC
Subjt:  MSIQNPLEACKVLVLRPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNYRCLGCYGSLRHSNHRKVQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYC

Query:  ELWW-KNVTRGFVWLEEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYI
        ELWW KN+TRGFVWLEEKPEF W +SSPPYRIS+DTS+FNYTCWYGFRSAIRVAR++KET+ELGL+NVRWFVMGDDDTVFFTENLVE+L KYDHNQMYYI
Subjt:  ELWW-KNVTRGFVWLEEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYI

Query:  GGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGILDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLD
        GGNSESVEQD +HSY MAYGG GFA+SYPLAA LV ILDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGN YGLLAAHPVAPLVSLHHLD
Subjt:  GGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGILDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLD

Query:  YVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATPKELETPFLTFQTWKSYGNEPFTFETRPVSSDPCERPILY
        Y+Q IFPAM++PDS+KKLH AY+TDPSRALQH+FCYD AR WSVSVSWGYSVQLYPWLAT KEL+TPFLTFQTWK+  NE FTF+TRPVSS+PCERPILY
Subjt:  YVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATPKELETPFLTFQTWKSYGNEPFTFETRPVSSDPCERPILY

Query:  FLDAAERFGGGGVGGRRTVTRYRKFVEEAGKECERADYAPALAVRYFNVAAPEFDRRLWRQ
        FLD AERFGG      RT+T YRK+VE   KEC++ DYA AL+V YFNV+APEFDRRLWRQ
Subjt:  FLDAAERFGGGGVGGRRTVTRYRKFVEEAGKECERADYAPALAVRYFNVAAPEFDRRLWRQ

XP_023526502.1 uncharacterized protein LOC111789987 [Cucurbita pepo subsp. pepo]1.3e-22380.26Show/hide
Query:  MSIQNPLEACKVLVLRPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNYRCLGCYGSLRHSNHRKVQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYC
        MS QN L+A K +V RP DVFA F+R  LVIS VASFSLFFYLT  D+  RC GC+G+LR SNHR+V+AF AGEQPTN+SHLVFGIGGSVKTW+ERRHYC
Subjt:  MSIQNPLEACKVLVLRPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNYRCLGCYGSLRHSNHRKVQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYC

Query:  ELWW-KNVTRGFVWLEEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYI
        ELWW KN+TRGFVWLEEKPEFPWP+SSPPYRIS+DTS+FNYTCWYGFRSAIRVAR+IKET+ELGL+NVRWFVMGDDDTVFFTENLV+LL KYDHNQMYYI
Subjt:  ELWW-KNVTRGFVWLEEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYI

Query:  GGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGILDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLD
        G NSESVEQD +HSY MAYGG GFA+SYPLAA LV ILDGCINRYADMYGSDQKIQGCIS+IGVPLTKELGFHQVDIRGN YG+LAAHPVAPLVSLHHLD
Subjt:  GGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGILDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLD

Query:  YVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATPKELETPFLTFQTWKSYGNEPFTFETRPVSSDPCERPILY
        Y++ IFP M+ PDS+KKLH AY+TDP RALQH+FCYD A  WSVSVSWGYSVQLYPWLAT KEL+TPFLTFQTWK+  NE FTF+TRPVSS+PCERPILY
Subjt:  YVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATPKELETPFLTFQTWKSYGNEPFTFETRPVSSDPCERPILY

Query:  FLDAAERFGGGGVGGRRTVTRYRKFVEEAGKECERADYAPALAVRYFNVAAPEFDRRLWRQ
        FLD AERFGG      RT+T YRK+VE    EC++ DYA AL+V YFNV+APEFDRRLWRQ
Subjt:  FLDAAERFGGGGVGGRRTVTRYRKFVEEAGKECERADYAPALAVRYFNVAAPEFDRRLWRQ

XP_038904437.1 uncharacterized protein LOC120090799 [Benincasa hispida]1.2e-22482.51Show/hide
Query:  RPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNYRCLGCYGSLRHSNHRKVQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYCELWW-KNVTRGFVWL
        RP  +  PF+RA L+ISAVASFSLF +LTF+DQ   C GCY + R+SNHRKV+AF AGEQPTN+SH+VFGIGGSVKTWNERRHYCELWW KNVTRGFVWL
Subjt:  RPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNYRCLGCYGSLRHSNHRKVQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYCELWW-KNVTRGFVWL

Query:  EEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYIGGNSESVEQDVIHSY
        EEKPEFPWPESSPPYRIS+DTS+FNYTCWYGFRSAIRVAR+IKET+ELGLENVRWFVMGDDDTVFFTENLV+LL KYDHNQM+YIGGNSESVEQDV+HSY
Subjt:  EEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYIGGNSESVEQDVIHSY

Query:  TMAYGGGGFAVSYPLAAALVGILDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDYVQTIFPAMSQPDSL
        TMAYGGGGFA+SYPLA  LV ILDGCINRYADMYGSDQKIQGCISEIGVP+TKELGFHQ+DIRGNPYG+LAAHP+APLVSLHHLDYVQ IFP M+QPD+L
Subjt:  TMAYGGGGFAVSYPLAAALVGILDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDYVQTIFPAMSQPDSL

Query:  KKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATPKELETPFLTFQTWKSYGNEPFTFETRPVSSDPCERPILYFLDAAERFGGGGVGG
        KKLHNAY+TDPSRALQH+FCYDT R WSVSVSWGYS+QLYPWL T KELET FLT+QTW++  NEPFTF+TRPVSSDPCERPILYFLD+AER GG     
Subjt:  KKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATPKELETPFLTFQTWKSYGNEPFTFETRPVSSDPCERPILYFLDAAERFGGGGVGG

Query:  RRTVTRYRKFVEEAGKECERADYAPALAVRYFNVAAPEFDRRLWRQ
         RT+T YR+F+EEA   C+R DYAPALAV  FNV+APEFDRRLWRQ
Subjt:  RRTVTRYRKFVEEAGKECERADYAPALAVRYFNVAAPEFDRRLWRQ

TrEMBL top hitse value%identityAlignment
A0A1S3B6M6 uncharacterized protein LOC1034863661.4e-22379.18Show/hide
Query:  MSIQNPLEACKVLVLRPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNYRCLGCYGSLRHSNHRKVQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYC
        MSIQN L+  K  VLRP D+F+P +RA+LV+  VASFSLFFYLTFSDQN  C GCY + R+SNHRKV+AFDAGEQPTN+SHLVFGIGGSVKTWNERRHYC
Subjt:  MSIQNPLEACKVLVLRPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNYRCLGCYGSLRHSNHRKVQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYC

Query:  ELWW-KNVTRGFVWLEEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYI
        ELWW KNVTRGFVWLEEKPE+ WPESSPPYRIS DTSKFNYTCWYGFRSAIRVAR+IKET+E+GLENVRWFVMGDDDTVFF +NL+++L +YDHNQMYYI
Subjt:  ELWW-KNVTRGFVWLEEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYI

Query:  GGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGILDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLD
        G NSESVEQDV+HSY MAYGGGGFA+SYPLA  LV ILDGCI+RYA MYGSDQKIQGCI+EIGVPLTKELGFHQ+DIRGNPYG+LAAHP+APLVSLHHLD
Subjt:  GGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGILDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLD

Query:  YVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATPKELETPFLTFQTWKSYGNEPFTFETRPVSSDPCERPILY
        YVQ+IFPAM+QPDSLKKL+ AY+TDPSRALQH+FCYDT R WSVSVSWGYSVQLYPWL T KE+ET FLT+QTWK+  NEPFTF+T+PVSSDPC+RPILY
Subjt:  YVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATPKELETPFLTFQTWKSYGNEPFTFETRPVSSDPCERPILY

Query:  FLDAAERFGGGGVGGRRTVTRYRKFVEEAGKECERADYAPALAVRYFNVAAPEFDRRLWRQ
        FL++ ER G       RT+T Y+++ EEA   C+R DYAPALAV  FNV+APEFDRRLW Q
Subjt:  FLDAAERFGGGGVGGRRTVTRYRKFVEEAGKECERADYAPALAVRYFNVAAPEFDRRLWRQ

A0A5D3DN33 Uncharacterized protein1.4e-22379.18Show/hide
Query:  MSIQNPLEACKVLVLRPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNYRCLGCYGSLRHSNHRKVQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYC
        MSIQN L+  K  VLRP D+F+P +RA+LV+  VASFSLFFYLTFSDQN  C GCY + R+SNHRKV+AFDAGEQPTN+SHLVFGIGGSVKTWNERRHYC
Subjt:  MSIQNPLEACKVLVLRPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNYRCLGCYGSLRHSNHRKVQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYC

Query:  ELWW-KNVTRGFVWLEEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYI
        ELWW KNVTRGFVWLEEKPE+ WPESSPPYRIS DTSKFNYTCWYGFRSAIRVAR+IKET+E+GLENVRWFVMGDDDTVFF +NL+++L +YDHNQMYYI
Subjt:  ELWW-KNVTRGFVWLEEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYI

Query:  GGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGILDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLD
        G NSESVEQDV+HSY MAYGGGGFA+SYPLA  LV ILDGCI+RYA MYGSDQKIQGCI+EIGVPLTKELGFHQ+DIRGNPYG+LAAHP+APLVSLHHLD
Subjt:  GGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGILDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLD

Query:  YVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATPKELETPFLTFQTWKSYGNEPFTFETRPVSSDPCERPILY
        YVQ+IFPAM+QPDSLKKL+ AY+TDPSRALQH+FCYDT R WSVSVSWGYSVQLYPWL T KE+ET FLT+QTWK+  NEPFTF+T+PVSSDPC+RPILY
Subjt:  YVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATPKELETPFLTFQTWKSYGNEPFTFETRPVSSDPCERPILY

Query:  FLDAAERFGGGGVGGRRTVTRYRKFVEEAGKECERADYAPALAVRYFNVAAPEFDRRLWRQ
        FL++ ER G       RT+T Y+++ EEA   C+R DYAPALAV  FNV+APEFDRRLW Q
Subjt:  FLDAAERFGGGGVGGRRTVTRYRKFVEEAGKECERADYAPALAVRYFNVAAPEFDRRLWRQ

A0A6J1CXD9 uncharacterized protein LOC1110151462.9e-22481.7Show/hide
Query:  LEACKVLVLRPTDVFAPFIRALLVISAVASFSLFFYLTFSDQN-YRCLGCYGSLRHSNHRKVQA--FDAGE-QPTNMSHLVFGIGGSVKTWNERRHYCEL
        ++  KVLVLRP D  +PFIRA+ VISAVASFSLFFYLTFSDQN   C+GC+G+LR+SNHRK++A   D GE + TN+SH+VFGIGGSV TWNERRHYCEL
Subjt:  LEACKVLVLRPTDVFAPFIRALLVISAVASFSLFFYLTFSDQN-YRCLGCYGSLRHSNHRKVQA--FDAGE-QPTNMSHLVFGIGGSVKTWNERRHYCEL

Query:  WW-KNVTRGFVWLEEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYIGG
        WW KNVTRGFVWLEEKP FPWPE+SPPYRIS+DTS+FNYTCWYGFRSAIRVAR+IKETFELGLENVRWFVMGDDDTVFFTENLVELL +YDHNQMYYIG 
Subjt:  WW-KNVTRGFVWLEEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYIGG

Query:  NSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGILDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDYV
        NSESVEQD++HSYTMAYGG GFA+SYPLA  LV ILDGCINRYADM  SDQKIQGC+SEIGV +TKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDYV
Subjt:  NSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGILDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDYV

Query:  QTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATPKELETPFLTFQTWKSYGNEPFTFETRPVSSDPCERPILYFL
        QTIFPAMSQPDSLK+L++AY+TDPSRALQHTFCYD AR WSVSVSWGY+VQLYPWLAT K+LETPFLTFQTWK+  NEPF F+TRPVSSDPC+RPIL+FL
Subjt:  QTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATPKELETPFLTFQTWKSYGNEPFTFETRPVSSDPCERPILYFL

Query:  DAAERFGGGGVGGRRTVTRYRKFVEEAGKECERADYAPALAVRYFNVAAPEFDRRLWRQ
        DAA+R     + GRRTVTRYR++VEE  KECER+DY PAL VR+F+V+APEFDRRLWRQ
Subjt:  DAAERFGGGGVGGRRTVTRYRKFVEEAGKECERADYAPALAVRYFNVAAPEFDRRLWRQ

A0A6J1FA50 uncharacterized protein LOC1114421983.4e-22580.91Show/hide
Query:  MSIQNPLEACKVLVLRPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNYRCLGCYGSLRHSNHRKVQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYC
        MS QN L+A K +V RP DVFA F+R  LVIS VASFSLF YLT  D+  RC GCYG+LR SNHR+V+AF AGEQPTN+SHLVFGIGGSVKTWNERRHYC
Subjt:  MSIQNPLEACKVLVLRPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNYRCLGCYGSLRHSNHRKVQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYC

Query:  ELWW-KNVTRGFVWLEEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYI
        ELWW KNVTRGFVWLEEKPEFPWP+SSPPYRIS+DTS+FNYTCWYGFRSAIRVAR+IKET++LGL+NVRWFVMGDDDTVFFTENLV+LL KYDHNQMYYI
Subjt:  ELWW-KNVTRGFVWLEEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYI

Query:  GGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGILDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLD
        G NSESVEQD +HSY MAYGG GFA+SYPLAA LV ILDGCINRYADMYGSDQKIQGCIS+IGVPLTKELGFHQVDIRG+ YG+LAAHPVAPLVSLHHLD
Subjt:  GGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGILDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLD

Query:  YVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATPKELETPFLTFQTWKSYGNEPFTFETRPVSSDPCERPILY
        Y++ IFPAM++PDS+KKLH AY+TDP RALQH+FCYD AR WSVSVSWGYSVQLYPWLAT KEL+TPFLTFQTWK+  NE FTF+TRPVSS+PCERPILY
Subjt:  YVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATPKELETPFLTFQTWKSYGNEPFTFETRPVSSDPCERPILY

Query:  FLDAAERFGGGGVGGRRTVTRYRKFVEEAGKECERADYAPALAVRYFNVAAPEFDRRLWRQ
        FLD AERFGG      RT+TRYRK+VE    EC + DYA AL+V YFNV+APEFDRRLWRQ
Subjt:  FLDAAERFGGGGVGGRRTVTRYRKFVEEAGKECERADYAPALAVRYFNVAAPEFDRRLWRQ

A0A6J1J3X9 uncharacterized protein LOC1114824428.1e-22781.78Show/hide
Query:  MSIQNPLEACKVLVLRPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNYRCLGCYGSLRHSNHRKVQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYC
        MS QN L+A K +V RP DVFA F+R LLVIS VASFSLFFYLT  D+  RC GCYG+LR SNHR+V+AF AGEQPTN+SHLVFGIGGSVKTW+ERRHYC
Subjt:  MSIQNPLEACKVLVLRPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNYRCLGCYGSLRHSNHRKVQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYC

Query:  ELWW-KNVTRGFVWLEEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYI
        ELWW KN+TRGFVWLEEKPEF W +SSPPYRIS+DTS+FNYTCWYGFRSAIRVAR++KET+ELGL+NVRWFVMGDDDTVFFTENLVE+L KYDHNQMYYI
Subjt:  ELWW-KNVTRGFVWLEEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYI

Query:  GGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGILDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLD
        GGNSESVEQD +HSY MAYGG GFA+SYPLAA LV ILDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGN YGLLAAHPVAPLVSLHHLD
Subjt:  GGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGILDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLD

Query:  YVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATPKELETPFLTFQTWKSYGNEPFTFETRPVSSDPCERPILY
        Y+Q IFPAM++PDS+KKLH AY+TDPSRALQH+FCYD AR WSVSVSWGYSVQLYPWLAT KEL+TPFLTFQTWK+  NE FTF+TRPVSS+PCERPILY
Subjt:  YVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATPKELETPFLTFQTWKSYGNEPFTFETRPVSSDPCERPILY

Query:  FLDAAERFGGGGVGGRRTVTRYRKFVEEAGKECERADYAPALAVRYFNVAAPEFDRRLWRQ
        FLD AERFGG      RT+T YRK+VE   KEC++ DYA AL+V YFNV+APEFDRRLWRQ
Subjt:  FLDAAERFGGGGVGGRRTVTRYRKFVEEAGKECERADYAPALAVRYFNVAAPEFDRRLWRQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G37730.1 Protein of unknown function (DUF604)1.6e-15864.27Show/hide
Query:  QPTNMSHLVFGIGGSVKTWNERRHYCELWWK-NVTRGFVWLEEKP--EFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFV
        + T++SH+ FGIGGS++TW +R  Y ELWW+ NVTRGF+WL+E+P     W  +SPPY++S DTS+F+YTCWYG RSAIR+AR+IKETFELGL +VRWF+
Subjt:  QPTNMSHLVFGIGGSVKTWNERRHYCELWWK-NVTRGFVWLEEKP--EFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFV

Query:  MGDDDTVFFTENLVELLSKYDHNQMYYIGGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGILDGCINRYADMYGSDQKIQGCISEIGVPLTKELGF
        MGDDDTVFF +NL+ +L+KYDHNQMYYIGGNSESVEQD++HSY MAYGGGG A+SYPLA  LV +LDGCI+RYA +YGSDQKI+ C+SEIGVPLTKELGF
Subjt:  MGDDDTVFFTENLVELLSKYDHNQMYYIGGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGILDGCINRYADMYGSDQKIQGCISEIGVPLTKELGF

Query:  HQVDIRGNPYGLLAAHPVAPLVSLHHLDYVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATPKELETPFLTFQ
        HQVDIRGNPYGLLAAHPVAPLV+LHHLDYV  IFP  +Q D+L++L +AY+TDPSR +QH+FC+D  R W VSVSWGY++Q+YP L T KELETPFLTF+
Subjt:  HQVDIRGNPYGLLAAHPVAPLVSLHHLDYVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATPKELETPFLTFQ

Query:  TWKSYGNEPFTFETRPVSSDPCERPILYFLDAAERFGGGGVGGRRTVTRYRKFVEEA-GKECERADYAPALAVRYFNVAAPEFDRRLWR
        +W++  +EPF+F+TRP+S DPCERP++YFLD     G G     +T+T YRK VE     +C   DY+ A  V + +V+       LW+
Subjt:  TWKSYGNEPFTFETRPVSSDPCERPILYFLDAAERFGGGGVGGRRTVTRYRKFVEEA-GKECERADYAPALAVRYFNVAAPEFDRRLWR

AT3G11420.1 Protein of unknown function (DUF604)1.2e-11848.12Show/hide
Query:  RPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNYRCLG-CYGSLRHSNHRKVQAF--DAGEQPTNMSHLVFGIGGSVKTWNERRHYCELWWKNVTRGFV
        RP D    F R  ++   + S SL    TF   + R     YG    +  +K  A    A   PTN+SH+ F I G+ +TW +R  Y  LWW+N TRGFV
Subjt:  RPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNYRCLG-CYGSLRHSNHRKVQAF--DAGEQPTNMSHLVFGIGGSVKTWNERRHYCELWWKNVTRGFV

Query:  WLEEKPEFPWPES----SPPYRISE-DTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYIGGNSESVE
        WL+E  + P   S    S P R+S+   ++F ++     R+A+R+AR+I +++ L L NVRWFVMGDDDTVFFTENLV++LSKYDH QM+YIGGNSESVE
Subjt:  WLEEKPEFPWPES----SPPYRISE-DTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYIGGNSESVE

Query:  QDVIHSYTMAYGGGGFAVSYPLAAALVGILDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDYVQTIFPA
        QDV+H+Y MA+GGGGFA+S PLAA L   +D C+ RY   YGSDQ+I  CISEIGVP T+E GFHQ+DIRG+PYG LAAHP+APLVSLHHL Y+  +FP 
Subjt:  QDVIHSYTMAYGGGGFAVSYPLAAALVGILDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDYVQTIFPA

Query:  MSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATPKELETPFLTFQTWKSYGNEPFTFETRPVSSDPCERPILYFLDAAERF
         +  +SL+ L   Y  DP+R LQ   C+D  R+WS+S+SWGY++Q+Y +  T  EL TP  TF+TW+S  + PF F TRP+  DPCERP+ YF+D AE  
Subjt:  MSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATPKELETPFLTFQTWKSYGNEPFTFETRPVSSDPCERPILYFLDAAERF

Query:  GGGGVGGRRTVTRYRKFVEEAGKECERADYAPALAVRYFNVAAPEFDRRLWRQ
           G     T T Y    +  G  C + ++     V+   V + + D   W +
Subjt:  GGGGVGGRRTVTRYRKFVEEAGKECERADYAPALAVRYFNVAAPEFDRRLWRQ

AT4G11350.1 Protein of unknown function (DUF604)8.8e-10150.14Show/hide
Query:  VQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYCELWWK-NVTRGFVWLEE----KPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETF-
        V+A  A ++ T+++H+VFGI  S K W +R+ Y ++W+K    RG+VWL+E    K E    ES P  RIS DTS F YT   G RSAIR++R++ ET  
Subjt:  VQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYCELWWK-NVTRGFVWLEE----KPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETF-

Query:  ---ELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYIGGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGILDGCINRYADMYGSDQKIQGC
               +NVRWFVMGDDDTVF T+NL+ +L KYDH QMYYIG  SES  Q++I SY MAYGGGGFA+SYPLA AL  + D CI RY  +YGSD ++Q C
Subjt:  ---ELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYIGGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGILDGCINRYADMYGSDQKIQGC

Query:  ISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDYVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWL
        ++E+GVPLTKE+GFHQ D+ GN +GLLAAHP+ P VS+HHLD V+ IFP M++  ++KKL    + D +  LQ + CYD  + W++SVSWG++VQ++   
Subjt:  ISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDYVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWL

Query:  ATPKELETPFLTFQTW-KSYGNEPFTFETRPVSSDPCERPILYFLDAAE
         +P+E+E P  TF  W K      + F TRPVS + C++P ++ + +A+
Subjt:  ATPKELETPFLTFQTW-KSYGNEPFTFETRPVSSDPCERPILYFLDAAE

AT4G23490.1 Protein of unknown function (DUF604)3.4e-10047.88Show/hide
Query:  RHSNHRKVQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYCELWWK-NVTRGFVWLEEKPEFPWPESS-----PPYRISEDTSKFNYTCWYGFRSAIRVA
        R  N  + +     ++ T+++H+VFGI  S K W +R+ Y ++W+K    RG+VWL+++ +    +       PP +IS  T+ F YT   G RSA+R++
Subjt:  RHSNHRKVQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYCELWWK-NVTRGFVWLEEKPEFPWPESS-----PPYRISEDTSKFNYTCWYGFRSAIRVA

Query:  RVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYIGGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGILDGCINRYADMYGSDQK
        R++ ET  LG +NVRWFVMGDDDTVF  +NL+ +L KYDH QMYYIG  SES  Q++  SY MAYGGGGFA+SYPLA AL  + D CI RY  +YGSD +
Subjt:  RVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYIGGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGILDGCINRYADMYGSDQK

Query:  IQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDYVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQL
        +Q C++E+GVPLTKELGFHQ D+ GN +GLLAAHPV P VS+HHLD V+ IFP M++  +LKK+    + D +  LQ + CYD  + W++SVSWGY+VQ+
Subjt:  IQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDYVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQL

Query:  YPWLATPKELETPFLTFQTW-KSYGNEPFTFETRPVSSDPCERPILYFLDAAE
        +  + +P+E+E P  TF  W K      + F TRPVS +PC++P ++++ + +
Subjt:  YPWLATPKELETPFLTFQTW-KSYGNEPFTFETRPVSSDPCERPILYFLDAAE

AT5G41460.1 Protein of unknown function (DUF604)9.4e-10353.78Show/hide
Query:  TNMSHLVFGIGGSVKTWNERRHYCELWWK-NVTRGFVWLEEKPEFPWPE----SSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFV
        T   H+VFGI  S + W +R+ Y ++W+K N  R +VWL EKP     E    S PP +IS DTSKF Y    G RSAIR++R++ ET +LGL++VRWFV
Subjt:  TNMSHLVFGIGGSVKTWNERRHYCELWWK-NVTRGFVWLEEKPEFPWPE----SSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFV

Query:  MGDDDTVFFTENLVELLSKYDHNQMYYIGGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGILDGCINRYADMYGSDQKIQGCISEIGVPLTKELGF
        MGDDDTVF  ENL+ +L KYDHNQMYYIG  SES  Q++  SY MAYGGGGFA+SYPLA AL  + D CI RY  +YGSD ++Q C++E+GVPLTKELGF
Subjt:  MGDDDTVFFTENLVELLSKYDHNQMYYIGGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGILDGCINRYADMYGSDQKIQGCISEIGVPLTKELGF

Query:  HQVDIRGNPYGLLAAHPVAPLVSLHHLDYVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATPKELETPFLTFQ
        HQ D+ GN +GLLAAHPVAPLV+LHHLD V+ IFP M++ D+LK L    + D +  +Q + CYD  RKW+VSVSWG++VQ++  + + +E+E P  TF 
Subjt:  HQVDIRGNPYGLLAAHPVAPLVSLHHLDYVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATPKELETPFLTFQ

Query:  TW-KSYGNEPFTFETRPVSSDPCERPILYFL
         W +      + F TRPVS  PC++P ++++
Subjt:  TW-KSYGNEPFTFETRPVSSDPCERPILYFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGATTCAGAATCCATTAGAAGCCTGCAAAGTTTTGGTCTTGCGGCCAACGGATGTTTTTGCTCCTTTCATTAGAGCCCTTCTGGTTATCTCCGCCGTTGCTTCGTT
TTCTCTCTTCTTTTACCTGACTTTTTCCGACCAAAACTACCGTTGTCTTGGCTGCTACGGTTCACTTCGGCACTCAAACCACCGGAAAGTTCAGGCTTTCGATGCCGGAG
AACAGCCGACGAATATGTCCCATCTTGTGTTTGGCATTGGTGGCTCCGTCAAGACATGGAACGAGCGACGCCATTATTGCGAGCTGTGGTGGAAGAATGTTACTCGTGGG
TTTGTTTGGCTTGAAGAGAAGCCTGAATTTCCCTGGCCGGAATCGTCCCCGCCGTACCGAATCTCCGAGGACACCTCGAAATTCAACTACACTTGCTGGTACGGGTTTCG
GTCGGCGATTCGAGTGGCTAGGGTTATCAAGGAGACTTTTGAACTAGGGTTGGAGAATGTGAGGTGGTTCGTGATGGGGGACGATGATACAGTCTTCTTCACAGAGAATT
TGGTGGAGTTATTGAGTAAATACGATCACAACCAAATGTACTATATCGGAGGTAATTCTGAGAGCGTGGAGCAAGATGTAATTCATTCTTACACCATGGCCTACGGCGGC
GGCGGATTCGCTGTTAGCTACCCTCTCGCGGCGGCGCTGGTCGGAATTTTAGACGGCTGCATCAATCGTTATGCTGATATGTATGGCTCCGATCAGAAAATTCAGGGTTG
CATCAGTGAGATCGGCGTTCCCCTCACCAAAGAGCTTGGATTCCACCAGGTGGATATAAGAGGAAACCCATATGGTTTATTAGCTGCTCATCCAGTTGCGCCGTTAGTGT
CGCTCCACCACCTGGACTACGTGCAGACCATATTCCCAGCCATGTCCCAACCCGACTCGCTCAAGAAGCTCCACAACGCCTACCAAACGGACCCGAGTCGAGCCCTTCAG
CACACCTTCTGCTACGACACGGCTCGTAAGTGGTCCGTCTCGGTGTCGTGGGGTTACAGTGTTCAGTTGTATCCATGGCTGGCCACGCCCAAGGAACTCGAGACGCCATT
TCTTACGTTCCAAACCTGGAAGTCATATGGCAACGAGCCCTTCACTTTCGAAACCCGACCCGTAAGTTCGGACCCGTGTGAAAGACCCATTTTGTATTTCTTGGATGCGG
CGGAGAGATTCGGCGGCGGCGGCGTCGGAGGACGGCGGACGGTGACGAGGTACCGGAAATTTGTGGAGGAGGCTGGGAAGGAGTGTGAGCGGGCGGATTACGCTCCTGCA
TTGGCTGTTCGGTATTTTAACGTCGCGGCGCCGGAGTTCGACCGCCGTCTGTGGAGGCAGATCAACACCTCGCCGCTGTTGCCCCCGCCTCGCCGGAAACGCCATTGCTC
TTCGGTGGGTTTCATTCTTCTTCCTCCTCCGTTCAACGTCCAATTGCAGATCTCTCCGTTCGCGAATTTAGTGGCAGATCGTGACCCACGATGCCATTCCAGCAGGCGAC
CCAGGTGCGAATCGAAGGAGTCGGTCTGGTGGGTTTCGTTTCATATCGAGATTCGTGGGTCTGCCGGCATCGAAAAATCGCGTGATCTGTGTGTCTTGAAGAAATGGCTA
ATGGGCGTTTTCAGATCGCGTTGTGTGTCTCCAAAAGCTTGTGTCTCACGCCCAGATCTGTGTGGGTTTCTTGCTACTTGGGCGTTTTCAGATCATTTGAATCTTGAAGA
AATGGGTTGTTGTGATGGTGCTTTGGCGAGAAAGGCTTCACGAACAGGTTGTCTCCGACAGAGGGTGCGAAGAAATCCGAATAGCAGCAACTGCGGCGAGCGTTCGGCGT
GGTTGTGGGTTTTCAAGCTTCAGCGGGTGATCGAGTTTCCACAACGGCGTGGGTCTCGGCTCAACGACAGTCCAGATCGAAGAGAGAGAGAGAAATCTGGTGGCGTCGTG
GGTTTTGTTTCTTCGATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGATTCAGAATCCATTAGAAGCCTGCAAAGTTTTGGTCTTGCGGCCAACGGATGTTTTTGCTCCTTTCATTAGAGCCCTTCTGGTTATCTCCGCCGTTGCTTCGTT
TTCTCTCTTCTTTTACCTGACTTTTTCCGACCAAAACTACCGTTGTCTTGGCTGCTACGGTTCACTTCGGCACTCAAACCACCGGAAAGTTCAGGCTTTCGATGCCGGAG
AACAGCCGACGAATATGTCCCATCTTGTGTTTGGCATTGGTGGCTCCGTCAAGACATGGAACGAGCGACGCCATTATTGCGAGCTGTGGTGGAAGAATGTTACTCGTGGG
TTTGTTTGGCTTGAAGAGAAGCCTGAATTTCCCTGGCCGGAATCGTCCCCGCCGTACCGAATCTCCGAGGACACCTCGAAATTCAACTACACTTGCTGGTACGGGTTTCG
GTCGGCGATTCGAGTGGCTAGGGTTATCAAGGAGACTTTTGAACTAGGGTTGGAGAATGTGAGGTGGTTCGTGATGGGGGACGATGATACAGTCTTCTTCACAGAGAATT
TGGTGGAGTTATTGAGTAAATACGATCACAACCAAATGTACTATATCGGAGGTAATTCTGAGAGCGTGGAGCAAGATGTAATTCATTCTTACACCATGGCCTACGGCGGC
GGCGGATTCGCTGTTAGCTACCCTCTCGCGGCGGCGCTGGTCGGAATTTTAGACGGCTGCATCAATCGTTATGCTGATATGTATGGCTCCGATCAGAAAATTCAGGGTTG
CATCAGTGAGATCGGCGTTCCCCTCACCAAAGAGCTTGGATTCCACCAGGTGGATATAAGAGGAAACCCATATGGTTTATTAGCTGCTCATCCAGTTGCGCCGTTAGTGT
CGCTCCACCACCTGGACTACGTGCAGACCATATTCCCAGCCATGTCCCAACCCGACTCGCTCAAGAAGCTCCACAACGCCTACCAAACGGACCCGAGTCGAGCCCTTCAG
CACACCTTCTGCTACGACACGGCTCGTAAGTGGTCCGTCTCGGTGTCGTGGGGTTACAGTGTTCAGTTGTATCCATGGCTGGCCACGCCCAAGGAACTCGAGACGCCATT
TCTTACGTTCCAAACCTGGAAGTCATATGGCAACGAGCCCTTCACTTTCGAAACCCGACCCGTAAGTTCGGACCCGTGTGAAAGACCCATTTTGTATTTCTTGGATGCGG
CGGAGAGATTCGGCGGCGGCGGCGTCGGAGGACGGCGGACGGTGACGAGGTACCGGAAATTTGTGGAGGAGGCTGGGAAGGAGTGTGAGCGGGCGGATTACGCTCCTGCA
TTGGCTGTTCGGTATTTTAACGTCGCGGCGCCGGAGTTCGACCGCCGTCTGTGGAGGCAGATCAACACCTCGCCGCTGTTGCCCCCGCCTCGCCGGAAACGCCATTGCTC
TTCGGTGGGTTTCATTCTTCTTCCTCCTCCGTTCAACGTCCAATTGCAGATCTCTCCGTTCGCGAATTTAGTGGCAGATCGTGACCCACGATGCCATTCCAGCAGGCGAC
CCAGGTGCGAATCGAAGGAGTCGGTCTGGTGGGTTTCGTTTCATATCGAGATTCGTGGGTCTGCCGGCATCGAAAAATCGCGTGATCTGTGTGTCTTGAAGAAATGGCTA
ATGGGCGTTTTCAGATCGCGTTGTGTGTCTCCAAAAGCTTGTGTCTCACGCCCAGATCTGTGTGGGTTTCTTGCTACTTGGGCGTTTTCAGATCATTTGAATCTTGAAGA
AATGGGTTGTTGTGATGGTGCTTTGGCGAGAAAGGCTTCACGAACAGGTTGTCTCCGACAGAGGGTGCGAAGAAATCCGAATAGCAGCAACTGCGGCGAGCGTTCGGCGT
GGTTGTGGGTTTTCAAGCTTCAGCGGGTGATCGAGTTTCCACAACGGCGTGGGTCTCGGCTCAACGACAGTCCAGATCGAAGAGAGAGAGAGAAATCTGGTGGCGTCGTG
GGTTTTGTTTCTTCGATTTGA
Protein sequenceShow/hide protein sequence
MSIQNPLEACKVLVLRPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNYRCLGCYGSLRHSNHRKVQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYCELWWKNVTRG
FVWLEEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYIGGNSESVEQDVIHSYTMAYGG
GGFAVSYPLAAALVGILDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDYVQTIFPAMSQPDSLKKLHNAYQTDPSRALQ
HTFCYDTARKWSVSVSWGYSVQLYPWLATPKELETPFLTFQTWKSYGNEPFTFETRPVSSDPCERPILYFLDAAERFGGGGVGGRRTVTRYRKFVEEAGKECERADYAPA
LAVRYFNVAAPEFDRRLWRQINTSPLLPPPRRKRHCSSVGFILLPPPFNVQLQISPFANLVADRDPRCHSSRRPRCESKESVWWVSFHIEIRGSAGIEKSRDLCVLKKWL
MGVFRSRCVSPKACVSRPDLCGFLATWAFSDHLNLEEMGCCDGALARKASRTGCLRQRVRRNPNSSNCGERSAWLWVFKLQRVIEFPQRRGSRLNDSPDRREREKSGGVV
GFVSSI