; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg033460 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg033460
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionProtein of unknown function (DUF604)
Genome locationscaffold5:1799924..1802760
RNA-Seq ExpressionSpg033460
SyntenySpg033460
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0008375 - acetylglucosaminyltransferase activity (molecular function)
InterPro domainsIPR006740 - Protein of unknown function DUF604


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008442515.1 PREDICTED: uncharacterized protein LOC103486366 [Cucumis melo]2.1e-22879.78Show/hide
Query:  MSIQNPLEACKVLVLRPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNHRCLGCYGSLRHSNHRKVQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYC
        MSIQN L+  K  VLRP D+F+P +RA+LV+  VASFSLFFYLTFSDQN  C GCY + R+SNHRKV+AFDAGEQPTN+SHLVFGIGGSVKTWNERRHYC
Subjt:  MSIQNPLEACKVLVLRPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNHRCLGCYGSLRHSNHRKVQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYC

Query:  ELWWKKNVTRGFVWLEEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYI
        ELWWKKNVTRGFVWLEEKPE+ WPESSPPYRIS DTSKFNYTCWYGFRSAIRVAR+IKET+E+GLENVRWFVMGDDDTVFF +NL+++L +YDHNQMYYI
Subjt:  ELWWKKNVTRGFVWLEEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYI

Query:  GGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGVLDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLD
        G NSESVEQDV+HSY MAYGGGGFA+SYPLA  LV +LDGCI+RYA MYGSDQKIQGCI+EIGVPLTKELGFHQ+DIRGNPYG+LAAHP+APLVSLHHLD
Subjt:  GGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGVLDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLD

Query:  YVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATAKELETAFLTFQTWKSYSNEPFTFDTRPVSSDPCERPILY
        YVQ+IFPAM+QPDSLKKL+ AY+TDPSRALQH+FCYDT R WSVSVSWGYSVQLYPWL TAKE+ETAFLT+QTWK+ SNEPFTFDT+PVSSDPC+RPILY
Subjt:  YVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATAKELETAFLTFQTWKSYSNEPFTFDTRPVSSDPCERPILY

Query:  FLDAAERFGGGGVRGRRTVTRYRKFVEEAEKECERADYAPALAVRYFNVSAPEFDRRLWRQVNKK
        FL++ ER G    R  RT+T Y+++ EEA   C+R DYAPALAV  FNVSAPEFDRRLW Q  ++
Subjt:  FLDAAERFGGGGVRGRRTVTRYRKFVEEAEKECERADYAPALAVRYFNVSAPEFDRRLWRQVNKK

XP_011652840.2 uncharacterized protein LOC101203954 [Cucumis sativus]4.7e-22879.78Show/hide
Query:  MSIQNPLEACKVLVLRPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNHRCLGCYGSLRHSNHRKVQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYC
        MSIQN L+  K  VLRP DVF+P ++A+LV+S VASFSLFFYLTFSDQN  C GCY + R+SNHRK++AFDAGEQPTN+SHLVFGIGGSVKTWNERRHYC
Subjt:  MSIQNPLEACKVLVLRPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNHRCLGCYGSLRHSNHRKVQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYC

Query:  ELWWKKNVTRGFVWLEEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYI
        ELWWKKNVTRGFVW+EEKPEF WPESSPPYR+S+DTSKFNYTCWYGFRSAIRVAR+IKET+E+GLENVRWFVMGDDDTVFF ENL+++L +YDHNQMYYI
Subjt:  ELWWKKNVTRGFVWLEEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYI

Query:  GGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGVLDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLD
        G NSESVEQDV+HSYTMAYGGGGFA+SYPLA  LV +LDGCINRYA MYGSDQKIQGCISEIGVPLTKE GFHQ+DIRGNPYG+LAAHP+APLVSLHHLD
Subjt:  GGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGVLDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLD

Query:  YVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATAKELETAFLTFQTWKSYSNEPFTFDTRPVSSDPCERPILY
        YVQTIFP M+QPDSLKKLH AY+TDPSRALQHTFCYDT   WSVS+SWGYSVQLYP L TAKE+ETAFLT+QTW++ SNEPFTFDT+PVSSDPC+RPILY
Subjt:  YVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATAKELETAFLTFQTWKSYSNEPFTFDTRPVSSDPCERPILY

Query:  FLDAAERFGGGGVRGRRTVTRYRKFVEEAEKECERADYAPALAVRYFNVSAPEFDRRLWRQVNKK
        FL++AER G    R  +T+T Y+++VEEA   C+R DYAPALAV +FNVSA EFDRRLW Q  ++
Subjt:  FLDAAERFGGGGVRGRRTVTRYRKFVEEAEKECERADYAPALAVRYFNVSAPEFDRRLWRQVNKK

XP_022935260.1 uncharacterized protein LOC111442198 [Cucurbita moschata]8.1e-22880.65Show/hide
Query:  MSIQNPLEACKVLVLRPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNHRCLGCYGSLRHSNHRKVQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYC
        MS QN L+A K +V RP DVFA F+R  LVIS VASFSLF YLT  D+  RC GCYG+LR SNHR+V+AF AGEQPTN+SHLVFGIGGSVKTWNERRHYC
Subjt:  MSIQNPLEACKVLVLRPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNHRCLGCYGSLRHSNHRKVQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYC

Query:  ELWWKKNVTRGFVWLEEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYI
        ELWW KNVTRGFVWLEEKPEFPWP+SSPPYRIS+DTS+FNYTCWYGFRSAIRVAR+IKET++LGL+NVRWFVMGDDDTVFFTENLV+LL KYDHNQMYYI
Subjt:  ELWWKKNVTRGFVWLEEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYI

Query:  GGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGVLDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLD
        G NSESVEQD +HSY MAYGG GFA+SYPLAA LV +LDGCINRYADMYGSDQKIQGCIS+IGVPLTKELGFHQVDIRG+ YG+LAAHPVAPLVSLHHLD
Subjt:  GGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGVLDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLD

Query:  YVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATAKELETAFLTFQTWKSYSNEPFTFDTRPVSSDPCERPILY
        Y++ IFPAM++PDS+KKLH AY+TDP RALQH+FCYD AR WSVSVSWGYSVQLYPWLATAKEL+T FLTFQTWK+ +NE FTFDTRPVSS+PCERPILY
Subjt:  YVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATAKELETAFLTFQTWKSYSNEPFTFDTRPVSSDPCERPILY

Query:  FLDAAERFGGGGVRGRRTVTRYRKFVEEAEKECERADYAPALAVRYFNVSAPEFDRRLWRQVNKK
        FLD AERFGG   R  RT+TRYRK+VE    EC + DYA AL+V YFNVSAPEFDRRLWRQ  ++
Subjt:  FLDAAERFGGGGVRGRRTVTRYRKFVEEAEKECERADYAPALAVRYFNVSAPEFDRRLWRQVNKK

XP_022983991.1 uncharacterized protein LOC111482442 [Cucurbita maxima]5.1e-23081.72Show/hide
Query:  MSIQNPLEACKVLVLRPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNHRCLGCYGSLRHSNHRKVQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYC
        MS QN L+A K +V RP DVFA F+R LLVIS VASFSLFFYLT  D+  RC GCYG+LR SNHR+V+AF AGEQPTN+SHLVFGIGGSVKTW+ERRHYC
Subjt:  MSIQNPLEACKVLVLRPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNHRCLGCYGSLRHSNHRKVQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYC

Query:  ELWWKKNVTRGFVWLEEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYI
        ELWWKKN+TRGFVWLEEKPEF W +SSPPYRIS+DTS+FNYTCWYGFRSAIRVAR++KET+ELGL+NVRWFVMGDDDTVFFTENLVE+L KYDHNQMYYI
Subjt:  ELWWKKNVTRGFVWLEEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYI

Query:  GGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGVLDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLD
        GGNSESVEQD +HSY MAYGG GFA+SYPLAA LV +LDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGN YGLLAAHPVAPLVSLHHLD
Subjt:  GGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGVLDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLD

Query:  YVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATAKELETAFLTFQTWKSYSNEPFTFDTRPVSSDPCERPILY
        Y+Q IFPAM++PDS+KKLH AY+TDPSRALQH+FCYD AR WSVSVSWGYSVQLYPWLATAKEL+T FLTFQTWK+ +NE FTFDTRPVSS+PCERPILY
Subjt:  YVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATAKELETAFLTFQTWKSYSNEPFTFDTRPVSSDPCERPILY

Query:  FLDAAERFGGGGVRGRRTVTRYRKFVEEAEKECERADYAPALAVRYFNVSAPEFDRRLWRQVNKK
        FLD AERFGG   R  RT+T YRK+VE   KEC++ DYA AL+V YFNVSAPEFDRRLWRQ  ++
Subjt:  FLDAAERFGGGGVRGRRTVTRYRKFVEEAEKECERADYAPALAVRYFNVSAPEFDRRLWRQVNKK

XP_038904437.1 uncharacterized protein LOC120090799 [Benincasa hispida]1.1e-22983.11Show/hide
Query:  RPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNHRCLGCYGSLRHSNHRKVQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYCELWWKKNVTRGFVWL
        RP  +  PF+RA L+ISAVASFSLF +LTF+DQ   C GCY + R+SNHRKV+AF AGEQPTN+SH+VFGIGGSVKTWNERRHYCELWWKKNVTRGFVWL
Subjt:  RPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNHRCLGCYGSLRHSNHRKVQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYCELWWKKNVTRGFVWL

Query:  EEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYIGGNSESVEQDVIHSY
        EEKPEFPWPESSPPYRIS+DTS+FNYTCWYGFRSAIRVAR+IKET+ELGLENVRWFVMGDDDTVFFTENLV+LL KYDHNQM+YIGGNSESVEQDV+HSY
Subjt:  EEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYIGGNSESVEQDVIHSY

Query:  TMAYGGGGFAVSYPLAAALVGVLDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDYVQTIFPAMSQPDSL
        TMAYGGGGFA+SYPLA  LV +LDGCINRYADMYGSDQKIQGCISEIGVP+TKELGFHQ+DIRGNPYG+LAAHP+APLVSLHHLDYVQ IFP M+QPD+L
Subjt:  TMAYGGGGFAVSYPLAAALVGVLDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDYVQTIFPAMSQPDSL

Query:  KKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATAKELETAFLTFQTWKSYSNEPFTFDTRPVSSDPCERPILYFLDAAERFGGGGVRG
        KKLHNAY+TDPSRALQH+FCYDT R WSVSVSWGYS+QLYPWL TAKELETAFLT+QTW++ SNEPFTFDTRPVSSDPCERPILYFLD+AER GG   R 
Subjt:  KKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATAKELETAFLTFQTWKSYSNEPFTFDTRPVSSDPCERPILYFLDAAERFGGGGVRG

Query:  RRTVTRYRKFVEEAEKECERADYAPALAVRYFNVSAPEFDRRLWRQVNKK
         RT+T YR+F+EEA   C+R DYAPALAV  FNVSAPEFDRRLWRQ  ++
Subjt:  RRTVTRYRKFVEEAEKECERADYAPALAVRYFNVSAPEFDRRLWRQVNKK

TrEMBL top hitse value%identityAlignment
A0A1S3B6M6 uncharacterized protein LOC1034863661.0e-22879.78Show/hide
Query:  MSIQNPLEACKVLVLRPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNHRCLGCYGSLRHSNHRKVQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYC
        MSIQN L+  K  VLRP D+F+P +RA+LV+  VASFSLFFYLTFSDQN  C GCY + R+SNHRKV+AFDAGEQPTN+SHLVFGIGGSVKTWNERRHYC
Subjt:  MSIQNPLEACKVLVLRPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNHRCLGCYGSLRHSNHRKVQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYC

Query:  ELWWKKNVTRGFVWLEEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYI
        ELWWKKNVTRGFVWLEEKPE+ WPESSPPYRIS DTSKFNYTCWYGFRSAIRVAR+IKET+E+GLENVRWFVMGDDDTVFF +NL+++L +YDHNQMYYI
Subjt:  ELWWKKNVTRGFVWLEEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYI

Query:  GGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGVLDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLD
        G NSESVEQDV+HSY MAYGGGGFA+SYPLA  LV +LDGCI+RYA MYGSDQKIQGCI+EIGVPLTKELGFHQ+DIRGNPYG+LAAHP+APLVSLHHLD
Subjt:  GGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGVLDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLD

Query:  YVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATAKELETAFLTFQTWKSYSNEPFTFDTRPVSSDPCERPILY
        YVQ+IFPAM+QPDSLKKL+ AY+TDPSRALQH+FCYDT R WSVSVSWGYSVQLYPWL TAKE+ETAFLT+QTWK+ SNEPFTFDT+PVSSDPC+RPILY
Subjt:  YVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATAKELETAFLTFQTWKSYSNEPFTFDTRPVSSDPCERPILY

Query:  FLDAAERFGGGGVRGRRTVTRYRKFVEEAEKECERADYAPALAVRYFNVSAPEFDRRLWRQVNKK
        FL++ ER G    R  RT+T Y+++ EEA   C+R DYAPALAV  FNVSAPEFDRRLW Q  ++
Subjt:  FLDAAERFGGGGVRGRRTVTRYRKFVEEAEKECERADYAPALAVRYFNVSAPEFDRRLWRQVNKK

A0A5D3DN33 Uncharacterized protein1.0e-22879.78Show/hide
Query:  MSIQNPLEACKVLVLRPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNHRCLGCYGSLRHSNHRKVQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYC
        MSIQN L+  K  VLRP D+F+P +RA+LV+  VASFSLFFYLTFSDQN  C GCY + R+SNHRKV+AFDAGEQPTN+SHLVFGIGGSVKTWNERRHYC
Subjt:  MSIQNPLEACKVLVLRPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNHRCLGCYGSLRHSNHRKVQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYC

Query:  ELWWKKNVTRGFVWLEEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYI
        ELWWKKNVTRGFVWLEEKPE+ WPESSPPYRIS DTSKFNYTCWYGFRSAIRVAR+IKET+E+GLENVRWFVMGDDDTVFF +NL+++L +YDHNQMYYI
Subjt:  ELWWKKNVTRGFVWLEEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYI

Query:  GGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGVLDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLD
        G NSESVEQDV+HSY MAYGGGGFA+SYPLA  LV +LDGCI+RYA MYGSDQKIQGCI+EIGVPLTKELGFHQ+DIRGNPYG+LAAHP+APLVSLHHLD
Subjt:  GGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGVLDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLD

Query:  YVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATAKELETAFLTFQTWKSYSNEPFTFDTRPVSSDPCERPILY
        YVQ+IFPAM+QPDSLKKL+ AY+TDPSRALQH+FCYDT R WSVSVSWGYSVQLYPWL TAKE+ETAFLT+QTWK+ SNEPFTFDT+PVSSDPC+RPILY
Subjt:  YVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATAKELETAFLTFQTWKSYSNEPFTFDTRPVSSDPCERPILY

Query:  FLDAAERFGGGGVRGRRTVTRYRKFVEEAEKECERADYAPALAVRYFNVSAPEFDRRLWRQVNKK
        FL++ ER G    R  RT+T Y+++ EEA   C+R DYAPALAV  FNVSAPEFDRRLW Q  ++
Subjt:  FLDAAERFGGGGVRGRRTVTRYRKFVEEAEKECERADYAPALAVRYFNVSAPEFDRRLWRQVNKK

A0A6J1CXD9 uncharacterized protein LOC1110151462.5e-22781.64Show/hide
Query:  LEACKVLVLRPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNHR-CLGCYGSLRHSNHRKVQA--FDAGE-QPTNMSHLVFGIGGSVKTWNERRHYCEL
        ++  KVLVLRP D  +PFIRA+ VISAVASFSLFFYLTFSDQN   C+GC+G+LR+SNHRK++A   D GE + TN+SH+VFGIGGSV TWNERRHYCEL
Subjt:  LEACKVLVLRPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNHR-CLGCYGSLRHSNHRKVQA--FDAGE-QPTNMSHLVFGIGGSVKTWNERRHYCEL

Query:  WWKKNVTRGFVWLEEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYIGG
        WWKKNVTRGFVWLEEKP FPWPE+SPPYRIS+DTS+FNYTCWYGFRSAIRVAR+IKETFELGLENVRWFVMGDDDTVFFTENLVELL +YDHNQMYYIG 
Subjt:  WWKKNVTRGFVWLEEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYIGG

Query:  NSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGVLDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDYV
        NSESVEQD++HSYTMAYGG GFA+SYPLA  LV +LDGCINRYADM  SDQKIQGC+SEIGV +TKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDYV
Subjt:  NSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGVLDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDYV

Query:  QTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATAKELETAFLTFQTWKSYSNEPFTFDTRPVSSDPCERPILYFL
        QTIFPAMSQPDSLK+L++AY+TDPSRALQHTFCYD AR WSVSVSWGY+VQLYPWLATAK+LET FLTFQTWK+ SNEPF FDTRPVSSDPC+RPIL+FL
Subjt:  QTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATAKELETAFLTFQTWKSYSNEPFTFDTRPVSSDPCERPILYFL

Query:  DAAERFGGGGVRGRRTVTRYRKFVEEAEKECERADYAPALAVRYFNVSAPEFDRRLWRQVNKK
        DAA+R     + GRRTVTRYR++VEE +KECER+DY PAL VR+F+VSAPEFDRRLWRQ  ++
Subjt:  DAAERFGGGGVRGRRTVTRYRKFVEEAEKECERADYAPALAVRYFNVSAPEFDRRLWRQVNKK

A0A6J1FA50 uncharacterized protein LOC1114421983.9e-22880.65Show/hide
Query:  MSIQNPLEACKVLVLRPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNHRCLGCYGSLRHSNHRKVQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYC
        MS QN L+A K +V RP DVFA F+R  LVIS VASFSLF YLT  D+  RC GCYG+LR SNHR+V+AF AGEQPTN+SHLVFGIGGSVKTWNERRHYC
Subjt:  MSIQNPLEACKVLVLRPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNHRCLGCYGSLRHSNHRKVQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYC

Query:  ELWWKKNVTRGFVWLEEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYI
        ELWW KNVTRGFVWLEEKPEFPWP+SSPPYRIS+DTS+FNYTCWYGFRSAIRVAR+IKET++LGL+NVRWFVMGDDDTVFFTENLV+LL KYDHNQMYYI
Subjt:  ELWWKKNVTRGFVWLEEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYI

Query:  GGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGVLDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLD
        G NSESVEQD +HSY MAYGG GFA+SYPLAA LV +LDGCINRYADMYGSDQKIQGCIS+IGVPLTKELGFHQVDIRG+ YG+LAAHPVAPLVSLHHLD
Subjt:  GGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGVLDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLD

Query:  YVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATAKELETAFLTFQTWKSYSNEPFTFDTRPVSSDPCERPILY
        Y++ IFPAM++PDS+KKLH AY+TDP RALQH+FCYD AR WSVSVSWGYSVQLYPWLATAKEL+T FLTFQTWK+ +NE FTFDTRPVSS+PCERPILY
Subjt:  YVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATAKELETAFLTFQTWKSYSNEPFTFDTRPVSSDPCERPILY

Query:  FLDAAERFGGGGVRGRRTVTRYRKFVEEAEKECERADYAPALAVRYFNVSAPEFDRRLWRQVNKK
        FLD AERFGG   R  RT+TRYRK+VE    EC + DYA AL+V YFNVSAPEFDRRLWRQ  ++
Subjt:  FLDAAERFGGGGVRGRRTVTRYRKFVEEAEKECERADYAPALAVRYFNVSAPEFDRRLWRQVNKK

A0A6J1J3X9 uncharacterized protein LOC1114824422.4e-23081.72Show/hide
Query:  MSIQNPLEACKVLVLRPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNHRCLGCYGSLRHSNHRKVQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYC
        MS QN L+A K +V RP DVFA F+R LLVIS VASFSLFFYLT  D+  RC GCYG+LR SNHR+V+AF AGEQPTN+SHLVFGIGGSVKTW+ERRHYC
Subjt:  MSIQNPLEACKVLVLRPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNHRCLGCYGSLRHSNHRKVQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYC

Query:  ELWWKKNVTRGFVWLEEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYI
        ELWWKKN+TRGFVWLEEKPEF W +SSPPYRIS+DTS+FNYTCWYGFRSAIRVAR++KET+ELGL+NVRWFVMGDDDTVFFTENLVE+L KYDHNQMYYI
Subjt:  ELWWKKNVTRGFVWLEEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYI

Query:  GGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGVLDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLD
        GGNSESVEQD +HSY MAYGG GFA+SYPLAA LV +LDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGN YGLLAAHPVAPLVSLHHLD
Subjt:  GGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGVLDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLD

Query:  YVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATAKELETAFLTFQTWKSYSNEPFTFDTRPVSSDPCERPILY
        Y+Q IFPAM++PDS+KKLH AY+TDPSRALQH+FCYD AR WSVSVSWGYSVQLYPWLATAKEL+T FLTFQTWK+ +NE FTFDTRPVSS+PCERPILY
Subjt:  YVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATAKELETAFLTFQTWKSYSNEPFTFDTRPVSSDPCERPILY

Query:  FLDAAERFGGGGVRGRRTVTRYRKFVEEAEKECERADYAPALAVRYFNVSAPEFDRRLWRQVNKK
        FLD AERFGG   R  RT+T YRK+VE   KEC++ DYA AL+V YFNVSAPEFDRRLWRQ  ++
Subjt:  FLDAAERFGGGGVRGRRTVTRYRKFVEEAEKECERADYAPALAVRYFNVSAPEFDRRLWRQVNKK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G37730.1 Protein of unknown function (DUF604)3.1e-16164.47Show/hide
Query:  QPTNMSHLVFGIGGSVKTWNERRHYCELWWKKNVTRGFVWLEEKP--EFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFV
        + T++SH+ FGIGGS++TW +R  Y ELWW+ NVTRGF+WL+E+P     W  +SPPY++S DTS+F+YTCWYG RSAIR+AR+IKETFELGL +VRWF+
Subjt:  QPTNMSHLVFGIGGSVKTWNERRHYCELWWKKNVTRGFVWLEEKP--EFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFV

Query:  MGDDDTVFFTENLVELLSKYDHNQMYYIGGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGVLDGCINRYADMYGSDQKIQGCISEIGVPLTKELGF
        MGDDDTVFF +NL+ +L+KYDHNQMYYIGGNSESVEQD++HSY MAYGGGG A+SYPLA  LV +LDGCI+RYA +YGSDQKI+ C+SEIGVPLTKELGF
Subjt:  MGDDDTVFFTENLVELLSKYDHNQMYYIGGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGVLDGCINRYADMYGSDQKIQGCISEIGVPLTKELGF

Query:  HQVDIRGNPYGLLAAHPVAPLVSLHHLDYVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATAKELETAFLTFQ
        HQVDIRGNPYGLLAAHPVAPLV+LHHLDYV  IFP  +Q D+L++L +AY+TDPSR +QH+FC+D  R W VSVSWGY++Q+YP L TAKELET FLTF+
Subjt:  HQVDIRGNPYGLLAAHPVAPLVSLHHLDYVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATAKELETAFLTFQ

Query:  TWKSYSNEPFTFDTRPVSSDPCERPILYFLDAAERFGGGGVRGRRTVTRYRKFVEEAEK-ECERADYAPALAVRYFNVSAPEFDRRLWRQVNKK
        +W++ S+EPF+FDTRP+S DPCERP++YFLD     G G     +T+T YRK VE  E  +C   DY+ A  V + +VS       LW+   ++
Subjt:  TWKSYSNEPFTFDTRPVSSDPCERPILYFLDAAERFGGGGVRGRRTVTRYRKFVEEAEK-ECERADYAPALAVRYFNVSAPEFDRRLWRQVNKK

AT3G11420.1 Protein of unknown function (DUF604)4.3e-11847.82Show/hide
Query:  RPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNHRCLG-CYGSLRHSNHRKVQAF--DAGEQPTNMSHLVFGIGGSVKTWNERRHYCELWWKKNVTRGF
        RP D    F R  ++   + S SL    TF   + R     YG    +  +K  A    A   PTN+SH+ F I G+ +TW +R  Y  LWW +N TRGF
Subjt:  RPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNHRCLG-CYGSLRHSNHRKVQAF--DAGEQPTNMSHLVFGIGGSVKTWNERRHYCELWWKKNVTRGF

Query:  VWLEEKPEFPWPES----SPPYRISE-DTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYIGGNSESV
        VWL+E  + P   S    S P R+S+   ++F ++     R+A+R+AR+I +++ L L NVRWFVMGDDDTVFFTENLV++LSKYDH QM+YIGGNSESV
Subjt:  VWLEEKPEFPWPES----SPPYRISE-DTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYIGGNSESV

Query:  EQDVIHSYTMAYGGGGFAVSYPLAAALVGVLDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDYVQTIFP
        EQDV+H+Y MA+GGGGFA+S PLAA L   +D C+ RY   YGSDQ+I  CISEIGVP T+E GFHQ+DIRG+PYG LAAHP+APLVSLHHL Y+  +FP
Subjt:  EQDVIHSYTMAYGGGGFAVSYPLAAALVGVLDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDYVQTIFP

Query:  AMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATAKELETAFLTFQTWKSYSNEPFTFDTRPVSSDPCERPILYFLDAAER
          +  +SL+ L   Y  DP+R LQ   C+D  R+WS+S+SWGY++Q+Y +  TA EL T   TF+TW+S S+ PF F+TRP+  DPCERP+ YF+D AE 
Subjt:  AMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATAKELETAFLTFQTWKSYSNEPFTFDTRPVSSDPCERPILYFLDAAER

Query:  FGGGGVRGRRTVTRYRKFVEEAEKECERADYAPALAVRYFNVSAPEFDRRLWRQVNKK
             VR   T T Y    ++    C + ++     V+   V++ + D   W +  ++
Subjt:  FGGGGVRGRRTVTRYRKFVEEAEKECERADYAPALAVRYFNVSAPEFDRRLWRQVNKK

AT4G11350.1 Protein of unknown function (DUF604)4.0e-10049.57Show/hide
Query:  VQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYCELWWKKNVTRGFVWLEE----KPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETF-
        V+A  A ++ T+++H+VFGI  S K W +R+ Y ++W+K    RG+VWL+E    K E    ES P  RIS DTS F YT   G RSAIR++R++ ET  
Subjt:  VQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYCELWWKKNVTRGFVWLEE----KPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETF-

Query:  ---ELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYIGGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGVLDGCINRYADMYGSDQKIQGC
               +NVRWFVMGDDDTVF T+NL+ +L KYDH QMYYIG  SES  Q++I SY MAYGGGGFA+SYPLA AL  + D CI RY  +YGSD ++Q C
Subjt:  ---ELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYIGGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGVLDGCINRYADMYGSDQKIQGC

Query:  ISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDYVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWL
        ++E+GVPLTKE+GFHQ D+ GN +GLLAAHP+ P VS+HHLD V+ IFP M++  ++KKL    + D +  LQ + CYD  + W++SVSWG++VQ++   
Subjt:  ISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDYVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWL

Query:  ATAKELETAFLTFQTW-KSYSNEPFTFDTRPVSSDPCERPILYFLDAAE
         + +E+E    TF  W K      + F+TRPVS + C++P ++ + +A+
Subjt:  ATAKELETAFLTFQTW-KSYSNEPFTFDTRPVSSDPCERPILYFLDAAE

AT4G23490.1 Protein of unknown function (DUF604)2.0e-9947.31Show/hide
Query:  RHSNHRKVQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYCELWWKKNVTRGFVWLEEKPEFPWPESS-----PPYRISEDTSKFNYTCWYGFRSAIRVA
        R  N  + +     ++ T+++H+VFGI  S K W +R+ Y ++W+K    RG+VWL+++ +    +       PP +IS  T+ F YT   G RSA+R++
Subjt:  RHSNHRKVQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYCELWWKKNVTRGFVWLEEKPEFPWPESS-----PPYRISEDTSKFNYTCWYGFRSAIRVA

Query:  RVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYIGGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGVLDGCINRYADMYGSDQK
        R++ ET  LG +NVRWFVMGDDDTVF  +NL+ +L KYDH QMYYIG  SES  Q++  SY MAYGGGGFA+SYPLA AL  + D CI RY  +YGSD +
Subjt:  RVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYIGGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGVLDGCINRYADMYGSDQK

Query:  IQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDYVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQL
        +Q C++E+GVPLTKELGFHQ D+ GN +GLLAAHPV P VS+HHLD V+ IFP M++  +LKK+    + D +  LQ + CYD  + W++SVSWGY+VQ+
Subjt:  IQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDYVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQL

Query:  YPWLATAKELETAFLTFQTW-KSYSNEPFTFDTRPVSSDPCERPILYFLDAAE
        +  + + +E+E    TF  W K      + F+TRPVS +PC++P ++++ + +
Subjt:  YPWLATAKELETAFLTFQTW-KSYSNEPFTFDTRPVSSDPCERPILYFLDAAE

AT5G41460.1 Protein of unknown function (DUF604)1.3e-10353.78Show/hide
Query:  TNMSHLVFGIGGSVKTWNERRHYCELWWKKNVTRGFVWLEEKPEFPWPE----SSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFV
        T   H+VFGI  S + W +R+ Y ++W+K N  R +VWL EKP     E    S PP +IS DTSKF Y    G RSAIR++R++ ET +LGL++VRWFV
Subjt:  TNMSHLVFGIGGSVKTWNERRHYCELWWKKNVTRGFVWLEEKPEFPWPE----SSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFV

Query:  MGDDDTVFFTENLVELLSKYDHNQMYYIGGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGVLDGCINRYADMYGSDQKIQGCISEIGVPLTKELGF
        MGDDDTVF  ENL+ +L KYDHNQMYYIG  SES  Q++  SY MAYGGGGFA+SYPLA AL  + D CI RY  +YGSD ++Q C++E+GVPLTKELGF
Subjt:  MGDDDTVFFTENLVELLSKYDHNQMYYIGGNSESVEQDVIHSYTMAYGGGGFAVSYPLAAALVGVLDGCINRYADMYGSDQKIQGCISEIGVPLTKELGF

Query:  HQVDIRGNPYGLLAAHPVAPLVSLHHLDYVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATAKELETAFLTFQ
        HQ D+ GN +GLLAAHPVAPLV+LHHLD V+ IFP M++ D+LK L    + D +  +Q + CYD  RKW+VSVSWG++VQ++  + +A+E+E    TF 
Subjt:  HQVDIRGNPYGLLAAHPVAPLVSLHHLDYVQTIFPAMSQPDSLKKLHNAYQTDPSRALQHTFCYDTARKWSVSVSWGYSVQLYPWLATAKELETAFLTFQ

Query:  TW-KSYSNEPFTFDTRPVSSDPCERPILYFL
         W +      + F+TRPVS  PC++P ++++
Subjt:  TW-KSYSNEPFTFDTRPVSSDPCERPILYFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGATTCAGAATCCATTAGAAGCCTGCAAAGTTTTGGTCTTGCGGCCAACGGATGTTTTTGCTCCTTTCATTAGAGCCCTTCTGGTTATCTCCGCCGTCGCTTCGTT
TTCTCTCTTCTTTTACCTAACTTTTTCCGACCAAAACCACCGTTGTCTTGGCTGCTACGGTTCACTTCGGCACTCAAACCACCGGAAAGTTCAGGCTTTCGATGCCGGAG
AACAGCCGACGAATATGTCCCATCTTGTGTTTGGCATTGGTGGCTCCGTCAAGACGTGGAACGAGCGACGCCATTATTGCGAGCTGTGGTGGAAGAAAAATGTTACTCGT
GGGTTTGTTTGGCTTGAAGAGAAGCCTGAATTTCCCTGGCCGGAATCGTCCCCGCCGTACCGAATTTCCGAGGACACCTCAAAATTCAACTACACTTGCTGGTACGGGTT
TCGGTCGGCGATTCGAGTGGCTAGGGTTATCAAGGAGACTTTTGAACTAGGGTTGGAGAATGTGAGGTGGTTCGTGATGGGGGACGATGATACAGTTTTCTTCACAGAGA
ATTTGGTAGAGTTATTGAGTAAATACGATCACAATCAAATGTACTATATCGGAGGTAATTCTGAGAGCGTGGAGCAAGATGTAATTCATTCTTACACCATGGCCTACGGC
GGCGGCGGATTCGCCGTTAGCTACCCTCTCGCGGCGGCGCTGGTCGGAGTTTTAGACGGCTGCATCAATCGTTATGCTGATATGTATGGCTCCGATCAGAAAATTCAGGG
TTGCATCAGTGAGATCGGCGTTCCCCTCACCAAAGAGCTTGGATTCCACCAGGTGGATATAAGAGGAAACCCATATGGTTTATTAGCTGCTCATCCAGTTGCGCCGTTAG
TGTCGCTCCACCACCTGGACTACGTGCAGACCATATTCCCAGCCATGTCCCAACCCGACTCGCTCAAGAAGCTCCACAACGCCTACCAAACGGACCCGAGTCGAGCCCTT
CAGCACACCTTCTGCTACGACACGGCTCGTAAGTGGTCCGTCTCGGTGTCGTGGGGTTACAGTGTTCAGTTGTATCCATGGCTGGCCACTGCCAAGGAACTCGAAACGGC
ATTTCTTACGTTCCAAACGTGGAAGTCATATAGCAATGAGCCCTTCACTTTCGATACCCGACCCGTAAGTTCGGACCCGTGTGAAAGACCCATTTTGTATTTCTTGGATG
CGGCGGAGAGATTCGGCGGCGGCGGCGTCCGAGGACGGCGGACGGTGACGAGGTACCGGAAATTTGTGGAGGAGGCTGAGAAGGAGTGTGAGCGGGCGGATTACGCTCCT
GCATTGGCTGTTCGGTATTTTAACGTCTCGGCGCCGGAGTTCGACCGCCGTCTGTGGAGGCAGGTAAACAAGAAATTTGTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGATTCAGAATCCATTAGAAGCCTGCAAAGTTTTGGTCTTGCGGCCAACGGATGTTTTTGCTCCTTTCATTAGAGCCCTTCTGGTTATCTCCGCCGTCGCTTCGTT
TTCTCTCTTCTTTTACCTAACTTTTTCCGACCAAAACCACCGTTGTCTTGGCTGCTACGGTTCACTTCGGCACTCAAACCACCGGAAAGTTCAGGCTTTCGATGCCGGAG
AACAGCCGACGAATATGTCCCATCTTGTGTTTGGCATTGGTGGCTCCGTCAAGACGTGGAACGAGCGACGCCATTATTGCGAGCTGTGGTGGAAGAAAAATGTTACTCGT
GGGTTTGTTTGGCTTGAAGAGAAGCCTGAATTTCCCTGGCCGGAATCGTCCCCGCCGTACCGAATTTCCGAGGACACCTCAAAATTCAACTACACTTGCTGGTACGGGTT
TCGGTCGGCGATTCGAGTGGCTAGGGTTATCAAGGAGACTTTTGAACTAGGGTTGGAGAATGTGAGGTGGTTCGTGATGGGGGACGATGATACAGTTTTCTTCACAGAGA
ATTTGGTAGAGTTATTGAGTAAATACGATCACAATCAAATGTACTATATCGGAGGTAATTCTGAGAGCGTGGAGCAAGATGTAATTCATTCTTACACCATGGCCTACGGC
GGCGGCGGATTCGCCGTTAGCTACCCTCTCGCGGCGGCGCTGGTCGGAGTTTTAGACGGCTGCATCAATCGTTATGCTGATATGTATGGCTCCGATCAGAAAATTCAGGG
TTGCATCAGTGAGATCGGCGTTCCCCTCACCAAAGAGCTTGGATTCCACCAGGTGGATATAAGAGGAAACCCATATGGTTTATTAGCTGCTCATCCAGTTGCGCCGTTAG
TGTCGCTCCACCACCTGGACTACGTGCAGACCATATTCCCAGCCATGTCCCAACCCGACTCGCTCAAGAAGCTCCACAACGCCTACCAAACGGACCCGAGTCGAGCCCTT
CAGCACACCTTCTGCTACGACACGGCTCGTAAGTGGTCCGTCTCGGTGTCGTGGGGTTACAGTGTTCAGTTGTATCCATGGCTGGCCACTGCCAAGGAACTCGAAACGGC
ATTTCTTACGTTCCAAACGTGGAAGTCATATAGCAATGAGCCCTTCACTTTCGATACCCGACCCGTAAGTTCGGACCCGTGTGAAAGACCCATTTTGTATTTCTTGGATG
CGGCGGAGAGATTCGGCGGCGGCGGCGTCCGAGGACGGCGGACGGTGACGAGGTACCGGAAATTTGTGGAGGAGGCTGAGAAGGAGTGTGAGCGGGCGGATTACGCTCCT
GCATTGGCTGTTCGGTATTTTAACGTCTCGGCGCCGGAGTTCGACCGCCGTCTGTGGAGGCAGGTAAACAAGAAATTTGTTTAA
Protein sequenceShow/hide protein sequence
MSIQNPLEACKVLVLRPTDVFAPFIRALLVISAVASFSLFFYLTFSDQNHRCLGCYGSLRHSNHRKVQAFDAGEQPTNMSHLVFGIGGSVKTWNERRHYCELWWKKNVTR
GFVWLEEKPEFPWPESSPPYRISEDTSKFNYTCWYGFRSAIRVARVIKETFELGLENVRWFVMGDDDTVFFTENLVELLSKYDHNQMYYIGGNSESVEQDVIHSYTMAYG
GGGFAVSYPLAAALVGVLDGCINRYADMYGSDQKIQGCISEIGVPLTKELGFHQVDIRGNPYGLLAAHPVAPLVSLHHLDYVQTIFPAMSQPDSLKKLHNAYQTDPSRAL
QHTFCYDTARKWSVSVSWGYSVQLYPWLATAKELETAFLTFQTWKSYSNEPFTFDTRPVSSDPCERPILYFLDAAERFGGGGVRGRRTVTRYRKFVEEAEKECERADYAP
ALAVRYFNVSAPEFDRRLWRQVNKKFV