; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G09150 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G09150
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionTy3-gypsy retrotransposon protein
Genome locationClcChr01:9998842..10016259
RNA-Seq ExpressionClc01G09150
SyntenyClc01G09150
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022993223.1 uncharacterized protein LOC111489311 isoform X1 [Cucurbita maxima]1.0e-27785.12Show/hide
Query:  MDNYSEGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDIFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSG
        MDNYSEGARL  EN  FENP PP VLDRVRVESTSILSGTLA  VD FA AGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRR+LG  NQHVEGGSG
Subjt:  MDNYSEGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDIFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSG

Query:  MPSGELLQCFLKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKIYLMEKVATELGSRPLTD
        +PSG+LLQCFLKQ+ KSMFASEE M+ GNVLHD++ S+APR CSPSVVCSPNATL GSYFSSNH+LNKSTESG++MELKEDKIY  +KVATELGSR LT+
Subjt:  MPSGELLQCFLKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKIYLMEKVATELGSRPLTD

Query:  HVPKANLLSSTKVKDEPYDHVDDCNM---DMKNVFSNTVSIKSETTIPDEHYENKLDNMRLQDRMKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPAS
        HVP+ANLLSSTKVKDEPYDH + C++   DM NV+ NT+S+KSETT+PDE +ENK+D+M LQDRMKFFSSRK F FTSMD EHPKPSDPGCSILVSEP +
Subjt:  HVPKANLLSSTKVKDEPYDHVDDCNM---DMKNVFSNTVSIKSETTIPDEHYENKLDNMRLQDRMKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPAS

Query:  LMNIKRRRKWKKTATNSIETALEEDAPGLLQILVDKGIQIDEIKLYGEIESDDDLDESFSEDRFGELEDVISRLFSQRHSFLKFPS-IRCMKNSRASYCL
          N KRRRK KKTATNSIETALEEDAPGLLQILV+KGIQ+DEIKLYGE ESDDDLDES SED F ELEDVI+RLF QRHSFLKFPS IRC+K SRASYCL
Subjt:  LMNIKRRRKWKKTATNSIETALEEDAPGLLQILVDKGIQIDEIKLYGEIESDDDLDESFSEDRFGELEDVISRLFSQRHSFLKFPS-IRCMKNSRASYCL

Query:  ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSVPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGE
        ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVF+RHKRIVMERPEYGYATYFFELV+S+PI+WQIKRLVIAMKLT+CSRISLLENRPLLVGEDLTEGE
Subjt:  ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSVPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGE

Query:  AGVLLSYGWTPNSGLGTMLNYRDRVVHDRNNEDISEWRSKIGKLLMNGYNGGALVQENTSKKVAEYSSSQTTQVKLEL
        AGVLLSYGW  NSGLGTMLNYR RVVHDR+NEDISEWRSKIGKLLM+GYNGGALV ENT KKVAEYSSSQTTQVKLEL
Subjt:  AGVLLSYGWTPNSGLGTMLNYRDRVVHDRNNEDISEWRSKIGKLLMNGYNGGALVQENTSKKVAEYSSSQTTQVKLEL

XP_022993225.1 uncharacterized protein LOC111489311 isoform X2 [Cucurbita maxima]1.0e-27785.12Show/hide
Query:  MDNYSEGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDIFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSG
        MDNYSEGARL  EN  FENP PP VLDRVRVESTSILSGTLA  VD FA AGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRR+LG  NQHVEGGSG
Subjt:  MDNYSEGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDIFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSG

Query:  MPSGELLQCFLKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKIYLMEKVATELGSRPLTD
        +PSG+LLQCFLKQ+ KSMFASEE M+ GNVLHD++ S+APR CSPSVVCSPNATL GSYFSSNH+LNKSTESG++MELKEDKIY  +KVATELGSR LT+
Subjt:  MPSGELLQCFLKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKIYLMEKVATELGSRPLTD

Query:  HVPKANLLSSTKVKDEPYDHVDDCNM---DMKNVFSNTVSIKSETTIPDEHYENKLDNMRLQDRMKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPAS
        HVP+ANLLSSTKVKDEPYDH + C++   DM NV+ NT+S+KSETT+PDE +ENK+D+M LQDRMKFFSSRK F FTSMD EHPKPSDPGCSILVSEP +
Subjt:  HVPKANLLSSTKVKDEPYDHVDDCNM---DMKNVFSNTVSIKSETTIPDEHYENKLDNMRLQDRMKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPAS

Query:  LMNIKRRRKWKKTATNSIETALEEDAPGLLQILVDKGIQIDEIKLYGEIESDDDLDESFSEDRFGELEDVISRLFSQRHSFLKFPS-IRCMKNSRASYCL
          N KRRRK KKTATNSIETALEEDAPGLLQILV+KGIQ+DEIKLYGE ESDDDLDES SED F ELEDVI+RLF QRHSFLKFPS IRC+K SRASYCL
Subjt:  LMNIKRRRKWKKTATNSIETALEEDAPGLLQILVDKGIQIDEIKLYGEIESDDDLDESFSEDRFGELEDVISRLFSQRHSFLKFPS-IRCMKNSRASYCL

Query:  ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSVPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGE
        ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVF+RHKRIVMERPEYGYATYFFELV+S+PI+WQIKRLVIAMKLT+CSRISLLENRPLLVGEDLTEGE
Subjt:  ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSVPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGE

Query:  AGVLLSYGWTPNSGLGTMLNYRDRVVHDRNNEDISEWRSKIGKLLMNGYNGGALVQENTSKKVAEYSSSQTTQVKLEL
        AGVLLSYGW  NSGLGTMLNYR RVVHDR+NEDISEWRSKIGKLLM+GYNGGALV ENT KKVAEYSSSQTTQVKLEL
Subjt:  AGVLLSYGWTPNSGLGTMLNYRDRVVHDRNNEDISEWRSKIGKLLMNGYNGGALVQENTSKKVAEYSSSQTTQVKLEL

XP_022993227.1 uncharacterized protein LOC111489311 isoform X3 [Cucurbita maxima]1.0e-27785.12Show/hide
Query:  MDNYSEGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDIFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSG
        MDNYSEGARL  EN  FENP PP VLDRVRVESTSILSGTLA  VD FA AGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRR+LG  NQHVEGGSG
Subjt:  MDNYSEGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDIFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSG

Query:  MPSGELLQCFLKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKIYLMEKVATELGSRPLTD
        +PSG+LLQCFLKQ+ KSMFASEE M+ GNVLHD++ S+APR CSPSVVCSPNATL GSYFSSNH+LNKSTESG++MELKEDKIY  +KVATELGSR LT+
Subjt:  MPSGELLQCFLKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKIYLMEKVATELGSRPLTD

Query:  HVPKANLLSSTKVKDEPYDHVDDCNM---DMKNVFSNTVSIKSETTIPDEHYENKLDNMRLQDRMKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPAS
        HVP+ANLLSSTKVKDEPYDH + C++   DM NV+ NT+S+KSETT+PDE +ENK+D+M LQDRMKFFSSRK F FTSMD EHPKPSDPGCSILVSEP +
Subjt:  HVPKANLLSSTKVKDEPYDHVDDCNM---DMKNVFSNTVSIKSETTIPDEHYENKLDNMRLQDRMKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPAS

Query:  LMNIKRRRKWKKTATNSIETALEEDAPGLLQILVDKGIQIDEIKLYGEIESDDDLDESFSEDRFGELEDVISRLFSQRHSFLKFPS-IRCMKNSRASYCL
          N KRRRK KKTATNSIETALEEDAPGLLQILV+KGIQ+DEIKLYGE ESDDDLDES SED F ELEDVI+RLF QRHSFLKFPS IRC+K SRASYCL
Subjt:  LMNIKRRRKWKKTATNSIETALEEDAPGLLQILVDKGIQIDEIKLYGEIESDDDLDESFSEDRFGELEDVISRLFSQRHSFLKFPS-IRCMKNSRASYCL

Query:  ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSVPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGE
        ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVF+RHKRIVMERPEYGYATYFFELV+S+PI+WQIKRLVIAMKLT+CSRISLLENRPLLVGEDLTEGE
Subjt:  ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSVPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGE

Query:  AGVLLSYGWTPNSGLGTMLNYRDRVVHDRNNEDISEWRSKIGKLLMNGYNGGALVQENTSKKVAEYSSSQTTQVKLEL
        AGVLLSYGW  NSGLGTMLNYR RVVHDR+NEDISEWRSKIGKLLM+GYNGGALV ENT KKVAEYSSSQTTQVKLEL
Subjt:  AGVLLSYGWTPNSGLGTMLNYRDRVVHDRNNEDISEWRSKIGKLLMNGYNGGALVQENTSKKVAEYSSSQTTQVKLEL

XP_022993228.1 uncharacterized protein LOC111489311 isoform X4 [Cucurbita maxima]1.0e-27785.12Show/hide
Query:  MDNYSEGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDIFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSG
        MDNYSEGARL  EN  FENP PP VLDRVRVESTSILSGTLA  VD FA AGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRR+LG  NQHVEGGSG
Subjt:  MDNYSEGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDIFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSG

Query:  MPSGELLQCFLKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKIYLMEKVATELGSRPLTD
        +PSG+LLQCFLKQ+ KSMFASEE M+ GNVLHD++ S+APR CSPSVVCSPNATL GSYFSSNH+LNKSTESG++MELKEDKIY  +KVATELGSR LT+
Subjt:  MPSGELLQCFLKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKIYLMEKVATELGSRPLTD

Query:  HVPKANLLSSTKVKDEPYDHVDDCNM---DMKNVFSNTVSIKSETTIPDEHYENKLDNMRLQDRMKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPAS
        HVP+ANLLSSTKVKDEPYDH + C++   DM NV+ NT+S+KSETT+PDE +ENK+D+M LQDRMKFFSSRK F FTSMD EHPKPSDPGCSILVSEP +
Subjt:  HVPKANLLSSTKVKDEPYDHVDDCNM---DMKNVFSNTVSIKSETTIPDEHYENKLDNMRLQDRMKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPAS

Query:  LMNIKRRRKWKKTATNSIETALEEDAPGLLQILVDKGIQIDEIKLYGEIESDDDLDESFSEDRFGELEDVISRLFSQRHSFLKFPS-IRCMKNSRASYCL
          N KRRRK KKTATNSIETALEEDAPGLLQILV+KGIQ+DEIKLYGE ESDDDLDES SED F ELEDVI+RLF QRHSFLKFPS IRC+K SRASYCL
Subjt:  LMNIKRRRKWKKTATNSIETALEEDAPGLLQILVDKGIQIDEIKLYGEIESDDDLDESFSEDRFGELEDVISRLFSQRHSFLKFPS-IRCMKNSRASYCL

Query:  ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSVPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGE
        ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVF+RHKRIVMERPEYGYATYFFELV+S+PI+WQIKRLVIAMKLT+CSRISLLENRPLLVGEDLTEGE
Subjt:  ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSVPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGE

Query:  AGVLLSYGWTPNSGLGTMLNYRDRVVHDRNNEDISEWRSKIGKLLMNGYNGGALVQENTSKKVAEYSSSQTTQVKLEL
        AGVLLSYGW  NSGLGTMLNYR RVVHDR+NEDISEWRSKIGKLLM+GYNGGALV ENT KKVAEYSSSQTTQVKLEL
Subjt:  AGVLLSYGWTPNSGLGTMLNYRDRVVHDRNNEDISEWRSKIGKLLMNGYNGGALVQENTSKKVAEYSSSQTTQVKLEL

XP_038874728.1 uncharacterized protein LOC120067269 [Benincasa hispida]9.8e-28484.81Show/hide
Query:  MDNYSEGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDIFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSG
        MDN+S+G RL IENP FEN M   VLDRVRVESTS LSGTL DGVD FASAGVAVTKVKNEMF+DF+EDLDHV LI+RLRMLLSRRALG TN HVE GSG
Subjt:  MDNYSEGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDIFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSG

Query:  MPSGELLQCFLKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKIYLMEKVATELGSRPLTD
        +PSGELL C LKQREKSMFA EELM+  NVLHDRT SHAPRLCSPSVVCSPNATLP SYFSSNH+ NKSTESGD+MELK+DKI   EKVAT+L S+PLTD
Subjt:  MPSGELLQCFLKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKIYLMEKVATELGSRPLTD

Query:  HVPKANLLSSTKVKDEPYDHVDDCNM---DMKNVFSNTVSIKSETTIPDEHYENKLDNMRLQDRMKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPAS
        HVPKANLLSSTKVKDEPY HVDDCN+   D  NV SNTV IKSETTIPDEHYENKLDNMRLQDRMKFFSSRKVF FTSMD EHPKPSDPGCSILV EP S
Subjt:  HVPKANLLSSTKVKDEPYDHVDDCNM---DMKNVFSNTVSIKSETTIPDEHYENKLDNMRLQDRMKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPAS

Query:  LMNIKRRRKWKKTATNSIETALEEDAPGLLQILVDKGIQIDEIKLYGEIESDDDLDESFSEDRFGELEDVISR----------------------LFSQR
         MNIK RRK KKTATNSIETALEEDAPGLLQILVDKG+Q+DEIKLYGEIE+DDDLDESFSED FGELEDVISR                      LFSQR
Subjt:  LMNIKRRRKWKKTATNSIETALEEDAPGLLQILVDKGIQIDEIKLYGEIESDDDLDESFSEDRFGELEDVISR----------------------LFSQR

Query:  HSFLKFPSIRCMKNSRASYCLACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSVPINWQIKRLVIAMKLTSC
        HSFLKFPSIRCMK+SRASYCLACLVSLIEQTRYLHFRNWPVEWGWCRDLQSF+FVFERHKRIVMERPEYG+ATYFFELVDS+P+NWQIKRLVIAMKLTSC
Subjt:  HSFLKFPSIRCMKNSRASYCLACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSVPINWQIKRLVIAMKLTSC

Query:  SRISLLENRPLLVGEDLTEGEAGVLLSYGWTPNSGLGTMLNYRDRVVHDRNNEDISEWRSKIGKLLMNGYNGGALVQENTSKKVAEYSSSQTTQVKLEL
        SRISLLENRPLLVGEDLTEGEA VLL+YGW PNSGLGTMLNYR RVVHDRNNEDISEWRSKIGKLLM+GYNGGALVQENTSKKVAEYSS QTTQVKLEL
Subjt:  SRISLLENRPLLVGEDLTEGEAGVLLSYGWTPNSGLGTMLNYRDRVVHDRNNEDISEWRSKIGKLLMNGYNGGALVQENTSKKVAEYSSSQTTQVKLEL

TrEMBL top hitse value%identityAlignment
A0A6J1FLT1 uncharacterized protein LOC111445382 isoform X12.5e-27785.29Show/hide
Query:  MDNYSEGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDIFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSG
        MDNYSEGARL  EN  FENP PP VLDRVRVESTSILSGTL  GVD FA AGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALG  NQHVEGGSG
Subjt:  MDNYSEGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDIFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSG

Query:  MPSGELLQCFLKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKIYLMEKVATELGSRPLTD
        +PSG+LLQCFLKQ+ KSMFASEE M+ GNVLHD++ S+APR CSPSVVCSPNATL GSYFSSNH+LNKSTESG++MELKEDKI   EKVATELGSR LT+
Subjt:  MPSGELLQCFLKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKIYLMEKVATELGSRPLTD

Query:  HVPKANLLSSTKVKDEPYDHVDDCNM---DMKNVFSNTVSIKSETTIPDEHYENKLDNMRLQDRMKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPAS
        HVP+ NLLSSTKVKDEPYDH + C++   DM NV+SNT+SIKSETT+PDE YENK+D+M LQDRMKFFSSRK   FTSMD EHPKPSDPGCS+LVSEP +
Subjt:  HVPKANLLSSTKVKDEPYDHVDDCNM---DMKNVFSNTVSIKSETTIPDEHYENKLDNMRLQDRMKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPAS

Query:  LMNIKRRRKWKKTATNSIETALEEDAPGLLQILVDKGIQIDEIKLYGEIESDDDLDESFSEDRFGELEDVISRLFSQRHSFLKFPS-IRCMKNSRASYCL
          N KRRRK KKTATNSIETALEEDAPGLLQILV+KGIQ+DEIKLYGE ESDDDLDES SED F ELEDVI+RLF QRHSFLKFPS IRCMK SRASYCL
Subjt:  LMNIKRRRKWKKTATNSIETALEEDAPGLLQILVDKGIQIDEIKLYGEIESDDDLDESFSEDRFGELEDVISRLFSQRHSFLKFPS-IRCMKNSRASYCL

Query:  ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSVPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGE
        ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELV+S+PI+WQIKRLVIAMKLT+CSRISLLENRPLLVGEDLTEGE
Subjt:  ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSVPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGE

Query:  AGVLLSYGWTPNSGLGTMLNYRDRVVHDRNNEDISEWRSKIGKLLMNGYNGGALVQENTSKKVAEYSSSQTTQVKLEL
        A VLLSYGW  NSGLGTMLNYR RVVHDR+NEDISEWRSKIGKLLM+GYNGGALV ENT KKVAEYSSSQ TQVKLEL
Subjt:  AGVLLSYGWTPNSGLGTMLNYRDRVVHDRNNEDISEWRSKIGKLLMNGYNGGALVQENTSKKVAEYSSSQTTQVKLEL

A0A6J1JS72 uncharacterized protein LOC111489311 isoform X35.1e-27885.12Show/hide
Query:  MDNYSEGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDIFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSG
        MDNYSEGARL  EN  FENP PP VLDRVRVESTSILSGTLA  VD FA AGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRR+LG  NQHVEGGSG
Subjt:  MDNYSEGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDIFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSG

Query:  MPSGELLQCFLKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKIYLMEKVATELGSRPLTD
        +PSG+LLQCFLKQ+ KSMFASEE M+ GNVLHD++ S+APR CSPSVVCSPNATL GSYFSSNH+LNKSTESG++MELKEDKIY  +KVATELGSR LT+
Subjt:  MPSGELLQCFLKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKIYLMEKVATELGSRPLTD

Query:  HVPKANLLSSTKVKDEPYDHVDDCNM---DMKNVFSNTVSIKSETTIPDEHYENKLDNMRLQDRMKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPAS
        HVP+ANLLSSTKVKDEPYDH + C++   DM NV+ NT+S+KSETT+PDE +ENK+D+M LQDRMKFFSSRK F FTSMD EHPKPSDPGCSILVSEP +
Subjt:  HVPKANLLSSTKVKDEPYDHVDDCNM---DMKNVFSNTVSIKSETTIPDEHYENKLDNMRLQDRMKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPAS

Query:  LMNIKRRRKWKKTATNSIETALEEDAPGLLQILVDKGIQIDEIKLYGEIESDDDLDESFSEDRFGELEDVISRLFSQRHSFLKFPS-IRCMKNSRASYCL
          N KRRRK KKTATNSIETALEEDAPGLLQILV+KGIQ+DEIKLYGE ESDDDLDES SED F ELEDVI+RLF QRHSFLKFPS IRC+K SRASYCL
Subjt:  LMNIKRRRKWKKTATNSIETALEEDAPGLLQILVDKGIQIDEIKLYGEIESDDDLDESFSEDRFGELEDVISRLFSQRHSFLKFPS-IRCMKNSRASYCL

Query:  ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSVPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGE
        ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVF+RHKRIVMERPEYGYATYFFELV+S+PI+WQIKRLVIAMKLT+CSRISLLENRPLLVGEDLTEGE
Subjt:  ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSVPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGE

Query:  AGVLLSYGWTPNSGLGTMLNYRDRVVHDRNNEDISEWRSKIGKLLMNGYNGGALVQENTSKKVAEYSSSQTTQVKLEL
        AGVLLSYGW  NSGLGTMLNYR RVVHDR+NEDISEWRSKIGKLLM+GYNGGALV ENT KKVAEYSSSQTTQVKLEL
Subjt:  AGVLLSYGWTPNSGLGTMLNYRDRVVHDRNNEDISEWRSKIGKLLMNGYNGGALVQENTSKKVAEYSSSQTTQVKLEL

A0A6J1JY04 uncharacterized protein LOC111489311 isoform X25.1e-27885.12Show/hide
Query:  MDNYSEGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDIFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSG
        MDNYSEGARL  EN  FENP PP VLDRVRVESTSILSGTLA  VD FA AGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRR+LG  NQHVEGGSG
Subjt:  MDNYSEGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDIFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSG

Query:  MPSGELLQCFLKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKIYLMEKVATELGSRPLTD
        +PSG+LLQCFLKQ+ KSMFASEE M+ GNVLHD++ S+APR CSPSVVCSPNATL GSYFSSNH+LNKSTESG++MELKEDKIY  +KVATELGSR LT+
Subjt:  MPSGELLQCFLKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKIYLMEKVATELGSRPLTD

Query:  HVPKANLLSSTKVKDEPYDHVDDCNM---DMKNVFSNTVSIKSETTIPDEHYENKLDNMRLQDRMKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPAS
        HVP+ANLLSSTKVKDEPYDH + C++   DM NV+ NT+S+KSETT+PDE +ENK+D+M LQDRMKFFSSRK F FTSMD EHPKPSDPGCSILVSEP +
Subjt:  HVPKANLLSSTKVKDEPYDHVDDCNM---DMKNVFSNTVSIKSETTIPDEHYENKLDNMRLQDRMKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPAS

Query:  LMNIKRRRKWKKTATNSIETALEEDAPGLLQILVDKGIQIDEIKLYGEIESDDDLDESFSEDRFGELEDVISRLFSQRHSFLKFPS-IRCMKNSRASYCL
          N KRRRK KKTATNSIETALEEDAPGLLQILV+KGIQ+DEIKLYGE ESDDDLDES SED F ELEDVI+RLF QRHSFLKFPS IRC+K SRASYCL
Subjt:  LMNIKRRRKWKKTATNSIETALEEDAPGLLQILVDKGIQIDEIKLYGEIESDDDLDESFSEDRFGELEDVISRLFSQRHSFLKFPS-IRCMKNSRASYCL

Query:  ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSVPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGE
        ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVF+RHKRIVMERPEYGYATYFFELV+S+PI+WQIKRLVIAMKLT+CSRISLLENRPLLVGEDLTEGE
Subjt:  ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSVPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGE

Query:  AGVLLSYGWTPNSGLGTMLNYRDRVVHDRNNEDISEWRSKIGKLLMNGYNGGALVQENTSKKVAEYSSSQTTQVKLEL
        AGVLLSYGW  NSGLGTMLNYR RVVHDR+NEDISEWRSKIGKLLM+GYNGGALV ENT KKVAEYSSSQTTQVKLEL
Subjt:  AGVLLSYGWTPNSGLGTMLNYRDRVVHDRNNEDISEWRSKIGKLLMNGYNGGALVQENTSKKVAEYSSSQTTQVKLEL

A0A6J1JZL8 uncharacterized protein LOC111489311 isoform X15.1e-27885.12Show/hide
Query:  MDNYSEGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDIFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSG
        MDNYSEGARL  EN  FENP PP VLDRVRVESTSILSGTLA  VD FA AGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRR+LG  NQHVEGGSG
Subjt:  MDNYSEGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDIFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSG

Query:  MPSGELLQCFLKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKIYLMEKVATELGSRPLTD
        +PSG+LLQCFLKQ+ KSMFASEE M+ GNVLHD++ S+APR CSPSVVCSPNATL GSYFSSNH+LNKSTESG++MELKEDKIY  +KVATELGSR LT+
Subjt:  MPSGELLQCFLKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKIYLMEKVATELGSRPLTD

Query:  HVPKANLLSSTKVKDEPYDHVDDCNM---DMKNVFSNTVSIKSETTIPDEHYENKLDNMRLQDRMKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPAS
        HVP+ANLLSSTKVKDEPYDH + C++   DM NV+ NT+S+KSETT+PDE +ENK+D+M LQDRMKFFSSRK F FTSMD EHPKPSDPGCSILVSEP +
Subjt:  HVPKANLLSSTKVKDEPYDHVDDCNM---DMKNVFSNTVSIKSETTIPDEHYENKLDNMRLQDRMKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPAS

Query:  LMNIKRRRKWKKTATNSIETALEEDAPGLLQILVDKGIQIDEIKLYGEIESDDDLDESFSEDRFGELEDVISRLFSQRHSFLKFPS-IRCMKNSRASYCL
          N KRRRK KKTATNSIETALEEDAPGLLQILV+KGIQ+DEIKLYGE ESDDDLDES SED F ELEDVI+RLF QRHSFLKFPS IRC+K SRASYCL
Subjt:  LMNIKRRRKWKKTATNSIETALEEDAPGLLQILVDKGIQIDEIKLYGEIESDDDLDESFSEDRFGELEDVISRLFSQRHSFLKFPS-IRCMKNSRASYCL

Query:  ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSVPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGE
        ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVF+RHKRIVMERPEYGYATYFFELV+S+PI+WQIKRLVIAMKLT+CSRISLLENRPLLVGEDLTEGE
Subjt:  ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSVPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGE

Query:  AGVLLSYGWTPNSGLGTMLNYRDRVVHDRNNEDISEWRSKIGKLLMNGYNGGALVQENTSKKVAEYSSSQTTQVKLEL
        AGVLLSYGW  NSGLGTMLNYR RVVHDR+NEDISEWRSKIGKLLM+GYNGGALV ENT KKVAEYSSSQTTQVKLEL
Subjt:  AGVLLSYGWTPNSGLGTMLNYRDRVVHDRNNEDISEWRSKIGKLLMNGYNGGALVQENTSKKVAEYSSSQTTQVKLEL

A0A6J1K1L2 uncharacterized protein LOC111489311 isoform X45.1e-27885.12Show/hide
Query:  MDNYSEGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDIFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSG
        MDNYSEGARL  EN  FENP PP VLDRVRVESTSILSGTLA  VD FA AGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRR+LG  NQHVEGGSG
Subjt:  MDNYSEGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDIFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSG

Query:  MPSGELLQCFLKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKIYLMEKVATELGSRPLTD
        +PSG+LLQCFLKQ+ KSMFASEE M+ GNVLHD++ S+APR CSPSVVCSPNATL GSYFSSNH+LNKSTESG++MELKEDKIY  +KVATELGSR LT+
Subjt:  MPSGELLQCFLKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKIYLMEKVATELGSRPLTD

Query:  HVPKANLLSSTKVKDEPYDHVDDCNM---DMKNVFSNTVSIKSETTIPDEHYENKLDNMRLQDRMKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPAS
        HVP+ANLLSSTKVKDEPYDH + C++   DM NV+ NT+S+KSETT+PDE +ENK+D+M LQDRMKFFSSRK F FTSMD EHPKPSDPGCSILVSEP +
Subjt:  HVPKANLLSSTKVKDEPYDHVDDCNM---DMKNVFSNTVSIKSETTIPDEHYENKLDNMRLQDRMKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPAS

Query:  LMNIKRRRKWKKTATNSIETALEEDAPGLLQILVDKGIQIDEIKLYGEIESDDDLDESFSEDRFGELEDVISRLFSQRHSFLKFPS-IRCMKNSRASYCL
          N KRRRK KKTATNSIETALEEDAPGLLQILV+KGIQ+DEIKLYGE ESDDDLDES SED F ELEDVI+RLF QRHSFLKFPS IRC+K SRASYCL
Subjt:  LMNIKRRRKWKKTATNSIETALEEDAPGLLQILVDKGIQIDEIKLYGEIESDDDLDESFSEDRFGELEDVISRLFSQRHSFLKFPS-IRCMKNSRASYCL

Query:  ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSVPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGE
        ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVF+RHKRIVMERPEYGYATYFFELV+S+PI+WQIKRLVIAMKLT+CSRISLLENRPLLVGEDLTEGE
Subjt:  ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSVPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGE

Query:  AGVLLSYGWTPNSGLGTMLNYRDRVVHDRNNEDISEWRSKIGKLLMNGYNGGALVQENTSKKVAEYSSSQTTQVKLEL
        AGVLLSYGW  NSGLGTMLNYR RVVHDR+NEDISEWRSKIGKLLM+GYNGGALV ENT KKVAEYSSSQTTQVKLEL
Subjt:  AGVLLSYGWTPNSGLGTMLNYRDRVVHDRNNEDISEWRSKIGKLLMNGYNGGALVQENTSKKVAEYSSSQTTQVKLEL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G16610.1 unknown protein1.3e-9545.36Show/hide
Query:  EGGSGMPSGELLQCFLKQREKSMFASEEL-MKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTE--SGDNMELKEDKIYLMEKVATE
        E G    SG     FL++ +  +  +  +  ++G+ L    ES  P     S   SP A+LP    SSN    K  +    + + L E++I   +     
Subjt:  EGGSGMPSGELLQCFLKQREKSMFASEEL-MKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTE--SGDNMELKEDKIYLMEKVATE

Query:  LGSRPLTDHVPKANLLS-STKVKDEPYDH---VDDCNMDMKNV---FSNTVS-------IKSETTIPDEHYENKLDNMRLQDRMKFFSSRKVFDFTSMDS
        L    + D+  K  + S    VK E   H   +D+  +D   +    +   S       +K+E     E  E+ +D+M+L DR+K  S    F  +    
Subjt:  LGSRPLTDHVPKANLLS-STKVKDEPYDH---VDDCNMDMKNV---FSNTVS-------IKSETTIPDEHYENKLDNMRLQDRMKFFSSRKVFDFTSMDS

Query:  EHPKPSDPGCSILVSEPASLMNIKRRRKWKKTATNSIETALEEDAPGLLQILVDKGIQIDEIKLYGEIESDDDLDESFSEDRFGELEDVISRLFSQRHSF
        +   PS         E      + R  K KKTAT+SIETALEEDAPGLLQ+L+ +G+ +DE++LYG    D   D+S   + F ELEDVIS+LF +R + 
Subjt:  EHPKPSDPGCSILVSEPASLMNIKRRRKWKKTATNSIETALEEDAPGLLQILVDKGIQIDEIKLYGEIESDDDLDESFSEDRFGELEDVISRLFSQRHSF

Query:  LKFPSIRCMKNSRASYCLACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSVPINWQIKRLVIAMKLTSCSRI
         K  +    K SR SYCL CL SLIEQ RYL FR WPVEWGWCRDLQSFIFVFERH RIVMERPEYGYATYFFEL ++  I WQ+KRLV+AMKL SC R 
Subjt:  LKFPSIRCMKNSRASYCLACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSVPINWQIKRLVIAMKLTSCSRI

Query:  SLLENRPLLVGEDLTEGEAGVLLSYGWTPNSGLGTMLNYRDRVVHDRNNE-DISEWRSKIGKLLMNGYNGGALV
         L+EN+PLLVGED+T GEA VL+ YGW  N+GLGTMLNYRDRV HDR  +   SEWRSKI +LL++GYN G +V
Subjt:  SLLENRPLLVGEDLTEGEAGVLLSYGWTPNSGLGTMLNYRDRVVHDRNNE-DISEWRSKIGKLLMNGYNGGALV

AT5G16610.2 unknown protein2.1e-9841.95Show/hide
Query:  SGTLADGVDIFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHV---------------------EGGSGMPSGELLQCFLKQREK
        SG +    +I      AV         +  +DL+H+ L ER +MLL R A+     +V                     E G    SG     FL++ + 
Subjt:  SGTLADGVDIFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHV---------------------EGGSGMPSGELLQCFLKQREK

Query:  SMFASEEL-MKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTE--SGDNMELKEDKIYLMEKVATELGSRPLTDHVPKANLLS-STK
         +  +  +  ++G+ L    ES  P     S   SP A+LP    SSN    K  +    + + L E++I   +     L    + D+  K  + S    
Subjt:  SMFASEEL-MKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTE--SGDNMELKEDKIYLMEKVATELGSRPLTDHVPKANLLS-STK

Query:  VKDEPYDH---VDDCNMDMKNV---FSNTVS-------IKSETTIPDEHYENKLDNMRLQDRMKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPASLM
        VK E   H   +D+  +D   +    +   S       +K+E     E  E+ +D+M+L DR+K  S    F  +    +   PS         E     
Subjt:  VKDEPYDH---VDDCNMDMKNV---FSNTVS-------IKSETTIPDEHYENKLDNMRLQDRMKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPASLM

Query:  NIKRRRKWKKTATNSIETALEEDAPGLLQILVDKGIQIDEIKLYGEIESDDDLDESFSEDRFGELEDVISRLFSQRHSFLKFPSIRCMKNSRASYCLACL
         + R  K KKTAT+SIETALEEDAPGLLQ+L+ +G+ +DE++LYG    D   D+S   + F ELEDVIS+LF +R +  K  +    K SR SYCL CL
Subjt:  NIKRRRKWKKTATNSIETALEEDAPGLLQILVDKGIQIDEIKLYGEIESDDDLDESFSEDRFGELEDVISRLFSQRHSFLKFPSIRCMKNSRASYCLACL

Query:  VSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSVPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGEAGV
         SLIEQ RYL FR WPVEWGWCRDLQSFIFVFERH RIVMERPEYGYATYFFEL ++  I WQ+KRLV+AMKL SC R  L+EN+PLLVGED+T GEA V
Subjt:  VSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSVPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGEAGV

Query:  LLSYGWTPNSGLGTMLNYRDRVVHDRNNE-DISEWRSKIGKLLMNGYNGGALV
        L+ YGW  N+GLGTMLNYRDRV HDR  +   SEWRSKI +LL++GYN G +V
Subjt:  LLSYGWTPNSGLGTMLNYRDRVVHDRNNE-DISEWRSKIGKLLMNGYNGGALV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACAACTATAGTGAGGGGGCTCGATTAATTATAGAGAACCCGGCTTTTGAGAATCCAATGCCACCTAACGTTCTTGATCGGGTAAGGGTGGAATCCACAAGCATCTT
ATCAGGTACCTTGGCAGATGGTGTAGATATCTTTGCTTCTGCTGGTGTGGCTGTAACTAAGGTTAAAAATGAGATGTTTGATGACTTCGATGAAGATCTTGATCATGTTT
TATTGATAGAGCGACTAAGGATGCTGCTTTCAAGGCGAGCATTGGGTTCGACAAATCAACATGTGGAGGGTGGTTCTGGTATGCCGTCGGGAGAACTTCTACAATGCTTC
CTCAAACAGAGGGAGAAGTCCATGTTTGCTAGTGAAGAACTGATGAAAAATGGAAATGTGTTGCATGATAGAACTGAAAGTCATGCTCCTCGTCTTTGCAGCCCTTCAGT
AGTTTGTTCACCTAATGCAACTCTTCCGGGATCTTATTTCTCAAGCAATCATACTTTGAACAAATCAACTGAATCAGGCGACAATATGGAACTTAAAGAAGATAAGATCT
ACTTAATGGAAAAGGTAGCTACAGAATTAGGATCACGGCCTTTGACTGATCATGTTCCTAAAGCAAATTTACTGAGTTCCACGAAAGTGAAGGATGAACCTTATGATCAT
GTGGATGACTGCAACATGGATATGAAAAATGTCTTCAGCAACACTGTGTCAATAAAGAGTGAAACAACCATTCCCGATGAACATTATGAAAATAAGTTAGACAATATGCG
ATTGCAAGACCGAATGAAGTTTTTCTCTTCTCGGAAGGTTTTTGATTTTACATCTATGGATTCTGAGCATCCAAAACCTTCTGACCCTGGATGCAGCATTCTTGTTTCAG
AACCTGCTAGTTTAATGAACATTAAACGGAGACGCAAATGGAAGAAAACTGCCACGAATTCAATTGAAACAGCACTGGAGGAGGATGCTCCTGGCCTTCTCCAGATACTT
GTTGACAAAGGCATACAAATTGATGAGATCAAGCTTTATGGGGAGATAGAAAGTGATGATGATCTAGATGAGTCTTTTAGTGAAGACAGGTTTGGGGAGCTTGAAGATGT
GATATCAAGGCTTTTTTCTCAACGCCATTCCTTTTTGAAGTTTCCCTCTATAAGATGCATGAAAAATTCAAGAGCAAGCTATTGCTTAGCTTGTCTAGTTTCACTCATTG
AGCAGACGAGATATCTTCATTTCCGAAACTGGCCTGTCGAATGGGGGTGGTGCCGGGATCTCCAGTCTTTTATATTTGTATTTGAGAGACATAAAAGAATAGTGATGGAA
CGCCCTGAGTATGGCTATGCTACATATTTTTTTGAGCTTGTGGATTCCGTACCCATCAACTGGCAGATAAAGCGGTTGGTGATTGCTATGAAGCTTACGAGTTGTAGCAG
AATTTCACTACTTGAGAACAGACCATTATTGGTTGGGGAAGATTTGACCGAAGGTGAGGCAGGGGTTTTATTGAGCTATGGATGGACGCCAAATAGTGGCTTGGGTACAA
TGCTGAACTACCGTGACAGAGTCGTTCACGATCGGAATAATGAGGACATCTCGGAATGGAGATCAAAAATAGGGAAGCTACTGATGAATGGTTATAATGGAGGAGCTCTT
GTGCAAGAAAATACTTCAAAGAAGGTTGCAGAATACAGCAGTTCTCAAACCACACAAGTTAAGCTGGAACTCTGA
mRNA sequenceShow/hide mRNA sequence
GTAATGACTTTTAGTTTGATTGTGTTGGAAGAATGAAGTTATATTGTCATTTATTAAAAAGAAACTGCTTGAATTTCTTTATTCACCATAAACTCATTCAATATAAAACA
TGGATTGTTTCCAAATTGATGACTATAATTTAAAGAGGAACTGATCAATGGATTATGCTGAGGTTGTTGTGATTATGAGTAGGTGTTGTAATTGCTCATTTAGGACGTGA
TATTTTTTTACTGCCGCTGATTGACCTTACTACACTATCTTCGGTTGATACTATTGCCCAAATATGTAGGAATTTGGTTGCATCTTAGTGAGTTGTTGTGGTGGTGATTG
ATCCATTTTCTGTTGCATTGTTTGTTGAAAAAAATAAATATTTCTAGACTTCTTTGTAATAAACGTTGTGAGAACTTGTATGTTTATATGTTTTTAGAACTTTGCGAGAG
TGTTAATACGAGAGATTTCTAAGTAGGCAATCTATTTTAAAATGTGCATCAATCATACCTATGTTCCTATGCCTAATAATTATTCTATTAGTCCTTTTTTTGCTAATTTT
TGAATGCAGTTTTATCTATTTGCATTTTGCACGATACTTTTGAAAATTGATGTGTAGAATTTGATTGATAACTACAGAAAAGTAGCAAAAAATAGGTATCACTATGAGGA
ATTTAATCTGAGATGTTTTTGCCTTTGGGTCATGTCCTATATGAGAAGAGAATCACCAAACTCATTAATGTTGGAAAGGACAAGAAAGCACTATAGTTAAGAGCAAGTTG
GAAACTCACAAAGAGCTAAGTAGAAGTGAGAACATGTTCAACGTTCCCAGCAATATGAGGTACCACTACGTAGATGACTTTCAGTTTCCCTCATTATGATTGAATCTTTT
ATCCTCAAGTCCCTTTTGACATGCTTACGTGGGATGAATTATGATGGAATTAGAACTTATATATTCCCACATATCAAATTTGTTGTATATAAAAATAATTTGTTCGTTTT
CTTCTCATTTTCCCCCTCGTTTGATACTAAAACAACAGCGAAAGAATACATTTTGTGATAACCATCCGATTCTTTGTGTCAAAATATGTCAAAATACGATTAGTTGGTGA
TCGAAATCATGCGTAATGACAGGTTCATAAATATAACTCTATCAATATAATCATTTTATTTCACATTTATCGAATTTAATCGACAATGTTAATGTCATAAATATGCATTG
TCCGTCAACATTTAGTTAGTTGTGTGAAGTTGATACATATGTTACTTGTTTCATCGAGATAGTCTGATGCTGTCAAGGTAGACATATGTGTGAAAGAAACCTTAGATTGC
TAGTTGTGAACTTGTTGAAGTGTTATCTTAAATATAAACATATATTTTCCACACGAGTTCTTGAAGTTTTATCTTATAATTTTTAGGCTCCCTGGTTTCCATGGTGCTGA
TGATAGGTGCTATAGGTAATTTTAGATGGGAACTTCGATGTTTTGAGGCATCATACTTGTAAAAAAACCATTAGGAACTAAAGTACCATTTTAATGTTGATATGATTGTT
GCCTTTTTAATGTACATATATGTTGATATTTATGTACACCTTTTCAACATTTTTTGGATTCATTGTCTTGCACATGTTGATGTACTCTGCAAATGATATATACTTTTGGA
GGTATTACAGGTTGTTCTTATAATATATTGAGAATGCATTTGAATACCAGAAGACCTTGGACGATGGTGGTGGTAGAAAAAAGCAGCAAAGTCGGGGAGAATTCACTCTT
CCATCATGAATATAAACATTAGAAGCTTCTTGCGGAAATGCTCATATACAAGGTCATCTTATGGACAACTATAGTGAGGGGGCTCGATTAATTATAGAGAACCCGGCTTT
TGAGAATCCAATGCCACCTAACGTTCTTGATCGGGTAAGGGTGGAATCCACAAGCATCTTATCAGGTACCTTGGCAGATGGTGTAGATATCTTTGCTTCTGCTGGTGTGG
CTGTAACTAAGGTTAAAAATGAGATGTTTGATGACTTCGATGAAGATCTTGATCATGTTTTATTGATAGAGCGACTAAGGATGCTGCTTTCAAGGCGAGCATTGGGTTCG
ACAAATCAACATGTGGAGGGTGGTTCTGGTATGCCGTCGGGAGAACTTCTACAATGCTTCCTCAAACAGAGGGAGAAGTCCATGTTTGCTAGTGAAGAACTGATGAAAAA
TGGAAATGTGTTGCATGATAGAACTGAAAGTCATGCTCCTCGTCTTTGCAGCCCTTCAGTAGTTTGTTCACCTAATGCAACTCTTCCGGGATCTTATTTCTCAAGCAATC
ATACTTTGAACAAATCAACTGAATCAGGCGACAATATGGAACTTAAAGAAGATAAGATCTACTTAATGGAAAAGGTAGCTACAGAATTAGGATCACGGCCTTTGACTGAT
CATGTTCCTAAAGCAAATTTACTGAGTTCCACGAAAGTGAAGGATGAACCTTATGATCATGTGGATGACTGCAACATGGATATGAAAAATGTCTTCAGCAACACTGTGTC
AATAAAGAGTGAAACAACCATTCCCGATGAACATTATGAAAATAAGTTAGACAATATGCGATTGCAAGACCGAATGAAGTTTTTCTCTTCTCGGAAGGTTTTTGATTTTA
CATCTATGGATTCTGAGCATCCAAAACCTTCTGACCCTGGATGCAGCATTCTTGTTTCAGAACCTGCTAGTTTAATGAACATTAAACGGAGACGCAAATGGAAGAAAACT
GCCACGAATTCAATTGAAACAGCACTGGAGGAGGATGCTCCTGGCCTTCTCCAGATACTTGTTGACAAAGGCATACAAATTGATGAGATCAAGCTTTATGGGGAGATAGA
AAGTGATGATGATCTAGATGAGTCTTTTAGTGAAGACAGGTTTGGGGAGCTTGAAGATGTGATATCAAGGCTTTTTTCTCAACGCCATTCCTTTTTGAAGTTTCCCTCTA
TAAGATGCATGAAAAATTCAAGAGCAAGCTATTGCTTAGCTTGTCTAGTTTCACTCATTGAGCAGACGAGATATCTTCATTTCCGAAACTGGCCTGTCGAATGGGGGTGG
TGCCGGGATCTCCAGTCTTTTATATTTGTATTTGAGAGACATAAAAGAATAGTGATGGAACGCCCTGAGTATGGCTATGCTACATATTTTTTTGAGCTTGTGGATTCCGT
ACCCATCAACTGGCAGATAAAGCGGTTGGTGATTGCTATGAAGCTTACGAGTTGTAGCAGAATTTCACTACTTGAGAACAGACCATTATTGGTTGGGGAAGATTTGACCG
AAGGTGAGGCAGGGGTTTTATTGAGCTATGGATGGACGCCAAATAGTGGCTTGGGTACAATGCTGAACTACCGTGACAGAGTCGTTCACGATCGGAATAATGAGGACATC
TCGGAATGGAGATCAAAAATAGGGAAGCTACTGATGAATGGTTATAATGGAGGAGCTCTTGTGCAAGAAAATACTTCAAAGAAGGTTGCAGAATACAGCAGTTCTCAAAC
CACACAAGTTAAGCTGGAACTCTGATTGCTCTTAGTTTCTCATAGTTTGCTGTTTCCGCACAATTTGTACAAATATTCCATTTGCTGTGAAAGTTTAGAATAATTTCGAT
CTGTCCAAAGCTTGTTCATTAATCTGAAGTGAAGAATAATTAGCTTTCATTTTTATATGCATAAAATGACTCAGCTATATAGTTGGAAGGTGACTATTAGTTTATGTTAA
TTTTAGGGGTGAAAGTGTGAATTGAAAAACCGATTAAACCGTCCAAACCAAACTGCAAATTGTTGGCAGTTCGTTTTCAGATTGGGTTAGGTGGTCGGCATGGGTTTTGA
GGAAGACAGAGTTGTGGTGGTCGGCTTTGGGCAGCAATAGTAGCACACTGATACAAACTCAAAGCTGACTGATACTACATAATAAGAAATCCATCTTTTTCCCAAGTGCA
TTGTGCGGTCGATTGGTGAAAGGAGCGCCCGAAAACCTTTATGTGGTGTTTGGGATAAGGAGTGGAATAGTGAGGAGTAAGGAGTTATGTCGTATAGGGAGTTGTGAAAT
CTTGGAACCCACA
Protein sequenceShow/hide protein sequence
MDNYSEGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDIFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSGMPSGELLQCF
LKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKIYLMEKVATELGSRPLTDHVPKANLLSSTKVKDEPYDH
VDDCNMDMKNVFSNTVSIKSETTIPDEHYENKLDNMRLQDRMKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPASLMNIKRRRKWKKTATNSIETALEEDAPGLLQIL
VDKGIQIDEIKLYGEIESDDDLDESFSEDRFGELEDVISRLFSQRHSFLKFPSIRCMKNSRASYCLACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVME
RPEYGYATYFFELVDSVPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGEAGVLLSYGWTPNSGLGTMLNYRDRVVHDRNNEDISEWRSKIGKLLMNGYNGGAL
VQENTSKKVAEYSSSQTTQVKLEL