; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC01G009190 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC01G009190
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationCicolChr01:10463559..10478730
RNA-Seq ExpressionCcUC01G009190
SyntenyCcUC01G009190
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008460540.1 PREDICTED: uncharacterized protein LOC103499334 isoform X1 [Cucumis melo]6.8e-27785.66Show/hide
Query:  EGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDNFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSGMPLGE
        EGARL IEN   EN MPP  LD VRVES S LSGTLADGVDN  SAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALG TN+HVEGG G+P GE
Subjt:  EGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDNFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSGMPLGE

Query:  LLQCFLKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKINLMEKVATELGSRPLTDHVPKA
        LLQCFLK+R+KSMFASEELM+  NVLH RT SHAPR CSPS VCSP+ TL GSY S NH +NKSTESGD+MELKEDKI   EKVATEL SRPLTDHVPK 
Subjt:  LLQCFLKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKINLMEKVATELGSRPLTDHVPKA

Query:  NLLSSTKMKDEPYDHVDDCNM---DMKNVFSNSVSIKSETTIPDEHYENKLDNMRLQDRIKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPASLMNIK
        NLLSST +KDEPYDHVDD N+   DM NVFSN+VSIKSE T PDEHYENK+DNMRLQDR+KFFSSRK F FT +D EHPKPSDPGCSILVSEPASL NIK
Subjt:  NLLSSTKMKDEPYDHVDDCNM---DMKNVFSNSVSIKSETTIPDEHYENKLDNMRLQDRIKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPASLMNIK

Query:  RRCKRKKTATNSIETALEEDAPGLLQILVDKGVQIDEIKLYGEIESDDDLDVSFSEDRFGELEDVISRLFSQRHSFLEFPSIRCMKSSRASYCLACLVSL
        RRCKRKKT TNS+ETALEEDAPGLLQILVDKGV +DEIKLYGE ESD+DLD SF ED FGELEDVISRLFSQRHSF++FPSIRCMKSSR SYCLACLVSL
Subjt:  RRCKRKKTATNSIETALEEDAPGLLQILVDKGVQIDEIKLYGEIESDDDLDVSFSEDRFGELEDVISRLFSQRHSFLEFPSIRCMKSSRASYCLACLVSL

Query:  IEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGEAGVLLS
        IEQTRYLHFR WPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVI+MKLTSCSRISLLEN PLLVGEDLTEGEA VLLS
Subjt:  IEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGEAGVLLS

Query:  YGWTPNSGLGTMLNYHDRVVHDRNNEDISEWRSKIGKLLMDGYNGGALVQENTSKKVAEYSNSQTTQVKLEL
        YGW PNSGLGTMLNY  RVVHDRNNEDISEW+SKIGKLLMDGYNGGAL+ ENTS KVAEYS+SQTTQVKLEL
Subjt:  YGWTPNSGLGTMLNYHDRVVHDRNNEDISEWRSKIGKLLMDGYNGGALVQENTSKKVAEYSNSQTTQVKLEL

XP_022939493.1 uncharacterized protein LOC111445382 isoform X1 [Cucurbita moschata]2.4e-27484.08Show/hide
Query:  MDNYSEGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDNFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSG
        MDNYSEGARL  EN  FENP PP VLDRVRVESTSILSGTL  GVDNFA AGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALG  NQHVEGGSG
Subjt:  MDNYSEGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDNFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSG

Query:  MPLGELLQCFLKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKINLMEKVATELGSRPLTD
        +P G+LLQCFLKQ+ KSMFASEE M+ GNVLHD++ S+APR CSPSVVCSPNATL GSYFSSNH+LNKSTESG++MELKEDKI   EKVATELGSR LT+
Subjt:  MPLGELLQCFLKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKINLMEKVATELGSRPLTD

Query:  HVPKANLLSSTKMKDEPYDHVDDCNM---DMKNVFSNSVSIKSETTIPDEHYENKLDNMRLQDRIKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPAS
        HVP+ NLLSSTK+KDEPYDH + C++   DM NV+SN++SIKSETT+PDE YENK+D+M LQDR+KFFSSRK   FTSMD EHPKPSDPGCS+LVSEP +
Subjt:  HVPKANLLSSTKMKDEPYDHVDDCNM---DMKNVFSNSVSIKSETTIPDEHYENKLDNMRLQDRIKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPAS

Query:  LMNIKRRCKRKKTATNSIETALEEDAPGLLQILVDKGVQIDEIKLYGEIESDDDLDVSFSEDRFGELEDVISRLFSQRHSFLEFPS-IRCMKSSRASYCL
          N KRR K+KKTATNSIETALEEDAPGLLQILV+KG+Q+DEIKLYGE ESDDDLD S SED F ELEDVI+RLF QRHSFL+FPS IRCMK+SRASYCL
Subjt:  LMNIKRRCKRKKTATNSIETALEEDAPGLLQILVDKGVQIDEIKLYGEIESDDDLDVSFSEDRFGELEDVISRLFSQRHSFLEFPS-IRCMKSSRASYCL

Query:  ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGE
        ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELV+SLPI+WQIKRLVIAMKLT+CSRISLLENRPLLVGEDLTEGE
Subjt:  ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGE

Query:  AGVLLSYGWTPNSGLGTMLNYHDRVVHDRNNEDISEWRSKIGKLLMDGYNGGALVQENTSKKVAEYSNSQTTQVKLEL
        A VLLSYGW  NSGLGTMLNY  RVVHDR+NEDISEWRSKIGKLLMDGYNGGALV ENT KKVAEYS+SQ TQVKLEL
Subjt:  AGVLLSYGWTPNSGLGTMLNYHDRVVHDRNNEDISEWRSKIGKLLMDGYNGGALVQENTSKKVAEYSNSQTTQVKLEL

XP_022939495.1 uncharacterized protein LOC111445382 isoform X2 [Cucurbita moschata]2.4e-27484.08Show/hide
Query:  MDNYSEGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDNFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSG
        MDNYSEGARL  EN  FENP PP VLDRVRVESTSILSGTL  GVDNFA AGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALG  NQHVEGGSG
Subjt:  MDNYSEGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDNFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSG

Query:  MPLGELLQCFLKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKINLMEKVATELGSRPLTD
        +P G+LLQCFLKQ+ KSMFASEE M+ GNVLHD++ S+APR CSPSVVCSPNATL GSYFSSNH+LNKSTESG++MELKEDKI   EKVATELGSR LT+
Subjt:  MPLGELLQCFLKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKINLMEKVATELGSRPLTD

Query:  HVPKANLLSSTKMKDEPYDHVDDCNM---DMKNVFSNSVSIKSETTIPDEHYENKLDNMRLQDRIKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPAS
        HVP+ NLLSSTK+KDEPYDH + C++   DM NV+SN++SIKSETT+PDE YENK+D+M LQDR+KFFSSRK   FTSMD EHPKPSDPGCS+LVSEP +
Subjt:  HVPKANLLSSTKMKDEPYDHVDDCNM---DMKNVFSNSVSIKSETTIPDEHYENKLDNMRLQDRIKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPAS

Query:  LMNIKRRCKRKKTATNSIETALEEDAPGLLQILVDKGVQIDEIKLYGEIESDDDLDVSFSEDRFGELEDVISRLFSQRHSFLEFPS-IRCMKSSRASYCL
          N KRR K+KKTATNSIETALEEDAPGLLQILV+KG+Q+DEIKLYGE ESDDDLD S SED F ELEDVI+RLF QRHSFL+FPS IRCMK+SRASYCL
Subjt:  LMNIKRRCKRKKTATNSIETALEEDAPGLLQILVDKGVQIDEIKLYGEIESDDDLDVSFSEDRFGELEDVISRLFSQRHSFLEFPS-IRCMKSSRASYCL

Query:  ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGE
        ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELV+SLPI+WQIKRLVIAMKLT+CSRISLLENRPLLVGEDLTEGE
Subjt:  ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGE

Query:  AGVLLSYGWTPNSGLGTMLNYHDRVVHDRNNEDISEWRSKIGKLLMDGYNGGALVQENTSKKVAEYSNSQTTQVKLEL
        A VLLSYGW  NSGLGTMLNY  RVVHDR+NEDISEWRSKIGKLLMDGYNGGALV ENT KKVAEYS+SQ TQVKLEL
Subjt:  AGVLLSYGWTPNSGLGTMLNYHDRVVHDRNNEDISEWRSKIGKLLMDGYNGGALVQENTSKKVAEYSNSQTTQVKLEL

XP_022939497.1 uncharacterized protein LOC111445382 isoform X3 [Cucurbita moschata]2.4e-27484.08Show/hide
Query:  MDNYSEGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDNFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSG
        MDNYSEGARL  EN  FENP PP VLDRVRVESTSILSGTL  GVDNFA AGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALG  NQHVEGGSG
Subjt:  MDNYSEGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDNFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSG

Query:  MPLGELLQCFLKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKINLMEKVATELGSRPLTD
        +P G+LLQCFLKQ+ KSMFASEE M+ GNVLHD++ S+APR CSPSVVCSPNATL GSYFSSNH+LNKSTESG++MELKEDKI   EKVATELGSR LT+
Subjt:  MPLGELLQCFLKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKINLMEKVATELGSRPLTD

Query:  HVPKANLLSSTKMKDEPYDHVDDCNM---DMKNVFSNSVSIKSETTIPDEHYENKLDNMRLQDRIKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPAS
        HVP+ NLLSSTK+KDEPYDH + C++   DM NV+SN++SIKSETT+PDE YENK+D+M LQDR+KFFSSRK   FTSMD EHPKPSDPGCS+LVSEP +
Subjt:  HVPKANLLSSTKMKDEPYDHVDDCNM---DMKNVFSNSVSIKSETTIPDEHYENKLDNMRLQDRIKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPAS

Query:  LMNIKRRCKRKKTATNSIETALEEDAPGLLQILVDKGVQIDEIKLYGEIESDDDLDVSFSEDRFGELEDVISRLFSQRHSFLEFPS-IRCMKSSRASYCL
          N KRR K+KKTATNSIETALEEDAPGLLQILV+KG+Q+DEIKLYGE ESDDDLD S SED F ELEDVI+RLF QRHSFL+FPS IRCMK+SRASYCL
Subjt:  LMNIKRRCKRKKTATNSIETALEEDAPGLLQILVDKGVQIDEIKLYGEIESDDDLDVSFSEDRFGELEDVISRLFSQRHSFLEFPS-IRCMKSSRASYCL

Query:  ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGE
        ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELV+SLPI+WQIKRLVIAMKLT+CSRISLLENRPLLVGEDLTEGE
Subjt:  ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGE

Query:  AGVLLSYGWTPNSGLGTMLNYHDRVVHDRNNEDISEWRSKIGKLLMDGYNGGALVQENTSKKVAEYSNSQTTQVKLEL
        A VLLSYGW  NSGLGTMLNY  RVVHDR+NEDISEWRSKIGKLLMDGYNGGALV ENT KKVAEYS+SQ TQVKLEL
Subjt:  AGVLLSYGWTPNSGLGTMLNYHDRVVHDRNNEDISEWRSKIGKLLMDGYNGGALVQENTSKKVAEYSNSQTTQVKLEL

XP_038874728.1 uncharacterized protein LOC120067269 [Benincasa hispida]2.7e-28184.14Show/hide
Query:  MDNYSEGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDNFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSG
        MDN+S+G RL IENP FEN M   VLDRVRVESTS LSGTL DGVD+FASAGVAVTKVKNEMF+DF+EDLDHV LI+RLRMLLSRRALG TN HVE GSG
Subjt:  MDNYSEGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDNFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSG

Query:  MPLGELLQCFLKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKINLMEKVATELGSRPLTD
        +P GELL C LKQREKSMFA EELM+  NVLHDRT SHAPRLCSPSVVCSPNATLP SYFSSNH+ NKSTESGD+MELK+DKI   EKVAT+L S+PLTD
Subjt:  MPLGELLQCFLKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKINLMEKVATELGSRPLTD

Query:  HVPKANLLSSTKMKDEPYDHVDDCNM---DMKNVFSNSVSIKSETTIPDEHYENKLDNMRLQDRIKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPAS
        HVPKANLLSSTK+KDEPY HVDDCN+   D  NV SN+V IKSETTIPDEHYENKLDNMRLQDR+KFFSSRKVF FTSMD EHPKPSDPGCSILV EP S
Subjt:  HVPKANLLSSTKMKDEPYDHVDDCNM---DMKNVFSNSVSIKSETTIPDEHYENKLDNMRLQDRIKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPAS

Query:  LMNIKRRCKRKKTATNSIETALEEDAPGLLQILVDKGVQIDEIKLYGEIESDDDLDVSFSEDRFGELEDVISR----------------------LFSQR
         MNIK R KRKKTATNSIETALEEDAPGLLQILVDKGVQ+DEIKLYGEIE+DDDLD SFSED FGELEDVISR                      LFSQR
Subjt:  LMNIKRRCKRKKTATNSIETALEEDAPGLLQILVDKGVQIDEIKLYGEIESDDDLDVSFSEDRFGELEDVISR----------------------LFSQR

Query:  HSFLEFPSIRCMKSSRASYCLACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVIAMKLTSC
        HSFL+FPSIRCMKSSRASYCLACLVSLIEQTRYLHFRNWPVEWGWCRDLQSF+FVFERHKRIVMERPEYG+ATYFFELVDSLP+NWQIKRLVIAMKLTSC
Subjt:  HSFLEFPSIRCMKSSRASYCLACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVIAMKLTSC

Query:  SRISLLENRPLLVGEDLTEGEAGVLLSYGWTPNSGLGTMLNYHDRVVHDRNNEDISEWRSKIGKLLMDGYNGGALVQENTSKKVAEYSNSQTTQVKLEL
        SRISLLENRPLLVGEDLTEGEA VLL+YGW PNSGLGTMLNY  RVVHDRNNEDISEWRSKIGKLLMDGYNGGALVQENTSKKVAEYS+ QTTQVKLEL
Subjt:  SRISLLENRPLLVGEDLTEGEAGVLLSYGWTPNSGLGTMLNYHDRVVHDRNNEDISEWRSKIGKLLMDGYNGGALVQENTSKKVAEYSNSQTTQVKLEL

TrEMBL top hitse value%identityAlignment
A0A1S3CCT1 uncharacterized protein LOC103499334 isoform X13.3e-27785.66Show/hide
Query:  EGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDNFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSGMPLGE
        EGARL IEN   EN MPP  LD VRVES S LSGTLADGVDN  SAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALG TN+HVEGG G+P GE
Subjt:  EGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDNFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSGMPLGE

Query:  LLQCFLKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKINLMEKVATELGSRPLTDHVPKA
        LLQCFLK+R+KSMFASEELM+  NVLH RT SHAPR CSPS VCSP+ TL GSY S NH +NKSTESGD+MELKEDKI   EKVATEL SRPLTDHVPK 
Subjt:  LLQCFLKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKINLMEKVATELGSRPLTDHVPKA

Query:  NLLSSTKMKDEPYDHVDDCNM---DMKNVFSNSVSIKSETTIPDEHYENKLDNMRLQDRIKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPASLMNIK
        NLLSST +KDEPYDHVDD N+   DM NVFSN+VSIKSE T PDEHYENK+DNMRLQDR+KFFSSRK F FT +D EHPKPSDPGCSILVSEPASL NIK
Subjt:  NLLSSTKMKDEPYDHVDDCNM---DMKNVFSNSVSIKSETTIPDEHYENKLDNMRLQDRIKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPASLMNIK

Query:  RRCKRKKTATNSIETALEEDAPGLLQILVDKGVQIDEIKLYGEIESDDDLDVSFSEDRFGELEDVISRLFSQRHSFLEFPSIRCMKSSRASYCLACLVSL
        RRCKRKKT TNS+ETALEEDAPGLLQILVDKGV +DEIKLYGE ESD+DLD SF ED FGELEDVISRLFSQRHSF++FPSIRCMKSSR SYCLACLVSL
Subjt:  RRCKRKKTATNSIETALEEDAPGLLQILVDKGVQIDEIKLYGEIESDDDLDVSFSEDRFGELEDVISRLFSQRHSFLEFPSIRCMKSSRASYCLACLVSL

Query:  IEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGEAGVLLS
        IEQTRYLHFR WPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVI+MKLTSCSRISLLEN PLLVGEDLTEGEA VLLS
Subjt:  IEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGEAGVLLS

Query:  YGWTPNSGLGTMLNYHDRVVHDRNNEDISEWRSKIGKLLMDGYNGGALVQENTSKKVAEYSNSQTTQVKLEL
        YGW PNSGLGTMLNY  RVVHDRNNEDISEW+SKIGKLLMDGYNGGAL+ ENTS KVAEYS+SQTTQVKLEL
Subjt:  YGWTPNSGLGTMLNYHDRVVHDRNNEDISEWRSKIGKLLMDGYNGGALVQENTSKKVAEYSNSQTTQVKLEL

A0A6J1FGY5 uncharacterized protein LOC111445382 isoform X31.2e-27484.08Show/hide
Query:  MDNYSEGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDNFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSG
        MDNYSEGARL  EN  FENP PP VLDRVRVESTSILSGTL  GVDNFA AGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALG  NQHVEGGSG
Subjt:  MDNYSEGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDNFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSG

Query:  MPLGELLQCFLKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKINLMEKVATELGSRPLTD
        +P G+LLQCFLKQ+ KSMFASEE M+ GNVLHD++ S+APR CSPSVVCSPNATL GSYFSSNH+LNKSTESG++MELKEDKI   EKVATELGSR LT+
Subjt:  MPLGELLQCFLKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKINLMEKVATELGSRPLTD

Query:  HVPKANLLSSTKMKDEPYDHVDDCNM---DMKNVFSNSVSIKSETTIPDEHYENKLDNMRLQDRIKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPAS
        HVP+ NLLSSTK+KDEPYDH + C++   DM NV+SN++SIKSETT+PDE YENK+D+M LQDR+KFFSSRK   FTSMD EHPKPSDPGCS+LVSEP +
Subjt:  HVPKANLLSSTKMKDEPYDHVDDCNM---DMKNVFSNSVSIKSETTIPDEHYENKLDNMRLQDRIKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPAS

Query:  LMNIKRRCKRKKTATNSIETALEEDAPGLLQILVDKGVQIDEIKLYGEIESDDDLDVSFSEDRFGELEDVISRLFSQRHSFLEFPS-IRCMKSSRASYCL
          N KRR K+KKTATNSIETALEEDAPGLLQILV+KG+Q+DEIKLYGE ESDDDLD S SED F ELEDVI+RLF QRHSFL+FPS IRCMK+SRASYCL
Subjt:  LMNIKRRCKRKKTATNSIETALEEDAPGLLQILVDKGVQIDEIKLYGEIESDDDLDVSFSEDRFGELEDVISRLFSQRHSFLEFPS-IRCMKSSRASYCL

Query:  ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGE
        ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELV+SLPI+WQIKRLVIAMKLT+CSRISLLENRPLLVGEDLTEGE
Subjt:  ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGE

Query:  AGVLLSYGWTPNSGLGTMLNYHDRVVHDRNNEDISEWRSKIGKLLMDGYNGGALVQENTSKKVAEYSNSQTTQVKLEL
        A VLLSYGW  NSGLGTMLNY  RVVHDR+NEDISEWRSKIGKLLMDGYNGGALV ENT KKVAEYS+SQ TQVKLEL
Subjt:  AGVLLSYGWTPNSGLGTMLNYHDRVVHDRNNEDISEWRSKIGKLLMDGYNGGALVQENTSKKVAEYSNSQTTQVKLEL

A0A6J1FLT1 uncharacterized protein LOC111445382 isoform X11.2e-27484.08Show/hide
Query:  MDNYSEGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDNFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSG
        MDNYSEGARL  EN  FENP PP VLDRVRVESTSILSGTL  GVDNFA AGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALG  NQHVEGGSG
Subjt:  MDNYSEGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDNFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSG

Query:  MPLGELLQCFLKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKINLMEKVATELGSRPLTD
        +P G+LLQCFLKQ+ KSMFASEE M+ GNVLHD++ S+APR CSPSVVCSPNATL GSYFSSNH+LNKSTESG++MELKEDKI   EKVATELGSR LT+
Subjt:  MPLGELLQCFLKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKINLMEKVATELGSRPLTD

Query:  HVPKANLLSSTKMKDEPYDHVDDCNM---DMKNVFSNSVSIKSETTIPDEHYENKLDNMRLQDRIKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPAS
        HVP+ NLLSSTK+KDEPYDH + C++   DM NV+SN++SIKSETT+PDE YENK+D+M LQDR+KFFSSRK   FTSMD EHPKPSDPGCS+LVSEP +
Subjt:  HVPKANLLSSTKMKDEPYDHVDDCNM---DMKNVFSNSVSIKSETTIPDEHYENKLDNMRLQDRIKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPAS

Query:  LMNIKRRCKRKKTATNSIETALEEDAPGLLQILVDKGVQIDEIKLYGEIESDDDLDVSFSEDRFGELEDVISRLFSQRHSFLEFPS-IRCMKSSRASYCL
          N KRR K+KKTATNSIETALEEDAPGLLQILV+KG+Q+DEIKLYGE ESDDDLD S SED F ELEDVI+RLF QRHSFL+FPS IRCMK+SRASYCL
Subjt:  LMNIKRRCKRKKTATNSIETALEEDAPGLLQILVDKGVQIDEIKLYGEIESDDDLDVSFSEDRFGELEDVISRLFSQRHSFLEFPS-IRCMKSSRASYCL

Query:  ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGE
        ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELV+SLPI+WQIKRLVIAMKLT+CSRISLLENRPLLVGEDLTEGE
Subjt:  ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGE

Query:  AGVLLSYGWTPNSGLGTMLNYHDRVVHDRNNEDISEWRSKIGKLLMDGYNGGALVQENTSKKVAEYSNSQTTQVKLEL
        A VLLSYGW  NSGLGTMLNY  RVVHDR+NEDISEWRSKIGKLLMDGYNGGALV ENT KKVAEYS+SQ TQVKLEL
Subjt:  AGVLLSYGWTPNSGLGTMLNYHDRVVHDRNNEDISEWRSKIGKLLMDGYNGGALVQENTSKKVAEYSNSQTTQVKLEL

A0A6J1FMV3 uncharacterized protein LOC111445382 isoform X21.2e-27484.08Show/hide
Query:  MDNYSEGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDNFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSG
        MDNYSEGARL  EN  FENP PP VLDRVRVESTSILSGTL  GVDNFA AGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALG  NQHVEGGSG
Subjt:  MDNYSEGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDNFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSG

Query:  MPLGELLQCFLKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKINLMEKVATELGSRPLTD
        +P G+LLQCFLKQ+ KSMFASEE M+ GNVLHD++ S+APR CSPSVVCSPNATL GSYFSSNH+LNKSTESG++MELKEDKI   EKVATELGSR LT+
Subjt:  MPLGELLQCFLKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKINLMEKVATELGSRPLTD

Query:  HVPKANLLSSTKMKDEPYDHVDDCNM---DMKNVFSNSVSIKSETTIPDEHYENKLDNMRLQDRIKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPAS
        HVP+ NLLSSTK+KDEPYDH + C++   DM NV+SN++SIKSETT+PDE YENK+D+M LQDR+KFFSSRK   FTSMD EHPKPSDPGCS+LVSEP +
Subjt:  HVPKANLLSSTKMKDEPYDHVDDCNM---DMKNVFSNSVSIKSETTIPDEHYENKLDNMRLQDRIKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPAS

Query:  LMNIKRRCKRKKTATNSIETALEEDAPGLLQILVDKGVQIDEIKLYGEIESDDDLDVSFSEDRFGELEDVISRLFSQRHSFLEFPS-IRCMKSSRASYCL
          N KRR K+KKTATNSIETALEEDAPGLLQILV+KG+Q+DEIKLYGE ESDDDLD S SED F ELEDVI+RLF QRHSFL+FPS IRCMK+SRASYCL
Subjt:  LMNIKRRCKRKKTATNSIETALEEDAPGLLQILVDKGVQIDEIKLYGEIESDDDLDVSFSEDRFGELEDVISRLFSQRHSFLEFPS-IRCMKSSRASYCL

Query:  ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGE
        ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELV+SLPI+WQIKRLVIAMKLT+CSRISLLENRPLLVGEDLTEGE
Subjt:  ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGE

Query:  AGVLLSYGWTPNSGLGTMLNYHDRVVHDRNNEDISEWRSKIGKLLMDGYNGGALVQENTSKKVAEYSNSQTTQVKLEL
        A VLLSYGW  NSGLGTMLNY  RVVHDR+NEDISEWRSKIGKLLMDGYNGGALV ENT KKVAEYS+SQ TQVKLEL
Subjt:  AGVLLSYGWTPNSGLGTMLNYHDRVVHDRNNEDISEWRSKIGKLLMDGYNGGALVQENTSKKVAEYSNSQTTQVKLEL

A0A6J1JZL8 uncharacterized protein LOC111489311 isoform X12.0e-27483.74Show/hide
Query:  MDNYSEGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDNFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSG
        MDNYSEGARL  EN  FENP PP VLDRVRVESTSILSGTLA  VDNFA AGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRR+LG  NQHVEGGSG
Subjt:  MDNYSEGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDNFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSG

Query:  MPLGELLQCFLKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKINLMEKVATELGSRPLTD
        +P G+LLQCFLKQ+ KSMFASEE M+ GNVLHD++ S+APR CSPSVVCSPNATL GSYFSSNH+LNKSTESG++MELKEDKI   +KVATELGSR LT+
Subjt:  MPLGELLQCFLKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKINLMEKVATELGSRPLTD

Query:  HVPKANLLSSTKMKDEPYDHVDDCNM---DMKNVFSNSVSIKSETTIPDEHYENKLDNMRLQDRIKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPAS
        HVP+ANLLSSTK+KDEPYDH + C++   DM NV+ N++S+KSETT+PDE +ENK+D+M LQDR+KFFSSRK F FTSMD EHPKPSDPGCSILVSEP +
Subjt:  HVPKANLLSSTKMKDEPYDHVDDCNM---DMKNVFSNSVSIKSETTIPDEHYENKLDNMRLQDRIKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPAS

Query:  LMNIKRRCKRKKTATNSIETALEEDAPGLLQILVDKGVQIDEIKLYGEIESDDDLDVSFSEDRFGELEDVISRLFSQRHSFLEFPS-IRCMKSSRASYCL
          N KRR K KKTATNSIETALEEDAPGLLQILV+KG+Q+DEIKLYGE ESDDDLD S SED F ELEDVI+RLF QRHSFL+FPS IRC+K+SRASYCL
Subjt:  LMNIKRRCKRKKTATNSIETALEEDAPGLLQILVDKGVQIDEIKLYGEIESDDDLDVSFSEDRFGELEDVISRLFSQRHSFLEFPS-IRCMKSSRASYCL

Query:  ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGE
        ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVF+RHKRIVMERPEYGYATYFFELV+SLPI+WQIKRLVIAMKLT+CSRISLLENRPLLVGEDLTEGE
Subjt:  ACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGE

Query:  AGVLLSYGWTPNSGLGTMLNYHDRVVHDRNNEDISEWRSKIGKLLMDGYNGGALVQENTSKKVAEYSNSQTTQVKLEL
        AGVLLSYGW  NSGLGTMLNY  RVVHDR+NEDISEWRSKIGKLLMDGYNGGALV ENT KKVAEYS+SQTTQVKLEL
Subjt:  AGVLLSYGWTPNSGLGTMLNYHDRVVHDRNNEDISEWRSKIGKLLMDGYNGGALVQENTSKKVAEYSNSQTTQVKLEL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G16610.1 unknown protein2.8e-9547.31Show/hide
Query:  KNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKINLMEKVATELGSRPLTDHVPKAN------LLSSTKMKDEPYD
        ++G+ L    ES  P     S   SP A+LP    SSN    K  +      L    +N  E  +T++   PL D V + N            +K E   
Subjt:  KNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKINLMEKVATELGSRPLTDHVPKAN------LLSSTKMKDEPYD

Query:  H---VDDCNMDMKNV---FSNSVS-------IKSETTIPDEHYENKLDNMRLQDRIKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPASLMNIKRRCK
        H   +D+  +D   +    +   S       +K+E     E  E+ +D+M+L DR+K  S    F  +    +   PS         E      + R  K
Subjt:  H---VDDCNMDMKNV---FSNSVS-------IKSETTIPDEHYENKLDNMRLQDRIKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPASLMNIKRRCK

Query:  RKKTATNSIETALEEDAPGLLQILVDKGVQIDEIKLYGEIESDDDLDVSFSEDRFGELEDVISRLFSQRHSFLEFPSIRCMKSSRASYCLACLVSLIEQT
        RKKTAT+SIETALEEDAPGLLQ+L+ +GV +DE++LYG    D   D S   + F ELEDVIS+LF +R +  +  +    K+SR SYCL CL SLIEQ 
Subjt:  RKKTATNSIETALEEDAPGLLQILVDKGVQIDEIKLYGEIESDDDLDVSFSEDRFGELEDVISRLFSQRHSFLEFPSIRCMKSSRASYCLACLVSLIEQT

Query:  RYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGEAGVLLSYGWT
        RYL FR WPVEWGWCRDLQSFIFVFERH RIVMERPEYGYATYFFEL ++  I WQ+KRLV+AMKL SC R  L+EN+PLLVGED+T GEA VL+ YGW 
Subjt:  RYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGEAGVLLSYGWT

Query:  PNSGLGTMLNYHDRVVHDRNNE-DISEWRSKIGKLLMDGYNGGALV
         N+GLGTMLNY DRV HDR  +   SEWRSKI +LL+DGYN G +V
Subjt:  PNSGLGTMLNYHDRVVHDRNNE-DISEWRSKIGKLLMDGYNGGALV

AT5G16610.2 unknown protein3.5e-9841.27Show/hide
Query:  DRVRVESTSILSGTLADGVDNFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHV---------------------EGGSGMPLGE
        D   +   + + G  ++ V+NF   G            +  +DL+H+ L ER +MLL R A+     +V                     E G     G 
Subjt:  DRVRVESTSILSGTLADGVDNFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHV---------------------EGGSGMPLGE

Query:  LLQCFLKQREKSMFASEEL-MKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKINLMEKVATELGSRPLTDHVPK
            FL++ +  +  +  +  ++G+ L    ES  P     S   SP A+LP    SSN    K  +      L    +N  E  +T++   PL D V +
Subjt:  LLQCFLKQREKSMFASEEL-MKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKINLMEKVATELGSRPLTDHVPK

Query:  AN------LLSSTKMKDEPYDH---VDDCNMDMKNV---FSNSVS-------IKSETTIPDEHYENKLDNMRLQDRIKFFSSRKVFDFTSMDSEHPKPSD
         N            +K E   H   +D+  +D   +    +   S       +K+E     E  E+ +D+M+L DR+K  S    F  +    +   PS 
Subjt:  AN------LLSSTKMKDEPYDH---VDDCNMDMKNV---FSNSVS-------IKSETTIPDEHYENKLDNMRLQDRIKFFSSRKVFDFTSMDSEHPKPSD

Query:  PGCSILVSEPASLMNIKRRCKRKKTATNSIETALEEDAPGLLQILVDKGVQIDEIKLYGEIESDDDLDVSFSEDRFGELEDVISRLFSQRHSFLEFPSIR
                E      + R  KRKKTAT+SIETALEEDAPGLLQ+L+ +GV +DE++LYG    D   D S   + F ELEDVIS+LF +R +  +  +  
Subjt:  PGCSILVSEPASLMNIKRRCKRKKTATNSIETALEEDAPGLLQILVDKGVQIDEIKLYGEIESDDDLDVSFSEDRFGELEDVISRLFSQRHSFLEFPSIR

Query:  CMKSSRASYCLACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVIAMKLTSCSRISLLENRP
          K+SR SYCL CL SLIEQ RYL FR WPVEWGWCRDLQSFIFVFERH RIVMERPEYGYATYFFEL ++  I WQ+KRLV+AMKL SC R  L+EN+P
Subjt:  CMKSSRASYCLACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVIAMKLTSCSRISLLENRP

Query:  LLVGEDLTEGEAGVLLSYGWTPNSGLGTMLNYHDRVVHDRNNE-DISEWRSKIGKLLMDGYNGGALV
        LLVGED+T GEA VL+ YGW  N+GLGTMLNY DRV HDR  +   SEWRSKI +LL+DGYN G +V
Subjt:  LLVGEDLTEGEAGVLLSYGWTPNSGLGTMLNYHDRVVHDRNNE-DISEWRSKIGKLLMDGYNGGALV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACAACTATAGCGAGGGGGCTCGATTAATTATAGAGAACCCGGCTTTTGAGAATCCAATGCCACCTAACGTTCTTGATCGGGTAAGGGTGGAATCCACAAGCATCTT
ATCAGGTACCTTGGCAGATGGCGTAGATAACTTTGCTTCTGCTGGTGTGGCTGTAACTAAGGTTAAAAATGAGATGTTTGATGACTTCGATGAAGATCTTGATCATGTTT
TATTGATAGAGCGACTAAGGATGCTGCTTTCAAGGCGAGCATTGGGTTCGACAAATCAACATGTGGAGGGTGGTTCTGGTATGCCGTTGGGAGAACTTCTACAATGCTTC
CTGAAACAGAGGGAGAAGTCCATGTTTGCTAGTGAAGAACTGATGAAAAATGGAAATGTGTTGCATGATAGAACTGAAAGTCATGCTCCTCGTCTTTGCAGCCCTTCAGT
AGTTTGTTCACCTAATGCAACTCTTCCGGGATCTTATTTCTCAAGCAATCATACTTTGAACAAATCAACTGAATCAGGCGACAATATGGAACTTAAAGAAGATAAGATCA
ACTTAATGGAAAAGGTAGCTACAGAATTAGGTTCACGGCCTTTGACTGATCATGTTCCTAAAGCAAATTTATTGAGTTCCACGAAAATGAAGGATGAACCTTATGATCAT
GTGGATGACTGCAACATGGATATGAAAAATGTCTTCAGCAACAGTGTGTCAATAAAGAGTGAAACAACCATTCCCGATGAACATTATGAAAATAAGTTAGACAATATGCG
ATTGCAAGATCGAATTAAGTTTTTCTCTTCTCGGAAGGTTTTTGATTTTACATCTATGGATTCCGAGCATCCAAAACCTTCTGACCCTGGATGCAGCATTCTTGTTTCAG
AACCTGCTAGCTTAATGAACATTAAACGGAGATGCAAACGGAAAAAAACTGCCACGAATTCAATTGAAACAGCACTGGAGGAGGATGCTCCTGGCCTTCTCCAGATACTT
GTTGACAAAGGCGTACAAATTGATGAGATCAAGCTTTATGGGGAGATAGAAAGTGATGATGATCTAGATGTGTCTTTTAGTGAAGACAGGTTTGGGGAACTTGAAGATGT
GATATCAAGGCTTTTTTCTCAACGCCATTCCTTTTTGGAGTTTCCCTCTATAAGATGCATGAAAAGTTCAAGAGCAAGCTATTGCTTAGCTTGTCTAGTTTCACTCATTG
AGCAGACGAGATATCTTCATTTCCGAAACTGGCCTGTCGAATGGGGGTGGTGCCGGGATCTCCAGTCTTTTATATTTGTATTTGAGAGACATAAAAGAATAGTGATGGAA
CGCCCCGAGTATGGCTATGCTACATATTTTTTTGAGCTTGTGGATTCCTTACCCATCAACTGGCAGATAAAGCGGTTGGTGATTGCTATGAAGCTTACGAGTTGTAGCAG
AATTTCACTACTTGAGAACAGACCATTATTGGTTGGGGAAGATTTGACCGAAGGTGAGGCAGGGGTTTTATTGAGCTATGGATGGACGCCGAATAGTGGCTTGGGTACAA
TGCTGAACTACCATGACAGAGTCGTTCACGATCGGAATAATGAGGACATCTCGGAATGGAGATCAAAAATAGGGAAGCTACTGATGGATGGTTATAATGGAGGAGCTCTT
GTGCAAGAAAATACTTCAAAGAAGGTTGCAGAATACAGCAATTCCCAAACCACACAAGTTAAGCTGGAACTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACAACTATAGCGAGGGGGCTCGATTAATTATAGAGAACCCGGCTTTTGAGAATCCAATGCCACCTAACGTTCTTGATCGGGTAAGGGTGGAATCCACAAGCATCTT
ATCAGGTACCTTGGCAGATGGCGTAGATAACTTTGCTTCTGCTGGTGTGGCTGTAACTAAGGTTAAAAATGAGATGTTTGATGACTTCGATGAAGATCTTGATCATGTTT
TATTGATAGAGCGACTAAGGATGCTGCTTTCAAGGCGAGCATTGGGTTCGACAAATCAACATGTGGAGGGTGGTTCTGGTATGCCGTTGGGAGAACTTCTACAATGCTTC
CTGAAACAGAGGGAGAAGTCCATGTTTGCTAGTGAAGAACTGATGAAAAATGGAAATGTGTTGCATGATAGAACTGAAAGTCATGCTCCTCGTCTTTGCAGCCCTTCAGT
AGTTTGTTCACCTAATGCAACTCTTCCGGGATCTTATTTCTCAAGCAATCATACTTTGAACAAATCAACTGAATCAGGCGACAATATGGAACTTAAAGAAGATAAGATCA
ACTTAATGGAAAAGGTAGCTACAGAATTAGGTTCACGGCCTTTGACTGATCATGTTCCTAAAGCAAATTTATTGAGTTCCACGAAAATGAAGGATGAACCTTATGATCAT
GTGGATGACTGCAACATGGATATGAAAAATGTCTTCAGCAACAGTGTGTCAATAAAGAGTGAAACAACCATTCCCGATGAACATTATGAAAATAAGTTAGACAATATGCG
ATTGCAAGATCGAATTAAGTTTTTCTCTTCTCGGAAGGTTTTTGATTTTACATCTATGGATTCCGAGCATCCAAAACCTTCTGACCCTGGATGCAGCATTCTTGTTTCAG
AACCTGCTAGCTTAATGAACATTAAACGGAGATGCAAACGGAAAAAAACTGCCACGAATTCAATTGAAACAGCACTGGAGGAGGATGCTCCTGGCCTTCTCCAGATACTT
GTTGACAAAGGCGTACAAATTGATGAGATCAAGCTTTATGGGGAGATAGAAAGTGATGATGATCTAGATGTGTCTTTTAGTGAAGACAGGTTTGGGGAACTTGAAGATGT
GATATCAAGGCTTTTTTCTCAACGCCATTCCTTTTTGGAGTTTCCCTCTATAAGATGCATGAAAAGTTCAAGAGCAAGCTATTGCTTAGCTTGTCTAGTTTCACTCATTG
AGCAGACGAGATATCTTCATTTCCGAAACTGGCCTGTCGAATGGGGGTGGTGCCGGGATCTCCAGTCTTTTATATTTGTATTTGAGAGACATAAAAGAATAGTGATGGAA
CGCCCCGAGTATGGCTATGCTACATATTTTTTTGAGCTTGTGGATTCCTTACCCATCAACTGGCAGATAAAGCGGTTGGTGATTGCTATGAAGCTTACGAGTTGTAGCAG
AATTTCACTACTTGAGAACAGACCATTATTGGTTGGGGAAGATTTGACCGAAGGTGAGGCAGGGGTTTTATTGAGCTATGGATGGACGCCGAATAGTGGCTTGGGTACAA
TGCTGAACTACCATGACAGAGTCGTTCACGATCGGAATAATGAGGACATCTCGGAATGGAGATCAAAAATAGGGAAGCTACTGATGGATGGTTATAATGGAGGAGCTCTT
GTGCAAGAAAATACTTCAAAGAAGGTTGCAGAATACAGCAATTCCCAAACCACACAAGTTAAGCTGGAACTCTGA
Protein sequenceShow/hide protein sequence
MDNYSEGARLIIENPAFENPMPPNVLDRVRVESTSILSGTLADGVDNFASAGVAVTKVKNEMFDDFDEDLDHVLLIERLRMLLSRRALGSTNQHVEGGSGMPLGELLQCF
LKQREKSMFASEELMKNGNVLHDRTESHAPRLCSPSVVCSPNATLPGSYFSSNHTLNKSTESGDNMELKEDKINLMEKVATELGSRPLTDHVPKANLLSSTKMKDEPYDH
VDDCNMDMKNVFSNSVSIKSETTIPDEHYENKLDNMRLQDRIKFFSSRKVFDFTSMDSEHPKPSDPGCSILVSEPASLMNIKRRCKRKKTATNSIETALEEDAPGLLQIL
VDKGVQIDEIKLYGEIESDDDLDVSFSEDRFGELEDVISRLFSQRHSFLEFPSIRCMKSSRASYCLACLVSLIEQTRYLHFRNWPVEWGWCRDLQSFIFVFERHKRIVME
RPEYGYATYFFELVDSLPINWQIKRLVIAMKLTSCSRISLLENRPLLVGEDLTEGEAGVLLSYGWTPNSGLGTMLNYHDRVVHDRNNEDISEWRSKIGKLLMDGYNGGAL
VQENTSKKVAEYSNSQTTQVKLEL