; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0007437 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0007437
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr09:11183932..11210830
RNA-Seq ExpressionPI0007437
SyntenyPI0007437
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8648436.1 hypothetical protein Csa_008851 [Cucumis sativus]0.0e+0090.64Show/hide
Query:  LKSSKFTYLLAVDETNSQYSQLDLDELEWNSHLMDNCEGAQLNIENPTFENLITPEVLDQVRVESISSLSGTLADGVDNFGSADVAVTKVKNEMSDDFDE
        L +S F     VDE NSQYS+LDLDELE NSHLMDNCEGA+LNIEN TFEN + PEVLD VRVESISSL GTLADGVDNFGSA VAVTKVKNEM DDFDE
Subjt:  LKSSKFTYLLAVDETNSQYSQLDLDELEWNSHLMDNCEGAQLNIENPTFENLITPEVLDQVRVESISSLSGTLADGVDNFGSADVAVTKVKNEMSDDFDE

Query:  DLDHVLLIERLRMLLSRRALGLTNRHVEGGSGVPSGELLQCFLKIREKSMFASEELMEIENLLHSRTGSHAPRPCSSSEICSPSRTLTGSCCSGNHCVNK
        DLDHVLLIERLRMLLSRRALGLTNRH EGG GV SGE LQCFLK REKSMFASEE MEIEN+LHSRTGSHAP PC  SE+CSP  TLTGS CSGN CVNK
Subjt:  DLDHVLLIERLRMLLSRRALGLTNRHVEGGSGVPSGELLQCFLKIREKSMFASEELMEIENLLHSRTGSHAPRPCSSSEICSPSRTLTGSCCSGNHCVNK

Query:  SSESGDDMELKEDKICSTEKVATELGSRPLTDHVPKANLLSSTKVKDEPYDHVDDNNIYGKDMNNVFSNTVLIKSEATVPDEHYENKVDNMRLQDRMKFF
        S+ES DDMELKEDKICSTEKVATELGSRPLTDHVPKANLLS TKVKDEPYDHVDD+NIYGKDMNNVFS+TV IKSEAT+PDEHYENKVDNMRLQDRMKFF
Subjt:  SSESGDDMELKEDKICSTEKVATELGSRPLTDHVPKANLLSSTKVKDEPYDHVDDNNIYGKDMNNVFSNTVLIKSEATVPDEHYENKVDNMRLQDRMKFF

Query:  SSRKDFGFTPMDYEHPKPSNPGCSILVSEPASLTNIKRRRKRKKTVTNLVETALEEDAPGLLQILVDKGVLVDEIKLYGETESDDDLDESFSEDSFGELE
        SS+KDFGFTPM+YEHPKPS+PGCSILVSEPASL NIKRRRKRKKTVTN VETALEEDAPGLLQILVDKGVLVDEIKLYGETESD+DLDESFSEDSF EL+
Subjt:  SSRKDFGFTPMDYEHPKPSNPGCSILVSEPASLTNIKRRRKRKKTVTNLVETALEEDAPGLLQILVDKGVLVDEIKLYGETESDDDLDESFSEDSFGELE

Query:  DVISRLFSQRHSFMKFPSIRCMKSSRVSYCLACLVSLIEQARYLHFRNWPVEWGWCRDLQSFMFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKR
        DVISRLFSQRHSFMKFPSIRCMKSSRVSYCLACLVSLIEQ RYL FRNWPVEWGWCRDLQSF+FVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKR
Subjt:  DVISRLFSQRHSFMKFPSIRCMKSSRVSYCLACLVSLIEQARYLHFRNWPVEWGWCRDLQSFMFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKR

Query:  LVISMKLTSCSRISLLENRPLLVGEDLTEGEAGVLLGYGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSS
        LVISMKLTSCSRISLLEN PLLVGEDLTEGEAGVL  YGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTS+KVAEYSSS
Subjt:  LVISMKLTSCSRISLLENRPLLVGEDLTEGEAGVLLGYGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSS

Query:  QTTQVKLEL
        QTTQVKLEL
Subjt:  QTTQVKLEL

XP_008460540.1 PREDICTED: uncharacterized protein LOC103499334 isoform X1 [Cucumis melo]2.9e-30493.36Show/hide
Query:  EGAQLNIENPTFENLITPEVLDQVRVESISSLSGTLADGVDNFGSADVAVTKVKNEMSDDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGGSGVPSGE
        EGA+LNIEN T ENL+ PE LD VRVESISSLSGTLADGVDN GSA VAVTKVKNEM DDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGG GVPSGE
Subjt:  EGAQLNIENPTFENLITPEVLDQVRVESISSLSGTLADGVDNFGSADVAVTKVKNEMSDDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGGSGVPSGE

Query:  LLQCFLKIREKSMFASEELMEIENLLHSRTGSHAPRPCSSSEICSPSRTLTGSCCSGNHCVNKSSESGDDMELKEDKICSTEKVATELGSRPLTDHVPKA
        LLQCFLK R+KSMFASEELMEIEN+LHSRTGSHAPRPCS SE+CSPS TLTGS CSGNHCVNKS+ESGDDMELKEDKICSTEKVATEL SRPLTDHVPK 
Subjt:  LLQCFLKIREKSMFASEELMEIENLLHSRTGSHAPRPCSSSEICSPSRTLTGSCCSGNHCVNKSSESGDDMELKEDKICSTEKVATELGSRPLTDHVPKA

Query:  NLLSSTKVKDEPYDHVDDNNIYGKDMNNVFSNTVLIKSEATVPDEHYENKVDNMRLQDRMKFFSSRKDFGFTPMDYEHPKPSNPGCSILVSEPASLTNIK
        NLLSST VKDEPYDHVDD+NIYGKDMNNVFSNTV IKSEAT PDEHYENKVDNMRLQDRMKFFSSRKDFGFTP+DYEHPKPS+PGCSILVSEPASLTNIK
Subjt:  NLLSSTKVKDEPYDHVDDNNIYGKDMNNVFSNTVLIKSEATVPDEHYENKVDNMRLQDRMKFFSSRKDFGFTPMDYEHPKPSNPGCSILVSEPASLTNIK

Query:  RRRKRKKTVTNLVETALEEDAPGLLQILVDKGVLVDEIKLYGETESDDDLDESFSEDSFGELEDVISRLFSQRHSFMKFPSIRCMKSSRVSYCLACLVSL
        RR KRKKTVTN VETALEEDAPGLLQILVDKGVLVDEIKLYGETESD+DLDESF ED FGELEDVISRLFSQRHSFMKFPSIRCMKSSRVSYCLACLVSL
Subjt:  RRRKRKKTVTNLVETALEEDAPGLLQILVDKGVLVDEIKLYGETESDDDLDESFSEDSFGELEDVISRLFSQRHSFMKFPSIRCMKSSRVSYCLACLVSL

Query:  IEQARYLHFRNWPVEWGWCRDLQSFMFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENRPLLVGEDLTEGEAGVLLG
        IEQ RYLHFR WPVEWGWCRDLQSF+FVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLEN PLLVGEDLTEGEA VLL 
Subjt:  IEQARYLHFRNWPVEWGWCRDLQSFMFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENRPLLVGEDLTEGEAGVLLG

Query:  YGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLEL
        YGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLEL
Subjt:  YGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLEL

XP_008460542.1 PREDICTED: uncharacterized protein LOC103499334 isoform X2 [Cucumis melo]1.6e-28388.64Show/hide
Query:  EGAQLNIENPTFENLITPEVLDQVRVESISSLSGTLADGVDNFGSADVAVTKVKNEMSDDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGGSGVPSGE
        EGA+LNIEN T ENL+ PE LD VRVESISSLSGTLADGVDN GSA VAVTKVKNEM DDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGG GVPSGE
Subjt:  EGAQLNIENPTFENLITPEVLDQVRVESISSLSGTLADGVDNFGSADVAVTKVKNEMSDDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGGSGVPSGE

Query:  LLQCFLKIREKSMFASEELMEIENLLHSRTGSHAPRPCSSSEICSPSRTLTGSCCSGNHCVNKSSESGDDMELKEDKICSTEKVATELGSRPLTDHVPKA
        LLQCFLK R+KSMFASEELMEIEN+LHSRTGSHAPRPCS SE+CSPS TLTGS CSGNHCVNKS+ESGDDMELKEDKICSTEKVATEL SRPLTDHVPK 
Subjt:  LLQCFLKIREKSMFASEELMEIENLLHSRTGSHAPRPCSSSEICSPSRTLTGSCCSGNHCVNKSSESGDDMELKEDKICSTEKVATELGSRPLTDHVPKA

Query:  NLLSSTKVKDEPYDHVDDNNIYGKDMNNVFSNTVLIKSEATVPDEHYENKVDNMRLQDRMKFFSSRKDFGFTPMDYEHPKPSNPGCSILVSEPASLTNIK
        NLLSST VKDEPYDHVDD+NIY                             DNMRLQDRMKFFSSRKDFGFTP+DYEHPKPS+PGCSILVSEPASLTNIK
Subjt:  NLLSSTKVKDEPYDHVDDNNIYGKDMNNVFSNTVLIKSEATVPDEHYENKVDNMRLQDRMKFFSSRKDFGFTPMDYEHPKPSNPGCSILVSEPASLTNIK

Query:  RRRKRKKTVTNLVETALEEDAPGLLQILVDKGVLVDEIKLYGETESDDDLDESFSEDSFGELEDVISRLFSQRHSFMKFPSIRCMKSSRVSYCLACLVSL
        RR KRKKTVTN VETALEEDAPGLLQILVDKGVLVDEIKLYGETESD+DLDESF ED FGELEDVISRLFSQRHSFMKFPSIRCMKSSRVSYCLACLVSL
Subjt:  RRRKRKKTVTNLVETALEEDAPGLLQILVDKGVLVDEIKLYGETESDDDLDESFSEDSFGELEDVISRLFSQRHSFMKFPSIRCMKSSRVSYCLACLVSL

Query:  IEQARYLHFRNWPVEWGWCRDLQSFMFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENRPLLVGEDLTEGEAGVLLG
        IEQ RYLHFR WPVEWGWCRDLQSF+FVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLEN PLLVGEDLTEGEA VLL 
Subjt:  IEQARYLHFRNWPVEWGWCRDLQSFMFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENRPLLVGEDLTEGEAGVLLG

Query:  YGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLEL
        YGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLEL
Subjt:  YGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLEL

XP_011655236.1 uncharacterized protein LOC101212787 isoform X1 [Cucumis sativus]2.5e-30092.01Show/hide
Query:  MDNCEGAQLNIENPTFENLITPEVLDQVRVESISSLSGTLADGVDNFGSADVAVTKVKNEMSDDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGGSGV
        MDNCEGA+LNIEN TFEN + PEVLD VRVESISSL GTLADGVDNFGSA VAVTKVKNEM DDFDEDLDHVLLIERLRMLLSRRALGLTNRH EGG GV
Subjt:  MDNCEGAQLNIENPTFENLITPEVLDQVRVESISSLSGTLADGVDNFGSADVAVTKVKNEMSDDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGGSGV

Query:  PSGELLQCFLKIREKSMFASEELMEIENLLHSRTGSHAPRPCSSSEICSPSRTLTGSCCSGNHCVNKSSESGDDMELKEDKICSTEKVATELGSRPLTDH
         SGE LQCFLK REKSMFASEE MEIEN+LHSRTGSHAP PC  SE+CSP  TLTGS CSGN CVNKS+ES DDMELKEDKICSTEKVATELGSRPLTDH
Subjt:  PSGELLQCFLKIREKSMFASEELMEIENLLHSRTGSHAPRPCSSSEICSPSRTLTGSCCSGNHCVNKSSESGDDMELKEDKICSTEKVATELGSRPLTDH

Query:  VPKANLLSSTKVKDEPYDHVDDNNIYGKDMNNVFSNTVLIKSEATVPDEHYENKVDNMRLQDRMKFFSSRKDFGFTPMDYEHPKPSNPGCSILVSEPASL
        VPKANLLS TKVKDEPYDHVDD+NIYGKDMNNVFS+TV IKSEAT+PDEHYENKVDNMRLQDRMKFFSS+KDFGFTPM+YEHPKPS+PGCSILVSEPASL
Subjt:  VPKANLLSSTKVKDEPYDHVDDNNIYGKDMNNVFSNTVLIKSEATVPDEHYENKVDNMRLQDRMKFFSSRKDFGFTPMDYEHPKPSNPGCSILVSEPASL

Query:  TNIKRRRKRKKTVTNLVETALEEDAPGLLQILVDKGVLVDEIKLYGETESDDDLDESFSEDSFGELEDVISRLFSQRHSFMKFPSIRCMKSSRVSYCLAC
         NIKRRRKRKKTVTN VETALEEDAPGLLQILVDKGVLVDEIKLYGETESD+DLDESFSEDSF EL+DVISRLFSQRHSFMKFPSIRCMKSSRVSYCLAC
Subjt:  TNIKRRRKRKKTVTNLVETALEEDAPGLLQILVDKGVLVDEIKLYGETESDDDLDESFSEDSFGELEDVISRLFSQRHSFMKFPSIRCMKSSRVSYCLAC

Query:  LVSLIEQARYLHFRNWPVEWGWCRDLQSFMFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENRPLLVGEDLTEGEAG
        LVSLIEQ RYL FRNWPVEWGWCRDLQSF+FVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLEN PLLVGEDLTEGEAG
Subjt:  LVSLIEQARYLHFRNWPVEWGWCRDLQSFMFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENRPLLVGEDLTEGEAG

Query:  VLLGYGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLEL
        VL  YGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTS+KVAEYSSSQTTQVKLEL
Subjt:  VLLGYGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLEL

XP_038874728.1 uncharacterized protein LOC120067269 [Benincasa hispida]1.0e-27780.55Show/hide
Query:  AVDETNSQYSQLDLDELEWNSHLMDN-CEGAQLNIENPTFENLITPEVLDQVRVESISSLSGTLADGVDNFGSADVAVTKVKNEMSDDFDEDLDHVLLIE
        A D  N +  QL     +   HLMDN  +G +LNIENPTFEN ++ +VLD+VRVES S+LSGTL DGVD+F SA VAVTKVKNEM +DF+EDLDHV LI+
Subjt:  AVDETNSQYSQLDLDELEWNSHLMDN-CEGAQLNIENPTFENLITPEVLDQVRVESISSLSGTLADGVDNFGSADVAVTKVKNEMSDDFDEDLDHVLLIE

Query:  RLRMLLSRRALGLTNRHVEGGSGVPSGELLQCFLKIREKSMFASEELMEIENLLHSRTGSHAPRPCSSSEICSPSRTLTGSCCSGNHCVNKSSESGDDME
        RLRMLLSRRALGLTN HVE GSGVPSGELL C LK REKSMFA EELMEIEN+LH RTGSHAPR CS S +CSP+ TL  S  S NH  NKS+ESGDDME
Subjt:  RLRMLLSRRALGLTNRHVEGGSGVPSGELLQCFLKIREKSMFASEELMEIENLLHSRTGSHAPRPCSSSEICSPSRTLTGSCCSGNHCVNKSSESGDDME

Query:  LKEDKICSTEKVATELGSRPLTDHVPKANLLSSTKVKDEPYDHVDDNNIYGKDMNNVFSNTVLIKSEATVPDEHYENKVDNMRLQDRMKFFSSRKDFGFT
        LK+DKICSTEKVAT+L S+PLTDHVPKANLLSSTKVKDEPY HVDD NIYGKD NNV SNTVLIKSE T+PDEHYENK+DNMRLQDRMKFFSSRK FGFT
Subjt:  LKEDKICSTEKVATELGSRPLTDHVPKANLLSSTKVKDEPYDHVDDNNIYGKDMNNVFSNTVLIKSEATVPDEHYENKVDNMRLQDRMKFFSSRKDFGFT

Query:  PMDYEHPKPSNPGCSILVSEPASLTNIKRRRKRKKTVTNLVETALEEDAPGLLQILVDKGVLVDEIKLYGETESDDDLDESFSEDSFGELEDVISR----
         MDYEHPKPS+PGCSILV EP S  NIK RRKRKKT TN +ETALEEDAPGLLQILVDKGV VDEIKLYGE E+DDDLDESFSEDSFGELEDVISR    
Subjt:  PMDYEHPKPSNPGCSILVSEPASLTNIKRRRKRKKTVTNLVETALEEDAPGLLQILVDKGVLVDEIKLYGETESDDDLDESFSEDSFGELEDVISR----

Query:  ------------------LFSQRHSFMKFPSIRCMKSSRVSYCLACLVSLIEQARYLHFRNWPVEWGWCRDLQSFMFVFERHKRIVMERPEYGYATYFFE
                          LFSQRHSF+KFPSIRCMKSSR SYCLACLVSLIEQ RYLHFRNWPVEWGWCRDLQSF+FVFERHKRIVMERPEYG+ATYFFE
Subjt:  ------------------LFSQRHSFMKFPSIRCMKSSRVSYCLACLVSLIEQARYLHFRNWPVEWGWCRDLQSFMFVFERHKRIVMERPEYGYATYFFE

Query:  LVDSLPINWQIKRLVISMKLTSCSRISLLENRPLLVGEDLTEGEAGVLLGYGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLL
        LVDSLP+NWQIKRLVI+MKLTSCSRISLLENRPLLVGEDLTEGEA VLL YGWMPNSGLGTMLNYRGRVVHDRNNEDISEW+SKIGKLLMDGYNGGAL+ 
Subjt:  LVDSLPINWQIKRLVISMKLTSCSRISLLENRPLLVGEDLTEGEAGVLLGYGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLL

Query:  ENTSMKVAEYSSSQTTQVKLEL
        ENTS KVAEYSS QTTQVKLEL
Subjt:  ENTSMKVAEYSSSQTTQVKLEL

TrEMBL top hitse value%identityAlignment
A0A0A0KNC1 Uncharacterized protein4.9e-30292.04Show/hide
Query:  HLMDNCEGAQLNIENPTFENLITPEVLDQVRVESISSLSGTLADGVDNFGSADVAVTKVKNEMSDDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGGS
        HLMDNCEGA+LNIEN TFEN + PEVLD VRVESISSL GTLADGVDNFGSA VAVTKVKNEM DDFDEDLDHVLLIERLRMLLSRRALGLTNRH EGG 
Subjt:  HLMDNCEGAQLNIENPTFENLITPEVLDQVRVESISSLSGTLADGVDNFGSADVAVTKVKNEMSDDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGGS

Query:  GVPSGELLQCFLKIREKSMFASEELMEIENLLHSRTGSHAPRPCSSSEICSPSRTLTGSCCSGNHCVNKSSESGDDMELKEDKICSTEKVATELGSRPLT
        GV SGE LQCFLK REKSMFASEE MEIEN+LHSRTGSHAP PC  SE+CSP  TLTGS CSGN CVNKS+ES DDMELKEDKICSTEKVATELGSRPLT
Subjt:  GVPSGELLQCFLKIREKSMFASEELMEIENLLHSRTGSHAPRPCSSSEICSPSRTLTGSCCSGNHCVNKSSESGDDMELKEDKICSTEKVATELGSRPLT

Query:  DHVPKANLLSSTKVKDEPYDHVDDNNIYGKDMNNVFSNTVLIKSEATVPDEHYENKVDNMRLQDRMKFFSSRKDFGFTPMDYEHPKPSNPGCSILVSEPA
        DHVPKANLLS TKVKDEPYDHVDD+NIYGKDMNNVFS+TV IKSEAT+PDEHYENKVDNMRLQDRMKFFSS+KDFGFTPM+YEHPKPS+PGCSILVSEPA
Subjt:  DHVPKANLLSSTKVKDEPYDHVDDNNIYGKDMNNVFSNTVLIKSEATVPDEHYENKVDNMRLQDRMKFFSSRKDFGFTPMDYEHPKPSNPGCSILVSEPA

Query:  SLTNIKRRRKRKKTVTNLVETALEEDAPGLLQILVDKGVLVDEIKLYGETESDDDLDESFSEDSFGELEDVISRLFSQRHSFMKFPSIRCMKSSRVSYCL
        SL NIKRRRKRKKTVTN VETALEEDAPGLLQILVDKGVLVDEIKLYGETESD+DLDESFSEDSF EL+DVISRLFSQRHSFMKFPSIRCMKSSRVSYCL
Subjt:  SLTNIKRRRKRKKTVTNLVETALEEDAPGLLQILVDKGVLVDEIKLYGETESDDDLDESFSEDSFGELEDVISRLFSQRHSFMKFPSIRCMKSSRVSYCL

Query:  ACLVSLIEQARYLHFRNWPVEWGWCRDLQSFMFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENRPLLVGEDLTEGE
        ACLVSLIEQ RYL FRNWPVEWGWCRDLQSF+FVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLEN PLLVGEDLTEGE
Subjt:  ACLVSLIEQARYLHFRNWPVEWGWCRDLQSFMFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENRPLLVGEDLTEGE

Query:  AGVLLGYGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLEL
        AGVL  YGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTS+KVAEYSSSQTTQVKLEL
Subjt:  AGVLLGYGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLEL

A0A1S3CCP8 uncharacterized protein LOC103499334 isoform X27.9e-28488.64Show/hide
Query:  EGAQLNIENPTFENLITPEVLDQVRVESISSLSGTLADGVDNFGSADVAVTKVKNEMSDDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGGSGVPSGE
        EGA+LNIEN T ENL+ PE LD VRVESISSLSGTLADGVDN GSA VAVTKVKNEM DDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGG GVPSGE
Subjt:  EGAQLNIENPTFENLITPEVLDQVRVESISSLSGTLADGVDNFGSADVAVTKVKNEMSDDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGGSGVPSGE

Query:  LLQCFLKIREKSMFASEELMEIENLLHSRTGSHAPRPCSSSEICSPSRTLTGSCCSGNHCVNKSSESGDDMELKEDKICSTEKVATELGSRPLTDHVPKA
        LLQCFLK R+KSMFASEELMEIEN+LHSRTGSHAPRPCS SE+CSPS TLTGS CSGNHCVNKS+ESGDDMELKEDKICSTEKVATEL SRPLTDHVPK 
Subjt:  LLQCFLKIREKSMFASEELMEIENLLHSRTGSHAPRPCSSSEICSPSRTLTGSCCSGNHCVNKSSESGDDMELKEDKICSTEKVATELGSRPLTDHVPKA

Query:  NLLSSTKVKDEPYDHVDDNNIYGKDMNNVFSNTVLIKSEATVPDEHYENKVDNMRLQDRMKFFSSRKDFGFTPMDYEHPKPSNPGCSILVSEPASLTNIK
        NLLSST VKDEPYDHVDD+NIY                             DNMRLQDRMKFFSSRKDFGFTP+DYEHPKPS+PGCSILVSEPASLTNIK
Subjt:  NLLSSTKVKDEPYDHVDDNNIYGKDMNNVFSNTVLIKSEATVPDEHYENKVDNMRLQDRMKFFSSRKDFGFTPMDYEHPKPSNPGCSILVSEPASLTNIK

Query:  RRRKRKKTVTNLVETALEEDAPGLLQILVDKGVLVDEIKLYGETESDDDLDESFSEDSFGELEDVISRLFSQRHSFMKFPSIRCMKSSRVSYCLACLVSL
        RR KRKKTVTN VETALEEDAPGLLQILVDKGVLVDEIKLYGETESD+DLDESF ED FGELEDVISRLFSQRHSFMKFPSIRCMKSSRVSYCLACLVSL
Subjt:  RRRKRKKTVTNLVETALEEDAPGLLQILVDKGVLVDEIKLYGETESDDDLDESFSEDSFGELEDVISRLFSQRHSFMKFPSIRCMKSSRVSYCLACLVSL

Query:  IEQARYLHFRNWPVEWGWCRDLQSFMFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENRPLLVGEDLTEGEAGVLLG
        IEQ RYLHFR WPVEWGWCRDLQSF+FVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLEN PLLVGEDLTEGEA VLL 
Subjt:  IEQARYLHFRNWPVEWGWCRDLQSFMFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENRPLLVGEDLTEGEAGVLLG

Query:  YGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLEL
        YGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLEL
Subjt:  YGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLEL

A0A1S3CCT1 uncharacterized protein LOC103499334 isoform X11.4e-30493.36Show/hide
Query:  EGAQLNIENPTFENLITPEVLDQVRVESISSLSGTLADGVDNFGSADVAVTKVKNEMSDDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGGSGVPSGE
        EGA+LNIEN T ENL+ PE LD VRVESISSLSGTLADGVDN GSA VAVTKVKNEM DDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGG GVPSGE
Subjt:  EGAQLNIENPTFENLITPEVLDQVRVESISSLSGTLADGVDNFGSADVAVTKVKNEMSDDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGGSGVPSGE

Query:  LLQCFLKIREKSMFASEELMEIENLLHSRTGSHAPRPCSSSEICSPSRTLTGSCCSGNHCVNKSSESGDDMELKEDKICSTEKVATELGSRPLTDHVPKA
        LLQCFLK R+KSMFASEELMEIEN+LHSRTGSHAPRPCS SE+CSPS TLTGS CSGNHCVNKS+ESGDDMELKEDKICSTEKVATEL SRPLTDHVPK 
Subjt:  LLQCFLKIREKSMFASEELMEIENLLHSRTGSHAPRPCSSSEICSPSRTLTGSCCSGNHCVNKSSESGDDMELKEDKICSTEKVATELGSRPLTDHVPKA

Query:  NLLSSTKVKDEPYDHVDDNNIYGKDMNNVFSNTVLIKSEATVPDEHYENKVDNMRLQDRMKFFSSRKDFGFTPMDYEHPKPSNPGCSILVSEPASLTNIK
        NLLSST VKDEPYDHVDD+NIYGKDMNNVFSNTV IKSEAT PDEHYENKVDNMRLQDRMKFFSSRKDFGFTP+DYEHPKPS+PGCSILVSEPASLTNIK
Subjt:  NLLSSTKVKDEPYDHVDDNNIYGKDMNNVFSNTVLIKSEATVPDEHYENKVDNMRLQDRMKFFSSRKDFGFTPMDYEHPKPSNPGCSILVSEPASLTNIK

Query:  RRRKRKKTVTNLVETALEEDAPGLLQILVDKGVLVDEIKLYGETESDDDLDESFSEDSFGELEDVISRLFSQRHSFMKFPSIRCMKSSRVSYCLACLVSL
        RR KRKKTVTN VETALEEDAPGLLQILVDKGVLVDEIKLYGETESD+DLDESF ED FGELEDVISRLFSQRHSFMKFPSIRCMKSSRVSYCLACLVSL
Subjt:  RRRKRKKTVTNLVETALEEDAPGLLQILVDKGVLVDEIKLYGETESDDDLDESFSEDSFGELEDVISRLFSQRHSFMKFPSIRCMKSSRVSYCLACLVSL

Query:  IEQARYLHFRNWPVEWGWCRDLQSFMFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENRPLLVGEDLTEGEAGVLLG
        IEQ RYLHFR WPVEWGWCRDLQSF+FVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLEN PLLVGEDLTEGEA VLL 
Subjt:  IEQARYLHFRNWPVEWGWCRDLQSFMFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENRPLLVGEDLTEGEAGVLLG

Query:  YGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLEL
        YGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLEL
Subjt:  YGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLEL

A0A6J1FLT1 uncharacterized protein LOC111445382 isoform X16.9e-27280.17Show/hide
Query:  TYLLAVDETNSQYSQLDLDELEWNSHLMDN-CEGAQLNIENPTFENLITPEVLDQVRVESISSLSGTLADGVDNFGSADVAVTKVKNEMSDDFDEDLDHV
        T + A D  N    QL+    + N H MDN  EGA+LN EN TFEN   PEVLD+VRVES S LSGTL  GVDNF  A VAVTKVKNEM DDFDEDLDHV
Subjt:  TYLLAVDETNSQYSQLDLDELEWNSHLMDN-CEGAQLNIENPTFENLITPEVLDQVRVESISSLSGTLADGVDNFGSADVAVTKVKNEMSDDFDEDLDHV

Query:  LLIERLRMLLSRRALGLTNRHVEGGSGVPSGELLQCFLKIREKSMFASEELMEIENLLHSRTGSHAPRPCSSSEICSPSRTLTGSCCSGNHCVNKSSESG
        LLIERLRMLLSRRALGL N+HVEGGSGVPSG+LLQCFLK + KSMFASEE MEI N+LH ++GS+APR CS S +CSP+ TL+GS  S NH +NKS+ESG
Subjt:  LLIERLRMLLSRRALGLTNRHVEGGSGVPSGELLQCFLKIREKSMFASEELMEIENLLHSRTGSHAPRPCSSSEICSPSRTLTGSCCSGNHCVNKSSESG

Query:  DDMELKEDKICSTEKVATELGSRPLTDHVPKANLLSSTKVKDEPYDHVDDNNIYGKDMNNVFSNTVLIKSEATVPDEHYENKVDNMRLQDRMKFFSSRKD
        +DMELKEDKICS+EKVATELGSR LT+HVP+ NLLSSTKVKDEPYDH +  +IYGKDMNNV+SNT+ IKSE T+PDE YENKVD+M LQDRMKFFSSRKD
Subjt:  DDMELKEDKICSTEKVATELGSRPLTDHVPKANLLSSTKVKDEPYDHVDDNNIYGKDMNNVFSNTVLIKSEATVPDEHYENKVDNMRLQDRMKFFSSRKD

Query:  FGFTPMDYEHPKPSNPGCSILVSEPASLTNIKRRRKRKKTVTNLVETALEEDAPGLLQILVDKGVLVDEIKLYGETESDDDLDESFSEDSFGELEDVISR
         GFT MDYEHPKPS+PGCS+LVSEP +  N KRRRK+KKT TN +ETALEEDAPGLLQILV+KG+ VDEIKLYGETESDDDLDES SEDSF ELEDVI+R
Subjt:  FGFTPMDYEHPKPSNPGCSILVSEPASLTNIKRRRKRKKTVTNLVETALEEDAPGLLQILVDKGVLVDEIKLYGETESDDDLDESFSEDSFGELEDVISR

Query:  LFSQRHSFMKFPS-IRCMKSSRVSYCLACLVSLIEQARYLHFRNWPVEWGWCRDLQSFMFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVIS
        LF QRHSF+KFPS IRCMK+SR SYCLACLVSLIEQ RYLHFRNWPVEWGWCRDLQSF+FVFERHKRIVMERPEYGYATYFFELV+SLPI+WQIKRLVI+
Subjt:  LFSQRHSFMKFPS-IRCMKSSRVSYCLACLVSLIEQARYLHFRNWPVEWGWCRDLQSFMFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVIS

Query:  MKLTSCSRISLLENRPLLVGEDLTEGEAGVLLGYGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQ
        MKLT+CSRISLLENRPLLVGEDLTEGEA VLL YGWM NSGLGTMLNYRGRVVHDR+NEDISEW+SKIGKLLMDGYNGGAL+LENT  KVAEYSSSQ TQ
Subjt:  MKLTSCSRISLLENRPLLVGEDLTEGEAGVLLGYGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQ

Query:  VKLEL
        VKLEL
Subjt:  VKLEL

A0A6J1FMV3 uncharacterized protein LOC111445382 isoform X22.0e-27180.53Show/hide
Query:  AVDETNSQYSQLDLDELEWNSHLMDN-CEGAQLNIENPTFENLITPEVLDQVRVESISSLSGTLADGVDNFGSADVAVTKVKNEMSDDFDEDLDHVLLIE
        A D  N    QL+    + N H MDN  EGA+LN EN TFEN   PEVLD+VRVES S LSGTL  GVDNF  A VAVTKVKNEM DDFDEDLDHVLLIE
Subjt:  AVDETNSQYSQLDLDELEWNSHLMDN-CEGAQLNIENPTFENLITPEVLDQVRVESISSLSGTLADGVDNFGSADVAVTKVKNEMSDDFDEDLDHVLLIE

Query:  RLRMLLSRRALGLTNRHVEGGSGVPSGELLQCFLKIREKSMFASEELMEIENLLHSRTGSHAPRPCSSSEICSPSRTLTGSCCSGNHCVNKSSESGDDME
        RLRMLLSRRALGL N+HVEGGSGVPSG+LLQCFLK + KSMFASEE MEI N+LH ++GS+APR CS S +CSP+ TL+GS  S NH +NKS+ESG+DME
Subjt:  RLRMLLSRRALGLTNRHVEGGSGVPSGELLQCFLKIREKSMFASEELMEIENLLHSRTGSHAPRPCSSSEICSPSRTLTGSCCSGNHCVNKSSESGDDME

Query:  LKEDKICSTEKVATELGSRPLTDHVPKANLLSSTKVKDEPYDHVDDNNIYGKDMNNVFSNTVLIKSEATVPDEHYENKVDNMRLQDRMKFFSSRKDFGFT
        LKEDKICS+EKVATELGSR LT+HVP+ NLLSSTKVKDEPYDH +  +IYGKDMNNV+SNT+ IKSE T+PDE YENKVD+M LQDRMKFFSSRKD GFT
Subjt:  LKEDKICSTEKVATELGSRPLTDHVPKANLLSSTKVKDEPYDHVDDNNIYGKDMNNVFSNTVLIKSEATVPDEHYENKVDNMRLQDRMKFFSSRKDFGFT

Query:  PMDYEHPKPSNPGCSILVSEPASLTNIKRRRKRKKTVTNLVETALEEDAPGLLQILVDKGVLVDEIKLYGETESDDDLDESFSEDSFGELEDVISRLFSQ
         MDYEHPKPS+PGCS+LVSEP +  N KRRRK+KKT TN +ETALEEDAPGLLQILV+KG+ VDEIKLYGETESDDDLDES SEDSF ELEDVI+RLF Q
Subjt:  PMDYEHPKPSNPGCSILVSEPASLTNIKRRRKRKKTVTNLVETALEEDAPGLLQILVDKGVLVDEIKLYGETESDDDLDESFSEDSFGELEDVISRLFSQ

Query:  RHSFMKFPS-IRCMKSSRVSYCLACLVSLIEQARYLHFRNWPVEWGWCRDLQSFMFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLT
        RHSF+KFPS IRCMK+SR SYCLACLVSLIEQ RYLHFRNWPVEWGWCRDLQSF+FVFERHKRIVMERPEYGYATYFFELV+SLPI+WQIKRLVI+MKLT
Subjt:  RHSFMKFPS-IRCMKSSRVSYCLACLVSLIEQARYLHFRNWPVEWGWCRDLQSFMFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLT

Query:  SCSRISLLENRPLLVGEDLTEGEAGVLLGYGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLE
        +CSRISLLENRPLLVGEDLTEGEA VLL YGWM NSGLGTMLNYRGRVVHDR+NEDISEW+SKIGKLLMDGYNGGAL+LENT  KVAEYSSSQ TQVKLE
Subjt:  SCSRISLLENRPLLVGEDLTEGEAGVLLGYGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLE

Query:  L
        L
Subjt:  L

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G16610.1 unknown protein1.0e-9449.36Show/hide
Query:  MELKEDKICSTEKVATELGSRPLTDHVPKANLLS-STKVKDEPYDH---VDDNNIYGKDMNNVFSNTV-------LIKSEATVPDEHYENKVDNMRLQDR
        + L E++I ST+     L    + D+  K  + S    VK E   H   +D+N +    ++   +           +K+EA    E  E+ +D+M+L DR
Subjt:  MELKEDKICSTEKVATELGSRPLTDHVPKANLLS-STKVKDEPYDH---VDDNNIYGKDMNNVFSNTV-------LIKSEATVPDEHYENKVDNMRLQDR

Query:  MKFFSSRKDFGFTPMDYEHPKPSNPGCSILVSEPASLTNIKRRRKRKKTVTNLVETALEEDAPGLLQILVDKGVLVDEIKLYGETESDDDLDESFSEDSF
        +K     + F  +    +   PS+        E    + + R  KRKKT T+ +ETALEEDAPGLLQ+L+ +GV VDE++LYG    D   D+S   +SF
Subjt:  MKFFSSRKDFGFTPMDYEHPKPSNPGCSILVSEPASLTNIKRRRKRKKTVTNLVETALEEDAPGLLQILVDKGVLVDEIKLYGETESDDDLDESFSEDSF

Query:  GELEDVISRLFSQRHSFMKFPSIRCMKSSRVSYCLACLVSLIEQARYLHFRNWPVEWGWCRDLQSFMFVFERHKRIVMERPEYGYATYFFELVDSLPINW
         ELEDVIS+LF +R +  K  +    K+SR SYCL CL SLIEQARYL FR WPVEWGWCRDLQSF+FVFERH RIVMERPEYGYATYFFEL ++  I W
Subjt:  GELEDVISRLFSQRHSFMKFPSIRCMKSSRVSYCLACLVSLIEQARYLHFRNWPVEWGWCRDLQSFMFVFERHKRIVMERPEYGYATYFFELVDSLPINW

Query:  QIKRLVISMKLTSCSRISLLENRPLLVGEDLTEGEAGVLLGYGWMPNSGLGTMLNYRGRVVHDRNNE-DISEWKSKIGKLLMDGYNGGALL
        Q+KRLV++MKL SC R  L+EN+PLLVGED+T GEA VL+ YGW+ N+GLGTMLNYR RV HDR  +   SEW+SKI +LL+DGYN G ++
Subjt:  QIKRLVISMKLTSCSRISLLENRPLLVGEDLTEGEAGVLLGYGWMPNSGLGTMLNYRGRVVHDRNNE-DISEWKSKIGKLLMDGYNGGALL

AT5G16610.2 unknown protein1.3e-9740.46Show/hide
Query:  ITPEVLDQVRVESISSL--SGTLADGVDNFGSADVAVTKVKNEMSDDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGGS-GVPSGELLQCFLKIREKS
        ++ E+L+ + +     L  SG +    +  G    AV         +  +DL+H+ L ER +MLL R A+ L   +VE  +      EL +   +I  ++
Subjt:  ITPEVLDQVRVESISSL--SGTLADGVDNFGSADVAVTKVKNEMSDDFDEDLDHVLLIERLRMLLSRRALGLTNRHVEGGS-GVPSGELLQCFLKIREKS

Query:  MFASEELMEIENLLH-------------SRTGSHAPRPCSSSEICSPSRTLTGSCCSGNHCVNKSSES---------GDDMELKEDKICSTEKVATELGS
          AS   ++    L              S +GS       S    SP R+   S  +    V+ S+++          + + L E++I ST+     L  
Subjt:  MFASEELMEIENLLH-------------SRTGSHAPRPCSSSEICSPSRTLTGSCCSGNHCVNKSSES---------GDDMELKEDKICSTEKVATELGS

Query:  RPLTDHVPKANLLS-STKVKDEPYDH---VDDNNIYGKDMNNVFSNTV-------LIKSEATVPDEHYENKVDNMRLQDRMKFFSSRKDFGFTPMDYEHP
          + D+  K  + S    VK E   H   +D+N +    ++   +           +K+EA    E  E+ +D+M+L DR+K     + F  +    +  
Subjt:  RPLTDHVPKANLLS-STKVKDEPYDH---VDDNNIYGKDMNNVFSNTV-------LIKSEATVPDEHYENKVDNMRLQDRMKFFSSRKDFGFTPMDYEHP

Query:  KPSNPGCSILVSEPASLTNIKRRRKRKKTVTNLVETALEEDAPGLLQILVDKGVLVDEIKLYGETESDDDLDESFSEDSFGELEDVISRLFSQRHSFMKF
         PS+        E    + + R  KRKKT T+ +ETALEEDAPGLLQ+L+ +GV VDE++LYG    D   D+S   +SF ELEDVIS+LF +R +  K 
Subjt:  KPSNPGCSILVSEPASLTNIKRRRKRKKTVTNLVETALEEDAPGLLQILVDKGVLVDEIKLYGETESDDDLDESFSEDSFGELEDVISRLFSQRHSFMKF

Query:  PSIRCMKSSRVSYCLACLVSLIEQARYLHFRNWPVEWGWCRDLQSFMFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLL
         +    K+SR SYCL CL SLIEQARYL FR WPVEWGWCRDLQSF+FVFERH RIVMERPEYGYATYFFEL ++  I WQ+KRLV++MKL SC R  L+
Subjt:  PSIRCMKSSRVSYCLACLVSLIEQARYLHFRNWPVEWGWCRDLQSFMFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLL

Query:  ENRPLLVGEDLTEGEAGVLLGYGWMPNSGLGTMLNYRGRVVHDRNNE-DISEWKSKIGKLLMDGYNGGALL
        EN+PLLVGED+T GEA VL+ YGW+ N+GLGTMLNYR RV HDR  +   SEW+SKI +LL+DGYN G ++
Subjt:  ENRPLLVGEDLTEGEAGVLLGYGWMPNSGLGTMLNYRGRVVHDRNNE-DISEWKSKIGKLLMDGYNGGALL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAACGGGTCACGCACGAGGATGGAGACACGTGTTGTGTCAAACGGGTCTCAGAGCGGGTCGGGTCGGATGAAACAGCTCTCATTTGGAATTTGATCGGAGAAAGGGT
GGAAGAAGAACTACTACAAATGCCCCCGTCCGCCGACCACCGCTCAGCCGACCACCCGCCTCTACGACGTTACTCGCTGCCGTCGTCTTCGACGTCCGGCCAGTCCCCCA
CGCGCCGTGCGAAGGATCATAGCCGGTCATCTCGACTCCTTAAGGTTGGAAAGGACAAAAGCACTATAGTTAAAGGAAAGTTGGAAACTCACAATGAGCTAAGTAAAAGT
GAGAACATGTTCAACGTTCCCATTAATATGAGGTGGTGGTTAAAAAGCAGCAAATTCACTTATTTGCTTGCAGTTGACGAGACAAACTCACAGTATAGTCAATTGGATCT
GGATGAGCTAGAGTGGAATAGTCATCTTATGGACAACTGCGAGGGAGCTCAATTAAATATAGAGAACCCGACTTTTGAGAATCTAATAACACCTGAGGTTCTGGATCAGG
TAAGGGTGGAATCCATAAGCAGCTTATCAGGTACCTTGGCAGATGGTGTAGATAACTTTGGTTCTGCTGATGTGGCTGTAACTAAGGTTAAAAATGAGATGTCTGATGAT
TTTGATGAAGATCTTGATCATGTTTTATTGATAGAGCGACTAAGGATGCTGCTATCAAGGCGAGCATTGGGTTTGACAAATCGACATGTGGAGGGTGGTTCTGGTGTGCC
TTCGGGAGAACTTCTACAATGCTTCTTGAAAATAAGAGAGAAATCCATGTTTGCTAGTGAAGAACTGATGGAAATTGAAAATTTGTTGCATTCTAGAACTGGAAGTCATG
CTCCTCGTCCTTGCAGCTCTTCAGAAATTTGTTCACCTAGTCGAACTCTTACGGGATCATGTTGCTCAGGCAATCATTGTGTGAACAAGTCATCTGAATCAGGCGATGAT
ATGGAACTGAAAGAAGATAAGATCTGCTCAACAGAGAAGGTAGCTACAGAATTAGGTTCACGGCCTTTGACTGATCATGTTCCTAAAGCAAATTTATTGAGTTCCACGAA
AGTGAAGGATGAACCTTATGATCATGTGGATGACAACAACATATATGGTAAGGATATGAATAATGTCTTCAGTAATACTGTGTTGATAAAGAGTGAAGCAACCGTTCCCG
ATGAACATTATGAAAACAAGGTAGACAATATGCGATTGCAAGATCGAATGAAGTTTTTCTCTTCTAGGAAGGATTTTGGTTTTACACCTATGGATTACGAGCATCCAAAA
CCTTCTAACCCTGGATGCAGCATTCTTGTTTCAGAACCTGCTAGTTTAACGAACATTAAACGAAGACGCAAACGGAAAAAAACTGTCACGAATTTAGTTGAAACAGCATT
GGAGGAAGATGCTCCTGGCCTTCTCCAGATACTAGTTGACAAAGGTGTACTAGTTGATGAAATCAAGCTTTATGGGGAGACAGAAAGTGATGATGATCTAGATGAGTCTT
TTAGCGAAGACAGCTTTGGTGAGCTTGAAGATGTGATATCAAGGCTTTTTTCTCAACGCCATTCCTTTATGAAGTTTCCCTCCATAAGATGCATGAAAAGTTCAAGAGTA
AGCTATTGTTTAGCTTGTCTAGTTTCACTTATTGAGCAGGCAAGATATCTTCATTTCCGGAACTGGCCTGTTGAATGGGGGTGGTGCCGGGATCTCCAGTCTTTTATGTT
TGTATTTGAGAGACATAAAAGAATAGTGATGGAACGTCCTGAGTATGGCTATGCGACATATTTTTTTGAGCTTGTGGATTCCTTACCCATCAACTGGCAGATAAAGCGGT
TGGTGATTTCCATGAAGCTTACGAGTTGTAGCAGAATTTCATTACTTGAGAACAGACCATTATTGGTTGGGGAAGATTTGACCGAAGGCGAGGCAGGGGTTTTATTGGGC
TATGGGTGGATGCCGAATAGTGGCTTGGGTACAATGCTGAACTACCGTGGCAGAGTTGTTCATGACCGGAATAATGAGGACATCTCCGAATGGAAATCAAAAATAGGGAA
GCTATTGATGGATGGTTATAATGGCGGAGCTCTTTTGCTAGAAAATACTTCAATGAAGGTTGCAGAATACAGCAGTTCCCAAACCACACAAGTTAAGCTAGAACTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAACGGGTCACGCACGAGGATGGAGACACGTGTTGTGTCAAACGGGTCTCAGAGCGGGTCGGGTCGGATGAAACAGCTCTCATTTGGAATTTGATCGGAGAAAGGGT
GGAAGAAGAACTACTACAAATGCCCCCGTCCGCCGACCACCGCTCAGCCGACCACCCGCCTCTACGACGTTACTCGCTGCCGTCGTCTTCGACGTCCGGCCAGTCCCCCA
CGCGCCGTGCGAAGGATCATAGCCGGTCATCTCGACTCCTTAAGGTTGGAAAGGACAAAAGCACTATAGTTAAAGGAAAGTTGGAAACTCACAATGAGCTAAGTAAAAGT
GAGAACATGTTCAACGTTCCCATTAATATGAGGTGGTGGTTAAAAAGCAGCAAATTCACTTATTTGCTTGCAGTTGACGAGACAAACTCACAGTATAGTCAATTGGATCT
GGATGAGCTAGAGTGGAATAGTCATCTTATGGACAACTGCGAGGGAGCTCAATTAAATATAGAGAACCCGACTTTTGAGAATCTAATAACACCTGAGGTTCTGGATCAGG
TAAGGGTGGAATCCATAAGCAGCTTATCAGGTACCTTGGCAGATGGTGTAGATAACTTTGGTTCTGCTGATGTGGCTGTAACTAAGGTTAAAAATGAGATGTCTGATGAT
TTTGATGAAGATCTTGATCATGTTTTATTGATAGAGCGACTAAGGATGCTGCTATCAAGGCGAGCATTGGGTTTGACAAATCGACATGTGGAGGGTGGTTCTGGTGTGCC
TTCGGGAGAACTTCTACAATGCTTCTTGAAAATAAGAGAGAAATCCATGTTTGCTAGTGAAGAACTGATGGAAATTGAAAATTTGTTGCATTCTAGAACTGGAAGTCATG
CTCCTCGTCCTTGCAGCTCTTCAGAAATTTGTTCACCTAGTCGAACTCTTACGGGATCATGTTGCTCAGGCAATCATTGTGTGAACAAGTCATCTGAATCAGGCGATGAT
ATGGAACTGAAAGAAGATAAGATCTGCTCAACAGAGAAGGTAGCTACAGAATTAGGTTCACGGCCTTTGACTGATCATGTTCCTAAAGCAAATTTATTGAGTTCCACGAA
AGTGAAGGATGAACCTTATGATCATGTGGATGACAACAACATATATGGTAAGGATATGAATAATGTCTTCAGTAATACTGTGTTGATAAAGAGTGAAGCAACCGTTCCCG
ATGAACATTATGAAAACAAGGTAGACAATATGCGATTGCAAGATCGAATGAAGTTTTTCTCTTCTAGGAAGGATTTTGGTTTTACACCTATGGATTACGAGCATCCAAAA
CCTTCTAACCCTGGATGCAGCATTCTTGTTTCAGAACCTGCTAGTTTAACGAACATTAAACGAAGACGCAAACGGAAAAAAACTGTCACGAATTTAGTTGAAACAGCATT
GGAGGAAGATGCTCCTGGCCTTCTCCAGATACTAGTTGACAAAGGTGTACTAGTTGATGAAATCAAGCTTTATGGGGAGACAGAAAGTGATGATGATCTAGATGAGTCTT
TTAGCGAAGACAGCTTTGGTGAGCTTGAAGATGTGATATCAAGGCTTTTTTCTCAACGCCATTCCTTTATGAAGTTTCCCTCCATAAGATGCATGAAAAGTTCAAGAGTA
AGCTATTGTTTAGCTTGTCTAGTTTCACTTATTGAGCAGGCAAGATATCTTCATTTCCGGAACTGGCCTGTTGAATGGGGGTGGTGCCGGGATCTCCAGTCTTTTATGTT
TGTATTTGAGAGACATAAAAGAATAGTGATGGAACGTCCTGAGTATGGCTATGCGACATATTTTTTTGAGCTTGTGGATTCCTTACCCATCAACTGGCAGATAAAGCGGT
TGGTGATTTCCATGAAGCTTACGAGTTGTAGCAGAATTTCATTACTTGAGAACAGACCATTATTGGTTGGGGAAGATTTGACCGAAGGCGAGGCAGGGGTTTTATTGGGC
TATGGGTGGATGCCGAATAGTGGCTTGGGTACAATGCTGAACTACCGTGGCAGAGTTGTTCATGACCGGAATAATGAGGACATCTCCGAATGGAAATCAAAAATAGGGAA
GCTATTGATGGATGGTTATAATGGCGGAGCTCTTTTGCTAGAAAATACTTCAATGAAGGTTGCAGAATACAGCAGTTCCCAAACCACACAAGTTAAGCTAGAACTCTGA
Protein sequenceShow/hide protein sequence
MKRVTHEDGDTCCVKRVSERVGSDETALIWNLIGERVEEELLQMPPSADHRSADHPPLRRYSLPSSSTSGQSPTRRAKDHSRSSRLLKVGKDKSTIVKGKLETHNELSKS
ENMFNVPINMRWWLKSSKFTYLLAVDETNSQYSQLDLDELEWNSHLMDNCEGAQLNIENPTFENLITPEVLDQVRVESISSLSGTLADGVDNFGSADVAVTKVKNEMSDD
FDEDLDHVLLIERLRMLLSRRALGLTNRHVEGGSGVPSGELLQCFLKIREKSMFASEELMEIENLLHSRTGSHAPRPCSSSEICSPSRTLTGSCCSGNHCVNKSSESGDD
MELKEDKICSTEKVATELGSRPLTDHVPKANLLSSTKVKDEPYDHVDDNNIYGKDMNNVFSNTVLIKSEATVPDEHYENKVDNMRLQDRMKFFSSRKDFGFTPMDYEHPK
PSNPGCSILVSEPASLTNIKRRRKRKKTVTNLVETALEEDAPGLLQILVDKGVLVDEIKLYGETESDDDLDESFSEDSFGELEDVISRLFSQRHSFMKFPSIRCMKSSRV
SYCLACLVSLIEQARYLHFRNWPVEWGWCRDLQSFMFVFERHKRIVMERPEYGYATYFFELVDSLPINWQIKRLVISMKLTSCSRISLLENRPLLVGEDLTEGEAGVLLG
YGWMPNSGLGTMLNYRGRVVHDRNNEDISEWKSKIGKLLMDGYNGGALLLENTSMKVAEYSSSQTTQVKLEL