; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg039014 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg039014
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationscaffold12:3254573..3269676
RNA-Seq ExpressionSpg039014
SyntenySpg039014
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR005135 - Endonuclease/exonuclease/phosphatase
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR019734 - Tetratricopeptide repeat
IPR025558 - Domain of unknown function DUF4283
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047377.1 kinesin light chain 3 isoform X1 [Cucumis melo var. makuwa]2.6e-7142.08Show/hide
Query:  AAVILGMNSNPVLAKDASFKPSSENGIEESDTVGLRKVEDGS-----------------------GRLEEAERYFISAIQEAKEGFGERDPHVASAFNNL
        AA+ILG+NSNPVLA++ASFKPSSENGIE+ +TVGLRKVEDGS                       GRLE+AE+YFISAIQEAKEGFGERDPHVASAFNNL
Subjt:  AAVILGMNSNPVLAKDASFKPSSENGIEESDTVGLRKVEDGS-----------------------GRLEEAERYFISAIQEAKEGFGERDPHVASAFNNL

Query:  AELYRVMKTFDKAEPMYLEAINILEESYGSEDIRGVRQNMECLQDMEGCLMMACVVDDCNGCVSMMAQTFDDKCERNVPLLGESKYLEYLGLSCSPLVIH
        AELYRVMKTFDKAEPMYLEAINILEESYG+EDI                                                                   
Subjt:  AELYRVMKTFDKAEPMYLEAINILEESYGSEDIRGVRQNMECLQDMEGCLMMACVVDDCNGCVSMMAQTFDDKCERNVPLLGESKYLEYLGLSCSPLVIH

Query:  CGSILIHIDLVLEQFDSDMELENVSRKNLARRGWEGFKVKEAVCPREVHLQGKIVDNGIVKRDRLLHTASQPCVCIGLMSPTVAAYPSRLLTREREHVSE
                                                                                                            
Subjt:  CGSILIHIDLVLEQFDSDMELENVSRKNLARRGWEGFKVKEAVCPREVHLQGKIVDNGIVKRDRLLHTASQPCVCIGLMSPTVAAYPSRLLTREREHVSE

Query:  SNFSVSCIVTYRVYCILGHSASLVLPLGRVGSALHNLGQFYLVQRKLKEACHCYE-----IKGRVLGHGHVDYADTMYHLGTVLYLLGEEKDSEALIQDS
                                    RVGSALHNLGQFYLVQRKLKE+C+CYE     IKGRVLG GHVDYADTMYHLGTVLYLLGEEKDSEALIQDS
Subjt:  SNFSVSCIVTYRVYCILGHSASLVLPLGRVGSALHNLGQFYLVQRKLKEACHCYE-----IKGRVLGHGHVDYADTMYHLGTVLYLLGEEKDSEALIQDS

Query:  IRILEEGGLGESILCIRRLRYLAKDIMSLLTTFGIVGGAHSMLVVHIIEMDEDRWRSFSRL
        IRILEEGGLGESILCIRRLRYLAK  M + +   +        ++HI+E+ +  W S   +
Subjt:  IRILEEGGLGESILCIRRLRYLAKDIMSLLTTFGIVGGAHSMLVVHIIEMDEDRWRSFSRL

TYK14053.1 kinesin light chain 3 [Cucumis melo var. makuwa]8.0e-7343.3Show/hide
Query:  AAVILGMNSNPVLAKDASFKPSSENGIEESDTVGLRKVEDGS-----------GRLEEAERYFISAIQEAKEGFGERDPHVASAFNNLAELYRVMKTFDK
        AA+ILG+NSNPVLA++ASFKPSSENGIE+ +TVGLRKVEDGS           GRLE+AE+YFISAIQEAKEGFGERDPHVASAFNNLAELYRVMKTFDK
Subjt:  AAVILGMNSNPVLAKDASFKPSSENGIEESDTVGLRKVEDGS-----------GRLEEAERYFISAIQEAKEGFGERDPHVASAFNNLAELYRVMKTFDK

Query:  AEPMYLEAINILEESYGSEDIRGVRQNMECLQDMEGCLMMACVVDDCNGCVSMMAQTFDDKCERNVPLLGESKYLEYLGLSCSPLVIHCGSILIHIDLVL
        AEPMYLEAINILEESYG+EDI                                                                               
Subjt:  AEPMYLEAINILEESYGSEDIRGVRQNMECLQDMEGCLMMACVVDDCNGCVSMMAQTFDDKCERNVPLLGESKYLEYLGLSCSPLVIHCGSILIHIDLVL

Query:  EQFDSDMELENVSRKNLARRGWEGFKVKEAVCPREVHLQGKIVDNGIVKRDRLLHTASQPCVCIGLMSPTVAAYPSRLLTREREHVSESNFSVSCIVTYR
                                                                                                            
Subjt:  EQFDSDMELENVSRKNLARRGWEGFKVKEAVCPREVHLQGKIVDNGIVKRDRLLHTASQPCVCIGLMSPTVAAYPSRLLTREREHVSESNFSVSCIVTYR

Query:  VYCILGHSASLVLPLGRVGSALHNLGQFYLVQRKLKEACHCYE----IKGRVLGHGHVDYADTMYHLGTVLYLLGEEKDSEALIQDSIRILEEGGLGESI
                        RVGSALHNLGQFYLVQRKLKE+C+CYE    IKGRVLG GHVDYADTMYHLGTVLYLLGEEKDSEALIQDSIRILEEGGLGESI
Subjt:  VYCILGHSASLVLPLGRVGSALHNLGQFYLVQRKLKEACHCYE----IKGRVLGHGHVDYADTMYHLGTVLYLLGEEKDSEALIQDSIRILEEGGLGESI

Query:  LCIRRLRYLAKDIMSLLTTFGIVGGAHSMLVVHIIEMDEDRWRSFSRL
        LCIRRLRYLAK  M + +   +        ++HI+E+ +  W S   +
Subjt:  LCIRRLRYLAKDIMSLLTTFGIVGGAHSMLVVHIIEMDEDRWRSFSRL

XP_008457118.2 PREDICTED: uncharacterized protein LOC103496867 [Cucumis melo]2.0e-7142.17Show/hide
Query:  AAVILGMNSNPVLAKDASFKPSSENGIEESDTVGLRKVEDGS-----------------------GRLEEAERYFISAIQEAKEGFGERDPHVASAFNNL
        AA+ILG+NSNPVLA++ASFKPSSENGIE+ +TVGLRKVEDGS                       GRLE+AE+YFISAIQEAKEGFGERDPHVASAFNNL
Subjt:  AAVILGMNSNPVLAKDASFKPSSENGIEESDTVGLRKVEDGS-----------------------GRLEEAERYFISAIQEAKEGFGERDPHVASAFNNL

Query:  AELYRVMKTFDKAEPMYLEAINILEESYGSEDIRGVRQNMECLQDMEGCLMMACVVDDCNGCVSMMAQTFDDKCERNVPLLGESKYLEYLGLSCSPLVIH
        AELYRVMKTFDKAEPMYLEAINILEESYG+EDI                                                                   
Subjt:  AELYRVMKTFDKAEPMYLEAINILEESYGSEDIRGVRQNMECLQDMEGCLMMACVVDDCNGCVSMMAQTFDDKCERNVPLLGESKYLEYLGLSCSPLVIH

Query:  CGSILIHIDLVLEQFDSDMELENVSRKNLARRGWEGFKVKEAVCPREVHLQGKIVDNGIVKRDRLLHTASQPCVCIGLMSPTVAAYPSRLLTREREHVSE
                                                                                                            
Subjt:  CGSILIHIDLVLEQFDSDMELENVSRKNLARRGWEGFKVKEAVCPREVHLQGKIVDNGIVKRDRLLHTASQPCVCIGLMSPTVAAYPSRLLTREREHVSE

Query:  SNFSVSCIVTYRVYCILGHSASLVLPLGRVGSALHNLGQFYLVQRKLKEACHCYE----IKGRVLGHGHVDYADTMYHLGTVLYLLGEEKDSEALIQDSI
                                    RVGSALHNLGQFYLVQRKLKE+C+CYE    IKGRVLG GHVDYADTMYHLGTVLYLLGEEKDSEALIQDSI
Subjt:  SNFSVSCIVTYRVYCILGHSASLVLPLGRVGSALHNLGQFYLVQRKLKEACHCYE----IKGRVLGHGHVDYADTMYHLGTVLYLLGEEKDSEALIQDSI

Query:  RILEEGGLGESILCIRRLRYLAKDIMSLLTTFGIVGGAHSMLVVHIIEMDEDRWRSFSRL
        RILEEGGLGESILCIRRLRYLAK  M + +   +        ++HI+E+ +  W S   +
Subjt:  RILEEGGLGESILCIRRLRYLAKDIMSLLTTFGIVGGAHSMLVVHIIEMDEDRWRSFSRL

XP_022143432.1 kinesin light chain 1 isoform X1 [Momordica charantia]5.8e-7141.52Show/hide
Query:  AAVILGMNSNPVLAKDASFKPSSENGIEESDTVGLRKVEDGS-----------------------GRLEEAERYFISAIQEAKEGFGERDPHVASAFNNL
        AA+ILG+NSNPV AKD +FKPSSENGIEE+DT+GLRKVEDGS                       GRLEEAER F+SA+QEAKEGFGERDPHVASAFNNL
Subjt:  AAVILGMNSNPVLAKDASFKPSSENGIEESDTVGLRKVEDGS-----------------------GRLEEAERYFISAIQEAKEGFGERDPHVASAFNNL

Query:  AELYRVMKTFDKAEPMYLEAINILEESYGSEDIRGVRQNMECLQDMEGCLMMACVVDDCNGCVSMMAQTFDDKCERNVPLLGESKYLEYLGLSCSPLVIH
        AELYRVMK +DKAEPMYLEAINILEESYGSEDI                                                                   
Subjt:  AELYRVMKTFDKAEPMYLEAINILEESYGSEDIRGVRQNMECLQDMEGCLMMACVVDDCNGCVSMMAQTFDDKCERNVPLLGESKYLEYLGLSCSPLVIH

Query:  CGSILIHIDLVLEQFDSDMELENVSRKNLARRGWEGFKVKEAVCPREVHLQGKIVDNGIVKRDRLLHTASQPCVCIGLMSPTVAAYPSRLLTREREHVSE
                                                                                                            
Subjt:  CGSILIHIDLVLEQFDSDMELENVSRKNLARRGWEGFKVKEAVCPREVHLQGKIVDNGIVKRDRLLHTASQPCVCIGLMSPTVAAYPSRLLTREREHVSE

Query:  SNFSVSCIVTYRVYCILGHSASLVLPLGRVGSALHNLGQFYLVQRKLKEACHCYE----IKGRVLGHGHVDYADTMYHLGTVLYLLGEEKDSEALIQDSI
                                    RVGSALHNLGQFYLVQRKLKEAC+CYE    IKGRVLGHGH+DYADTMYHLGTVLYLLG+EKDSEALIQDS+
Subjt:  SNFSVSCIVTYRVYCILGHSASLVLPLGRVGSALHNLGQFYLVQRKLKEACHCYE----IKGRVLGHGHVDYADTMYHLGTVLYLLGEEKDSEALIQDSI

Query:  RILEEGGLGESILCIRRLRYLAKDIMSLLTTFGIVGGAHSMLVVHIIEMDEDRWRSFSRL
        RILEEGGLGESILCIRRLRYLAK  + L +   +        ++HI+E+ +  W S   +
Subjt:  RILEEGGLGESILCIRRLRYLAKDIMSLLTTFGIVGGAHSMLVVHIIEMDEDRWRSFSRL

XP_031738901.1 uncharacterized protein LOC101207905 isoform X2 [Cucumis sativus]6.8e-7242.27Show/hide
Query:  AAVILGMNSNPVLAKDASFKPSSENGIEESDTVGLRKVEDGS-----------------------GRLEEAERYFISAIQEAKEGFGERDPHVASAFNNL
        +A+ILG+NSNPVLA++ASFKPSSENGIE+ +TVGLRKVEDGS                       GRLE+AE+YFISAIQEAKEGFGERDPHVASAFNNL
Subjt:  AAVILGMNSNPVLAKDASFKPSSENGIEESDTVGLRKVEDGS-----------------------GRLEEAERYFISAIQEAKEGFGERDPHVASAFNNL

Query:  AELYRVMKTFDKAEPMYLEAINILEESYGSEDIRGVRQNMECLQDMEGCLMMACVVDDCNGCVSMMAQTFDDKCERNVPLLGESKYLEYLGLSCSPLVIH
        AELYRVMKTFDKAEPMYLEAI ILEESYG+EDI                                                                   
Subjt:  AELYRVMKTFDKAEPMYLEAINILEESYGSEDIRGVRQNMECLQDMEGCLMMACVVDDCNGCVSMMAQTFDDKCERNVPLLGESKYLEYLGLSCSPLVIH

Query:  CGSILIHIDLVLEQFDSDMELENVSRKNLARRGWEGFKVKEAVCPREVHLQGKIVDNGIVKRDRLLHTASQPCVCIGLMSPTVAAYPSRLLTREREHVSE
                                                                                                            
Subjt:  CGSILIHIDLVLEQFDSDMELENVSRKNLARRGWEGFKVKEAVCPREVHLQGKIVDNGIVKRDRLLHTASQPCVCIGLMSPTVAAYPSRLLTREREHVSE

Query:  SNFSVSCIVTYRVYCILGHSASLVLPLGRVGSALHNLGQFYLVQRKLKEACHCYEIKGRVLGHGHVDYADTMYHLGTVLYLLGEEKDSEALIQDSIRILE
                                    RVGSALHNLGQ YLVQRKLKE+C+CYEIKGRVLG+GHVDYADTMYHLGTVLYLLGEEKDSEALIQDSIRILE
Subjt:  SNFSVSCIVTYRVYCILGHSASLVLPLGRVGSALHNLGQFYLVQRKLKEACHCYEIKGRVLGHGHVDYADTMYHLGTVLYLLGEEKDSEALIQDSIRILE

Query:  EGGLGESILCIRRLRYLAKDIM---SLLTTFGIVGGAHSMLVVHIIEMDEDRWRSFSRL
        EGGLGESILCIRRLRYLAK  M   +LL T  +        ++HI+E+ +  W S   +
Subjt:  EGGLGESILCIRRLRYLAKDIM---SLLTTFGIVGGAHSMLVVHIIEMDEDRWRSFSRL

TrEMBL top hitse value%identityAlignment
A0A1S3C5G7 uncharacterized protein LOC1034968679.6e-7242.17Show/hide
Query:  AAVILGMNSNPVLAKDASFKPSSENGIEESDTVGLRKVEDGS-----------------------GRLEEAERYFISAIQEAKEGFGERDPHVASAFNNL
        AA+ILG+NSNPVLA++ASFKPSSENGIE+ +TVGLRKVEDGS                       GRLE+AE+YFISAIQEAKEGFGERDPHVASAFNNL
Subjt:  AAVILGMNSNPVLAKDASFKPSSENGIEESDTVGLRKVEDGS-----------------------GRLEEAERYFISAIQEAKEGFGERDPHVASAFNNL

Query:  AELYRVMKTFDKAEPMYLEAINILEESYGSEDIRGVRQNMECLQDMEGCLMMACVVDDCNGCVSMMAQTFDDKCERNVPLLGESKYLEYLGLSCSPLVIH
        AELYRVMKTFDKAEPMYLEAINILEESYG+EDI                                                                   
Subjt:  AELYRVMKTFDKAEPMYLEAINILEESYGSEDIRGVRQNMECLQDMEGCLMMACVVDDCNGCVSMMAQTFDDKCERNVPLLGESKYLEYLGLSCSPLVIH

Query:  CGSILIHIDLVLEQFDSDMELENVSRKNLARRGWEGFKVKEAVCPREVHLQGKIVDNGIVKRDRLLHTASQPCVCIGLMSPTVAAYPSRLLTREREHVSE
                                                                                                            
Subjt:  CGSILIHIDLVLEQFDSDMELENVSRKNLARRGWEGFKVKEAVCPREVHLQGKIVDNGIVKRDRLLHTASQPCVCIGLMSPTVAAYPSRLLTREREHVSE

Query:  SNFSVSCIVTYRVYCILGHSASLVLPLGRVGSALHNLGQFYLVQRKLKEACHCYE----IKGRVLGHGHVDYADTMYHLGTVLYLLGEEKDSEALIQDSI
                                    RVGSALHNLGQFYLVQRKLKE+C+CYE    IKGRVLG GHVDYADTMYHLGTVLYLLGEEKDSEALIQDSI
Subjt:  SNFSVSCIVTYRVYCILGHSASLVLPLGRVGSALHNLGQFYLVQRKLKEACHCYE----IKGRVLGHGHVDYADTMYHLGTVLYLLGEEKDSEALIQDSI

Query:  RILEEGGLGESILCIRRLRYLAKDIMSLLTTFGIVGGAHSMLVVHIIEMDEDRWRSFSRL
        RILEEGGLGESILCIRRLRYLAK  M + +   +        ++HI+E+ +  W S   +
Subjt:  RILEEGGLGESILCIRRLRYLAKDIMSLLTTFGIVGGAHSMLVVHIIEMDEDRWRSFSRL

A0A5A7TZJ1 Kinesin light chain 3 isoform X11.3e-7142.08Show/hide
Query:  AAVILGMNSNPVLAKDASFKPSSENGIEESDTVGLRKVEDGS-----------------------GRLEEAERYFISAIQEAKEGFGERDPHVASAFNNL
        AA+ILG+NSNPVLA++ASFKPSSENGIE+ +TVGLRKVEDGS                       GRLE+AE+YFISAIQEAKEGFGERDPHVASAFNNL
Subjt:  AAVILGMNSNPVLAKDASFKPSSENGIEESDTVGLRKVEDGS-----------------------GRLEEAERYFISAIQEAKEGFGERDPHVASAFNNL

Query:  AELYRVMKTFDKAEPMYLEAINILEESYGSEDIRGVRQNMECLQDMEGCLMMACVVDDCNGCVSMMAQTFDDKCERNVPLLGESKYLEYLGLSCSPLVIH
        AELYRVMKTFDKAEPMYLEAINILEESYG+EDI                                                                   
Subjt:  AELYRVMKTFDKAEPMYLEAINILEESYGSEDIRGVRQNMECLQDMEGCLMMACVVDDCNGCVSMMAQTFDDKCERNVPLLGESKYLEYLGLSCSPLVIH

Query:  CGSILIHIDLVLEQFDSDMELENVSRKNLARRGWEGFKVKEAVCPREVHLQGKIVDNGIVKRDRLLHTASQPCVCIGLMSPTVAAYPSRLLTREREHVSE
                                                                                                            
Subjt:  CGSILIHIDLVLEQFDSDMELENVSRKNLARRGWEGFKVKEAVCPREVHLQGKIVDNGIVKRDRLLHTASQPCVCIGLMSPTVAAYPSRLLTREREHVSE

Query:  SNFSVSCIVTYRVYCILGHSASLVLPLGRVGSALHNLGQFYLVQRKLKEACHCYE-----IKGRVLGHGHVDYADTMYHLGTVLYLLGEEKDSEALIQDS
                                    RVGSALHNLGQFYLVQRKLKE+C+CYE     IKGRVLG GHVDYADTMYHLGTVLYLLGEEKDSEALIQDS
Subjt:  SNFSVSCIVTYRVYCILGHSASLVLPLGRVGSALHNLGQFYLVQRKLKEACHCYE-----IKGRVLGHGHVDYADTMYHLGTVLYLLGEEKDSEALIQDS

Query:  IRILEEGGLGESILCIRRLRYLAKDIMSLLTTFGIVGGAHSMLVVHIIEMDEDRWRSFSRL
        IRILEEGGLGESILCIRRLRYLAK  M + +   +        ++HI+E+ +  W S   +
Subjt:  IRILEEGGLGESILCIRRLRYLAKDIMSLLTTFGIVGGAHSMLVVHIIEMDEDRWRSFSRL

A0A5D3CQC2 Kinesin light chain 33.9e-7343.3Show/hide
Query:  AAVILGMNSNPVLAKDASFKPSSENGIEESDTVGLRKVEDGS-----------GRLEEAERYFISAIQEAKEGFGERDPHVASAFNNLAELYRVMKTFDK
        AA+ILG+NSNPVLA++ASFKPSSENGIE+ +TVGLRKVEDGS           GRLE+AE+YFISAIQEAKEGFGERDPHVASAFNNLAELYRVMKTFDK
Subjt:  AAVILGMNSNPVLAKDASFKPSSENGIEESDTVGLRKVEDGS-----------GRLEEAERYFISAIQEAKEGFGERDPHVASAFNNLAELYRVMKTFDK

Query:  AEPMYLEAINILEESYGSEDIRGVRQNMECLQDMEGCLMMACVVDDCNGCVSMMAQTFDDKCERNVPLLGESKYLEYLGLSCSPLVIHCGSILIHIDLVL
        AEPMYLEAINILEESYG+EDI                                                                               
Subjt:  AEPMYLEAINILEESYGSEDIRGVRQNMECLQDMEGCLMMACVVDDCNGCVSMMAQTFDDKCERNVPLLGESKYLEYLGLSCSPLVIHCGSILIHIDLVL

Query:  EQFDSDMELENVSRKNLARRGWEGFKVKEAVCPREVHLQGKIVDNGIVKRDRLLHTASQPCVCIGLMSPTVAAYPSRLLTREREHVSESNFSVSCIVTYR
                                                                                                            
Subjt:  EQFDSDMELENVSRKNLARRGWEGFKVKEAVCPREVHLQGKIVDNGIVKRDRLLHTASQPCVCIGLMSPTVAAYPSRLLTREREHVSESNFSVSCIVTYR

Query:  VYCILGHSASLVLPLGRVGSALHNLGQFYLVQRKLKEACHCYE----IKGRVLGHGHVDYADTMYHLGTVLYLLGEEKDSEALIQDSIRILEEGGLGESI
                        RVGSALHNLGQFYLVQRKLKE+C+CYE    IKGRVLG GHVDYADTMYHLGTVLYLLGEEKDSEALIQDSIRILEEGGLGESI
Subjt:  VYCILGHSASLVLPLGRVGSALHNLGQFYLVQRKLKEACHCYE----IKGRVLGHGHVDYADTMYHLGTVLYLLGEEKDSEALIQDSIRILEEGGLGESI

Query:  LCIRRLRYLAKDIMSLLTTFGIVGGAHSMLVVHIIEMDEDRWRSFSRL
        LCIRRLRYLAK  M + +   +        ++HI+E+ +  W S   +
Subjt:  LCIRRLRYLAKDIMSLLTTFGIVGGAHSMLVVHIIEMDEDRWRSFSRL

A0A6J1CQT4 kinesin light chain 1 isoform X12.8e-7141.52Show/hide
Query:  AAVILGMNSNPVLAKDASFKPSSENGIEESDTVGLRKVEDGS-----------------------GRLEEAERYFISAIQEAKEGFGERDPHVASAFNNL
        AA+ILG+NSNPV AKD +FKPSSENGIEE+DT+GLRKVEDGS                       GRLEEAER F+SA+QEAKEGFGERDPHVASAFNNL
Subjt:  AAVILGMNSNPVLAKDASFKPSSENGIEESDTVGLRKVEDGS-----------------------GRLEEAERYFISAIQEAKEGFGERDPHVASAFNNL

Query:  AELYRVMKTFDKAEPMYLEAINILEESYGSEDIRGVRQNMECLQDMEGCLMMACVVDDCNGCVSMMAQTFDDKCERNVPLLGESKYLEYLGLSCSPLVIH
        AELYRVMK +DKAEPMYLEAINILEESYGSEDI                                                                   
Subjt:  AELYRVMKTFDKAEPMYLEAINILEESYGSEDIRGVRQNMECLQDMEGCLMMACVVDDCNGCVSMMAQTFDDKCERNVPLLGESKYLEYLGLSCSPLVIH

Query:  CGSILIHIDLVLEQFDSDMELENVSRKNLARRGWEGFKVKEAVCPREVHLQGKIVDNGIVKRDRLLHTASQPCVCIGLMSPTVAAYPSRLLTREREHVSE
                                                                                                            
Subjt:  CGSILIHIDLVLEQFDSDMELENVSRKNLARRGWEGFKVKEAVCPREVHLQGKIVDNGIVKRDRLLHTASQPCVCIGLMSPTVAAYPSRLLTREREHVSE

Query:  SNFSVSCIVTYRVYCILGHSASLVLPLGRVGSALHNLGQFYLVQRKLKEACHCYE----IKGRVLGHGHVDYADTMYHLGTVLYLLGEEKDSEALIQDSI
                                    RVGSALHNLGQFYLVQRKLKEAC+CYE    IKGRVLGHGH+DYADTMYHLGTVLYLLG+EKDSEALIQDS+
Subjt:  SNFSVSCIVTYRVYCILGHSASLVLPLGRVGSALHNLGQFYLVQRKLKEACHCYE----IKGRVLGHGHVDYADTMYHLGTVLYLLGEEKDSEALIQDSI

Query:  RILEEGGLGESILCIRRLRYLAKDIMSLLTTFGIVGGAHSMLVVHIIEMDEDRWRSFSRL
        RILEEGGLGESILCIRRLRYLAK  + L +   +        ++HI+E+ +  W S   +
Subjt:  RILEEGGLGESILCIRRLRYLAKDIMSLLTTFGIVGGAHSMLVVHIIEMDEDRWRSFSRL

A0A6J1KEU5 uncharacterized protein LOC1114926204.8e-7142.39Show/hide
Query:  AAVILGMNSNPVLAKDASFKPSSENGIEESDTVGLRKVEDGS-----------------------GRLEEAERYFISAIQEAKEGFGERDPHVASAFNNL
        AA+ILG+NSNPVLAKDA  KPSSENGIEESDT+GLRKVEDGS                       G+LEEAERYF+SAIQEAKEGFGERDPHVASAFNNL
Subjt:  AAVILGMNSNPVLAKDASFKPSSENGIEESDTVGLRKVEDGS-----------------------GRLEEAERYFISAIQEAKEGFGERDPHVASAFNNL

Query:  AELYRVMKTFDKAEPMYLEAINILEESYGSEDIRGVRQNMECLQDMEGCLMMACVVDDCNGCVSMMAQTFDDKCERNVPLLGESKYLEYLGLSCSPLVIH
        AELYRV KTFDKAEPMYLEAINILEESYGSEDI                                                                   
Subjt:  AELYRVMKTFDKAEPMYLEAINILEESYGSEDIRGVRQNMECLQDMEGCLMMACVVDDCNGCVSMMAQTFDDKCERNVPLLGESKYLEYLGLSCSPLVIH

Query:  CGSILIHIDLVLEQFDSDMELENVSRKNLARRGWEGFKVKEAVCPREVHLQGKIVDNGIVKRDRLLHTASQPCVCIGLMSPTVAAYPSRLLTREREHVSE
                                                                                                            
Subjt:  CGSILIHIDLVLEQFDSDMELENVSRKNLARRGWEGFKVKEAVCPREVHLQGKIVDNGIVKRDRLLHTASQPCVCIGLMSPTVAAYPSRLLTREREHVSE

Query:  SNFSVSCIVTYRVYCILGHSASLVLPLGRVGSALHNLGQFYLVQRKLKEACHCYE----IKGRVLGHGHVDYADTMYHLGTVLYLLGEEKDSEALIQDSI
                                    RVGSALHNLGQFYLVQRKLKEA +CYE    IKG VLG+GHVDYADTMYHLGTVLYLLGEEKDSEALIQDSI
Subjt:  SNFSVSCIVTYRVYCILGHSASLVLPLGRVGSALHNLGQFYLVQRKLKEACHCYE----IKGRVLGHGHVDYADTMYHLGTVLYLLGEEKDSEALIQDSI

Query:  RILEEGGLGESILCIRRLRYLAKDIMSLLTTFGIVGGAHSMLVVHIIEMDEDRWRSFSRL
        RILEEGGLGES LC+RRLRYLAK  M     F          V+HI+E+ +  W S   +
Subjt:  RILEEGGLGESILCIRRLRYLAKDIMSLLTTFGIVGGAHSMLVVHIIEMDEDRWRSFSRL

SwissProt top hitse value%identityAlignment
A5HK05 Amyloid protein-binding protein 25.2e-0641.54Show/hide
Query:  LEEAERYFISAIQEAKEGFGERDPHVASAFNNLAELYRVMKTFDKAEPMYLEAINILEESYGSED
        L+EA    +S++Q AK+ FGE +   A  + NL  LY+ M+ F +AE M+++AI I E+  G ED
Subjt:  LEEAERYFISAIQEAKEGFGERDPHVASAFNNLAELYRVMKTFDKAEPMYLEAINILEESYGSED

Q92624 Amyloid protein-binding protein 25.2e-0641.54Show/hide
Query:  LEEAERYFISAIQEAKEGFGERDPHVASAFNNLAELYRVMKTFDKAEPMYLEAINILEESYGSED
        L+EA    +S++Q AK+ FGE +   A  + NL  LY+ M+ F +AE M+++AI I E+  G ED
Subjt:  LEEAERYFISAIQEAKEGFGERDPHVASAFNNLAELYRVMKTFDKAEPMYLEAINILEESYGSED

Q9DAX9 Amyloid protein-binding protein 25.2e-0641.54Show/hide
Query:  LEEAERYFISAIQEAKEGFGERDPHVASAFNNLAELYRVMKTFDKAEPMYLEAINILEESYGSED
        L+EA    +S++Q AK+ FGE +   A  + NL  LY+ M+ F +AE M+++AI I E+  G ED
Subjt:  LEEAERYFISAIQEAKEGFGERDPHVASAFNNLAELYRVMKTFDKAEPMYLEAINILEESYGSED

Arabidopsis top hitse value%identityAlignment
AT5G37590.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.5e-4833.49Show/hide
Query:  IDLCGAAVILGMNSNPVLAKDASFKPSSENGIEESDTVGLRKVEDGS-----------------------GRLEEAERYFISAIQEAKEGFGERDPHVAS
        I L G A ILG   N VLA+D S K  S + ++ES   GL K+EDGS                       G+LE AER F SAIQEAKEGFGE+DPHVAS
Subjt:  IDLCGAAVILGMNSNPVLAKDASFKPSSENGIEESDTVGLRKVEDGS-----------------------GRLEEAERYFISAIQEAKEGFGERDPHVAS

Query:  AFNNLAELYRVMKTFDKAEPMYLEAINILEESYGSEDIRGVRQNMECLQDMEGCLMMACVVDDCNGCVSMMAQTFDDKCERNVPLLGESKYLEYLGLSCS
        A NNLAELYRV K FDKAEP+YLEA++ILEE YG +D+                                                              
Subjt:  AFNNLAELYRVMKTFDKAEPMYLEAINILEESYGSEDIRGVRQNMECLQDMEGCLMMACVVDDCNGCVSMMAQTFDDKCERNVPLLGESKYLEYLGLSCS

Query:  PLVIHCGSILIHIDLVLEQFDSDMELENVSRKNLARRGWEGFKVKEAVCPREVHLQGKIVDNGIVKRDRLLHTASQPCVCIGLMSPTVAAYPSRLLTRER
                                                                                                            
Subjt:  PLVIHCGSILIHIDLVLEQFDSDMELENVSRKNLARRGWEGFKVKEAVCPREVHLQGKIVDNGIVKRDRLLHTASQPCVCIGLMSPTVAAYPSRLLTRER

Query:  EHVSESNFSVSCIVTYRVYCILGHSASLVLPLGRVGSALHNLGQFYLVQRKLKEACHCYEIKGRVLGHGHVDYADTMYHLGTVLYLLGEEKDSEALIQDS
                                         RVG+ LHNLGQ YLVQRKL+EA  CYE+KGRVLG+ H DYA+TMYHLGTVL++LG+  D+EALI DS
Subjt:  EHVSESNFSVSCIVTYRVYCILGHSASLVLPLGRVGSALHNLGQFYLVQRKLKEACHCYEIKGRVLGHGHVDYADTMYHLGTVLYLLGEEKDSEALIQDS

Query:  IRILEEGGLGESILCIRRLRYLAK
        ++ILEEGG GES+  IRRLRYL++
Subjt:  IRILEEGGLGESILCIRRLRYLAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACATCCATTGACCTCTGTGGTGCAGCTGTAATCCTTGGAATGAACTCCAATCCTGTCTTAGCAAAAGATGCATCTTTTAAGCCAAGTTCTGAAAATGGTATTGAGGA
GAGTGATACTGTGGGACTACGCAAAGTGGAGGATGGTTCTGGAAGACTTGAAGAAGCTGAAAGATATTTTATTTCTGCGATTCAAGAAGCTAAAGAAGGCTTTGGGGAGA
GGGATCCTCATGTTGCGTCTGCCTTCAATAATCTGGCAGAACTGTATAGAGTCATGAAAACATTTGACAAAGCAGAACCGATGTATTTGGAAGCCATCAACATATTGGAG
GAATCCTATGGCTCTGAAGATATAAGGGGTGTAAGGCAGAACATGGAGTGCTTGCAGGACATGGAGGGATGTTTGATGATGGCATGTGTTGTGGATGACTGCAATGGGTG
TGTAAGCATGATGGCACAAACTTTTGATGACAAATGTGAGAGGAATGTACCTCTATTGGGAGAGTCCAAGTACCTCGAATACTTGGGTTTATCGTGTAGTCCTTTAGTAA
TACATTGTGGTTCTATTCTGATTCATATCGATTTGGTATTAGAGCAGTTCGATTCTGACATGGAATTAGAAAATGTTAGCAGGAAGAACCTCGCAAGAAGGGGCTGGGAG
GGGTTTAAAGTGAAGGAGGCGGTATGCCCAAGGGAGGTGCACCTGCAAGGAAAAATCGTCGACAATGGAATTGTAAAGAGAGATCGTTTGTTACACACCGCCTCACAACC
TTGTGTTTGTATTGGTCTGATGTCACCAACTGTCGCTGCTTATCCATCAAGGTTGTTAACTCGTGAAAGGGAACACGTATCGGAATCAAATTTTAGTGTGTCGTGTATCG
TAACGTATCGTGTATACTGTATCTTAGGACACTCAGCTTCTTTAGTCCTACCACTTGGGAGAGTTGGTTCTGCACTTCACAACCTTGGACAGTTTTATCTAGTTCAGAGG
AAGCTAAAAGAAGCCTGCCACTGCTATGAGATCAAAGGTCGTGTTCTCGGACACGGCCATGTTGATTATGCAGATACTATGTACCATCTTGGAACAGTGCTATACCTTCT
AGGGGAGGAAAAAGATTCTGAGGCCCTGATCCAGGATTCGATAAGGATACTGGAGGAAGGTGGCTTAGGCGAGTCAATTCTCTGCATCAGAAGATTGCGATATCTTGCTA
AGGACATAATGAGTTTGCTAACCACCTTTGGGATCGTTGGAGGAGCTCATTCGATGTTAGTGGTCCACATAATAGAGATGGATGAGGATCGTTGGAGGAGTTTCTCCCGA
CTTTGCGACCTTTCCCTGAACCAAAAGTTCTTTTCTGAGAATAGAGTTGAAGATGCGATCCTATGGGTGGAAAAGATCACAAACAAAAAGGGCCACTCAGCAGAGATAGC
AAAGCTTGGACATAATGGGGGTTTAAATAAGATTACTATACCTGTGGGTGCTGAGAGAAAAGGCTGGTGTAGTTTCATTGAATGCATTAACTCTCTCATCAACAACCCTT
CAGCTGTCCTCCCTCCACCCATAAAAGAAGCCTCGGTCACCACTTCATATAAGGCAGCTTTGAAGAATCCTCCAAAAGAACCCCAGAATCATCCCACCTTGGTACCGTTA
CAACTCGATGATCCAGCCTCCCACACTCCGTTGGCAACCTTGTACTTATCCTCGGCTGTTATTGTCCTAGCCAACATCAAAGACTGGTACAAGGTTGGACAATATCAGGT
TAGATTTTTCCCATGGTCCTCTGAATACATGAGTGGAGAACAAAAGGTTCCATCATATGGCGGATGGATAAAGATTTGCAACCTTCCCATTGATAAATGGTCCCTTGAAA
CCTTGAAAAAGATTGGTGATGAATGTGGTGGATACCTGGAAACAGCAACCAAGTCCCTCACCAGATTAGATATGATGGAGGTTATGATTAAGGTGAAAAACAACTACACT
GGCTTCATCCCAGCCGAGGTTCACATTCCATCATCATCAAGAAGCCCCATCACGGTTAGAATCGACCCATTCTTTATGGAGGATTATTATATCGGATATATGGCCGGAAT
CCATGGAAAAATTCCTCTAGGCCCGTCGACAGTTGTGGAAGCATGCGTCGGAGCCCCTAAACAAGCTTGTCTCTCAGAAGACTTTCAAACGCAATCAGTATGGACCCCAC
AGCTTTCAGAAGATCACGTGATTCCCCAATCCAAAGCAGCCACCACCACCGGATCCAAAAAACCCTATCTTCACAAAATCATAAACCCTATTCCCCAATCAGCGGCCCAC
ACCAATCCACAGCCCCCAAACCAGCCCGCTCCAATAAGCCCACTTATCCTAGATGGCCCACCCCATGCCTCCACCAATCCGCATCCACCAACCTGCCCCGATACCGAAGC
CCCTACCCCCTCTTTTAGCTCCACACCAAATAGCCCAATAACAACCAATCATGACCAACATGGCCGGAAAAAGCCCATCACCATTAACAAGGAAACCTACCTCCTTACAG
GCACGATGCATTCGACTGGAACGACTTTTCACCCATCTGATTCGGAAGGAGCTCTCTCCTCTCCGTGTTCTCCGAATTTGGATGAATCTCCACCTAATCAGCAGCAAAAA
ACCACCAATCCGATACATTATCCACCAACTATCTCTCACTTATTCGAATCTATCGAAGACCAGGCTGATATGGATTACCCCACCCCTCTGAAGATCGAGGATCCCACGGG
GGCTGTATATCAGCAAACTCTTCACATGGAGACTTCTGCTCTGATTGATATTGATATTGGAGAGATGAATGAAGATGAATATGACTCCGAGACCCACTACCAACAGAGAG
ACCCTGCGGCTTATCTTCCTTATCTTTTTCCTTGGCTGGCCGAACATGGCATGGGCATGGGCTCTTGGAAGAAAAGAGCCCTCATCAAAGACTTCATCACCTCAAAAAAT
CTTGCCATTGTCATCCTCCAAGAAACAAAGCTCCCTTCCTTTGATAGAAAGACGGCTAAATCAGTGTGGAGTTCAAAAAACATTGCTTGGACAGCTCTTCACGCCTATGG
TGCATCGAGGGGCATAGCCATTTTGTGGAATGAGTCATCTTTCCGTGTTTTGGAGATCGTCGAAGAACTCCAAGACCTTCAGGCTCTTTGCTTACCAAATTGGATAGTTG
GGGGTGATTTCAACATAACTAGATGGTCATGGGAGAAATCGACCAACTCAGCCCCCACCCGTGGAATGAGGAAATTTAACAAGTTCATTGAATCTTCAGGGCTACAGGAT
ATTCCACTCTCCAATGGAAAATACACATGGTCTAGTTTTCGGCCTAATCCCACCATGACTCTTATCGATAGATTCCTCATCTCTGACAACATCTCCCTTAAATTTTTATC
AGCACAGGTTCGAAAGCTAGAACGGAACACATCAGACCATTTCCCTTTATGTCTCACCCTTGGCAAAGAAAAATGGGGCCCACCACCGTTCCGTTTCCTCAACGGATGGC
TGTCACATAAATCTCTCTTGCAGATGGTGGATGTATGGTGGAATTCGAATATGCTACAAGGGTGGCCGGGGCACGGTTTTATGGCTAAATTAAAAGGGTTGAAAAAGGAA
ATAAAACAGTGGAATCAACAAACCTACAGCAAGCAAAGGGACCGAAAAACAGCTTTGAGCATTGAGGAAAATGGCCTCCTAACCGAACAAGATATTAGTCGTAGATTGAT
TATTAAAGCAGAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGACATCCATTGACCTCTGTGGTGCAGCTGTAATCCTTGGAATGAACTCCAATCCTGTCTTAGCAAAAGATGCATCTTTTAAGCCAAGTTCTGAAAATGGTATTGAGGA
GAGTGATACTGTGGGACTACGCAAAGTGGAGGATGGTTCTGGAAGACTTGAAGAAGCTGAAAGATATTTTATTTCTGCGATTCAAGAAGCTAAAGAAGGCTTTGGGGAGA
GGGATCCTCATGTTGCGTCTGCCTTCAATAATCTGGCAGAACTGTATAGAGTCATGAAAACATTTGACAAAGCAGAACCGATGTATTTGGAAGCCATCAACATATTGGAG
GAATCCTATGGCTCTGAAGATATAAGGGGTGTAAGGCAGAACATGGAGTGCTTGCAGGACATGGAGGGATGTTTGATGATGGCATGTGTTGTGGATGACTGCAATGGGTG
TGTAAGCATGATGGCACAAACTTTTGATGACAAATGTGAGAGGAATGTACCTCTATTGGGAGAGTCCAAGTACCTCGAATACTTGGGTTTATCGTGTAGTCCTTTAGTAA
TACATTGTGGTTCTATTCTGATTCATATCGATTTGGTATTAGAGCAGTTCGATTCTGACATGGAATTAGAAAATGTTAGCAGGAAGAACCTCGCAAGAAGGGGCTGGGAG
GGGTTTAAAGTGAAGGAGGCGGTATGCCCAAGGGAGGTGCACCTGCAAGGAAAAATCGTCGACAATGGAATTGTAAAGAGAGATCGTTTGTTACACACCGCCTCACAACC
TTGTGTTTGTATTGGTCTGATGTCACCAACTGTCGCTGCTTATCCATCAAGGTTGTTAACTCGTGAAAGGGAACACGTATCGGAATCAAATTTTAGTGTGTCGTGTATCG
TAACGTATCGTGTATACTGTATCTTAGGACACTCAGCTTCTTTAGTCCTACCACTTGGGAGAGTTGGTTCTGCACTTCACAACCTTGGACAGTTTTATCTAGTTCAGAGG
AAGCTAAAAGAAGCCTGCCACTGCTATGAGATCAAAGGTCGTGTTCTCGGACACGGCCATGTTGATTATGCAGATACTATGTACCATCTTGGAACAGTGCTATACCTTCT
AGGGGAGGAAAAAGATTCTGAGGCCCTGATCCAGGATTCGATAAGGATACTGGAGGAAGGTGGCTTAGGCGAGTCAATTCTCTGCATCAGAAGATTGCGATATCTTGCTA
AGGACATAATGAGTTTGCTAACCACCTTTGGGATCGTTGGAGGAGCTCATTCGATGTTAGTGGTCCACATAATAGAGATGGATGAGGATCGTTGGAGGAGTTTCTCCCGA
CTTTGCGACCTTTCCCTGAACCAAAAGTTCTTTTCTGAGAATAGAGTTGAAGATGCGATCCTATGGGTGGAAAAGATCACAAACAAAAAGGGCCACTCAGCAGAGATAGC
AAAGCTTGGACATAATGGGGGTTTAAATAAGATTACTATACCTGTGGGTGCTGAGAGAAAAGGCTGGTGTAGTTTCATTGAATGCATTAACTCTCTCATCAACAACCCTT
CAGCTGTCCTCCCTCCACCCATAAAAGAAGCCTCGGTCACCACTTCATATAAGGCAGCTTTGAAGAATCCTCCAAAAGAACCCCAGAATCATCCCACCTTGGTACCGTTA
CAACTCGATGATCCAGCCTCCCACACTCCGTTGGCAACCTTGTACTTATCCTCGGCTGTTATTGTCCTAGCCAACATCAAAGACTGGTACAAGGTTGGACAATATCAGGT
TAGATTTTTCCCATGGTCCTCTGAATACATGAGTGGAGAACAAAAGGTTCCATCATATGGCGGATGGATAAAGATTTGCAACCTTCCCATTGATAAATGGTCCCTTGAAA
CCTTGAAAAAGATTGGTGATGAATGTGGTGGATACCTGGAAACAGCAACCAAGTCCCTCACCAGATTAGATATGATGGAGGTTATGATTAAGGTGAAAAACAACTACACT
GGCTTCATCCCAGCCGAGGTTCACATTCCATCATCATCAAGAAGCCCCATCACGGTTAGAATCGACCCATTCTTTATGGAGGATTATTATATCGGATATATGGCCGGAAT
CCATGGAAAAATTCCTCTAGGCCCGTCGACAGTTGTGGAAGCATGCGTCGGAGCCCCTAAACAAGCTTGTCTCTCAGAAGACTTTCAAACGCAATCAGTATGGACCCCAC
AGCTTTCAGAAGATCACGTGATTCCCCAATCCAAAGCAGCCACCACCACCGGATCCAAAAAACCCTATCTTCACAAAATCATAAACCCTATTCCCCAATCAGCGGCCCAC
ACCAATCCACAGCCCCCAAACCAGCCCGCTCCAATAAGCCCACTTATCCTAGATGGCCCACCCCATGCCTCCACCAATCCGCATCCACCAACCTGCCCCGATACCGAAGC
CCCTACCCCCTCTTTTAGCTCCACACCAAATAGCCCAATAACAACCAATCATGACCAACATGGCCGGAAAAAGCCCATCACCATTAACAAGGAAACCTACCTCCTTACAG
GCACGATGCATTCGACTGGAACGACTTTTCACCCATCTGATTCGGAAGGAGCTCTCTCCTCTCCGTGTTCTCCGAATTTGGATGAATCTCCACCTAATCAGCAGCAAAAA
ACCACCAATCCGATACATTATCCACCAACTATCTCTCACTTATTCGAATCTATCGAAGACCAGGCTGATATGGATTACCCCACCCCTCTGAAGATCGAGGATCCCACGGG
GGCTGTATATCAGCAAACTCTTCACATGGAGACTTCTGCTCTGATTGATATTGATATTGGAGAGATGAATGAAGATGAATATGACTCCGAGACCCACTACCAACAGAGAG
ACCCTGCGGCTTATCTTCCTTATCTTTTTCCTTGGCTGGCCGAACATGGCATGGGCATGGGCTCTTGGAAGAAAAGAGCCCTCATCAAAGACTTCATCACCTCAAAAAAT
CTTGCCATTGTCATCCTCCAAGAAACAAAGCTCCCTTCCTTTGATAGAAAGACGGCTAAATCAGTGTGGAGTTCAAAAAACATTGCTTGGACAGCTCTTCACGCCTATGG
TGCATCGAGGGGCATAGCCATTTTGTGGAATGAGTCATCTTTCCGTGTTTTGGAGATCGTCGAAGAACTCCAAGACCTTCAGGCTCTTTGCTTACCAAATTGGATAGTTG
GGGGTGATTTCAACATAACTAGATGGTCATGGGAGAAATCGACCAACTCAGCCCCCACCCGTGGAATGAGGAAATTTAACAAGTTCATTGAATCTTCAGGGCTACAGGAT
ATTCCACTCTCCAATGGAAAATACACATGGTCTAGTTTTCGGCCTAATCCCACCATGACTCTTATCGATAGATTCCTCATCTCTGACAACATCTCCCTTAAATTTTTATC
AGCACAGGTTCGAAAGCTAGAACGGAACACATCAGACCATTTCCCTTTATGTCTCACCCTTGGCAAAGAAAAATGGGGCCCACCACCGTTCCGTTTCCTCAACGGATGGC
TGTCACATAAATCTCTCTTGCAGATGGTGGATGTATGGTGGAATTCGAATATGCTACAAGGGTGGCCGGGGCACGGTTTTATGGCTAAATTAAAAGGGTTGAAAAAGGAA
ATAAAACAGTGGAATCAACAAACCTACAGCAAGCAAAGGGACCGAAAAACAGCTTTGAGCATTGAGGAAAATGGCCTCCTAACCGAACAAGATATTAGTCGTAGATTGAT
TATTAAAGCAGAGTAA
Protein sequenceShow/hide protein sequence
MTSIDLCGAAVILGMNSNPVLAKDASFKPSSENGIEESDTVGLRKVEDGSGRLEEAERYFISAIQEAKEGFGERDPHVASAFNNLAELYRVMKTFDKAEPMYLEAINILE
ESYGSEDIRGVRQNMECLQDMEGCLMMACVVDDCNGCVSMMAQTFDDKCERNVPLLGESKYLEYLGLSCSPLVIHCGSILIHIDLVLEQFDSDMELENVSRKNLARRGWE
GFKVKEAVCPREVHLQGKIVDNGIVKRDRLLHTASQPCVCIGLMSPTVAAYPSRLLTREREHVSESNFSVSCIVTYRVYCILGHSASLVLPLGRVGSALHNLGQFYLVQR
KLKEACHCYEIKGRVLGHGHVDYADTMYHLGTVLYLLGEEKDSEALIQDSIRILEEGGLGESILCIRRLRYLAKDIMSLLTTFGIVGGAHSMLVVHIIEMDEDRWRSFSR
LCDLSLNQKFFSENRVEDAILWVEKITNKKGHSAEIAKLGHNGGLNKITIPVGAERKGWCSFIECINSLINNPSAVLPPPIKEASVTTSYKAALKNPPKEPQNHPTLVPL
QLDDPASHTPLATLYLSSAVIVLANIKDWYKVGQYQVRFFPWSSEYMSGEQKVPSYGGWIKICNLPIDKWSLETLKKIGDECGGYLETATKSLTRLDMMEVMIKVKNNYT
GFIPAEVHIPSSSRSPITVRIDPFFMEDYYIGYMAGIHGKIPLGPSTVVEACVGAPKQACLSEDFQTQSVWTPQLSEDHVIPQSKAATTTGSKKPYLHKIINPIPQSAAH
TNPQPPNQPAPISPLILDGPPHASTNPHPPTCPDTEAPTPSFSSTPNSPITTNHDQHGRKKPITINKETYLLTGTMHSTGTTFHPSDSEGALSSPCSPNLDESPPNQQQK
TTNPIHYPPTISHLFESIEDQADMDYPTPLKIEDPTGAVYQQTLHMETSALIDIDIGEMNEDEYDSETHYQQRDPAAYLPYLFPWLAEHGMGMGSWKKRALIKDFITSKN
LAIVILQETKLPSFDRKTAKSVWSSKNIAWTALHAYGASRGIAILWNESSFRVLEIVEELQDLQALCLPNWIVGGDFNITRWSWEKSTNSAPTRGMRKFNKFIESSGLQD
IPLSNGKYTWSSFRPNPTMTLIDRFLISDNISLKFLSAQVRKLERNTSDHFPLCLTLGKEKWGPPPFRFLNGWLSHKSLLQMVDVWWNSNMLQGWPGHGFMAKLKGLKKE
IKQWNQQTYSKQRDRKTALSIEENGLLTEQDISRRLIIKAE