; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi09G006090 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi09G006090
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptionprotein CHUP1, chloroplastic-like
Genome locationchr09:6611630..6617339
RNA-Seq ExpressionLsi09G006090
SyntenyLsi09G006090
Gene Ontology termsNA
InterPro domainsIPR040265 - Protein CHUP1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008467205.1 PREDICTED: protein CHUP1, chloroplastic-like [Cucumis melo]0.0e+0083.01Show/hide
Query:  MMNRLSVLVAVSIAAYSIRQLTIRSWTSLFLPTNCSAYAFSNSVFIFVSFIYIENGEDVEKNGLD---EKEEEANSINDATSQVNGRTSDLEDGDHSSDE
        MMNR+SV+VAVSIAAY+I+QLTIRSWTS FLPTNCS                 ENGED +KNGLD   E+EEEA+SINDATSQVNGRTSDLEDGDHSSDE
Subjt:  MMNRLSVLVAVSIAAYSIRQLTIRSWTSLFLPTNCSAYAFSNSVFIFVSFIYIENGEDVEKNGLD---EKEEEANSINDATSQVNGRTSDLEDGDHSSDE

Query:  FRELLP-REFEHRSLDDNKKEEKVPEIQIENNKIELERLLKLVMELEERKVKLEGELFMCDGIKYSETDVMELRKQLEAKNDDISMLNITISSLQAERKI
         + LLP R  E+  L   KKEEKVPE   E+NKIE ERLLKLVMELEERKVKLEGEL MCDGIKYSETDVMELRKQL+AKN+DISMLN TISSLQAERKI
Subjt:  FRELLP-REFEHRSLDDNKKEEKVPEIQIENNKIELERLLKLVMELEERKVKLEGELFMCDGIKYSETDVMELRKQLEAKNDDISMLNITISSLQAERKI

Query:  LKEEIVKGALMKKELEEARGKIRELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDFEVELGELKRKNRELQHEKQELTSKLE
        LKEEI+KGALMKKELEEAR KI+ELQRQIQLDANQTKE LLLLKQRVS LQAKEEEAVKKEAEL+KKQKAAKDFEVELGELK KNRELQHEKQELTSKLE
Subjt:  LKEEIVKGALMKKELEEARGKIRELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDFEVELGELKRKNRELQHEKQELTSKLE

Query:  VMKARIKTLTKMEESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYFRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLMIEY
        VMKARIKTLTKM ESEIITKEREEAQKLKSENEDLIKQLE LQMNRFSEVEELVY RWINACLRYELR+N+I AGESARYLNKSSSPKS+EKAKQLM+EY
Subjt:  VMKARIKTLTKMEESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYFRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLMIEY

Query:  AGLESGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFNERP--NMSLKKLIRNKGGLSAVSFPRIIGSSHRWKDPLEAVVALSAETLTLSEVRLQ
        AG+E GQ ETDHESNFSHPFS GI++L+NTSIDSSRSRTSSF+E+P  N+SLKKLIRN+GGLSAVS P I GSSHRWKDPLEAV+ALSAETLTLSEVRLQ
Subjt:  AGLESGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFNERP--NMSLKKLIRNKGGLSAVSFPRIIGSSHRWKDPLEAVVALSAETLTLSEVRLQ

Query:  VSSRKSVNSVATSFQLMSKTVEESLKQKYSTYKEHHKLALGSEKKIKEKAENEREKSSGNASSLNIEYDDTSMRKKPAILPLQLAQMKMNKTSCDPNSQY
        VSSRKSVNSVATSFQLMSK+VEESL+QKYSTYKEH+KLA+GSEK+IKEKAE+E+ KSSG++SSLN+EY D SMRKK A LPL+LAQMK NK SC+P+SQ 
Subjt:  VSSRKSVNSVATSFQLMSKTVEESLKQKYSTYKEHHKLALGSEKKIKEKAENEREKSSGNASSLNIEYDDTSMRKKPAILPLQLAQMKMNKTSCDPNSQY

Query:  DNDSKNMISNPNSNGGEGHRGPELVRFNKKMMKLEVKADMETQGD-LVVALAMEVREASFTNMEDVVSFVIWLDEKLSSLVDGMEILEHFDWPKLKTDAL
        DNDS N+ISNP S+GGE HRG ELV+FN+KMMK EVKA METQGD LVVALAMEVREA F+NMED+VSFVI LDEKLSSLVDGMEILEHFDWP  KTDAL
Subjt:  DNDSKNMISNPNSNGGEGHRGPELVRFNKKMMKLEVKADMETQGD-LVVALAMEVREASFTNMEDVVSFVIWLDEKLSSLVDGMEILEHFDWPKLKTDAL

Query:  REAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYGLFHTRDTANSRYEELGIPMDWLLDCGVVGKIKVSCVELARKYMKRIVNEHNA
        REAAF YQKLMKL+EEVSSFVDNPKLTCEVALNKMNSLLDKVEQSV  L  TRDT  SRYEELGIP+DWLLDCGVVGKIKV CVELARKYMKRIV EHNA
Subjt:  REAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYGLFHTRDTANSRYEELGIPMDWLLDCGVVGKIKVSCVELARKYMKRIVNEHNA

Query:  LSGPEKEPNREFLLFQGVRFASRVHKDFVACFATPFNFAGGFDSESMKAFEELRSRVRTEAGQKN
        LSGP+KEPNREFLLFQGVRFASRVHK           FAGGFDS+SMKAFEELR+RV TE GQKN
Subjt:  LSGPEKEPNREFLLFQGVRFASRVHKDFVACFATPFNFAGGFDSESMKAFEELRSRVRTEAGQKN

XP_011655490.1 protein CHUP1, chloroplastic isoform X1 [Cucumis sativus]0.0e+0082.04Show/hide
Query:  MNRLSVLVAVSIAAYSIRQLTIRSWTSLFLP-TNCSAYAFSNSVFIFVSFIYIENGEDVEKN----------GL-DEKEEEANSINDATSQVNGRTSDLE
        MNR+SV+VAVSIAAY+I+QLTIRSWTS FLP TNCS                 ENGEDV+KN          GL +E+EEEANSI+D TSQVNGRTSDLE
Subjt:  MNRLSVLVAVSIAAYSIRQLTIRSWTSLFLP-TNCSAYAFSNSVFIFVSFIYIENGEDVEKN----------GL-DEKEEEANSINDATSQVNGRTSDLE

Query:  DGDHSSDEFRELL--PREFEHRSLDDNKKEEKVPEIQIENNKIELERLLKLVMELEERKVKLEGELFMCDGIKYSETDVMELRKQLEAKNDDISMLNITI
        DGDHSSDEF+ LL   R  E+  LDDN+KEEKVPE  IEN+KIELERLLKL+MELEERKVKLEGEL MCDGIKYSETDVMELRKQL+AKNDDISMLN TI
Subjt:  DGDHSSDEFRELL--PREFEHRSLDDNKKEEKVPEIQIENNKIELERLLKLVMELEERKVKLEGELFMCDGIKYSETDVMELRKQLEAKNDDISMLNITI

Query:  SSLQAERKILKEEIVKGALMKKELEEARGKIRELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDFEVELGELKRKNRELQHE
        SSLQAERKILKEEI+KGALMKKELEE RGKI+ELQRQIQLDANQTKE LLLLKQRVS LQAKEEEAVKKEAELYKKQKAAKDFEVE GELK KNRELQHE
Subjt:  SSLQAERKILKEEIVKGALMKKELEEARGKIRELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDFEVELGELKRKNRELQHE

Query:  KQELTSKLEVMKARIKTLTKMEESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYFRWINACLRYELRDNEISAGESARYLNKSSSPKSKE
         QELTSKLEVMKARIKTLTKM E+EIITKEREEAQKLKSENEDLIKQLE LQMNRFSEVEELVY RWINACLRYELR+N+I AGESARYLNKSSSPKSKE
Subjt:  KQELTSKLEVMKARIKTLTKMEESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYFRWINACLRYELRDNEISAGESARYLNKSSSPKSKE

Query:  KAKQLMIEYAGLESGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFNERP--NMSLKKLIRNKGGLSAVSFPRIIGSSHRWKDPLEAVVALSAET
        KAKQLM+EYAG E G+AETDHESNFSHPFSS I++L+NTSIDSSRSRTSSF E+P  N+SLKKLIRN+GG SAVS P  I SSHRWKDPLEAV+ALSAET
Subjt:  KAKQLMIEYAGLESGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFNERP--NMSLKKLIRNKGGLSAVSFPRIIGSSHRWKDPLEAVVALSAET

Query:  LTLSEVRLQVSSRKSVNSVATSFQLMSKTVEESLKQKYSTYKEHHKLALGSEKKIKEKAENEREKSSGNASSLNIEYDDTSMRKKPAILPLQLAQMKMNK
        LTLSEVRLQVSSRKSVNSVATSFQLMSK+VE+SL+QKYSTYKEHHKLA+GSEK+IKEK ENER KSSG++SS N+EY+D SMRK  A L L+LAQMKMNK
Subjt:  LTLSEVRLQVSSRKSVNSVATSFQLMSKTVEESLKQKYSTYKEHHKLALGSEKKIKEKAENEREKSSGNASSLNIEYDDTSMRKKPAILPLQLAQMKMNK

Query:  TSCDPNSQYDNDSKNMISNPNSNGGEGHRGPELVRFNKKMMKLEVKADMETQGD-LVVALAMEVREASFTNMEDVVSFVIWLDEKLSSLVDGMEILEHFD
         SC+P+SQYDN+S N IS+P S+GGE HRG ELV+FN+KMMK EVK  METQ D LV+ALAMEVREASF+NMED+VSFVIWLDEKLSSLVDGMEILEHFD
Subjt:  TSCDPNSQYDNDSKNMISNPNSNGGEGHRGPELVRFNKKMMKLEVKADMETQGD-LVVALAMEVREASFTNMEDVVSFVIWLDEKLSSLVDGMEILEHFD

Query:  WPKLKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYGLFHTRDTANSRYEELGIPMDWLLDCGVVGKIKVSCVELARKYM
        WPK KTDALREAAF YQKLMKL+EEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVY L  TRDT  SRYEELGIP+DWLLDCGVVGKIKV CVELARKYM
Subjt:  WPKLKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYGLFHTRDTANSRYEELGIPMDWLLDCGVVGKIKVSCVELARKYM

Query:  KRIVNEHNALSGPEKEPNREFLLFQGVRFASRVHKDFVACFATPFNFAGGFDSESMKAFEELRSRVRTEAGQKN
        KRIV EHNALSGPEKEPNREFLLFQGVRFASRVHK           FAGGFDS+SMKAFEELRSRV TE GQ+N
Subjt:  KRIVNEHNALSGPEKEPNREFLLFQGVRFASRVHKDFVACFATPFNFAGGFDSESMKAFEELRSRVRTEAGQKN

XP_031741050.1 protein CHUP1, chloroplastic isoform X2 [Cucumis sativus]0.0e+0082.89Show/hide
Query:  MNRLSVLVAVSIAAYSIRQLTIRSWTSLFLP-TNCSAYAFSNSVFIFVSFIYIENGEDVEKN----------GL-DEKEEEANSINDATSQVNGRTSDLE
        MNR+SV+VAVSIAAY+I+QLTIRSWTS FLP TNCS                 ENGEDV+KN          GL +E+EEEANSI+D TSQVNGRTSDLE
Subjt:  MNRLSVLVAVSIAAYSIRQLTIRSWTSLFLP-TNCSAYAFSNSVFIFVSFIYIENGEDVEKN----------GL-DEKEEEANSINDATSQVNGRTSDLE

Query:  DGDHSSDEFRELL--PREFEHRSLDDNKKEEKVPEIQIENNKIELERLLKLVMELEERKVKLEGELFMCDGIKYSETDVMELRKQLEAKNDDISMLNITI
        DGDHSSDEF+ LL   R  E+  LDDN+KEEKVPE  IEN+KIELERLLKL+MELEERKVKLEGEL MCDGIKYSETDVMELRKQL+AKNDDISMLN TI
Subjt:  DGDHSSDEFRELL--PREFEHRSLDDNKKEEKVPEIQIENNKIELERLLKLVMELEERKVKLEGELFMCDGIKYSETDVMELRKQLEAKNDDISMLNITI

Query:  SSLQAERKILKEEIVKGALMKKELEEARGKIRELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDFEVELGELKRKNRELQHE
        SSLQAERKILKEEI+KGALMKKELEE RGKI+ELQRQIQLDANQTKE LLLLKQRVS LQAKEEEAVKKEAELYKKQKAAKDFEVE GELK KNRELQHE
Subjt:  SSLQAERKILKEEIVKGALMKKELEEARGKIRELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDFEVELGELKRKNRELQHE

Query:  KQELTSKLEVMKARIKTLTKMEESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYFRWINACLRYELRDNEISAGESARYLNKSSSPKSKE
         QELTSKLEVMKARIKTLTKM E+EIITKEREEAQKLKSENEDLIKQLE LQMNRFSEVEELVY RWINACLRYELR+N+I AGESARYLNKSSSPKSKE
Subjt:  KQELTSKLEVMKARIKTLTKMEESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYFRWINACLRYELRDNEISAGESARYLNKSSSPKSKE

Query:  KAKQLMIEYAGLESGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFNERP--NMSLKKLIRNKGGLSAVSFPRIIGSSHRWKDPLEAVVALSAET
        KAKQLM+EYAG E G+AETDHESNFSHPFSS I++L+NTSIDSSRSRTSSF E+P  N+SLKKLIRN+GG SAVS P  I SSHRWKDPLEAV+ALSAET
Subjt:  KAKQLMIEYAGLESGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFNERP--NMSLKKLIRNKGGLSAVSFPRIIGSSHRWKDPLEAVVALSAET

Query:  LTLSEVRLQVSSRKSVNSVATSFQLMSKTVEESLKQKYSTYKEHHKLALGSEKKIKEKAENEREKSSGNASSLNIEYDDTSMRKKPAILPLQLAQMKMNK
        LTLSEVRLQVSSRKSVNSVATSFQLMSK+VE+SL+QKYSTYKEHHKLA+GSEK+IKEK ENER KSSG++SS N+EY+D SMRK  A L L+LAQMKMNK
Subjt:  LTLSEVRLQVSSRKSVNSVATSFQLMSKTVEESLKQKYSTYKEHHKLALGSEKKIKEKAENEREKSSGNASSLNIEYDDTSMRKKPAILPLQLAQMKMNK

Query:  TSCDPNSQYDNDSKNMISNPNSNGGEGHRGPELVRFNKKMMKLEVKADMETQGD-LVVALAMEVREASFTNMEDVVSFVIWLDEKLSSLVDGMEILEHFD
         SC+P+SQYDN+S N IS+P S+GGE HRG ELV+FN+KMMK EVK  METQ D LV+ALAMEVREASF+NMED+VSFVIWLDEKLSSLVDGMEILEHFD
Subjt:  TSCDPNSQYDNDSKNMISNPNSNGGEGHRGPELVRFNKKMMKLEVKADMETQGD-LVVALAMEVREASFTNMEDVVSFVIWLDEKLSSLVDGMEILEHFD

Query:  WPKLKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYGLFHTRDTANSRYEELGIPMDWLLDCGVVGKIKVSCVELARKYM
        WPK KTDALREAAF YQKLMKL+EEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVY L  TRDT  SRYEELGIP+DWLLDCGVVGKIKV CVELARKYM
Subjt:  WPKLKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYGLFHTRDTANSRYEELGIPMDWLLDCGVVGKIKVSCVELARKYM

Query:  KRIVNEHNALSGPEKEPNREFLLFQGVRFASRVHKD
        KRIV EHNALSGPEKEPNREFLLFQGVRFASRVHK+
Subjt:  KRIVNEHNALSGPEKEPNREFLLFQGVRFASRVHKD

XP_031741051.1 protein CHUP1, chloroplastic isoform X3 [Cucumis sativus]0.0e+0079.06Show/hide
Query:  MNRLSVLVAVSIAAYSIRQLTIRSWTSLFLP-TNCSAYAFSNSVFIFVSFIYIENGEDVEKN----------GL-DEKEEEANSINDATSQVNGRTSDLE
        MNR+SV+VAVSIAAY+I+QLTIRSWTS FLP TNCS                 ENGEDV+KN          GL +E+EEEANSI+D TSQVNGRTSDLE
Subjt:  MNRLSVLVAVSIAAYSIRQLTIRSWTSLFLP-TNCSAYAFSNSVFIFVSFIYIENGEDVEKN----------GL-DEKEEEANSINDATSQVNGRTSDLE

Query:  DGDHSSDEFRELL--PREFEHRSLDDNKKEEKVPEIQIENNKIELERLLKLVMELEERKVKLEGELFMCDGIKYSETDVMELRKQLEAKNDDISMLNITI
        DGDHSSDEF+ LL   R  E+  LDDN+KEEKVPE  IEN+KIELERLLKL+MELEERKVKLEGEL MCDGIKYSETDVMELRKQL+AKNDDISMLN TI
Subjt:  DGDHSSDEFRELL--PREFEHRSLDDNKKEEKVPEIQIENNKIELERLLKLVMELEERKVKLEGELFMCDGIKYSETDVMELRKQLEAKNDDISMLNITI

Query:  SSLQAERKILKEEIVKGALMKKELEEARGKIRELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDFEVELGELKRKNRELQHE
        SSLQAERKILKEEI+KGALMKKELEE RGKI+ELQRQIQLDANQTKE LLLLKQRVS LQAKEEEAVKKEAELYKKQKAAKDFEVE GELK KNRELQHE
Subjt:  SSLQAERKILKEEIVKGALMKKELEEARGKIRELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDFEVELGELKRKNRELQHE

Query:  KQELTSKLEVMKARIKTLTKMEESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYFRWINACLRYELRDNEISAGESARYLNKSSSPKSKE
         QELTSKLEVMKARIKTLTKM E+EIITKEREEAQKLKSENEDLIKQLE LQMNRFSEVEELVY RWINACLRYELR+N+I AG+               
Subjt:  KQELTSKLEVMKARIKTLTKMEESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYFRWINACLRYELRDNEISAGESARYLNKSSSPKSKE

Query:  KAKQLMIEYAGLESGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFNERP--NMSLKKLIRNKGGLSAVSFPRIIGSSHRWKDPLEAVVALSAET
                    E G+AETDHESNFSHPFSS I++L+NTSIDSSRSRTSSF E+P  N+SLKKLIRN+GG SAVS P  I SSHRWKDPLEAV+ALSAET
Subjt:  KAKQLMIEYAGLESGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFNERP--NMSLKKLIRNKGGLSAVSFPRIIGSSHRWKDPLEAVVALSAET

Query:  LTLSEVRLQVSSRKSVNSVATSFQLMSKTVEESLKQKYSTYKEHHKLALGSEKKIKEKAENEREKSSGNASSLNIEYDDTSMRKKPAILPLQLAQMKMNK
        LTLSEVRLQVSSRKSVNSVATSFQLMSK+VE+SL+QKYSTYKEHHKLA+GSEK+IKEK ENER KSSG++SS N+EY+D SMRK  A L L+LAQMKMNK
Subjt:  LTLSEVRLQVSSRKSVNSVATSFQLMSKTVEESLKQKYSTYKEHHKLALGSEKKIKEKAENEREKSSGNASSLNIEYDDTSMRKKPAILPLQLAQMKMNK

Query:  TSCDPNSQYDNDSKNMISNPNSNGGEGHRGPELVRFNKKMMKLEVKADMETQGD-LVVALAMEVREASFTNMEDVVSFVIWLDEKLSSLVDGMEILEHFD
         SC+P+SQYDN+S N IS+P S+GGE HRG ELV+FN+KMMK EVK  METQ D LV+ALAMEVREASF+NMED+VSFVIWLDEKLSSLVDGMEILEHFD
Subjt:  TSCDPNSQYDNDSKNMISNPNSNGGEGHRGPELVRFNKKMMKLEVKADMETQGD-LVVALAMEVREASFTNMEDVVSFVIWLDEKLSSLVDGMEILEHFD

Query:  WPKLKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYGLFHTRDTANSRYEELGIPMDWLLDCGVVGKIKVSCVELARKYM
        WPK KTDALREAAF YQKLMKL+EEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVY L  TRDT  SRYEELGIP+DWLLDCGVVGKIKV CVELARKYM
Subjt:  WPKLKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYGLFHTRDTANSRYEELGIPMDWLLDCGVVGKIKVSCVELARKYM

Query:  KRIVNEHNALSGPEKEPNREFLLFQGVRFASRVHKDFVACFATPFNFAGGFDSESMKAFEELRSRVRTEAGQKN
        KRIV EHNALSGPEKEPNREFLLFQGVRFASRVHK           FAGGFDS+SMKAFEELRSRV TE GQ+N
Subjt:  KRIVNEHNALSGPEKEPNREFLLFQGVRFASRVHKDFVACFATPFNFAGGFDSESMKAFEELRSRVRTEAGQKN

XP_038906491.1 protein CHUP1, chloroplastic-like isoform X1 [Benincasa hispida]0.0e+0085.35Show/hide
Query:  MMNRLSVLVAVSIAAYSIRQLTIRSWTSLFLPTNCSAYAFSNSVFIFVSFIYIENGEDVEKNGLDEKEEEANSINDATSQVNGRTSDLEDGDHSSDEFRE
        MMNRL +LVAVSI AY+IRQLTIRSW+SLF P NCS                 ENGED +KNGLDE+EEEANSIND TSQVNGRTSD+EDGDH SDEFR 
Subjt:  MMNRLSVLVAVSIAAYSIRQLTIRSWTSLFLPTNCSAYAFSNSVFIFVSFIYIENGEDVEKNGLDEKEEEANSINDATSQVNGRTSDLEDGDHSSDEFRE

Query:  LLPREFEHRSLDDNKKEEKVPEIQIENNKIELERLLKLVMELEERKVKLEGELFMCDGIKYSETDVMELRKQLEAKNDDISMLNITISSLQAERKILKEE
        LLPRE E+ SLDDNKKEEKVPEIQIENNKIELERL+KLVMELEERK KLEGEL MCD IKYSETDV ELRKQL+AKNDDISMLNITISSLQAERKIL+EE
Subjt:  LLPREFEHRSLDDNKKEEKVPEIQIENNKIELERLLKLVMELEERKVKLEGELFMCDGIKYSETDVMELRKQLEAKNDDISMLNITISSLQAERKILKEE

Query:  IVKGALMKKELEEARGKIRELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDFEVELGELKRKNRELQHEKQELTSKLEVMKA
        I+KGALMKKELE ARGKI+ELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEA+KKEAELYKKQKAAKDFEVELGELKRKNRELQHEK EL SKLEVMKA
Subjt:  IVKGALMKKELEEARGKIRELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDFEVELGELKRKNRELQHEKQELTSKLEVMKA

Query:  RIKTLTKMEESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYFRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLMIEYAGLE
        RIKTLTKM ESEI+TKEREEAQKLKSENEDLIK LERLQMNRF+EVEELVY RWINACLRYELRDNEIS GESARYLNKS SPKSKEKAKQLM+EYAGLE
Subjt:  RIKTLTKMEESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYFRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLMIEYAGLE

Query:  SGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFNERP--NMSLKKLIRNKGGLSAVSFPRIIGSSHRWKDPLEAVVALSAETLTLSEVRLQVSSR
        SGQ ETDHESNFSHPFSSGIED+DNTSIDSSRSRTSSF E+P  N+SLKKLIRN GG SAVS P IIGSSHRWKDPLEAV+ALSAETLTLSEVRLQVSS 
Subjt:  SGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFNERP--NMSLKKLIRNKGGLSAVSFPRIIGSSHRWKDPLEAVVALSAETLTLSEVRLQVSSR

Query:  KSVNSVATSFQLMSKTVEESLKQKYSTYKEHHKLALGSEKKIKEKAENEREKSSGNASSLNIEYDDTSMRKKPAILPLQLAQMKMNKTSCDPNSQYDNDS
        KSVNSVATSFQLMSK+V+ESLKQKYSTYKEH KLALGSEK+IKEKA NER KSSG+A SL  EYDDT++RKKPAILPL+L QMKMN+TS DP+SQ+DNDS
Subjt:  KSVNSVATSFQLMSKTVEESLKQKYSTYKEHHKLALGSEKKIKEKAENEREKSSGNASSLNIEYDDTSMRKKPAILPLQLAQMKMNKTSCDPNSQYDNDS

Query:  KNMISNPNSNGGEGHRGPELVRFNKKMMKLEVKADMETQGDLVVALAMEVREASFTNMEDVVSFVIWLDEKLSSLVDGMEILEHFDWPKLKTDALREAAF
        KNMISNP S+GGE HRGPELVRFN+K+MK EV AD+ETQGDLVVALAMEVREASF+NMEDVVSF+I LDEK  SLV+GMEIL+HFDWPK KTDAL EAAF
Subjt:  KNMISNPNSNGGEGHRGPELVRFNKKMMKLEVKADMETQGDLVVALAMEVREASFTNMEDVVSFVIWLDEKLSSLVDGMEILEHFDWPKLKTDALREAAF

Query:  EYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYGLFHTRDTANSRYEELGIPMDWLLDCGVVGKIKVSCVELARKYMKRIVNEHNALSGPE
         YQKLMKL+EEVSSFVDNPKLTCEVALNKMNSL+DKVEQSVYGLF TRDT  S+YEELGIP+DWLLDCGVVGKIKVSCVELARKYMKRIVNEHNALSGPE
Subjt:  EYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYGLFHTRDTANSRYEELGIPMDWLLDCGVVGKIKVSCVELARKYMKRIVNEHNALSGPE

Query:  KEPNREFLLFQGVRFASRVHKDFVACFATPFNFAGGFDSESMKAFEELRSRVRTEAGQKN
        KEP+REFLLFQGVRFASR+HK           FAGGFD ESMKAFEELRSRV TEAGQKN
Subjt:  KEPNREFLLFQGVRFASRVHKDFVACFATPFNFAGGFDSESMKAFEELRSRVRTEAGQKN

TrEMBL top hitse value%identityAlignment
A0A061ECQ9 Hydroxyproline-rich glycoprotein family protein isoform 42.0e-20149.89Show/hide
Query:  MMNRLSVLVAVSIAAYSIRQLTI---RSWTSLFLPTNCSAYAFS---NSVFIFVSFIYIENGEDVEKNGLDEKEEEANSINDATSQVNGRTSDLEDGDHS
        M+ R+  +VA SIAA++++QL +   +S TSL   +     +F    N       F Y  +    +    +E+EE+   I+   ++VNG   D+ D D  
Subjt:  MMNRLSVLVAVSIAAYSIRQLTI---RSWTSLFLPTNCSAYAFS---NSVFIFVSFIYIENGEDVEKNGLDEKEEEANSINDATSQVNGRTSDLEDGDHS

Query:  SDEFRELLPREFEHRSLDD---NKKEEKVPEIQIENNKIELERLLKLVMELEERKVKLEGELFMCDGIKYSETDVMELRKQLEAKNDDISMLNITISSLQ
          EF +LL  E E+    D     + EK+ E ++ NN  ELERL  LV ELEER+VKLEGEL    G+K  E+D+ EL++QL+ K  +I MLNITISSLQ
Subjt:  SDEFRELLPREFEHRSLDD---NKKEEKVPEIQIENNKIELERLLKLVMELEERKVKLEGELFMCDGIKYSETDVMELRKQLEAKNDDISMLNITISSLQ

Query:  AERKILKEEIVKGALMKKELEEARGKIRELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDFEVELGELKRKNRELQHEKQEL
        +ERK L+E+I  GA +KKELE AR KI+ELQRQIQLDANQTK  LL LKQ+VS LQAKE+EA+K +AE+ KK KA K+ E+E+ EL+RKN+ELQHEK+EL
Subjt:  AERKILKEEIVKGALMKKELEEARGKIRELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDFEVELGELKRKNRELQHEKQEL

Query:  TSKLEVMKARIKTLTKMEESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYFRWINACLRYELRDNEISAGE-SARYLNKSSSPKSKEKAK
        T KL+  +A+I  L+ M E+EI  + REE   L+  NEDL+KQ+E LQMNRFSEVEELVY RW+NACLRYELR+ +   G+ SAR LNKS SPKS+E AK
Subjt:  TSKLEVMKARIKTLTKMEESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYFRWINACLRYELRDNEISAGE-SARYLNKSSSPKSKEKAK

Query:  QLMIEYAGLESGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFNERPNM--SLKKLIRNKGGLSAVSFP----------RIIGSSHRWKDPLEAV
        QL++EYAG E GQ +TD ESNFSHP S+G EDLDN SI SS SR SS +++P++   LKK  R+K   SAVS P          RI  S H  + PLEA+
Subjt:  QLMIEYAGLESGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFNERPNM--SLKKLIRNKGGLSAVSFP----------RIIGSSHRWKDPLEAV

Query:  VALSA--------------------ETLTLSEVRLQVSSRKSVNSVATSFQLMSKTVEESLKQKYSTYKEHHKLALGSEKKIKEKAENEREKSSGNASSL
        +  +A                    ET T+  +R QVSS  S NSVATSF LMS++V+ SL++KY  YK+ HKLAL  EK+IK+KA+  R +  G+ S+ 
Subjt:  VALSA--------------------ETLTLSEVRLQVSSRKSVNSVATSFQLMSKTVEESLKQKYSTYKEHHKLALGSEKKIKEKAENEREKSSGNASSL

Query:  NIEYDDTSMRKKPAILPLQLAQMKMNKTSCDPNSQYDNDSKNMISNPNSNGGEGH---RGPELVRFNKKMM-----------------------------
           +   + R+KP ILP +LAQ+K        +S   ND K + S   S     H   R P + R   K                               
Subjt:  NIEYDDTSMRKKPAILPLQLAQMKMNKTSCDPNSQYDNDSKNMISNPNSNGGEGH---RGPELVRFNKKMM-----------------------------

Query:  --------------KLEVKADMETQGDLVVALAMEVREASFTNMEDVVSFVIWLDEKLSSLVDGMEILEHFDWPKLKTDALREAAFEYQKLMKLKEEVSS
                         VKAD+ETQGD V +LA E+R ASFT++ED+V+FV WLDE+LS LVD   +L+HFDWP+ K DALREAAFEYQ L+KL++++SS
Subjt:  --------------KLEVKADMETQGDLVVALAMEVREASFTNMEDVVSFVIWLDEKLSSLVDGMEILEHFDWPKLKTDALREAAFEYQKLMKLKEEVSS

Query:  FVDNPKLTCEVALNKMNSLLDKVEQSVYGLFHTRDTANSRYEELGIPMDWLLDCGVVGKIKVSCVELARKYMKRIVNEHNALSGPEKEPNREFLLFQGVR
        FVD+P L CE AL KM  LL+KVEQSVY L  TRD A SRY+E GIP++WLLD GVVGKIK+S V+LARKYMKR+ +E + L+GPEKEPNREF+L QG+R
Subjt:  FVDNPKLTCEVALNKMNSLLDKVEQSVYGLFHTRDTANSRYEELGIPMDWLLDCGVVGKIKVSCVELARKYMKRIVNEHNALSGPEKEPNREFLLFQGVR

Query:  FASRVHKDFVACFATPFNFAGGFDSESMKAFEELRSRVRTEAGQKN
        FA RVH+           FAGGFD+ESMKAFEELRSRV ++ G+ N
Subjt:  FASRVHKDFVACFATPFNFAGGFDSESMKAFEELRSRVRTEAGQKN

A0A0A0KT25 Uncharacterized protein0.0e+0082.7Show/hide
Query:  MNRLSVLVAVSIAAYSIRQLTIRSWTSLFLP-TNCSAYAFSNSVFIFVSFIYIENGEDVEKN---GL-DEKEEEANSINDATSQVNGRTSDLEDGDHSSD
        MNR+SV+VAVSIAAY+I+QLTIRSWTS FLP TNCS                 ENGEDV+KN   GL +E+EEEANSI+D TSQVNGRTSDLEDGDHSSD
Subjt:  MNRLSVLVAVSIAAYSIRQLTIRSWTSLFLP-TNCSAYAFSNSVFIFVSFIYIENGEDVEKN---GL-DEKEEEANSINDATSQVNGRTSDLEDGDHSSD

Query:  EFRELL--PREFEHRSLDDNKKEEKVPEIQIENNKIELERLLKLVMELEERKVKLEGELFMCDGIKYSETDVMELRKQLEAKNDDISMLNITISSLQAER
        EF+ LL   R  E+  LDDN+KEEKVPE  IEN+KIELERLLKL+MELEERKVKLEGEL MCDGIKYSETDVMELRKQL+AKNDDISMLN TISSLQAER
Subjt:  EFRELL--PREFEHRSLDDNKKEEKVPEIQIENNKIELERLLKLVMELEERKVKLEGELFMCDGIKYSETDVMELRKQLEAKNDDISMLNITISSLQAER

Query:  KILKEEIVKGALMKKELEEARGKIRELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDFEVELGELKRKNRELQHEKQELTSK
        KILKEEI+KGALMKKELEE RGKI+ELQRQIQLDANQTKE LLLLKQRVS LQAKEEEAVKKEAELYKKQKAAKDFEVE GELK KNRELQHE QELTSK
Subjt:  KILKEEIVKGALMKKELEEARGKIRELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDFEVELGELKRKNRELQHEKQELTSK

Query:  LEVMKARIKTLTKMEESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYFRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLMI
        LEVMKARIKTLTKM E+EIITKEREEAQKLKSENEDLIKQLE LQMNRFSEVEELVY RWINACLRYELR+N+I AGESARYLNKSSSPKSKEKAKQLM+
Subjt:  LEVMKARIKTLTKMEESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYFRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLMI

Query:  EYAGLESGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFNERP--NMSLKKLIRNKGGLSAVSFPRIIGSSHRWKDPLEAVVALSAETLTLSEVR
        EYAG E G+AETDHESNFSHPFSS I++L+NTSIDSSRSRTSSF E+P  N+SLKKLIRN+GG SAVS P  I SSHRWKDPLEAV+ALSAETLTLSEVR
Subjt:  EYAGLESGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFNERP--NMSLKKLIRNKGGLSAVSFPRIIGSSHRWKDPLEAVVALSAETLTLSEVR

Query:  LQVSSRKSVNSVATSFQLMSKTVEESLKQKYSTYKEHHKLALGSEKKIKEKAENEREKSSGNASSLNIEYDDTSMRKKPAILPLQLAQMKMNKTSCDPNS
        LQVSSRKSVNSVATSFQLMSK+VE+SL+QKYSTYKEHHKLA+GSEK+IKEK ENER KSSG++SS N+EY+D SMRK  A L L+LAQMKMNK SC+P+S
Subjt:  LQVSSRKSVNSVATSFQLMSKTVEESLKQKYSTYKEHHKLALGSEKKIKEKAENEREKSSGNASSLNIEYDDTSMRKKPAILPLQLAQMKMNKTSCDPNS

Query:  QYDNDSKNMISNPNSNGGEGHRGPELVRFNKKMMKLEVKADMETQGD-LVVALAMEVREASFTNMEDVVSFVIWLDEKLSSLVDGMEILEHFDWPKLKTD
        QYDN+S N IS+P S+GGE HRG ELV+FN+KMMK EVK  METQ D LV+ALAMEVREASF+NMED+VSFVIWLDEKLSSLVDGMEILEHFDWPK KTD
Subjt:  QYDNDSKNMISNPNSNGGEGHRGPELVRFNKKMMKLEVKADMETQGD-LVVALAMEVREASFTNMEDVVSFVIWLDEKLSSLVDGMEILEHFDWPKLKTD

Query:  ALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYGLFHTRDTANSRYEELGIPMDWLLDCGVVGKIKVSCVELARKYMKRIVNEH
        ALREAAF YQKLMKL+EEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVY L  TRDT  SRYEELGIP+DWLLDCGVVGKIKV CVELARKYMKRIV EH
Subjt:  ALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYGLFHTRDTANSRYEELGIPMDWLLDCGVVGKIKVSCVELARKYMKRIVNEH

Query:  NALSGPEKEPNREFLLFQGVRFASRVHKDFVACFATPFNFAGGFDSESMKAFEELRSRVRTEAGQKN
        NALSGPEKEPNREFLLFQGVRFASRVHK           FAGGFDS+SMKAFEELRSRV TE GQ+N
Subjt:  NALSGPEKEPNREFLLFQGVRFASRVHKDFVACFATPFNFAGGFDSESMKAFEELRSRVRTEAGQKN

A0A1S3CSZ9 protein CHUP1, chloroplastic-like0.0e+0083.01Show/hide
Query:  MMNRLSVLVAVSIAAYSIRQLTIRSWTSLFLPTNCSAYAFSNSVFIFVSFIYIENGEDVEKNGLD---EKEEEANSINDATSQVNGRTSDLEDGDHSSDE
        MMNR+SV+VAVSIAAY+I+QLTIRSWTS FLPTNCS                 ENGED +KNGLD   E+EEEA+SINDATSQVNGRTSDLEDGDHSSDE
Subjt:  MMNRLSVLVAVSIAAYSIRQLTIRSWTSLFLPTNCSAYAFSNSVFIFVSFIYIENGEDVEKNGLD---EKEEEANSINDATSQVNGRTSDLEDGDHSSDE

Query:  FRELLP-REFEHRSLDDNKKEEKVPEIQIENNKIELERLLKLVMELEERKVKLEGELFMCDGIKYSETDVMELRKQLEAKNDDISMLNITISSLQAERKI
         + LLP R  E+  L   KKEEKVPE   E+NKIE ERLLKLVMELEERKVKLEGEL MCDGIKYSETDVMELRKQL+AKN+DISMLN TISSLQAERKI
Subjt:  FRELLP-REFEHRSLDDNKKEEKVPEIQIENNKIELERLLKLVMELEERKVKLEGELFMCDGIKYSETDVMELRKQLEAKNDDISMLNITISSLQAERKI

Query:  LKEEIVKGALMKKELEEARGKIRELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDFEVELGELKRKNRELQHEKQELTSKLE
        LKEEI+KGALMKKELEEAR KI+ELQRQIQLDANQTKE LLLLKQRVS LQAKEEEAVKKEAEL+KKQKAAKDFEVELGELK KNRELQHEKQELTSKLE
Subjt:  LKEEIVKGALMKKELEEARGKIRELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDFEVELGELKRKNRELQHEKQELTSKLE

Query:  VMKARIKTLTKMEESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYFRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLMIEY
        VMKARIKTLTKM ESEIITKEREEAQKLKSENEDLIKQLE LQMNRFSEVEELVY RWINACLRYELR+N+I AGESARYLNKSSSPKS+EKAKQLM+EY
Subjt:  VMKARIKTLTKMEESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYFRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLMIEY

Query:  AGLESGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFNERP--NMSLKKLIRNKGGLSAVSFPRIIGSSHRWKDPLEAVVALSAETLTLSEVRLQ
        AG+E GQ ETDHESNFSHPFS GI++L+NTSIDSSRSRTSSF+E+P  N+SLKKLIRN+GGLSAVS P I GSSHRWKDPLEAV+ALSAETLTLSEVRLQ
Subjt:  AGLESGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFNERP--NMSLKKLIRNKGGLSAVSFPRIIGSSHRWKDPLEAVVALSAETLTLSEVRLQ

Query:  VSSRKSVNSVATSFQLMSKTVEESLKQKYSTYKEHHKLALGSEKKIKEKAENEREKSSGNASSLNIEYDDTSMRKKPAILPLQLAQMKMNKTSCDPNSQY
        VSSRKSVNSVATSFQLMSK+VEESL+QKYSTYKEH+KLA+GSEK+IKEKAE+E+ KSSG++SSLN+EY D SMRKK A LPL+LAQMK NK SC+P+SQ 
Subjt:  VSSRKSVNSVATSFQLMSKTVEESLKQKYSTYKEHHKLALGSEKKIKEKAENEREKSSGNASSLNIEYDDTSMRKKPAILPLQLAQMKMNKTSCDPNSQY

Query:  DNDSKNMISNPNSNGGEGHRGPELVRFNKKMMKLEVKADMETQGD-LVVALAMEVREASFTNMEDVVSFVIWLDEKLSSLVDGMEILEHFDWPKLKTDAL
        DNDS N+ISNP S+GGE HRG ELV+FN+KMMK EVKA METQGD LVVALAMEVREA F+NMED+VSFVI LDEKLSSLVDGMEILEHFDWP  KTDAL
Subjt:  DNDSKNMISNPNSNGGEGHRGPELVRFNKKMMKLEVKADMETQGD-LVVALAMEVREASFTNMEDVVSFVIWLDEKLSSLVDGMEILEHFDWPKLKTDAL

Query:  REAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYGLFHTRDTANSRYEELGIPMDWLLDCGVVGKIKVSCVELARKYMKRIVNEHNA
        REAAF YQKLMKL+EEVSSFVDNPKLTCEVALNKMNSLLDKVEQSV  L  TRDT  SRYEELGIP+DWLLDCGVVGKIKV CVELARKYMKRIV EHNA
Subjt:  REAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYGLFHTRDTANSRYEELGIPMDWLLDCGVVGKIKVSCVELARKYMKRIVNEHNA

Query:  LSGPEKEPNREFLLFQGVRFASRVHKDFVACFATPFNFAGGFDSESMKAFEELRSRVRTEAGQKN
        LSGP+KEPNREFLLFQGVRFASRVHK           FAGGFDS+SMKAFEELR+RV TE GQKN
Subjt:  LSGPEKEPNREFLLFQGVRFASRVHKDFVACFATPFNFAGGFDSESMKAFEELRSRVRTEAGQKN

A0A5D3BMR7 Protein CHUP10.0e+0086.3Show/hide
Query:  DEKEEEANSINDATSQVNGRTSDLEDGDHSSDEFRELLP-REFEHRSLDDNKKEEKVPEIQIENNKIELERLLKLVMELEERKVKLEGELFMCDGIKYSE
        +E+EEEA+SINDATSQVNGRTSDLEDGDHSSDE + LLP R  E+  L   KKEEKVPE   ENNKIE ERLLKLVMELEERKVKLEGEL MCDGIKYSE
Subjt:  DEKEEEANSINDATSQVNGRTSDLEDGDHSSDEFRELLP-REFEHRSLDDNKKEEKVPEIQIENNKIELERLLKLVMELEERKVKLEGELFMCDGIKYSE

Query:  TDVMELRKQLEAKNDDISMLNITISSLQAERKILKEEIVKGALMKKELEEARGKIRELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKK
        TDVMELRKQL+AKN+DISMLN TISSLQAERKILKEEI+KGALMKKELEEARGKI+ELQRQIQLDANQTKE LLLLKQRVS LQAKEEEAVKKEAEL+KK
Subjt:  TDVMELRKQLEAKNDDISMLNITISSLQAERKILKEEIVKGALMKKELEEARGKIRELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKK

Query:  QKAAKDFEVELGELKRKNRELQHEKQELTSKLEVMKARIKTLTKMEESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYFRWINACLRYEL
        QKAAKDFEVELGELK KNRELQHEKQELTSKLEVMKARIKTLTKM ESEIITKEREEAQKLKSENEDLIKQLE LQMNRFSEVEELVY RWINACLRYEL
Subjt:  QKAAKDFEVELGELKRKNRELQHEKQELTSKLEVMKARIKTLTKMEESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYFRWINACLRYEL

Query:  RDNEISAGESARYLNKSSSPKSKEKAKQLMIEYAGLESGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFNERP--NMSLKKLIRNKGGLSAVSF
        R+N+I AGESARYLNKSSSPKS+EKAKQLM+EYAG+E GQ ETDHESNFSHPFS GI++L+NTSIDSSRSRTSSF+E+P  N+SLKKLIRN+GGLSAVS 
Subjt:  RDNEISAGESARYLNKSSSPKSKEKAKQLMIEYAGLESGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFNERP--NMSLKKLIRNKGGLSAVSF

Query:  PRIIGSSHRWKDPLEAVVALSAETLTLSEVRLQVSSRKSVNSVATSFQLMSKTVEESLKQKYSTYKEHHKLALGSEKKIKEKAENEREKSSGNASSLNIE
        P I GSSHRWKDPLEAV+ALSAETLTLSEVRLQVSSRKSVNSVATSFQLMSK+VEESL+QKYSTYKEHHKLA+GSEK+IKEKAE+E+ KSSG++SSLN+E
Subjt:  PRIIGSSHRWKDPLEAVVALSAETLTLSEVRLQVSSRKSVNSVATSFQLMSKTVEESLKQKYSTYKEHHKLALGSEKKIKEKAENEREKSSGNASSLNIE

Query:  YDDTSMRKKPAILPLQLAQMKMNKTSCDPNSQYDNDSKNMISNPNSNGGEGHRGPELVRFNKKMMKLEVKADMETQGD-LVVALAMEVREASFTNMEDVV
        Y D SMRKK A LPL+LAQMK NK SC+P+SQ DNDS N+ISNP S+GGE HRG ELV+FN+KMMK EVKA METQGD LVVALAMEVREA F+NMED+V
Subjt:  YDDTSMRKKPAILPLQLAQMKMNKTSCDPNSQYDNDSKNMISNPNSNGGEGHRGPELVRFNKKMMKLEVKADMETQGD-LVVALAMEVREASFTNMEDVV

Query:  SFVIWLDEKLSSLVDGMEILEHFDWPKLKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYGLFHTRDTANSRYEELGIPM
        SFVI LDEKLSSLVDGMEILEHFDWP  KTDALREAAF YQKLMKL+EEVSSFVDNPKLTCEVALNKMNSLLDKVEQSV  L  TRDT  SRYEELGIP+
Subjt:  SFVIWLDEKLSSLVDGMEILEHFDWPKLKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYGLFHTRDTANSRYEELGIPM

Query:  DWLLDCGVVGKIKVSCVELARKYMKRIVNEHNALSGPEKEPNREFLLFQGVRFASRVHK
        DWLLDCGVVGKIKV CVELARKYMKRIV EHN LSGP+KEPNREFLLFQGVRFASRVHK
Subjt:  DWLLDCGVVGKIKVSCVELARKYMKRIVNEHNALSGPEKEPNREFLLFQGVRFASRVHK

A0A6J1DWY5 protein CHUP1, chloroplastic-like1.2e-25964.43Show/hide
Query:  MMNRLSVLVAVSIAAYSIRQLTIRSWTSLFLPTNCSAYAFSNSVFIFVSFIYIENGEDVEKNGLDEKEEEANSINDATSQVNGRTSDLEDGDHSSDEFRE
        +M +L VLVAVSIAAY+I+QLTIRSW+S  LPTNCS                 ENGE  EKNGLD +E++ NSIN A SQV+G +SD         E RE
Subjt:  MMNRLSVLVAVSIAAYSIRQLTIRSWTSLFLPTNCSAYAFSNSVFIFVSFIYIENGEDVEKNGLDEKEEEANSINDATSQVNGRTSDLEDGDHSSDEFRE

Query:  LLPREFEHRSLDDNKKEE-KVPEIQIENNKIELERLLKLVMELEERKVKLEGELFMCDGIKYSETDVMELRKQLEAKNDDISMLNITISSLQAERKILKE
        LLPR+ E R LD NKKEE KVPE  +ENNKIEL+RLLKLVMELEERKVKLE EL M D +K  ++D  EL K+LEAK++D+SMLNITISSLQAERK L+E
Subjt:  LLPREFEHRSLDDNKKEE-KVPEIQIENNKIELERLLKLVMELEERKVKLEGELFMCDGIKYSETDVMELRKQLEAKNDDISMLNITISSLQAERKILKE

Query:  EIVKGALMKKELEEARGKIRELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDFEVELGELKRKNRELQHEKQELTSKLEVMK
        EIVKGA MKKELEEA+GKI+ELQRQ+QLDANQTKEHL  LK+RVS LQAKEEEAVKKEA+LY+K KAAK FE+ELGELK+KNR+LQ EK+ELTSKLEVM+
Subjt:  EIVKGALMKKELEEARGKIRELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDFEVELGELKRKNRELQHEKQELTSKLEVMK

Query:  ARIKTLTKMEESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYFRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLMIEYAGL
        ARI TLT + ESEIIT+EREE +KL+  NE+L KQLE LQMNRFSEVEELVY RW+NACLRYELRDNE   GESA  L+KS SPKSKEKAKQLM+EYAGL
Subjt:  ARIKTLTKMEESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYFRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLMIEYAGL

Query:  ESGQAETDHESNFSHP-FSSGIEDLDNTSIDSSRSRTSSFNERPNMSLKKLIRNKGGLSAVSFPRIIGSSHRWKDPLEAVVALSAETLTL-SEVRLQVSS
          GQ ETDHESNFSHP FSSGIED DNTS  SSRSRTSSF                               RWKDPLEA VA S ETLT  SEV+ QVSS
Subjt:  ESGQAETDHESNFSHP-FSSGIEDLDNTSIDSSRSRTSSFNERPNMSLKKLIRNKGGLSAVSFPRIIGSSHRWKDPLEAVVALSAETLTL-SEVRLQVSS

Query:  RKSVNSVATSFQLMSKTVEESLKQKYSTYKEHHKLAL--GSEKKIKEKAENEREKSSGNASSLNIEYDDTSMRKKPAILPLQLAQMKMNKTSCDPNSQYD
        R SVNSVATSFQ MS++ EES+KQKYS YKEHHKL +  G EK+IKEKAE ER K+                                   SC       
Subjt:  RKSVNSVATSFQLMSKTVEESLKQKYSTYKEHHKLAL--GSEKKIKEKAENEREKSSGNASSLNIEYDDTSMRKKPAILPLQLAQMKMNKTSCDPNSQYD

Query:  NDSKNMISNPNSNGGEGHRGPELVRFNKKMMKLEVKADMETQGDLVVALAMEVREASFTNMEDVVSFVIWLDEKLSSLVD-GMEILEHFDWPKLKTDALR
                         +  PE VRF++K+MK EVKADMET+GDLV+ L M+V+  SFTNMEDVVSFVIWLD+K SSLVD  + ILEHFDWP+ K+DALR
Subjt:  NDSKNMISNPNSNGGEGHRGPELVRFNKKMMKLEVKADMETQGDLVVALAMEVREASFTNMEDVVSFVIWLDEKLSSLVD-GMEILEHFDWPKLKTDALR

Query:  EAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYGLFHTRDTANSRYEELGIPMDWLLDCGVVGKIKVSCVELARKYMKRIVNEHNAL
        EAA EYQ LMKL EEVSSFVD+PKLT EVAL  M+SLL K+EQSV+ +   R+ A S+YEELGIP+DWLLD GVVGK+KV  VELARKYMKRI+NE NAL
Subjt:  EAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYGLFHTRDTANSRYEELGIPMDWLLDCGVVGKIKVSCVELARKYMKRIVNEHNAL

Query:  SGPEKEPNREFLLFQGVRFASRVHKDFVACFATPFNFAGGFDSESMKAFEELRSRVRTEAGQK
        SGP KEPNREFLL QGVRFASRVH+           FAGGFD ESMKAFEELR+R+ TEAGQK
Subjt:  SGPEKEPNREFLLFQGVRFASRVHKDFVACFATPFNFAGGFDSESMKAFEELRSRVRTEAGQK

SwissProt top hitse value%identityAlignment
Q9LI74 Protein CHUP1, chloroplastic1.9e-18544.69Show/hide
Query:  MMNRLSVLVAVSIAAYSIRQLTIRSWTSLFLPTNCSAYAFSNSVFIFVSFIYIENGEDVEKNGLDEKEEEANSINDATSQVNGRTSDLEDGDHSSDEFRE
        M  R+  +VA SIAA ++++L ++       P+  S           V   Y  N +++++   +E+EEE   IN   +Q  G  SD  D D    EF +
Subjt:  MMNRLSVLVAVSIAAYSIRQLTIRSWTSLFLPTNCSAYAFSNSVFIFVSFIYIENGEDVEKNGLDEKEEEANSINDATSQVNGRTSDLEDGDHSSDEFRE

Query:  LLPREFEHRSLDDNKKEEKVP-----EIQIENNKIELERLLKLVMELEERKVKLEGELFMCDGIKYSETDVMELRKQLEAKNDDISMLNITISSLQAERK
        LL  E E+   DD+   EK       E+++  N  ELERL +LV ELEER+VKLEGEL    G+K  E+D++EL++QL+ K  +I MLNITI+SLQAERK
Subjt:  LLPREFEHRSLDDNKKEEKVP-----EIQIENNKIELERLLKLVMELEERKVKLEGELFMCDGIKYSETDVMELRKQLEAKNDDISMLNITISSLQAERK

Query:  ILKEEIVKGALMKKELEEARGKIRELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDFEVELGELKRKNRELQHEKQELTSKL
         L+EE+ +  +++KELE AR KI+ELQRQIQLDANQTK  LLLLKQ VS+LQ KEEEA+ K+ E+ +K KA +D EV++ ELKRKNRELQHEK+EL+ KL
Subjt:  ILKEEIVKGALMKKELEEARGKIRELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDFEVELGELKRKNRELQHEKQELTSKL

Query:  EVMKARIKTLTKMEESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYFRWINACLRYELRDNEISAGE-SARYLNKSSSPKSKEKAKQLMI
        +  +ARI TL+ M ES+ + K REE   LK  NEDL+KQ+E LQMNRFSEVEELVY RW+NACLRYELR+ +  AG+ SAR L+K+ SPKS+ KAK+LM+
Subjt:  EVMKARIKTLTKMEESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYFRWINACLRYELRDNEISAGE-SARYLNKSSSPKSKEKAKQLMI

Query:  EYAGLESGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFNERPNM--SLKKLIRNKGGLSAVSFP----------RIIGSSHRWKDPLEAVVALS
        EYAG E GQ +TD ESN+S P S G +D DN S+DSS SR SSF+++P +   LKK  ++K   S  S P          R+  S ++ + PLE+++  +
Subjt:  EYAGLESGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFNERPNM--SLKKLIRNKGGLSAVSFP----------RIIGSSHRWKDPLEAVVALS

Query:  A--------------------ETLTLSEVRLQ---VSSRKSVNSVATSFQLMSKTVEESLKQKYSTYKEHHKLALGSEKKIKEKAENER-EKSSGN----
        A                    ET  L  +R Q    S  + +NSVA SF +MSK+V+  L +KY  YK+ HKLA+  EK IK KA+  R E+  GN    
Subjt:  A--------------------ETLTLSEVRLQ---VSSRKSVNSVATSFQLMSKTVEESLKQKYSTYKEHHKLALGSEKKIKEKAENER-EKSSGN----

Query:  -----------------------ASSLNIEYDDTSMRKKPAILPLQLAQMKMNKTSC---DPNSQYDNDSKNM---------------------------
                               ++  N   +  +      +  ++L  ++          P S     S N+                           
Subjt:  -----------------------ASSLNIEYDDTSMRKKPAILPLQLAQMKMNKTSC---DPNSQYDNDSKNM---------------------------

Query:  -------------ISNPNSNGGEGHRGPELVRFNKKMMK-------------------------------------LEVKADMETQGDLVVALAMEVREA
                     +      G + HR PELV F + +MK                                     L VKAD+ETQGD V +LA EVR +
Subjt:  -------------ISNPNSNGGEGHRGPELVRFNKKMMK-------------------------------------LEVKADMETQGDLVVALAMEVREA

Query:  SFTNMEDVVSFVIWLDEKLSSLVDGMEILEHFDWPKLKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYGLFHTRDTANS
        SFT++ED+++FV WLDE+LS LVD   +L+HFDWP+ K DALREAAFEYQ LMKL+++V+SFVD+P L+CE AL KM  LL+KVEQSVY L  TRD A S
Subjt:  SFTNMEDVVSFVIWLDEKLSSLVDGMEILEHFDWPKLKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYGLFHTRDTANS

Query:  RYEELGIPMDWLLDCGVVGKIKVSCVELARKYMKRIVNEHNALSGPEKEPNREFLLFQGVRFASRVHKDFVACFATPFNFAGGFDSESMKAFEELRSRVR
        RY+E GIP+DWL D GVVGKIK+S V+LA+KYMKR+  E +++SG +K+PNREFLL QGVRFA RVH+           FAGGFD+ESMKAFEELRSR +
Subjt:  RYEELGIPMDWLLDCGVVGKIKVSCVELARKYMKRIVNEHNALSGPEKEPNREFLLFQGVRFASRVHKDFVACFATPFNFAGGFDSESMKAFEELRSRVR

Query:  TEAGQKN
        TE+G  N
Subjt:  TEAGQKN

Arabidopsis top hitse value%identityAlignment
AT1G48280.1 hydroxyproline-rich glycoprotein family protein5.5e-4738.91Show/hide
Query:  QLAQMKMNKTSCDPNSQYDNDSKNMISNP-NSNGGEGHRGPELVRFNKKMMKLEVKADMETQGDLVVALAMEVREASFTNMEDVVSFVIWLDEKLSSLVD
        QL Q+   + +    SQ  N +K+ +++  NS  GE          N+    + +KAD+ET+G+ +  L  +V    F++MEDV+ FV WLD++L++L D
Subjt:  QLAQMKMNKTSCDPNSQYDNDSKNMISNP-NSNGGEGHRGPELVRFNKKMMKLEVKADMETQGDLVVALAMEVREASFTNMEDVVSFVIWLDEKLSSLVD

Query:  GMEILEHFDWPKLKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYGLFHTRDTANSRYEELGIPMDWLLDCGVVGKIKVS
           +L+HF WP+ K D L+EAA EY++L KL++E+SS+ D+P +   VAL KM +LLDK EQ +  L   R ++   Y++  IP++W+LD G++ KIK +
Subjt:  GMEILEHFDWPKLKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYGLFHTRDTANSRYEELGIPMDWLLDCGVVGKIKVS

Query:  CVELARKYMKRIVNEHNALSGPEKEPNREFLLFQGVRFASRVHKDFVACFATPFNFAGGFDSESMKAFEELRSRV
         ++LA+ YM R+ NE  +    ++E  +E LL QGVRFA R H+           FAGG D E++ A EE++ RV
Subjt:  CVELARKYMKRIVNEHNALSGPEKEPNREFLLFQGVRFASRVHKDFVACFATPFNFAGGFDSESMKAFEELRSRV

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein1.4e-18644.69Show/hide
Query:  MMNRLSVLVAVSIAAYSIRQLTIRSWTSLFLPTNCSAYAFSNSVFIFVSFIYIENGEDVEKNGLDEKEEEANSINDATSQVNGRTSDLEDGDHSSDEFRE
        M  R+  +VA SIAA ++++L ++       P+  S           V   Y  N +++++   +E+EEE   IN   +Q  G  SD  D D    EF +
Subjt:  MMNRLSVLVAVSIAAYSIRQLTIRSWTSLFLPTNCSAYAFSNSVFIFVSFIYIENGEDVEKNGLDEKEEEANSINDATSQVNGRTSDLEDGDHSSDEFRE

Query:  LLPREFEHRSLDDNKKEEKVP-----EIQIENNKIELERLLKLVMELEERKVKLEGELFMCDGIKYSETDVMELRKQLEAKNDDISMLNITISSLQAERK
        LL  E E+   DD+   EK       E+++  N  ELERL +LV ELEER+VKLEGEL    G+K  E+D++EL++QL+ K  +I MLNITI+SLQAERK
Subjt:  LLPREFEHRSLDDNKKEEKVP-----EIQIENNKIELERLLKLVMELEERKVKLEGELFMCDGIKYSETDVMELRKQLEAKNDDISMLNITISSLQAERK

Query:  ILKEEIVKGALMKKELEEARGKIRELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDFEVELGELKRKNRELQHEKQELTSKL
         L+EE+ +  +++KELE AR KI+ELQRQIQLDANQTK  LLLLKQ VS+LQ KEEEA+ K+ E+ +K KA +D EV++ ELKRKNRELQHEK+EL+ KL
Subjt:  ILKEEIVKGALMKKELEEARGKIRELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDFEVELGELKRKNRELQHEKQELTSKL

Query:  EVMKARIKTLTKMEESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYFRWINACLRYELRDNEISAGE-SARYLNKSSSPKSKEKAKQLMI
        +  +ARI TL+ M ES+ + K REE   LK  NEDL+KQ+E LQMNRFSEVEELVY RW+NACLRYELR+ +  AG+ SAR L+K+ SPKS+ KAK+LM+
Subjt:  EVMKARIKTLTKMEESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYFRWINACLRYELRDNEISAGE-SARYLNKSSSPKSKEKAKQLMI

Query:  EYAGLESGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFNERPNM--SLKKLIRNKGGLSAVSFP----------RIIGSSHRWKDPLEAVVALS
        EYAG E GQ +TD ESN+S P S G +D DN S+DSS SR SSF+++P +   LKK  ++K   S  S P          R+  S ++ + PLE+++  +
Subjt:  EYAGLESGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFNERPNM--SLKKLIRNKGGLSAVSFP----------RIIGSSHRWKDPLEAVVALS

Query:  A--------------------ETLTLSEVRLQ---VSSRKSVNSVATSFQLMSKTVEESLKQKYSTYKEHHKLALGSEKKIKEKAENER-EKSSGN----
        A                    ET  L  +R Q    S  + +NSVA SF +MSK+V+  L +KY  YK+ HKLA+  EK IK KA+  R E+  GN    
Subjt:  A--------------------ETLTLSEVRLQ---VSSRKSVNSVATSFQLMSKTVEESLKQKYSTYKEHHKLALGSEKKIKEKAENER-EKSSGN----

Query:  -----------------------ASSLNIEYDDTSMRKKPAILPLQLAQMKMNKTSC---DPNSQYDNDSKNM---------------------------
                               ++  N   +  +      +  ++L  ++          P S     S N+                           
Subjt:  -----------------------ASSLNIEYDDTSMRKKPAILPLQLAQMKMNKTSC---DPNSQYDNDSKNM---------------------------

Query:  -------------ISNPNSNGGEGHRGPELVRFNKKMMK-------------------------------------LEVKADMETQGDLVVALAMEVREA
                     +      G + HR PELV F + +MK                                     L VKAD+ETQGD V +LA EVR +
Subjt:  -------------ISNPNSNGGEGHRGPELVRFNKKMMK-------------------------------------LEVKADMETQGDLVVALAMEVREA

Query:  SFTNMEDVVSFVIWLDEKLSSLVDGMEILEHFDWPKLKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYGLFHTRDTANS
        SFT++ED+++FV WLDE+LS LVD   +L+HFDWP+ K DALREAAFEYQ LMKL+++V+SFVD+P L+CE AL KM  LL+KVEQSVY L  TRD A S
Subjt:  SFTNMEDVVSFVIWLDEKLSSLVDGMEILEHFDWPKLKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYGLFHTRDTANS

Query:  RYEELGIPMDWLLDCGVVGKIKVSCVELARKYMKRIVNEHNALSGPEKEPNREFLLFQGVRFASRVHKDFVACFATPFNFAGGFDSESMKAFEELRSRVR
        RY+E GIP+DWL D GVVGKIK+S V+LA+KYMKR+  E +++SG +K+PNREFLL QGVRFA RVH+           FAGGFD+ESMKAFEELRSR +
Subjt:  RYEELGIPMDWLLDCGVVGKIKVSCVELARKYMKRIVNEHNALSGPEKEPNREFLLFQGVRFASRVHKDFVACFATPFNFAGGFDSESMKAFEELRSRVR

Query:  TEAGQKN
        TE+G  N
Subjt:  TEAGQKN

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein1.4e-18644.69Show/hide
Query:  MMNRLSVLVAVSIAAYSIRQLTIRSWTSLFLPTNCSAYAFSNSVFIFVSFIYIENGEDVEKNGLDEKEEEANSINDATSQVNGRTSDLEDGDHSSDEFRE
        M  R+  +VA SIAA ++++L ++       P+  S           V   Y  N +++++   +E+EEE   IN   +Q  G  SD  D D    EF +
Subjt:  MMNRLSVLVAVSIAAYSIRQLTIRSWTSLFLPTNCSAYAFSNSVFIFVSFIYIENGEDVEKNGLDEKEEEANSINDATSQVNGRTSDLEDGDHSSDEFRE

Query:  LLPREFEHRSLDDNKKEEKVP-----EIQIENNKIELERLLKLVMELEERKVKLEGELFMCDGIKYSETDVMELRKQLEAKNDDISMLNITISSLQAERK
        LL  E E+   DD+   EK       E+++  N  ELERL +LV ELEER+VKLEGEL    G+K  E+D++EL++QL+ K  +I MLNITI+SLQAERK
Subjt:  LLPREFEHRSLDDNKKEEKVP-----EIQIENNKIELERLLKLVMELEERKVKLEGELFMCDGIKYSETDVMELRKQLEAKNDDISMLNITISSLQAERK

Query:  ILKEEIVKGALMKKELEEARGKIRELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDFEVELGELKRKNRELQHEKQELTSKL
         L+EE+ +  +++KELE AR KI+ELQRQIQLDANQTK  LLLLKQ VS+LQ KEEEA+ K+ E+ +K KA +D EV++ ELKRKNRELQHEK+EL+ KL
Subjt:  ILKEEIVKGALMKKELEEARGKIRELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDFEVELGELKRKNRELQHEKQELTSKL

Query:  EVMKARIKTLTKMEESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYFRWINACLRYELRDNEISAGE-SARYLNKSSSPKSKEKAKQLMI
        +  +ARI TL+ M ES+ + K REE   LK  NEDL+KQ+E LQMNRFSEVEELVY RW+NACLRYELR+ +  AG+ SAR L+K+ SPKS+ KAK+LM+
Subjt:  EVMKARIKTLTKMEESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYFRWINACLRYELRDNEISAGE-SARYLNKSSSPKSKEKAKQLMI

Query:  EYAGLESGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFNERPNM--SLKKLIRNKGGLSAVSFP----------RIIGSSHRWKDPLEAVVALS
        EYAG E GQ +TD ESN+S P S G +D DN S+DSS SR SSF+++P +   LKK  ++K   S  S P          R+  S ++ + PLE+++  +
Subjt:  EYAGLESGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFNERPNM--SLKKLIRNKGGLSAVSFP----------RIIGSSHRWKDPLEAVVALS

Query:  A--------------------ETLTLSEVRLQ---VSSRKSVNSVATSFQLMSKTVEESLKQKYSTYKEHHKLALGSEKKIKEKAENER-EKSSGN----
        A                    ET  L  +R Q    S  + +NSVA SF +MSK+V+  L +KY  YK+ HKLA+  EK IK KA+  R E+  GN    
Subjt:  A--------------------ETLTLSEVRLQ---VSSRKSVNSVATSFQLMSKTVEESLKQKYSTYKEHHKLALGSEKKIKEKAENER-EKSSGN----

Query:  -----------------------ASSLNIEYDDTSMRKKPAILPLQLAQMKMNKTSC---DPNSQYDNDSKNM---------------------------
                               ++  N   +  +      +  ++L  ++          P S     S N+                           
Subjt:  -----------------------ASSLNIEYDDTSMRKKPAILPLQLAQMKMNKTSC---DPNSQYDNDSKNM---------------------------

Query:  -------------ISNPNSNGGEGHRGPELVRFNKKMMK-------------------------------------LEVKADMETQGDLVVALAMEVREA
                     +      G + HR PELV F + +MK                                     L VKAD+ETQGD V +LA EVR +
Subjt:  -------------ISNPNSNGGEGHRGPELVRFNKKMMK-------------------------------------LEVKADMETQGDLVVALAMEVREA

Query:  SFTNMEDVVSFVIWLDEKLSSLVDGMEILEHFDWPKLKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYGLFHTRDTANS
        SFT++ED+++FV WLDE+LS LVD   +L+HFDWP+ K DALREAAFEYQ LMKL+++V+SFVD+P L+CE AL KM  LL+KVEQSVY L  TRD A S
Subjt:  SFTNMEDVVSFVIWLDEKLSSLVDGMEILEHFDWPKLKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYGLFHTRDTANS

Query:  RYEELGIPMDWLLDCGVVGKIKVSCVELARKYMKRIVNEHNALSGPEKEPNREFLLFQGVRFASRVHKDFVACFATPFNFAGGFDSESMKAFEELRSRVR
        RY+E GIP+DWL D GVVGKIK+S V+LA+KYMKR+  E +++SG +K+PNREFLL QGVRFA RVH+           FAGGFD+ESMKAFEELRSR +
Subjt:  RYEELGIPMDWLLDCGVVGKIKVSCVELARKYMKRIVNEHNALSGPEKEPNREFLLFQGVRFASRVHKDFVACFATPFNFAGGFDSESMKAFEELRSRVR

Query:  TEAGQKN
        TE+G  N
Subjt:  TEAGQKN

AT3G25690.3 Hydroxyproline-rich glycoprotein family protein6.3e-16045.92Show/hide
Query:  KILKEEIVKGALMKKELEEARGKIRELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDFEVELGELKRKNRELQHEKQELTSK
        K L+EE+ +  +++KELE AR KI+ELQRQIQLDANQTK  LLLLKQ VS+LQ KEEEA+ K+ E+ +K KA +D EV++ ELKRKNRELQHEK+EL+ K
Subjt:  KILKEEIVKGALMKKELEEARGKIRELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDFEVELGELKRKNRELQHEKQELTSK

Query:  LEVMKARIKTLTKMEESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYFRWINACLRYELRDNEISAGE-SARYLNKSSSPKSKEKAKQLM
        L+  +ARI TL+ M ES+ + K REE   LK  NEDL+KQ+E LQMNRFSEVEELVY RW+NACLRYELR+ +  AG+ SAR L+K+ SPKS+ KAK+LM
Subjt:  LEVMKARIKTLTKMEESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYFRWINACLRYELRDNEISAGE-SARYLNKSSSPKSKEKAKQLM

Query:  IEYAGLESGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFNERPNM--SLKKLIRNKGGLSAVSFP----------RIIGSSHRWKDPLEAVVAL
        +EYAG E GQ +TD ESN+S P S G +D DN S+DSS SR SSF+++P +   LKK  ++K   S  S P          R+  S ++ + PLE+++  
Subjt:  IEYAGLESGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFNERPNM--SLKKLIRNKGGLSAVSFP----------RIIGSSHRWKDPLEAVVAL

Query:  SA--------------------ETLTLSEVRLQ---VSSRKSVNSVATSFQLMSKTVEESLKQKYSTYKEHHKLALGSEKKIKEKAENER-EKSSGN---
        +A                    ET  L  +R Q    S  + +NSVA SF +MSK+V+  L +KY  YK+ HKLA+  EK IK KA+  R E+  GN   
Subjt:  SA--------------------ETLTLSEVRLQ---VSSRKSVNSVATSFQLMSKTVEESLKQKYSTYKEHHKLALGSEKKIKEKAENER-EKSSGN---

Query:  ------------------------ASSLNIEYDDTSMRKKPAILPLQLAQMKMNKTSC---DPNSQYDNDSKNM--------------------------
                                ++  N   +  +      +  ++L  ++          P S     S N+                          
Subjt:  ------------------------ASSLNIEYDDTSMRKKPAILPLQLAQMKMNKTSC---DPNSQYDNDSKNM--------------------------

Query:  --------------ISNPNSNGGEGHRGPELVRFNKKMMK-------------------------------------LEVKADMETQGDLVVALAMEVRE
                      +      G + HR PELV F + +MK                                     L VKAD+ETQGD V +LA EVR 
Subjt:  --------------ISNPNSNGGEGHRGPELVRFNKKMMK-------------------------------------LEVKADMETQGDLVVALAMEVRE

Query:  ASFTNMEDVVSFVIWLDEKLSSLVDGMEILEHFDWPKLKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYGLFHTRDTAN
        +SFT++ED+++FV WLDE+LS LVD   +L+HFDWP+ K DALREAAFEYQ LMKL+++V+SFVD+P L+CE AL KM  LL+KVEQSVY L  TRD A 
Subjt:  ASFTNMEDVVSFVIWLDEKLSSLVDGMEILEHFDWPKLKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYGLFHTRDTAN

Query:  SRYEELGIPMDWLLDCGVVGKIKVSCVELARKYMKRIVNEHNALSGPEKEPNREFLLFQGVRFASRVHKDFVACFATPFNFAGGFDSESMKAFEELRSRV
        SRY+E GIP+DWL D GVVGKIK+S V+LA+KYMKR+  E +++SG +K+PNREFLL QGVRFA RVH+           FAGGFD+ESMKAFEELRSR 
Subjt:  SRYEELGIPMDWLLDCGVVGKIKVSCVELARKYMKRIVNEHNALSGPEKEPNREFLLFQGVRFASRVHKDFVACFATPFNFAGGFDSESMKAFEELRSRV

Query:  RTEAGQKN
        +TE+G  N
Subjt:  RTEAGQKN

AT4G18570.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.3e-5141.26Show/hide
Query:  MRKKPAILPLQLAQMKMNKTSCDPNSQYDNDSKNMISNPNSNGGEGHRGPELVRFNKKMMKLEVKADMETQGDLVVALAMEVREASFTNMEDVVSFVIWL
        +R+ P ++    + M+ + T+   +S    ++       NSN  +     E    N+ +  L +K D+ETQGD +  L  EV  A+F+++EDVV FV WL
Subjt:  MRKKPAILPLQLAQMKMNKTSCDPNSQYDNDSKNMISNPNSNGGEGHRGPELVRFNKKMMKLEVKADMETQGDLVVALAMEVREASFTNMEDVVSFVIWL

Query:  DEKLSSLVDGMEILEHFDWPKLKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYGLFHTRDTANSRYEELGIPMDWLLDC
        D++LS LVD   +L+HF+WP+ K DALREAAF Y  L KL  E S F ++P+ +   AL KM +L +K+E  VY L   R++A ++++   IP+DW+L+ 
Subjt:  DEKLSSLVDGMEILEHFDWPKLKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYGLFHTRDTANSRYEELGIPMDWLLDC

Query:  GVVGKIKVSCVELARKYMKRIVNEHNALSGPEKEPNREFLLFQGVRFASRVHKDFVACFATPFNFAGGFDSESMKAFEELRSRVRT
        G+  +IK++ V+LA KYMKR+  E  A+ G    P  E L+ QGVRFA RVH+           FAGGFD+E+MKAFEELR + R+
Subjt:  GVVGKIKVSCVELARKYMKRIVNEHNALSGPEKEPNREFLLFQGVRFASRVHKDFVACFATPFNFAGGFDSESMKAFEELRSRVRT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGAACAGATTAAGTGTTCTTGTTGCTGTTTCAATTGCAGCTTATTCAATTAGGCAGCTCACAATCAGATCATGGACCTCATTATTCTTGCCTACCAACTGTTCAGC
TTATGCCTTTTCTAATTCTGTTTTTATCTTTGTTTCCTTTATTTACATAGAAAATGGAGAAGACGTGGAGAAGAACGGATTGGATGAAAAAGAAGAGGAAGCTAATTCAA
TAAATGATGCAACTAGTCAAGTTAATGGTAGAACCTCTGATCTTGAAGATGGAGATCATAGTTCAGATGAGTTTCGAGAACTTCTACCACGAGAGTTCGAACATCGGTCG
CTTGATGACAATAAGAAAGAAGAAAAAGTGCCTGAAATCCAGATAGAAAACAATAAGATTGAATTGGAAAGGTTGCTGAAACTAGTGATGGAATTGGAGGAAAGGAAAGT
GAAACTCGAAGGTGAATTGTTCATGTGCGACGGAATAAAATATAGTGAAACTGACGTTATGGAGTTACGAAAGCAGCTCGAGGCCAAGAATGATGATATCAGTATGCTTA
ATATCACAATCAGCTCTTTGCAGGCTGAGAGAAAGATTCTAAAAGAAGAGATAGTGAAGGGAGCATTGATGAAGAAGGAACTAGAGGAGGCTAGAGGCAAGATTAGAGAG
CTGCAGAGGCAAATTCAGCTTGATGCAAACCAAACAAAAGAACATTTATTATTACTCAAACAACGAGTTTCCGCTTTGCAGGCGAAAGAGGAAGAGGCCGTCAAGAAAGA
AGCAGAACTTTATAAGAAACAGAAAGCGGCGAAGGATTTCGAGGTAGAACTTGGGGAGCTTAAGCGGAAGAATCGAGAGCTTCAACACGAAAAGCAGGAACTAACTTCTA
AACTAGAAGTTATGAAGGCCAGAATCAAAACACTGACTAAGATGGAAGAGAGTGAAATAATCACGAAGGAAAGAGAAGAGGCTCAAAAGTTAAAGAGTGAAAATGAGGAT
CTGATAAAGCAACTTGAAAGGCTTCAGATGAATAGGTTTAGTGAAGTTGAAGAGTTAGTCTACTTTCGTTGGATCAATGCTTGCTTGAGGTATGAGCTTCGGGACAACGA
GATATCGGCAGGCGAATCCGCTCGTTATCTCAATAAGAGTTCAAGTCCAAAGTCAAAAGAGAAGGCAAAACAACTCATGATAGAGTATGCAGGATTGGAGTCTGGACAAG
CAGAAACAGATCATGAAAGCAACTTTTCTCATCCATTCTCTTCTGGGATTGAAGATCTTGACAATACTTCAATTGATAGTTCAAGGAGCAGAACAAGTAGTTTCAATGAG
AGGCCTAATATGAGTTTGAAGAAATTGATCAGAAACAAAGGTGGTTTGAGTGCTGTTTCGTTTCCGCGCATAATCGGTTCAAGCCACAGATGGAAGGATCCTTTAGAAGC
AGTGGTGGCTCTGAGTGCTGAAACTTTAACTCTCTCAGAAGTTAGATTGCAAGTTTCTTCAAGAAAATCTGTCAACTCAGTTGCAACATCATTCCAACTGATGTCTAAAA
CAGTTGAAGAAAGTCTAAAACAAAAATATTCAACTTATAAAGAGCATCATAAGTTGGCCTTAGGGAGTGAGAAGAAAATCAAGGAAAAGGCTGAGAATGAAAGGGAAAAA
AGTTCTGGTAACGCTTCGAGTCTGAATATAGAATACGACGACACTAGCATGAGAAAGAAACCTGCAATTTTACCCCTTCAACTTGCTCAAATGAAGATGAATAAAACTTC
TTGTGATCCAAATAGCCAATATGACAATGATAGCAAAAATATGATCTCCAATCCAAATTCTAACGGAGGTGAAGGTCATCGAGGACCCGAGTTAGTTCGGTTCAATAAAA
AAATGATGAAACTTGAGGTGAAAGCTGATATGGAAACTCAAGGAGACTTGGTTGTGGCATTGGCAATGGAAGTTAGAGAAGCGTCTTTCACCAACATGGAAGATGTCGTC
TCCTTCGTTATATGGCTCGATGAAAAGTTATCTTCTTTGGTTGATGGTATGGAGATTCTCGAGCATTTCGATTGGCCCAAGTTAAAAACAGATGCATTAAGAGAAGCAGC
CTTTGAATATCAGAAGCTGATGAAACTAAAAGAAGAAGTCTCTTCTTTTGTTGACAATCCCAAACTCACATGTGAAGTTGCTTTAAACAAAATGAACTCCTTGCTAGATA
AGGTGGAGCAGAGTGTCTATGGATTGTTTCATACAAGGGACACGGCGAATTCGCGATACGAGGAACTCGGAATTCCCATGGATTGGTTGTTGGATTGTGGAGTTGTTGGC
AAGATAAAGGTGTCATGTGTAGAATTGGCAAGGAAGTACATGAAACGCATAGTAAATGAACATAATGCACTGAGTGGACCTGAGAAAGAACCAAACAGAGAATTTTTGCT
TTTCCAAGGAGTGCGATTTGCTTCTCGTGTTCATAAGGACTTCGTGGCTTGTTTTGCTACTCCATTTAATTTTGCAGGAGGCTTTGATTCCGAGAGCATGAAGGCATTTG
AAGAGCTGAGAAGTCGTGTCCGTACAGAGGCAGGACAGAAAAATTGA
mRNA sequenceShow/hide mRNA sequence
CAGAGGCTTAATTCTTAATTTTGTTCTTTCTGTCCACTTTCAAGTTTCAAGCTCTCTAGCTAGTTAAGATGATGAACAGATTAAGTGTTCTTGTTGCTGTTTCAATTGCA
GCTTATTCAATTAGGCAGCTCACAATCAGATCATGGACCTCATTATTCTTGCCTACCAACTGTTCAGCTTATGCCTTTTCTAATTCTGTTTTTATCTTTGTTTCCTTTAT
TTACATAGAAAATGGAGAAGACGTGGAGAAGAACGGATTGGATGAAAAAGAAGAGGAAGCTAATTCAATAAATGATGCAACTAGTCAAGTTAATGGTAGAACCTCTGATC
TTGAAGATGGAGATCATAGTTCAGATGAGTTTCGAGAACTTCTACCACGAGAGTTCGAACATCGGTCGCTTGATGACAATAAGAAAGAAGAAAAAGTGCCTGAAATCCAG
ATAGAAAACAATAAGATTGAATTGGAAAGGTTGCTGAAACTAGTGATGGAATTGGAGGAAAGGAAAGTGAAACTCGAAGGTGAATTGTTCATGTGCGACGGAATAAAATA
TAGTGAAACTGACGTTATGGAGTTACGAAAGCAGCTCGAGGCCAAGAATGATGATATCAGTATGCTTAATATCACAATCAGCTCTTTGCAGGCTGAGAGAAAGATTCTAA
AAGAAGAGATAGTGAAGGGAGCATTGATGAAGAAGGAACTAGAGGAGGCTAGAGGCAAGATTAGAGAGCTGCAGAGGCAAATTCAGCTTGATGCAAACCAAACAAAAGAA
CATTTATTATTACTCAAACAACGAGTTTCCGCTTTGCAGGCGAAAGAGGAAGAGGCCGTCAAGAAAGAAGCAGAACTTTATAAGAAACAGAAAGCGGCGAAGGATTTCGA
GGTAGAACTTGGGGAGCTTAAGCGGAAGAATCGAGAGCTTCAACACGAAAAGCAGGAACTAACTTCTAAACTAGAAGTTATGAAGGCCAGAATCAAAACACTGACTAAGA
TGGAAGAGAGTGAAATAATCACGAAGGAAAGAGAAGAGGCTCAAAAGTTAAAGAGTGAAAATGAGGATCTGATAAAGCAACTTGAAAGGCTTCAGATGAATAGGTTTAGT
GAAGTTGAAGAGTTAGTCTACTTTCGTTGGATCAATGCTTGCTTGAGGTATGAGCTTCGGGACAACGAGATATCGGCAGGCGAATCCGCTCGTTATCTCAATAAGAGTTC
AAGTCCAAAGTCAAAAGAGAAGGCAAAACAACTCATGATAGAGTATGCAGGATTGGAGTCTGGACAAGCAGAAACAGATCATGAAAGCAACTTTTCTCATCCATTCTCTT
CTGGGATTGAAGATCTTGACAATACTTCAATTGATAGTTCAAGGAGCAGAACAAGTAGTTTCAATGAGAGGCCTAATATGAGTTTGAAGAAATTGATCAGAAACAAAGGT
GGTTTGAGTGCTGTTTCGTTTCCGCGCATAATCGGTTCAAGCCACAGATGGAAGGATCCTTTAGAAGCAGTGGTGGCTCTGAGTGCTGAAACTTTAACTCTCTCAGAAGT
TAGATTGCAAGTTTCTTCAAGAAAATCTGTCAACTCAGTTGCAACATCATTCCAACTGATGTCTAAAACAGTTGAAGAAAGTCTAAAACAAAAATATTCAACTTATAAAG
AGCATCATAAGTTGGCCTTAGGGAGTGAGAAGAAAATCAAGGAAAAGGCTGAGAATGAAAGGGAAAAAAGTTCTGGTAACGCTTCGAGTCTGAATATAGAATACGACGAC
ACTAGCATGAGAAAGAAACCTGCAATTTTACCCCTTCAACTTGCTCAAATGAAGATGAATAAAACTTCTTGTGATCCAAATAGCCAATATGACAATGATAGCAAAAATAT
GATCTCCAATCCAAATTCTAACGGAGGTGAAGGTCATCGAGGACCCGAGTTAGTTCGGTTCAATAAAAAAATGATGAAACTTGAGGTGAAAGCTGATATGGAAACTCAAG
GAGACTTGGTTGTGGCATTGGCAATGGAAGTTAGAGAAGCGTCTTTCACCAACATGGAAGATGTCGTCTCCTTCGTTATATGGCTCGATGAAAAGTTATCTTCTTTGGTT
GATGGTATGGAGATTCTCGAGCATTTCGATTGGCCCAAGTTAAAAACAGATGCATTAAGAGAAGCAGCCTTTGAATATCAGAAGCTGATGAAACTAAAAGAAGAAGTCTC
TTCTTTTGTTGACAATCCCAAACTCACATGTGAAGTTGCTTTAAACAAAATGAACTCCTTGCTAGATAAGGTGGAGCAGAGTGTCTATGGATTGTTTCATACAAGGGACA
CGGCGAATTCGCGATACGAGGAACTCGGAATTCCCATGGATTGGTTGTTGGATTGTGGAGTTGTTGGCAAGATAAAGGTGTCATGTGTAGAATTGGCAAGGAAGTACATG
AAACGCATAGTAAATGAACATAATGCACTGAGTGGACCTGAGAAAGAACCAAACAGAGAATTTTTGCTTTTCCAAGGAGTGCGATTTGCTTCTCGTGTTCATAAGGACTT
CGTGGCTTGTTTTGCTACTCCATTTAATTTTGCAGGAGGCTTTGATTCCGAGAGCATGAAGGCATTTGAAGAGCTGAGAAGTCGTGTCCGTACAGAGGCAGGACAGAAAA
ATTGA
Protein sequenceShow/hide protein sequence
MMNRLSVLVAVSIAAYSIRQLTIRSWTSLFLPTNCSAYAFSNSVFIFVSFIYIENGEDVEKNGLDEKEEEANSINDATSQVNGRTSDLEDGDHSSDEFRELLPREFEHRS
LDDNKKEEKVPEIQIENNKIELERLLKLVMELEERKVKLEGELFMCDGIKYSETDVMELRKQLEAKNDDISMLNITISSLQAERKILKEEIVKGALMKKELEEARGKIRE
LQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDFEVELGELKRKNRELQHEKQELTSKLEVMKARIKTLTKMEESEIITKEREEAQKLKSENED
LIKQLERLQMNRFSEVEELVYFRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLMIEYAGLESGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFNE
RPNMSLKKLIRNKGGLSAVSFPRIIGSSHRWKDPLEAVVALSAETLTLSEVRLQVSSRKSVNSVATSFQLMSKTVEESLKQKYSTYKEHHKLALGSEKKIKEKAENEREK
SSGNASSLNIEYDDTSMRKKPAILPLQLAQMKMNKTSCDPNSQYDNDSKNMISNPNSNGGEGHRGPELVRFNKKMMKLEVKADMETQGDLVVALAMEVREASFTNMEDVV
SFVIWLDEKLSSLVDGMEILEHFDWPKLKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYGLFHTRDTANSRYEELGIPMDWLLDCGVVG
KIKVSCVELARKYMKRIVNEHNALSGPEKEPNREFLLFQGVRFASRVHKDFVACFATPFNFAGGFDSESMKAFEELRSRVRTEAGQKN