; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0017098 (gene) of Chayote v1 genome

Gene IDSed0017098
OrganismSechium edule (Chayote v1)
Descriptionprotein CHUP1, chloroplastic
Genome locationLG09:39673882..39677573
RNA-Seq ExpressionSed0017098
SyntenySed0017098
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0009707 - chloroplast outer membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR040265 - Protein CHUP1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0059471.1 protein CHUP1 [Cucumis melo var. makuwa]7.4e-24074.38Show/hide
Query:  MEVKRDLMKPILFKFGVALAISFAGFFYSRLKIKNKRPSLPPPSSSSSDDQGSKV--GSGRGPKLDSQAMKEATSTTSNVILYAADVYEEIRNPEVNIDN
        ME K +LMKP+L KFGV LAISFA F YSR ++KNKRP LPPP SSSSDDQG+KV  G GRGP+LD+Q MK AT+ +SNV+L+A D YEE+  P+VN+D+
Subjt:  MEVKRDLMKPILFKFGVALAISFAGFFYSRLKIKNKRPSLPPPSSSSSDDQGSKV--GSGRGPKLDSQAMKEATSTTSNVILYAADVYEEIRNPEVNIDN

Query:  -------SNKNSVDKDGLLLPEFEELVKEYDLSAANVGFSPKKNVETPRSGLEAPKAYLTVETDEYEEEIRHLKIKVKMLRERERNLEIQLLEYYGLKEQ
               SNK+ VDKDGLLLPEF+E VKE+DLSAAN  FSPKKNVE PRSGLE PKAY TVE DEYE+EIRHLK KVKMLRERERNLE QLLEYYGLKEQ
Subjt:  -------SNKNSVDKDGLLLPEFEELVKEYDLSAANVGFSPKKNVETPRSGLEAPKAYLTVETDEYEEEIRHLKIKVKMLRERERNLEIQLLEYYGLKEQ

Query:  ESAVMELQNRLKINNMEAQLFTLKIESLQTDNRRLESQVSDHAKSVSDLEAAKAKIKLLKKKLRHEAEQNRGQILSLQQRVAKLLDQECKTNESNKDAEI
        E+AVMELQNRLKINNMEA+LFT KIESL+ DNRRLESQV +HAK+VSDLEAA+AKIK LKKKLRHEAEQNR QIL+LQQ+V KL DQE KTNESNKDA+I
Subjt:  ESAVMELQNRLKINNMEAQLFTLKIESLQTDNRRLESQVSDHAKSVSDLEAAKAKIKLLKKKLRHEAEQNRGQILSLQQRVAKLLDQECKTNESNKDAEI

Query:  KLQKIEDLEKQIEDLRKSNMRLQIENSDLSQRLDATQFLANSIFEDQEKETLKEERERLAQENVTLTKEMEQLKADRCADVEELVYLRWINACLRYELRN
        KLQKIEDLEK+IE+LRK N RLQIENSDL +RLDATQFLANS+ EDQEKE+LKEE ERL QEN  LTKE+EQL+A R ADVEELVYLRWINACLRYELRN
Subjt:  KLQKIEDLEKQIEDLRKSNMRLQIENSDLSQRLDATQFLANSIFEDQEKETLKEERERLAQENVTLTKEMEQLKADRCADVEELVYLRWINACLRYELRN

Query:  FQPPTGKPAARDLSKTLSPKSEGKAKKLILEYANSEVVEGGKGINTIDIDSDRWSTSQASSHTDPGDLDYSAVDFPSTAKPSSNKNSFISKLRKLLRGKS
        FQPP GK AARDLSKTLSPKSE KAKKLIL+YAN+E  E GKGI+  D DSD+WS+SQASSHTDPGD D SA +FPSTAK SSNK  FI KL+KLLRGK 
Subjt:  FQPPTGKPAARDLSKTLSPKSEGKAKKLILEYANSEVVEGGKGINTIDIDSDRWSTSQASSHTDPGDLDYSAVDFPSTAKPSSNKNSFISKLRKLLRGKS

Query:  SQSQQNLALSTEKTAGPAEYSYSSRYSSSNSTRTNASRAEGQSFGCTTPSQNSSRHSIDFHSLNSQKEDGMKAADSLRRNSDVGYTNKR--IGSERSSNL
          SQQNL L  EK+A   E S S  YSSSNST TNA+RAEGQ+ G  T S+NSSR+SIDF  L+SQKED +K  DS RRNSDVGY NKR  +GS++SSN 
Subjt:  SQSQQNLALSTEKTAGPAEYSYSSRYSSSNSTRTNASRAEGQSFGCTTPSQNSSRHSIDFHSLNSQKEDGMKAADSLRRNSDVGYTNKR--IGSERSSNL

Query:  LYRSRSEETEYTEKSDLMKYAEVLKDTKRAKNQSQQKGAPDRSF
          RS+S++TE TEKS+LMKYAEVLKDT+ AKNQS +K A   SF
Subjt:  LYRSRSEETEYTEKSDLMKYAEVLKDTKRAKNQSQQKGAPDRSF

XP_004141788.1 protein CHUP1, chloroplastic isoform X2 [Cucumis sativus]3.3e-23272.36Show/hide
Query:  MEVKRDLMKPILFKFGVALAISFAGFFYSRLKIKNKRPSLPPPSSSSSDDQGSKV--GSGRGPKLDSQAMKEATSTTSNVILYAADVYEEIRNPEVNIDN
        ME K +L +PILFKFGV LAISFAGF YSR ++KNKRP LPPPS SSSDDQG+KV  G GRGP+LD Q       T SNV+L+A D YEE   P+VN D+
Subjt:  MEVKRDLMKPILFKFGVALAISFAGFFYSRLKIKNKRPSLPPPSSSSSDDQGSKV--GSGRGPKLDSQAMKEATSTTSNVILYAADVYEEIRNPEVNIDN

Query:  -------SNKNSVDKDGLLLPEFEELVKEYDLSAANVGFSPKKNVETPRSGLEAPKAYLTVETDEYEEEIRHLKIKVKMLRERERNLEIQLLEYYGLKEQ
               SNK+ VDKDGLL PEF+EL+KE+DLSAAN  FS KKNVE PR GLE PKAY TVE DEYE+EIR+LK KVKMLRERERNLE+QLLEYYGLKEQ
Subjt:  -------SNKNSVDKDGLLLPEFEELVKEYDLSAANVGFSPKKNVETPRSGLEAPKAYLTVETDEYEEEIRHLKIKVKMLRERERNLEIQLLEYYGLKEQ

Query:  ESAVMELQNRLKINNMEAQLFTLKIESLQTDNRRLESQVSDHAKSVSDLEAAKAKIKLLKKKLRHEAEQNRGQILSLQQRVAKLLDQECKTNESNKDAEI
        E+AVMELQNRLKINNMEA+LFT KIESL+ DNRRLESQV DHAKSVSDLEAA+AKIK LKKKLR+EAEQNRGQIL+LQ+RV KL DQE KTN+SNKDA+I
Subjt:  ESAVMELQNRLKINNMEAQLFTLKIESLQTDNRRLESQVSDHAKSVSDLEAAKAKIKLLKKKLRHEAEQNRGQILSLQQRVAKLLDQECKTNESNKDAEI

Query:  KLQKIEDLEKQIEDLRKSNMRLQIENSDLSQRLDATQFLANSIFEDQEKETLKEERERLAQENVTLTKEMEQLKADRCADVEELVYLRWINACLRYELRN
        KLQKIEDLEK+IE+LRKSN+RL+IENSDL +RLDATQFLANS+ EDQEKE+LKEE ERL +EN  LTKE+EQL+A R ADVEELVYLRWINACLRYELRN
Subjt:  KLQKIEDLEKQIEDLRKSNMRLQIENSDLSQRLDATQFLANSIFEDQEKETLKEERERLAQENVTLTKEMEQLKADRCADVEELVYLRWINACLRYELRN

Query:  FQPPTGKPAARDLSKTLSPKSEGKAKKLILEYANSEVVEGGKGINTIDIDSDRWSTSQASSHTDPGDLDYSAVDFPSTAKPSSNKNSFISKLRKLLRGKS
        FQPP GK AARDLSKTLSPKSE KAKKLIL+YAN+E  E GK +N  D DSD+WS+SQASSHTDPGD D S  DFPSTAK  SNK  FISKLRKLL+GK 
Subjt:  FQPPTGKPAARDLSKTLSPKSEGKAKKLILEYANSEVVEGGKGINTIDIDSDRWSTSQASSHTDPGDLDYSAVDFPSTAKPSSNKNSFISKLRKLLRGKS

Query:  SQSQQNLALSTEKTAGPAEYSYSSRYSSSNSTRTNASRAEGQSFGCTTPSQNSSRHSIDFHSLNSQKEDGMKAADSLRRNSDVGYTNKR--IGSERSSNL
          SQQN+ L  EK+A   E S S  YS+SNST TNA+RAEGQ+ G  TP  NSS HS+DFH L SQKED +K  DS+RRNSDVG  NKR  +GS++ S+ 
Subjt:  SQSQQNLALSTEKTAGPAEYSYSSRYSSSNSTRTNASRAEGQSFGCTTPSQNSSRHSIDFHSLNSQKEDGMKAADSLRRNSDVGYTNKR--IGSERSSNL

Query:  LYRSRSEETEYTEKSDLMKYAEVLKDTKRAKNQSQQKGAPDRSF
         YRS++++TE TEKS+LMKYAEVLKDT+ AKN+S +K A   SF
Subjt:  LYRSRSEETEYTEKSDLMKYAEVLKDTKRAKNQSQQKGAPDRSF

XP_008462405.1 PREDICTED: protein CHUP1, chloroplastic [Cucumis melo]8.2e-23974.22Show/hide
Query:  MEVKRDLMKPILFKFGVALAISFAGFFYSRLKIKNKRPSLPPPSSSSSDDQGSKV--GSGRGPKLDSQAMKEATSTTSNVILYAADVYEEIRNPEVNIDN
        ME K +LMKP+L KFGV LAISFA F YSR ++KNKRP LPPP SSSSDDQG+KV  G GRGP+LD+Q MK AT+ +SNV+L+A D YEE+   +VN+D+
Subjt:  MEVKRDLMKPILFKFGVALAISFAGFFYSRLKIKNKRPSLPPPSSSSSDDQGSKV--GSGRGPKLDSQAMKEATSTTSNVILYAADVYEEIRNPEVNIDN

Query:  -------SNKNSVDKDGLLLPEFEELVKEYDLSAANVGFSPKKNVETPRSGLEAPKAYLTVETDEYEEEIRHLKIKVKMLRERERNLEIQLLEYYGLKEQ
               SNK+ VDKDGLLLPEF+E VKE+DLSAAN  FSPKKNVE PRSGLE PKAY TVE DEYE+EIRHLK KVKMLRERERNLE QLLEYYGLKEQ
Subjt:  -------SNKNSVDKDGLLLPEFEELVKEYDLSAANVGFSPKKNVETPRSGLEAPKAYLTVETDEYEEEIRHLKIKVKMLRERERNLEIQLLEYYGLKEQ

Query:  ESAVMELQNRLKINNMEAQLFTLKIESLQTDNRRLESQVSDHAKSVSDLEAAKAKIKLLKKKLRHEAEQNRGQILSLQQRVAKLLDQECKTNESNKDAEI
        E+AVMELQNRLKINNMEA+LFT KIESL+ DNRRLESQV +HAK+VSDLEAA+AKIK LKKKLRHEAEQNR QIL+LQQ+V KL DQE KTNESNKDA+I
Subjt:  ESAVMELQNRLKINNMEAQLFTLKIESLQTDNRRLESQVSDHAKSVSDLEAAKAKIKLLKKKLRHEAEQNRGQILSLQQRVAKLLDQECKTNESNKDAEI

Query:  KLQKIEDLEKQIEDLRKSNMRLQIENSDLSQRLDATQFLANSIFEDQEKETLKEERERLAQENVTLTKEMEQLKADRCADVEELVYLRWINACLRYELRN
        KLQKIEDLEK+IE+LRK N RLQIENSDL +RLDATQFLANS+ EDQEKE+LKEE ERL QEN  LTKE+EQL+A R ADVEELVYLRWINACLRYELRN
Subjt:  KLQKIEDLEKQIEDLRKSNMRLQIENSDLSQRLDATQFLANSIFEDQEKETLKEERERLAQENVTLTKEMEQLKADRCADVEELVYLRWINACLRYELRN

Query:  FQPPTGKPAARDLSKTLSPKSEGKAKKLILEYANSEVVEGGKGINTIDIDSDRWSTSQASSHTDPGDLDYSAVDFPSTAKPSSNKNSFISKLRKLLRGKS
        FQPP GK AARDLSKTLSPKSE KAKKLIL+YAN+E  E GKGI+  D DSD+WS+SQASSHTDPGD D SA +FPSTAK SSNK  FI KL+KLLRGK 
Subjt:  FQPPTGKPAARDLSKTLSPKSEGKAKKLILEYANSEVVEGGKGINTIDIDSDRWSTSQASSHTDPGDLDYSAVDFPSTAKPSSNKNSFISKLRKLLRGKS

Query:  SQSQQNLALSTEKTAGPAEYSYSSRYSSSNSTRTNASRAEGQSFGCTTPSQNSSRHSIDFHSLNSQKEDGMKAADSLRRNSDVGYTNKR--IGSERSSNL
          SQQNL L  EK+A   E S S  YSSSNST TNA+RAEGQ+ G  T S+NSSR+SIDF  L+SQKED +K  DS RRNSDVGY NKR  +GS++SSN 
Subjt:  SQSQQNLALSTEKTAGPAEYSYSSRYSSSNSTRTNASRAEGQSFGCTTPSQNSSRHSIDFHSLNSQKEDGMKAADSLRRNSDVGYTNKR--IGSERSSNL

Query:  LYRSRSEETEYTEKSDLMKYAEVLKDTKRAKNQSQQKGAPDRSF
          RS+S++TE TEKS+LMKYAEVLKDT+ AKNQS +K A   SF
Subjt:  LYRSRSEETEYTEKSDLMKYAEVLKDTKRAKNQSQQKGAPDRSF

XP_031744947.1 protein CHUP1, chloroplastic isoform X1 [Cucumis sativus]3.7e-23172.25Show/hide
Query:  MEVKRDLMKPILFKFGVALAISFAGFFYSRLKIKNKRPSLPPPS-SSSSDDQGSKV--GSGRGPKLDSQAMKEATSTTSNVILYAADVYEEIRNPEVNID
        ME K +L +PILFKFGV LAISFAGF YSR ++KNKRP LPPPS SSS+DDQG+KV  G GRGP+LD Q       T SNV+L+A D YEE   P+VN D
Subjt:  MEVKRDLMKPILFKFGVALAISFAGFFYSRLKIKNKRPSLPPPS-SSSSDDQGSKV--GSGRGPKLDSQAMKEATSTTSNVILYAADVYEEIRNPEVNID

Query:  N-------SNKNSVDKDGLLLPEFEELVKEYDLSAANVGFSPKKNVETPRSGLEAPKAYLTVETDEYEEEIRHLKIKVKMLRERERNLEIQLLEYYGLKE
        +       SNK+ VDKDGLL PEF+EL+KE+DLSAAN  FS KKNVE PR GLE PKAY TVE DEYE+EIR+LK KVKMLRERERNLE+QLLEYYGLKE
Subjt:  N-------SNKNSVDKDGLLLPEFEELVKEYDLSAANVGFSPKKNVETPRSGLEAPKAYLTVETDEYEEEIRHLKIKVKMLRERERNLEIQLLEYYGLKE

Query:  QESAVMELQNRLKINNMEAQLFTLKIESLQTDNRRLESQVSDHAKSVSDLEAAKAKIKLLKKKLRHEAEQNRGQILSLQQRVAKLLDQECKTNESNKDAE
        QE+AVMELQNRLKINNMEA+LFT KIESL+ DNRRLESQV DHAKSVSDLEAA+AKIK LKKKLR+EAEQNRGQIL+LQ+RV KL DQE KTN+SNKDA+
Subjt:  QESAVMELQNRLKINNMEAQLFTLKIESLQTDNRRLESQVSDHAKSVSDLEAAKAKIKLLKKKLRHEAEQNRGQILSLQQRVAKLLDQECKTNESNKDAE

Query:  IKLQKIEDLEKQIEDLRKSNMRLQIENSDLSQRLDATQFLANSIFEDQEKETLKEERERLAQENVTLTKEMEQLKADRCADVEELVYLRWINACLRYELR
        IKLQKIEDLEK+IE+LRKSN+RL+IENSDL +RLDATQFLANS+ EDQEKE+LKEE ERL +EN  LTKE+EQL+A R ADVEELVYLRWINACLRYELR
Subjt:  IKLQKIEDLEKQIEDLRKSNMRLQIENSDLSQRLDATQFLANSIFEDQEKETLKEERERLAQENVTLTKEMEQLKADRCADVEELVYLRWINACLRYELR

Query:  NFQPPTGKPAARDLSKTLSPKSEGKAKKLILEYANSEVVEGGKGINTIDIDSDRWSTSQASSHTDPGDLDYSAVDFPSTAKPSSNKNSFISKLRKLLRGK
        NFQPP GK AARDLSKTLSPKSE KAKKLIL+YAN+E  E GK +N  D DSD+WS+SQASSHTDPGD D S  DFPSTAK  SNK  FISKLRKLL+GK
Subjt:  NFQPPTGKPAARDLSKTLSPKSEGKAKKLILEYANSEVVEGGKGINTIDIDSDRWSTSQASSHTDPGDLDYSAVDFPSTAKPSSNKNSFISKLRKLLRGK

Query:  SSQSQQNLALSTEKTAGPAEYSYSSRYSSSNSTRTNASRAEGQSFGCTTPSQNSSRHSIDFHSLNSQKEDGMKAADSLRRNSDVGYTNKR--IGSERSSN
           SQQN+ L  EK+A   E S S  YS+SNST TNA+RAEGQ+ G  TP  NSS HS+DFH L SQKED +K  DS+RRNSDVG  NKR  +GS++ S+
Subjt:  SSQSQQNLALSTEKTAGPAEYSYSSRYSSSNSTRTNASRAEGQSFGCTTPSQNSSRHSIDFHSLNSQKEDGMKAADSLRRNSDVGYTNKR--IGSERSSN

Query:  LLYRSRSEETEYTEKSDLMKYAEVLKDTKRAKNQSQQKGAPDRSF
          YRS++++TE TEKS+LMKYAEVLKDT+ AKN+S +K A   SF
Subjt:  LLYRSRSEETEYTEKSDLMKYAEVLKDTKRAKNQSQQKGAPDRSF

XP_038898688.1 protein CHUP1, chloroplastic [Benincasa hispida]1.0e-24475.5Show/hide
Query:  MEVKRDLMKPILFKFGVALAISFAGFFYSRLKIKNKRPSLPPPSSSSSDDQGSKV--GSGRGPKLDSQAMKEATSTTSNVILYAADVYEEIRNPEVNIDN
        M+ KRDLMKPILFKFG ALAISFAGF  S+ +++NKRP L PPSSSSSDDQ SKV  G GRGP+LD+Q +K AT+ +SNV+ +A D YE+   P+VN D+
Subjt:  MEVKRDLMKPILFKFGVALAISFAGFFYSRLKIKNKRPSLPPPSSSSSDDQGSKV--GSGRGPKLDSQAMKEATSTTSNVILYAADVYEEIRNPEVNIDN

Query:  -------SNKNSVDKDGLLLPEFEELVKEYDLSAANVGFSPKKNVETPRSGLEAPKAYLTVETDEYEEEIRHLKIKVKMLRERERNLEIQLLEYYGLKEQ
               SNK+ VDKDG LLPEF+ELVKE+D SAAN G  PKKNVE PRSGLE PKAY TVE DEYE+EIRHLK KVK LRERERNLE+QLLEYYGLKEQ
Subjt:  -------SNKNSVDKDGLLLPEFEELVKEYDLSAANVGFSPKKNVETPRSGLEAPKAYLTVETDEYEEEIRHLKIKVKMLRERERNLEIQLLEYYGLKEQ

Query:  ESAVMELQNRLKINNMEAQLFTLKIESLQTDNRRLESQVSDHAKSVSDLEAAKAKIKLLKKKLRHEAEQNRGQILSLQQRVAKLLDQECKTNESNKDAEI
        E+AVMELQNRLKINNMEA+LFTLKIESLQ DNRRLESQV DHAKSVSDLEAAKAKIK LKKK+R+EAEQNRGQIL+LQQRV KL DQE KTNESNKDA+I
Subjt:  ESAVMELQNRLKINNMEAQLFTLKIESLQTDNRRLESQVSDHAKSVSDLEAAKAKIKLLKKKLRHEAEQNRGQILSLQQRVAKLLDQECKTNESNKDAEI

Query:  KLQKIEDLEKQIEDLRKSNMRLQIENSDLSQRLDATQFLANSIFEDQEKETLKEERERLAQENVTLTKEMEQLKADRCADVEELVYLRWINACLRYELRN
        +LQKIE+LEK+IEDLRKSN++LQIENSDLS+RLDATQFLANS+ EDQEKE+LKEE ERL++EN  LTKE+EQL+A RCAD+EELVYLRWINACLRYELRN
Subjt:  KLQKIEDLEKQIEDLRKSNMRLQIENSDLSQRLDATQFLANSIFEDQEKETLKEERERLAQENVTLTKEMEQLKADRCADVEELVYLRWINACLRYELRN

Query:  FQPPTGKPAARDLSKTLSPKSEGKAKKLILEYANSEVVEGGKGINTIDIDSDRWSTSQASSHTDPGDLDYSAVDFPSTAKPSSNKNSFISKLRKLLRGKS
        FQPP GK AARDLSKTLSPKSE KAKKLIL+YAN+E +E GK IN  D DSD+WS+SQASSHTDPGD D SAVDFPSTAK SSNK  FISKLRKLLRGK 
Subjt:  FQPPTGKPAARDLSKTLSPKSEGKAKKLILEYANSEVVEGGKGINTIDIDSDRWSTSQASSHTDPGDLDYSAVDFPSTAKPSSNKNSFISKLRKLLRGKS

Query:  SQSQQNLALSTEKTAGPAEYSYSSRYSSSNSTRTNASRAEGQSFGCTTPSQNSSRHSIDFHSLNSQKEDGMKAADSL-RRNSDVGYTNKR--IGSERSSN
          SQQNL L  EK+A   E S S RYSSSNS  TNA+RAEGQ  G TTPS+NSSRHS+DFH LNSQKED  K  DS+ RRNSDVGY NK+  +GS+ SSN
Subjt:  SQSQQNLALSTEKTAGPAEYSYSSRYSSSNSTRTNASRAEGQSFGCTTPSQNSSRHSIDFHSLNSQKEDGMKAADSL-RRNSDVGYTNKR--IGSERSSN

Query:  LLYRSRSEETEYTEKSDLMKYAEVLKDTKRAKNQSQQKGAPDRSF
          YRS+S++TE TEKS+LMKYAEVLKDT+ AKNQSQ+K A   SF
Subjt:  LLYRSRSEETEYTEKSDLMKYAEVLKDTKRAKNQSQQKGAPDRSF

TrEMBL top hitse value%identityAlignment
A0A0A0K799 Uncharacterized protein1.6e-23272.36Show/hide
Query:  MEVKRDLMKPILFKFGVALAISFAGFFYSRLKIKNKRPSLPPPSSSSSDDQGSKV--GSGRGPKLDSQAMKEATSTTSNVILYAADVYEEIRNPEVNIDN
        ME K +L +PILFKFGV LAISFAGF YSR ++KNKRP LPPPS SSSDDQG+KV  G GRGP+LD Q       T SNV+L+A D YEE   P+VN D+
Subjt:  MEVKRDLMKPILFKFGVALAISFAGFFYSRLKIKNKRPSLPPPSSSSSDDQGSKV--GSGRGPKLDSQAMKEATSTTSNVILYAADVYEEIRNPEVNIDN

Query:  -------SNKNSVDKDGLLLPEFEELVKEYDLSAANVGFSPKKNVETPRSGLEAPKAYLTVETDEYEEEIRHLKIKVKMLRERERNLEIQLLEYYGLKEQ
               SNK+ VDKDGLL PEF+EL+KE+DLSAAN  FS KKNVE PR GLE PKAY TVE DEYE+EIR+LK KVKMLRERERNLE+QLLEYYGLKEQ
Subjt:  -------SNKNSVDKDGLLLPEFEELVKEYDLSAANVGFSPKKNVETPRSGLEAPKAYLTVETDEYEEEIRHLKIKVKMLRERERNLEIQLLEYYGLKEQ

Query:  ESAVMELQNRLKINNMEAQLFTLKIESLQTDNRRLESQVSDHAKSVSDLEAAKAKIKLLKKKLRHEAEQNRGQILSLQQRVAKLLDQECKTNESNKDAEI
        E+AVMELQNRLKINNMEA+LFT KIESL+ DNRRLESQV DHAKSVSDLEAA+AKIK LKKKLR+EAEQNRGQIL+LQ+RV KL DQE KTN+SNKDA+I
Subjt:  ESAVMELQNRLKINNMEAQLFTLKIESLQTDNRRLESQVSDHAKSVSDLEAAKAKIKLLKKKLRHEAEQNRGQILSLQQRVAKLLDQECKTNESNKDAEI

Query:  KLQKIEDLEKQIEDLRKSNMRLQIENSDLSQRLDATQFLANSIFEDQEKETLKEERERLAQENVTLTKEMEQLKADRCADVEELVYLRWINACLRYELRN
        KLQKIEDLEK+IE+LRKSN+RL+IENSDL +RLDATQFLANS+ EDQEKE+LKEE ERL +EN  LTKE+EQL+A R ADVEELVYLRWINACLRYELRN
Subjt:  KLQKIEDLEKQIEDLRKSNMRLQIENSDLSQRLDATQFLANSIFEDQEKETLKEERERLAQENVTLTKEMEQLKADRCADVEELVYLRWINACLRYELRN

Query:  FQPPTGKPAARDLSKTLSPKSEGKAKKLILEYANSEVVEGGKGINTIDIDSDRWSTSQASSHTDPGDLDYSAVDFPSTAKPSSNKNSFISKLRKLLRGKS
        FQPP GK AARDLSKTLSPKSE KAKKLIL+YAN+E  E GK +N  D DSD+WS+SQASSHTDPGD D S  DFPSTAK  SNK  FISKLRKLL+GK 
Subjt:  FQPPTGKPAARDLSKTLSPKSEGKAKKLILEYANSEVVEGGKGINTIDIDSDRWSTSQASSHTDPGDLDYSAVDFPSTAKPSSNKNSFISKLRKLLRGKS

Query:  SQSQQNLALSTEKTAGPAEYSYSSRYSSSNSTRTNASRAEGQSFGCTTPSQNSSRHSIDFHSLNSQKEDGMKAADSLRRNSDVGYTNKR--IGSERSSNL
          SQQN+ L  EK+A   E S S  YS+SNST TNA+RAEGQ+ G  TP  NSS HS+DFH L SQKED +K  DS+RRNSDVG  NKR  +GS++ S+ 
Subjt:  SQSQQNLALSTEKTAGPAEYSYSSRYSSSNSTRTNASRAEGQSFGCTTPSQNSSRHSIDFHSLNSQKEDGMKAADSLRRNSDVGYTNKR--IGSERSSNL

Query:  LYRSRSEETEYTEKSDLMKYAEVLKDTKRAKNQSQQKGAPDRSF
         YRS++++TE TEKS+LMKYAEVLKDT+ AKN+S +K A   SF
Subjt:  LYRSRSEETEYTEKSDLMKYAEVLKDTKRAKNQSQQKGAPDRSF

A0A1S3CGW9 protein CHUP1, chloroplastic4.0e-23974.22Show/hide
Query:  MEVKRDLMKPILFKFGVALAISFAGFFYSRLKIKNKRPSLPPPSSSSSDDQGSKV--GSGRGPKLDSQAMKEATSTTSNVILYAADVYEEIRNPEVNIDN
        ME K +LMKP+L KFGV LAISFA F YSR ++KNKRP LPPP SSSSDDQG+KV  G GRGP+LD+Q MK AT+ +SNV+L+A D YEE+   +VN+D+
Subjt:  MEVKRDLMKPILFKFGVALAISFAGFFYSRLKIKNKRPSLPPPSSSSSDDQGSKV--GSGRGPKLDSQAMKEATSTTSNVILYAADVYEEIRNPEVNIDN

Query:  -------SNKNSVDKDGLLLPEFEELVKEYDLSAANVGFSPKKNVETPRSGLEAPKAYLTVETDEYEEEIRHLKIKVKMLRERERNLEIQLLEYYGLKEQ
               SNK+ VDKDGLLLPEF+E VKE+DLSAAN  FSPKKNVE PRSGLE PKAY TVE DEYE+EIRHLK KVKMLRERERNLE QLLEYYGLKEQ
Subjt:  -------SNKNSVDKDGLLLPEFEELVKEYDLSAANVGFSPKKNVETPRSGLEAPKAYLTVETDEYEEEIRHLKIKVKMLRERERNLEIQLLEYYGLKEQ

Query:  ESAVMELQNRLKINNMEAQLFTLKIESLQTDNRRLESQVSDHAKSVSDLEAAKAKIKLLKKKLRHEAEQNRGQILSLQQRVAKLLDQECKTNESNKDAEI
        E+AVMELQNRLKINNMEA+LFT KIESL+ DNRRLESQV +HAK+VSDLEAA+AKIK LKKKLRHEAEQNR QIL+LQQ+V KL DQE KTNESNKDA+I
Subjt:  ESAVMELQNRLKINNMEAQLFTLKIESLQTDNRRLESQVSDHAKSVSDLEAAKAKIKLLKKKLRHEAEQNRGQILSLQQRVAKLLDQECKTNESNKDAEI

Query:  KLQKIEDLEKQIEDLRKSNMRLQIENSDLSQRLDATQFLANSIFEDQEKETLKEERERLAQENVTLTKEMEQLKADRCADVEELVYLRWINACLRYELRN
        KLQKIEDLEK+IE+LRK N RLQIENSDL +RLDATQFLANS+ EDQEKE+LKEE ERL QEN  LTKE+EQL+A R ADVEELVYLRWINACLRYELRN
Subjt:  KLQKIEDLEKQIEDLRKSNMRLQIENSDLSQRLDATQFLANSIFEDQEKETLKEERERLAQENVTLTKEMEQLKADRCADVEELVYLRWINACLRYELRN

Query:  FQPPTGKPAARDLSKTLSPKSEGKAKKLILEYANSEVVEGGKGINTIDIDSDRWSTSQASSHTDPGDLDYSAVDFPSTAKPSSNKNSFISKLRKLLRGKS
        FQPP GK AARDLSKTLSPKSE KAKKLIL+YAN+E  E GKGI+  D DSD+WS+SQASSHTDPGD D SA +FPSTAK SSNK  FI KL+KLLRGK 
Subjt:  FQPPTGKPAARDLSKTLSPKSEGKAKKLILEYANSEVVEGGKGINTIDIDSDRWSTSQASSHTDPGDLDYSAVDFPSTAKPSSNKNSFISKLRKLLRGKS

Query:  SQSQQNLALSTEKTAGPAEYSYSSRYSSSNSTRTNASRAEGQSFGCTTPSQNSSRHSIDFHSLNSQKEDGMKAADSLRRNSDVGYTNKR--IGSERSSNL
          SQQNL L  EK+A   E S S  YSSSNST TNA+RAEGQ+ G  T S+NSSR+SIDF  L+SQKED +K  DS RRNSDVGY NKR  +GS++SSN 
Subjt:  SQSQQNLALSTEKTAGPAEYSYSSRYSSSNSTRTNASRAEGQSFGCTTPSQNSSRHSIDFHSLNSQKEDGMKAADSLRRNSDVGYTNKR--IGSERSSNL

Query:  LYRSRSEETEYTEKSDLMKYAEVLKDTKRAKNQSQQKGAPDRSF
          RS+S++TE TEKS+LMKYAEVLKDT+ AKNQS +K A   SF
Subjt:  LYRSRSEETEYTEKSDLMKYAEVLKDTKRAKNQSQQKGAPDRSF

A0A5A7V182 Protein CHUP13.6e-24074.38Show/hide
Query:  MEVKRDLMKPILFKFGVALAISFAGFFYSRLKIKNKRPSLPPPSSSSSDDQGSKV--GSGRGPKLDSQAMKEATSTTSNVILYAADVYEEIRNPEVNIDN
        ME K +LMKP+L KFGV LAISFA F YSR ++KNKRP LPPP SSSSDDQG+KV  G GRGP+LD+Q MK AT+ +SNV+L+A D YEE+  P+VN+D+
Subjt:  MEVKRDLMKPILFKFGVALAISFAGFFYSRLKIKNKRPSLPPPSSSSSDDQGSKV--GSGRGPKLDSQAMKEATSTTSNVILYAADVYEEIRNPEVNIDN

Query:  -------SNKNSVDKDGLLLPEFEELVKEYDLSAANVGFSPKKNVETPRSGLEAPKAYLTVETDEYEEEIRHLKIKVKMLRERERNLEIQLLEYYGLKEQ
               SNK+ VDKDGLLLPEF+E VKE+DLSAAN  FSPKKNVE PRSGLE PKAY TVE DEYE+EIRHLK KVKMLRERERNLE QLLEYYGLKEQ
Subjt:  -------SNKNSVDKDGLLLPEFEELVKEYDLSAANVGFSPKKNVETPRSGLEAPKAYLTVETDEYEEEIRHLKIKVKMLRERERNLEIQLLEYYGLKEQ

Query:  ESAVMELQNRLKINNMEAQLFTLKIESLQTDNRRLESQVSDHAKSVSDLEAAKAKIKLLKKKLRHEAEQNRGQILSLQQRVAKLLDQECKTNESNKDAEI
        E+AVMELQNRLKINNMEA+LFT KIESL+ DNRRLESQV +HAK+VSDLEAA+AKIK LKKKLRHEAEQNR QIL+LQQ+V KL DQE KTNESNKDA+I
Subjt:  ESAVMELQNRLKINNMEAQLFTLKIESLQTDNRRLESQVSDHAKSVSDLEAAKAKIKLLKKKLRHEAEQNRGQILSLQQRVAKLLDQECKTNESNKDAEI

Query:  KLQKIEDLEKQIEDLRKSNMRLQIENSDLSQRLDATQFLANSIFEDQEKETLKEERERLAQENVTLTKEMEQLKADRCADVEELVYLRWINACLRYELRN
        KLQKIEDLEK+IE+LRK N RLQIENSDL +RLDATQFLANS+ EDQEKE+LKEE ERL QEN  LTKE+EQL+A R ADVEELVYLRWINACLRYELRN
Subjt:  KLQKIEDLEKQIEDLRKSNMRLQIENSDLSQRLDATQFLANSIFEDQEKETLKEERERLAQENVTLTKEMEQLKADRCADVEELVYLRWINACLRYELRN

Query:  FQPPTGKPAARDLSKTLSPKSEGKAKKLILEYANSEVVEGGKGINTIDIDSDRWSTSQASSHTDPGDLDYSAVDFPSTAKPSSNKNSFISKLRKLLRGKS
        FQPP GK AARDLSKTLSPKSE KAKKLIL+YAN+E  E GKGI+  D DSD+WS+SQASSHTDPGD D SA +FPSTAK SSNK  FI KL+KLLRGK 
Subjt:  FQPPTGKPAARDLSKTLSPKSEGKAKKLILEYANSEVVEGGKGINTIDIDSDRWSTSQASSHTDPGDLDYSAVDFPSTAKPSSNKNSFISKLRKLLRGKS

Query:  SQSQQNLALSTEKTAGPAEYSYSSRYSSSNSTRTNASRAEGQSFGCTTPSQNSSRHSIDFHSLNSQKEDGMKAADSLRRNSDVGYTNKR--IGSERSSNL
          SQQNL L  EK+A   E S S  YSSSNST TNA+RAEGQ+ G  T S+NSSR+SIDF  L+SQKED +K  DS RRNSDVGY NKR  +GS++SSN 
Subjt:  SQSQQNLALSTEKTAGPAEYSYSSRYSSSNSTRTNASRAEGQSFGCTTPSQNSSRHSIDFHSLNSQKEDGMKAADSLRRNSDVGYTNKR--IGSERSSNL

Query:  LYRSRSEETEYTEKSDLMKYAEVLKDTKRAKNQSQQKGAPDRSF
          RS+S++TE TEKS+LMKYAEVLKDT+ AKNQS +K A   SF
Subjt:  LYRSRSEETEYTEKSDLMKYAEVLKDTKRAKNQSQQKGAPDRSF

A0A6J1HK30 protein CHUP1, chloroplastic isoform X22.4e-22872.12Show/hide
Query:  MMEVKRDLMKPILFKFGVALAISFAGFFYSRLKIKNKRPSLPPPSSSSSDDQGSKVGSGRGPKLDSQAMKEATSTTSNVILYAADVYEEIRNPEVNIDNS
        MME K DL+KP+LFKFGV LAISFA F YSR +I+NKRPSL PPSSSS         SGRG KLD Q MK AT+ +SN I+ AAD YEE+   + N D+S
Subjt:  MMEVKRDLMKPILFKFGVALAISFAGFFYSRLKIKNKRPSLPPPSSSSSDDQGSKVGSGRGPKLDSQAMKEATSTTSNVILYAADVYEEIRNPEVNIDNS

Query:  -------NKNSVDKDGLLLPEFEELVKEYDLSAANVGFSPKKNVETPRSGLEAPKAYLTVETDEYEEEIRHLKIKVKMLRERERNLEIQLLEYYGLKEQE
               N + VD++GLLLPEF+ELVK++DLSAAN GFSPKKN    R G+E PKAY  VE+D YE EI+HLK KVKMLRERERNLE+QLLEYYGLKEQE
Subjt:  -------NKNSVDKDGLLLPEFEELVKEYDLSAANVGFSPKKNVETPRSGLEAPKAYLTVETDEYEEEIRHLKIKVKMLRERERNLEIQLLEYYGLKEQE

Query:  SAVMELQNRLKINNMEAQLFTLKIESLQTDNRRLESQVSDHAKSVSDLEAAKAKIKLLKKKLRHEAEQNRGQILSLQQRVAKLLDQECKTNESNKDAEIK
        +AVMELQNRLKINNMEA+LFTLKIESLQ DNRRLESQVSD AKS SDLEAA+  IK LKKKLRHEAEQNR QI++LQQRV KLLDQECK NES K+ +IK
Subjt:  SAVMELQNRLKINNMEAQLFTLKIESLQTDNRRLESQVSDHAKSVSDLEAAKAKIKLLKKKLRHEAEQNRGQILSLQQRVAKLLDQECKTNESNKDAEIK

Query:  LQKIEDLEKQIEDLRKSNMRLQIENSDLSQRLDATQFLANSIFEDQEKETLKEERERLAQENVTLTKEMEQLKADRCADVEELVYLRWINACLRYELRNF
        LQ IEDLEK+IE+L+K+N RLQ ENSDL +RLDATQFLANSI EDQEKE+LKEER+R AQEN TLTKE+EQL+A RCADVEELVYLRWINACLRYELRNF
Subjt:  LQKIEDLEKQIEDLRKSNMRLQIENSDLSQRLDATQFLANSIFEDQEKETLKEERERLAQENVTLTKEMEQLKADRCADVEELVYLRWINACLRYELRNF

Query:  QPPTGKPAARDLSKTLSPKSEGKAKKLILEYANSEVVEGGKGINTIDIDSDRWSTSQASSHTDPGDLDYSAVDFPSTAKPSSNKNSFISKLRKLLRGKSS
        QP  GK AARDLSKTLSPKSE KAKKLILEYAN+E +E GK IN  D DSD+WS+SQASSHTDPGDLDYSAVD   TAKPSSNK  F+SKLR LLRGKS 
Subjt:  QPPTGKPAARDLSKTLSPKSEGKAKKLILEYANSEVVEGGKGINTIDIDSDRWSTSQASSHTDPGDLDYSAVDFPSTAKPSSNKNSFISKLRKLLRGKSS

Query:  QSQQNLALSTEKTAGPAEYSYSSRYSSSNSTRTNASRAEGQSFGCTTPSQNSSRHSIDFHSLNSQKEDGMKAADSLRRNSDVGYTNKRI--GSERSSNLL
         +QQ+ AL  EK+A       S RYSSS+ST TNA+RA+G   G TTPSQNSSR S+DFH LNSQKED +K  DSLRRNSDVGY NKR   GS+RSSN L
Subjt:  QSQQNLALSTEKTAGPAEYSYSSRYSSSNSTRTNASRAEGQSFGCTTPSQNSSRHSIDFHSLNSQKEDGMKAADSLRRNSDVGYTNKRI--GSERSSNLL

Query:  YRSRSEETEYT---EKSDLMKYAEVLKDTKRAKNQSQQKGAP
        YRS S+ETE T   EKS+L+KYAEVLK+++  KNQS++K AP
Subjt:  YRSRSEETEYT---EKSDLMKYAEVLKDTKRAKNQSQQKGAP

A0A6J1HMC2 protein CHUP1, chloroplastic isoform X14.4e-23072.4Show/hide
Query:  MMEVKRDLMKPILFKFGVALAISFAGFFYSRLKIKNKRPSLPPPSSSSSDD-QGSKV--GSGRGPKLDSQAMKEATSTTSNVILYAADVYEEIRNPEVNI
        MME K DL+KP+LFKFGV LAISFA F YSR +I+NKRPSL PPSSSSSD+ + +KV  G GRG KLD Q MK AT+ +SN I+ AAD YEE+   + N 
Subjt:  MMEVKRDLMKPILFKFGVALAISFAGFFYSRLKIKNKRPSLPPPSSSSSDD-QGSKV--GSGRGPKLDSQAMKEATSTTSNVILYAADVYEEIRNPEVNI

Query:  DNS-------NKNSVDKDGLLLPEFEELVKEYDLSAANVGFSPKKNVETPRSGLEAPKAYLTVETDEYEEEIRHLKIKVKMLRERERNLEIQLLEYYGLK
        D+S       N + VD++GLLLPEF+ELVK++DLSAAN GFSPKKN    R G+E PKAY  VE+D YE EI+HLK KVKMLRERERNLE+QLLEYYGLK
Subjt:  DNS-------NKNSVDKDGLLLPEFEELVKEYDLSAANVGFSPKKNVETPRSGLEAPKAYLTVETDEYEEEIRHLKIKVKMLRERERNLEIQLLEYYGLK

Query:  EQESAVMELQNRLKINNMEAQLFTLKIESLQTDNRRLESQVSDHAKSVSDLEAAKAKIKLLKKKLRHEAEQNRGQILSLQQRVAKLLDQECKTNESNKDA
        EQE+AVMELQNRLKINNMEA+LFTLKIESLQ DNRRLESQVSD AKS SDLEAA+  IK LKKKLRHEAEQNR QI++LQQRV KLLDQECK NES K+ 
Subjt:  EQESAVMELQNRLKINNMEAQLFTLKIESLQTDNRRLESQVSDHAKSVSDLEAAKAKIKLLKKKLRHEAEQNRGQILSLQQRVAKLLDQECKTNESNKDA

Query:  EIKLQKIEDLEKQIEDLRKSNMRLQIENSDLSQRLDATQFLANSIFEDQEKETLKEERERLAQENVTLTKEMEQLKADRCADVEELVYLRWINACLRYEL
        +IKLQ IEDLEK+IE+L+K+N RLQ ENSDL +RLDATQFLANSI EDQEKE+LKEER+R AQEN TLTKE+EQL+A RCADVEELVYLRWINACLRYEL
Subjt:  EIKLQKIEDLEKQIEDLRKSNMRLQIENSDLSQRLDATQFLANSIFEDQEKETLKEERERLAQENVTLTKEMEQLKADRCADVEELVYLRWINACLRYEL

Query:  RNFQPPTGKPAARDLSKTLSPKSEGKAKKLILEYANSEVVEGGKGINTIDIDSDRWSTSQASSHTDPGDLDYSAVDFPSTAKPSSNKNSFISKLRKLLRG
        RNFQP  GK AARDLSKTLSPKSE KAKKLILEYAN+E +E GK IN  D DSD+WS+SQASSHTDPGDLDYSAVD   TAKPSSNK  F+SKLR LLRG
Subjt:  RNFQPPTGKPAARDLSKTLSPKSEGKAKKLILEYANSEVVEGGKGINTIDIDSDRWSTSQASSHTDPGDLDYSAVDFPSTAKPSSNKNSFISKLRKLLRG

Query:  KSSQSQQNLALSTEKTAGPAEYSYSSRYSSSNSTRTNASRAEGQSFGCTTPSQNSSRHSIDFHSLNSQKEDGMKAADSLRRNSDVGYTNKRI--GSERSS
        KS  +QQ+ AL  EK+A       S RYSSS+ST TNA+RA+G   G TTPSQNSSR S+DFH LNSQKED +K  DSLRRNSDVGY NKR   GS+RSS
Subjt:  KSSQSQQNLALSTEKTAGPAEYSYSSRYSSSNSTRTNASRAEGQSFGCTTPSQNSSRHSIDFHSLNSQKEDGMKAADSLRRNSDVGYTNKRI--GSERSS

Query:  NLLYRSRSEETEYT---EKSDLMKYAEVLKDTKRAKNQSQQKGAP
        N LYRS S+ETE T   EKS+L+KYAEVLK+++  KNQS++K AP
Subjt:  NLLYRSRSEETEYT---EKSDLMKYAEVLKDTKRAKNQSQQKGAP

SwissProt top hitse value%identityAlignment
Q9LI74 Protein CHUP1, chloroplastic2.0e-5434.7Show/hide
Query:  KFGVALAISFAGFFYSRLKIKNKRPSLPPPSSSSSDDQGSKVGSGRGPKLDSQAMKEATSTTSNVILYAADVYEEIRNPEVNIDNSNKNSVDKDGLLLPE
        + G  +A S A     RL +K  +PS P  +    D + S         L+ + ++E        +     V  + R        S  + +D D  +LPE
Subjt:  KFGVALAISFAGFFYSRLKIKNKRPSLPPPSSSSSDDQGSKVGSGRGPKLDSQAMKEATSTTSNVILYAADVYEEIRNPEVNIDNSNKNSVDKDGLLLPE

Query:  FEELVK---EYDLSAANVGFSPKKNVETPRSGLEAPKAYLTVETDEYEEEIRHLKIKVKMLRERERNLEIQLLEYYGLKEQESAVMELQNRLKINNMEAQ
        FE+L+    EY L           N+E      E  + Y  VE    + E+  LK  VK L ERE  LE +LLEYYGLKEQES ++ELQ +LKI  +E  
Subjt:  FEELVK---EYDLSAANVGFSPKKNVETPRSGLEAPKAYLTVETDEYEEEIRHLKIKVKMLRERERNLEIQLLEYYGLKEQESAVMELQNRLKINNMEAQ

Query:  LFTLKIESLQTDNRRLESQVSDHAKSVSDLEAAKAKIKLLKKKLRHEAEQNRGQILSLQQRVAKLLDQECKTNESNKDAEIKLQKIEDLEKQIEDLRKSN
        +  + I SLQ + ++L+ ++S +     +LE A+ KIK L+++++ +A Q +GQ+L L+Q V+ L  +E +    + + E KL+ ++DLE Q+ +L++ N
Subjt:  LFTLKIESLQTDNRRLESQVSDHAKSVSDLEAAKAKIKLLKKKLRHEAEQNRGQILSLQQRVAKLLDQECKTNESNKDAEIKLQKIEDLEKQIEDLRKSN

Query:  MRLQIENSDLSQRLDATQ---FLANSIFEDQEKETLKEERERLAQENVTLTKEMEQLKADRCADVEELVYLRWINACLRYELRNFQPPTGKPAARDLSKT
          LQ E  +LS +LD+ +      +++ E  +   ++EE   L   N  L K++E L+ +R ++VEELVYLRW+NACLRYELRN+Q P GK +ARDLSK 
Subjt:  MRLQIENSDLSQRLDATQ---FLANSIFEDQEKETLKEERERLAQENVTLTKEMEQLKADRCADVEELVYLRWINACLRYELRNFQPPTGKPAARDLSKT

Query:  LSPKSEGKAKKLILEYANSEVVEGGKGINTIDIDSD-RWSTSQASSHTDPGDLDYSAVDFPSTAKPSSNKNSFISKLRKLLRGKSSQSQQNLALSTEKTA
        LSPKS+ KAK+L+LEYA SE     +G    D++S+    +S  S   D   +D S   F S +K    K   I KL+K  + K   S Q+    +    
Subjt:  LSPKSEGKAKKLILEYANSEVVEGGKGINTIDIDSD-RWSTSQASSHTDPGDLDYSAVDFPSTAKPSSNKNSFISKLRKLLRGKSSQSQQNLALSTEKTA

Query:  GPAEYSYSSRYSSSNSTRTNASRAEGQSFGCTTPSQ
         P   S SS         +   R  G+S   TT  Q
Subjt:  GPAEYSYSSRYSSSNSTRTNASRAEGQSFGCTTPSQ

Arabidopsis top hitse value%identityAlignment
AT1G52080.1 actin binding protein family1.1e-8939.94Show/hide
Query:  KRDLMKPILFKFGVALAISFAGFFYSRLKIKNKR--PSLPPPSSSSSDDQGSKVGSGRGPKLDSQAMKEATSTTSNVILYAADVYEEIRNPEVNIDNSNK
        KRD+   ++ + G ALA+SFAGF ++R +   KR  P+LPP    SSD+ G +  S +   +D +   E T  T    L            E ++D    
Subjt:  KRDLMKPILFKFGVALAISFAGFFYSRLKIKNKR--PSLPPPSSSSSDDQGSKVGSGRGPKLDSQAMKEATSTTSNVILYAADVYEEIRNPEVNIDNSNK

Query:  NSVDKDGLLLPEFEELVKEYDLSAANVGFSPKKNVETPRSGLEAPKAYLTVETDEYEEEIRHLKIKVKMLRERERNLEIQLLEYYGLKEQESAVMELQNR
           +KD  LLPEFEE  K+ DL   +       + ETPRS + AP A+ + E  ++E EI  L+  V+ LRERER LE +LLEYY LKEQ+   MEL++R
Subjt:  NSVDKDGLLLPEFEELVKEYDLSAANVGFSPKKNVETPRSGLEAPKAYLTVETDEYEEEIRHLKIKVKMLRERERNLEIQLLEYYGLKEQESAVMELQNR

Query:  LKINNMEAQLFTLKIESLQTDNRRLESQVSDHAKSVSDLEAAKAKIKLLKKKLRHEAEQNRGQILSLQQRVAKLLDQECKTNESNKDAEIKLQKIEDLEK
        LK+N ME ++F  KI+ LQ +N +L+++  +H+K + +L+ AK+++++LKKKL    +Q+  QILSL+QRVA+L ++E K    + +A+  +Q++ DLE 
Subjt:  LKINNMEAQLFTLKIESLQTDNRRLESQVSDHAKSVSDLEAAKAKIKLLKKKLRHEAEQNRGQILSLQQRVAKLLDQECKTNESNKDAEIKLQKIEDLEK

Query:  QIEDLRKSNMRLQIENSDLSQRLDATQFLANSIFEDQEK-ETLKEERERLAQENVTLTKEMEQLKADRCADVEELVYLRWINACLRYELRNFQPPTGKPA
        +I +L  +N RLQ EN +LS++L++ Q +ANS  E+ E+ ETL+E+  RL  EN  L K++EQL+ DRC D+E+LVYLRWINACLRYELR +QPP GK  
Subjt:  QIEDLRKSNMRLQIENSDLSQRLDATQFLANSIFEDQEK-ETLKEERERLAQENVTLTKEMEQLKADRCADVEELVYLRWINACLRYELRNFQPPTGKPA

Query:  ARDLSKTLSPKSEGKAKKLILEYANSEVVEGGKGINTIDIDSDRWSTSQASSH--TDPGDLDYSAVD-FPSTAKPSSNKNSFISKLRKLLRGKSSQSQQN
        ARDLS TLSP SE KAK+LILEYA+SE           + D DRWS+SQ  S   TD   LD S+VD   +T    + K   + KL K+L GK ++    
Subjt:  ARDLSKTLSPKSEGKAKKLILEYANSEVVEGGKGINTIDIDSDRWSTSQASSH--TDPGDLDYSAVD-FPSTAKPSSNKNSFISKLRKLLRGKSSQSQQN

Query:  LALSTEKTAGPAEYSYSSRYSSSNSTRTNASRAEGQSFGCTTPSQNSSRHSIDFHSL---NSQKEDGMKAADSLRRNSDVGYTNKRIGSERSSNLLYRSR
            ++K AG +E        SS++T  +           +TP Q  S HS+DF  L     ++ED       LRR S+             S+      
Subjt:  LALSTEKTAGPAEYSYSSRYSSSNSTRTNASRAEGQSFGCTTPSQNSSRHSIDFHSL---NSQKEDGMKAADSLRRNSDVGYTNKRIGSERSSNLLYRSR

Query:  SEETEYTEKSDLMKYAEVLKDTKRAK
          ET+   K +L+K A+ L  ++  K
Subjt:  SEETEYTEKSDLMKYAEVLKDTKRAK

AT2G36650.1 unknown protein8.4e-0821.65Show/hide
Query:  GLLLPEFEELVKEYDLSAANVGFSPKKNVETPRSGLEAPKAYLTVETDEYEEEIRHLKIKVKMLRERERNLEIQLLEYYGLKEQESAVMELQNRLKINNM
        GL+L  F    ++ +++++    +P+ +    R   E  +A      ++ ++EI  LK + + L+ +E  +E+    +  LK+QE  ++E ++ L +   
Subjt:  GLLLPEFEELVKEYDLSAANVGFSPKKNVETPRSGLEAPKAYLTVETDEYEEEIRHLKIKVKMLRERERNLEIQLLEYYGLKEQESAVMELQNRLKINNM

Query:  EAQLFTLKIESLQTDNRRLESQVSDHAKSVSDLEAAKAKIKLLK---KKLRHEAEQ------NRGQILSLQQRVAKLLDQECKTNESNKDAEIKLQKIED
        +   F  ++ +++ +++R ++ V  + K V +++  +++  LL+   KKLR +++Q         +I+ +++   K +D+    N   K+ E    K++D
Subjt:  EAQLFTLKIESLQTDNRRLESQVSDHAKSVSDLEAAKAKIKLLK---KKLRHEAEQ------NRGQILSLQQRVAKLLDQECKTNESNKDAEIKLQKIED

Query:  LEKQIEDLRKSNMRLQIENSDLSQRLDATQFLANSIFEDQEKETLKEERERLAQENVTLTKEMEQLKADRCADVEELVYLRWINACLRYEL
        +E  ++ L++    L +++S+ +  + +         ED  +                + +E E+LK D    V+E++ LRW NACLR+E+
Subjt:  LEKQIEDLRKSNMRLQIENSDLSQRLDATQFLANSIFEDQEKETLKEERERLAQENVTLTKEMEQLKADRCADVEELVYLRWINACLRYEL

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein1.4e-5534.7Show/hide
Query:  KFGVALAISFAGFFYSRLKIKNKRPSLPPPSSSSSDDQGSKVGSGRGPKLDSQAMKEATSTTSNVILYAADVYEEIRNPEVNIDNSNKNSVDKDGLLLPE
        + G  +A S A     RL +K  +PS P  +    D + S         L+ + ++E        +     V  + R        S  + +D D  +LPE
Subjt:  KFGVALAISFAGFFYSRLKIKNKRPSLPPPSSSSSDDQGSKVGSGRGPKLDSQAMKEATSTTSNVILYAADVYEEIRNPEVNIDNSNKNSVDKDGLLLPE

Query:  FEELVK---EYDLSAANVGFSPKKNVETPRSGLEAPKAYLTVETDEYEEEIRHLKIKVKMLRERERNLEIQLLEYYGLKEQESAVMELQNRLKINNMEAQ
        FE+L+    EY L           N+E      E  + Y  VE    + E+  LK  VK L ERE  LE +LLEYYGLKEQES ++ELQ +LKI  +E  
Subjt:  FEELVK---EYDLSAANVGFSPKKNVETPRSGLEAPKAYLTVETDEYEEEIRHLKIKVKMLRERERNLEIQLLEYYGLKEQESAVMELQNRLKINNMEAQ

Query:  LFTLKIESLQTDNRRLESQVSDHAKSVSDLEAAKAKIKLLKKKLRHEAEQNRGQILSLQQRVAKLLDQECKTNESNKDAEIKLQKIEDLEKQIEDLRKSN
        +  + I SLQ + ++L+ ++S +     +LE A+ KIK L+++++ +A Q +GQ+L L+Q V+ L  +E +    + + E KL+ ++DLE Q+ +L++ N
Subjt:  LFTLKIESLQTDNRRLESQVSDHAKSVSDLEAAKAKIKLLKKKLRHEAEQNRGQILSLQQRVAKLLDQECKTNESNKDAEIKLQKIEDLEKQIEDLRKSN

Query:  MRLQIENSDLSQRLDATQ---FLANSIFEDQEKETLKEERERLAQENVTLTKEMEQLKADRCADVEELVYLRWINACLRYELRNFQPPTGKPAARDLSKT
          LQ E  +LS +LD+ +      +++ E  +   ++EE   L   N  L K++E L+ +R ++VEELVYLRW+NACLRYELRN+Q P GK +ARDLSK 
Subjt:  MRLQIENSDLSQRLDATQ---FLANSIFEDQEKETLKEERERLAQENVTLTKEMEQLKADRCADVEELVYLRWINACLRYELRNFQPPTGKPAARDLSKT

Query:  LSPKSEGKAKKLILEYANSEVVEGGKGINTIDIDSD-RWSTSQASSHTDPGDLDYSAVDFPSTAKPSSNKNSFISKLRKLLRGKSSQSQQNLALSTEKTA
        LSPKS+ KAK+L+LEYA SE     +G    D++S+    +S  S   D   +D S   F S +K    K   I KL+K  + K   S Q+    +    
Subjt:  LSPKSEGKAKKLILEYANSEVVEGGKGINTIDIDSD-RWSTSQASSHTDPGDLDYSAVDFPSTAKPSSNKNSFISKLRKLLRGKSSQSQQNLALSTEKTA

Query:  GPAEYSYSSRYSSSNSTRTNASRAEGQSFGCTTPSQ
         P   S SS         +   R  G+S   TT  Q
Subjt:  GPAEYSYSSRYSSSNSTRTNASRAEGQSFGCTTPSQ

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein1.4e-5534.7Show/hide
Query:  KFGVALAISFAGFFYSRLKIKNKRPSLPPPSSSSSDDQGSKVGSGRGPKLDSQAMKEATSTTSNVILYAADVYEEIRNPEVNIDNSNKNSVDKDGLLLPE
        + G  +A S A     RL +K  +PS P  +    D + S         L+ + ++E        +     V  + R        S  + +D D  +LPE
Subjt:  KFGVALAISFAGFFYSRLKIKNKRPSLPPPSSSSSDDQGSKVGSGRGPKLDSQAMKEATSTTSNVILYAADVYEEIRNPEVNIDNSNKNSVDKDGLLLPE

Query:  FEELVK---EYDLSAANVGFSPKKNVETPRSGLEAPKAYLTVETDEYEEEIRHLKIKVKMLRERERNLEIQLLEYYGLKEQESAVMELQNRLKINNMEAQ
        FE+L+    EY L           N+E      E  + Y  VE    + E+  LK  VK L ERE  LE +LLEYYGLKEQES ++ELQ +LKI  +E  
Subjt:  FEELVK---EYDLSAANVGFSPKKNVETPRSGLEAPKAYLTVETDEYEEEIRHLKIKVKMLRERERNLEIQLLEYYGLKEQESAVMELQNRLKINNMEAQ

Query:  LFTLKIESLQTDNRRLESQVSDHAKSVSDLEAAKAKIKLLKKKLRHEAEQNRGQILSLQQRVAKLLDQECKTNESNKDAEIKLQKIEDLEKQIEDLRKSN
        +  + I SLQ + ++L+ ++S +     +LE A+ KIK L+++++ +A Q +GQ+L L+Q V+ L  +E +    + + E KL+ ++DLE Q+ +L++ N
Subjt:  LFTLKIESLQTDNRRLESQVSDHAKSVSDLEAAKAKIKLLKKKLRHEAEQNRGQILSLQQRVAKLLDQECKTNESNKDAEIKLQKIEDLEKQIEDLRKSN

Query:  MRLQIENSDLSQRLDATQ---FLANSIFEDQEKETLKEERERLAQENVTLTKEMEQLKADRCADVEELVYLRWINACLRYELRNFQPPTGKPAARDLSKT
          LQ E  +LS +LD+ +      +++ E  +   ++EE   L   N  L K++E L+ +R ++VEELVYLRW+NACLRYELRN+Q P GK +ARDLSK 
Subjt:  MRLQIENSDLSQRLDATQ---FLANSIFEDQEKETLKEERERLAQENVTLTKEMEQLKADRCADVEELVYLRWINACLRYELRNFQPPTGKPAARDLSKT

Query:  LSPKSEGKAKKLILEYANSEVVEGGKGINTIDIDSD-RWSTSQASSHTDPGDLDYSAVDFPSTAKPSSNKNSFISKLRKLLRGKSSQSQQNLALSTEKTA
        LSPKS+ KAK+L+LEYA SE     +G    D++S+    +S  S   D   +D S   F S +K    K   I KL+K  + K   S Q+    +    
Subjt:  LSPKSEGKAKKLILEYANSEVVEGGKGINTIDIDSD-RWSTSQASSHTDPGDLDYSAVDFPSTAKPSSNKNSFISKLRKLLRGKSSQSQQNLALSTEKTA

Query:  GPAEYSYSSRYSSSNSTRTNASRAEGQSFGCTTPSQ
         P   S SS         +   R  G+S   TT  Q
Subjt:  GPAEYSYSSRYSSSNSTRTNASRAEGQSFGCTTPSQ

AT3G25690.3 Hydroxyproline-rich glycoprotein family protein8.6e-3736.31Show/hide
Query:  DNRRLESQVSDHAKSVSDLEAAKAKIKLLKKKLRHEAEQNRGQILSLQQRVAKLLDQECKTNESNKDAEIKLQKIEDLEKQIEDLRKSNMRLQIENSDLS
        +++ L+ ++S +     +LE A+ KIK L+++++ +A Q +GQ+L L+Q V+ L  +E +    + + E KL+ ++DLE Q+ +L++ N  LQ E  +LS
Subjt:  DNRRLESQVSDHAKSVSDLEAAKAKIKLLKKKLRHEAEQNRGQILSLQQRVAKLLDQECKTNESNKDAEIKLQKIEDLEKQIEDLRKSNMRLQIENSDLS

Query:  QRLDATQ---FLANSIFEDQEKETLKEERERLAQENVTLTKEMEQLKADRCADVEELVYLRWINACLRYELRNFQPPTGKPAARDLSKTLSPKSEGKAKK
         +LD+ +      +++ E  +   ++EE   L   N  L K++E L+ +R ++VEELVYLRW+NACLRYELRN+Q P GK +ARDLSK LSPKS+ KAK+
Subjt:  QRLDATQ---FLANSIFEDQEKETLKEERERLAQENVTLTKEMEQLKADRCADVEELVYLRWINACLRYELRNFQPPTGKPAARDLSKTLSPKSEGKAKK

Query:  LILEYANSEVVEGGKGINTIDIDSD-RWSTSQASSHTDPGDLDYSAVDFPSTAKPSSNKNSFISKLRKLLRGKSSQSQQNLALSTEKTAGPAEYSYSSRY
        L+LEYA SE     +G    D++S+    +S  S   D   +D S   F S +K    K   I KL+K  + K   S Q+    +     P   S SS  
Subjt:  LILEYANSEVVEGGKGINTIDIDSD-RWSTSQASSHTDPGDLDYSAVDFPSTAKPSSNKNSFISKLRKLLRGKSSQSQQNLALSTEKTAGPAEYSYSSRY

Query:  SSSNSTRTNASRAEGQSFGCTTPSQ
               +   R  G+S   TT  Q
Subjt:  SSSNSTRTNASRAEGQSFGCTTPSQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGAGGTGAAGAGAGATTTGATGAAGCCAATATTGTTCAAATTTGGGGTTGCTTTGGCTATCTCCTTTGCTGGTTTCTTCTATTCCCGCTTAAAAATCAAGAACAA
AAGACCTTCTCTGCCTCCTCCCTCGTCGAGTTCTTCAGATGATCAGGGAAGTAAAGTTGGCTCGGGAAGAGGACCTAAACTTGATAGCCAAGCAATGAAGGAAGCAACAT
CAACAACCTCTAATGTTATTCTTTATGCAGCTGATGTCTATGAGGAAATACGAAATCCGGAAGTCAACATTGACAATTCAAATAAGAACAGTGTAGACAAAGATGGGCTG
CTTCTCCCAGAGTTTGAGGAACTTGTCAAGGAATATGATCTTTCTGCAGCAAATGTTGGGTTTTCTCCTAAGAAAAATGTTGAAACACCGAGGTCAGGACTCGAAGCTCC
GAAAGCTTATCTGACCGTTGAGACGGATGAGTATGAAGAAGAGATCAGACACCTCAAAATCAAGGTGAAAATGCTTCGAGAGAGGGAAAGAAACCTCGAGATTCAGCTAC
TCGAGTATTATGGCCTGAAAGAGCAAGAATCTGCTGTCATGGAGCTCCAAAATAGGTTGAAGATTAACAATATGGAGGCTCAACTTTTCACTCTCAAGATTGAGTCCCTT
CAGACTGATAATCGACGATTAGAATCACAGGTTTCCGATCATGCGAAATCAGTGTCCGACCTCGAGGCTGCAAAAGCAAAGATTAAGCTTCTCAAGAAAAAACTTAGACA
TGAAGCAGAACAGAACAGGGGACAGATCTTAAGTCTTCAGCAAAGAGTTGCTAAGCTGCTTGATCAAGAATGTAAAACAAATGAAAGCAATAAAGATGCTGAAATCAAAC
TGCAAAAGATTGAAGATTTGGAGAAACAAATAGAGGACTTAAGAAAGTCAAATATGAGATTACAAATAGAAAATTCTGATCTCAGTCAGAGATTAGATGCTACTCAGTTT
CTTGCAAATTCTATTTTTGAAGACCAAGAAAAAGAAACACTTAAAGAAGAAAGGGAGCGTTTGGCGCAAGAAAACGTGACGTTGACAAAGGAAATGGAGCAGCTTAAAGC
AGACCGATGTGCGGATGTCGAAGAACTAGTCTATCTCCGCTGGATTAATGCTTGCTTAAGATACGAGCTGCGGAATTTTCAGCCTCCAACTGGAAAACCAGCAGCAAGAG
ACCTAAGCAAAACATTAAGTCCCAAATCCGAGGGGAAAGCGAAGAAGCTCATACTCGAATACGCAAATTCAGAAGTAGTTGAAGGGGGGAAGGGCATCAACACTATAGAT
ATCGATTCAGATCGGTGGTCAACCTCGCAAGCTTCCTCTCATACCGATCCCGGAGATCTGGATTATTCAGCTGTTGATTTTCCATCAACAGCCAAACCAAGTTCAAACAA
AAACAGTTTCATTAGCAAACTGAGGAAACTCTTGAGGGGAAAAAGTAGTCAAAGTCAACAAAACCTCGCTTTGTCTACCGAAAAAACTGCTGGACCTGCAGAATATAGTT
ATTCTTCACGTTACAGTTCAAGTAATTCTACAAGGACCAATGCTTCTAGAGCCGAGGGACAAAGTTTTGGATGTACAACTCCATCTCAGAATTCATCAAGACATTCAATA
GATTTTCACAGCTTAAACAGCCAAAAGGAAGACGGCATGAAAGCTGCAGATTCCCTTAGAAGGAACAGCGATGTCGGCTACACTAACAAGAGAATAGGTAGTGAACGATC
AAGCAACTTGTTGTATAGATCTCGTAGCGAGGAAACAGAATACACTGAGAAGTCGGATTTGATGAAATATGCTGAAGTTCTGAAAGACACTAAGCGAGCTAAGAACCAGT
CACAGCAGAAGGGTGCACCCGACCGCTCGTTTTACACCCGAAAGCTTGTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGATGGAGGTGAAGAGAGATTTGATGAAGCCAATATTGTTCAAATTTGGGGTTGCTTTGGCTATCTCCTTTGCTGGTTTCTTCTATTCCCGCTTAAAAATCAAGAACAA
AAGACCTTCTCTGCCTCCTCCCTCGTCGAGTTCTTCAGATGATCAGGGAAGTAAAGTTGGCTCGGGAAGAGGACCTAAACTTGATAGCCAAGCAATGAAGGAAGCAACAT
CAACAACCTCTAATGTTATTCTTTATGCAGCTGATGTCTATGAGGAAATACGAAATCCGGAAGTCAACATTGACAATTCAAATAAGAACAGTGTAGACAAAGATGGGCTG
CTTCTCCCAGAGTTTGAGGAACTTGTCAAGGAATATGATCTTTCTGCAGCAAATGTTGGGTTTTCTCCTAAGAAAAATGTTGAAACACCGAGGTCAGGACTCGAAGCTCC
GAAAGCTTATCTGACCGTTGAGACGGATGAGTATGAAGAAGAGATCAGACACCTCAAAATCAAGGTGAAAATGCTTCGAGAGAGGGAAAGAAACCTCGAGATTCAGCTAC
TCGAGTATTATGGCCTGAAAGAGCAAGAATCTGCTGTCATGGAGCTCCAAAATAGGTTGAAGATTAACAATATGGAGGCTCAACTTTTCACTCTCAAGATTGAGTCCCTT
CAGACTGATAATCGACGATTAGAATCACAGGTTTCCGATCATGCGAAATCAGTGTCCGACCTCGAGGCTGCAAAAGCAAAGATTAAGCTTCTCAAGAAAAAACTTAGACA
TGAAGCAGAACAGAACAGGGGACAGATCTTAAGTCTTCAGCAAAGAGTTGCTAAGCTGCTTGATCAAGAATGTAAAACAAATGAAAGCAATAAAGATGCTGAAATCAAAC
TGCAAAAGATTGAAGATTTGGAGAAACAAATAGAGGACTTAAGAAAGTCAAATATGAGATTACAAATAGAAAATTCTGATCTCAGTCAGAGATTAGATGCTACTCAGTTT
CTTGCAAATTCTATTTTTGAAGACCAAGAAAAAGAAACACTTAAAGAAGAAAGGGAGCGTTTGGCGCAAGAAAACGTGACGTTGACAAAGGAAATGGAGCAGCTTAAAGC
AGACCGATGTGCGGATGTCGAAGAACTAGTCTATCTCCGCTGGATTAATGCTTGCTTAAGATACGAGCTGCGGAATTTTCAGCCTCCAACTGGAAAACCAGCAGCAAGAG
ACCTAAGCAAAACATTAAGTCCCAAATCCGAGGGGAAAGCGAAGAAGCTCATACTCGAATACGCAAATTCAGAAGTAGTTGAAGGGGGGAAGGGCATCAACACTATAGAT
ATCGATTCAGATCGGTGGTCAACCTCGCAAGCTTCCTCTCATACCGATCCCGGAGATCTGGATTATTCAGCTGTTGATTTTCCATCAACAGCCAAACCAAGTTCAAACAA
AAACAGTTTCATTAGCAAACTGAGGAAACTCTTGAGGGGAAAAAGTAGTCAAAGTCAACAAAACCTCGCTTTGTCTACCGAAAAAACTGCTGGACCTGCAGAATATAGTT
ATTCTTCACGTTACAGTTCAAGTAATTCTACAAGGACCAATGCTTCTAGAGCCGAGGGACAAAGTTTTGGATGTACAACTCCATCTCAGAATTCATCAAGACATTCAATA
GATTTTCACAGCTTAAACAGCCAAAAGGAAGACGGCATGAAAGCTGCAGATTCCCTTAGAAGGAACAGCGATGTCGGCTACACTAACAAGAGAATAGGTAGTGAACGATC
AAGCAACTTGTTGTATAGATCTCGTAGCGAGGAAACAGAATACACTGAGAAGTCGGATTTGATGAAATATGCTGAAGTTCTGAAAGACACTAAGCGAGCTAAGAACCAGT
CACAGCAGAAGGGTGCACCCGACCGCTCGTTTTACACCCGAAAGCTTGTCTAG
Protein sequenceShow/hide protein sequence
MMEVKRDLMKPILFKFGVALAISFAGFFYSRLKIKNKRPSLPPPSSSSSDDQGSKVGSGRGPKLDSQAMKEATSTTSNVILYAADVYEEIRNPEVNIDNSNKNSVDKDGL
LLPEFEELVKEYDLSAANVGFSPKKNVETPRSGLEAPKAYLTVETDEYEEEIRHLKIKVKMLRERERNLEIQLLEYYGLKEQESAVMELQNRLKINNMEAQLFTLKIESL
QTDNRRLESQVSDHAKSVSDLEAAKAKIKLLKKKLRHEAEQNRGQILSLQQRVAKLLDQECKTNESNKDAEIKLQKIEDLEKQIEDLRKSNMRLQIENSDLSQRLDATQF
LANSIFEDQEKETLKEERERLAQENVTLTKEMEQLKADRCADVEELVYLRWINACLRYELRNFQPPTGKPAARDLSKTLSPKSEGKAKKLILEYANSEVVEGGKGINTID
IDSDRWSTSQASSHTDPGDLDYSAVDFPSTAKPSSNKNSFISKLRKLLRGKSSQSQQNLALSTEKTAGPAEYSYSSRYSSSNSTRTNASRAEGQSFGCTTPSQNSSRHSI
DFHSLNSQKEDGMKAADSLRRNSDVGYTNKRIGSERSSNLLYRSRSEETEYTEKSDLMKYAEVLKDTKRAKNQSQQKGAPDRSFYTRKLV