; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0007380 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0007380
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr12:5321523..5325586
RNA-Seq ExpressionPI0007380
SyntenyPI0007380
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057015.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]5.9e-15892.43Show/hide
Query:  MLKVRLEVRQSDSLSKRGTMGSKAMLKWGKTITPAHVEQLIQAERDIKKALIIFDSATAEYENGFKHDLNTFSLMISKLISANQFRLAETLLDRMKEEKI
        M +V+ EVRQSDSLSKRGTMGSKAM KW KT+TPAHV+QLIQAERDIKKALIIFDSATAEY NGFKHD+NTFSLMISKLISANQFRLAE LLDRMKEEKI
Subjt:  MLKVRLEVRQSDSLSKRGTMGSKAMLKWGKTITPAHVEQLIQAERDIKKALIIFDSATAEYENGFKHDLNTFSLMISKLISANQFRLAETLLDRMKEEKI

Query:  DVTEDILLSICRAYGRIHKPLDSIRVLDKMQDFHCKPTEKSYISVLAILVEENQLKLAFRFYRKMRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMHMFRE
        DVTEDILLSICRAYGRIHKPLDSIRV  KM DFHCKPTEKSYISVLAILVEENQLKLAFRFYR MRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMH+FR 
Subjt:  DVTEDILLSICRAYGRIHKPLDSIRVLDKMQDFHCKPTEKSYISVLAILVEENQLKLAFRFYRKMRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMHMFRE

Query:  MSNRGCEPDSYTYGTLINGLCRFGNIVEAKKLLQEMETKGCQPSVITYTSIIHGLCQLNNVDEAMRLLEDMKDKGIEPNVFTYSSLMDGFCKAGHSSRAR
        MSN G EPDSYTYGTLINGLCRFGNIVEAK+LLQEMETKGC PSVITYTSIIHGLCQLNNVDEA+RLLEDMKDK IEPNVFTYSSLMDGFCKAGHSSRAR
Subjt:  MSNRGCEPDSYTYGTLINGLCRFGNIVEAKKLLQEMETKGCQPSVITYTSIIHGLCQLNNVDEAMRLLEDMKDKGIEPNVFTYSSLMDGFCKAGHSSRAR

Query:  DILG
        DILG
Subjt:  DILG

KAE8652726.1 hypothetical protein Csa_014106 [Cucumis sativus]5.2e-16291.69Show/hide
Query:  MDFKILNKIHMLKVRLEVRQSDSLSKRGTMGSKAMLKWGKTITPAHVEQLIQAERDIKKALIIFDSATAEYENGFKHDLNTFSLMISKLISANQFRLAET
        MDFKILNKIH+L+V+ EVRQSDSL+KR TMGSKAM KW KT+TP HV+QLIQAERDIKKALIIFDSATAEY NGFKHDLNTFSLMISKLISANQFRLAET
Subjt:  MDFKILNKIHMLKVRLEVRQSDSLSKRGTMGSKAMLKWGKTITPAHVEQLIQAERDIKKALIIFDSATAEYENGFKHDLNTFSLMISKLISANQFRLAET

Query:  LLDRMKEEKIDVTEDILLSICRAYGRIHKPLDSIRVLDKMQDFHCKPTEKSYISVLAILVEENQLKLAFRFYRKMRKMGIPPTVTSLNVLIKAFCKNSGT
        LLDRMKEEKIDVTEDILLSICRAYGRIHKPLDSIRV  KMQDFHCKPTEKSYISVLAILVEENQLK AFRFYR MRKMGIPPTVTSLNVLIKAFCKNSGT
Subjt:  LLDRMKEEKIDVTEDILLSICRAYGRIHKPLDSIRVLDKMQDFHCKPTEKSYISVLAILVEENQLKLAFRFYRKMRKMGIPPTVTSLNVLIKAFCKNSGT

Query:  MDKAMHMFREMSNRGCEPDSYTYGTLINGLCRFGNIVEAKKLLQEMETKGCQPSVITYTSIIHGLCQLNNVDEAMRLLEDMKDKGIEPNVFTYSSLMDGF
        MDKAMH+FR MSN GCEPDSYTYGTLINGLCRF +IVEAK+LLQEMETKGC PSV+TYTSIIHGLCQLNNVDEAMRLLEDMKDK IEPNVFTYSSLMDGF
Subjt:  MDKAMHMFREMSNRGCEPDSYTYGTLINGLCRFGNIVEAKKLLQEMETKGCQPSVITYTSIIHGLCQLNNVDEAMRLLEDMKDKGIEPNVFTYSSLMDGF

Query:  CKAGHSSRARDIL
        CK GHSSRARDIL
Subjt:  CKAGHSSRARDIL

XP_004146658.2 pentatricopeptide repeat-containing protein At5g46100 isoform X2 [Cucumis sativus]2.3e-14992.96Show/hide
Query:  MGSKAMLKWGKTITPAHVEQLIQAERDIKKALIIFDSATAEYENGFKHDLNTFSLMISKLISANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIHK
        MGSKAM KW KT+TP HV+QLIQAERDIKKALIIFDSATAEY NGFKHDLNTFSLMISKLISANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIHK
Subjt:  MGSKAMLKWGKTITPAHVEQLIQAERDIKKALIIFDSATAEYENGFKHDLNTFSLMISKLISANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIHK

Query:  PLDSIRVLDKMQDFHCKPTEKSYISVLAILVEENQLKLAFRFYRKMRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMHMFREMSNRGCEPDSYTYGTLING
        PLDSIRV  KMQDFHCKPTEKSYISVLAILVEENQLK AFRFYR MRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMH+FR MSN GCEPDSYTYGTLING
Subjt:  PLDSIRVLDKMQDFHCKPTEKSYISVLAILVEENQLKLAFRFYRKMRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMHMFREMSNRGCEPDSYTYGTLING

Query:  LCRFGNIVEAKKLLQEMETKGCQPSVITYTSIIHGLCQLNNVDEAMRLLEDMKDKGIEPNVFTYSSLMDGFCKAGHSSRARDIL
        LCRF +IVEAK+LLQEMETKGC PSV+TYTSIIHGLCQLNNVDEAMRLLEDMKDK IEPNVFTYSSLMDGFCK GHSSRARDIL
Subjt:  LCRFGNIVEAKKLLQEMETKGCQPSVITYTSIIHGLCQLNNVDEAMRLLEDMKDKGIEPNVFTYSSLMDGFCKAGHSSRARDIL

XP_008442845.1 PREDICTED: pentatricopeptide repeat-containing protein At5g46100 [Cucumis melo]5.9e-15093.33Show/hide
Query:  MGSKAMLKWGKTITPAHVEQLIQAERDIKKALIIFDSATAEYENGFKHDLNTFSLMISKLISANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIHK
        MGSKAM KW KT+TPAHV+QLIQAERDIKKALIIFDSATAEY NGFKHD+NTFSLMISKLISANQFRLAE LLDRMKEEKIDVTEDILLSICRAYGRIHK
Subjt:  MGSKAMLKWGKTITPAHVEQLIQAERDIKKALIIFDSATAEYENGFKHDLNTFSLMISKLISANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIHK

Query:  PLDSIRVLDKMQDFHCKPTEKSYISVLAILVEENQLKLAFRFYRKMRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMHMFREMSNRGCEPDSYTYGTLING
        PLDSIRV  KM DFHCKPTEKSYISVLAILVEENQLKLAFRFYR MRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMH+FR MSN G EPDSYTYGTLING
Subjt:  PLDSIRVLDKMQDFHCKPTEKSYISVLAILVEENQLKLAFRFYRKMRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMHMFREMSNRGCEPDSYTYGTLING

Query:  LCRFGNIVEAKKLLQEMETKGCQPSVITYTSIIHGLCQLNNVDEAMRLLEDMKDKGIEPNVFTYSSLMDGFCKAGHSSRARDILG
        LCRFGNIVEAK+LLQEMETKGC PSVITYTSIIHGLCQLNNVDEA+RLLEDMKDK IEPNVFTYSSLMDGFCKAGHSSRARDILG
Subjt:  LCRFGNIVEAKKLLQEMETKGCQPSVITYTSIIHGLCQLNNVDEAMRLLEDMKDKGIEPNVFTYSSLMDGFCKAGHSSRARDILG

XP_031736238.1 pentatricopeptide repeat-containing protein At5g46100 isoform X1 [Cucumis sativus]5.2e-16291.69Show/hide
Query:  MDFKILNKIHMLKVRLEVRQSDSLSKRGTMGSKAMLKWGKTITPAHVEQLIQAERDIKKALIIFDSATAEYENGFKHDLNTFSLMISKLISANQFRLAET
        MDFKILNKIH+L+V+ EVRQSDSL+KR TMGSKAM KW KT+TP HV+QLIQAERDIKKALIIFDSATAEY NGFKHDLNTFSLMISKLISANQFRLAET
Subjt:  MDFKILNKIHMLKVRLEVRQSDSLSKRGTMGSKAMLKWGKTITPAHVEQLIQAERDIKKALIIFDSATAEYENGFKHDLNTFSLMISKLISANQFRLAET

Query:  LLDRMKEEKIDVTEDILLSICRAYGRIHKPLDSIRVLDKMQDFHCKPTEKSYISVLAILVEENQLKLAFRFYRKMRKMGIPPTVTSLNVLIKAFCKNSGT
        LLDRMKEEKIDVTEDILLSICRAYGRIHKPLDSIRV  KMQDFHCKPTEKSYISVLAILVEENQLK AFRFYR MRKMGIPPTVTSLNVLIKAFCKNSGT
Subjt:  LLDRMKEEKIDVTEDILLSICRAYGRIHKPLDSIRVLDKMQDFHCKPTEKSYISVLAILVEENQLKLAFRFYRKMRKMGIPPTVTSLNVLIKAFCKNSGT

Query:  MDKAMHMFREMSNRGCEPDSYTYGTLINGLCRFGNIVEAKKLLQEMETKGCQPSVITYTSIIHGLCQLNNVDEAMRLLEDMKDKGIEPNVFTYSSLMDGF
        MDKAMH+FR MSN GCEPDSYTYGTLINGLCRF +IVEAK+LLQEMETKGC PSV+TYTSIIHGLCQLNNVDEAMRLLEDMKDK IEPNVFTYSSLMDGF
Subjt:  MDKAMHMFREMSNRGCEPDSYTYGTLINGLCRFGNIVEAKKLLQEMETKGCQPSVITYTSIIHGLCQLNNVDEAMRLLEDMKDKGIEPNVFTYSSLMDGF

Query:  CKAGHSSRARDIL
        CK GHSSRARDIL
Subjt:  CKAGHSSRARDIL

TrEMBL top hitse value%identityAlignment
A0A0A0LRZ4 Uncharacterized protein7.8e-15692Show/hide
Query:  VRLEVRQSDSLSKRGTMGSKAMLKWGKTITPAHVEQLIQAERDIKKALIIFDSATAEYENGFKHDLNTFSLMISKLISANQFRLAETLLDRMKEEKIDVT
        V+ EVRQSDSL+KR TMGSKAM KW KT+TP HV+QLIQAERDIKKALIIFDSATAEY NGFKHDLNTFSLMISKLISANQFRLAETLLDRMKEEKIDVT
Subjt:  VRLEVRQSDSLSKRGTMGSKAMLKWGKTITPAHVEQLIQAERDIKKALIIFDSATAEYENGFKHDLNTFSLMISKLISANQFRLAETLLDRMKEEKIDVT

Query:  EDILLSICRAYGRIHKPLDSIRVLDKMQDFHCKPTEKSYISVLAILVEENQLKLAFRFYRKMRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMHMFREMSN
        EDILLSICRAYGRIHKPLDSIRV  KMQDFHCKPTEKSYISVLAILVEENQLK AFRFYR MRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMH+FR MSN
Subjt:  EDILLSICRAYGRIHKPLDSIRVLDKMQDFHCKPTEKSYISVLAILVEENQLKLAFRFYRKMRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMHMFREMSN

Query:  RGCEPDSYTYGTLINGLCRFGNIVEAKKLLQEMETKGCQPSVITYTSIIHGLCQLNNVDEAMRLLEDMKDKGIEPNVFTYSSLMDGFCKAGHSSRARDIL
         GCEPDSYTYGTLINGLCRF +IVEAK+LLQEMETKGC PSV+TYTSIIHGLCQLNNVDEAMRLLEDMKDK IEPNVFTYSSLMDGFCK GHSSRARDIL
Subjt:  RGCEPDSYTYGTLINGLCRFGNIVEAKKLLQEMETKGCQPSVITYTSIIHGLCQLNNVDEAMRLLEDMKDKGIEPNVFTYSSLMDGFCKAGHSSRARDIL

A0A1S3B6P2 pentatricopeptide repeat-containing protein At5g461002.9e-15093.33Show/hide
Query:  MGSKAMLKWGKTITPAHVEQLIQAERDIKKALIIFDSATAEYENGFKHDLNTFSLMISKLISANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIHK
        MGSKAM KW KT+TPAHV+QLIQAERDIKKALIIFDSATAEY NGFKHD+NTFSLMISKLISANQFRLAE LLDRMKEEKIDVTEDILLSICRAYGRIHK
Subjt:  MGSKAMLKWGKTITPAHVEQLIQAERDIKKALIIFDSATAEYENGFKHDLNTFSLMISKLISANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIHK

Query:  PLDSIRVLDKMQDFHCKPTEKSYISVLAILVEENQLKLAFRFYRKMRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMHMFREMSNRGCEPDSYTYGTLING
        PLDSIRV  KM DFHCKPTEKSYISVLAILVEENQLKLAFRFYR MRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMH+FR MSN G EPDSYTYGTLING
Subjt:  PLDSIRVLDKMQDFHCKPTEKSYISVLAILVEENQLKLAFRFYRKMRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMHMFREMSNRGCEPDSYTYGTLING

Query:  LCRFGNIVEAKKLLQEMETKGCQPSVITYTSIIHGLCQLNNVDEAMRLLEDMKDKGIEPNVFTYSSLMDGFCKAGHSSRARDILG
        LCRFGNIVEAK+LLQEMETKGC PSVITYTSIIHGLCQLNNVDEA+RLLEDMKDK IEPNVFTYSSLMDGFCKAGHSSRARDILG
Subjt:  LCRFGNIVEAKKLLQEMETKGCQPSVITYTSIIHGLCQLNNVDEAMRLLEDMKDKGIEPNVFTYSSLMDGFCKAGHSSRARDILG

A0A5D3DS89 Pentatricopeptide repeat-containing protein2.9e-15892.43Show/hide
Query:  MLKVRLEVRQSDSLSKRGTMGSKAMLKWGKTITPAHVEQLIQAERDIKKALIIFDSATAEYENGFKHDLNTFSLMISKLISANQFRLAETLLDRMKEEKI
        M +V+ EVRQSDSLSKRGTMGSKAM KW KT+TPAHV+QLIQAERDIKKALIIFDSATAEY NGFKHD+NTFSLMISKLISANQFRLAE LLDRMKEEKI
Subjt:  MLKVRLEVRQSDSLSKRGTMGSKAMLKWGKTITPAHVEQLIQAERDIKKALIIFDSATAEYENGFKHDLNTFSLMISKLISANQFRLAETLLDRMKEEKI

Query:  DVTEDILLSICRAYGRIHKPLDSIRVLDKMQDFHCKPTEKSYISVLAILVEENQLKLAFRFYRKMRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMHMFRE
        DVTEDILLSICRAYGRIHKPLDSIRV  KM DFHCKPTEKSYISVLAILVEENQLKLAFRFYR MRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMH+FR 
Subjt:  DVTEDILLSICRAYGRIHKPLDSIRVLDKMQDFHCKPTEKSYISVLAILVEENQLKLAFRFYRKMRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMHMFRE

Query:  MSNRGCEPDSYTYGTLINGLCRFGNIVEAKKLLQEMETKGCQPSVITYTSIIHGLCQLNNVDEAMRLLEDMKDKGIEPNVFTYSSLMDGFCKAGHSSRAR
        MSN G EPDSYTYGTLINGLCRFGNIVEAK+LLQEMETKGC PSVITYTSIIHGLCQLNNVDEA+RLLEDMKDK IEPNVFTYSSLMDGFCKAGHSSRAR
Subjt:  MSNRGCEPDSYTYGTLINGLCRFGNIVEAKKLLQEMETKGCQPSVITYTSIIHGLCQLNNVDEAMRLLEDMKDKGIEPNVFTYSSLMDGFCKAGHSSRAR

Query:  DILG
        DILG
Subjt:  DILG

A0A6J1CDW1 pentatricopeptide repeat-containing protein At5g46100 isoform X12.1e-14583.5Show/hide
Query:  MLKVRLEVRQSDSLSKRGTMGSKAMLKWGKTITPAHVEQLIQAERDIKKALIIFDSATAEYENGFKHDLNTFSLMISKLISANQFRLAETLLDRMKEEKI
        ML+V+ EV + D+L KRGTMGSKAM KW KT+TP+HVEQLIQAERDI KAL+IFDSAT+EY NGFKHDLNTF LMISKL+SANQFR AETLLDRM EEK 
Subjt:  MLKVRLEVRQSDSLSKRGTMGSKAMLKWGKTITPAHVEQLIQAERDIKKALIIFDSATAEYENGFKHDLNTFSLMISKLISANQFRLAETLLDRMKEEKI

Query:  DVTEDILLSICRAYGRIHKPLDSIRVLDKMQDFHCKPTEKSYISVLAILVEENQLKLAFRFYRKMRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMHMFRE
        DVTEDI L+ICRAYGR+HKPLDSIR+  KM+DF CKPTEKSYI+V AILVEENQLKLA RFYR MRKMG PPTV SLNVLIKAFCKNSGTMDKAMH+ RE
Subjt:  DVTEDILLSICRAYGRIHKPLDSIRVLDKMQDFHCKPTEKSYISVLAILVEENQLKLAFRFYRKMRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMHMFRE

Query:  MSNRGCEPDSYTYGTLINGLCRFGNIVEAKKLLQEMETKGCQPSVITYTSIIHGLCQLNNVDEAMRLLEDMKDKGIEPNVFTYSSLMDGFCKAGHSSRAR
        MSN GCEPDSYTYGTLINGLC+ G IVEAK+LLQEMETKGC PSV+TYTS+IHGLCQLNNVDEA+ LLEDM  KGIEPNVFTYSSLMDGFCKAGHSSRAR
Subjt:  MSNRGCEPDSYTYGTLINGLCRFGNIVEAKKLLQEMETKGCQPSVITYTSIIHGLCQLNNVDEAMRLLEDMKDKGIEPNVFTYSSLMDGFCKAGHSSRAR

Query:  DIL
        D+L
Subjt:  DIL

A0A6J1KLZ3 pentatricopeptide repeat-containing protein At5g461001.4e-14489.08Show/hide
Query:  MGSKAMLKWGKTITPAHVEQLIQAERDIKKALIIFDSATAEYENGFKHDLNTFSLMISKLISANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIHK
        MGSKAM KW KT+TPAHVEQLIQAERDI KAL+IFDSATAEY NGFKHDLNTF LMI KL+SANQFRLAETLLDRMKEEK+DVTEDI LSICRAYGRIH+
Subjt:  MGSKAMLKWGKTITPAHVEQLIQAERDIKKALIIFDSATAEYENGFKHDLNTFSLMISKLISANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIHK

Query:  PLDSIRVLDKMQDFHCKPTEKSYISVLAILVEENQLKLAFRFYRKMRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMHMFREMSNRGCEPDSYTYGTLING
        PLDSIRV  KMQDFHCKPTEKSYISV AILVEENQLKLAFRFYR MRK+GIPPTV SLNVLIKA CKNSGTMDKAM+MFREMSN+GCEPDSYTYGTLING
Subjt:  PLDSIRVLDKMQDFHCKPTEKSYISVLAILVEENQLKLAFRFYRKMRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMHMFREMSNRGCEPDSYTYGTLING

Query:  LCRFGNIVEAKKLLQEMETKGCQPSVITYTSIIHGLCQLNNVDEAMRLLEDMKDKGIEPNVFTYSSLMDGFCKAGHSSRARDIL
        LCRFGNIVEAK+LLQEME KGC PSV+TYTS+IHGLCQLNNVDEAM LLEDM  KGIEPNVFTYSSLMDGFCKAGHS RARD+L
Subjt:  LCRFGNIVEAKKLLQEMETKGCQPSVITYTSIIHGLCQLNNVDEAMRLLEDMKDKGIEPNVFTYSSLMDGFCKAGHSSRARDIL

SwissProt top hitse value%identityAlignment
Q9CA58 Putative pentatricopeptide repeat-containing protein At1g745804.9e-3830.13Show/hide
Query:  GKTITPAHVEQLIQAERDIKKALIIFDSATAEYENGFKHDLNTFSLMISKLISANQFRLAETLLDRMKEEKID-VTEDILLSICRAYGRIHKPLDSIRVL
        G  + P HV  +I+ ++D  KAL +F+S   E   GFKH L+T+  +I KL    +F   E +L  M+E   + + E + +   + YGR  K  +++ V 
Subjt:  GKTITPAHVEQLIQAERDIKKALIIFDSATAEYENGFKHDLNTFSLMISKLISANQFRLAETLLDRMKEEKID-VTEDILLSICRAYGRIHKPLDSIRVL

Query:  DKMQDFHCKPTEKSYISVLAILVEENQLKLAFRFYRKMRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMHMFREMSNRGCEPDSYTYGTLING--------
        ++M  + C+PT  SY +++++LV+      A + Y +MR  GI P V S  + +K+FCK S     A+ +   MS++GCE +   Y T++ G        
Subjt:  DKMQDFHCKPTEKSYISVLAILVEENQLKLAFRFYRKMRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMHMFREMSNRGCEPDSYTYGTLING--------

Query:  ---------------------------LCRFGNIVEAKKLLQEMETKGCQPSVITYTSIIHGLCQLNNVDEAMRLLEDMKDKGIEPNVFTYSSLMDGFCK
                                   LC+ G++ E +KLL ++  +G  P++ TY   I GLCQ   +D A+R++  + ++G +P+V TY++L+ G CK
Subjt:  ---------------------------LCRFGNIVEAKKLLQEMETKGCQPSVITYTSIIHGLCQLNNVDEAMRLLEDMKDKGIEPNVFTYSSLMDGFCK

Query:  AGHSSRARDILG
              A   LG
Subjt:  AGHSSRARDILG

Q9FMF6 Pentatricopeptide repeat-containing protein At5g64320, mitochondrial1.1e-4233.82Show/hide
Query:  ITPAHVEQLIQAERDIKKALIIFDSATAEYENGFKHDLNTFSLMISKLISANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIHKPLDSIRVLDKMQ
        ITP  + +L++   ++  ++ +F    +  +NG++H  + + ++I KL +  +F+  + LL +MK+E I   E + +SI R Y +   P  + R++ +M+
Subjt:  ITPAHVEQLIQAERDIKKALIIFDSATAEYENGFKHDLNTFSLMISKLISANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIHKPLDSIRVLDKMQ

Query:  D-FHCKPTEKSYISVLAILVEENQLKLAFRFYRKMRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMHMFREMSNRGCEPDSYTYGTLINGLCRFGNIVEAK
        + + C+PT KSY  VL ILV  N  K+A   +  M    IPPT+ +  V++KAFC     +D A+ + R+M+  GC P+S  Y TLI+ L +   + EA 
Subjt:  D-FHCKPTEKSYISVLAILVEENQLKLAFRFYRKMRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMHMFREMSNRGCEPDSYTYGTLINGLCRFGNIVEAK

Query:  KLLQEMETKGCQPSVITYTSIIHGLCQLNNVDEAMRLLEDMKDKGIEPNVFTYSSLMDGFCKAGHSSRARDI
        +LL+EM   GC P   T+  +I GLC+ + ++EA +++  M  +G  P+  TY  LM+G CK G    A+D+
Subjt:  KLLQEMETKGCQPSVITYTSIIHGLCQLNNVDEAMRLLEDMKDKGIEPNVFTYSSLMDGFCKAGHSSRARDI

Q9FNL2 Pentatricopeptide repeat-containing protein At5g461002.5e-10663.38Show/hide
Query:  MGSKAML-KWGKTITPAHVEQLIQAERDIKKALIIFDSATAEYENGFKHDLNTFSLMISKLISANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIH
        MGSK M+ KW K ITP+ V +L++AE+D++K++ +FDSATAEY NG+ HD ++F  M+ +L+SAN+F+ AE L+ RMK E   V+EDILLSICR YGR+H
Subjt:  MGSKAML-KWGKTITPAHVEQLIQAERDIKKALIIFDSATAEYENGFKHDLNTFSLMISKLISANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIH

Query:  KPLDSIRVLDKMQDFHCKPTEKSYISVLAILVEENQLKLAFRFYRKMRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMHMFREMSNRGCEPDSYTYGTLIN
        +P DS+RV  KM+DF C P++K+Y++VLAILVEENQL LAF+FY+ MR++G+PPTV SLNVLIKA C+N GT+D  + +F EM  RGC+PDSYTYGTLI+
Subjt:  KPLDSIRVLDKMQDFHCKPTEKSYISVLAILVEENQLKLAFRFYRKMRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMHMFREMSNRGCEPDSYTYGTLIN

Query:  GLCRFGNIVEAKKLLQEMETKGCQPSVITYTSIIHGLCQLNNVDEAMRLLEDMKDKGIEPNVFTYSSLMDGFCKAGHSSRARDI
        GLCRFG I EAKKL  EM  K C P+V+TYTS+I+GLC   NVDEAMR LE+MK KGIEPNVFTYSSLMDG CK G S +A ++
Subjt:  GLCRFGNIVEAKKLLQEMETKGCQPSVITYTSIIHGLCQLNNVDEAMRLLEDMKDKGIEPNVFTYSSLMDGFCKAGHSSRARDI

Q9M302 Pentatricopeptide repeat-containing protein At3g488101.5e-3933.21Show/hide
Query:  TITPAHVE-------QLIQAERDIKKALIIFDSATAEYENGFKHDLNTFSLMISKLISANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIHKPLDS
        T +P H E       + ++ E  +  AL  F S      N FKH   TF +MI KL    Q    + LL +MK +    +ED+ +S+   Y ++     +
Subjt:  TITPAHVE-------QLIQAERDIKKALIIFDSATAEYENGFKHDLNTFSLMISKLISANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIHKPLDS

Query:  IRVLDKMQDFHCKPTEKSYISVLAILVEENQLKLAFRFYRKMRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMHMFREMSNRGCEPDSYTYGTLINGLCRF
        + +  ++++F C P+ K Y  VL  L+ EN++++ +  YR M++ G  P V + NVL+KA CKN+  +D A  +  EMSN+GC PD+ +Y T+I+ +C  
Subjt:  IRVLDKMQDFHCKPTEKSYISVLAILVEENQLKLAFRFYRKMRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMHMFREMSNRGCEPDSYTYGTLINGLCRF

Query:  GNIVEAKKLLQEMETKGCQPSVITYTSIIHGLCQLNNVDEAMRLLEDMKDKGIEPNVFTYSSLMDGFCKAGHSSRARDIL
        G + E ++L +  E     P V  Y ++I+GLC+ ++   A  L+ +M +KGI PNV +YS+L++  C +G    A   L
Subjt:  GNIVEAKKLLQEMETKGCQPSVITYTSIIHGLCQLNNVDEAMRLLEDMKDKGIEPNVFTYSSLMDGFCKAGHSSRARDIL

Q9ZUU3 Pentatricopeptide repeat-containing protein At2g372301.2e-3932.58Show/hide
Query:  VEQLIQAERDIKKALIIFDSATAEYENGFKHDLNTFSLMISKLISANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIHKPLDSIRVLDKMQDFHCK
        V  ++   + ++ AL  F     E     +HD +T   MI  L   ++   A  +L  M E+ +   ED+ + +  +YG+     +S+++  KM+D   +
Subjt:  VEQLIQAERDIKKALIIFDSATAEYENGFKHDLNTFSLMISKLISANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIHKPLDSIRVLDKMQDFHCK

Query:  PTEKSYISVLAILVEENQLKLAFRFYRKMRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMHMFREMSNRGCEPDSYTYGTLINGLCRFGNIVEAKKLLQEM
         T KSY S+  +++   +  +A R++ KM   G+ PT  + N+++  F   S  ++ A+  F +M  RG  PD  T+ T+ING CRF  + EA+KL  EM
Subjt:  PTEKSYISVLAILVEENQLKLAFRFYRKMRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMHMFREMSNRGCEPDSYTYGTLINGLCRFGNIVEAKKLLQEM

Query:  ETKGCQPSVITYTSIIHGLCQLNNVDEAMRLLEDMKDKGIEPNVFTYSSLMDGFCKAGHSSRARDIL
        +     PSV++YT++I G   ++ VD+ +R+ E+M+  GIEPN  TYS+L+ G C AG    A++IL
Subjt:  ETKGCQPSVITYTSIIHGLCQLNNVDEAMRLLEDMKDKGIEPNVFTYSSLMDGFCKAGHSSRARDIL

Arabidopsis top hitse value%identityAlignment
AT1G74580.1 Pentatricopeptide repeat (PPR) superfamily protein3.5e-3930.13Show/hide
Query:  GKTITPAHVEQLIQAERDIKKALIIFDSATAEYENGFKHDLNTFSLMISKLISANQFRLAETLLDRMKEEKID-VTEDILLSICRAYGRIHKPLDSIRVL
        G  + P HV  +I+ ++D  KAL +F+S   E   GFKH L+T+  +I KL    +F   E +L  M+E   + + E + +   + YGR  K  +++ V 
Subjt:  GKTITPAHVEQLIQAERDIKKALIIFDSATAEYENGFKHDLNTFSLMISKLISANQFRLAETLLDRMKEEKID-VTEDILLSICRAYGRIHKPLDSIRVL

Query:  DKMQDFHCKPTEKSYISVLAILVEENQLKLAFRFYRKMRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMHMFREMSNRGCEPDSYTYGTLING--------
        ++M  + C+PT  SY +++++LV+      A + Y +MR  GI P V S  + +K+FCK S     A+ +   MS++GCE +   Y T++ G        
Subjt:  DKMQDFHCKPTEKSYISVLAILVEENQLKLAFRFYRKMRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMHMFREMSNRGCEPDSYTYGTLING--------

Query:  ---------------------------LCRFGNIVEAKKLLQEMETKGCQPSVITYTSIIHGLCQLNNVDEAMRLLEDMKDKGIEPNVFTYSSLMDGFCK
                                   LC+ G++ E +KLL ++  +G  P++ TY   I GLCQ   +D A+R++  + ++G +P+V TY++L+ G CK
Subjt:  ---------------------------LCRFGNIVEAKKLLQEMETKGCQPSVITYTSIIHGLCQLNNVDEAMRLLEDMKDKGIEPNVFTYSSLMDGFCK

Query:  AGHSSRARDILG
              A   LG
Subjt:  AGHSSRARDILG

AT2G37230.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.3e-4132.58Show/hide
Query:  VEQLIQAERDIKKALIIFDSATAEYENGFKHDLNTFSLMISKLISANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIHKPLDSIRVLDKMQDFHCK
        V  ++   + ++ AL  F     E     +HD +T   MI  L   ++   A  +L  M E+ +   ED+ + +  +YG+     +S+++  KM+D   +
Subjt:  VEQLIQAERDIKKALIIFDSATAEYENGFKHDLNTFSLMISKLISANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIHKPLDSIRVLDKMQDFHCK

Query:  PTEKSYISVLAILVEENQLKLAFRFYRKMRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMHMFREMSNRGCEPDSYTYGTLINGLCRFGNIVEAKKLLQEM
         T KSY S+  +++   +  +A R++ KM   G+ PT  + N+++  F   S  ++ A+  F +M  RG  PD  T+ T+ING CRF  + EA+KL  EM
Subjt:  PTEKSYISVLAILVEENQLKLAFRFYRKMRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMHMFREMSNRGCEPDSYTYGTLINGLCRFGNIVEAKKLLQEM

Query:  ETKGCQPSVITYTSIIHGLCQLNNVDEAMRLLEDMKDKGIEPNVFTYSSLMDGFCKAGHSSRARDIL
        +     PSV++YT++I G   ++ VD+ +R+ E+M+  GIEPN  TYS+L+ G C AG    A++IL
Subjt:  ETKGCQPSVITYTSIIHGLCQLNNVDEAMRLLEDMKDKGIEPNVFTYSSLMDGFCKAGHSSRARDIL

AT3G48810.1 Pentatricopeptide repeat (PPR) superfamily protein1.1e-4033.21Show/hide
Query:  TITPAHVE-------QLIQAERDIKKALIIFDSATAEYENGFKHDLNTFSLMISKLISANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIHKPLDS
        T +P H E       + ++ E  +  AL  F S      N FKH   TF +MI KL    Q    + LL +MK +    +ED+ +S+   Y ++     +
Subjt:  TITPAHVE-------QLIQAERDIKKALIIFDSATAEYENGFKHDLNTFSLMISKLISANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIHKPLDS

Query:  IRVLDKMQDFHCKPTEKSYISVLAILVEENQLKLAFRFYRKMRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMHMFREMSNRGCEPDSYTYGTLINGLCRF
        + +  ++++F C P+ K Y  VL  L+ EN++++ +  YR M++ G  P V + NVL+KA CKN+  +D A  +  EMSN+GC PD+ +Y T+I+ +C  
Subjt:  IRVLDKMQDFHCKPTEKSYISVLAILVEENQLKLAFRFYRKMRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMHMFREMSNRGCEPDSYTYGTLINGLCRF

Query:  GNIVEAKKLLQEMETKGCQPSVITYTSIIHGLCQLNNVDEAMRLLEDMKDKGIEPNVFTYSSLMDGFCKAGHSSRARDIL
        G + E ++L +  E     P V  Y ++I+GLC+ ++   A  L+ +M +KGI PNV +YS+L++  C +G    A   L
Subjt:  GNIVEAKKLLQEMETKGCQPSVITYTSIIHGLCQLNNVDEAMRLLEDMKDKGIEPNVFTYSSLMDGFCKAGHSSRARDIL

AT5G46100.1 Pentatricopeptide repeat (PPR) superfamily protein1.7e-10763.38Show/hide
Query:  MGSKAML-KWGKTITPAHVEQLIQAERDIKKALIIFDSATAEYENGFKHDLNTFSLMISKLISANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIH
        MGSK M+ KW K ITP+ V +L++AE+D++K++ +FDSATAEY NG+ HD ++F  M+ +L+SAN+F+ AE L+ RMK E   V+EDILLSICR YGR+H
Subjt:  MGSKAML-KWGKTITPAHVEQLIQAERDIKKALIIFDSATAEYENGFKHDLNTFSLMISKLISANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIH

Query:  KPLDSIRVLDKMQDFHCKPTEKSYISVLAILVEENQLKLAFRFYRKMRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMHMFREMSNRGCEPDSYTYGTLIN
        +P DS+RV  KM+DF C P++K+Y++VLAILVEENQL LAF+FY+ MR++G+PPTV SLNVLIKA C+N GT+D  + +F EM  RGC+PDSYTYGTLI+
Subjt:  KPLDSIRVLDKMQDFHCKPTEKSYISVLAILVEENQLKLAFRFYRKMRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMHMFREMSNRGCEPDSYTYGTLIN

Query:  GLCRFGNIVEAKKLLQEMETKGCQPSVITYTSIIHGLCQLNNVDEAMRLLEDMKDKGIEPNVFTYSSLMDGFCKAGHSSRARDI
        GLCRFG I EAKKL  EM  K C P+V+TYTS+I+GLC   NVDEAMR LE+MK KGIEPNVFTYSSLMDG CK G S +A ++
Subjt:  GLCRFGNIVEAKKLLQEMETKGCQPSVITYTSIIHGLCQLNNVDEAMRLLEDMKDKGIEPNVFTYSSLMDGFCKAGHSSRARDI

AT5G64320.1 Pentatricopeptide repeat (PPR) superfamily protein8.0e-4433.82Show/hide
Query:  ITPAHVEQLIQAERDIKKALIIFDSATAEYENGFKHDLNTFSLMISKLISANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIHKPLDSIRVLDKMQ
        ITP  + +L++   ++  ++ +F    +  +NG++H  + + ++I KL +  +F+  + LL +MK+E I   E + +SI R Y +   P  + R++ +M+
Subjt:  ITPAHVEQLIQAERDIKKALIIFDSATAEYENGFKHDLNTFSLMISKLISANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIHKPLDSIRVLDKMQ

Query:  D-FHCKPTEKSYISVLAILVEENQLKLAFRFYRKMRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMHMFREMSNRGCEPDSYTYGTLINGLCRFGNIVEAK
        + + C+PT KSY  VL ILV  N  K+A   +  M    IPPT+ +  V++KAFC     +D A+ + R+M+  GC P+S  Y TLI+ L +   + EA 
Subjt:  D-FHCKPTEKSYISVLAILVEENQLKLAFRFYRKMRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMHMFREMSNRGCEPDSYTYGTLINGLCRFGNIVEAK

Query:  KLLQEMETKGCQPSVITYTSIIHGLCQLNNVDEAMRLLEDMKDKGIEPNVFTYSSLMDGFCKAGHSSRARDI
        +LL+EM   GC P   T+  +I GLC+ + ++EA +++  M  +G  P+  TY  LM+G CK G    A+D+
Subjt:  KLLQEMETKGCQPSVITYTSIIHGLCQLNNVDEAMRLLEDMKDKGIEPNVFTYSSLMDGFCKAGHSSRARDI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTTCAAAATATTGAATAAGATTCATATGCTTAAGGTTCGGTTGGAAGTTAGACAGTCTGATTCTTTAAGCAAGAGAGGAACAATGGGCAGTAAAGCGATGCTTAA
ATGGGGAAAAACAATCACACCTGCTCATGTTGAGCAGCTAATCCAAGCAGAACGAGACATAAAGAAGGCACTTATCATATTCGACTCTGCGACAGCCGAGTATGAAAATG
GTTTTAAGCATGATCTCAATACTTTTAGTCTCATGATTAGCAAGTTAATTTCTGCAAACCAGTTCAGGTTAGCAGAAACACTTCTTGATAGGATGAAGGAAGAGAAAATT
GATGTCACTGAGGATATACTTCTCTCCATTTGTAGGGCTTATGGTCGTATCCATAAGCCATTGGATTCCATAAGAGTTCTCGATAAAATGCAGGATTTTCATTGCAAGCC
TACAGAAAAATCTTACATTTCAGTGCTTGCCATTCTTGTGGAAGAAAATCAATTAAAATTGGCTTTTAGATTTTATAGGAAAATGAGAAAAATGGGTATTCCCCCTACGG
TAACTTCTCTTAATGTTCTAATCAAAGCCTTTTGCAAGAATAGTGGAACCATGGATAAAGCAATGCACATGTTTCGTGAAATGTCTAATCGTGGGTGTGAACCTGATTCA
TATACTTATGGAACTTTGATCAATGGATTATGTAGATTCGGAAACATCGTTGAGGCAAAGAAATTGTTGCAGGAGATGGAGACAAAAGGTTGTCAACCTTCTGTCATCAC
ATATACTTCAATAATACATGGTCTGTGTCAGCTGAACAATGTGGATGAAGCAATGAGATTACTTGAAGATATGAAGGACAAGGGTATCGAACCTAATGTGTTTACTTACA
GTTCTCTAATGGATGGATTTTGCAAGGCTGGTCATTCTTCACGAGCTAGAGATATCTTGGGGTGA
mRNA sequenceShow/hide mRNA sequence
GGGTTGTACATGTTTAAGCCATTGAATAAGTTGTCCAAGTGGGCTAAAGATTTGTGTTGTGTTTTGGCATGGGGGATGAGATTAGTGGTATTGAAGGAAACTTTATCATG
ACATGTAGATAGGCATGAGATTAAAGTTGCTTTATAAATAAGAAAGAGGAGAAGAAGAGACTATAGTTTTACTAAAGGTTACTAGGTATTAAGGTCAAATGTTTCGAGAT
GAGCTTTGAAAGGCTTAATAAGTCATGAACACTTTAGGGAGTAAATTTGCTAGTTGAAGTATAAGTGATGGTTAGAATAATTCTTTATAGCACCTAGCTAGGCCATGTCG
TCAAAGGAAGCCAAAGTAGTTGAAATAATCTTATTCTTAAAAGTGAAGAAGCAAAGTGTCGGTTTGAATTAAACAATAAAATTTTAGAGAACTTCAAAGTACTTGGACTA
TCCCTAACTTTATGCTACCAAGACTCTAGATAGTTAATTGGACAAAGAGGGAAAAAAAAGTTTTTGGAGTTTAAAAGAGATGACAAATAGTGAAGGAGAATATTCAATAG
GTTATACTTTGGTATCGAGATTATGTTTCACTTGGAAAGTGAAAAAGAGCACCCTGAATTTATTGGCATTAGCAAGGATATGTGTGAGGAAAATGAAAGTGTTTATAGAA
TGCTTTACTATTTTGAGAAGATAAAAAGGATATTAATGTTTTGAGGCTACAAGTACATAAGTTTGGTTTGACTTACTTGCTAAGATAGGATTTTGGTGTTTACAAGCCTT
TGGAGTTTGTATAAGTAACACAGTTACTACTTGCAACTTTGGATCATTGCATGAGTTGGTTGAAAGGATAGCTTGCTTAAGCTAGTGGTCATAGGACTCGGTATTAGTAG
GAGAGTTTTCTTATCAAATAAAAGATTGAGTAAGACAATCAAGTGTTGGAGTATTTAACAAGCTTTTCAGATTATAGGGGAGTTAGAATAGAAAATATTTTATGTGGTGA
AGTTTGGAAAGCCATTGTTACCAAAAACTACTCAAAGGGAAGGTTGAGGCCTTATTGACGAAAATGTAAATTATTGAAGGTGTTTGCAAAAATTAAAAGATCGTGACTTT
GGGCACAAGCTTTGGTTGGAAACCAGTGAACCATCAATTTTATTGAAAAAGGAAAATAGTTCAACTCGAGGTAGACAAAAGTTGTACAGTTATTGAAATAAGTTAGATCT
TCCAAGATCATCATGGGCACAAGGGATAGACTTGACGATAAAAACCGAGTTGAAAGCAAGGAGTTGCTATAGATTAGATGAAGTAGCTTAAGTCAGAGAAACATGAACAA
CTAAGCAAAACTTTTATGACCTAATGTTCCGCTTGTGAGCACATCAATTTTGTTTGGCAAGGAAGAAAGAAATATCCATGCAACTATGTACAAACTGTTATGGATTGAAT
ATAATAGTGTAACTAGATCGGAGGGGATTGTCGGGGCCAGTAGGTGACGAATCCGAATTTCGATTCCTGGACTTATGATGCCACGAGATATATGTATTTCCTTGTATGGA
TGACATGTTCATCAACTTTGAGGAGTATAAATGTTCCAAGTATTGAATGACTCAAGAAGTTTTTGGTCTTACAAAGGCATAATAGTCGATTCACCCATGTTTTTTTCTTT
TTTAAGGTTGAATTTGGAGTTTGTAAAGGTTTAGGCCTTTGTCTAGCAAGGATAAATTGGGTGTTTGGTTGAGTTTATTGTTTGAAGTATGGAAGGAATTTCGAATGAAA
TACAAATTTTGGGTACGTGATTTGAGGATTTCATAAAATGATTCGTTTGTATGGAGTTGGGCCTAAAAGTTAAAGTAATATATATTTTTTTGTTCATGTTGTGGACTAAA
AGAGAACATAGTTTGCTTGTGTTAAATATTTGGGCTTTAATTTTGAGGTAAGTAATTTTACTTCTAGAACTCCTTTGTACCAGGCACTCTAATTAGGATATGATTGATAG
CTTCTCTGAGACTGTCCAAAAGATTTGGATGGTTGATGTGTATGCATGAGCATGAGCATGCCACAATGTGTTCTATGATTTATGTTTGTGGTGAAATGTGCTAATGAACC
ATCTACACTTTCGTAACGATGCATGTTCTATTCTATTCCTTAAAACAATAAAATGTTAGTGGTGTGAATCATGGATTTGATGTATTTGTTGGCATCTTTCAAGGAACATT
AAACTTTCCAAGACTTTATATGCACTGTGATTATTGCGATTATTATTTTTTTTGATGGACTTCAAAATATTGAATAAGATTCATATGCTTAAGGTTCGGTTGGAAGTTAG
ACAGTCTGATTCTTTAAGCAAGAGAGGAACAATGGGCAGTAAAGCGATGCTTAAATGGGGAAAAACAATCACACCTGCTCATGTTGAGCAGCTAATCCAAGCAGAACGAG
ACATAAAGAAGGCACTTATCATATTCGACTCTGCGACAGCCGAGTATGAAAATGGTTTTAAGCATGATCTCAATACTTTTAGTCTCATGATTAGCAAGTTAATTTCTGCA
AACCAGTTCAGGTTAGCAGAAACACTTCTTGATAGGATGAAGGAAGAGAAAATTGATGTCACTGAGGATATACTTCTCTCCATTTGTAGGGCTTATGGTCGTATCCATAA
GCCATTGGATTCCATAAGAGTTCTCGATAAAATGCAGGATTTTCATTGCAAGCCTACAGAAAAATCTTACATTTCAGTGCTTGCCATTCTTGTGGAAGAAAATCAATTAA
AATTGGCTTTTAGATTTTATAGGAAAATGAGAAAAATGGGTATTCCCCCTACGGTAACTTCTCTTAATGTTCTAATCAAAGCCTTTTGCAAGAATAGTGGAACCATGGAT
AAAGCAATGCACATGTTTCGTGAAATGTCTAATCGTGGGTGTGAACCTGATTCATATACTTATGGAACTTTGATCAATGGATTATGTAGATTCGGAAACATCGTTGAGGC
AAAGAAATTGTTGCAGGAGATGGAGACAAAAGGTTGTCAACCTTCTGTCATCACATATACTTCAATAATACATGGTCTGTGTCAGCTGAACAATGTGGATGAAGCAATGA
GATTACTTGAAGATATGAAGGACAAGGGTATCGAACCTAATGTGTTTACTTACAGTTCTCTAATGGATGGATTTTGCAAGGCTGGTCATTCTTCACGAGCTAGAGATATC
TTGGGGTGATGGTTCAAAAACGCTTGAGGCCCAACATGATCAGTTATAGTACATTGCTTAATGGACTTTGTAATGAAGGAAAAATAAATGAAGCACTTGAGATTTTTGAC
AGAATGAAACTCCAAGGTTTGAAACCAGATGCCGGGTTGTATGGGAAAATAGTTAATCGCCTGTGTGATGTTTCCAAATTCCAAGAAGCTGCAAACTTCTTGGATGAGAT
GGTCCTTTGTGGGATCACACCTAATAGACTAACATGGAGCCTTCATGTCAGGACCCATAACAGAGTAATTCACAGTCTCTGCACTATCAACGATTCAAATCGTGCATTTC
AATTGTATCTTAGTGTCTCAACACGTGGTATTAGTATCACTGTTGATACTTTTAATTCTTTGTTAAAATGTTTCTGTAACAAAAGGGATCTTCCTAAAACTTCTAGAATT
CTGGATGAGATGGTGATTAATGGATGTATCCCCCAGGAAGAAATGTGGAGTACCATAGTTAATTGTTTTTGTGATGAAAGAAAAGCTTATGATGCTATGAAATTGCTGCA
ACTCCAGTTGATGGATTGATCTCTTGGGTCTATAATATTTATTAAAAGTTGCATTTCGTGAGCAGATATTTATTTTTGTGCAGCTGCACTTTGAACTTTTTTCCTTCCTG
GACAAGTATGCACACCACTATTTGCCTACTTTTTACCCCTTGATACTATCATTATCCAGTTCATCAATATTTCCAGTCTTCTTACTGGTATTTTCTACATAATCATGGTG
TCAGTGGGAAGCCAAACAGAACATCAGCATACCATTTGGGGTGATTTGTCTTATTCCATTCATCTCCTTATATCCAGGCATTTCTGTAGCCTTGTAGGTAGCAT
Protein sequenceShow/hide protein sequence
MDFKILNKIHMLKVRLEVRQSDSLSKRGTMGSKAMLKWGKTITPAHVEQLIQAERDIKKALIIFDSATAEYENGFKHDLNTFSLMISKLISANQFRLAETLLDRMKEEKI
DVTEDILLSICRAYGRIHKPLDSIRVLDKMQDFHCKPTEKSYISVLAILVEENQLKLAFRFYRKMRKMGIPPTVTSLNVLIKAFCKNSGTMDKAMHMFREMSNRGCEPDS
YTYGTLINGLCRFGNIVEAKKLLQEMETKGCQPSVITYTSIIHGLCQLNNVDEAMRLLEDMKDKGIEPNVFTYSSLMDGFCKAGHSSRARDILG