; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg037968 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg037968
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationscaffold12:42556840..42558636
RNA-Seq ExpressionSpg037968
SyntenySpg037968
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571513.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]2.2e-22871.21Show/hide
Query:  MFSVATTNALKQTTRSISNFVSSSISSPLQRPYVPYLKQTLLDRIKNCSTINELDGVYVSMIKTNATQDCFLVNQFISASLTFNCVDYPVLAFTQMENPN
        MFS+  TNALKQ TRSISNFVSSS    LQ  YV   KQTLLDRIKNCSTINELDG+Y SMIKTNATQDCFLVNQFISASLTFN VDYPVLAF QMENPN
Subjt:  MFSVATTNALKQTTRSISNFVSSSISSPLQRPYVPYLKQTLLDRIKNCSTINELDGVYVSMIKTNATQDCFLVNQFISASLTFNCVDYPVLAFTQMENPN

Query:  VFVYNAMIRGFVHCGHPFRALQCYVHMLESKVLPTSYTFSSLVKACTFMRAVELGRMIHCHIWKNGLDSHVFVQTALIDFYSNLERLGEARK--------
        VFVYNAMIRGFV+CG+PFRA+QCYVHMLESKVLP+SYTFSSLVKACT M A++LGRMIHCHIWKNGL+  VFVQT+LID YSNLER G+ARK        
Subjt:  VFVYNAMIRGFVHCGHPFRALQCYVHMLESKVLPTSYTFSSLVKACTFMRAVELGRMIHCHIWKNGLDSHVFVQTALIDFYSNLERLGEARK--------

Query:  ---------------------------------------------------------------------------------------------------V
                                                                                                           V
Subjt:  ---------------------------------------------------------------------------------------------------V

Query:  TMSTVVSACAHVGALDLGKEIHHYVMSQGINLDVYVGSALVDMYAKCGSLDRSLLVFFKLRDKNLYCWNAVIEGLAVHGYAEKALRMFVIMEREPIMPNG
        TMSTVVSACAHVGAL+LGKEIHHY MS+G+NLDVY+GSALVDMYAKCGSLDRSLLVFFKL+DKNLYCWNAVIEGLAVHGYAEKALRMFVIMERE IMPNG
Subjt:  TMSTVVSACAHVGALDLGKEIHHYVMSQGINLDVYVGSALVDMYAKCGSLDRSLLVFFKLRDKNLYCWNAVIEGLAVHGYAEKALRMFVIMEREPIMPNG

Query:  VTFISILSACTHAGLVEEGRSRFLSMTRDYGIHPEVEHYGCMVDMLSKAGLLDEALELIKSMELEPNSIIWGALLNGCKLHGNLEIAEEAVQQLMILEPM
        VTFISILSACTHAGLV EGRSRF SM RDYGI PEVEHYGCMVDMLSKAGLLDEALELI  ME EPNSIIWGALLNGCKLHGN EIA++AVQQL ILEP 
Subjt:  VTFISILSACTHAGLVEEGRSRFLSMTRDYGIHPEVEHYGCMVDMLSKAGLLDEALELIKSMELEPNSIIWGALLNGCKLHGNLEIAEEAVQQLMILEPM

Query:  NSGHYNLLVSMHAEERHWMEVAHIRAMMKEQGVEKKYPGSSWIELEGRIHQFSASADSHPDSDEIYFILTELDGQLKLAGYIPELSV
        NSGHYNLLVSM+AEE+HWM+VAHIRAMMKE GVEKKYPGSSWIELEGRIHQFSASA+ HPDSD+IYFILTELDGQLKLAG + E SV
Subjt:  NSGHYNLLVSMHAEERHWMEVAHIRAMMKEQGVEKKYPGSSWIELEGRIHQFSASADSHPDSDEIYFILTELDGQLKLAGYIPELSV

XP_022963550.1 pentatricopeptide repeat-containing protein At1g06143 [Cucurbita moschata]5.9e-22971.21Show/hide
Query:  MFSVATTNALKQTTRSISNFVSSSISSPLQRPYVPYLKQTLLDRIKNCSTINELDGVYVSMIKTNATQDCFLVNQFISASLTFNCVDYPVLAFTQMENPN
        MFS+  TNALKQ TRSISNFVSSS    LQ  YV   KQTLLDRIKNCSTINELDG+Y SMIKTNATQDCFLVNQFISASLTFN VDYPVLAFTQMENPN
Subjt:  MFSVATTNALKQTTRSISNFVSSSISSPLQRPYVPYLKQTLLDRIKNCSTINELDGVYVSMIKTNATQDCFLVNQFISASLTFNCVDYPVLAFTQMENPN

Query:  VFVYNAMIRGFVHCGHPFRALQCYVHMLESKVLPTSYTFSSLVKACTFMRAVELGRMIHCHIWKNGLDSHVFVQTALIDFYSNLERLGEARK--------
        VFVYNAMIRGFV+CG+PFRA+QCYVHMLESKVLP+SYTFSSLVKACT M A++LGRMIHCHIWKNGL+  VFVQT+LID YSNLER G+ARK        
Subjt:  VFVYNAMIRGFVHCGHPFRALQCYVHMLESKVLPTSYTFSSLVKACTFMRAVELGRMIHCHIWKNGLDSHVFVQTALIDFYSNLERLGEARK--------

Query:  ---------------------------------------------------------------------------------------------------V
                                                                                                           V
Subjt:  ---------------------------------------------------------------------------------------------------V

Query:  TMSTVVSACAHVGALDLGKEIHHYVMSQGINLDVYVGSALVDMYAKCGSLDRSLLVFFKLRDKNLYCWNAVIEGLAVHGYAEKALRMFVIMEREPIMPNG
        TMSTVVSACAHVGAL+LGKEIHHY MS+G+NLDVY+GSALVDMYAKCGSLDRSLLVFFKL+DKNLYCWNAVIEGLAVHGYAEKALRMFVIMERE IMPNG
Subjt:  TMSTVVSACAHVGALDLGKEIHHYVMSQGINLDVYVGSALVDMYAKCGSLDRSLLVFFKLRDKNLYCWNAVIEGLAVHGYAEKALRMFVIMEREPIMPNG

Query:  VTFISILSACTHAGLVEEGRSRFLSMTRDYGIHPEVEHYGCMVDMLSKAGLLDEALELIKSMELEPNSIIWGALLNGCKLHGNLEIAEEAVQQLMILEPM
        VTFISILSACTHAGLV EGRSRF SM RDYGI PEVEHYGCMVDMLSKAGLLDEALELI  ME EPNSIIWGALLNGCKLHGN EIA++AVQQL +LEP 
Subjt:  VTFISILSACTHAGLVEEGRSRFLSMTRDYGIHPEVEHYGCMVDMLSKAGLLDEALELIKSMELEPNSIIWGALLNGCKLHGNLEIAEEAVQQLMILEPM

Query:  NSGHYNLLVSMHAEERHWMEVAHIRAMMKEQGVEKKYPGSSWIELEGRIHQFSASADSHPDSDEIYFILTELDGQLKLAGYIPELSV
        NSGHYNLLVSM+AEE+HWM+VAHIRAMMKE GVEKKYPGSSWIELEGRIHQFSASA+ HPDSD+IYFILTELDGQLKLAG + E SV
Subjt:  NSGHYNLLVSMHAEERHWMEVAHIRAMMKEQGVEKKYPGSSWIELEGRIHQFSASADSHPDSDEIYFILTELDGQLKLAGYIPELSV

XP_022967388.1 pentatricopeptide repeat-containing protein At1g06143 [Cucurbita maxima]1.5e-22971.21Show/hide
Query:  MFSVATTNALKQTTRSISNFVSSSISSPLQRPYVPYLKQTLLDRIKNCSTINELDGVYVSMIKTNATQDCFLVNQFISASLTFNCVDYPVLAFTQMENPN
        MFS+  TNALKQ TRSISNFVSSS S  LQ PYVP  KQTLLDRIKNCSTINELDG+Y SMIK NATQDCFLVNQFISASLTFN VDYPVLAFTQMENPN
Subjt:  MFSVATTNALKQTTRSISNFVSSSISSPLQRPYVPYLKQTLLDRIKNCSTINELDGVYVSMIKTNATQDCFLVNQFISASLTFNCVDYPVLAFTQMENPN

Query:  VFVYNAMIRGFVHCGHPFRALQCYVHMLESKVLPTSYTFSSLVKACTFMRAVELGRMIHCHIWKNGLDSHVFVQTALIDFYSNLERLGEARK--------
        VFVYNAMIRGFV+CG+PFRA+QCYVHMLES+VLP+SYTFSSLVKACT M A++LGRMIHC IW +GL+  VFVQT+LID YSNLER G+ARK        
Subjt:  VFVYNAMIRGFVHCGHPFRALQCYVHMLESKVLPTSYTFSSLVKACTFMRAVELGRMIHCHIWKNGLDSHVFVQTALIDFYSNLERLGEARK--------

Query:  ---------------------------------------------------------------------------------------------------V
                                                                                                           V
Subjt:  ---------------------------------------------------------------------------------------------------V

Query:  TMSTVVSACAHVGALDLGKEIHHYVMSQGINLDVYVGSALVDMYAKCGSLDRSLLVFFKLRDKNLYCWNAVIEGLAVHGYAEKALRMFVIMEREPIMPNG
        TMSTVVSACAHVGAL+LGKEIHHY MS+G+NLDVY+GSALVDMYAKCGSLDRSLLVFFKL+DKNLYCWNAVIEGLAVHGYAEKALRMFVIMERE IMPNG
Subjt:  TMSTVVSACAHVGALDLGKEIHHYVMSQGINLDVYVGSALVDMYAKCGSLDRSLLVFFKLRDKNLYCWNAVIEGLAVHGYAEKALRMFVIMEREPIMPNG

Query:  VTFISILSACTHAGLVEEGRSRFLSMTRDYGIHPEVEHYGCMVDMLSKAGLLDEALELIKSMELEPNSIIWGALLNGCKLHGNLEIAEEAVQQLMILEPM
        VTFISILSACTHAGLV EGRSRFLSM RDYGIHPEVEHYGCMVDMLSKAGLLDEALELI  ME EPNSIIWGALLNGCKLHGN EIA++AV++L ILEP 
Subjt:  VTFISILSACTHAGLVEEGRSRFLSMTRDYGIHPEVEHYGCMVDMLSKAGLLDEALELIKSMELEPNSIIWGALLNGCKLHGNLEIAEEAVQQLMILEPM

Query:  NSGHYNLLVSMHAEERHWMEVAHIRAMMKEQGVEKKYPGSSWIELEGRIHQFSASADSHPDSDEIYFILTELDGQLKLAGYIPELSV
        NSGHYNLLVSM+AEE+HW+EVAHIRAMMKE GVEKKYPGSSWIELEGRIHQFSASAD HPDSD+IYFILTELDGQLKLAG + E SV
Subjt:  NSGHYNLLVSMHAEERHWMEVAHIRAMMKEQGVEKKYPGSSWIELEGRIHQFSASADSHPDSDEIYFILTELDGQLKLAGYIPELSV

XP_023554768.1 pentatricopeptide repeat-containing protein At1g06143 [Cucurbita pepo subsp. pepo]7.7e-22971.38Show/hide
Query:  MFSVATTNALKQTTRSISNFVSSSISSPLQRPYVPYLKQTLLDRIKNCSTINELDGVYVSMIKTNATQDCFLVNQFISASLTFNCVDYPVLAFTQMENPN
        MFS+  TNALKQ TRSISNF SSS    LQ  YV   KQTLLDRIKNCSTINELDG+Y SMIKTNATQDCFLVNQFISASLTFN VDYPVLAFTQMENPN
Subjt:  MFSVATTNALKQTTRSISNFVSSSISSPLQRPYVPYLKQTLLDRIKNCSTINELDGVYVSMIKTNATQDCFLVNQFISASLTFNCVDYPVLAFTQMENPN

Query:  VFVYNAMIRGFVHCGHPFRALQCYVHMLESKVLPTSYTFSSLVKACTFMRAVELGRMIHCHIWKNGLDSHVFVQTALIDFYSNLERLGEARK--------
        VFVYNAMIRGFV+CG+PFRA+QCYVHMLES+VLP+SYTFSSLVKACT M A++LGRMIHCHIWKNGL+  VFVQT+LID YSNLER G+ARK        
Subjt:  VFVYNAMIRGFVHCGHPFRALQCYVHMLESKVLPTSYTFSSLVKACTFMRAVELGRMIHCHIWKNGLDSHVFVQTALIDFYSNLERLGEARK--------

Query:  ---------------------------------------------------------------------------------------------------V
                                                                                                           V
Subjt:  ---------------------------------------------------------------------------------------------------V

Query:  TMSTVVSACAHVGALDLGKEIHHYVMSQGINLDVYVGSALVDMYAKCGSLDRSLLVFFKLRDKNLYCWNAVIEGLAVHGYAEKALRMFVIMEREPIMPNG
        TMSTVVSACAHVGALDLGKEIHHY MS G+NLDVY+GSALVDMYAKCGSLDRSLLVFFKL+DKNLYCWNAVIEGLAVHGYAEKALRMFVIMERE IMPNG
Subjt:  TMSTVVSACAHVGALDLGKEIHHYVMSQGINLDVYVGSALVDMYAKCGSLDRSLLVFFKLRDKNLYCWNAVIEGLAVHGYAEKALRMFVIMEREPIMPNG

Query:  VTFISILSACTHAGLVEEGRSRFLSMTRDYGIHPEVEHYGCMVDMLSKAGLLDEALELIKSMELEPNSIIWGALLNGCKLHGNLEIAEEAVQQLMILEPM
        VTFISILSACTHAGLV EGRSRF SM RDYGI PEVEHYGCMVDMLSKAGLLDEALELI  ME EPNSIIWGALLNGCKLHGN EIA++AV+QL ILEP 
Subjt:  VTFISILSACTHAGLVEEGRSRFLSMTRDYGIHPEVEHYGCMVDMLSKAGLLDEALELIKSMELEPNSIIWGALLNGCKLHGNLEIAEEAVQQLMILEPM

Query:  NSGHYNLLVSMHAEERHWMEVAHIRAMMKEQGVEKKYPGSSWIELEGRIHQFSASADSHPDSDEIYFILTELDGQLKLAGYIPELSV
        NSGHYNLLVSM+AEE+HWMEVAHIRAMMKE GVEKKYPGSSWIELEGRIHQFSASAD HPDSD+IYFILTELDGQLKLAG + E SV
Subjt:  NSGHYNLLVSMHAEERHWMEVAHIRAMMKEQGVEKKYPGSSWIELEGRIHQFSASADSHPDSDEIYFILTELDGQLKLAGYIPELSV

XP_038888390.1 pentatricopeptide repeat-containing protein At1g06143 [Benincasa hispida]3.8e-23672.56Show/hide
Query:  MFSVATTNALKQTTRSISNFVSSSISSPLQRPYVPYLKQTLLDRIKNCSTINELDGVYVSMIKTNATQDCFLVNQFISASLTFNCVDYPVLAFTQMENPN
        MFS   TNALKQ TRSISNFVSSSIS P Q P +P  KQTLL+RIKNCSTINELDG+Y SMIKTNATQDCFLVNQFIS SL FN VDYPV+AFTQMENPN
Subjt:  MFSVATTNALKQTTRSISNFVSSSISSPLQRPYVPYLKQTLLDRIKNCSTINELDGVYVSMIKTNATQDCFLVNQFISASLTFNCVDYPVLAFTQMENPN

Query:  VFVYNAMIRGFVHCGHPFRALQCYVHML-ESKVLPTSYTFSSLVKACTFMRAVELGRMIHCHIWKNGLDSHVFVQTALIDFYSNLERLGEARK-------
        VFVYNAMIRGFV+CG+PF ALQCYVHML E+KV PTSYTFSSLVKACTFM AVELGRMIHCHIWK+G +SH+FVQTALIDFYSNLERL EARK       
Subjt:  VFVYNAMIRGFVHCGHPFRALQCYVHML-ESKVLPTSYTFSSLVKACTFMRAVELGRMIHCHIWKNGLDSHVFVQTALIDFYSNLERLGEARK-------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  VTMSTVVSACAHVGALDLGKEIHHYVMSQGINLDVYVGSALVDMYAKCGSLDRSLLVFFKLRDKNLYCWNAVIEGLAVHGYAEKALRMFVIMEREPIMPN
        VT+STVVSACAHVGAL+LGK IHHYVMSQG+NLDVY+GSALVDMYAKCGSLDRSLLVFFKL DKNLYCWNAVIEGLAVHGYAEKALRMFVIMERE I PN
Subjt:  VTMSTVVSACAHVGALDLGKEIHHYVMSQGINLDVYVGSALVDMYAKCGSLDRSLLVFFKLRDKNLYCWNAVIEGLAVHGYAEKALRMFVIMEREPIMPN

Query:  GVTFISILSACTHAGLVEEGRSRFLSMTRDYGIHPEVEHYGCMVDMLSKAGLLDEALELIKSMELEPNSIIWGALLNGCKLHGNLEIAEEAVQQLMILEP
        GVTFISILSACTHAGLVEEGRSRFLSMTRDYGI PE+ HYGCMVDMLSKAG LDEALELIKSME EPNSIIWGALLNGCKLHGN EIA++AVQQLMILEP
Subjt:  GVTFISILSACTHAGLVEEGRSRFLSMTRDYGIHPEVEHYGCMVDMLSKAGLLDEALELIKSMELEPNSIIWGALLNGCKLHGNLEIAEEAVQQLMILEP

Query:  MNSGHYNLLVSMHAEERHWMEVAHIRAMMKEQGVEKKYPGSSWIELEGRIHQFSASADSHPDSDEIYFILTELDGQLKLAGYIPELSVCCNILV
        M+SGHYNLLVSM+AEE+ WMEVAHIRAMMKEQGVEKKYPGSSWIEL+GRIHQFSASADSHPDSDEIYF+LTELDGQLKLAGYI E  VCCN LV
Subjt:  MNSGHYNLLVSMHAEERHWMEVAHIRAMMKEQGVEKKYPGSSWIELEGRIHQFSASADSHPDSDEIYFILTELDGQLKLAGYIPELSVCCNILV

TrEMBL top hitse value%identityAlignment
A0A0A0LB99 Uncharacterized protein2.5e-22568.78Show/hide
Query:  MFSVATTNALKQTTRSISNFVS-SSISSPLQRPYVPYLKQTLLDRIKNCSTINELDGVYVSMIKTNATQDCFLVNQFISASLTFNCVDYPVLAFTQMENP
        MFS  TTNALKQ TRSI NFVS  SIS PLQ P  P  KQTLL+RIKNCSTINEL G+  SMIKTNA QDCFLV+QFISAS   N V YPV AFTQMENP
Subjt:  MFSVATTNALKQTTRSISNFVS-SSISSPLQRPYVPYLKQTLLDRIKNCSTINELDGVYVSMIKTNATQDCFLVNQFISASLTFNCVDYPVLAFTQMENP

Query:  NVFVYNAMIRGFVHCGHPFRALQCYVHML-ESKVLPTSYTFSSLVKACTFMRAVELGRMIHCHIWKNGLDSHVFVQTALIDFYSNLERLGEARK------
        NVFVYNAMI+GFV+CG+PFRALQCYVHML ES VLPTSYTFSSLVKACTFM AVELG+M+HCHIWK G +SH+FVQTAL+DFYS LE L EARK      
Subjt:  NVFVYNAMIRGFVHCGHPFRALQCYVHML-ESKVLPTSYTFSSLVKACTFMRAVELGRMIHCHIWKNGLDSHVFVQTALIDFYSNLERLGEARK------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -VTMSTVVSACAHVGALDLGKEIHHYVMSQGINLDVYVGSALVDMYAKCGSLDRSLLVFFKLRDKNLYCWNAVIEGLAVHGYAEKALRMFVIMEREPIMP
         VTMSTV SACAH+GAL+LGKEIHHYVMSQG+NLDVY+GSALVDMYAKCGSLD SLL+FFKL DKNLYCWNAVIEGLAVHGYAEKALRMF IMERE IMP
Subjt:  -VTMSTVVSACAHVGALDLGKEIHHYVMSQGINLDVYVGSALVDMYAKCGSLDRSLLVFFKLRDKNLYCWNAVIEGLAVHGYAEKALRMFVIMEREPIMP

Query:  NGVTFISILSACTHAGLVEEGRSRFLSMTRDYGIHPEVEHYGCMVDMLSKAGLLDEALELIKSMELEPNSIIWGALLNGCKLHGNLEIAEEAVQQLMILE
        NGVTFISILSACTHAGLV+EGRSRFLSMTRDY I P++ HYGCMVDMLSK+G L+EALELIKSME EPNSIIWGALLNGCKLHGN EIAE+AV+QLMILE
Subjt:  NGVTFISILSACTHAGLVEEGRSRFLSMTRDYGIHPEVEHYGCMVDMLSKAGLLDEALELIKSMELEPNSIIWGALLNGCKLHGNLEIAEEAVQQLMILE

Query:  PMNSGHYNLLVSMHAEERHWMEVAHIRAMMKEQGVEKKYPGSSWIELEGRIHQFSASADSHPDSDEIYFILTELDGQLKLAGYIPELSVCCNILVYTEE
        PMNSGHYNLLVSM+AEE+ WMEVAHIR+MMKE+GVEKKYPGSSWIELEG IHQFSASADSHPDSD+IYF+LTELDGQLKLAGYI E SVC   L+++EE
Subjt:  PMNSGHYNLLVSMHAEERHWMEVAHIRAMMKEQGVEKKYPGSSWIELEGRIHQFSASADSHPDSDEIYFILTELDGQLKLAGYIPELSVCCNILVYTEE

A0A5A7T9J0 Pentatricopeptide repeat-containing protein6.8e-22368.9Show/hide
Query:  MFSVATTNALKQTTRSISNFVSSSISSPLQRPYVPYLKQTLLDRIKNCSTINELDGVYVSMIKTNATQDCFLVNQFISASLTFNCVDYPVLAFTQMENPN
        MFS  TT ALKQ TRSI NFVS SIS PLQ P  P  KQTLL+RIKNCS INEL  VY SMIK+NA QDCFLV+QFISAS  FN V YPV AFTQMENPN
Subjt:  MFSVATTNALKQTTRSISNFVSSSISSPLQRPYVPYLKQTLLDRIKNCSTINELDGVYVSMIKTNATQDCFLVNQFISASLTFNCVDYPVLAFTQMENPN

Query:  VFVYNAMIRGFVHCGHPFRALQCYVHMLE-SKVLPTSYTFSSLVKACTFMRAVELGRMIHCHIWKNGLDSHVFVQTALIDFYSNLERLGEARK-------
        VFVYNAMI+GFV+ G+PFR LQCYVHMLE S VLP SYTFSSLVKACTFM AVELG+M+HCHIWK G +SH+FVQTAL+DFYS LE+L EARK       
Subjt:  VFVYNAMIRGFVHCGHPFRALQCYVHMLE-SKVLPTSYTFSSLVKACTFMRAVELGRMIHCHIWKNGLDSHVFVQTALIDFYSNLERLGEARK-------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  VTMSTVVSACAHVGALDLGKEIHHYVMSQGINLDVYVGSALVDMYAKCGSLDRSLLVFFKLRDKNLYCWNAVIEGLAVHGYAEKALRMFVIMEREPIMPN
        VTMSTVVSACAHVGAL+LGKEIH YVMSQG+N DVY+GSALVDMYAKCGSLD SLL+FFKL+DKNLYCWNAVIEGLAVHGYAEKALRMF IMERE I+PN
Subjt:  VTMSTVVSACAHVGALDLGKEIHHYVMSQGINLDVYVGSALVDMYAKCGSLDRSLLVFFKLRDKNLYCWNAVIEGLAVHGYAEKALRMFVIMEREPIMPN

Query:  GVTFISILSACTHAGLVEEGRSRFLSMTRDYGIHPEVEHYGCMVDMLSKAGLLDEALELIKSMELEPNSIIWGALLNGCKLHGNLEIAEEAVQQLMILEP
        GVTFISILSACTHAGLVEEGRSRFLSMTRDYGI PE+ HYGCMVDMLSKAGLL EALELIKSME EPNSIIWGALLNGCKLHGN  IA++AV+QLMILEP
Subjt:  GVTFISILSACTHAGLVEEGRSRFLSMTRDYGIHPEVEHYGCMVDMLSKAGLLDEALELIKSMELEPNSIIWGALLNGCKLHGNLEIAEEAVQQLMILEP

Query:  MNSGHYNLLVSMHAEERHWMEVAHIRAMMKEQGVEKKYPGSSWIELEGRIHQFSASADSHPDSDEIYFILTELDGQLKLAGYIPELSVCCNILVYTEE
        MNSGHYNLLVSM AEE+ WMEVAHIR MMKEQGVEKKYPGSSWIELEG IHQFSASADSHPDSD+IYF+LTELDGQLKLAGYI E SVC   LV+ EE
Subjt:  MNSGHYNLLVSMHAEERHWMEVAHIRAMMKEQGVEKKYPGSSWIELEGRIHQFSASADSHPDSDEIYFILTELDGQLKLAGYIPELSVCCNILVYTEE

A0A6J1BWV2 pentatricopeptide repeat-containing protein At1g061431.0e-22368.17Show/hide
Query:  MFSVATTNALKQTTRSISNFVSSSISSPLQRPYVPYLKQTLLDRIKNCSTINELDGVYVSMIKTNATQDCFLVNQFISASLTFNCVDYPVLAFTQMENPN
        M S  T NALKQ TR ISN V S IS PLQ+P VP  K+TLLDRIKNC TI+ELDGVY SMIKTNA QDCFLVNQFISASLTF+ VDYPVLAFTQM+NPN
Subjt:  MFSVATTNALKQTTRSISNFVSSSISSPLQRPYVPYLKQTLLDRIKNCSTINELDGVYVSMIKTNATQDCFLVNQFISASLTFNCVDYPVLAFTQMENPN

Query:  VFVYNAMIRGFVHCGHPFRALQCYVHMLESKVLPTSYTFSSLVKACTFMRAVELGRMIHCHIWKNGLDSHVFVQTALIDFYSNLERLGEARK--------
        VFVYNAMIRGFVHCGH  +ALQCYVHMLESKV PTSYTFSSLVKACT + AVELGR+IH HIWKNGL+SHVFVQTALIDFYSNL +L E+RK        
Subjt:  VFVYNAMIRGFVHCGHPFRALQCYVHMLESKVLPTSYTFSSLVKACTFMRAVELGRMIHCHIWKNGLDSHVFVQTALIDFYSNLERLGEARK--------

Query:  ---------------------------------------------------------------------------------------------------V
                                                                                                           V
Subjt:  ---------------------------------------------------------------------------------------------------V

Query:  TMSTVVSACAHVGALDLGKEIHHYVMSQGINLDVYVGSALVDMYAKCGSLDRSLLVFFKLRDKNLYCWNAVIEGLAVHGYAEKALRMFVIMEREPIMPNG
        TM+ V+S+CAHVGAL+LGK+IHHYVMSQG+N DVY+GSALVDMYAKCGSLDRSLLVFFKL+ KNLYCWNAVIEGLAVHGYAEKAL MFV MERE I+PNG
Subjt:  TMSTVVSACAHVGALDLGKEIHHYVMSQGINLDVYVGSALVDMYAKCGSLDRSLLVFFKLRDKNLYCWNAVIEGLAVHGYAEKALRMFVIMEREPIMPNG

Query:  VTFISILSACTHAGLVEEGRSRFLSMTRDYGIHPEVEHYGCMVDMLSKAGLLDEALELIKSMELEPNSIIWGALLNGCKLHGNLEIAEEAVQQLMILEPM
        ++FIS+LSACTHAGLVEEGR RFLSMT DYGI PEVEHYGCMV+MLSKAGLLDEALELI+SM+  PNSIIWGALLNGCKLHGNLEIA++AVQQL+ILEP 
Subjt:  VTFISILSACTHAGLVEEGRSRFLSMTRDYGIHPEVEHYGCMVDMLSKAGLLDEALELIKSMELEPNSIIWGALLNGCKLHGNLEIAEEAVQQLMILEPM

Query:  NSGHYNLLVSMHAEERHWMEVAHIRAMMKEQGVEKKYPGSSWIELEGRIHQFSASADSHPDSDEIYFILTELDGQLKLAGYIPELSVCCNILVYTEE
        NSGH+NLLVSM+AEE+HWMEVA+IRAMMKEQGVEKKYPGSSWIEL+GRIH FSASA+SHPDSD+IYFIL ELD QLKL G IPE S+C N LV+TEE
Subjt:  NSGHYNLLVSMHAEERHWMEVAHIRAMMKEQGVEKKYPGSSWIELEGRIHQFSASADSHPDSDEIYFILTELDGQLKLAGYIPELSVCCNILVYTEE

A0A6J1HIB3 pentatricopeptide repeat-containing protein At1g061432.8e-22971.21Show/hide
Query:  MFSVATTNALKQTTRSISNFVSSSISSPLQRPYVPYLKQTLLDRIKNCSTINELDGVYVSMIKTNATQDCFLVNQFISASLTFNCVDYPVLAFTQMENPN
        MFS+  TNALKQ TRSISNFVSSS    LQ  YV   KQTLLDRIKNCSTINELDG+Y SMIKTNATQDCFLVNQFISASLTFN VDYPVLAFTQMENPN
Subjt:  MFSVATTNALKQTTRSISNFVSSSISSPLQRPYVPYLKQTLLDRIKNCSTINELDGVYVSMIKTNATQDCFLVNQFISASLTFNCVDYPVLAFTQMENPN

Query:  VFVYNAMIRGFVHCGHPFRALQCYVHMLESKVLPTSYTFSSLVKACTFMRAVELGRMIHCHIWKNGLDSHVFVQTALIDFYSNLERLGEARK--------
        VFVYNAMIRGFV+CG+PFRA+QCYVHMLESKVLP+SYTFSSLVKACT M A++LGRMIHCHIWKNGL+  VFVQT+LID YSNLER G+ARK        
Subjt:  VFVYNAMIRGFVHCGHPFRALQCYVHMLESKVLPTSYTFSSLVKACTFMRAVELGRMIHCHIWKNGLDSHVFVQTALIDFYSNLERLGEARK--------

Query:  ---------------------------------------------------------------------------------------------------V
                                                                                                           V
Subjt:  ---------------------------------------------------------------------------------------------------V

Query:  TMSTVVSACAHVGALDLGKEIHHYVMSQGINLDVYVGSALVDMYAKCGSLDRSLLVFFKLRDKNLYCWNAVIEGLAVHGYAEKALRMFVIMEREPIMPNG
        TMSTVVSACAHVGAL+LGKEIHHY MS+G+NLDVY+GSALVDMYAKCGSLDRSLLVFFKL+DKNLYCWNAVIEGLAVHGYAEKALRMFVIMERE IMPNG
Subjt:  TMSTVVSACAHVGALDLGKEIHHYVMSQGINLDVYVGSALVDMYAKCGSLDRSLLVFFKLRDKNLYCWNAVIEGLAVHGYAEKALRMFVIMEREPIMPNG

Query:  VTFISILSACTHAGLVEEGRSRFLSMTRDYGIHPEVEHYGCMVDMLSKAGLLDEALELIKSMELEPNSIIWGALLNGCKLHGNLEIAEEAVQQLMILEPM
        VTFISILSACTHAGLV EGRSRF SM RDYGI PEVEHYGCMVDMLSKAGLLDEALELI  ME EPNSIIWGALLNGCKLHGN EIA++AVQQL +LEP 
Subjt:  VTFISILSACTHAGLVEEGRSRFLSMTRDYGIHPEVEHYGCMVDMLSKAGLLDEALELIKSMELEPNSIIWGALLNGCKLHGNLEIAEEAVQQLMILEPM

Query:  NSGHYNLLVSMHAEERHWMEVAHIRAMMKEQGVEKKYPGSSWIELEGRIHQFSASADSHPDSDEIYFILTELDGQLKLAGYIPELSV
        NSGHYNLLVSM+AEE+HWM+VAHIRAMMKE GVEKKYPGSSWIELEGRIHQFSASA+ HPDSD+IYFILTELDGQLKLAG + E SV
Subjt:  NSGHYNLLVSMHAEERHWMEVAHIRAMMKEQGVEKKYPGSSWIELEGRIHQFSASADSHPDSDEIYFILTELDGQLKLAGYIPELSV

A0A6J1HUY6 pentatricopeptide repeat-containing protein At1g061437.5e-23071.21Show/hide
Query:  MFSVATTNALKQTTRSISNFVSSSISSPLQRPYVPYLKQTLLDRIKNCSTINELDGVYVSMIKTNATQDCFLVNQFISASLTFNCVDYPVLAFTQMENPN
        MFS+  TNALKQ TRSISNFVSSS S  LQ PYVP  KQTLLDRIKNCSTINELDG+Y SMIK NATQDCFLVNQFISASLTFN VDYPVLAFTQMENPN
Subjt:  MFSVATTNALKQTTRSISNFVSSSISSPLQRPYVPYLKQTLLDRIKNCSTINELDGVYVSMIKTNATQDCFLVNQFISASLTFNCVDYPVLAFTQMENPN

Query:  VFVYNAMIRGFVHCGHPFRALQCYVHMLESKVLPTSYTFSSLVKACTFMRAVELGRMIHCHIWKNGLDSHVFVQTALIDFYSNLERLGEARK--------
        VFVYNAMIRGFV+CG+PFRA+QCYVHMLES+VLP+SYTFSSLVKACT M A++LGRMIHC IW +GL+  VFVQT+LID YSNLER G+ARK        
Subjt:  VFVYNAMIRGFVHCGHPFRALQCYVHMLESKVLPTSYTFSSLVKACTFMRAVELGRMIHCHIWKNGLDSHVFVQTALIDFYSNLERLGEARK--------

Query:  ---------------------------------------------------------------------------------------------------V
                                                                                                           V
Subjt:  ---------------------------------------------------------------------------------------------------V

Query:  TMSTVVSACAHVGALDLGKEIHHYVMSQGINLDVYVGSALVDMYAKCGSLDRSLLVFFKLRDKNLYCWNAVIEGLAVHGYAEKALRMFVIMEREPIMPNG
        TMSTVVSACAHVGAL+LGKEIHHY MS+G+NLDVY+GSALVDMYAKCGSLDRSLLVFFKL+DKNLYCWNAVIEGLAVHGYAEKALRMFVIMERE IMPNG
Subjt:  TMSTVVSACAHVGALDLGKEIHHYVMSQGINLDVYVGSALVDMYAKCGSLDRSLLVFFKLRDKNLYCWNAVIEGLAVHGYAEKALRMFVIMEREPIMPNG

Query:  VTFISILSACTHAGLVEEGRSRFLSMTRDYGIHPEVEHYGCMVDMLSKAGLLDEALELIKSMELEPNSIIWGALLNGCKLHGNLEIAEEAVQQLMILEPM
        VTFISILSACTHAGLV EGRSRFLSM RDYGIHPEVEHYGCMVDMLSKAGLLDEALELI  ME EPNSIIWGALLNGCKLHGN EIA++AV++L ILEP 
Subjt:  VTFISILSACTHAGLVEEGRSRFLSMTRDYGIHPEVEHYGCMVDMLSKAGLLDEALELIKSMELEPNSIIWGALLNGCKLHGNLEIAEEAVQQLMILEPM

Query:  NSGHYNLLVSMHAEERHWMEVAHIRAMMKEQGVEKKYPGSSWIELEGRIHQFSASADSHPDSDEIYFILTELDGQLKLAGYIPELSV
        NSGHYNLLVSM+AEE+HW+EVAHIRAMMKE GVEKKYPGSSWIELEGRIHQFSASAD HPDSD+IYFILTELDGQLKLAG + E SV
Subjt:  NSGHYNLLVSMHAEERHWMEVAHIRAMMKEQGVEKKYPGSSWIELEGRIHQFSASADSHPDSDEIYFILTELDGQLKLAGYIPELSV

SwissProt top hitse value%identityAlignment
A8MQA3 Pentatricopeptide repeat-containing protein At4g210657.0e-8436.64Show/hide
Query:  VDYPVLAFTQMENP-NVFVYNAMIRGFVHCGHPFRALQCYVHM-LESKVLPTSYTFSSLVKACTFMRAVELGRMIHCHIWKNGLDSHVFVQTALIDFYSN
        + Y    F+++E P NVF++N +IRG+   G+   A   Y  M +   V P ++T+  L+KA T M  V LG  IH  + ++G  S ++VQ +L+  Y+N
Subjt:  VDYPVLAFTQMENP-NVFVYNAMIRGFVHCGHPFRALQCYVHM-LESKVLPTSYTFSSLVKACTFMRAVELGRMIHCHIWKNGLDSHVFVQTALIDFYSN

Query:  LERLGEARKV---------------------------------------------TMSTVVSACAHVGALDLGKEIHHYVMSQGINLDVYVGSALVDMYA
           +  A KV                                             T+ +++SACA +GAL LGK +H Y++  G+  +++  + L+D+YA
Subjt:  LERLGEARKV---------------------------------------------TMSTVVSACAHVGALDLGKEIHHYVMSQGINLDVYVGSALVDMYA

Query:  KCGSLDRSLLVFFKLRDKNLYCWNAVIEGLAVHGYAEKALRMFVIME-REPIMPNGVTFISILSACTHAGLVEEGRSRFLSMTRDYGIHPEVEHYGCMVD
        +CG ++ +  +F ++ DKN   W ++I GLAV+G+ ++A+ +F  ME  E ++P  +TF+ IL AC+H G+V+EG   F  M  +Y I P +EH+GCMVD
Subjt:  KCGSLDRSLLVFFKLRDKNLYCWNAVIEGLAVHGYAEKALRMFVIME-REPIMPNGVTFISILSACTHAGLVEEGRSRFLSMTRDYGIHPEVEHYGCMVD

Query:  MLSKAGLLDEALELIKSMELEPNSIIWGALLNGCKLHGNLEIAEEAVQQLMILEPMNSGHYNLLVSMHAEERHWMEVAHIRAMMKEQGVEKKYPGSSWIE
        +L++AG + +A E IKSM ++PN +IW  LL  C +HG+ ++AE A  Q++ LEP +SG Y LL +M+A E+ W +V  IR  M   GV KK PG S +E
Subjt:  MLSKAGLLDEALELIKSMELEPNSIIWGALLNGCKLHGNLEIAEEAVQQLMILEPMNSGHYNLLVSMHAEERHWMEVAHIRAMMKEQGVEKKYPGSSWIE

Query:  LEGRIHQFSASADSHPDSDEIYFILTELDGQLKLAGYIPELSVCCNILVYTEE
        +  R+H+F     SHP SD IY  L E+ G+L+  GY+P++S   N+ V  EE
Subjt:  LEGRIHQFSASADSHPDSDEIYFILTELDGQLKLAGYIPELSVCCNILVYTEE

Q56X05 Pentatricopeptide repeat-containing protein At1g061439.1e-13244.52Show/hide
Query:  MFSVATTNALKQTTRSISNFVSSSISSPLQRPYVPYLKQTLLDRIKNCSTINELDGVYVSMIKTNATQDCFLVNQFISASLTFNCVDYPVLAFTQMENPN
        M + A  ++L+  +  + +F +S   +P      P LK+     IK CST   L+    +MIKT+  QDC L+NQFI+A  +F  +D  V   TQM+ PN
Subjt:  MFSVATTNALKQTTRSISNFVSSSISSPLQRPYVPYLKQTLLDRIKNCSTINELDGVYVSMIKTNATQDCFLVNQFISASLTFNCVDYPVLAFTQMENPN

Query:  VFVYNAMIRGFVHCGHPFRALQCYVHMLESKVLPTSYTFSSLVKACTFMRAVELGRMIHCHIWKNGLDSHVFVQTALIDFYSNLERLGEARK--------
        VFVYNA+ +GFV C HP R+L+ YV ML   V P+SYT+SSLVKA +F  A   G  +  HIWK G   HV +QT LIDFYS   R+ EARK        
Subjt:  VFVYNAMIRGFVHCGHPFRALQCYVHMLESKVLPTSYTFSSLVKACTFMRAVELGRMIHCHIWKNGLDSHVFVQTALIDFYSNLERLGEARK--------

Query:  ---------------------------------------------------------------------------------------------------V
                                                                                                           V
Subjt:  ---------------------------------------------------------------------------------------------------V

Query:  TMSTVVSACAHVGALDLGKEIHHYVMSQGINLDVYVGSALVDMYAKCGSLDRSLLVFFKLRDKNLYCWNAVIEGLAVHGYAEKALRMFVIMEREPIMPNG
        TMSTV+SACAH+G L++GKE+H Y +  G  LDVY+GSALVDMY+KCGSL+R+LLVFF L  KNL+CWN++IEGLA HG+A++AL+MF  ME E + PN 
Subjt:  TMSTVVSACAHVGALDLGKEIHHYVMSQGINLDVYVGSALVDMYAKCGSLDRSLLVFFKLRDKNLYCWNAVIEGLAVHGYAEKALRMFVIMEREPIMPNG

Query:  VTFISILSACTHAGLVEEGRSRFLSMTRDYGIHPEVEHYGCMVDMLSKAGLLDEALELIKSMELEPNSIIWGALLNGCKLHGNLEIAEEAVQQLMILEPM
        VTF+S+ +ACTHAGLV+EGR  + SM  DY I   VEHYG MV + SKAGL+ EALELI +ME EPN++IWGALL+GC++H NL IAE A  +LM+LEPM
Subjt:  VTFISILSACTHAGLVEEGRSRFLSMTRDYGIHPEVEHYGCMVDMLSKAGLLDEALELIKSMELEPNSIIWGALLNGCKLHGNLEIAEEAVQQLMILEPM

Query:  NSGHYNLLVSMHAEERHWMEVAHIRAMMKEQGVEKKYPGSSWIELEGRIHQFSASADSHPDSDEIYFILTELDGQLKLAGYIPE
        NSG+Y LLVSM+AE+  W +VA IR  M+E G+EK  PG+S I ++ R H F+A+  SH  SDE+  +L E+  Q+ LAGY+ E
Subjt:  NSGHYNLLVSMHAEERHWMEVAHIRAMMKEQGVEKKYPGSSWIELEGRIHQFSASADSHPDSDEIYFILTELDGQLKLAGYIPE

Q9C6T2 Pentatricopeptide repeat-containing protein At1g319201.3e-8234.48Show/hide
Query:  KQTLLDRIKNCSTINELDGVYVSMIKTNATQDCFLVNQFISASLTFNC--------VDYPVLAFTQMENPNVFVYNAMIRGFVHCGHPFRALQCYVHMLE
        +Q  L  +K C  I+E   V+   IK +     F  + F ++S+   C        ++Y    F  +++P  F +N MIRG+V+      AL  Y  M++
Subjt:  KQTLLDRIKNCSTINELDGVYVSMIKTNATQDCFLVNQFISASLTFNC--------VDYPVLAFTQMENPNVFVYNAMIRGFVHCGHPFRALQCYVHMLE

Query:  SKVLPTSYTFSSLVKACTFMRAVELGRMIHCHIWKNGLDSHVFVQTALIDFYSN----------LERLGEARKVTMSTVVS-------------------
            P ++T+  L+KACT ++++  G+ IH  ++K GL++ VFVQ +LI+ Y             E+L      + S++VS                   
Subjt:  SKVLPTSYTFSSLVKACTFMRAVELGRMIHCHIWKNGLDSHVFVQTALIDFYSN----------LERLGEARKVTMSTVVS-------------------

Query:  -----------------ACAHVGALDLGKEIHHYVMSQGINLDVYVGSALVDMYAKCGSLDRSLLVFFKLRDKNLYCWNAVIEGLAVHGYAEKALRMFVI
                         ACA+ GAL+LG  IH +++     L++ V ++LVDMY KCG LD++L +F K+  +N   ++A+I GLA+HG  E ALRMF  
Subjt:  -----------------ACAHVGALDLGKEIHHYVMSQGINLDVYVGSALVDMYAKCGSLDRSLLVFFKLRDKNLYCWNAVIEGLAVHGYAEKALRMFVI

Query:  MEREPIMPNGVTFISILSACTHAGLVEEGRSRFLSMTRDYGIHPEVEHYGCMVDMLSKAGLLDEALELIKSMELEPNSIIWGALLNGCKLHGNLEIAEEA
        M +E + P+ V ++S+L+AC+H+GLV+EGR  F  M ++  + P  EHYGC+VD+L +AGLL+EALE I+S+ +E N +IW   L+ C++  N+E+ + A
Subjt:  MEREPIMPNGVTFISILSACTHAGLVEEGRSRFLSMTRDYGIHPEVEHYGCMVDMLSKAGLLDEALELIKSMELEPNSIIWGALLNGCKLHGNLEIAEEA

Query:  VQQLMILEPMNSGHYNLLVSMHAEERHWMEVAHIRAMMKEQGVEKKYPGSSWIELEGRIHQFSASADSHPDSDEIYFILTELDGQLKLAGYIPELS
         Q+L+ L   N G Y L+ +++++ + W +VA  R  +  +G+ K+ PG S +EL+G+ H+F +   SHP   EIY +L +++ QLK  GY P+L+
Subjt:  VQQLMILEPMNSGHYNLLVSMHAEERHWMEVAHIRAMMKEQGVEKKYPGSSWIELEGRIHQFSASADSHPDSDEIYFILTELDGQLKLAGYIPELS

Q9FJY7 Pentatricopeptide repeat-containing protein At5g665203.1e-8734.67Show/hide
Query:  IKNCSTINELDGVYVSMIKTNATQDCFLVNQFIS---ASLTFNCVDYPVLAFTQMENPNVFVYNAMIRGFVHCGHPFRALQCYVHMLESKVLPTSYTFSS
        ++ CS   EL  ++  M+KT   QD + + +F+S   +S + + + Y  + F   + P+ F++N MIRGF     P R+L  Y  ML S     +YTF S
Subjt:  IKNCSTINELDGVYVSMIKTNATQDCFLVNQFIS---ASLTFNCVDYPVLAFTQMENPNVFVYNAMIRGFVHCGHPFRALQCYVHMLESKVLPTSYTFSS

Query:  LVKACTFMRAVELGRMIHCHIWKNGLDSHVFVQTALIDFYS-----------------------------------------------------------
        L+KAC+ + A E    IH  I K G ++ V+   +LI+ Y+                                                           
Subjt:  LVKACTFMRAVELGRMIHCHIWKNGLDSHVFVQTALIDFYS-----------------------------------------------------------

Query:  -------NLERLG----------EARKVTMSTVVSACAHVGALDLGKEIHHYVMSQGINLDVYVGSALVDMYAKCGSLDRSLLVFFKLRDKNLYCWNAVI
               N E L           E   V+++  +SACA +GAL+ GK IH Y+    I +D  +G  L+DMYAKCG ++ +L VF  ++ K++  W A+I
Subjt:  -------NLERLG----------EARKVTMSTVVSACAHVGALDLGKEIHHYVMSQGINLDVYVGSALVDMYAKCGSLDRSLLVFFKLRDKNLYCWNAVI

Query:  EGLAVHGYAEKALRMFVIMEREPIMPNGVTFISILSACTHAGLVEEGRSRFLSMTRDYGIHPEVEHYGCMVDMLSKAGLLDEALELIKSMELEPNSIIWG
         G A HG+  +A+  F+ M++  I PN +TF ++L+AC++ GLVEEG+  F SM RDY + P +EHYGC+VD+L +AGLLDEA   I+ M L+PN++IWG
Subjt:  EGLAVHGYAEKALRMFVIMEREPIMPNGVTFISILSACTHAGLVEEGRSRFLSMTRDYGIHPEVEHYGCMVDMLSKAGLLDEALELIKSMELEPNSIIWG

Query:  ALLNGCKLHGNLEIAEEAVQQLMILEPMNSGHYNLLVSMHAEERHWMEVAHIRAMMKEQGVEKKYPGSSWIELEGRIHQFSASADSHPDSDEIYFILTEL
        ALL  C++H N+E+ EE  + L+ ++P + G Y    ++HA ++ W + A  R +MKEQGV  K PG S I LEG  H+F A   SHP+ ++I      +
Subjt:  ALLNGCKLHGNLEIAEEAVQQLMILEPMNSGHYNLLVSMHAEERHWMEVAHIRAMMKEQGVEKKYPGSSWIELEGRIHQFSASADSHPDSDEIYFILTEL

Query:  DGQLKLAGYIPELSVCCNILVYTEE
          +L+  GY+PEL      LV  +E
Subjt:  DGQLKLAGYIPELSVCCNILVYTEE

Q9STE1 Pentatricopeptide repeat-containing protein At4g213003.0e-8234.09Show/hide
Query:  TLLDRIKNCSTINELDGVYVSMIKTNATQDCFLVNQFISASLTFNCVDYPVLAFTQMENPNVFVYNAMIRGFVHCGHPFRALQCYVHMLESKVLPTSYTF
        +LL  +     +     ++  +++ + + D FL +  I A      V      F+Q  + +V V+ AMI G++H G    +L+ +  +++ K+ P   T 
Subjt:  TLLDRIKNCSTINELDGVYVSMIKTNATQDCFLVNQFISASLTFNCVDYPVLAFTQMENPNVFVYNAMIRGFVHCGHPFRALQCYVHMLESKVLPTSYTF

Query:  SSLVKACTFMRAVELGRMIHCHIWKNGLDSHVFVQTALIDFYSNLERLGEARK---------------------------------------------VT
         S++     + A++LGR +H  I K G D+   +  A+ID Y+   R+  A +                                             V+
Subjt:  SSLVKACTFMRAVELGRMIHCHIWKNGLDSHVFVQTALIDFYSNLERLGEARK---------------------------------------------VT

Query:  MSTVVSACAHVGALDLGKEIHHYVMSQGINLDVYVGSALVDMYAKCGSLDRSLLVFFKLRDKNLYCWNAVIEGLAVHGYAEKALRMF-VIMEREPIMPNG
        +S  +SACA++ +   GK IH +++   +  DVY  S L+DMYAKCG+L  ++ VF  +++KN+  WN++I     HG  + +L +F  ++E+  I P+ 
Subjt:  MSTVVSACAHVGALDLGKEIHHYVMSQGINLDVYVGSALVDMYAKCGSLDRSLLVFFKLRDKNLYCWNAVIEGLAVHGYAEKALRMF-VIMEREPIMPNG

Query:  VTFISILSACTHAGLVEEGRSRFLSMTRDYGIHPEVEHYGCMVDMLSKAGLLDEALELIKSMELEPNSIIWGALLNGCKLHGNLEIAEEAVQQLMILEPM
        +TF+ I+S+C H G V+EG   F SMT DYGI P+ EHY C+VD+  +AG L EA E +KSM   P++ +WG LL  C+LH N+E+AE A  +LM L+P 
Subjt:  VTFISILSACTHAGLVEEGRSRFLSMTRDYGIHPEVEHYGCMVDMLSKAGLLDEALELIKSMELEPNSIIWGALLNGCKLHGNLEIAEEAVQQLMILEPM

Query:  NSGHYNLLVSMHAEERHWMEVAHIRAMMKEQGVEKKYPGSSWIELEGRIHQFSASADSHPDSDEIYFILTELDGQLKLAGYIPE
        NSG+Y L+ + HA  R W  V  +R++MKE+ V+ K PG SWIE+  R H F +   +HP+S  IY +L  L G+L+L GYIP+
Subjt:  NSGHYNLLVSMHAEERHWMEVAHIRAMMKEQGVEKKYPGSSWIELEGRIHQFSASADSHPDSDEIYFILTELDGQLKLAGYIPE

Arabidopsis top hitse value%identityAlignment
AT1G06150.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein6.4e-13344.52Show/hide
Query:  MFSVATTNALKQTTRSISNFVSSSISSPLQRPYVPYLKQTLLDRIKNCSTINELDGVYVSMIKTNATQDCFLVNQFISASLTFNCVDYPVLAFTQMENPN
        M + A  ++L+  +  + +F +S   +P      P LK+     IK CST   L+    +MIKT+  QDC L+NQFI+A  +F  +D  V   TQM+ PN
Subjt:  MFSVATTNALKQTTRSISNFVSSSISSPLQRPYVPYLKQTLLDRIKNCSTINELDGVYVSMIKTNATQDCFLVNQFISASLTFNCVDYPVLAFTQMENPN

Query:  VFVYNAMIRGFVHCGHPFRALQCYVHMLESKVLPTSYTFSSLVKACTFMRAVELGRMIHCHIWKNGLDSHVFVQTALIDFYSNLERLGEARK--------
        VFVYNA+ +GFV C HP R+L+ YV ML   V P+SYT+SSLVKA +F  A   G  +  HIWK G   HV +QT LIDFYS   R+ EARK        
Subjt:  VFVYNAMIRGFVHCGHPFRALQCYVHMLESKVLPTSYTFSSLVKACTFMRAVELGRMIHCHIWKNGLDSHVFVQTALIDFYSNLERLGEARK--------

Query:  ---------------------------------------------------------------------------------------------------V
                                                                                                           V
Subjt:  ---------------------------------------------------------------------------------------------------V

Query:  TMSTVVSACAHVGALDLGKEIHHYVMSQGINLDVYVGSALVDMYAKCGSLDRSLLVFFKLRDKNLYCWNAVIEGLAVHGYAEKALRMFVIMEREPIMPNG
        TMSTV+SACAH+G L++GKE+H Y +  G  LDVY+GSALVDMY+KCGSL+R+LLVFF L  KNL+CWN++IEGLA HG+A++AL+MF  ME E + PN 
Subjt:  TMSTVVSACAHVGALDLGKEIHHYVMSQGINLDVYVGSALVDMYAKCGSLDRSLLVFFKLRDKNLYCWNAVIEGLAVHGYAEKALRMFVIMEREPIMPNG

Query:  VTFISILSACTHAGLVEEGRSRFLSMTRDYGIHPEVEHYGCMVDMLSKAGLLDEALELIKSMELEPNSIIWGALLNGCKLHGNLEIAEEAVQQLMILEPM
        VTF+S+ +ACTHAGLV+EGR  + SM  DY I   VEHYG MV + SKAGL+ EALELI +ME EPN++IWGALL+GC++H NL IAE A  +LM+LEPM
Subjt:  VTFISILSACTHAGLVEEGRSRFLSMTRDYGIHPEVEHYGCMVDMLSKAGLLDEALELIKSMELEPNSIIWGALLNGCKLHGNLEIAEEAVQQLMILEPM

Query:  NSGHYNLLVSMHAEERHWMEVAHIRAMMKEQGVEKKYPGSSWIELEGRIHQFSASADSHPDSDEIYFILTELDGQLKLAGYIPE
        NSG+Y LLVSM+AE+  W +VA IR  M+E G+EK  PG+S I ++ R H F+A+  SH  SDE+  +L E+  Q+ LAGY+ E
Subjt:  NSGHYNLLVSMHAEERHWMEVAHIRAMMKEQGVEKKYPGSSWIELEGRIHQFSASADSHPDSDEIYFILTELDGQLKLAGYIPE

AT1G31920.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.4e-8434.48Show/hide
Query:  KQTLLDRIKNCSTINELDGVYVSMIKTNATQDCFLVNQFISASLTFNC--------VDYPVLAFTQMENPNVFVYNAMIRGFVHCGHPFRALQCYVHMLE
        +Q  L  +K C  I+E   V+   IK +     F  + F ++S+   C        ++Y    F  +++P  F +N MIRG+V+      AL  Y  M++
Subjt:  KQTLLDRIKNCSTINELDGVYVSMIKTNATQDCFLVNQFISASLTFNC--------VDYPVLAFTQMENPNVFVYNAMIRGFVHCGHPFRALQCYVHMLE

Query:  SKVLPTSYTFSSLVKACTFMRAVELGRMIHCHIWKNGLDSHVFVQTALIDFYSN----------LERLGEARKVTMSTVVS-------------------
            P ++T+  L+KACT ++++  G+ IH  ++K GL++ VFVQ +LI+ Y             E+L      + S++VS                   
Subjt:  SKVLPTSYTFSSLVKACTFMRAVELGRMIHCHIWKNGLDSHVFVQTALIDFYSN----------LERLGEARKVTMSTVVS-------------------

Query:  -----------------ACAHVGALDLGKEIHHYVMSQGINLDVYVGSALVDMYAKCGSLDRSLLVFFKLRDKNLYCWNAVIEGLAVHGYAEKALRMFVI
                         ACA+ GAL+LG  IH +++     L++ V ++LVDMY KCG LD++L +F K+  +N   ++A+I GLA+HG  E ALRMF  
Subjt:  -----------------ACAHVGALDLGKEIHHYVMSQGINLDVYVGSALVDMYAKCGSLDRSLLVFFKLRDKNLYCWNAVIEGLAVHGYAEKALRMFVI

Query:  MEREPIMPNGVTFISILSACTHAGLVEEGRSRFLSMTRDYGIHPEVEHYGCMVDMLSKAGLLDEALELIKSMELEPNSIIWGALLNGCKLHGNLEIAEEA
        M +E + P+ V ++S+L+AC+H+GLV+EGR  F  M ++  + P  EHYGC+VD+L +AGLL+EALE I+S+ +E N +IW   L+ C++  N+E+ + A
Subjt:  MEREPIMPNGVTFISILSACTHAGLVEEGRSRFLSMTRDYGIHPEVEHYGCMVDMLSKAGLLDEALELIKSMELEPNSIIWGALLNGCKLHGNLEIAEEA

Query:  VQQLMILEPMNSGHYNLLVSMHAEERHWMEVAHIRAMMKEQGVEKKYPGSSWIELEGRIHQFSASADSHPDSDEIYFILTELDGQLKLAGYIPELS
         Q+L+ L   N G Y L+ +++++ + W +VA  R  +  +G+ K+ PG S +EL+G+ H+F +   SHP   EIY +L +++ QLK  GY P+L+
Subjt:  VQQLMILEPMNSGHYNLLVSMHAEERHWMEVAHIRAMMKEQGVEKKYPGSSWIELEGRIHQFSASADSHPDSDEIYFILTELDGQLKLAGYIPELS

AT4G21065.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.0e-8536.64Show/hide
Query:  VDYPVLAFTQMENP-NVFVYNAMIRGFVHCGHPFRALQCYVHM-LESKVLPTSYTFSSLVKACTFMRAVELGRMIHCHIWKNGLDSHVFVQTALIDFYSN
        + Y    F+++E P NVF++N +IRG+   G+   A   Y  M +   V P ++T+  L+KA T M  V LG  IH  + ++G  S ++VQ +L+  Y+N
Subjt:  VDYPVLAFTQMENP-NVFVYNAMIRGFVHCGHPFRALQCYVHM-LESKVLPTSYTFSSLVKACTFMRAVELGRMIHCHIWKNGLDSHVFVQTALIDFYSN

Query:  LERLGEARKV---------------------------------------------TMSTVVSACAHVGALDLGKEIHHYVMSQGINLDVYVGSALVDMYA
           +  A KV                                             T+ +++SACA +GAL LGK +H Y++  G+  +++  + L+D+YA
Subjt:  LERLGEARKV---------------------------------------------TMSTVVSACAHVGALDLGKEIHHYVMSQGINLDVYVGSALVDMYA

Query:  KCGSLDRSLLVFFKLRDKNLYCWNAVIEGLAVHGYAEKALRMFVIME-REPIMPNGVTFISILSACTHAGLVEEGRSRFLSMTRDYGIHPEVEHYGCMVD
        +CG ++ +  +F ++ DKN   W ++I GLAV+G+ ++A+ +F  ME  E ++P  +TF+ IL AC+H G+V+EG   F  M  +Y I P +EH+GCMVD
Subjt:  KCGSLDRSLLVFFKLRDKNLYCWNAVIEGLAVHGYAEKALRMFVIME-REPIMPNGVTFISILSACTHAGLVEEGRSRFLSMTRDYGIHPEVEHYGCMVD

Query:  MLSKAGLLDEALELIKSMELEPNSIIWGALLNGCKLHGNLEIAEEAVQQLMILEPMNSGHYNLLVSMHAEERHWMEVAHIRAMMKEQGVEKKYPGSSWIE
        +L++AG + +A E IKSM ++PN +IW  LL  C +HG+ ++AE A  Q++ LEP +SG Y LL +M+A E+ W +V  IR  M   GV KK PG S +E
Subjt:  MLSKAGLLDEALELIKSMELEPNSIIWGALLNGCKLHGNLEIAEEAVQQLMILEPMNSGHYNLLVSMHAEERHWMEVAHIRAMMKEQGVEKKYPGSSWIE

Query:  LEGRIHQFSASADSHPDSDEIYFILTELDGQLKLAGYIPELSVCCNILVYTEE
        +  R+H+F     SHP SD IY  L E+ G+L+  GY+P++S   N+ V  EE
Subjt:  LEGRIHQFSASADSHPDSDEIYFILTELDGQLKLAGYIPELSVCCNILVYTEE

AT4G21300.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.1e-8334.09Show/hide
Query:  TLLDRIKNCSTINELDGVYVSMIKTNATQDCFLVNQFISASLTFNCVDYPVLAFTQMENPNVFVYNAMIRGFVHCGHPFRALQCYVHMLESKVLPTSYTF
        +LL  +     +     ++  +++ + + D FL +  I A      V      F+Q  + +V V+ AMI G++H G    +L+ +  +++ K+ P   T 
Subjt:  TLLDRIKNCSTINELDGVYVSMIKTNATQDCFLVNQFISASLTFNCVDYPVLAFTQMENPNVFVYNAMIRGFVHCGHPFRALQCYVHMLESKVLPTSYTF

Query:  SSLVKACTFMRAVELGRMIHCHIWKNGLDSHVFVQTALIDFYSNLERLGEARK---------------------------------------------VT
         S++     + A++LGR +H  I K G D+   +  A+ID Y+   R+  A +                                             V+
Subjt:  SSLVKACTFMRAVELGRMIHCHIWKNGLDSHVFVQTALIDFYSNLERLGEARK---------------------------------------------VT

Query:  MSTVVSACAHVGALDLGKEIHHYVMSQGINLDVYVGSALVDMYAKCGSLDRSLLVFFKLRDKNLYCWNAVIEGLAVHGYAEKALRMF-VIMEREPIMPNG
        +S  +SACA++ +   GK IH +++   +  DVY  S L+DMYAKCG+L  ++ VF  +++KN+  WN++I     HG  + +L +F  ++E+  I P+ 
Subjt:  MSTVVSACAHVGALDLGKEIHHYVMSQGINLDVYVGSALVDMYAKCGSLDRSLLVFFKLRDKNLYCWNAVIEGLAVHGYAEKALRMF-VIMEREPIMPNG

Query:  VTFISILSACTHAGLVEEGRSRFLSMTRDYGIHPEVEHYGCMVDMLSKAGLLDEALELIKSMELEPNSIIWGALLNGCKLHGNLEIAEEAVQQLMILEPM
        +TF+ I+S+C H G V+EG   F SMT DYGI P+ EHY C+VD+  +AG L EA E +KSM   P++ +WG LL  C+LH N+E+AE A  +LM L+P 
Subjt:  VTFISILSACTHAGLVEEGRSRFLSMTRDYGIHPEVEHYGCMVDMLSKAGLLDEALELIKSMELEPNSIIWGALLNGCKLHGNLEIAEEAVQQLMILEPM

Query:  NSGHYNLLVSMHAEERHWMEVAHIRAMMKEQGVEKKYPGSSWIELEGRIHQFSASADSHPDSDEIYFILTELDGQLKLAGYIPE
        NSG+Y L+ + HA  R W  V  +R++MKE+ V+ K PG SWIE+  R H F +   +HP+S  IY +L  L G+L+L GYIP+
Subjt:  NSGHYNLLVSMHAEERHWMEVAHIRAMMKEQGVEKKYPGSSWIELEGRIHQFSASADSHPDSDEIYFILTELDGQLKLAGYIPE

AT5G66520.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.2e-8834.67Show/hide
Query:  IKNCSTINELDGVYVSMIKTNATQDCFLVNQFIS---ASLTFNCVDYPVLAFTQMENPNVFVYNAMIRGFVHCGHPFRALQCYVHMLESKVLPTSYTFSS
        ++ CS   EL  ++  M+KT   QD + + +F+S   +S + + + Y  + F   + P+ F++N MIRGF     P R+L  Y  ML S     +YTF S
Subjt:  IKNCSTINELDGVYVSMIKTNATQDCFLVNQFIS---ASLTFNCVDYPVLAFTQMENPNVFVYNAMIRGFVHCGHPFRALQCYVHMLESKVLPTSYTFSS

Query:  LVKACTFMRAVELGRMIHCHIWKNGLDSHVFVQTALIDFYS-----------------------------------------------------------
        L+KAC+ + A E    IH  I K G ++ V+   +LI+ Y+                                                           
Subjt:  LVKACTFMRAVELGRMIHCHIWKNGLDSHVFVQTALIDFYS-----------------------------------------------------------

Query:  -------NLERLG----------EARKVTMSTVVSACAHVGALDLGKEIHHYVMSQGINLDVYVGSALVDMYAKCGSLDRSLLVFFKLRDKNLYCWNAVI
               N E L           E   V+++  +SACA +GAL+ GK IH Y+    I +D  +G  L+DMYAKCG ++ +L VF  ++ K++  W A+I
Subjt:  -------NLERLG----------EARKVTMSTVVSACAHVGALDLGKEIHHYVMSQGINLDVYVGSALVDMYAKCGSLDRSLLVFFKLRDKNLYCWNAVI

Query:  EGLAVHGYAEKALRMFVIMEREPIMPNGVTFISILSACTHAGLVEEGRSRFLSMTRDYGIHPEVEHYGCMVDMLSKAGLLDEALELIKSMELEPNSIIWG
         G A HG+  +A+  F+ M++  I PN +TF ++L+AC++ GLVEEG+  F SM RDY + P +EHYGC+VD+L +AGLLDEA   I+ M L+PN++IWG
Subjt:  EGLAVHGYAEKALRMFVIMEREPIMPNGVTFISILSACTHAGLVEEGRSRFLSMTRDYGIHPEVEHYGCMVDMLSKAGLLDEALELIKSMELEPNSIIWG

Query:  ALLNGCKLHGNLEIAEEAVQQLMILEPMNSGHYNLLVSMHAEERHWMEVAHIRAMMKEQGVEKKYPGSSWIELEGRIHQFSASADSHPDSDEIYFILTEL
        ALL  C++H N+E+ EE  + L+ ++P + G Y    ++HA ++ W + A  R +MKEQGV  K PG S I LEG  H+F A   SHP+ ++I      +
Subjt:  ALLNGCKLHGNLEIAEEAVQQLMILEPMNSGHYNLLVSMHAEERHWMEVAHIRAMMKEQGVEKKYPGSSWIELEGRIHQFSASADSHPDSDEIYFILTEL

Query:  DGQLKLAGYIPELSVCCNILVYTEE
          +L+  GY+PEL      LV  +E
Subjt:  DGQLKLAGYIPELSVCCNILVYTEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTTCAGTTGCGACTACCAACGCTCTTAAACAGACAACAAGAAGCATCAGCAACTTCGTAAGTTCCTCAATCTCAAGCCCTCTTCAACGACCGTATGTTCCTTATTT
AAAGCAAACTCTACTCGATAGAATCAAGAACTGCTCCACAATAAACGAACTGGACGGTGTATATGTTTCAATGATCAAAACTAATGCAACCCAAGATTGCTTTCTGGTGA
ATCAGTTTATTAGCGCGTCTTTGACGTTTAACTGTGTGGATTATCCAGTTTTGGCCTTTACCCAGATGGAAAATCCTAATGTTTTTGTGTATAATGCGATGATTAGGGGA
TTTGTACATTGTGGGCACCCATTTCGAGCTTTGCAATGTTATGTACATATGTTAGAATCGAAGGTTTTGCCAACTAGTTATACGTTTTCATCATTAGTTAAAGCTTGCAC
CTTCATGCGTGCAGTTGAGTTGGGACGGATGATTCATTGTCATATTTGGAAGAATGGGCTTGATTCACATGTGTTTGTTCAAACTGCTTTGATTGATTTTTACTCAAATT
TGGAGAGACTTGGTGAAGCGAGAAAGGTAACGATGTCTACTGTTGTTTCAGCTTGTGCCCATGTTGGAGCTCTTGATCTAGGAAAAGAGATACATCATTATGTAATGTCT
CAGGGGATTAATCTTGATGTTTATGTTGGATCTGCATTAGTTGATATGTATGCCAAGTGCGGGAGTTTAGATCGGTCACTTTTGGTTTTCTTCAAATTGAGGGATAAAAA
TTTATATTGCTGGAATGCAGTGATTGAAGGGCTTGCTGTTCATGGTTATGCAGAGAAGGCGCTAAGGATGTTCGTCATCATGGAGAGGGAGCCAATCATGCCCAATGGTG
TTACTTTCATTAGCATATTAAGCGCCTGCACACATGCAGGGTTGGTTGAAGAAGGCAGAAGTAGATTCTTGAGCATGACTCGTGATTACGGCATCCATCCTGAAGTCGAA
CACTATGGTTGCATGGTTGATATGTTGAGTAAAGCAGGATTGCTTGATGAGGCGTTAGAATTGATTAAAAGTATGGAACTTGAACCAAACTCTATCATTTGGGGAGCCTT
GTTAAATGGGTGCAAGCTTCATGGGAACTTGGAGATTGCTGAAGAAGCTGTCCAACAGTTGATGATTTTGGAGCCCATGAACAGTGGGCATTACAATCTTTTGGTTAGCA
TGCATGCTGAAGAAAGACATTGGATGGAGGTTGCCCATATCCGGGCAATGATGAAAGAACAAGGGGTAGAAAAGAAATATCCCGGGTCAAGTTGGATTGAATTGGAAGGG
AGAATTCATCAGTTTTCAGCTTCAGCTGATTCTCACCCTGATTCTGACGAAATATACTTCATACTGACAGAGTTAGATGGACAGCTGAAACTAGCTGGTTACATACCCGA
GCTTTCAGTATGCTGTAATATTTTGGTTTATACGGAGGAATTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTTCAGTTGCGACTACCAACGCTCTTAAACAGACAACAAGAAGCATCAGCAACTTCGTAAGTTCCTCAATCTCAAGCCCTCTTCAACGACCGTATGTTCCTTATTT
AAAGCAAACTCTACTCGATAGAATCAAGAACTGCTCCACAATAAACGAACTGGACGGTGTATATGTTTCAATGATCAAAACTAATGCAACCCAAGATTGCTTTCTGGTGA
ATCAGTTTATTAGCGCGTCTTTGACGTTTAACTGTGTGGATTATCCAGTTTTGGCCTTTACCCAGATGGAAAATCCTAATGTTTTTGTGTATAATGCGATGATTAGGGGA
TTTGTACATTGTGGGCACCCATTTCGAGCTTTGCAATGTTATGTACATATGTTAGAATCGAAGGTTTTGCCAACTAGTTATACGTTTTCATCATTAGTTAAAGCTTGCAC
CTTCATGCGTGCAGTTGAGTTGGGACGGATGATTCATTGTCATATTTGGAAGAATGGGCTTGATTCACATGTGTTTGTTCAAACTGCTTTGATTGATTTTTACTCAAATT
TGGAGAGACTTGGTGAAGCGAGAAAGGTAACGATGTCTACTGTTGTTTCAGCTTGTGCCCATGTTGGAGCTCTTGATCTAGGAAAAGAGATACATCATTATGTAATGTCT
CAGGGGATTAATCTTGATGTTTATGTTGGATCTGCATTAGTTGATATGTATGCCAAGTGCGGGAGTTTAGATCGGTCACTTTTGGTTTTCTTCAAATTGAGGGATAAAAA
TTTATATTGCTGGAATGCAGTGATTGAAGGGCTTGCTGTTCATGGTTATGCAGAGAAGGCGCTAAGGATGTTCGTCATCATGGAGAGGGAGCCAATCATGCCCAATGGTG
TTACTTTCATTAGCATATTAAGCGCCTGCACACATGCAGGGTTGGTTGAAGAAGGCAGAAGTAGATTCTTGAGCATGACTCGTGATTACGGCATCCATCCTGAAGTCGAA
CACTATGGTTGCATGGTTGATATGTTGAGTAAAGCAGGATTGCTTGATGAGGCGTTAGAATTGATTAAAAGTATGGAACTTGAACCAAACTCTATCATTTGGGGAGCCTT
GTTAAATGGGTGCAAGCTTCATGGGAACTTGGAGATTGCTGAAGAAGCTGTCCAACAGTTGATGATTTTGGAGCCCATGAACAGTGGGCATTACAATCTTTTGGTTAGCA
TGCATGCTGAAGAAAGACATTGGATGGAGGTTGCCCATATCCGGGCAATGATGAAAGAACAAGGGGTAGAAAAGAAATATCCCGGGTCAAGTTGGATTGAATTGGAAGGG
AGAATTCATCAGTTTTCAGCTTCAGCTGATTCTCACCCTGATTCTGACGAAATATACTTCATACTGACAGAGTTAGATGGACAGCTGAAACTAGCTGGTTACATACCCGA
GCTTTCAGTATGCTGTAATATTTTGGTTTATACGGAGGAATTTTGA
Protein sequenceShow/hide protein sequence
MFSVATTNALKQTTRSISNFVSSSISSPLQRPYVPYLKQTLLDRIKNCSTINELDGVYVSMIKTNATQDCFLVNQFISASLTFNCVDYPVLAFTQMENPNVFVYNAMIRG
FVHCGHPFRALQCYVHMLESKVLPTSYTFSSLVKACTFMRAVELGRMIHCHIWKNGLDSHVFVQTALIDFYSNLERLGEARKVTMSTVVSACAHVGALDLGKEIHHYVMS
QGINLDVYVGSALVDMYAKCGSLDRSLLVFFKLRDKNLYCWNAVIEGLAVHGYAEKALRMFVIMEREPIMPNGVTFISILSACTHAGLVEEGRSRFLSMTRDYGIHPEVE
HYGCMVDMLSKAGLLDEALELIKSMELEPNSIIWGALLNGCKLHGNLEIAEEAVQQLMILEPMNSGHYNLLVSMHAEERHWMEVAHIRAMMKEQGVEKKYPGSSWIELEG
RIHQFSASADSHPDSDEIYFILTELDGQLKLAGYIPELSVCCNILVYTEEF