; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G08960 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G08960
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionPentatricopeptide repeat-containing protein
Genome locationClcChr04:22638224..22643976
RNA-Seq ExpressionClc04G08960
SyntenyClc04G08960
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6603691.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]2.6e-26286.53Show/hide
Query:  IRPLHLLFAISQSSALLYRHGFRIVASIRLFSNFKPKFHSTTNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQFRSQTLTDGFDSHSVVLT
        +R LH+LF IS SSALL RHGF IV SIRLFS FK     TTNLPKPPRLL+LISPKGN ASESRQTHLRLI+DFLRTDSDQ RSQTL+DGFDS SV L+
Subjt:  IRPLHLLFAISQSSALLYRHGFRIVASIRLFSNFKPKFHSTTNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQFRSQTLTDGFDSHSVVLT

Query:  KDSSFVLDQERESGHWDLQLFAGRFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTA
        KDSS V DQERESGHW  QLFAGRF+FDANDISS LSLC SQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKC E+++A Q+F+EMPVRNVVSWTA
Subjt:  KDSSFVLDQERESGHWDLQLFAGRFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTA

Query:  IIAGFAVEWQVDMCLELFQQMKIMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWN
        IIAGFA EWQVDMCLELFQ+M+ MALQPNEFTF TILSACTGSGALGVGRSLHCQTFKMG DS +HIAN LISMYCKCGA+N A+Y+FEAMEVKD VSWN
Subjt:  IIAGFAVEWQVDMCLELFQQMKIMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWN

Query:  SMIAGYAQHGLSLQAINLFKAMRKQKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSI
        SMIAGYAQHGLSLQAI+LFKAMRKQ+QVEAD ITFLGVLSSCRH GLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEA+NFI+ MPISPNSI
Subjt:  SMIAGYAQHGLSLQAINLFKAMRKQKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSI

Query:  VWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMD
        VWGSLLSACRLHGNVWIGLKAAESRLLLQP+CASTHLQLANLYA+AGYLD+ ARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVM+EIFG+MD
Subjt:  VWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMD

Query:  SIVNHMRSVGCVHEV-DEVDDVLLPTS
         +VNHMRSVGCV EV DE++D LL TS
Subjt:  SIVNHMRSVGCVHEV-DEVDDVLLPTS

XP_022949850.1 pentatricopeptide repeat-containing protein At2g37320 [Cucurbita moschata]4.9e-26186.15Show/hide
Query:  IRPLHLLFAISQSSALLYRHGFRIVASIRLFSNFKPKFHSTTNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQFRSQTLTDGFDSHSVVLT
        +R LH+LF IS SSALL RHGF IV SIRLFS FK     TTNLPKPPRLL+LISPKGN ASESRQTHLRLI+DFLRTDSDQ RSQTL+DGFDS SV L+
Subjt:  IRPLHLLFAISQSSALLYRHGFRIVASIRLFSNFKPKFHSTTNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQFRSQTLTDGFDSHSVVLT

Query:  KDSSFVLDQERESGHWDLQLFAGRFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTA
        KDSS V DQERESGHW  QLFAGRF+FDANDISS LSLC SQRN RGGIQYHSVAIRTGFIANVYVGSSLVSLYGKC E+++A Q+F+EMPVRNVVSWTA
Subjt:  KDSSFVLDQERESGHWDLQLFAGRFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTA

Query:  IIAGFAVEWQVDMCLELFQQMKIMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWN
        IIAGFA EWQVDMCLELFQ+M+ MALQPNEFTF TILSACTGSGALGVGRSLHCQTFKMG DS +HIAN LISMYCKCGA+N A+Y+FEAMEVKD VSWN
Subjt:  IIAGFAVEWQVDMCLELFQQMKIMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWN

Query:  SMIAGYAQHGLSLQAINLFKAMRKQKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSI
        S+IAGYAQHGLSLQAI+LFKAMRKQ+QVEAD ITFLGVLSSCRH GLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEA+N I+ MPISPNSI
Subjt:  SMIAGYAQHGLSLQAINLFKAMRKQKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSI

Query:  VWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMD
        VWGSLLSACRLHGNVWIGLKAAESRLLLQP+CASTHLQLANLYA+AGYLDD ARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVM+EIFG+MD
Subjt:  VWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMD

Query:  SIVNHMRSVGCVHEVD-EVDDVLLPTS
         +VNHMRSVGCV EVD E++D LL TS
Subjt:  SIVNHMRSVGCVHEVD-EVDDVLLPTS

XP_022977771.1 pentatricopeptide repeat-containing protein At2g37320 [Cucurbita maxima]1.7e-26186.15Show/hide
Query:  IRPLHLLFAISQSSALLYRHGFRIVASIRLFSNFKPKFHSTTNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQFRSQTLTDGFDSHSVVLT
        +R LH+LF ISQSSALL RHGF IV SIRLFS FK     TTNLPKPPRLL+LISPKGN ASESRQTHLRLI+DFL+TDSDQ RSQTL+DGFDS SV L+
Subjt:  IRPLHLLFAISQSSALLYRHGFRIVASIRLFSNFKPKFHSTTNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQFRSQTLTDGFDSHSVVLT

Query:  KDSSFVLDQERESGHWDLQLFAGRFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTA
        KDSS VLDQERESGHWD QLFAGRF+FDANDISS LSLC SQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKC E+++A Q+F+EMPVRNVVSWTA
Subjt:  KDSSFVLDQERESGHWDLQLFAGRFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTA

Query:  IIAGFAVEWQVDMCLELFQQMKIMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWN
        IIAGFA EWQVDMCLELFQ+M+ MALQPNEFTF TILSACTGSGALGVGRSLHCQTFKMG DS +HIAN LISMYCKCGA+N A+Y+FEAMEVKD VSWN
Subjt:  IIAGFAVEWQVDMCLELFQQMKIMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWN

Query:  SMIAGYAQHGLSLQAINLFKAMRKQKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSI
        SMIAGYAQHGLSL+AI+LF+AMRKQ+QVEAD ITFLGVLSSCRH GLVEEGRYYFNLMVEL LKPELDHYSCVIDLLGRAGLLKEA+NFI+KMPISPNSI
Subjt:  SMIAGYAQHGLSLQAINLFKAMRKQKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSI

Query:  VWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMD
        VWGSLLSACRLHGNVWIGLKAAESRLLLQP+CASTHLQLANLYA+AGYL+D ARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVM+EIFG+MD
Subjt:  VWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMD

Query:  SIVNHMRSVGCVHEV-DEVDDVLLPTS
         +VNHMRSV CV EV DE++D LL  S
Subjt:  SIVNHMRSVGCVHEV-DEVDDVLLPTS

XP_023544680.1 pentatricopeptide repeat-containing protein At2g37320 [Cucurbita pepo subsp. pepo]1.1e-26085.96Show/hide
Query:  IRPLHLLFAISQSSALLYRHGFRIVASIRLFSNFKPKFHSTTNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQFRSQTLTDGFDSHSVVLT
        +R LH LF IS+SSALL RHGF IV SIRLFS FK     TT+LPKPPRLL+LISPKGN ASESRQTHLRLI+DFL+TDSDQ RSQTL+DGFDS SV L+
Subjt:  IRPLHLLFAISQSSALLYRHGFRIVASIRLFSNFKPKFHSTTNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQFRSQTLTDGFDSHSVVLT

Query:  KDSSFVLDQERESGHWDLQLFAGRFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTA
        KDSS VLDQERESGHW  QLFAGRF+FDANDISS LSLC SQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKC E+++A Q+F+EMPVRNVVSWTA
Subjt:  KDSSFVLDQERESGHWDLQLFAGRFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTA

Query:  IIAGFAVEWQVDMCLELFQQMKIMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWN
        +IAGFA EWQVDMCLELFQQMK MAL+PNEFTF TILSACTGSGALGVGRSLHCQTFKMG DS +HIAN LISMYCKCGA+N A+Y+FEAMEVKD VSWN
Subjt:  IIAGFAVEWQVDMCLELFQQMKIMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWN

Query:  SMIAGYAQHGLSLQAINLFKAMRKQKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSI
        SMIAGYAQHGLSLQAI+LF+AMRKQ+QVEAD ITFLGVLSSCRH GLVEEGRYYFNLMVEL LKPELDHYSCVIDLLGRAGLLKEA+NFI+KMPISPNSI
Subjt:  SMIAGYAQHGLSLQAINLFKAMRKQKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSI

Query:  VWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMD
        VWGSLLSACRLHGNVWIGLKAAESRLLLQP+CASTHLQLANLYA+AGYLDD ARLRKMMKDKGLKT+PGYSWIEIQNKVYRFKAEDKSNPVM+EIFG+MD
Subjt:  VWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMD

Query:  SIVNHMRSVGCVHEV-DEVDDVLLPTS
         +VNHMRSVGCV EV DE+++ LL TS
Subjt:  SIVNHMRSVGCVHEV-DEVDDVLLPTS

XP_038881286.1 pentatricopeptide repeat-containing protein At2g37320 isoform X1 [Benincasa hispida]4.6e-26787.64Show/hide
Query:  RPLHLLFAISQSSALLYRHGFRIVASIRLFSNFKPKFHSTTNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQFRSQTLTDGFDSHSVVLTK
        R L ++F ISQSSALLYRHGF +V SIRLFSNFKPK  S+TNLPKPP+LL+LISPKGN A+E+RQTHLRLIQDFL+TDSDQ RSQTL+DGFDSHS+V +K
Subjt:  RPLHLLFAISQSSALLYRHGFRIVASIRLFSNFKPKFHSTTNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQFRSQTLTDGFDSHSVVLTK

Query:  DSSFVLDQERESGHWDLQLFAGRFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAI
        DSS VL QERESGHWDLQLFAGRFKFDANDISS LSLCSSQ NLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQ+FEEMPVRNVVSWTAI
Subjt:  DSSFVLDQERESGHWDLQLFAGRFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAI

Query:  IAGFAVEWQVDMCLELFQQMKIMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNS
        I+GFAVEW VDMCL+LFQQMK MALQPNEFTF TILSACTGSGALG+GRSLHCQTFKMG DS LH+AN LISMYCKCGA+N ALYIFEAMEVKDIVSWNS
Subjt:  IAGFAVEWQVDMCLELFQQMKIMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNS

Query:  MIAGYAQHGLSLQAINLFKAMRKQKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIV
        MIAGYAQHGLSLQAI+L+  MRKQKQVEADAITFLGVLSSCRHAGLVEEG+YYFNLMVELG+KPELDHY+CVIDLLGRAGLLKEA+NFI++MPISPNSIV
Subjt:  MIAGYAQHGLSLQAINLFKAMRKQKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIV

Query:  WGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDS
        WGSLLSACRLHGNVWIGLKAAESRLLLQP+CASTHLQ+ANLYAKAGYL+D ARLRKMMKDKGLKT PGYSWIEIQNKVYRFKAEDKSNPVMVEI GLMD 
Subjt:  WGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDS

Query:  IVNHMRSVGCVHEV-DEVDDVLLPTS
        I+NHMR +G  HEV DEVDD+ L TS
Subjt:  IVNHMRSVGCVHEV-DEVDDVLLPTS

TrEMBL top hitse value%identityAlignment
A0A0A0KX36 Uncharacterized protein1.2e-25786.38Show/hide
Query:  LHLLFAISQSSALLYRHGFRIVASIRLFSNFKPKFHSTTNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQFRSQTLTDGFDSHSVVLTKDS
        LH+LF ISQSSA  YRHGF ++ SIR FSNFK   H TTNLPKPPRLL+LISPKG+V+ ESRQTHLRLIQDFL+TDS Q RSQTL  G DS S+ L+KDS
Subjt:  LHLLFAISQSSALLYRHGFRIVASIRLFSNFKPKFHSTTNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQFRSQTLTDGFDSHSVVLTKDS

Query:  SFVLDQERESGHWDLQLFAGRFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIA
        SFVLDQE ESGHWD+Q FAGRFKF+ANDISS LSLC+SQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAY++F+EMPVRNVVSWTAIIA
Subjt:  SFVLDQERESGHWDLQLFAGRFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIA

Query:  GFAVEWQVDMCLELFQQMKIMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMI
        GFAVEWQV+MCLELFQ+MK MALQPNEFTF TIL+ACTGSGALGVGRSLHCQT KMG  S LH+AN LISMYCKCGA+N ALYIFEAMEVKD VSWNSMI
Subjt:  GFAVEWQVDMCLELFQQMKIMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMI

Query:  AGYAQHGLSLQAINLFKAMRKQKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWG
        AGYAQHGLSL+AI+LFKAMRKQKQVEADAITFLGVLSSCRHAG VEEGR+YFNLMVELGLKPELDHYSCVIDLLGRAGLLKEA+NFI+KMPI+PNSIVWG
Subjt:  AGYAQHGLSLQAINLFKAMRKQKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWG

Query:  SLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDSIV
        SLLSACRLHGNVWIGLKAAESRLLLQP+CASTHLQL NLYAKAGYLDD ARLRK+MKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNP+MVEIFGL+D +V
Subjt:  SLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDSIV

Query:  NHMRSVGCVHEVDE
        NHMR VGC HE+++
Subjt:  NHMRSVGCVHEVDE

A0A1S3BGK7 pentatricopeptide repeat-containing protein At2g373201.2e-26086.83Show/hide
Query:  LHLLFAISQSSALLYRHGFRIVASIRLFSNFKPKFHSTTNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQFRSQTLTDGFDSHSVVLTKDS
        LH+LFAISQSSA  YRHGF +V S+R FSNFK   H TTNLPKP RLL+LISPKG+V+ ESRQTHLRLIQDFL+TD DQ RSQTL+ GFDS SV L+KDS
Subjt:  LHLLFAISQSSALLYRHGFRIVASIRLFSNFKPKFHSTTNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQFRSQTLTDGFDSHSVVLTKDS

Query:  SFVLDQERESGHWDLQLFAGRFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIA
        SFVLDQE ESGHWD+Q FAGRFKF ANDISS LSLC+SQRNLRGG+QYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMF+EMPVRNVVSWTAIIA
Subjt:  SFVLDQERESGHWDLQLFAGRFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIA

Query:  GFAVEWQVDMCLELFQQMKIMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMI
        GFAVEWQV+MCLELF++MK MALQPNEFTF TIL+ACTGSGALGVGRSLHCQTFKMG  S LHIAN LISMYCKCGA+N ALYIFEAMEVKD VSWNSMI
Subjt:  GFAVEWQVDMCLELFQQMKIMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMI

Query:  AGYAQHGLSLQAINLFKAMRKQKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWG
        AGYAQHGLS +AI+LFKAMRKQKQVEADAITFLGVLSSCRHAG VEEGR+YFNLMVELGLKPELDHYSCVIDLLGRAGLLKEA+NFI+KMP+SPNSI+WG
Subjt:  AGYAQHGLSLQAINLFKAMRKQKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWG

Query:  SLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDSIV
        SLLSACRLHGNVWIGLKAAESRLLLQP+CASTHLQL  LYAKAGYLDD ARLRK+MKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNP+MVEIFGLMD +V
Subjt:  SLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDSIV

Query:  NHMRSVGCVHEV-DEVDDVLLPTS
        NHMR VG  HE+ DEVDDVLL TS
Subjt:  NHMRSVGCVHEV-DEVDDVLLPTS

A0A5A7U1F7 Pentatricopeptide repeat-containing protein2.9e-25986.78Show/hide
Query:  LLFAISQSSALLYRHGFRIVASIRLFSNFKPKFHSTTNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQFRSQTLTDGFDSHSVVLTKDSSF
        +LFAISQSSA  YRHGF +V S+R FSNFK   H TTNLPKP RLL+LISPKG+V+ ESRQTHLRLIQDFL+TD DQ RSQTL+ GFDS SV L+KDSSF
Subjt:  LLFAISQSSALLYRHGFRIVASIRLFSNFKPKFHSTTNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQFRSQTLTDGFDSHSVVLTKDSSF

Query:  VLDQERESGHWDLQLFAGRFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGF
        VLDQE ESGHWD+Q FAGRFKF ANDISS LSLC+SQRNLRGG+QYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMF+EMPVRNVVSWTAIIAGF
Subjt:  VLDQERESGHWDLQLFAGRFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGF

Query:  AVEWQVDMCLELFQQMKIMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAG
        AVEWQV+MCLELF++MK MALQPNEFTF TIL+ACTGSGALGVGRSLHCQTFKMG  S LHIAN LISMYCKCGA+N ALYIFEAMEVKD VSWNSMIAG
Subjt:  AVEWQVDMCLELFQQMKIMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAG

Query:  YAQHGLSLQAINLFKAMRKQKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSL
        YAQHGLS +AI+LFKAMRKQKQVEADAITFLGVLSSCRHAG VEEGR+YFNLMVELGLKPELDHYSCVIDLLGRAGLLKEA+NFI+KMP+SPNSI+WGSL
Subjt:  YAQHGLSLQAINLFKAMRKQKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSL

Query:  LSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDSIVNH
        LSACRLHGNVWIGLKAAESRLLLQP+CASTHLQL  LYAKAGYLDD ARLRK+MKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNP+MVEIFGLMD +VNH
Subjt:  LSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDSIVNH

Query:  MRSVGCVHEV-DEVDDVLLPTS
        MR VG  HE+ DEVDDVLL TS
Subjt:  MRSVGCVHEV-DEVDDVLLPTS

A0A6J1GDY6 pentatricopeptide repeat-containing protein At2g373202.4e-26186.15Show/hide
Query:  IRPLHLLFAISQSSALLYRHGFRIVASIRLFSNFKPKFHSTTNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQFRSQTLTDGFDSHSVVLT
        +R LH+LF IS SSALL RHGF IV SIRLFS FK     TTNLPKPPRLL+LISPKGN ASESRQTHLRLI+DFLRTDSDQ RSQTL+DGFDS SV L+
Subjt:  IRPLHLLFAISQSSALLYRHGFRIVASIRLFSNFKPKFHSTTNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQFRSQTLTDGFDSHSVVLT

Query:  KDSSFVLDQERESGHWDLQLFAGRFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTA
        KDSS V DQERESGHW  QLFAGRF+FDANDISS LSLC SQRN RGGIQYHSVAIRTGFIANVYVGSSLVSLYGKC E+++A Q+F+EMPVRNVVSWTA
Subjt:  KDSSFVLDQERESGHWDLQLFAGRFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTA

Query:  IIAGFAVEWQVDMCLELFQQMKIMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWN
        IIAGFA EWQVDMCLELFQ+M+ MALQPNEFTF TILSACTGSGALGVGRSLHCQTFKMG DS +HIAN LISMYCKCGA+N A+Y+FEAMEVKD VSWN
Subjt:  IIAGFAVEWQVDMCLELFQQMKIMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWN

Query:  SMIAGYAQHGLSLQAINLFKAMRKQKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSI
        S+IAGYAQHGLSLQAI+LFKAMRKQ+QVEAD ITFLGVLSSCRH GLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEA+N I+ MPISPNSI
Subjt:  SMIAGYAQHGLSLQAINLFKAMRKQKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSI

Query:  VWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMD
        VWGSLLSACRLHGNVWIGLKAAESRLLLQP+CASTHLQLANLYA+AGYLDD ARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVM+EIFG+MD
Subjt:  VWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMD

Query:  SIVNHMRSVGCVHEVD-EVDDVLLPTS
         +VNHMRSVGCV EVD E++D LL TS
Subjt:  SIVNHMRSVGCVHEVD-EVDDVLLPTS

A0A6J1IKW6 pentatricopeptide repeat-containing protein At2g373208.2e-26286.15Show/hide
Query:  IRPLHLLFAISQSSALLYRHGFRIVASIRLFSNFKPKFHSTTNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQFRSQTLTDGFDSHSVVLT
        +R LH+LF ISQSSALL RHGF IV SIRLFS FK     TTNLPKPPRLL+LISPKGN ASESRQTHLRLI+DFL+TDSDQ RSQTL+DGFDS SV L+
Subjt:  IRPLHLLFAISQSSALLYRHGFRIVASIRLFSNFKPKFHSTTNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQFRSQTLTDGFDSHSVVLT

Query:  KDSSFVLDQERESGHWDLQLFAGRFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTA
        KDSS VLDQERESGHWD QLFAGRF+FDANDISS LSLC SQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKC E+++A Q+F+EMPVRNVVSWTA
Subjt:  KDSSFVLDQERESGHWDLQLFAGRFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTA

Query:  IIAGFAVEWQVDMCLELFQQMKIMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWN
        IIAGFA EWQVDMCLELFQ+M+ MALQPNEFTF TILSACTGSGALGVGRSLHCQTFKMG DS +HIAN LISMYCKCGA+N A+Y+FEAMEVKD VSWN
Subjt:  IIAGFAVEWQVDMCLELFQQMKIMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWN

Query:  SMIAGYAQHGLSLQAINLFKAMRKQKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSI
        SMIAGYAQHGLSL+AI+LF+AMRKQ+QVEAD ITFLGVLSSCRH GLVEEGRYYFNLMVEL LKPELDHYSCVIDLLGRAGLLKEA+NFI+KMPISPNSI
Subjt:  SMIAGYAQHGLSLQAINLFKAMRKQKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSI

Query:  VWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMD
        VWGSLLSACRLHGNVWIGLKAAESRLLLQP+CASTHLQLANLYA+AGYL+D ARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVM+EIFG+MD
Subjt:  VWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMD

Query:  SIVNHMRSVGCVHEV-DEVDDVLLPTS
         +VNHMRSV CV EV DE++D LL  S
Subjt:  SIVNHMRSVGCVHEV-DEVDDVLLPTS

SwissProt top hitse value%identityAlignment
A8MQA3 Pentatricopeptide repeat-containing protein At4g210651.7e-7837.89Show/hide
Query:  SSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDMCLELFQQMKIMALQPNEFTFTTILSA
        ++  ++R G   HSV IR+GF + +YV +SL+ LY  CG++++AY++F++MP +++V+W ++I GFA   + +  L L+ +M    ++P+ FT  ++LSA
Subjt:  SSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDMCLELFQQMKIMALQPNEFTFTTILSA

Query:  CTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINLFKAMRKQKQVEADAITFLGVL
        C   GAL +G+ +H    K+GL   LH +NVL+ +Y +CG +  A  +F+ M  K+ VSW S+I G A +G   +AI LFK M   + +    ITF+G+L
Subjt:  CTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINLFKAMRKQKQVEADAITFLGVL

Query:  SSCRHAGLVEEGRYYFNLM-VELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQ
         +C H G+V+EG  YF  M  E  ++P ++H+ C++DLL RAG +K+A  +IK MP+ PN ++W +LL AC +HG+  +   A    L L+PN +  ++ 
Subjt:  SSCRHAGLVEEGRYYFNLM-VELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQ

Query:  LANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDSIVNHMRSVGCVHEVDEV
        L+N+YA      DV ++RK M   G+K  PG+S +E+ N+V+ F   DKS+P    I+  +  +   +RS G V ++  V
Subjt:  LANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDSIVNHMRSVGCVHEVDEV

Q7XJN6 Pentatricopeptide repeat-containing protein At2g407204.7e-8139.79Show/hide
Query:  KFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDMCLELFQQMKIMA
        K D++ ++S  + C+    LR G+Q H   I+TG + NV+VGSSL+ LY KCG    A ++F  M   N+V+W ++I+ ++     ++ ++LF  M    
Subjt:  KFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDMCLELFQQMKIMA

Query:  LQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINLFKAMRKQ
        + P+  + T++L A + + +L  G+SLH  T ++G+ S  H+ N LI MY KCG    A  IF+ M+ K +++WN MI GY  HG  + A++LF  M+K 
Subjt:  LQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINLFKAMRKQ

Query:  KQVEADAITFLGVLSSCRHAGLVEEGRYYFNLM-VELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIGLKAAES
         +   D +TFL ++S+C H+G VEEG+  F  M  + G++P ++HY+ ++DLLGRAGLL+EA +FIK MPI  +S +W  LLSA R H NV +G+ +AE 
Subjt:  KQVEADAITFLGVLSSCRHAGLVEEGRYYFNLM-VELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIGLKAAES

Query:  RLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDSIVNHM
         L ++P   ST++QL NLY +AG  ++ A+L  +MK+KGL   PG SWIE+ ++   F +   S+P+  EIF +++ + ++M
Subjt:  RLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDSIVNHM

Q9LTF4 Putative pentatricopeptide repeat-containing protein At5g526302.6e-7937.5Show/hide
Query:  QLFAGRFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDMCLELF
        ++ AG  + D + + S    C+       G   H ++++TG+ A+V+VGSSLV +Y KCGE+  A +MF+EMP RNVV+W+ ++ G+A   + +  L LF
Subjt:  QLFAGRFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDMCLELF

Query:  QQMKIMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINL
        ++     L  N+++F++++S C  S  L +GR +H  + K   DS   + + L+S+Y KCG    A  +F  + VK++  WN+M+  YAQH  + + I L
Subjt:  QQMKIMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINL

Query:  FKAMRKQKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIG
        FK M K   ++ + ITFL VL++C HAGLV+EGRYYF+ M E  ++P   HY+ ++D+LGRAG L+EA   I  MPI P   VWG+LL++C +H N  + 
Subjt:  FKAMRKQKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIG

Query:  LKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDSIVNHMRSVGCVHEVDEV
          AA+    L P  +  H+ L+N YA  G  +D A+ RK+++D+G K   G SW+E +NKV+ F A ++ +    EI+  +  +   M   G + +   V
Subjt:  LKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDSIVNHMRSVGCVHEVDEV

Q9SY02 Pentatricopeptide repeat-containing protein At4g027506.8e-8041.32Show/hide
Query:  NVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDMCLELFQQMKIMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLD
        NV   +++++ Y +CG++S A  +F++MP R+ VSW A+IAG++        L LF QM+    + N  +F++ LS C    AL +G+ LH +  K G +
Subjt:  NVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDMCLELFQQMKIMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLD

Query:  SCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINLFKAMRKQKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMV-EL
        +   + N L+ MYCKCG+I  A  +F+ M  KDIVSWN+MIAGY++HG    A+  F++M K++ ++ D  T + VLS+C H GLV++GR YF  M  + 
Subjt:  SCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINLFKAMRKQKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMV-EL

Query:  GLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKD
        G+ P   HY+C++DLLGRAGLL++A N +K MP  P++ +WG+LL A R+HGN  +   AA+    ++P  +  ++ L+NLYA +G   DV +LR  M+D
Subjt:  GLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKD

Query:  KGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDSIVNHMRSVGCV-------HEVDE
        KG+K  PGYSWIEIQNK + F   D+ +P   EIF  ++ +   M+  G V       H+V+E
Subjt:  KGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDSIVNHMRSVGCV-------HEVDE

Q9ZUT4 Pentatricopeptide repeat-containing protein At2g373201.4e-14655.9Show/hide
Query:  RLLELISPKGNVASESRQTHLRLIQDFLRTDSDQFRSQTLTDGFDSHSVVLTKDSSFVLDQERESGHWDLQLFAGRFKFDANDISSGLSLCSSQRNLRGG
        R+L++IS K    S +RQ H   +Q+F +TDS +FR Q +++ FD   +  TK+    + +E         +    + FDA  +SS +  C   R+ R G
Subjt:  RLLELISPKGNVASESRQTHLRLIQDFLRTDSDQFRSQTLTDGFDSHSVVLTKDSSFVLDQERESGHWDLQLFAGRFKFDANDISSGLSLCSSQRNLRGG

Query:  IQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDMCLELFQQMKIMALQPNEFTFTTILSACTGSGALGV
          +H +A++ GFI++VY+GSSLV LY   GE+ NAY++FEEMP RNVVSWTA+I+GFA EW+VD+CL+L+ +M+     PN++TFT +LSACTGSGALG 
Subjt:  IQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDMCLELFQQMKIMALQPNEFTFTTILSACTGSGALGV

Query:  GRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINLFKAMRKQKQVEADAITFLGVLSSCRHAGLV
        GRS+HCQT  MGL S LHI+N LISMYCKCG +  A  IF+    KD+VSWNSMIAGYAQHGL++QAI LF+ M  +   + DAIT+LGVLSSCRHAGLV
Subjt:  GRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINLFKAMRKQKQVEADAITFLGVLSSCRHAGLV

Query:  EEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGY
        +EGR +FNLM E GLKPEL+HYSC++DLLGR GLL+EA   I+ MP+ PNS++WGSLL +CR+HG+VW G++AAE RL+L+P+CA+TH+QLANLYA  GY
Subjt:  EEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGY

Query:  LDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDSIVNHM
          + A +RK+MKDKGLKT PG SWIEI N V+ FKAED SN  M+EI  ++  +++HM
Subjt:  LDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDSIVNHM

Arabidopsis top hitse value%identityAlignment
AT2G37320.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.0e-14755.9Show/hide
Query:  RLLELISPKGNVASESRQTHLRLIQDFLRTDSDQFRSQTLTDGFDSHSVVLTKDSSFVLDQERESGHWDLQLFAGRFKFDANDISSGLSLCSSQRNLRGG
        R+L++IS K    S +RQ H   +Q+F +TDS +FR Q +++ FD   +  TK+    + +E         +    + FDA  +SS +  C   R+ R G
Subjt:  RLLELISPKGNVASESRQTHLRLIQDFLRTDSDQFRSQTLTDGFDSHSVVLTKDSSFVLDQERESGHWDLQLFAGRFKFDANDISSGLSLCSSQRNLRGG

Query:  IQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDMCLELFQQMKIMALQPNEFTFTTILSACTGSGALGV
          +H +A++ GFI++VY+GSSLV LY   GE+ NAY++FEEMP RNVVSWTA+I+GFA EW+VD+CL+L+ +M+     PN++TFT +LSACTGSGALG 
Subjt:  IQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDMCLELFQQMKIMALQPNEFTFTTILSACTGSGALGV

Query:  GRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINLFKAMRKQKQVEADAITFLGVLSSCRHAGLV
        GRS+HCQT  MGL S LHI+N LISMYCKCG +  A  IF+    KD+VSWNSMIAGYAQHGL++QAI LF+ M  +   + DAIT+LGVLSSCRHAGLV
Subjt:  GRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINLFKAMRKQKQVEADAITFLGVLSSCRHAGLV

Query:  EEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGY
        +EGR +FNLM E GLKPEL+HYSC++DLLGR GLL+EA   I+ MP+ PNS++WGSLL +CR+HG+VW G++AAE RL+L+P+CA+TH+QLANLYA  GY
Subjt:  EEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGY

Query:  LDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDSIVNHM
          + A +RK+MKDKGLKT PG SWIEI N V+ FKAED SN  M+EI  ++  +++HM
Subjt:  LDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDSIVNHM

AT2G40720.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.3e-8239.79Show/hide
Query:  KFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDMCLELFQQMKIMA
        K D++ ++S  + C+    LR G+Q H   I+TG + NV+VGSSL+ LY KCG    A ++F  M   N+V+W ++I+ ++     ++ ++LF  M    
Subjt:  KFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDMCLELFQQMKIMA

Query:  LQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINLFKAMRKQ
        + P+  + T++L A + + +L  G+SLH  T ++G+ S  H+ N LI MY KCG    A  IF+ M+ K +++WN MI GY  HG  + A++LF  M+K 
Subjt:  LQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINLFKAMRKQ

Query:  KQVEADAITFLGVLSSCRHAGLVEEGRYYFNLM-VELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIGLKAAES
         +   D +TFL ++S+C H+G VEEG+  F  M  + G++P ++HY+ ++DLLGRAGLL+EA +FIK MPI  +S +W  LLSA R H NV +G+ +AE 
Subjt:  KQVEADAITFLGVLSSCRHAGLVEEGRYYFNLM-VELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIGLKAAES

Query:  RLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDSIVNHM
         L ++P   ST++QL NLY +AG  ++ A+L  +MK+KGL   PG SWIE+ ++   F +   S+P+  EIF +++ + ++M
Subjt:  RLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDSIVNHM

AT4G02750.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.8e-8141.32Show/hide
Query:  NVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDMCLELFQQMKIMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLD
        NV   +++++ Y +CG++S A  +F++MP R+ VSW A+IAG++        L LF QM+    + N  +F++ LS C    AL +G+ LH +  K G +
Subjt:  NVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDMCLELFQQMKIMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLD

Query:  SCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINLFKAMRKQKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMV-EL
        +   + N L+ MYCKCG+I  A  +F+ M  KDIVSWN+MIAGY++HG    A+  F++M K++ ++ D  T + VLS+C H GLV++GR YF  M  + 
Subjt:  SCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINLFKAMRKQKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMV-EL

Query:  GLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKD
        G+ P   HY+C++DLLGRAGLL++A N +K MP  P++ +WG+LL A R+HGN  +   AA+    ++P  +  ++ L+NLYA +G   DV +LR  M+D
Subjt:  GLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKD

Query:  KGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDSIVNHMRSVGCV-------HEVDE
        KG+K  PGYSWIEIQNK + F   D+ +P   EIF  ++ +   M+  G V       H+V+E
Subjt:  KGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDSIVNHMRSVGCV-------HEVDE

AT4G21065.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.2e-7937.89Show/hide
Query:  SSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDMCLELFQQMKIMALQPNEFTFTTILSA
        ++  ++R G   HSV IR+GF + +YV +SL+ LY  CG++++AY++F++MP +++V+W ++I GFA   + +  L L+ +M    ++P+ FT  ++LSA
Subjt:  SSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDMCLELFQQMKIMALQPNEFTFTTILSA

Query:  CTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINLFKAMRKQKQVEADAITFLGVL
        C   GAL +G+ +H    K+GL   LH +NVL+ +Y +CG +  A  +F+ M  K+ VSW S+I G A +G   +AI LFK M   + +    ITF+G+L
Subjt:  CTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINLFKAMRKQKQVEADAITFLGVL

Query:  SSCRHAGLVEEGRYYFNLM-VELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQ
         +C H G+V+EG  YF  M  E  ++P ++H+ C++DLL RAG +K+A  +IK MP+ PN ++W +LL AC +HG+  +   A    L L+PN +  ++ 
Subjt:  SSCRHAGLVEEGRYYFNLM-VELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQ

Query:  LANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDSIVNHMRSVGCVHEVDEV
        L+N+YA      DV ++RK M   G+K  PG+S +E+ N+V+ F   DKS+P    I+  +  +   +RS G V ++  V
Subjt:  LANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDSIVNHMRSVGCVHEVDEV

AT5G52630.1 mitochondrial RNAediting factor 11.8e-8037.5Show/hide
Query:  QLFAGRFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDMCLELF
        ++ AG  + D + + S    C+       G   H ++++TG+ A+V+VGSSLV +Y KCGE+  A +MF+EMP RNVV+W+ ++ G+A   + +  L LF
Subjt:  QLFAGRFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDMCLELF

Query:  QQMKIMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINL
        ++     L  N+++F++++S C  S  L +GR +H  + K   DS   + + L+S+Y KCG    A  +F  + VK++  WN+M+  YAQH  + + I L
Subjt:  QQMKIMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINL

Query:  FKAMRKQKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIG
        FK M K   ++ + ITFL VL++C HAGLV+EGRYYF+ M E  ++P   HY+ ++D+LGRAG L+EA   I  MPI P   VWG+LL++C +H N  + 
Subjt:  FKAMRKQKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIG

Query:  LKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDSIVNHMRSVGCVHEVDEV
          AA+    L P  +  H+ L+N YA  G  +D A+ RK+++D+G K   G SW+E +NKV+ F A ++ +    EI+  +  +   M   G + +   V
Subjt:  LKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDSIVNHMRSVGCVHEVDEV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGGTAAGCATGTGTAACTCAAAAGTTGAAAATGAAAGAATGGTCATGAAGATGGAGATACCCAGTAAAGAACATGATTGTCAACCTTCATTTTCCAATGTTTATGA
ACTACGAGGAGAGCCTGCTATAGTAATTAATGGGGTGCCTGATATACCTACTTGTGACAATGCCCTTGCCCTTTGTAATTCTCTGAATGATGAAAAATTACTTGGAAGTA
CAGGCTTTGGTGAGTGGTTAGAAGGGAGAGTTGTTAACAAAATGTTTGGCGATCGTTATTACTATGGTGTCATAATCGAGTACGACAAAGTTACAGGATGGTATAGAGTG
GAGTATGAAGATGGAGATTTCGAAGATCTCGATTGGCATGCACTCGAACAAGTGCTTTTGCCGATGGACATTACAGTTCCATTAAAAGCCTTAGCATTGAAGACTCTGAA
GAGAAGCAGGAAGGCGCGGAAGAACAGGAAAAACAAGACGGGAAACAGGCAAGGTGAAACCAAAGAAATGGAAGGCAGGAGAAAGAAGAGTGTGGGAATTCGTCCTCTTC
ATTTGCTCTTTGCGATATCTCAATCTTCTGCACTTCTTTATAGACATGGCTTCCGCATAGTGGCCTCCATTAGGCTTTTCTCTAACTTCAAACCTAAGTTCCATTCGACC
ACGAACTTACCTAAACCTCCCAGACTCTTGGAGCTCATTTCTCCAAAGGGAAATGTCGCCTCTGAAAGTCGCCAAACTCATCTTCGGCTCATTCAGGACTTTTTACGAAC
AGATTCGGATCAATTTCGATCTCAAACCCTTACAGACGGTTTTGATTCTCATTCAGTTGTTTTAACCAAGGATTCGTCCTTTGTTCTTGATCAAGAACGTGAGTCTGGTC
ACTGGGATCTTCAGTTGTTCGCAGGAAGATTTAAATTTGATGCGAACGATATATCCAGCGGTTTGAGTTTGTGCAGTTCTCAGCGCAATCTTCGTGGTGGAATTCAGTAT
CATTCTGTGGCGATACGAACTGGGTTTATTGCCAATGTGTATGTAGGAAGTTCGTTGGTAAGTTTGTACGGGAAATGCGGGGAGTTGAGTAATGCATATCAAATGTTTGA
AGAAATGCCTGTGAGAAATGTTGTATCATGGACAGCCATTATTGCTGGGTTTGCTGTAGAATGGCAAGTTGATATGTGCTTGGAGCTTTTCCAACAGATGAAAATAATGG
CATTGCAACCAAATGAGTTTACTTTTACTACTATATTGAGCGCTTGCACTGGCAGTGGAGCCCTTGGAGTAGGAAGAAGCCTCCACTGTCAAACATTCAAAATGGGCCTT
GATTCTTGTCTCCATATTGCAAATGTTTTGATCTCAATGTACTGTAAATGTGGAGCTATTAACTTAGCATTATACATATTTGAAGCCATGGAAGTCAAAGACATTGTTTC
ATGGAATTCCATGATCGCAGGTTACGCCCAACACGGACTTTCTCTACAAGCCATCAATCTTTTTAAAGCAATGAGGAAGCAGAAGCAAGTGGAAGCCGATGCCATTACTT
TCCTTGGTGTTCTGTCCTCATGTAGACATGCAGGGCTTGTGGAAGAGGGCAGATACTACTTCAATCTTATGGTCGAGCTTGGTTTGAAACCGGAGTTGGATCATTATTCA
TGTGTTATCGATTTGCTTGGCCGAGCTGGACTACTCAAAGAGGCTGAAAACTTCATTAAGAAGATGCCCATATCTCCCAATTCAATTGTTTGGGGATCACTTCTCTCTGC
TTGCAGGCTTCATGGGAATGTTTGGATAGGATTGAAGGCTGCAGAGAGTAGATTGCTGCTGCAACCCAATTGCGCATCGACACACTTGCAATTGGCTAATCTGTATGCAA
AGGCAGGATACTTGGATGATGTTGCAAGGTTGAGGAAGATGATGAAAGACAAAGGACTGAAGACTGCTCCTGGATATAGCTGGATTGAGATTCAGAATAAAGTTTACAGA
TTCAAAGCAGAAGATAAGTCAAACCCTGTAATGGTTGAGATTTTTGGTCTTATGGATAGCATAGTGAATCACATGAGATCTGTAGGCTGTGTTCATGAAGTGGACGAAGT
TGATGATGTTTTACTACCAACATCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTGGTAAGCATGTGTAACTCAAAAGTTGAAAATGAAAGAATGGTCATGAAGATGGAGATACCCAGTAAAGAACATGATTGTCAACCTTCATTTTCCAATGTTTATGA
ACTACGAGGAGAGCCTGCTATAGTAATTAATGGGGTGCCTGATATACCTACTTGTGACAATGCCCTTGCCCTTTGTAATTCTCTGAATGATGAAAAATTACTTGGAAGTA
CAGGCTTTGGTGAGTGGTTAGAAGGGAGAGTTGTTAACAAAATGTTTGGCGATCGTTATTACTATGGTGTCATAATCGAGTACGACAAAGTTACAGGATGGTATAGAGTG
GAGTATGAAGATGGAGATTTCGAAGATCTCGATTGGCATGCACTCGAACAAGTGCTTTTGCCGATGGACATTACAGTTCCATTAAAAGCCTTAGCATTGAAGACTCTGAA
GAGAAGCAGGAAGGCGCGGAAGAACAGGAAAAACAAGACGGGAAACAGGCAAGGTGAAACCAAAGAAATGGAAGGCAGGAGAAAGAAGAGTGTGGGAATTCGTCCTCTTC
ATTTGCTCTTTGCGATATCTCAATCTTCTGCACTTCTTTATAGACATGGCTTCCGCATAGTGGCCTCCATTAGGCTTTTCTCTAACTTCAAACCTAAGTTCCATTCGACC
ACGAACTTACCTAAACCTCCCAGACTCTTGGAGCTCATTTCTCCAAAGGGAAATGTCGCCTCTGAAAGTCGCCAAACTCATCTTCGGCTCATTCAGGACTTTTTACGAAC
AGATTCGGATCAATTTCGATCTCAAACCCTTACAGACGGTTTTGATTCTCATTCAGTTGTTTTAACCAAGGATTCGTCCTTTGTTCTTGATCAAGAACGTGAGTCTGGTC
ACTGGGATCTTCAGTTGTTCGCAGGAAGATTTAAATTTGATGCGAACGATATATCCAGCGGTTTGAGTTTGTGCAGTTCTCAGCGCAATCTTCGTGGTGGAATTCAGTAT
CATTCTGTGGCGATACGAACTGGGTTTATTGCCAATGTGTATGTAGGAAGTTCGTTGGTAAGTTTGTACGGGAAATGCGGGGAGTTGAGTAATGCATATCAAATGTTTGA
AGAAATGCCTGTGAGAAATGTTGTATCATGGACAGCCATTATTGCTGGGTTTGCTGTAGAATGGCAAGTTGATATGTGCTTGGAGCTTTTCCAACAGATGAAAATAATGG
CATTGCAACCAAATGAGTTTACTTTTACTACTATATTGAGCGCTTGCACTGGCAGTGGAGCCCTTGGAGTAGGAAGAAGCCTCCACTGTCAAACATTCAAAATGGGCCTT
GATTCTTGTCTCCATATTGCAAATGTTTTGATCTCAATGTACTGTAAATGTGGAGCTATTAACTTAGCATTATACATATTTGAAGCCATGGAAGTCAAAGACATTGTTTC
ATGGAATTCCATGATCGCAGGTTACGCCCAACACGGACTTTCTCTACAAGCCATCAATCTTTTTAAAGCAATGAGGAAGCAGAAGCAAGTGGAAGCCGATGCCATTACTT
TCCTTGGTGTTCTGTCCTCATGTAGACATGCAGGGCTTGTGGAAGAGGGCAGATACTACTTCAATCTTATGGTCGAGCTTGGTTTGAAACCGGAGTTGGATCATTATTCA
TGTGTTATCGATTTGCTTGGCCGAGCTGGACTACTCAAAGAGGCTGAAAACTTCATTAAGAAGATGCCCATATCTCCCAATTCAATTGTTTGGGGATCACTTCTCTCTGC
TTGCAGGCTTCATGGGAATGTTTGGATAGGATTGAAGGCTGCAGAGAGTAGATTGCTGCTGCAACCCAATTGCGCATCGACACACTTGCAATTGGCTAATCTGTATGCAA
AGGCAGGATACTTGGATGATGTTGCAAGGTTGAGGAAGATGATGAAAGACAAAGGACTGAAGACTGCTCCTGGATATAGCTGGATTGAGATTCAGAATAAAGTTTACAGA
TTCAAAGCAGAAGATAAGTCAAACCCTGTAATGGTTGAGATTTTTGGTCTTATGGATAGCATAGTGAATCACATGAGATCTGTAGGCTGTGTTCATGAAGTGGACGAAGT
TGATGATGTTTTACTACCAACATCCTGA
Protein sequenceShow/hide protein sequence
MLVSMCNSKVENERMVMKMEIPSKEHDCQPSFSNVYELRGEPAIVINGVPDIPTCDNALALCNSLNDEKLLGSTGFGEWLEGRVVNKMFGDRYYYGVIIEYDKVTGWYRV
EYEDGDFEDLDWHALEQVLLPMDITVPLKALALKTLKRSRKARKNRKNKTGNRQGETKEMEGRRKKSVGIRPLHLLFAISQSSALLYRHGFRIVASIRLFSNFKPKFHST
TNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQFRSQTLTDGFDSHSVVLTKDSSFVLDQERESGHWDLQLFAGRFKFDANDISSGLSLCSSQRNLRGGIQY
HSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDMCLELFQQMKIMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGL
DSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINLFKAMRKQKQVEADAITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYS
CVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYR
FKAEDKSNPVMVEIFGLMDSIVNHMRSVGCVHEVDEVDDVLLPTS