; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC04G074300 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC04G074300
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCiama_Chr04:23412662..23419127
RNA-Seq ExpressionCaUC04G074300
SyntenyCaUC04G074300
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6603691.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]4.3e-26586.91Show/hide
Query:  IRPLHLLFAISQSSALLYRHGFHIVASIRLFSNFKPKFHSTTNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQCRSQTLTDGFDSHSVVLT
        +R LH+LF IS SSALL RHGFHIV SIRLFS FK     TTNLPKPPRLL+LISPKGN ASESRQTHLRLI+DFLRTDSDQCRSQTL+DGFDS SV L+
Subjt:  IRPLHLLFAISQSSALLYRHGFHIVASIRLFSNFKPKFHSTTNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQCRSQTLTDGFDSHSVVLT

Query:  KGSSFVLDQERESGHWDLQLFAGSFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTA
        K SS V DQERESGHW  QLFAG F+FDANDISS LSLC SQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKC E+++A Q+F+EMPVRNVVSWTA
Subjt:  KGSSFVLDQERESGHWDLQLFAGSFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTA

Query:  IIAGFAVEWQVDTCLELFQQMKRMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWN
        IIAGFA EWQVD CLELFQ+M+RMALQPNEFTF TILSACTGSGALGVGRSLHCQTFKMG DS +HIAN LISMYCKCGA+N A+Y+FEAMEVKD VSWN
Subjt:  IIAGFAVEWQVDTCLELFQQMKRMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWN

Query:  SMIAGYAQHGLSLQAINLFKAMRKQKQVEADGITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSI
        SMIAGYAQHGLSLQAI+LFKAMRKQ+QVEADGITFLGVLSSCRH GLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEA+NFI+ MPISPNSI
Subjt:  SMIAGYAQHGLSLQAINLFKAMRKQKQVEADGITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSI

Query:  VWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMD
        VWGSLLSACRLHGNVWIGLKAAESRLLLQP+CASTHLQLANLYA+AGYLD+ ARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVM+EIFG+MD
Subjt:  VWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMD

Query:  GIVNHMRSVGCVHEV-DEVDDVLLPTS
        G+VNHMRSVGCV EV DE++D LL TS
Subjt:  GIVNHMRSVGCVHEV-DEVDDVLLPTS

XP_022949850.1 pentatricopeptide repeat-containing protein At2g37320 [Cucurbita moschata]8.1e-26486.53Show/hide
Query:  IRPLHLLFAISQSSALLYRHGFHIVASIRLFSNFKPKFHSTTNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQCRSQTLTDGFDSHSVVLT
        +R LH+LF IS SSALL RHGFHIV SIRLFS FK     TTNLPKPPRLL+LISPKGN ASESRQTHLRLI+DFLRTDSDQCRSQTL+DGFDS SV L+
Subjt:  IRPLHLLFAISQSSALLYRHGFHIVASIRLFSNFKPKFHSTTNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQCRSQTLTDGFDSHSVVLT

Query:  KGSSFVLDQERESGHWDLQLFAGSFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTA
        K SS V DQERESGHW  QLFAG F+FDANDISS LSLC SQRN RGGIQYHSVAIRTGFIANVYVGSSLVSLYGKC E+++A Q+F+EMPVRNVVSWTA
Subjt:  KGSSFVLDQERESGHWDLQLFAGSFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTA

Query:  IIAGFAVEWQVDTCLELFQQMKRMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWN
        IIAGFA EWQVD CLELFQ+M+RMALQPNEFTF TILSACTGSGALGVGRSLHCQTFKMG DS +HIAN LISMYCKCGA+N A+Y+FEAMEVKD VSWN
Subjt:  IIAGFAVEWQVDTCLELFQQMKRMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWN

Query:  SMIAGYAQHGLSLQAINLFKAMRKQKQVEADGITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSI
        S+IAGYAQHGLSLQAI+LFKAMRKQ+QVEADGITFLGVLSSCRH GLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEA+N I+ MPISPNSI
Subjt:  SMIAGYAQHGLSLQAINLFKAMRKQKQVEADGITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSI

Query:  VWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMD
        VWGSLLSACRLHGNVWIGLKAAESRLLLQP+CASTHLQLANLYA+AGYLDD ARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVM+EIFG+MD
Subjt:  VWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMD

Query:  GIVNHMRSVGCVHEVD-EVDDVLLPTS
        G+VNHMRSVGCV EVD E++D LL TS
Subjt:  GIVNHMRSVGCVHEVD-EVDDVLLPTS

XP_022977771.1 pentatricopeptide repeat-containing protein At2g37320 [Cucurbita maxima]2.8e-26486.53Show/hide
Query:  IRPLHLLFAISQSSALLYRHGFHIVASIRLFSNFKPKFHSTTNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQCRSQTLTDGFDSHSVVLT
        +R LH+LF ISQSSALL RHGFHIV SIRLFS FK     TTNLPKPPRLL+LISPKGN ASESRQTHLRLI+DFL+TDSDQCRSQTL+DGFDS SV L+
Subjt:  IRPLHLLFAISQSSALLYRHGFHIVASIRLFSNFKPKFHSTTNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQCRSQTLTDGFDSHSVVLT

Query:  KGSSFVLDQERESGHWDLQLFAGSFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTA
        K SS VLDQERESGHWD QLFAG F+FDANDISS LSLC SQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKC E+++A Q+F+EMPVRNVVSWTA
Subjt:  KGSSFVLDQERESGHWDLQLFAGSFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTA

Query:  IIAGFAVEWQVDTCLELFQQMKRMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWN
        IIAGFA EWQVD CLELFQ+M+RMALQPNEFTF TILSACTGSGALGVGRSLHCQTFKMG DS +HIAN LISMYCKCGA+N A+Y+FEAMEVKD VSWN
Subjt:  IIAGFAVEWQVDTCLELFQQMKRMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWN

Query:  SMIAGYAQHGLSLQAINLFKAMRKQKQVEADGITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSI
        SMIAGYAQHGLSL+AI+LF+AMRKQ+QVEADGITFLGVLSSCRH GLVEEGRYYFNLMVEL LKPELDHYSCVIDLLGRAGLLKEA+NFI+KMPISPNSI
Subjt:  SMIAGYAQHGLSLQAINLFKAMRKQKQVEADGITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSI

Query:  VWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMD
        VWGSLLSACRLHGNVWIGLKAAESRLLLQP+CASTHLQLANLYA+AGYL+D ARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVM+EIFG+MD
Subjt:  VWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMD

Query:  GIVNHMRSVGCVHEV-DEVDDVLLPTS
        G+VNHMRSV CV EV DE++D LL  S
Subjt:  GIVNHMRSVGCVHEV-DEVDDVLLPTS

XP_023544680.1 pentatricopeptide repeat-containing protein At2g37320 [Cucurbita pepo subsp. pepo]1.8e-26386.34Show/hide
Query:  IRPLHLLFAISQSSALLYRHGFHIVASIRLFSNFKPKFHSTTNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQCRSQTLTDGFDSHSVVLT
        +R LH LF IS+SSALL RHGFHIV SIRLFS FK     TT+LPKPPRLL+LISPKGN ASESRQTHLRLI+DFL+TDSDQCRSQTL+DGFDS SV L+
Subjt:  IRPLHLLFAISQSSALLYRHGFHIVASIRLFSNFKPKFHSTTNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQCRSQTLTDGFDSHSVVLT

Query:  KGSSFVLDQERESGHWDLQLFAGSFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTA
        K SS VLDQERESGHW  QLFAG F+FDANDISS LSLC SQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKC E+++A Q+F+EMPVRNVVSWTA
Subjt:  KGSSFVLDQERESGHWDLQLFAGSFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTA

Query:  IIAGFAVEWQVDTCLELFQQMKRMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWN
        +IAGFA EWQVD CLELFQQMKRMAL+PNEFTF TILSACTGSGALGVGRSLHCQTFKMG DS +HIAN LISMYCKCGA+N A+Y+FEAMEVKD VSWN
Subjt:  IIAGFAVEWQVDTCLELFQQMKRMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWN

Query:  SMIAGYAQHGLSLQAINLFKAMRKQKQVEADGITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSI
        SMIAGYAQHGLSLQAI+LF+AMRKQ+QVEADGITFLGVLSSCRH GLVEEGRYYFNLMVEL LKPELDHYSCVIDLLGRAGLLKEA+NFI+KMPISPNSI
Subjt:  SMIAGYAQHGLSLQAINLFKAMRKQKQVEADGITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSI

Query:  VWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMD
        VWGSLLSACRLHGNVWIGLKAAESRLLLQP+CASTHLQLANLYA+AGYLDD ARLRKMMKDKGLKT+PGYSWIEIQNKVYRFKAEDKSNPVM+EIFG+MD
Subjt:  VWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMD

Query:  GIVNHMRSVGCVHEV-DEVDDVLLPTS
        G+VNHMRSVGCV EV DE+++ LL TS
Subjt:  GIVNHMRSVGCVHEV-DEVDDVLLPTS

XP_038881286.1 pentatricopeptide repeat-containing protein At2g37320 isoform X1 [Benincasa hispida]1.1e-26887.64Show/hide
Query:  RPLHLLFAISQSSALLYRHGFHIVASIRLFSNFKPKFHSTTNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQCRSQTLTDGFDSHSVVLTK
        R L ++F ISQSSALLYRHGFH+V SIRLFSNFKPK  S+TNLPKPP+LL+LISPKGN A+E+RQTHLRLIQDFL+TDSDQCRSQTL+DGFDSHS+V +K
Subjt:  RPLHLLFAISQSSALLYRHGFHIVASIRLFSNFKPKFHSTTNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQCRSQTLTDGFDSHSVVLTK

Query:  GSSFVLDQERESGHWDLQLFAGSFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAI
         SS VL QERESGHWDLQLFAG FKFDANDISS LSLCSSQ NLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQ+FEEMPVRNVVSWTAI
Subjt:  GSSFVLDQERESGHWDLQLFAGSFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAI

Query:  IAGFAVEWQVDTCLELFQQMKRMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNS
        I+GFAVEW VD CL+LFQQMKRMALQPNEFTF TILSACTGSGALG+GRSLHCQTFKMG DS LH+AN LISMYCKCGA+N ALYIFEAMEVKDIVSWNS
Subjt:  IAGFAVEWQVDTCLELFQQMKRMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNS

Query:  MIAGYAQHGLSLQAINLFKAMRKQKQVEADGITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIV
        MIAGYAQHGLSLQAI+L+  MRKQKQVEAD ITFLGVLSSCRHAGLVEEG+YYFNLMVELG+KPELDHY+CVIDLLGRAGLLKEA+NFI++MPISPNSIV
Subjt:  MIAGYAQHGLSLQAINLFKAMRKQKQVEADGITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIV

Query:  WGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDG
        WGSLLSACRLHGNVWIGLKAAESRLLLQP+CASTHLQ+ANLYAKAGYL+D ARLRKMMKDKGLKT PGYSWIEIQNKVYRFKAEDKSNPVMVEI GLMDG
Subjt:  WGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDG

Query:  IVNHMRSVGCVHEV-DEVDDVLLPTS
        I+NHMR +G  HEV DEVDD+ L TS
Subjt:  IVNHMRSVGCVHEV-DEVDDVLLPTS

TrEMBL top hitse value%identityAlignment
A0A0A0KX36 Uncharacterized protein2.5e-25886.19Show/hide
Query:  LHLLFAISQSSALLYRHGFHIVASIRLFSNFKPKFHSTTNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQCRSQTLTDGFDSHSVVLTKGS
        LH+LF ISQSSA  YRHGF+++ SIR FSNFK   H TTNLPKPPRLL+LISPKG+V+ ESRQTHLRLIQDFL+TDS QCRSQTL  G DS S+ L+K S
Subjt:  LHLLFAISQSSALLYRHGFHIVASIRLFSNFKPKFHSTTNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQCRSQTLTDGFDSHSVVLTKGS

Query:  SFVLDQERESGHWDLQLFAGSFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIA
        SFVLDQE ESGHWD+Q FAG FKF+ANDISS LSLC+SQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAY++F+EMPVRNVVSWTAIIA
Subjt:  SFVLDQERESGHWDLQLFAGSFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIA

Query:  GFAVEWQVDTCLELFQQMKRMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMI
        GFAVEWQV+ CLELFQ+MKRMALQPNEFTF TIL+ACTGSGALGVGRSLHCQT KMG  S LH+AN LISMYCKCGA+N ALYIFEAMEVKD VSWNSMI
Subjt:  GFAVEWQVDTCLELFQQMKRMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMI

Query:  AGYAQHGLSLQAINLFKAMRKQKQVEADGITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWG
        AGYAQHGLSL+AI+LFKAMRKQKQVEAD ITFLGVLSSCRHAG VEEGR+YFNLMVELGLKPELDHYSCVIDLLGRAGLLKEA+NFI+KMPI+PNSIVWG
Subjt:  AGYAQHGLSLQAINLFKAMRKQKQVEADGITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWG

Query:  SLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDGIV
        SLLSACRLHGNVWIGLKAAESRLLLQP+CASTHLQL NLYAKAGYLDD ARLRK+MKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNP+MVEIFGL+DG+V
Subjt:  SLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDGIV

Query:  NHMRSVGCVHEVDE
        NHMR VGC HE+++
Subjt:  NHMRSVGCVHEVDE

A0A1S3BGK7 pentatricopeptide repeat-containing protein At2g373201.5e-26086.45Show/hide
Query:  LHLLFAISQSSALLYRHGFHIVASIRLFSNFKPKFHSTTNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQCRSQTLTDGFDSHSVVLTKGS
        LH+LFAISQSSA  YRHGF++V S+R FSNFK   H TTNLPKP RLL+LISPKG+V+ ESRQTHLRLIQDFL+TD DQCRSQTL+ GFDS SV L+K S
Subjt:  LHLLFAISQSSALLYRHGFHIVASIRLFSNFKPKFHSTTNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQCRSQTLTDGFDSHSVVLTKGS

Query:  SFVLDQERESGHWDLQLFAGSFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIA
        SFVLDQE ESGHWD+Q FAG FKF ANDISS LSLC+SQRNLRGG+QYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMF+EMPVRNVVSWTAIIA
Subjt:  SFVLDQERESGHWDLQLFAGSFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIA

Query:  GFAVEWQVDTCLELFQQMKRMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMI
        GFAVEWQV+ CLELF++MKRMALQPNEFTF TIL+ACTGSGALGVGRSLHCQTFKMG  S LHIAN LISMYCKCGA+N ALYIFEAMEVKD VSWNSMI
Subjt:  GFAVEWQVDTCLELFQQMKRMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMI

Query:  AGYAQHGLSLQAINLFKAMRKQKQVEADGITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWG
        AGYAQHGLS +AI+LFKAMRKQKQVEAD ITFLGVLSSCRHAG VEEGR+YFNLMVELGLKPELDHYSCVIDLLGRAGLLKEA+NFI+KMP+SPNSI+WG
Subjt:  AGYAQHGLSLQAINLFKAMRKQKQVEADGITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWG

Query:  SLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDGIV
        SLLSACRLHGNVWIGLKAAESRLLLQP+CASTHLQL  LYAKAGYLDD ARLRK+MKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNP+MVEIFGLMD +V
Subjt:  SLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDGIV

Query:  NHMRSVGCVHEV-DEVDDVLLPTS
        NHMR VG  HE+ DEVDDVLL TS
Subjt:  NHMRSVGCVHEV-DEVDDVLLPTS

A0A5A7U1F7 Pentatricopeptide repeat-containing protein3.8e-25986.4Show/hide
Query:  LLFAISQSSALLYRHGFHIVASIRLFSNFKPKFHSTTNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQCRSQTLTDGFDSHSVVLTKGSSF
        +LFAISQSSA  YRHGF++V S+R FSNFK   H TTNLPKP RLL+LISPKG+V+ ESRQTHLRLIQDFL+TD DQCRSQTL+ GFDS SV L+K SSF
Subjt:  LLFAISQSSALLYRHGFHIVASIRLFSNFKPKFHSTTNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQCRSQTLTDGFDSHSVVLTKGSSF

Query:  VLDQERESGHWDLQLFAGSFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGF
        VLDQE ESGHWD+Q FAG FKF ANDISS LSLC+SQRNLRGG+QYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMF+EMPVRNVVSWTAIIAGF
Subjt:  VLDQERESGHWDLQLFAGSFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGF

Query:  AVEWQVDTCLELFQQMKRMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAG
        AVEWQV+ CLELF++MKRMALQPNEFTF TIL+ACTGSGALGVGRSLHCQTFKMG  S LHIAN LISMYCKCGA+N ALYIFEAMEVKD VSWNSMIAG
Subjt:  AVEWQVDTCLELFQQMKRMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAG

Query:  YAQHGLSLQAINLFKAMRKQKQVEADGITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSL
        YAQHGLS +AI+LFKAMRKQKQVEAD ITFLGVLSSCRHAG VEEGR+YFNLMVELGLKPELDHYSCVIDLLGRAGLLKEA+NFI+KMP+SPNSI+WGSL
Subjt:  YAQHGLSLQAINLFKAMRKQKQVEADGITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSL

Query:  LSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDGIVNH
        LSACRLHGNVWIGLKAAESRLLLQP+CASTHLQL  LYAKAGYLDD ARLRK+MKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNP+MVEIFGLMD +VNH
Subjt:  LSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDGIVNH

Query:  MRSVGCVHEV-DEVDDVLLPTS
        MR VG  HE+ DEVDDVLL TS
Subjt:  MRSVGCVHEV-DEVDDVLLPTS

A0A6J1GDY6 pentatricopeptide repeat-containing protein At2g373203.9e-26486.53Show/hide
Query:  IRPLHLLFAISQSSALLYRHGFHIVASIRLFSNFKPKFHSTTNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQCRSQTLTDGFDSHSVVLT
        +R LH+LF IS SSALL RHGFHIV SIRLFS FK     TTNLPKPPRLL+LISPKGN ASESRQTHLRLI+DFLRTDSDQCRSQTL+DGFDS SV L+
Subjt:  IRPLHLLFAISQSSALLYRHGFHIVASIRLFSNFKPKFHSTTNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQCRSQTLTDGFDSHSVVLT

Query:  KGSSFVLDQERESGHWDLQLFAGSFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTA
        K SS V DQERESGHW  QLFAG F+FDANDISS LSLC SQRN RGGIQYHSVAIRTGFIANVYVGSSLVSLYGKC E+++A Q+F+EMPVRNVVSWTA
Subjt:  KGSSFVLDQERESGHWDLQLFAGSFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTA

Query:  IIAGFAVEWQVDTCLELFQQMKRMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWN
        IIAGFA EWQVD CLELFQ+M+RMALQPNEFTF TILSACTGSGALGVGRSLHCQTFKMG DS +HIAN LISMYCKCGA+N A+Y+FEAMEVKD VSWN
Subjt:  IIAGFAVEWQVDTCLELFQQMKRMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWN

Query:  SMIAGYAQHGLSLQAINLFKAMRKQKQVEADGITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSI
        S+IAGYAQHGLSLQAI+LFKAMRKQ+QVEADGITFLGVLSSCRH GLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEA+N I+ MPISPNSI
Subjt:  SMIAGYAQHGLSLQAINLFKAMRKQKQVEADGITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSI

Query:  VWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMD
        VWGSLLSACRLHGNVWIGLKAAESRLLLQP+CASTHLQLANLYA+AGYLDD ARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVM+EIFG+MD
Subjt:  VWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMD

Query:  GIVNHMRSVGCVHEVD-EVDDVLLPTS
        G+VNHMRSVGCV EVD E++D LL TS
Subjt:  GIVNHMRSVGCVHEVD-EVDDVLLPTS

A0A6J1IKW6 pentatricopeptide repeat-containing protein At2g373201.3e-26486.53Show/hide
Query:  IRPLHLLFAISQSSALLYRHGFHIVASIRLFSNFKPKFHSTTNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQCRSQTLTDGFDSHSVVLT
        +R LH+LF ISQSSALL RHGFHIV SIRLFS FK     TTNLPKPPRLL+LISPKGN ASESRQTHLRLI+DFL+TDSDQCRSQTL+DGFDS SV L+
Subjt:  IRPLHLLFAISQSSALLYRHGFHIVASIRLFSNFKPKFHSTTNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQCRSQTLTDGFDSHSVVLT

Query:  KGSSFVLDQERESGHWDLQLFAGSFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTA
        K SS VLDQERESGHWD QLFAG F+FDANDISS LSLC SQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKC E+++A Q+F+EMPVRNVVSWTA
Subjt:  KGSSFVLDQERESGHWDLQLFAGSFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTA

Query:  IIAGFAVEWQVDTCLELFQQMKRMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWN
        IIAGFA EWQVD CLELFQ+M+RMALQPNEFTF TILSACTGSGALGVGRSLHCQTFKMG DS +HIAN LISMYCKCGA+N A+Y+FEAMEVKD VSWN
Subjt:  IIAGFAVEWQVDTCLELFQQMKRMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWN

Query:  SMIAGYAQHGLSLQAINLFKAMRKQKQVEADGITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSI
        SMIAGYAQHGLSL+AI+LF+AMRKQ+QVEADGITFLGVLSSCRH GLVEEGRYYFNLMVEL LKPELDHYSCVIDLLGRAGLLKEA+NFI+KMPISPNSI
Subjt:  SMIAGYAQHGLSLQAINLFKAMRKQKQVEADGITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSI

Query:  VWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMD
        VWGSLLSACRLHGNVWIGLKAAESRLLLQP+CASTHLQLANLYA+AGYL+D ARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVM+EIFG+MD
Subjt:  VWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMD

Query:  GIVNHMRSVGCVHEV-DEVDDVLLPTS
        G+VNHMRSV CV EV DE++D LL  S
Subjt:  GIVNHMRSVGCVHEV-DEVDDVLLPTS

SwissProt top hitse value%identityAlignment
Q7XJN6 Pentatricopeptide repeat-containing protein At2g407201.6e-8139.84Show/hide
Query:  SFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDTCLELFQQMKR
        S K D++ ++S  + C+    LR G+Q H   I+TG + NV+VGSSL+ LY KCG    A ++F  M   N+V+W ++I+ ++     +  ++LF  M  
Subjt:  SFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDTCLELFQQMKR

Query:  MALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINLFKAMR
          + P+  + T++L A + + +L  G+SLH  T ++G+ S  H+ N LI MY KCG    A  IF+ M+ K +++WN MI GY  HG  + A++LF  M+
Subjt:  MALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINLFKAMR

Query:  KQKQVEADGITFLGVLSSCRHAGLVEEGRYYFNLM-VELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIGLKAA
        K  +   D +TFL ++S+C H+G VEEG+  F  M  + G++P ++HY+ ++DLLGRAGLL+EA +FIK MPI  +S +W  LLSA R H NV +G+ +A
Subjt:  KQKQVEADGITFLGVLSSCRHAGLVEEGRYYFNLM-VELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIGLKAA

Query:  ESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDGIVNHM
        E  L ++P   ST++QL NLY +AG  ++ A+L  +MK+KGL   PG SWIE+ ++   F +   S+P+  EIF +++ + ++M
Subjt:  ESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDGIVNHM

Q9LTF4 Putative pentatricopeptide repeat-containing protein At5g526303.4e-7937.5Show/hide
Query:  QLFAGSFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDTCLELF
        ++ AG+ + D + + S    C+       G   H ++++TG+ A+V+VGSSLV +Y KCGE+  A +MF+EMP RNVV+W+ ++ G+A   + +  L LF
Subjt:  QLFAGSFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDTCLELF

Query:  QQMKRMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINL
        ++     L  N+++F++++S C  S  L +GR +H  + K   DS   + + L+S+Y KCG    A  +F  + VK++  WN+M+  YAQH  + + I L
Subjt:  QQMKRMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINL

Query:  FKAMRKQKQVEADGITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIG
        FK M K   ++ + ITFL VL++C HAGLV+EGRYYF+ M E  ++P   HY+ ++D+LGRAG L+EA   I  MPI P   VWG+LL++C +H N  + 
Subjt:  FKAMRKQKQVEADGITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIG

Query:  LKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDGIVNHMRSVGCVHEVDEV
          AA+    L P  +  H+ L+N YA  G  +D A+ RK+++D+G K   G SW+E +NKV+ F A ++ +    EI+  +  +   M   G + +   V
Subjt:  LKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDGIVNHMRSVGCVHEVDEV

Q9SY02 Pentatricopeptide repeat-containing protein At4g027502.8e-8141.6Show/hide
Query:  NVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDTCLELFQQMKRMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLD
        NV   +++++ Y +CG++S A  +F++MP R+ VSW A+IAG++        L LF QM+R   + N  +F++ LS C    AL +G+ LH +  K G +
Subjt:  NVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDTCLELFQQMKRMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLD

Query:  SCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINLFKAMRKQKQVEADGITFLGVLSSCRHAGLVEEGRYYFNLMV-EL
        +   + N L+ MYCKCG+I  A  +F+ M  KDIVSWN+MIAGY++HG    A+  F++M K++ ++ D  T + VLS+C H GLV++GR YF  M  + 
Subjt:  SCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINLFKAMRKQKQVEADGITFLGVLSSCRHAGLVEEGRYYFNLMV-EL

Query:  GLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKD
        G+ P   HY+C++DLLGRAGLL++A N +K MP  P++ +WG+LL A R+HGN  +   AA+    ++P  +  ++ L+NLYA +G   DV +LR  M+D
Subjt:  GLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKD

Query:  KGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDGIVNHMRSVGCV-------HEVDE
        KG+K  PGYSWIEIQNK + F   D+ +P   EIF  ++ +   M+  G V       H+V+E
Subjt:  KGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDGIVNHMRSVGCV-------HEVDE

Q9ZUT4 Pentatricopeptide repeat-containing protein At2g373207.1e-14655.68Show/hide
Query:  RLLELISPKGNVASESRQTHLRLIQDFLRTDSDQCRSQTLTDGFDSHSVVLTKGSSFVLDQERESGHWDLQLFAGSFKFDANDISSGLSLCSSQRNLRGG
        R+L++IS K    S +RQ H   +Q+F +TDS + R Q +++ FD        G S VL++          +    + FDA  +SS +  C   R+ R G
Subjt:  RLLELISPKGNVASESRQTHLRLIQDFLRTDSDQCRSQTLTDGFDSHSVVLTKGSSFVLDQERESGHWDLQLFAGSFKFDANDISSGLSLCSSQRNLRGG

Query:  IQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDTCLELFQQMKRMALQPNEFTFTTILSACTGSGALGV
          +H +A++ GFI++VY+GSSLV LY   GE+ NAY++FEEMP RNVVSWTA+I+GFA EW+VD CL+L+ +M++    PN++TFT +LSACTGSGALG 
Subjt:  IQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDTCLELFQQMKRMALQPNEFTFTTILSACTGSGALGV

Query:  GRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINLFKAMRKQKQVEADGITFLGVLSSCRHAGLV
        GRS+HCQT  MGL S LHI+N LISMYCKCG +  A  IF+    KD+VSWNSMIAGYAQHGL++QAI LF+ M  +   + D IT+LGVLSSCRHAGLV
Subjt:  GRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINLFKAMRKQKQVEADGITFLGVLSSCRHAGLV

Query:  EEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGY
        +EGR +FNLM E GLKPEL+HYSC++DLLGR GLL+EA   I+ MP+ PNS++WGSLL +CR+HG+VW G++AAE RL+L+P+CA+TH+QLANLYA  GY
Subjt:  EEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGY

Query:  LDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDGIVNHM
          + A +RK+MKDKGLKT PG SWIEI N V+ FKAED SN  M+EI  ++  +++HM
Subjt:  LDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDGIVNHM

Q9ZUW3 Pentatricopeptide repeat-containing protein At2g276108.9e-8037.74Show/hide
Query:  QYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDTCLELFQQMKRMALQPNEFTFTTILSACTGSGA-LGV
        + H+  ++T +  +  VG++L+  Y K G++  A ++F  +  +++V+W+A++AG+A   + +  +++F ++ +  ++PNEFTF++IL+ C  + A +G 
Subjt:  QYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDTCLELFQQMKRMALQPNEFTFTTILSACTGSGA-LGV

Query:  GRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINLFKAMRKQKQVEADGITFLGVLSSCRHAGLV
        G+  H    K  LDS L +++ L++MY K G I  A  +F+    KD+VSWNSMI+GYAQHG +++A+++FK M+K+K V+ DG+TF+GV ++C HAGLV
Subjt:  GRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINLFKAMRKQKQVEADGITFLGVLSSCRHAGLV

Query:  EEGRYYFNLMV-ELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAG
        EEG  YF++MV +  + P  +H SC++DL  RAG L++A   I+ MP    S +W ++L+ACR+H    +G  AAE  + ++P  ++ ++ L+N+YA++G
Subjt:  EEGRYYFNLMV-ELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAG

Query:  YLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDGIVNHMRSVG
           + A++RK+M ++ +K  PGYSWIE++NK Y F A D+S+P+  +I+  ++ +   ++ +G
Subjt:  YLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDGIVNHMRSVG

Arabidopsis top hitse value%identityAlignment
AT2G27610.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.3e-8137.74Show/hide
Query:  QYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDTCLELFQQMKRMALQPNEFTFTTILSACTGSGA-LGV
        + H+  ++T +  +  VG++L+  Y K G++  A ++F  +  +++V+W+A++AG+A   + +  +++F ++ +  ++PNEFTF++IL+ C  + A +G 
Subjt:  QYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDTCLELFQQMKRMALQPNEFTFTTILSACTGSGA-LGV

Query:  GRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINLFKAMRKQKQVEADGITFLGVLSSCRHAGLV
        G+  H    K  LDS L +++ L++MY K G I  A  +F+    KD+VSWNSMI+GYAQHG +++A+++FK M+K+K V+ DG+TF+GV ++C HAGLV
Subjt:  GRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINLFKAMRKQKQVEADGITFLGVLSSCRHAGLV

Query:  EEGRYYFNLMV-ELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAG
        EEG  YF++MV +  + P  +H SC++DL  RAG L++A   I+ MP    S +W ++L+ACR+H    +G  AAE  + ++P  ++ ++ L+N+YA++G
Subjt:  EEGRYYFNLMV-ELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAG

Query:  YLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDGIVNHMRSVG
           + A++RK+M ++ +K  PGYSWIE++NK Y F A D+S+P+  +I+  ++ +   ++ +G
Subjt:  YLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDGIVNHMRSVG

AT2G37320.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.1e-14755.68Show/hide
Query:  RLLELISPKGNVASESRQTHLRLIQDFLRTDSDQCRSQTLTDGFDSHSVVLTKGSSFVLDQERESGHWDLQLFAGSFKFDANDISSGLSLCSSQRNLRGG
        R+L++IS K    S +RQ H   +Q+F +TDS + R Q +++ FD        G S VL++          +    + FDA  +SS +  C   R+ R G
Subjt:  RLLELISPKGNVASESRQTHLRLIQDFLRTDSDQCRSQTLTDGFDSHSVVLTKGSSFVLDQERESGHWDLQLFAGSFKFDANDISSGLSLCSSQRNLRGG

Query:  IQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDTCLELFQQMKRMALQPNEFTFTTILSACTGSGALGV
          +H +A++ GFI++VY+GSSLV LY   GE+ NAY++FEEMP RNVVSWTA+I+GFA EW+VD CL+L+ +M++    PN++TFT +LSACTGSGALG 
Subjt:  IQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDTCLELFQQMKRMALQPNEFTFTTILSACTGSGALGV

Query:  GRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINLFKAMRKQKQVEADGITFLGVLSSCRHAGLV
        GRS+HCQT  MGL S LHI+N LISMYCKCG +  A  IF+    KD+VSWNSMIAGYAQHGL++QAI LF+ M  +   + D IT+LGVLSSCRHAGLV
Subjt:  GRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINLFKAMRKQKQVEADGITFLGVLSSCRHAGLV

Query:  EEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGY
        +EGR +FNLM E GLKPEL+HYSC++DLLGR GLL+EA   I+ MP+ PNS++WGSLL +CR+HG+VW G++AAE RL+L+P+CA+TH+QLANLYA  GY
Subjt:  EEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGY

Query:  LDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDGIVNHM
          + A +RK+MKDKGLKT PG SWIEI N V+ FKAED SN  M+EI  ++  +++HM
Subjt:  LDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDGIVNHM

AT2G40720.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.1e-8239.84Show/hide
Query:  SFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDTCLELFQQMKR
        S K D++ ++S  + C+    LR G+Q H   I+TG + NV+VGSSL+ LY KCG    A ++F  M   N+V+W ++I+ ++     +  ++LF  M  
Subjt:  SFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDTCLELFQQMKR

Query:  MALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINLFKAMR
          + P+  + T++L A + + +L  G+SLH  T ++G+ S  H+ N LI MY KCG    A  IF+ M+ K +++WN MI GY  HG  + A++LF  M+
Subjt:  MALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINLFKAMR

Query:  KQKQVEADGITFLGVLSSCRHAGLVEEGRYYFNLM-VELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIGLKAA
        K  +   D +TFL ++S+C H+G VEEG+  F  M  + G++P ++HY+ ++DLLGRAGLL+EA +FIK MPI  +S +W  LLSA R H NV +G+ +A
Subjt:  KQKQVEADGITFLGVLSSCRHAGLVEEGRYYFNLM-VELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIGLKAA

Query:  ESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDGIVNHM
        E  L ++P   ST++QL NLY +AG  ++ A+L  +MK+KGL   PG SWIE+ ++   F +   S+P+  EIF +++ + ++M
Subjt:  ESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDGIVNHM

AT4G02750.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.0e-8241.6Show/hide
Query:  NVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDTCLELFQQMKRMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLD
        NV   +++++ Y +CG++S A  +F++MP R+ VSW A+IAG++        L LF QM+R   + N  +F++ LS C    AL +G+ LH +  K G +
Subjt:  NVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDTCLELFQQMKRMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLD

Query:  SCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINLFKAMRKQKQVEADGITFLGVLSSCRHAGLVEEGRYYFNLMV-EL
        +   + N L+ MYCKCG+I  A  +F+ M  KDIVSWN+MIAGY++HG    A+  F++M K++ ++ D  T + VLS+C H GLV++GR YF  M  + 
Subjt:  SCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINLFKAMRKQKQVEADGITFLGVLSSCRHAGLVEEGRYYFNLMV-EL

Query:  GLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKD
        G+ P   HY+C++DLLGRAGLL++A N +K MP  P++ +WG+LL A R+HGN  +   AA+    ++P  +  ++ L+NLYA +G   DV +LR  M+D
Subjt:  GLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKD

Query:  KGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDGIVNHMRSVGCV-------HEVDE
        KG+K  PGYSWIEIQNK + F   D+ +P   EIF  ++ +   M+  G V       H+V+E
Subjt:  KGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDGIVNHMRSVGCV-------HEVDE

AT5G52630.1 mitochondrial RNAediting factor 12.4e-8037.5Show/hide
Query:  QLFAGSFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDTCLELF
        ++ AG+ + D + + S    C+       G   H ++++TG+ A+V+VGSSLV +Y KCGE+  A +MF+EMP RNVV+W+ ++ G+A   + +  L LF
Subjt:  QLFAGSFKFDANDISSGLSLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDTCLELF

Query:  QQMKRMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINL
        ++     L  N+++F++++S C  S  L +GR +H  + K   DS   + + L+S+Y KCG    A  +F  + VK++  WN+M+  YAQH  + + I L
Subjt:  QQMKRMALQPNEFTFTTILSACTGSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINL

Query:  FKAMRKQKQVEADGITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIG
        FK M K   ++ + ITFL VL++C HAGLV+EGRYYF+ M E  ++P   HY+ ++D+LGRAG L+EA   I  MPI P   VWG+LL++C +H N  + 
Subjt:  FKAMRKQKQVEADGITFLGVLSSCRHAGLVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIG

Query:  LKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDGIVNHMRSVGCVHEVDEV
          AA+    L P  +  H+ L+N YA  G  +D A+ RK+++D+G K   G SW+E +NKV+ F A ++ +    EI+  +  +   M   G + +   V
Subjt:  LKAAESRLLLQPNCASTHLQLANLYAKAGYLDDVARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDGIVNHMRSVGCVHEVDEV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGGTAAGCATGTGTAACTCAAAAGTTGAAAATGAAAGAATGGTCATGAAGATGGAGATACCCAGTAAAGAACATGATTGTCAACCTTCATTTTCCAATGTT
TATGAACTACGAGGAGAGCCTGCTATAGTAATTAATGGGGTGCCTGATATACCTACTTGTGACAATGCCCTTGCCCTTTGTAATTCTCTGAATGATGAAAAATTA
CTTGGAAGTACAGGCTTTGGTGAGTGGTTAGAAGGGAGAGTTGTTAACAAAATGTTTGGCGATCGGTATTACTATGGCGTCATAATCGAGTACGACAAAGTTACA
GGATGGTATAGAGTGGAGTATGAAGATGGAGATTTCGAAGATCTCGATTGGCATGCACTCGAACAAGTGCTTTTGCCGATGGACATTACAGTTCCATTAAAAGCC
TTAGCATTGAAGACTCTGAAGAGAAGCAGGAAGGCGCGGAAGAACAGGAAAAACAAGACGGGAAACAGGCGAGGTGAAACCAAAGAAATGGAAGGCAGGAGGAAG
AAGAGTGTGGGAATTCGTCCTCTTCATTTGCTCTTTGCGATATCTCAATCTTCTGCACTTCTTTATAGACATGGCTTCCACATAGTGGCCTCCATTAGGCTTTTC
TCTAACTTCAAACCTAAGTTCCATTCGACCACGAACTTACCTAAACCTCCCAGACTCTTGGAGCTCATTTCTCCAAAGGGAAATGTCGCCTCTGAAAGTCGCCAA
ACTCATCTTCGGCTCATTCAGGACTTTTTACGAACAGATTCGGATCAATGTCGATCTCAAACCCTTACAGACGGTTTTGATTCTCATTCAGTTGTTTTAACCAAG
GGTTCATCCTTTGTTCTTGATCAAGAACGTGAGTCTGGTCACTGGGATCTTCAGTTGTTCGCAGGAAGCTTTAAATTTGATGCGAACGATATATCCAGCGGTTTG
AGTTTGTGCAGTTCTCAGCGCAATCTTCGTGGTGGAATTCAGTATCATTCTGTGGCGATACGAACTGGGTTTATTGCCAATGTGTATGTAGGAAGTTCGTTGGTA
AGTTTGTACGGGAAATGCGGGGAGTTGAGTAATGCATATCAAATGTTTGAAGAAATGCCTGTGAGAAATGTTGTGTCATGGACAGCCATTATTGCTGGGTTTGCT
GTAGAATGGCAAGTTGATACGTGCTTGGAGCTTTTCCAACAGATGAAAAGAATGGCATTGCAACCAAATGAGTTTACTTTTACTACTATATTGAGCGCTTGCACT
GGCAGTGGAGCCCTTGGAGTAGGAAGAAGCCTCCACTGTCAAACATTCAAAATGGGCCTTGATTCTTGTCTCCATATTGCAAATGTTTTGATCTCAATGTACTGT
AAATGTGGAGCTATTAACTTAGCATTATACATATTTGAAGCCATGGAAGTCAAAGACATTGTTTCATGGAATTCCATGATCGCAGGTTACGCCCAACACGGACTT
TCTCTACAAGCCATCAATCTTTTTAAAGCAATGAGGAAGCAGAAGCAAGTGGAAGCCGATGGCATCACTTTCCTTGGTGTTCTGTCCTCATGTAGACATGCAGGG
CTTGTGGAAGAGGGCAGATACTACTTCAATCTTATGGTCGAGCTTGGTTTGAAACCGGAGTTGGATCATTATTCATGTGTTATCGATTTGCTTGGCCGAGCTGGA
CTACTCAAAGAGGCTGAAAACTTCATTAAGAAGATGCCCATATCTCCCAATTCAATTGTTTGGGGATCACTTCTCTCTGCTTGCAGGCTTCATGGGAATGTTTGG
ATAGGATTGAAGGCTGCAGAGAGTAGATTGTTGCTGCAACCCAATTGCGCATCGACACACTTGCAATTGGCTAATCTGTATGCAAAGGCAGGGTACTTGGATGAT
GTTGCAAGGTTGAGGAAGATGATGAAAGACAAAGGACTGAAGACTGCTCCTGGATATAGCTGGATTGAGATTCAGAATAAAGTTTACAGATTCAAAGCAGAAGAT
AAGTCAAACCCTGTAATGGTTGAGATTTTTGGTCTTATGGATGGCATAGTGAATCACATGAGATCTGTAGGCTGTGTTCATGAAGTGGACGAAGTTGATGATGTT
TTACTACCAACATCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTGGTAAGCATGTGTAACTCAAAAGTTGAAAATGAAAGAATGGTCATGAAGATGGAGATACCCAGTAAAGAACATGATTGTCAACCTTCATTTTCCAATGTT
TATGAACTACGAGGAGAGCCTGCTATAGTAATTAATGGGGTGCCTGATATACCTACTTGTGACAATGCCCTTGCCCTTTGTAATTCTCTGAATGATGAAAAATTA
CTTGGAAGTACAGGCTTTGGTGAGTGGTTAGAAGGGAGAGTTGTTAACAAAATGTTTGGCGATCGGTATTACTATGGCGTCATAATCGAGTACGACAAAGTTACA
GGATGGTATAGAGTGGAGTATGAAGATGGAGATTTCGAAGATCTCGATTGGCATGCACTCGAACAAGTGCTTTTGCCGATGGACATTACAGTTCCATTAAAAGCC
TTAGCATTGAAGACTCTGAAGAGAAGCAGGAAGGCGCGGAAGAACAGGAAAAACAAGACGGGAAACAGGCGAGGTGAAACCAAAGAAATGGAAGGCAGGAGGAAG
AAGAGTGTGGGAATTCGTCCTCTTCATTTGCTCTTTGCGATATCTCAATCTTCTGCACTTCTTTATAGACATGGCTTCCACATAGTGGCCTCCATTAGGCTTTTC
TCTAACTTCAAACCTAAGTTCCATTCGACCACGAACTTACCTAAACCTCCCAGACTCTTGGAGCTCATTTCTCCAAAGGGAAATGTCGCCTCTGAAAGTCGCCAA
ACTCATCTTCGGCTCATTCAGGACTTTTTACGAACAGATTCGGATCAATGTCGATCTCAAACCCTTACAGACGGTTTTGATTCTCATTCAGTTGTTTTAACCAAG
GGTTCATCCTTTGTTCTTGATCAAGAACGTGAGTCTGGTCACTGGGATCTTCAGTTGTTCGCAGGAAGCTTTAAATTTGATGCGAACGATATATCCAGCGGTTTG
AGTTTGTGCAGTTCTCAGCGCAATCTTCGTGGTGGAATTCAGTATCATTCTGTGGCGATACGAACTGGGTTTATTGCCAATGTGTATGTAGGAAGTTCGTTGGTA
AGTTTGTACGGGAAATGCGGGGAGTTGAGTAATGCATATCAAATGTTTGAAGAAATGCCTGTGAGAAATGTTGTGTCATGGACAGCCATTATTGCTGGGTTTGCT
GTAGAATGGCAAGTTGATACGTGCTTGGAGCTTTTCCAACAGATGAAAAGAATGGCATTGCAACCAAATGAGTTTACTTTTACTACTATATTGAGCGCTTGCACT
GGCAGTGGAGCCCTTGGAGTAGGAAGAAGCCTCCACTGTCAAACATTCAAAATGGGCCTTGATTCTTGTCTCCATATTGCAAATGTTTTGATCTCAATGTACTGT
AAATGTGGAGCTATTAACTTAGCATTATACATATTTGAAGCCATGGAAGTCAAAGACATTGTTTCATGGAATTCCATGATCGCAGGTTACGCCCAACACGGACTT
TCTCTACAAGCCATCAATCTTTTTAAAGCAATGAGGAAGCAGAAGCAAGTGGAAGCCGATGGCATCACTTTCCTTGGTGTTCTGTCCTCATGTAGACATGCAGGG
CTTGTGGAAGAGGGCAGATACTACTTCAATCTTATGGTCGAGCTTGGTTTGAAACCGGAGTTGGATCATTATTCATGTGTTATCGATTTGCTTGGCCGAGCTGGA
CTACTCAAAGAGGCTGAAAACTTCATTAAGAAGATGCCCATATCTCCCAATTCAATTGTTTGGGGATCACTTCTCTCTGCTTGCAGGCTTCATGGGAATGTTTGG
ATAGGATTGAAGGCTGCAGAGAGTAGATTGTTGCTGCAACCCAATTGCGCATCGACACACTTGCAATTGGCTAATCTGTATGCAAAGGCAGGGTACTTGGATGAT
GTTGCAAGGTTGAGGAAGATGATGAAAGACAAAGGACTGAAGACTGCTCCTGGATATAGCTGGATTGAGATTCAGAATAAAGTTTACAGATTCAAAGCAGAAGAT
AAGTCAAACCCTGTAATGGTTGAGATTTTTGGTCTTATGGATGGCATAGTGAATCACATGAGATCTGTAGGCTGTGTTCATGAAGTGGACGAAGTTGATGATGTT
TTACTACCAACATCCTGA
Protein sequenceShow/hide protein sequence
MLVSMCNSKVENERMVMKMEIPSKEHDCQPSFSNVYELRGEPAIVINGVPDIPTCDNALALCNSLNDEKLLGSTGFGEWLEGRVVNKMFGDRYYYGVIIEYDKVT
GWYRVEYEDGDFEDLDWHALEQVLLPMDITVPLKALALKTLKRSRKARKNRKNKTGNRRGETKEMEGRRKKSVGIRPLHLLFAISQSSALLYRHGFHIVASIRLF
SNFKPKFHSTTNLPKPPRLLELISPKGNVASESRQTHLRLIQDFLRTDSDQCRSQTLTDGFDSHSVVLTKGSSFVLDQERESGHWDLQLFAGSFKFDANDISSGL
SLCSSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYQMFEEMPVRNVVSWTAIIAGFAVEWQVDTCLELFQQMKRMALQPNEFTFTTILSACT
GSGALGVGRSLHCQTFKMGLDSCLHIANVLISMYCKCGAINLALYIFEAMEVKDIVSWNSMIAGYAQHGLSLQAINLFKAMRKQKQVEADGITFLGVLSSCRHAG
LVEEGRYYFNLMVELGLKPELDHYSCVIDLLGRAGLLKEAENFIKKMPISPNSIVWGSLLSACRLHGNVWIGLKAAESRLLLQPNCASTHLQLANLYAKAGYLDD
VARLRKMMKDKGLKTAPGYSWIEIQNKVYRFKAEDKSNPVMVEIFGLMDGIVNHMRSVGCVHEVDEVDDVLLPTS