; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg05904 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg05904
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCarg_Chr04:21039938..21042071
RNA-Seq ExpressionCarg05904
SyntenyCarg05904
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7033227.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+00100Show/hide
Query:  MRGLLINPTLILSNELNYQLHSCYPVSCAHKHFHGIPELKSCIRRRITHGGNEASMSSMSIPRLNFVVRSTKALEFRTCEEVEAIRLVIDEGVEESSREW
        MRGLLINPTLILSNELNYQLHSCYPVSCAHKHFHGIPELKSCIRRRITHGGNEASMSSMSIPRLNFVVRSTKALEFRTCEEVEAIRLVIDEGVEESSREW
Subjt:  MRGLLINPTLILSNELNYQLHSCYPVSCAHKHFHGIPELKSCIRRRITHGGNEASMSSMSIPRLNFVVRSTKALEFRTCEEVEAIRLVIDEGVEESSREW

Query:  KLPPWGEVKNQDESIFQSEDVNQSEVLEGEGLVSDRKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDG
        KLPPWGEVKNQDESIFQSEDVNQSEVLEGEGLVSDRKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDG
Subjt:  KLPPWGEVKNQDESIFQSEDVNQSEVLEGEGLVSDRKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDG

Query:  LRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFRTWEHKYDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRC
        LRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFRTWEHKYDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRC
Subjt:  LRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFRTWEHKYDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRC

Query:  NQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFALRVFQDMLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLG
        NQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFALRVFQDMLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLG
Subjt:  NQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFALRVFQDMLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLG

Query:  ALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKLGLWDRALQILWEMEAASGLLVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTF
        ALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKLGLWDRALQILWEMEAASGLLVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTF
Subjt:  ALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKLGLWDRALQILWEMEAASGLLVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTF

Query:  TLLSLIRSCIWGSLWDEVELLLSNYLDVYLIVVVLQKSAHDASVYNAVIQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALMLQTLPKDRAGLKNRLAS
        TLLSLIRSCIWGSLWDEVELLLSNYLDVYLIVVVLQKSAHDASVYNAVIQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALMLQTLPKDRAGLKNRLAS
Subjt:  TLLSLIRSCIWGSLWDEVELLLSNYLDVYLIVVVLQKSAHDASVYNAVIQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALMLQTLPKDRAGLKNRLAS

Query:  RFKKRHRHYHHR
        RFKKRHRHYHHR
Subjt:  RFKKRHRHYHHR

XP_022964230.1 pentatricopeptide repeat-containing protein At3g29290 isoform X1 [Cucurbita moschata]0.0e+0095.75Show/hide
Query:  MRGLLINPTLILSNELNYQLHSCYPVSCAHKHFHGIPELKSCIRRRITHGGNEASMSSMSIPRLNFVVRSTKALEFRTCEEVEAIRLVIDEGVEESSREW
        MRGLLINPTLILSNELNYQLHSCYPVSCAHKHFHGIPELKSCIRRRITHGGN ASMSSMSIPRLNFVVRSTKALE+RTCEE EAIRLV+DEGVEESSREW
Subjt:  MRGLLINPTLILSNELNYQLHSCYPVSCAHKHFHGIPELKSCIRRRITHGGNEASMSSMSIPRLNFVVRSTKALEFRTCEEVEAIRLVIDEGVEESSREW

Query:  KLPPWGEVKNQDESIFQSEDVNQSEVLEGEGLVSDRKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDG
        K PPWGEVKNQDESIFQSEDVNQSE LEGEGLVSDRKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDG
Subjt:  KLPPWGEVKNQDESIFQSEDVNQSEVLEGEGLVSDRKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDG

Query:  LRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFRTWEHKYDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRC
        LRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFRTWEH+YDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCS THLTYSLLVSMYVRC
Subjt:  LRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFRTWEHKYDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRC

Query:  NQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFALRVFQDMLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLG
        NQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFALRVFQDMLKCGLEPNSVAFNALIN+LGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLG
Subjt:  NQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFALRVFQDMLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLG

Query:  ALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKLGLWDRALQILWEMEAASGLLVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTF
        ALYKANRYNDAIRLF+FVKREEKAQLNIHIYNTILLSCSKLGLWDRALQILWEMEAASG LVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTF
Subjt:  ALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKLGLWDRALQILWEMEAASGLLVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTF

Query:  TLLSLIRSCIWGSLWDEVELLLSNYLDVYLIVVVLQKSAHDASVYNAVIQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALMLQTLPKDRAGLKNRLAS
        TLLSLIRSCIWGSLWDEVELLLS             KSAHDASVYNAVIQGMCLRGKTDLAKKLYTKMR+IGIQPDGKTRALMLQTLPKDRAGLKNRLAS
Subjt:  TLLSLIRSCIWGSLWDEVELLLSNYLDVYLIVVVLQKSAHDASVYNAVIQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALMLQTLPKDRAGLKNRLAS

Query:  RFKKRHRHYHHR
        RFKKRHRHY HR
Subjt:  RFKKRHRHYHHR

XP_022964238.1 pentatricopeptide repeat-containing protein At3g29290 isoform X2 [Cucurbita moschata]0.0e+0095.59Show/hide
Query:  MRGLLINPTLILSNELNYQLHSCYPVSCAHKHFHGIPELKSCIRRRITHGGNEASMSSMSIPRLNFVVRSTKALEFRTCEEVEAIRLVIDEGVEESSREW
        MRGLLINPTLILSNELNYQLHSCYPVSCAHKHFHGIPELKSCIRRRITHGGN ASMSSMSIPRLNFVVRSTKALE+RTCEE EAIRLV+DEGVEESSREW
Subjt:  MRGLLINPTLILSNELNYQLHSCYPVSCAHKHFHGIPELKSCIRRRITHGGNEASMSSMSIPRLNFVVRSTKALEFRTCEEVEAIRLVIDEGVEESSREW

Query:  KLPPWGEVKNQDESIFQSEDVNQSEVLEGEGLVSDRKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDG
        K PPWGEVKNQDESIFQSEDVNQSE LEGEGLVSDRKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDG
Subjt:  KLPPWGEVKNQDESIFQSEDVNQSEVLEGEGLVSDRKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDG

Query:  LRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFRTWEHKYDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRC
        LRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFRTWEH+YDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCS THLTYSLLVSMYVRC
Subjt:  LRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFRTWEHKYDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRC

Query:  NQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFALRVFQDMLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLG
        NQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFALRVFQDMLKCGLEPNSVAFNALIN+LGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLG
Subjt:  NQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFALRVFQDMLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLG

Query:  ALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKLGLWDRALQILWEMEAASGLLVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTF
        ALYKANRYNDAIRLF+FVKREEKAQLNIHIYNTILLSCSKLGLWDRALQILWEMEAASG LVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTF
Subjt:  ALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKLGLWDRALQILWEMEAASGLLVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTF

Query:  TLLSLIRSCIWGSLWDEVELLLSNYLDVYLIVVVLQKSAHDASVYNAVIQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALMLQTLPKDRAGLKNRLAS
        TLLSLIRSCIWGSLWDEVELLLS              SAHDASVYNAVIQGMCLRGKTDLAKKLYTKMR+IGIQPDGKTRALMLQTLPKDRAGLKNRLAS
Subjt:  TLLSLIRSCIWGSLWDEVELLLSNYLDVYLIVVVLQKSAHDASVYNAVIQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALMLQTLPKDRAGLKNRLAS

Query:  RFKKRHRHYHHR
        RFKKRHRHY HR
Subjt:  RFKKRHRHYHHR

XP_022990659.1 pentatricopeptide repeat-containing protein At3g29290 isoform X1 [Cucurbita maxima]0.0e+0094.44Show/hide
Query:  MRGLLINPTLILSNELNYQLHSCYPVSCAHKHFHGIPELKSCIRRRITHGGNEASMSSMSIPRLNFVVRSTKALEFRTCEEVEAIRLVIDEGVEESSREW
        MRGLLINPTLILSNELNYQLHSCYPVSCAHKHFH IPELKSCIRRRITHGGN ASMSSMSIPRLNFVVRSTK LEFRTCEE EAI LV+DEGVEESSREW
Subjt:  MRGLLINPTLILSNELNYQLHSCYPVSCAHKHFHGIPELKSCIRRRITHGGNEASMSSMSIPRLNFVVRSTKALEFRTCEEVEAIRLVIDEGVEESSREW

Query:  KLPPWGEVKNQDESIFQSEDVNQSEVLEGEGLVSDRKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDG
        K PPWGEVKNQDE IFQSEDVNQSEVLEGEGLVSDRKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDG
Subjt:  KLPPWGEVKNQDESIFQSEDVNQSEVLEGEGLVSDRKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDG

Query:  LRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFRTWEHKYDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRC
        LRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFRTWEH+YDLKQFDAIVYN MISVCGKENNWVEAERIWRLME NGCSATHLTYSLLVS YVRC
Subjt:  LRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFRTWEHKYDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRC

Query:  NQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFALRVFQDMLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLG
        NQNELAIDIYVKMVQN  KPANDTMQAIIGASSREGRWDFALRVFQ+MLKCGLEPNSVAFNALIN+LGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLG
Subjt:  NQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFALRVFQDMLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLG

Query:  ALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKLGLWDRALQILWEMEAASGLLVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTF
        ALYKANRYNDAIRLFEFVKREEKAQLNIHIYNT+LLSCSKLGLWDRALQILWEMEAASG LVSASSYNIVISACEMA+KPEIALRVYERMIHQKLTPDTF
Subjt:  ALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKLGLWDRALQILWEMEAASGLLVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTF

Query:  TLLSLIRSCIWGSLWDEVELLLSNYLDVYLIVVVLQKSAHDASVYNAVIQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALMLQTLPKDRAGLKNRLAS
        TLLSLIRSCIWGSLWDEVELLLS             KSAHDASVYNAVIQGMCLRGKTDLAKK YTKM EIGIQPDGKTRALMLQTLPKDRAGLKNRLAS
Subjt:  TLLSLIRSCIWGSLWDEVELLLSNYLDVYLIVVVLQKSAHDASVYNAVIQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALMLQTLPKDRAGLKNRLAS

Query:  RFKKRHRHYHHR
        RFKKRHRHYHHR
Subjt:  RFKKRHRHYHHR

XP_023525691.1 pentatricopeptide repeat-containing protein At3g29290 [Cucurbita pepo subsp. pepo]0.0e+0094.93Show/hide
Query:  MRGLLINPTLILSNELNYQLHSCYPVSCAHKHFHGIPELKSCIRRRITHGGNEASMSSMSIPRLNFVVRSTKALEFRTCEEVEAIRLVIDEGVEESSREW
        MRGLLINPTLILSNELNYQLHSCYPVSCAHKHFH IPELKSCIRRRITHGGN ASMSSMSIPRLNFVVRSTKALEFRTCEE +AIRLV+D+GVEESSREW
Subjt:  MRGLLINPTLILSNELNYQLHSCYPVSCAHKHFHGIPELKSCIRRRITHGGNEASMSSMSIPRLNFVVRSTKALEFRTCEEVEAIRLVIDEGVEESSREW

Query:  KLPPWGEVKNQDESIFQSEDVNQSEVLEGEGLVSDRKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDG
        K PPWGEVKNQDE IFQSEDVNQSEVLEGEGL SDRKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNG FDDG
Subjt:  KLPPWGEVKNQDESIFQSEDVNQSEVLEGEGLVSDRKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDG

Query:  LRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFRTWEHKYDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRC
        LRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFRTWEH+YDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRC
Subjt:  LRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFRTWEHKYDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRC

Query:  NQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFALRVFQDMLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLG
        NQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFALRVFQDMLKCGLEPNSVAFN LIN+LGKANEVTLAFSIYNRMK MGHSPDVYTWKALLG
Subjt:  NQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFALRVFQDMLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLG

Query:  ALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKLGLWDRALQILWEMEAASGLLVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTF
        ALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKLGLWDRALQILWEMEAASG LVSASSYNIVISACEMA+KPEIALRVYERMIHQKLTPDTF
Subjt:  ALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKLGLWDRALQILWEMEAASGLLVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTF

Query:  TLLSLIRSCIWGSLWDEVELLLSNYLDVYLIVVVLQKSAHDASVYNAVIQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALMLQTLPKDRAGLKNRLAS
        TLLSLIRSCIWGSLWDEVELLLS             KSAHDASVYNA IQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALMLQTLPKDRAGL+NRLAS
Subjt:  TLLSLIRSCIWGSLWDEVELLLSNYLDVYLIVVVLQKSAHDASVYNAVIQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALMLQTLPKDRAGLKNRLAS

Query:  RFKKRHRHYHHR
        RFKKRHRHYHHR
Subjt:  RFKKRHRHYHHR

TrEMBL top hitse value%identityAlignment
A0A1S3C9M1 pentatricopeptide repeat-containing protein At3g29290 isoform X16.2e-25874.88Show/hide
Query:  MRGLLIN--PTLILSNELNYQLHSCYPVSCAHKHFHGI----PELKSCIRRRITHGGNEASMSSMSIPRLNFVVRSTKALEFRT-------CEEVEAIRL
        MRG+L N  PTLIL NE NYQ  S YP     KH        P LKSC+R  I + GN  SM  MS PRLN VV+S + ++FRT       C E EAI L
Subjt:  MRGLLIN--PTLILSNELNYQLHSCYPVSCAHKHFHGI----PELKSCIRRRITHGGNEASMSSMSIPRLNFVVRSTKALEFRT-------CEEVEAIRL

Query:  VIDEGVEESSREWKLPPWGEVKNQDESIFQSEDVNQSEVLEGEGLVSDRKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSL
        VIDE   ESSREWKLPPWG++ +QDE+ FQSEDVN  ++LEGE L ++ KV+FLEETD+V+LSKRILILSRKNKVRSA+ELFRSM LAG+LP+ HA NSL
Subjt:  VIDEGVEESSREWKLPPWGEVKNQDESIFQSEDVNQSEVLEGEGLVSDRKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSL

Query:  LACLLRNGLFDDGLRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFRTWEHKYDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATH
        LACLLRNGLF DGLRIFEFMK N+LSTGHTYSL+LKAVA+ HGFLSALEMF+ WEHKY L QFDAIVYNTMIS+CGK+NNWVEAER WRLME NGC+ATH
Subjt:  LACLLRNGLFDDGLRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFRTWEHKYDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATH

Query:  LTYSLLVSMYVRCNQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFALRVFQDMLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMG
        +TYSLLVS +VRCNQNELAID YVKMVQ+  KP NDTMQAIIGASS+EG+WDFAL VFQDMLKCGL+PNSV+FNALIN+LGKA EVTLAFSIYN MKSMG
Subjt:  LTYSLLVSMYVRCNQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFALRVFQDMLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMG

Query:  HSPDVYTWKALLGALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKLGLWDRALQILWEMEAASGLLVSASSYNIVISACEMAKKPEIALRVY
        HSPDVYTW ALLGALYKANRYNDAI LF FVKREEKAQLNIHIYNTIL+ CSKLGLW+RALQILWEME  SGLL+S +SYNIV++ACE A+KPEIAL+VY
Subjt:  HSPDVYTWKALLGALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKLGLWDRALQILWEMEAASGLLVSASSYNIVISACEMAKKPEIALRVY

Query:  ERMIHQKLTPDTFTLLSLIRSCIWGSLWDEVELLLSNYLDVYLIVVVLQKSAHDASVYNAVIQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALMLQTL
        ERM+HQK TPDTFT LSLIR CIWGSLWDEVELLL+             KS  D SVYN VIQGMCLRGKTDLAKKLYTKMRE  IQ DGKTRALMLQ L
Subjt:  ERMIHQKLTPDTFTLLSLIRSCIWGSLWDEVELLLSNYLDVYLIVVVLQKSAHDASVYNAVIQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALMLQTL

Query:  PKDRAGLKNRLASRFKKRHRHYHHR
        PKD A LKNR AS FKKR R YHHR
Subjt:  PKDRAGLKNRLASRFKKRHRHYHHR

A0A6J1HK85 pentatricopeptide repeat-containing protein At3g29290 isoform X20.0e+0095.59Show/hide
Query:  MRGLLINPTLILSNELNYQLHSCYPVSCAHKHFHGIPELKSCIRRRITHGGNEASMSSMSIPRLNFVVRSTKALEFRTCEEVEAIRLVIDEGVEESSREW
        MRGLLINPTLILSNELNYQLHSCYPVSCAHKHFHGIPELKSCIRRRITHGGN ASMSSMSIPRLNFVVRSTKALE+RTCEE EAIRLV+DEGVEESSREW
Subjt:  MRGLLINPTLILSNELNYQLHSCYPVSCAHKHFHGIPELKSCIRRRITHGGNEASMSSMSIPRLNFVVRSTKALEFRTCEEVEAIRLVIDEGVEESSREW

Query:  KLPPWGEVKNQDESIFQSEDVNQSEVLEGEGLVSDRKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDG
        K PPWGEVKNQDESIFQSEDVNQSE LEGEGLVSDRKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDG
Subjt:  KLPPWGEVKNQDESIFQSEDVNQSEVLEGEGLVSDRKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDG

Query:  LRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFRTWEHKYDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRC
        LRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFRTWEH+YDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCS THLTYSLLVSMYVRC
Subjt:  LRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFRTWEHKYDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRC

Query:  NQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFALRVFQDMLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLG
        NQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFALRVFQDMLKCGLEPNSVAFNALIN+LGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLG
Subjt:  NQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFALRVFQDMLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLG

Query:  ALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKLGLWDRALQILWEMEAASGLLVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTF
        ALYKANRYNDAIRLF+FVKREEKAQLNIHIYNTILLSCSKLGLWDRALQILWEMEAASG LVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTF
Subjt:  ALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKLGLWDRALQILWEMEAASGLLVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTF

Query:  TLLSLIRSCIWGSLWDEVELLLSNYLDVYLIVVVLQKSAHDASVYNAVIQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALMLQTLPKDRAGLKNRLAS
        TLLSLIRSCIWGSLWDEVELLLS              SAHDASVYNAVIQGMCLRGKTDLAKKLYTKMR+IGIQPDGKTRALMLQTLPKDRAGLKNRLAS
Subjt:  TLLSLIRSCIWGSLWDEVELLLSNYLDVYLIVVVLQKSAHDASVYNAVIQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALMLQTLPKDRAGLKNRLAS

Query:  RFKKRHRHYHHR
        RFKKRHRHY HR
Subjt:  RFKKRHRHYHHR

A0A6J1HK91 pentatricopeptide repeat-containing protein At3g29290 isoform X10.0e+0095.75Show/hide
Query:  MRGLLINPTLILSNELNYQLHSCYPVSCAHKHFHGIPELKSCIRRRITHGGNEASMSSMSIPRLNFVVRSTKALEFRTCEEVEAIRLVIDEGVEESSREW
        MRGLLINPTLILSNELNYQLHSCYPVSCAHKHFHGIPELKSCIRRRITHGGN ASMSSMSIPRLNFVVRSTKALE+RTCEE EAIRLV+DEGVEESSREW
Subjt:  MRGLLINPTLILSNELNYQLHSCYPVSCAHKHFHGIPELKSCIRRRITHGGNEASMSSMSIPRLNFVVRSTKALEFRTCEEVEAIRLVIDEGVEESSREW

Query:  KLPPWGEVKNQDESIFQSEDVNQSEVLEGEGLVSDRKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDG
        K PPWGEVKNQDESIFQSEDVNQSE LEGEGLVSDRKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDG
Subjt:  KLPPWGEVKNQDESIFQSEDVNQSEVLEGEGLVSDRKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDG

Query:  LRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFRTWEHKYDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRC
        LRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFRTWEH+YDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCS THLTYSLLVSMYVRC
Subjt:  LRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFRTWEHKYDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRC

Query:  NQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFALRVFQDMLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLG
        NQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFALRVFQDMLKCGLEPNSVAFNALIN+LGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLG
Subjt:  NQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFALRVFQDMLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLG

Query:  ALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKLGLWDRALQILWEMEAASGLLVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTF
        ALYKANRYNDAIRLF+FVKREEKAQLNIHIYNTILLSCSKLGLWDRALQILWEMEAASG LVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTF
Subjt:  ALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKLGLWDRALQILWEMEAASGLLVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTF

Query:  TLLSLIRSCIWGSLWDEVELLLSNYLDVYLIVVVLQKSAHDASVYNAVIQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALMLQTLPKDRAGLKNRLAS
        TLLSLIRSCIWGSLWDEVELLLS             KSAHDASVYNAVIQGMCLRGKTDLAKKLYTKMR+IGIQPDGKTRALMLQTLPKDRAGLKNRLAS
Subjt:  TLLSLIRSCIWGSLWDEVELLLSNYLDVYLIVVVLQKSAHDASVYNAVIQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALMLQTLPKDRAGLKNRLAS

Query:  RFKKRHRHYHHR
        RFKKRHRHY HR
Subjt:  RFKKRHRHYHHR

A0A6J1JQP6 pentatricopeptide repeat-containing protein At3g29290 isoform X20.0e+0094.28Show/hide
Query:  MRGLLINPTLILSNELNYQLHSCYPVSCAHKHFHGIPELKSCIRRRITHGGNEASMSSMSIPRLNFVVRSTKALEFRTCEEVEAIRLVIDEGVEESSREW
        MRGLLINPTLILSNELNYQLHSCYPVSCAHKHFH IPELKSCIRRRITHGGN ASMSSMSIPRLNFVVRSTK LEFRTCEE EAI LV+DEGVEESSREW
Subjt:  MRGLLINPTLILSNELNYQLHSCYPVSCAHKHFHGIPELKSCIRRRITHGGNEASMSSMSIPRLNFVVRSTKALEFRTCEEVEAIRLVIDEGVEESSREW

Query:  KLPPWGEVKNQDESIFQSEDVNQSEVLEGEGLVSDRKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDG
        K PPWGEVKNQDE IFQSEDVNQSEVLEGEGLVSDRKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDG
Subjt:  KLPPWGEVKNQDESIFQSEDVNQSEVLEGEGLVSDRKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDG

Query:  LRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFRTWEHKYDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRC
        LRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFRTWEH+YDLKQFDAIVYN MISVCGKENNWVEAERIWRLME NGCSATHLTYSLLVS YVRC
Subjt:  LRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFRTWEHKYDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRC

Query:  NQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFALRVFQDMLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLG
        NQNELAIDIYVKMVQN  KPANDTMQAIIGASSREGRWDFALRVFQ+MLKCGLEPNSVAFNALIN+LGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLG
Subjt:  NQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFALRVFQDMLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLG

Query:  ALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKLGLWDRALQILWEMEAASGLLVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTF
        ALYKANRYNDAIRLFEFVKREEKAQLNIHIYNT+LLSCSKLGLWDRALQILWEMEAASG LVSASSYNIVISACEMA+KPEIALRVYERMIHQKLTPDTF
Subjt:  ALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKLGLWDRALQILWEMEAASGLLVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTF

Query:  TLLSLIRSCIWGSLWDEVELLLSNYLDVYLIVVVLQKSAHDASVYNAVIQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALMLQTLPKDRAGLKNRLAS
        TLLSLIRSCIWGSLWDEVELLLS              SAHDASVYNAVIQGMCLRGKTDLAKK YTKM EIGIQPDGKTRALMLQTLPKDRAGLKNRLAS
Subjt:  TLLSLIRSCIWGSLWDEVELLLSNYLDVYLIVVVLQKSAHDASVYNAVIQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALMLQTLPKDRAGLKNRLAS

Query:  RFKKRHRHYHHR
        RFKKRHRHYHHR
Subjt:  RFKKRHRHYHHR

A0A6J1JSM4 pentatricopeptide repeat-containing protein At3g29290 isoform X10.0e+0094.44Show/hide
Query:  MRGLLINPTLILSNELNYQLHSCYPVSCAHKHFHGIPELKSCIRRRITHGGNEASMSSMSIPRLNFVVRSTKALEFRTCEEVEAIRLVIDEGVEESSREW
        MRGLLINPTLILSNELNYQLHSCYPVSCAHKHFH IPELKSCIRRRITHGGN ASMSSMSIPRLNFVVRSTK LEFRTCEE EAI LV+DEGVEESSREW
Subjt:  MRGLLINPTLILSNELNYQLHSCYPVSCAHKHFHGIPELKSCIRRRITHGGNEASMSSMSIPRLNFVVRSTKALEFRTCEEVEAIRLVIDEGVEESSREW

Query:  KLPPWGEVKNQDESIFQSEDVNQSEVLEGEGLVSDRKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDG
        K PPWGEVKNQDE IFQSEDVNQSEVLEGEGLVSDRKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDG
Subjt:  KLPPWGEVKNQDESIFQSEDVNQSEVLEGEGLVSDRKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDG

Query:  LRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFRTWEHKYDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRC
        LRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFRTWEH+YDLKQFDAIVYN MISVCGKENNWVEAERIWRLME NGCSATHLTYSLLVS YVRC
Subjt:  LRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFRTWEHKYDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRC

Query:  NQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFALRVFQDMLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLG
        NQNELAIDIYVKMVQN  KPANDTMQAIIGASSREGRWDFALRVFQ+MLKCGLEPNSVAFNALIN+LGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLG
Subjt:  NQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFALRVFQDMLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLG

Query:  ALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKLGLWDRALQILWEMEAASGLLVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTF
        ALYKANRYNDAIRLFEFVKREEKAQLNIHIYNT+LLSCSKLGLWDRALQILWEMEAASG LVSASSYNIVISACEMA+KPEIALRVYERMIHQKLTPDTF
Subjt:  ALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKLGLWDRALQILWEMEAASGLLVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTF

Query:  TLLSLIRSCIWGSLWDEVELLLSNYLDVYLIVVVLQKSAHDASVYNAVIQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALMLQTLPKDRAGLKNRLAS
        TLLSLIRSCIWGSLWDEVELLLS             KSAHDASVYNAVIQGMCLRGKTDLAKK YTKM EIGIQPDGKTRALMLQTLPKDRAGLKNRLAS
Subjt:  TLLSLIRSCIWGSLWDEVELLLSNYLDVYLIVVVLQKSAHDASVYNAVIQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALMLQTLPKDRAGLKNRLAS

Query:  RFKKRHRHYHHR
        RFKKRHRHYHHR
Subjt:  RFKKRHRHYHHR

SwissProt top hitse value%identityAlignment
Q84J46 Pentatricopeptide repeat-containing protein At3g292901.1e-13752.48Show/hide
Query:  EVKNQDESIFQSEDVNQSEVLEGEGLVSDRKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDGLRIFEF
        +V ++ +S F  E+V     LE +      +++FLEE +E  LSKR+  LSR +KVRSA+ELF SM   GL P+ HA NS L+CLLRNG       +FEF
Subjt:  EVKNQDESIFQSEDVNQSEVLEGEGLVSDRKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDGLRIFEF

Query:  MKSNKLSTGHTYSLILKAVADTHGFLSALEMFRTWEHKYDLKQ-FDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRCNQNEL
        M+  +  TGHTYSL+LKAVA+  G  SAL MFR  E +   +  FD ++YNT IS+CG+ NN  E ERIWR+M+ +G   T +TYSLLVS++VRC ++EL
Subjt:  MKSNKLSTGHTYSLILKAVADTHGFLSALEMFRTWEHKYDLKQ-FDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRCNQNEL

Query:  AIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFALRVFQDMLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLGALYKA
        A+D+Y +MV N +    D M A+I A ++E +WD AL++FQ MLK G++PN VA N LINSLGKA +V L F +Y+ +KS+GH PD YTW ALL ALYKA
Subjt:  AIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFALRVFQDMLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLGALYKA

Query:  NRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKLGLWDRALQILWEMEAASGLLVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTFTLLSL
        NRY D ++LF+ ++ E    LN ++YNT ++SC KLG W++A+++L+EME  SGL VS SSYN+VISACE ++K ++AL VYE M  +   P+TFT LSL
Subjt:  NRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKLGLWDRALQILWEMEAASGLLVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTFTLLSL

Query:  IRSCIWGSLWDEVELLLSNYLDVYLIVVVLQKSAHDASVYNAVIQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALMLQTLPK
        +RSCIWGSLWDEVE              +L+K   D S+YNA I GMCLR +   AK+LY KMRE+G++PDGKTRA+MLQ L K
Subjt:  IRSCIWGSLWDEVELLLSNYLDVYLIVVVLQKSAHDASVYNAVIQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALMLQTLPK

Q9ASZ8 Pentatricopeptide repeat-containing protein At1g126201.7e-2624.1Show/hide
Query:  ETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDGLRIFEFMKSNKLSTGH-TYSLILKAVADTHGFLSALEMFRTW
        E D V  S  I  L  + +V  A+EL   M   G  P+    N+L+  L  NG   D + + + M          TY  +LK +  +     A+E+ R  
Subjt:  ETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDGLRIFEFMKSNKLSTGH-TYSLILKAVADTHGFLSALEMFRTW

Query:  EHKYDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRCNQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFA
        E +    + DA+ Y+ +I    K+ +   A  ++  ME  G  A  + Y+ L+  +    + +    +   M++  + P      A+I    +EG+   A
Subjt:  EHKYDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRCNQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFA

Query:  LRVFQDMLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLGALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKL
          + ++M++ G+ P++V + +LI+   K N++  A  + + M S G  P++ T+  L+    KAN  +D + LF  +         +  YNT++    +L
Subjt:  LRVFQDMLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLGALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKL

Query:  GLWDRALQILWEMEAASGLLVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTFTLLSLIRSCIWGSLWDEVELLLSNYLDVYLIVVVLQKSAHD
        G  + A ++  EM  +  +     SY I++       +PE AL ++E++   K+  D      +I      S  D+   L  +        + L+    D
Subjt:  GLWDRALQILWEMEAASGLLVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTFTLLSLIRSCIWGSLWDEVELLLSNYLDVYLIVVVLQKSAHD

Query:  ASVYNAVIQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALMLQ
           YN +I G+C +G    A  L+ KM E G  P+G T  ++++
Subjt:  ASVYNAVIQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALMLQ

Q9FRS4 Pentatricopeptide repeat-containing protein At1g086102.0e-2726.67Show/hide
Query:  GLVSDRKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDGLRIFEFM-KSNKLSTGHTYSLILKAVADTH
        GL SD  +    E DE   ++ +  L    K+  A +L   M     +P F + ++L+  L R    D  + I   M  S  +    TY++I+  +    
Subjt:  GLVSDRKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDGLRIFEFM-KSNKLSTGHTYSLILKAVADTH

Query:  GFLSALEMFRTWEHKYDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRCNQNELAIDIYVKMVQNDLKPANDTMQAII
           +AL +    +        D I YNT+I       N  +A R W+    NGC    +TY++LV +  R   +  AI++   M      P   T  +++
Subjt:  GFLSALEMFRTWEHKYDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRCNQNELAIDIYVKMVQNDLKPANDTMQAII

Query:  GASSREGRWDFALRVFQDMLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLGALYKANRYNDAIRLFEFVKREEKAQLNIH
          + R G  +    V Q +L  GLE N+V +N L++SL           I N M    + P V T+  L+  L KA   + AI  F +   E+K   +I 
Subjt:  GASSREGRWDFALRVFQDMLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLGALYKANRYNDAIRLFEFVKREEKAQLNIH

Query:  IYNTILLSCSKLGLWDRALQILWEME---AASGLLVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTFTLLSLIRSCIWGSLWDEVELLLSNYL
         YNT+L + SK G+ D A+++L  ++      GL+    +YN VI         + AL +Y +M+   + PD  T  SLI      +L +E   +L    
Subjt:  IYNTILLSCSKLGLWDRALQILWEME---AASGLLVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTFTLLSLIRSCIWGSLWDEVELLLSNYL

Query:  DVYLIVVVLQKSAHDASVYNAVIQGMCLRGKTDLAKKLYTKMREIGIQPD
        +        + +    S Y  VIQG+C + + ++A ++   M   G +PD
Subjt:  DVYLIVVVLQKSAHDASVYNAVIQGMCLRGKTDLAKKLYTKMREIGIQPD

Q9LYZ9 Pentatricopeptide repeat-containing protein At5g028602.1e-2924.48Show/hide
Query:  DEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDGLRIFEFMKSNKLS-TGHTYSLIL----KAVADTHGFLSALEMFR
        D  +++  I +L ++ +V SA  +F  +   G     ++  SL++    +G + + + +F+ M+ +    T  TY++IL    K     +   S +E   
Subjt:  DEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDGLRIFEFMKSNKLS-TGHTYSLIL----KAVADTHGFLSALEMFR

Query:  TWEHKYDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRCNQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWD
          + K D    DA  YNT+I+ C + +   EA +++  M+A G S   +TY+ L+ +Y + ++ + A+ +  +MV N   P+  T  ++I A +R+G  D
Subjt:  TWEHKYDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRCNQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWD

Query:  FALRVFQDMLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLGALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCS
         A+ +   M + G +P+   +  L++   +A +V  A SI+  M++ G  P++ T+ A +       ++ + +++F+ +     +  +I  +NT+L    
Subjt:  FALRVFQDMLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLGALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCS

Query:  KLGLWDRALQILWEMEAASGLLVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTFTLLSLIRSCIWGSLWDEVELLLSNYLD
        + G+      +  EM+ A G +    ++N +ISA       E A+ VY RM+   +TPD  T  +++ +   G +W++ E +L+   D
Subjt:  KLGLWDRALQILWEMEAASGLLVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTFTLLSLIRSCIWGSLWDEVELLLSNYLD

Q9S7Q2 Pentatricopeptide repeat-containing protein At1g74850, chloroplastic7.8e-3226.54Show/hide
Query:  ILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDGLRIFEFMKSNKLSTG-HTYSLILKAVA----DTHGFLSALEMFRTWEHKYDL
        I +L R+  +   +E+F  M   G+  S  +  +L+    RNG ++  L + + MK+ K+S    TY+ ++ A A    D  G L         E +++ 
Subjt:  ILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDGLRIFEFMKSNKLSTG-HTYSLILKAVA----DTHGFLSALEMFRTWEHKYDL

Query:  KQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRCNQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFALRVFQD
         Q D + YNT++S C       EAE ++R M   G      TYS LV  + +  + E   D+  +M      P   +   ++ A ++ G    A+ VF  
Subjt:  KQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRCNQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFALRVFQD

Query:  MLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLGALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKLGLWDRA
        M   G  PN+  ++ L+N  G++        ++  MKS    PD  T+  L+    +   + + + LF  +  EE  + ++  Y  I+ +C K GL + A
Subjt:  MLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLGALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKLGLWDRA

Query:  LQILWEMEAASGLLVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTFTLLSLIRSCIWGSLWDEVELLLSNYLDVYLIVVVLQKSAHDASVYNA
         +IL  M  A+ ++ S+ +Y  VI A   A   E AL  +  M      P   T  SL+ S   G L  E E +LS  +D  +          +   +NA
Subjt:  LQILWEMEAASGLLVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTFTLLSLIRSCIWGSLWDEVELLLSNYLDVYLIVVVLQKSAHDASVYNA

Query:  VIQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALML
         I+     GK + A K Y  M +    PD +T   +L
Subjt:  VIQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALML

Arabidopsis top hitse value%identityAlignment
AT1G08610.1 Pentatricopeptide repeat (PPR) superfamily protein1.4e-2826.67Show/hide
Query:  GLVSDRKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDGLRIFEFM-KSNKLSTGHTYSLILKAVADTH
        GL SD  +    E DE   ++ +  L    K+  A +L   M     +P F + ++L+  L R    D  + I   M  S  +    TY++I+  +    
Subjt:  GLVSDRKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDGLRIFEFM-KSNKLSTGHTYSLILKAVADTH

Query:  GFLSALEMFRTWEHKYDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRCNQNELAIDIYVKMVQNDLKPANDTMQAII
           +AL +    +        D I YNT+I       N  +A R W+    NGC    +TY++LV +  R   +  AI++   M      P   T  +++
Subjt:  GFLSALEMFRTWEHKYDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRCNQNELAIDIYVKMVQNDLKPANDTMQAII

Query:  GASSREGRWDFALRVFQDMLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLGALYKANRYNDAIRLFEFVKREEKAQLNIH
          + R G  +    V Q +L  GLE N+V +N L++SL           I N M    + P V T+  L+  L KA   + AI  F +   E+K   +I 
Subjt:  GASSREGRWDFALRVFQDMLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLGALYKANRYNDAIRLFEFVKREEKAQLNIH

Query:  IYNTILLSCSKLGLWDRALQILWEME---AASGLLVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTFTLLSLIRSCIWGSLWDEVELLLSNYL
         YNT+L + SK G+ D A+++L  ++      GL+    +YN VI         + AL +Y +M+   + PD  T  SLI      +L +E   +L    
Subjt:  IYNTILLSCSKLGLWDRALQILWEME---AASGLLVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTFTLLSLIRSCIWGSLWDEVELLLSNYL

Query:  DVYLIVVVLQKSAHDASVYNAVIQGMCLRGKTDLAKKLYTKMREIGIQPD
        +        + +    S Y  VIQG+C + + ++A ++   M   G +PD
Subjt:  DVYLIVVVLQKSAHDASVYNAVIQGMCLRGKTDLAKKLYTKMREIGIQPD

AT1G12620.1 Pentatricopeptide repeat (PPR) superfamily protein1.2e-2724.1Show/hide
Query:  ETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDGLRIFEFMKSNKLSTGH-TYSLILKAVADTHGFLSALEMFRTW
        E D V  S  I  L  + +V  A+EL   M   G  P+    N+L+  L  NG   D + + + M          TY  +LK +  +     A+E+ R  
Subjt:  ETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDGLRIFEFMKSNKLSTGH-TYSLILKAVADTHGFLSALEMFRTW

Query:  EHKYDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRCNQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFA
        E +    + DA+ Y+ +I    K+ +   A  ++  ME  G  A  + Y+ L+  +    + +    +   M++  + P      A+I    +EG+   A
Subjt:  EHKYDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRCNQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFA

Query:  LRVFQDMLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLGALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKL
          + ++M++ G+ P++V + +LI+   K N++  A  + + M S G  P++ T+  L+    KAN  +D + LF  +         +  YNT++    +L
Subjt:  LRVFQDMLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLGALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKL

Query:  GLWDRALQILWEMEAASGLLVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTFTLLSLIRSCIWGSLWDEVELLLSNYLDVYLIVVVLQKSAHD
        G  + A ++  EM  +  +     SY I++       +PE AL ++E++   K+  D      +I      S  D+   L  +        + L+    D
Subjt:  GLWDRALQILWEMEAASGLLVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTFTLLSLIRSCIWGSLWDEVELLLSNYLDVYLIVVVLQKSAHD

Query:  ASVYNAVIQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALMLQ
           YN +I G+C +G    A  L+ KM E G  P+G T  ++++
Subjt:  ASVYNAVIQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALMLQ

AT1G74850.1 plastid transcriptionally active 25.5e-3326.54Show/hide
Query:  ILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDGLRIFEFMKSNKLSTG-HTYSLILKAVA----DTHGFLSALEMFRTWEHKYDL
        I +L R+  +   +E+F  M   G+  S  +  +L+    RNG ++  L + + MK+ K+S    TY+ ++ A A    D  G L         E +++ 
Subjt:  ILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDGLRIFEFMKSNKLSTG-HTYSLILKAVA----DTHGFLSALEMFRTWEHKYDL

Query:  KQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRCNQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFALRVFQD
         Q D + YNT++S C       EAE ++R M   G      TYS LV  + +  + E   D+  +M      P   +   ++ A ++ G    A+ VF  
Subjt:  KQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRCNQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFALRVFQD

Query:  MLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLGALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKLGLWDRA
        M   G  PN+  ++ L+N  G++        ++  MKS    PD  T+  L+    +   + + + LF  +  EE  + ++  Y  I+ +C K GL + A
Subjt:  MLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLGALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKLGLWDRA

Query:  LQILWEMEAASGLLVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTFTLLSLIRSCIWGSLWDEVELLLSNYLDVYLIVVVLQKSAHDASVYNA
         +IL  M  A+ ++ S+ +Y  VI A   A   E AL  +  M      P   T  SL+ S   G L  E E +LS  +D  +          +   +NA
Subjt:  LQILWEMEAASGLLVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTFTLLSLIRSCIWGSLWDEVELLLSNYLDVYLIVVVLQKSAHDASVYNA

Query:  VIQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALML
         I+     GK + A K Y  M +    PD +T   +L
Subjt:  VIQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALML

AT3G29290.1 Pentatricopeptide repeat (PPR) superfamily protein7.5e-13952.48Show/hide
Query:  EVKNQDESIFQSEDVNQSEVLEGEGLVSDRKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDGLRIFEF
        +V ++ +S F  E+V     LE +      +++FLEE +E  LSKR+  LSR +KVRSA+ELF SM   GL P+ HA NS L+CLLRNG       +FEF
Subjt:  EVKNQDESIFQSEDVNQSEVLEGEGLVSDRKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDGLRIFEF

Query:  MKSNKLSTGHTYSLILKAVADTHGFLSALEMFRTWEHKYDLKQ-FDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRCNQNEL
        M+  +  TGHTYSL+LKAVA+  G  SAL MFR  E +   +  FD ++YNT IS+CG+ NN  E ERIWR+M+ +G   T +TYSLLVS++VRC ++EL
Subjt:  MKSNKLSTGHTYSLILKAVADTHGFLSALEMFRTWEHKYDLKQ-FDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRCNQNEL

Query:  AIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFALRVFQDMLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLGALYKA
        A+D+Y +MV N +    D M A+I A ++E +WD AL++FQ MLK G++PN VA N LINSLGKA +V L F +Y+ +KS+GH PD YTW ALL ALYKA
Subjt:  AIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFALRVFQDMLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLGALYKA

Query:  NRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKLGLWDRALQILWEMEAASGLLVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTFTLLSL
        NRY D ++LF+ ++ E    LN ++YNT ++SC KLG W++A+++L+EME  SGL VS SSYN+VISACE ++K ++AL VYE M  +   P+TFT LSL
Subjt:  NRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKLGLWDRALQILWEMEAASGLLVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTFTLLSL

Query:  IRSCIWGSLWDEVELLLSNYLDVYLIVVVLQKSAHDASVYNAVIQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALMLQTLPK
        +RSCIWGSLWDEVE              +L+K   D S+YNA I GMCLR +   AK+LY KMRE+G++PDGKTRA+MLQ L K
Subjt:  IRSCIWGSLWDEVELLLSNYLDVYLIVVVLQKSAHDASVYNAVIQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALMLQTLPK

AT5G02860.1 Pentatricopeptide repeat (PPR) superfamily protein1.5e-3024.48Show/hide
Query:  DEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDGLRIFEFMKSNKLS-TGHTYSLIL----KAVADTHGFLSALEMFR
        D  +++  I +L ++ +V SA  +F  +   G     ++  SL++    +G + + + +F+ M+ +    T  TY++IL    K     +   S +E   
Subjt:  DEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDGLRIFEFMKSNKLS-TGHTYSLIL----KAVADTHGFLSALEMFR

Query:  TWEHKYDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRCNQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWD
          + K D    DA  YNT+I+ C + +   EA +++  M+A G S   +TY+ L+ +Y + ++ + A+ +  +MV N   P+  T  ++I A +R+G  D
Subjt:  TWEHKYDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRCNQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWD

Query:  FALRVFQDMLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLGALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCS
         A+ +   M + G +P+   +  L++   +A +V  A SI+  M++ G  P++ T+ A +       ++ + +++F+ +     +  +I  +NT+L    
Subjt:  FALRVFQDMLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLGALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCS

Query:  KLGLWDRALQILWEMEAASGLLVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTFTLLSLIRSCIWGSLWDEVELLLSNYLD
        + G+      +  EM+ A G +    ++N +ISA       E A+ VY RM+   +TPD  T  +++ +   G +W++ E +L+   D
Subjt:  KLGLWDRALQILWEMEAASGLLVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTFTLLSLIRSCIWGSLWDEVELLLSNYLD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGGATTGCTTATAAATCCCACTCTGATTTTATCAAATGAGTTGAATTACCAACTTCACTCGTGTTACCCTGTTAGTTGTGCACACAAGCATTTCCATGGTATTCC
AGAATTGAAATCATGTATAAGGCGTAGGATAACTCATGGGGGTAATGAAGCTTCAATGTCGTCGATGAGTATTCCACGATTGAATTTCGTGGTTCGGTCCACAAAAGCCC
TGGAATTTAGGACATGTGAAGAGGTTGAGGCTATTAGATTGGTCATTGATGAAGGAGTCGAAGAATCGTCTCGGGAGTGGAAATTGCCTCCCTGGGGAGAAGTGAAAAAT
CAGGATGAGTCAATCTTTCAATCTGAAGATGTAAACCAATCCGAAGTGTTAGAAGGGGAGGGTTTGGTAAGTGACAGAAAGGTGTATTTTCTTGAGGAAACTGATGAAGT
TATGCTATCAAAGCGTATTTTAATTCTCAGTAGAAAAAATAAGGTCAGAAGTGCAATGGAATTGTTCAGGTCCATGCATTTAGCAGGTCTTCTGCCAAGTTTTCATGCTT
CAAATTCACTTTTAGCTTGTCTTTTGAGGAATGGGCTGTTTGATGATGGTTTACGAATCTTCGAGTTTATGAAGTCAAACAAGCTATCAACAGGGCACACTTATAGCCTT
ATACTCAAAGCAGTTGCTGATACTCATGGATTTCTTTCTGCTCTTGAGATGTTTAGGACATGGGAGCACAAATATGACTTAAAACAGTTCGATGCAATTGTTTACAACAC
GATGATATCGGTCTGTGGAAAAGAGAATAACTGGGTTGAAGCTGAGAGAATATGGAGACTAATGGAGGCAAATGGCTGTAGTGCAACACATCTAACTTATTCTCTATTGG
TGAGCATGTACGTCCGCTGCAACCAGAACGAACTTGCGATCGACATTTATGTAAAGATGGTTCAAAATGATTTAAAACCAGCTAATGATACAATGCAAGCTATTATTGGC
GCATCTTCAAGGGAAGGGAGGTGGGATTTTGCTTTAAGAGTCTTTCAAGATATGTTGAAATGTGGACTCGAACCTAATTCCGTTGCATTCAACGCCTTGATCAATTCTCT
AGGAAAAGCTAATGAGGTCACTTTAGCATTCAGCATATACAATAGGATGAAATCTATGGGTCATTCACCTGATGTTTATACATGGAAGGCTCTACTCGGTGCTCTTTACA
AGGCAAATCGCTACAACGATGCTATTCGTCTCTTTGAGTTTGTGAAAAGAGAGGAGAAGGCTCAATTGAATATACATATTTACAATACCATTCTATTGTCTTGTTCAAAG
CTTGGGTTATGGGATAGGGCTCTCCAAATTTTGTGGGAAATGGAGGCCGCCTCTGGTCTCTTAGTCTCGGCATCATCATATAACATCGTTATTAGTGCATGTGAGATGGC
TAAGAAGCCAGAAATTGCGTTGCGAGTTTACGAACGCATGATTCATCAGAAGCTCACTCCTGATACCTTCACTCTTTTGTCGCTTATCCGAAGCTGCATTTGGGGATCTT
TATGGGATGAAGTGGAACTACTTCTATCTAATTATTTGGACGTATACCTGATAGTTGTTGTTCTTCAGAAGTCTGCACACGACGCATCTGTATACAATGCTGTCATCCAA
GGAATGTGCTTAAGAGGCAAGACTGATTTAGCAAAAAAGCTTTACACGAAGATGCGCGAAATCGGTATCCAACCAGATGGAAAAACACGAGCTTTGATGCTTCAGACGTT
GCCGAAGGATCGTGCTGGACTGAAGAACAGGTTGGCTTCTCGTTTCAAGAAAAGGCACAGACATTATCACCACAGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGAGGATTGCTTATAAATCCCACTCTGATTTTATCAAATGAGTTGAATTACCAACTTCACTCGTGTTACCCTGTTAGTTGTGCACACAAGCATTTCCATGGTATTCC
AGAATTGAAATCATGTATAAGGCGTAGGATAACTCATGGGGGTAATGAAGCTTCAATGTCGTCGATGAGTATTCCACGATTGAATTTCGTGGTTCGGTCCACAAAAGCCC
TGGAATTTAGGACATGTGAAGAGGTTGAGGCTATTAGATTGGTCATTGATGAAGGAGTCGAAGAATCGTCTCGGGAGTGGAAATTGCCTCCCTGGGGAGAAGTGAAAAAT
CAGGATGAGTCAATCTTTCAATCTGAAGATGTAAACCAATCCGAAGTGTTAGAAGGGGAGGGTTTGGTAAGTGACAGAAAGGTGTATTTTCTTGAGGAAACTGATGAAGT
TATGCTATCAAAGCGTATTTTAATTCTCAGTAGAAAAAATAAGGTCAGAAGTGCAATGGAATTGTTCAGGTCCATGCATTTAGCAGGTCTTCTGCCAAGTTTTCATGCTT
CAAATTCACTTTTAGCTTGTCTTTTGAGGAATGGGCTGTTTGATGATGGTTTACGAATCTTCGAGTTTATGAAGTCAAACAAGCTATCAACAGGGCACACTTATAGCCTT
ATACTCAAAGCAGTTGCTGATACTCATGGATTTCTTTCTGCTCTTGAGATGTTTAGGACATGGGAGCACAAATATGACTTAAAACAGTTCGATGCAATTGTTTACAACAC
GATGATATCGGTCTGTGGAAAAGAGAATAACTGGGTTGAAGCTGAGAGAATATGGAGACTAATGGAGGCAAATGGCTGTAGTGCAACACATCTAACTTATTCTCTATTGG
TGAGCATGTACGTCCGCTGCAACCAGAACGAACTTGCGATCGACATTTATGTAAAGATGGTTCAAAATGATTTAAAACCAGCTAATGATACAATGCAAGCTATTATTGGC
GCATCTTCAAGGGAAGGGAGGTGGGATTTTGCTTTAAGAGTCTTTCAAGATATGTTGAAATGTGGACTCGAACCTAATTCCGTTGCATTCAACGCCTTGATCAATTCTCT
AGGAAAAGCTAATGAGGTCACTTTAGCATTCAGCATATACAATAGGATGAAATCTATGGGTCATTCACCTGATGTTTATACATGGAAGGCTCTACTCGGTGCTCTTTACA
AGGCAAATCGCTACAACGATGCTATTCGTCTCTTTGAGTTTGTGAAAAGAGAGGAGAAGGCTCAATTGAATATACATATTTACAATACCATTCTATTGTCTTGTTCAAAG
CTTGGGTTATGGGATAGGGCTCTCCAAATTTTGTGGGAAATGGAGGCCGCCTCTGGTCTCTTAGTCTCGGCATCATCATATAACATCGTTATTAGTGCATGTGAGATGGC
TAAGAAGCCAGAAATTGCGTTGCGAGTTTACGAACGCATGATTCATCAGAAGCTCACTCCTGATACCTTCACTCTTTTGTCGCTTATCCGAAGCTGCATTTGGGGATCTT
TATGGGATGAAGTGGAACTACTTCTATCTAATTATTTGGACGTATACCTGATAGTTGTTGTTCTTCAGAAGTCTGCACACGACGCATCTGTATACAATGCTGTCATCCAA
GGAATGTGCTTAAGAGGCAAGACTGATTTAGCAAAAAAGCTTTACACGAAGATGCGCGAAATCGGTATCCAACCAGATGGAAAAACACGAGCTTTGATGCTTCAGACGTT
GCCGAAGGATCGTGCTGGACTGAAGAACAGGTTGGCTTCTCGTTTCAAGAAAAGGCACAGACATTATCACCACAGGTAA
Protein sequenceShow/hide protein sequence
MRGLLINPTLILSNELNYQLHSCYPVSCAHKHFHGIPELKSCIRRRITHGGNEASMSSMSIPRLNFVVRSTKALEFRTCEEVEAIRLVIDEGVEESSREWKLPPWGEVKN
QDESIFQSEDVNQSEVLEGEGLVSDRKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGLFDDGLRIFEFMKSNKLSTGHTYSL
ILKAVADTHGFLSALEMFRTWEHKYDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRCNQNELAIDIYVKMVQNDLKPANDTMQAIIG
ASSREGRWDFALRVFQDMLKCGLEPNSVAFNALINSLGKANEVTLAFSIYNRMKSMGHSPDVYTWKALLGALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSK
LGLWDRALQILWEMEAASGLLVSASSYNIVISACEMAKKPEIALRVYERMIHQKLTPDTFTLLSLIRSCIWGSLWDEVELLLSNYLDVYLIVVVLQKSAHDASVYNAVIQ
GMCLRGKTDLAKKLYTKMREIGIQPDGKTRALMLQTLPKDRAGLKNRLASRFKKRHRHYHHR