; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg002780 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg002780
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPentatricopeptide repeat (PPR) superfamily protein
Genome locationscaffold6:1120906..1131909
RNA-Seq ExpressionSpg002780
SyntenySpg002780
Gene Ontology termsNA
InterPro domainsIPR037119 - Haem oxygenase HugZ-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008439307.1 PREDICTED: uncharacterized protein At3g49140-like [Cucumis melo]5.0e-22277.84Show/hide
Query:  MIETALAVRFPAGANFCCSSALSYHRLAWISEDVTSIGQASSFCRLLHSCASDVHWKRCQRLNSRSLFGRNNPRKSGIQASAEHLGSASDPIKQNKSLQY
        MIETALAVRFPAGANFC SSA+ YHR AW SED +SIG ASSFCRLLHSC SDVHWKRCQRLNSRSL GR+N RK+GIQASAE LGSASDPIKQN+ LQY
Subjt:  MIETALAVRFPAGANFCCSSALSYHRLAWISEDVTSIGQASSFCRLLHSCASDVHWKRCQRLNSRSLFGRNNPRKSGIQASAEHLGSASDPIKQNKSLQY

Query:  HPSEELVKSITENAEDVRPTAAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDERGNIYFQVKNTEETMQNLTSENNFV-------------
        HPSEELVKSITE A+DVRPT+AETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDE GNIYFQ+KNTEE M+NLTSENNFV             
Subjt:  HPSEELVKSITENAEDVRPTAAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDERGNIYFQVKNTEETMQNLTSENNFV-------------

Query:  ---------------------------------------------------------DWASVIEDEDDPNLSDETLGDWAKLETMRSSHPMHFANKLSEA
                                                                 DW SVI+DEDD N SDETLGDWAKLETMRSSHPMHFANKLSE 
Subjt:  ---------------------------------------------------------DWASVIEDEDDPNLSDETLGDWAKLETMRSSHPMHFANKLSEA

Query:  ASDDPIDWMEQPPATLVIQGVLRPAFSEEHTVIQRHLSSRHSSNGDINEAQKLEDNHENHDRINHRGHESSSSMDGLNLADGLDENIPVNEASFYRLEMI
        ASDDPIDWMEQPPATLVIQGVLRPAFSEE TVIQ+HLSSRH SNGDINEAQKLE+N E+H RINH GHESSSS DGLNL D LDE+IP +EASFYRLEMI
Subjt:  ASDDPIDWMEQPPATLVIQGVLRPAFSEEHTVIQRHLSSRHSSNGDINEAQKLEDNHENHDRINHRGHESSSSMDGLNLADGLDENIPVNEASFYRLEMI

Query:  KVQLFTGHSHPSNVEIEDLMKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSGTQVQTLRFAFDTRATSEFS
        KVQLFTG+SHPS+VEIEDLMKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSGTQVQTLRFAFDTRATSEFS
Subjt:  KVQLFTGHSHPSNVEIEDLMKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSGTQVQTLRFAFDTRATSEFS

Query:  AEKQLNNLLFPRIHSKPQKMKQTHQNEC
        AEKQLN+LLFPRIHSK QKMKQT+QNEC
Subjt:  AEKQLNNLLFPRIHSKPQKMKQTHQNEC

XP_022141265.1 uncharacterized protein At3g49140-like [Momordica charantia]1.5e-22379.05Show/hide
Query:  MIETALAVRFPAGANFCCSSAL-SYHRLAWISEDVTSIGQASSFCRLLHSCASDVHWKRCQRLNSRSLFGRNNPRKSGIQASAEHLGSASDPIKQNKSLQ
        MIETALAVRFPAGANFC SS   SYHR AWISEDVTSIG  SSFCRLLHSCASDVHWKRCQRLNSR L GRN  R++GIQASAE LGSASDPIKQN  LQ
Subjt:  MIETALAVRFPAGANFCCSSAL-SYHRLAWISEDVTSIGQASSFCRLLHSCASDVHWKRCQRLNSRSLFGRNNPRKSGIQASAEHLGSASDPIKQNKSLQ

Query:  YHPSEELVKSITENAEDVRPTAAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDERGNIYFQVKNTEETMQNLTSENNFV------------
        YHPSEELVKSITENAEDVRPTAAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDE GNIYFQVKNTEETMQNLTSENNFV            
Subjt:  YHPSEELVKSITENAEDVRPTAAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDERGNIYFQVKNTEETMQNLTSENNFV------------

Query:  ------------------------------------------------------DWASVIEDEDDPNLSDETLGDWAKLETMRSSHPMHFANKLSEAASD
                                                              DW SVIEDEDD N SDETLGDWAKLETMRSSHPM+FANKLSE ASD
Subjt:  ------------------------------------------------------DWASVIEDEDDPNLSDETLGDWAKLETMRSSHPMHFANKLSEAASD

Query:  DPIDWMEQPPATLVIQGVLRPAFSEEHTVIQRHLSSRHSSNGDINEAQKLEDNHENHDRINHRGHESSSSMDGLNLADGLDENIPVNEASFYRLEMIKVQ
        DPIDWMEQPPATLVIQGVLRPAFSEEH+VIQRHLSSRHSSNGDINEAQK EDN ENH  INH  HESSSS DGLNLADGLD NIP++EASFYRLEMIK+Q
Subjt:  DPIDWMEQPPATLVIQGVLRPAFSEEHTVIQRHLSSRHSSNGDINEAQKLEDNHENHDRINHRGHESSSSMDGLNLADGLDENIPVNEASFYRLEMIKVQ

Query:  LFTGHSHPSNVEIEDLMKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSGTQVQTLRFAFDTRATSEFSAEK
        LFTGH+HPSNVE+EDLMKAQPDAIAHSAEKIISRLRAGGEKT QALKSLCWRCKGIQVEEAVING+DSLGFD+RVCSGTQVQTLRFAF TRATSEFSAEK
Subjt:  LFTGHSHPSNVEIEDLMKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSGTQVQTLRFAFDTRATSEFSAEK

Query:  QLNNLLFPRIHSKPQKMKQTHQNEC
        QLN++LFPRIHSK QK KQTHQNEC
Subjt:  QLNNLLFPRIHSKPQKMKQTHQNEC

XP_022922835.1 uncharacterized protein At3g49140-like [Cucurbita moschata]1.1e-21877.95Show/hide
Query:  MIETALAVRFPAGANFCCSSALSYHRLAWISEDVTSIGQASSFCRLLHSCASDVHWKRCQRLNSRSLFGRNNPRKSGIQASAEHLGSASDPIKQNKSLQY
        MIETALAVRF  GANFC SSALS HR AW SEDVT IG  +S CRL  SCASDV WKRCQRLNSRSL G+NN +K+GIQASAEHLGSASDPIKQN+ LQY
Subjt:  MIETALAVRFPAGANFCCSSALSYHRLAWISEDVTSIGQASSFCRLLHSCASDVHWKRCQRLNSRSLFGRNNPRKSGIQASAEHLGSASDPIKQNKSLQY

Query:  HPSEELVKSITENAEDVRPTAAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDERGNIYFQVKNTEETMQNLTSENNFV-------------
        HPSEE VKSITE AEDVRPT+AETTRTIIEVN KATLMFAGLINDEVQENIIWP+LPYVTDE GNIYFQVK+TEET+QNL SENNFV             
Subjt:  HPSEELVKSITENAEDVRPTAAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDERGNIYFQVKNTEETMQNLTSENNFV-------------

Query:  ----------------------------------------------DWASVIEDEDDPNLSDETLGDWAKLETMRSSHPMHFANKLSEAASDDPIDWMEQ
                                                      DW SVI+DEDD N SDETLGDWAKLETMRSSHPMHFANKLSE+ASDDPID ME+
Subjt:  ----------------------------------------------DWASVIEDEDDPNLSDETLGDWAKLETMRSSHPMHFANKLSEAASDDPIDWMEQ

Query:  PPATLVIQGVLRPAFSEEHTVIQRHLSSRHSSNGDINEAQKLEDNHENHDRINHRGHESSSSMDGLNLADGLDENIPVNEASFYRLEMIKVQLFTGHSHP
        PPATL+IQG LRPAFSEEHTVIQRHLSSRHSSNGDI+EAQKLEDN EN  RINH+GHESSSS DGLN+ DGL ENIPVN+ASFYRLEMIKVQL TGH+HP
Subjt:  PPATLVIQGVLRPAFSEEHTVIQRHLSSRHSSNGDINEAQKLEDNHENHDRINHRGHESSSSMDGLNLADGLDENIPVNEASFYRLEMIKVQLFTGHSHP

Query:  SNVEIEDLMKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSGTQVQTLRFAFDTRATSEFSAEKQLNNLLFP
        SNVEIEDLMKAQPDAI HSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSGTQVQTLRFAFDTRATSEFSAEKQL++LLFP
Subjt:  SNVEIEDLMKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSGTQVQTLRFAFDTRATSEFSAEKQLNNLLFP

Query:  RIHSKPQKMKQTHQNEC
        RIHSKP KMKQTHQNEC
Subjt:  RIHSKPQKMKQTHQNEC

XP_023552311.1 uncharacterized protein At3g49140-like [Cucurbita pepo subsp. pepo]9.7e-21877.8Show/hide
Query:  MIETALAVRFPAGANFCCSSALSYHRLAWISEDVTSIGQASSFCRLLHSCASDVHWKRCQRLNSRSLFGRNNPRKSGIQASAEHLGSASDPIKQNKSLQY
        MIETALAVRF   ANFC SSALS HR AW SEDVT IG A+S CRL  SCASDV WKRCQRLNSRSL G+NN +K+GIQASAEHLGSASDPIKQN+ LQY
Subjt:  MIETALAVRFPAGANFCCSSALSYHRLAWISEDVTSIGQASSFCRLLHSCASDVHWKRCQRLNSRSLFGRNNPRKSGIQASAEHLGSASDPIKQNKSLQY

Query:  HPSEELVKSITENAEDVRPTAAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDERGNIYFQVKNTEETMQNLTSENNFV-------------
        HPSEE VKSITE AEDVRPT+AE TRTIIEVNSKATLMFAGLINDEVQENIIWP+LPYVTDE GNIYFQVK+TEET+QNL SENNFV             
Subjt:  HPSEELVKSITENAEDVRPTAAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDERGNIYFQVKNTEETMQNLTSENNFV-------------

Query:  -----------------------------------------------DWASVIEDEDDPNLSDETLGDWAKLETMRSSHPMHFANKLSEAASDDPIDWME
                                                       DW SVI+DEDD N SDETLGDWAKLETMRSSHPMHFANKLSE+ASDDPID ME
Subjt:  -----------------------------------------------DWASVIEDEDDPNLSDETLGDWAKLETMRSSHPMHFANKLSEAASDDPIDWME

Query:  QPPATLVIQGVLRPAFSEEHTVIQRHLSSRHSSNGDINEAQKLEDNHENHDRINHRGHESSSSMDGLNLADGLDENIPVNEASFYRLEMIKVQLFTGHSH
        +PPATL+IQG LRPAFSEEHTVIQRHLSSRHSSNGDI+EAQKLEDN EN  RINH+GHESSSS DGLN+ DGL ENIPVN+ASFYRLEMIKVQL TGH+H
Subjt:  QPPATLVIQGVLRPAFSEEHTVIQRHLSSRHSSNGDINEAQKLEDNHENHDRINHRGHESSSSMDGLNLADGLDENIPVNEASFYRLEMIKVQLFTGHSH

Query:  PSNVEIEDLMKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSGTQVQTLRFAFDTRATSEFSAEKQLNNLLF
        PSNVEIEDLMKAQPDAI HSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSGTQVQTLRFAFDTRATSEFSAEKQL++LLF
Subjt:  PSNVEIEDLMKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSGTQVQTLRFAFDTRATSEFSAEKQLNNLLF

Query:  PRIHSKPQKMKQTHQNEC
        PRIHSKP KMKQTHQNEC
Subjt:  PRIHSKPQKMKQTHQNEC

XP_038880477.1 uncharacterized protein At3g49140-like [Benincasa hispida]1.7e-22579.43Show/hide
Query:  MIETALAVRFPAGANFCCSSALSYHRLAWISEDVTSIGQASSFCRLLHSCASDVHWKRCQRLNSRSLFGRNNPRKSGIQASAEHLGSASDPIKQNKSLQY
        MIETALAVRFPAGANFC SSALSYHR AW SEDV+SI  ASSFCRLLHSC SDVHWKRCQRLNS+SL GRNN +K+GIQASAEHLGSASDPIKQN+ LQY
Subjt:  MIETALAVRFPAGANFCCSSALSYHRLAWISEDVTSIGQASSFCRLLHSCASDVHWKRCQRLNSRSLFGRNNPRKSGIQASAEHLGSASDPIKQNKSLQY

Query:  HPSEELVKSITENAEDVRPTAAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDERGNIYFQVKNTEETMQNLTSENNFV-------------
        HPSEELVKSITE AEDVRPT+AETTRTIIEVNSKATLMFAGLIN+EVQENIIWPELPYVTDE GNIYFQ+KNTEE MQNLTSENNFV             
Subjt:  HPSEELVKSITENAEDVRPTAAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDERGNIYFQVKNTEETMQNLTSENNFV-------------

Query:  ----------------------------------------------------DWASVIEDEDDPNLSDETLGDWAKLETMRSSHPMHFANKLSEAASDDP
                                                            DW SVI+DEDD N SDETLGDWAKLETMRSSHPMHFANKLSEAASDDP
Subjt:  ----------------------------------------------------DWASVIEDEDDPNLSDETLGDWAKLETMRSSHPMHFANKLSEAASDDP

Query:  IDWMEQPPATLVIQGVLRPAFSEEHTVIQRHLSSRHSSNGDINEAQKL--EDNHENHDRINHRGHESSSSMDGLNLADGLDENIPVNEASFYRLEMIKVQ
        IDWMEQPPATLVIQGVLRPAFSEE+TVIQ+HLSSRHS  GDINEAQKL  EDN ENH RINH GHESSSS DG NL D LDENIPV++ASFYRLEMIKVQ
Subjt:  IDWMEQPPATLVIQGVLRPAFSEEHTVIQRHLSSRHSSNGDINEAQKL--EDNHENHDRINHRGHESSSSMDGLNLADGLDENIPVNEASFYRLEMIKVQ

Query:  LFTGHSHPSNVEIEDLMKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSGTQVQTLRFAFDTRATSEFSAEK
        LFTGH+HPSNVEIEDLMKAQPDAIAHSAEKIISRLRAGGEKT QALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSGTQVQTLRFAFDTRATSEFSAEK
Subjt:  LFTGHSHPSNVEIEDLMKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSGTQVQTLRFAFDTRATSEFSAEK

Query:  QLNNLLFPRIHSKPQKMKQTHQNEC
        QLN LLFPR+HSKPQKMKQTHQNEC
Subjt:  QLNNLLFPRIHSKPQKMKQTHQNEC

TrEMBL top hitse value%identityAlignment
A0A1S3AY35 uncharacterized protein At3g49140-like2.4e-22277.84Show/hide
Query:  MIETALAVRFPAGANFCCSSALSYHRLAWISEDVTSIGQASSFCRLLHSCASDVHWKRCQRLNSRSLFGRNNPRKSGIQASAEHLGSASDPIKQNKSLQY
        MIETALAVRFPAGANFC SSA+ YHR AW SED +SIG ASSFCRLLHSC SDVHWKRCQRLNSRSL GR+N RK+GIQASAE LGSASDPIKQN+ LQY
Subjt:  MIETALAVRFPAGANFCCSSALSYHRLAWISEDVTSIGQASSFCRLLHSCASDVHWKRCQRLNSRSLFGRNNPRKSGIQASAEHLGSASDPIKQNKSLQY

Query:  HPSEELVKSITENAEDVRPTAAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDERGNIYFQVKNTEETMQNLTSENNFV-------------
        HPSEELVKSITE A+DVRPT+AETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDE GNIYFQ+KNTEE M+NLTSENNFV             
Subjt:  HPSEELVKSITENAEDVRPTAAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDERGNIYFQVKNTEETMQNLTSENNFV-------------

Query:  ---------------------------------------------------------DWASVIEDEDDPNLSDETLGDWAKLETMRSSHPMHFANKLSEA
                                                                 DW SVI+DEDD N SDETLGDWAKLETMRSSHPMHFANKLSE 
Subjt:  ---------------------------------------------------------DWASVIEDEDDPNLSDETLGDWAKLETMRSSHPMHFANKLSEA

Query:  ASDDPIDWMEQPPATLVIQGVLRPAFSEEHTVIQRHLSSRHSSNGDINEAQKLEDNHENHDRINHRGHESSSSMDGLNLADGLDENIPVNEASFYRLEMI
        ASDDPIDWMEQPPATLVIQGVLRPAFSEE TVIQ+HLSSRH SNGDINEAQKLE+N E+H RINH GHESSSS DGLNL D LDE+IP +EASFYRLEMI
Subjt:  ASDDPIDWMEQPPATLVIQGVLRPAFSEEHTVIQRHLSSRHSSNGDINEAQKLEDNHENHDRINHRGHESSSSMDGLNLADGLDENIPVNEASFYRLEMI

Query:  KVQLFTGHSHPSNVEIEDLMKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSGTQVQTLRFAFDTRATSEFS
        KVQLFTG+SHPS+VEIEDLMKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSGTQVQTLRFAFDTRATSEFS
Subjt:  KVQLFTGHSHPSNVEIEDLMKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSGTQVQTLRFAFDTRATSEFS

Query:  AEKQLNNLLFPRIHSKPQKMKQTHQNEC
        AEKQLN+LLFPRIHSK QKMKQT+QNEC
Subjt:  AEKQLNNLLFPRIHSKPQKMKQTHQNEC

A0A6J1CHK0 uncharacterized protein At3g49140-like7.5e-22479.05Show/hide
Query:  MIETALAVRFPAGANFCCSSAL-SYHRLAWISEDVTSIGQASSFCRLLHSCASDVHWKRCQRLNSRSLFGRNNPRKSGIQASAEHLGSASDPIKQNKSLQ
        MIETALAVRFPAGANFC SS   SYHR AWISEDVTSIG  SSFCRLLHSCASDVHWKRCQRLNSR L GRN  R++GIQASAE LGSASDPIKQN  LQ
Subjt:  MIETALAVRFPAGANFCCSSAL-SYHRLAWISEDVTSIGQASSFCRLLHSCASDVHWKRCQRLNSRSLFGRNNPRKSGIQASAEHLGSASDPIKQNKSLQ

Query:  YHPSEELVKSITENAEDVRPTAAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDERGNIYFQVKNTEETMQNLTSENNFV------------
        YHPSEELVKSITENAEDVRPTAAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDE GNIYFQVKNTEETMQNLTSENNFV            
Subjt:  YHPSEELVKSITENAEDVRPTAAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDERGNIYFQVKNTEETMQNLTSENNFV------------

Query:  ------------------------------------------------------DWASVIEDEDDPNLSDETLGDWAKLETMRSSHPMHFANKLSEAASD
                                                              DW SVIEDEDD N SDETLGDWAKLETMRSSHPM+FANKLSE ASD
Subjt:  ------------------------------------------------------DWASVIEDEDDPNLSDETLGDWAKLETMRSSHPMHFANKLSEAASD

Query:  DPIDWMEQPPATLVIQGVLRPAFSEEHTVIQRHLSSRHSSNGDINEAQKLEDNHENHDRINHRGHESSSSMDGLNLADGLDENIPVNEASFYRLEMIKVQ
        DPIDWMEQPPATLVIQGVLRPAFSEEH+VIQRHLSSRHSSNGDINEAQK EDN ENH  INH  HESSSS DGLNLADGLD NIP++EASFYRLEMIK+Q
Subjt:  DPIDWMEQPPATLVIQGVLRPAFSEEHTVIQRHLSSRHSSNGDINEAQKLEDNHENHDRINHRGHESSSSMDGLNLADGLDENIPVNEASFYRLEMIKVQ

Query:  LFTGHSHPSNVEIEDLMKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSGTQVQTLRFAFDTRATSEFSAEK
        LFTGH+HPSNVE+EDLMKAQPDAIAHSAEKIISRLRAGGEKT QALKSLCWRCKGIQVEEAVING+DSLGFD+RVCSGTQVQTLRFAF TRATSEFSAEK
Subjt:  LFTGHSHPSNVEIEDLMKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSGTQVQTLRFAFDTRATSEFSAEK

Query:  QLNNLLFPRIHSKPQKMKQTHQNEC
        QLN++LFPRIHSK QK KQTHQNEC
Subjt:  QLNNLLFPRIHSKPQKMKQTHQNEC

A0A6J1E9X9 uncharacterized protein At3g49140-like5.6e-21977.95Show/hide
Query:  MIETALAVRFPAGANFCCSSALSYHRLAWISEDVTSIGQASSFCRLLHSCASDVHWKRCQRLNSRSLFGRNNPRKSGIQASAEHLGSASDPIKQNKSLQY
        MIETALAVRF  GANFC SSALS HR AW SEDVT IG  +S CRL  SCASDV WKRCQRLNSRSL G+NN +K+GIQASAEHLGSASDPIKQN+ LQY
Subjt:  MIETALAVRFPAGANFCCSSALSYHRLAWISEDVTSIGQASSFCRLLHSCASDVHWKRCQRLNSRSLFGRNNPRKSGIQASAEHLGSASDPIKQNKSLQY

Query:  HPSEELVKSITENAEDVRPTAAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDERGNIYFQVKNTEETMQNLTSENNFV-------------
        HPSEE VKSITE AEDVRPT+AETTRTIIEVN KATLMFAGLINDEVQENIIWP+LPYVTDE GNIYFQVK+TEET+QNL SENNFV             
Subjt:  HPSEELVKSITENAEDVRPTAAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDERGNIYFQVKNTEETMQNLTSENNFV-------------

Query:  ----------------------------------------------DWASVIEDEDDPNLSDETLGDWAKLETMRSSHPMHFANKLSEAASDDPIDWMEQ
                                                      DW SVI+DEDD N SDETLGDWAKLETMRSSHPMHFANKLSE+ASDDPID ME+
Subjt:  ----------------------------------------------DWASVIEDEDDPNLSDETLGDWAKLETMRSSHPMHFANKLSEAASDDPIDWMEQ

Query:  PPATLVIQGVLRPAFSEEHTVIQRHLSSRHSSNGDINEAQKLEDNHENHDRINHRGHESSSSMDGLNLADGLDENIPVNEASFYRLEMIKVQLFTGHSHP
        PPATL+IQG LRPAFSEEHTVIQRHLSSRHSSNGDI+EAQKLEDN EN  RINH+GHESSSS DGLN+ DGL ENIPVN+ASFYRLEMIKVQL TGH+HP
Subjt:  PPATLVIQGVLRPAFSEEHTVIQRHLSSRHSSNGDINEAQKLEDNHENHDRINHRGHESSSSMDGLNLADGLDENIPVNEASFYRLEMIKVQLFTGHSHP

Query:  SNVEIEDLMKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSGTQVQTLRFAFDTRATSEFSAEKQLNNLLFP
        SNVEIEDLMKAQPDAI HSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSGTQVQTLRFAFDTRATSEFSAEKQL++LLFP
Subjt:  SNVEIEDLMKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSGTQVQTLRFAFDTRATSEFSAEKQLNNLLFP

Query:  RIHSKPQKMKQTHQNEC
        RIHSKP KMKQTHQNEC
Subjt:  RIHSKPQKMKQTHQNEC

A0A6J1HTI8 uncharacterized protein At3g49140-like isoform X12.5e-20375.39Show/hide
Query:  MIETALAVRFPAGANFCCSSALSYHRLAWISEDVTSIGQASSFCRLLHSCASDVHWKRCQRLNSRSLFGRNNPRKSGIQASAEHLGSASDPIKQNKSLQY
        MIETALAVRFPAGANFC SSALS+HR AW SEDVT+IG AS FCRLLHSCASDV WKRC+ LNS+S   RNN RK+GI ASAEHLGSASDP+KQN+  QY
Subjt:  MIETALAVRFPAGANFCCSSALSYHRLAWISEDVTSIGQASSFCRLLHSCASDVHWKRCQRLNSRSLFGRNNPRKSGIQASAEHLGSASDPIKQNKSLQY

Query:  HPSEELVKSITENAEDVRPTAAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDERGNIYFQVKNTEETMQNLTSENNFV-------------
        HPSEELVKS +ENAEDVRPTAAETTRTIIEVNSKATLMF GLINDEVQENIIWPELPYVTDE GNIYFQVKNTEE MQNLTSENNFV             
Subjt:  HPSEELVKSITENAEDVRPTAAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDERGNIYFQVKNTEETMQNLTSENNFV-------------

Query:  ----------------------------------------------------DWASVIEDEDDPNLSDETLGDWAKLETMRSSHPMHFANKLSEAASDDP
                                                            DW SVIEDEDDPN SDET GDWAKLETMRSSHPMHFA KLSEAASDDP
Subjt:  ----------------------------------------------------DWASVIEDEDDPNLSDETLGDWAKLETMRSSHPMHFANKLSEAASDDP

Query:  IDWMEQPPATLVIQGVLRPAFSEEHTVIQRHLSSRHSSNGDINEAQKLEDNHENHDRINHRGHESSSSMDGLNLADGLDENIPVNEASFYRLEMIKVQLF
        IDWMEQPPATLVIQG LRP   EE +VIQRHLSSRHSSN DINEAQKLEDN ENH RI++ GHESSSS       +GLD+NIP+NE SFYRLEM KVQLF
Subjt:  IDWMEQPPATLVIQGVLRPAFSEEHTVIQRHLSSRHSSNGDINEAQKLEDNHENHDRINHRGHESSSSMDGLNLADGLDENIPVNEASFYRLEMIKVQLF

Query:  TGHSHPSNVEIEDLMKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSGTQVQTLRFAFDTRATSEFSAEKQL
        TGHSHPSNVEIEDLM+AQPDAIAHSAEKIISRLR GGEKTTQALKSLCWRCKGIQVEEAVINGIDS+GFDVRVCSGTQVQTLRFAFDTRATSEFSAEKQL
Subjt:  TGHSHPSNVEIEDLMKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSGTQVQTLRFAFDTRATSEFSAEKQL

Query:  NNLLFPRI
        N+LLF R+
Subjt:  NNLLFPRI

A0A6J1JB77 uncharacterized protein At3g49140-like4.6e-21376.65Show/hide
Query:  MIETALAVRFPAGANFCCSSALSYHRLAWISEDVTSIGQASSFCRLLHSCASDVHWKRCQRLNSRSLFGRNNPRKSGIQASAEHLGSASDPIKQNKSLQY
        MIETALAVRF  GANFC SSALS HR AW SEDVT IG A+S CRL  SCASDV WKRCQRLNSRSL G+NN +K+GIQASAE+LGSASDPIKQN+ LQY
Subjt:  MIETALAVRFPAGANFCCSSALSYHRLAWISEDVTSIGQASSFCRLLHSCASDVHWKRCQRLNSRSLFGRNNPRKSGIQASAEHLGSASDPIKQNKSLQY

Query:  HPSEELVKSITENAEDVRPTAAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDERGNIYFQVKNTEETMQNLTSENNFV-------------
        HPSEE VKSITE AEDVRPT+AETTRTIIEVNSKATLMFAGLINDEVQENIIWP+LPYVTDE GNIYFQVK+TEET+QNL SENNFV             
Subjt:  HPSEELVKSITENAEDVRPTAAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDERGNIYFQVKNTEETMQNLTSENNFV-------------

Query:  -----------------------------------------------DWASVIEDEDDPNLSDETLGDWAKLETMRSSHPMHFANKLSEAASDDPIDWME
                                                       DW SVI+DEDD N SDETLGDWAKLETMRSSHPMHFANKLSE+ASDDPID ME
Subjt:  -----------------------------------------------DWASVIEDEDDPNLSDETLGDWAKLETMRSSHPMHFANKLSEAASDDPIDWME

Query:  QPPATLVIQGVLRPAFSEEHTVIQRHLSSRHSSNGDINEAQKLEDNHENHDRINHRGHESSSSMDGLNLADGLDENIPVNEASFYRLEMIKVQLFTGHSH
        +PPATL+IQG LRPAFSEEHTVIQRHLSSRHSSNGDI+EAQKLEDN EN  RINH+GHESSSS DGLN+ DGL ENIPV +ASFYRLEMIKVQL TGH+H
Subjt:  QPPATLVIQGVLRPAFSEEHTVIQRHLSSRHSSNGDINEAQKLEDNHENHDRINHRGHESSSSMDGLNLADGLDENIPVNEASFYRLEMIKVQLFTGHSH

Query:  PSNVEIEDLMKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSGTQVQTLRFAFDTRATSEFSAEKQLNNLLF
        PSN+EIEDLMKAQPDAI  +AEKIISRL+AGGEKT QALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSGTQVQTLRFAFDTRATSEFSAEKQL++LLF
Subjt:  PSNVEIEDLMKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSGTQVQTLRFAFDTRATSEFSAEKQLNNLLF

Query:  PRIHSKPQKMKQTH
        PRIHSKP KMKQTH
Subjt:  PRIHSKPQKMKQTH

SwissProt top hitse value%identityAlignment
Q0WMN5 Uncharacterized protein At3g491404.7e-9842.13Show/hide
Query:  MIETALAVRFPAGANFCCSSALSYHRLAWISEDVTSIGQASSFCRLLHSCASDVHWKRCQRLNSRSLFGRNNPRKSGIQASAEHLGSASDPIKQNKSLQY
        MIE+ +AVR   G  FC S+AL  +R A  SE+  +    +S           +      + NS     R + RK+  QA+AE++ SASDP KQ    +Y
Subjt:  MIETALAVRFPAGANFCCSSALSYHRLAWISEDVTSIGQASSFCRLLHSCASDVHWKRCQRLNSRSLFGRNNPRKSGIQASAEHLGSASDPIKQNKSLQY

Query:  HPSEELVKSITENAEDVRPTAAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDERGNIYFQVKNTEETMQNLTSENNFV-------------
        HPSEE+  S+ +N  D R + AETTRTIIEVN+K TLM  G I D V ENI+WP++PY+TD+ GN+YFQVK  E+ MQ++TSENN+V             
Subjt:  HPSEELVKSITENAEDVRPTAAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDERGNIYFQVKNTEETMQNLTSENNFV-------------

Query:  ----------------------------------DWASVIEDE-------------DDPNLSDETLGDWAKLETMRSSHPMHFANKLSEAASDDPIDWME
                                          +W +++EDE             DD + SDE+LGDWA LETMRS HPM FA +++E AS+DP+DWM+
Subjt:  ----------------------------------DWASVIEDE-------------DDPNLSDETLGDWAKLETMRSSHPMHFANKLSEAASDDPIDWME

Query:  QPPATLVIQGVLRPAFSEEHTVIQRHLSSRHSSNGDINEAQKLEDNHENHDRINHRGHESSSSMDGLNLADGLDENIPVNEASFYRLEMIKVQLFTGHSH
        QP A L IQG+L     E+++ IQ+ L+  +S+     +A+ L D  E++ +      E  SS D              N  +FY+LEMI++QL T    
Subjt:  QPPATLVIQGVLRPAFSEEHTVIQRHLSSRHSSNGDINEAQKLEDNHENHDRINHRGHESSSSMDGLNLADGLDENIPVNEASFYRLEMIKVQLFTGHSH

Query:  PSNVEIEDLMKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSGTQVQTLRFAFDTRATSEFSAEKQLNNLLF
         + VE+ED+ KAQPDAIAH++ +IISRL   G+K T+ALKSLCWR   IQ EE  + GIDSLGFD+R+C+G ++++LRFAF TRATSE +AE Q+  LLF
Subjt:  PSNVEIEDLMKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSGTQVQTLRFAFDTRATSEFSAEKQLNNLLF

Query:  PRIHSKPQ
        P+ +   Q
Subjt:  PRIHSKPQ

Arabidopsis top hitse value%identityAlignment
AT3G49140.1 Pentatricopeptide repeat (PPR) superfamily protein3.3e-9942.13Show/hide
Query:  MIETALAVRFPAGANFCCSSALSYHRLAWISEDVTSIGQASSFCRLLHSCASDVHWKRCQRLNSRSLFGRNNPRKSGIQASAEHLGSASDPIKQNKSLQY
        MIE+ +AVR   G  FC S+AL  +R A  SE+  +    +S           +      + NS     R + RK+  QA+AE++ SASDP KQ    +Y
Subjt:  MIETALAVRFPAGANFCCSSALSYHRLAWISEDVTSIGQASSFCRLLHSCASDVHWKRCQRLNSRSLFGRNNPRKSGIQASAEHLGSASDPIKQNKSLQY

Query:  HPSEELVKSITENAEDVRPTAAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDERGNIYFQVKNTEETMQNLTSENNFV-------------
        HPSEE+  S+ +N  D R + AETTRTIIEVN+K TLM  G I D V ENI+WP++PY+TD+ GN+YFQVK  E+ MQ++TSENN+V             
Subjt:  HPSEELVKSITENAEDVRPTAAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDERGNIYFQVKNTEETMQNLTSENNFV-------------

Query:  ----------------------------------DWASVIEDE-------------DDPNLSDETLGDWAKLETMRSSHPMHFANKLSEAASDDPIDWME
                                          +W +++EDE             DD + SDE+LGDWA LETMRS HPM FA +++E AS+DP+DWM+
Subjt:  ----------------------------------DWASVIEDE-------------DDPNLSDETLGDWAKLETMRSSHPMHFANKLSEAASDDPIDWME

Query:  QPPATLVIQGVLRPAFSEEHTVIQRHLSSRHSSNGDINEAQKLEDNHENHDRINHRGHESSSSMDGLNLADGLDENIPVNEASFYRLEMIKVQLFTGHSH
        QP A L IQG+L     E+++ IQ+ L+  +S+     +A+ L D  E++ +      E  SS D              N  +FY+LEMI++QL T    
Subjt:  QPPATLVIQGVLRPAFSEEHTVIQRHLSSRHSSNGDINEAQKLEDNHENHDRINHRGHESSSSMDGLNLADGLDENIPVNEASFYRLEMIKVQLFTGHSH

Query:  PSNVEIEDLMKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSGTQVQTLRFAFDTRATSEFSAEKQLNNLLF
         + VE+ED+ KAQPDAIAH++ +IISRL   G+K T+ALKSLCWR   IQ EE  + GIDSLGFD+R+C+G ++++LRFAF TRATSE +AE Q+  LLF
Subjt:  PSNVEIEDLMKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSGTQVQTLRFAFDTRATSEFSAEKQLNNLLF

Query:  PRIHSKPQ
        P+ +   Q
Subjt:  PRIHSKPQ

AT3G59300.1 Pentatricopeptide repeat (PPR) superfamily protein4.9e-4231.89Show/hide
Query:  SDPIKQNKSLQYHPSEELVKSITENAEDVRPTAAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDERGNIYFQVKNTEETMQNLTSENNFVD
        SD +  +    YHP E+L  S  +  ++ + +A+E  RT +E NS A L+F G I+ E  ++  W E  YV D+ G+I+F++ + E  +++  + N    
Subjt:  SDPIKQNKSLQYHPSEELVKSITENAEDVRPTAAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDERGNIYFQVKNTEETMQNLTSENNFVD

Query:  WASV-------IEDEDDPNLSD----------------------ETLGDWAKLETMRSSHPMHFANKLSEAASDDPIDWMEQPPATLVIQGVLRPAFSEE
        +  +           ++ N+SD                      +   DW   +T    HP++FA  LS+A S D    M+ P   + I G LRPAF +E
Subjt:  WASV-------IEDEDDPNLSD----------------------ETLGDWAKLETMRSSHPMHFANKLSEAASDDPIDWMEQPPATLVIQGVLRPAFSEE

Query:  HTVIQRHLSSRHSSNGDINEAQKLEDNHENHDRINHRGHESSSSMDGLNLADGLDENIPVNEASFYRLEMIKVQLFTGHSHPSNVEIEDLMKAQPDAIAH
         + I+R   S              ED  +    +    +  +SS          DEN     +S YRLE++ ++L + +   S++ ++D   A+PD + H
Subjt:  HTVIQRHLSSRHSSNGDINEAQKLEDNHENHDRINHRGHESSSSMDGLNLADGLDENIPVNEASFYRLEMIKVQLFTGHSHPSNVEIEDLMKAQPDAIAH

Query:  SAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSGTQVQTLRFAFDTRATSEFSAEKQLNNLLFPRIHSKPQKMK
        S   II R    G  ++ ALK+LC + KG+  EEA +  +DSLG DVRV +G QVQT RF F TRAT+E +AEK+++ LLFPR  S+ +K+K
Subjt:  SAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSGTQVQTLRFAFDTRATSEFSAEKQLNNLLFPRIHSKPQKMK

AT5G24060.1 Pentatricopeptide repeat (PPR) superfamily protein1.8e-9244.59Show/hide
Query:  RKSGIQASAEHLGSASDPIKQNKSLQYHPSEELVKSITE-NAEDVRPTAAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDERGNIYFQVKN
        R++  QA AE+LGSASDP K      YHPSE++   + E N  D R +  ET RTIIEVN K TLM +GL+   V ENI+WP++PYVTD+ GNIYFQVK 
Subjt:  RKSGIQASAEHLGSASDPIKQNKSLQYHPSEELVKSITE-NAEDVRPTAAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDERGNIYFQVKN

Query:  TEETMQN-LTSENNFV------------------------------------------------------DWASVIEDEDDPNL----SDETLGDWAKLE
         E+ MQ  +TS+NN+V                                                      +W +V+ED DD +     SDE+LGDWA LE
Subjt:  TEETMQN-LTSENNFV------------------------------------------------------DWASVIEDEDDPNL----SDETLGDWAKLE

Query:  TMRSSHPMHFANKLSEAASDDPIDWMEQPPATLVIQGVLRPAFSEEHTVIQRHLSSRHSSNGDINEAQKLEDNHENHDRINHRGHESSSSMDGLNLADGL
        TMR  HPM+FA +++E AS DP++WM+QP A L IQG+L P   E+H+ IQ+H+S   S+  D N+ +  E++ E  + I            G N ++ L
Subjt:  TMRSSHPMHFANKLSEAASDDPIDWMEQPPATLVIQGVLRPAFSEEHTVIQRHLSSRHSSNGDINEAQKLEDNHENHDRINHRGHESSSSMDGLNLADGL

Query:  DENIPVNEASFYRLEMIKVQLFTGHSHPSNVEIEDLMKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSGTQ
              N   +Y+LE+I++QL T   H + VE+ED+ KAQPD IA +++ I++RL   G+K T+AL+SLCWR  GIQ EE  + GIDSLGFD+R+CSG Q
Subjt:  DENIPVNEASFYRLEMIKVQLFTGHSHPSNVEIEDLMKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSGTQ

Query:  VQTLRFAFDTRATSEFSAEKQLNNLLFPRIHSKPQKMKQTHQNE
        ++TLRFAF  RATSE +AE QL  LLF    SKPQK KQT+Q E
Subjt:  VQTLRFAFDTRATSEFSAEKQLNNLLFPRIHSKPQKMKQTHQNE

AT5G24060.2 Pentatricopeptide repeat (PPR) superfamily protein1.8e-9244.59Show/hide
Query:  RKSGIQASAEHLGSASDPIKQNKSLQYHPSEELVKSITE-NAEDVRPTAAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDERGNIYFQVKN
        R++  QA AE+LGSASDP K      YHPSE++   + E N  D R +  ET RTIIEVN K TLM +GL+   V ENI+WP++PYVTD+ GNIYFQVK 
Subjt:  RKSGIQASAEHLGSASDPIKQNKSLQYHPSEELVKSITE-NAEDVRPTAAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDERGNIYFQVKN

Query:  TEETMQN-LTSENNFV------------------------------------------------------DWASVIEDEDDPNL----SDETLGDWAKLE
         E+ MQ  +TS+NN+V                                                      +W +V+ED DD +     SDE+LGDWA LE
Subjt:  TEETMQN-LTSENNFV------------------------------------------------------DWASVIEDEDDPNL----SDETLGDWAKLE

Query:  TMRSSHPMHFANKLSEAASDDPIDWMEQPPATLVIQGVLRPAFSEEHTVIQRHLSSRHSSNGDINEAQKLEDNHENHDRINHRGHESSSSMDGLNLADGL
        TMR  HPM+FA +++E AS DP++WM+QP A L IQG+L P   E+H+ IQ+H+S   S+  D N+ +  E++ E  + I            G N ++ L
Subjt:  TMRSSHPMHFANKLSEAASDDPIDWMEQPPATLVIQGVLRPAFSEEHTVIQRHLSSRHSSNGDINEAQKLEDNHENHDRINHRGHESSSSMDGLNLADGL

Query:  DENIPVNEASFYRLEMIKVQLFTGHSHPSNVEIEDLMKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSGTQ
              N   +Y+LE+I++QL T   H + VE+ED+ KAQPD IA +++ I++RL   G+K T+AL+SLCWR  GIQ EE  + GIDSLGFD+R+CSG Q
Subjt:  DENIPVNEASFYRLEMIKVQLFTGHSHPSNVEIEDLMKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSGTQ

Query:  VQTLRFAFDTRATSEFSAEKQLNNLLFPRIHSKPQKMKQTHQNE
        ++TLRFAF  RATSE +AE QL  LLF    SKPQK KQT+Q E
Subjt:  VQTLRFAFDTRATSEFSAEKQLNNLLFPRIHSKPQKMKQTHQNE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCGAAACTGCTCTGGCCGTTCGATTCCCCGCCGGCGCAAACTTCTGCTGCTCCTCCGCTTTGTCCTATCACCGTTTAGCTTGGATTTCGGAGGATGTAACTAGTAT
AGGCCAGGCTAGCAGCTTCTGCCGGCTTTTGCATTCTTGCGCTTCCGACGTGCATTGGAAAAGATGCCAAAGACTTAATAGTAGGTCCCTCTTTGGAAGGAACAATCCTA
GAAAGAGTGGAATTCAAGCATCAGCAGAGCATTTAGGCTCTGCATCGGACCCAATTAAACAAAATAAGAGTCTGCAGTATCATCCATCCGAAGAGCTCGTGAAATCAATA
ACTGAGAATGCTGAGGATGTTAGACCTACTGCCGCAGAAACTACGAGAACAATCATTGAGGTAAATAGCAAAGCAACGTTAATGTTTGCGGGTTTGATTAACGATGAAGT
TCAGGAAAATATTATCTGGCCAGAACTACCTTACGTAACTGACGAACGGGGAAATATTTATTTTCAAGTGAAGAATACTGAAGAGACCATGCAAAACCTAACTTCAGAAA
ATAACTTTGTGGATTGGGCTTCGGTCATAGAGGATGAAGATGATCCAAATCTCTCTGACGAGACACTGGGAGATTGGGCAAAACTGGAGACAATGCGTTCTTCACATCCA
ATGCATTTTGCCAATAAGCTTTCAGAGGCTGCATCAGACGATCCTATCGATTGGATGGAGCAGCCCCCAGCAACTTTAGTGATTCAAGGTGTCTTAAGACCTGCATTCAG
TGAAGAGCATACTGTTATCCAGAGACATCTATCCAGCCGGCATTCTAGTAATGGTGACATAAACGAGGCTCAGAAACTTGAAGATAACCATGAGAACCATGATAGGATTA
ATCATCGTGGTCATGAATCTAGCTCATCCATGGATGGTTTAAACCTGGCGGATGGGTTGGATGAAAATATTCCAGTGAATGAGGCTTCATTTTACCGGCTGGAGATGATA
AAAGTTCAGCTGTTTACAGGACATTCGCATCCAAGCAACGTTGAAATAGAAGATCTCATGAAAGCTCAACCTGATGCAATTGCGCACTCAGCCGAAAAAATTATTTCTCG
TCTGAGAGCAGGGGGAGAAAAAACCACACAAGCGCTCAAGTCTCTCTGTTGGAGATGCAAGGGCATTCAGGTTGAGGAGGCAGTAATTAATGGTATAGATTCACTTGGAT
TTGACGTGAGGGTTTGCTCCGGAACTCAGGTCCAAACATTGCGGTTCGCATTTGATACGCGAGCAACCTCAGAGTTCAGTGCAGAAAAACAGCTGAACAACTTGCTTTTC
CCCAGAATTCATTCGAAACCTCAAAAAATGAAACAAACGCATCAAAATGAATGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGATCGAAACTGCTCTGGCCGTTCGATTCCCCGCCGGCGCAAACTTCTGCTGCTCCTCCGCTTTGTCCTATCACCGTTTAGCTTGGATTTCGGAGGATGTAACTAGTAT
AGGCCAGGCTAGCAGCTTCTGCCGGCTTTTGCATTCTTGCGCTTCCGACGTGCATTGGAAAAGATGCCAAAGACTTAATAGTAGGTCCCTCTTTGGAAGGAACAATCCTA
GAAAGAGTGGAATTCAAGCATCAGCAGAGCATTTAGGCTCTGCATCGGACCCAATTAAACAAAATAAGAGTCTGCAGTATCATCCATCCGAAGAGCTCGTGAAATCAATA
ACTGAGAATGCTGAGGATGTTAGACCTACTGCCGCAGAAACTACGAGAACAATCATTGAGGTAAATAGCAAAGCAACGTTAATGTTTGCGGGTTTGATTAACGATGAAGT
TCAGGAAAATATTATCTGGCCAGAACTACCTTACGTAACTGACGAACGGGGAAATATTTATTTTCAAGTGAAGAATACTGAAGAGACCATGCAAAACCTAACTTCAGAAA
ATAACTTTGTGGATTGGGCTTCGGTCATAGAGGATGAAGATGATCCAAATCTCTCTGACGAGACACTGGGAGATTGGGCAAAACTGGAGACAATGCGTTCTTCACATCCA
ATGCATTTTGCCAATAAGCTTTCAGAGGCTGCATCAGACGATCCTATCGATTGGATGGAGCAGCCCCCAGCAACTTTAGTGATTCAAGGTGTCTTAAGACCTGCATTCAG
TGAAGAGCATACTGTTATCCAGAGACATCTATCCAGCCGGCATTCTAGTAATGGTGACATAAACGAGGCTCAGAAACTTGAAGATAACCATGAGAACCATGATAGGATTA
ATCATCGTGGTCATGAATCTAGCTCATCCATGGATGGTTTAAACCTGGCGGATGGGTTGGATGAAAATATTCCAGTGAATGAGGCTTCATTTTACCGGCTGGAGATGATA
AAAGTTCAGCTGTTTACAGGACATTCGCATCCAAGCAACGTTGAAATAGAAGATCTCATGAAAGCTCAACCTGATGCAATTGCGCACTCAGCCGAAAAAATTATTTCTCG
TCTGAGAGCAGGGGGAGAAAAAACCACACAAGCGCTCAAGTCTCTCTGTTGGAGATGCAAGGGCATTCAGGTTGAGGAGGCAGTAATTAATGGTATAGATTCACTTGGAT
TTGACGTGAGGGTTTGCTCCGGAACTCAGGTCCAAACATTGCGGTTCGCATTTGATACGCGAGCAACCTCAGAGTTCAGTGCAGAAAAACAGCTGAACAACTTGCTTTTC
CCCAGAATTCATTCGAAACCTCAAAAAATGAAACAAACGCATCAAAATGAATGTTAA
Protein sequenceShow/hide protein sequence
MIETALAVRFPAGANFCCSSALSYHRLAWISEDVTSIGQASSFCRLLHSCASDVHWKRCQRLNSRSLFGRNNPRKSGIQASAEHLGSASDPIKQNKSLQYHPSEELVKSI
TENAEDVRPTAAETTRTIIEVNSKATLMFAGLINDEVQENIIWPELPYVTDERGNIYFQVKNTEETMQNLTSENNFVDWASVIEDEDDPNLSDETLGDWAKLETMRSSHP
MHFANKLSEAASDDPIDWMEQPPATLVIQGVLRPAFSEEHTVIQRHLSSRHSSNGDINEAQKLEDNHENHDRINHRGHESSSSMDGLNLADGLDENIPVNEASFYRLEMI
KVQLFTGHSHPSNVEIEDLMKAQPDAIAHSAEKIISRLRAGGEKTTQALKSLCWRCKGIQVEEAVINGIDSLGFDVRVCSGTQVQTLRFAFDTRATSEFSAEKQLNNLLF
PRIHSKPQKMKQTHQNEC