; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy2G040410 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy2G040410
OrganismCucumis hystrix (Cucumber (hystrix) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchrH02:19214612..19221075
RNA-Seq ExpressionChy2G040410
SyntenyChy2G040410
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011660274.1 pentatricopeptide repeat-containing protein At5g08510 [Cucumis sativus]0.098.24Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQISKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP
        MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQI KPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQISKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP

Query:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSACQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE
        GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSA QLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE
Subjt:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSACQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE

Query:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNEYVSNAVLELHARCGNIEVAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID
        MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKN YVSNAVLELHARCGNIE AQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID
Subjt:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNEYVSNAVLELHARCGNIEVAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID

Query:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN
        ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN
Subjt:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN

Query:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHNIYDIIKLHKHAHH
        VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLH IYDIIKLHKH HH
Subjt:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHNIYDIIKLHKHAHH

Query:  NENEDGELLYSS
        + NED ELLYSS
Subjt:  NENEDGELLYSS

XP_022924284.1 pentatricopeptide repeat-containing protein At5g08510 isoform X1 [Cucurbita moschata]0.085.77Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQISKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP
        MNQLKQIHAYSLRNG+D+TKFLI+KLLQ+P+LPYACTLFD I KPSV+LYNKFIQTFSSIGH HRCWLLY QMC QGCSPN +SFTFLFPACAS  N YP
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQISKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP

Query:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSACQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE
        GQMLHSHF KSGFASD+FA+TALLDMY KLG+L+SA QLFDEMPVRDIPTWNS+IAGYARSG+M AALELF+KMP RNVISWTALISGYAQNGKYAKALE
Subjt:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSACQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE

Query:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNEYVSNAVLELHARCGNIEVAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID
        MF+ LENEKGTKPNEV+IASVLPAC+ LGALDIGKRIEAYAR NGFFKN YVSNA+LE+HARCGNIE A++VFDEIGSKRNLCSWNTMIMGLAVHGRC D
Subjt:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNEYVSNAVLELHARCGNIEVAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID

Query:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN
        ALQLYDQMLI++ RPDDVTFVGLLLACTHGGMVA+GRQLFESME KFQ+APKLEHYGCLVDLLGRAGE++EAY+LIQ+MPM PDSVIWG LLGACSFHGN
Subjt:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN

Query:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHNIYDIIKLHKHAHH
        VELGEVAAESLFKLEPWNPGNYVILSNIYA AGDWSGVAR RKMMKGGH+ KRAG SYIEVGDGIHEF+VEDRSH KS EIYALLH +Y IIKLH     
Subjt:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHNIYDIIKLHKHAHH

Query:  NENE-DGELLYSS
        N+NE + ELLYSS
Subjt:  NENE-DGELLYSS

XP_022979295.1 pentatricopeptide repeat-containing protein At5g08510 isoform X1 [Cucurbita maxima]0.086.91Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQISKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP
        MNQLKQIHAYSLRNG+D+TKFLIEKLLQ+P+LPYACTLFD I KPSV+LYNKFIQTFSSIGHPHRCWLLY QMC QGCSPN +SFTFLFPACAS  N YP
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQISKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP

Query:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSACQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE
        GQMLHSHFCKSGFASD+FA+TALLDMY KLG+L+SA QLFDEMPVRDIPTWNS+IAGYARSG+M AALELF+KMP+RNVISWTALISGYAQNGKYAKALE
Subjt:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSACQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE

Query:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNEYVSNAVLELHARCGNIEVAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID
        MF+ LENEKGTKPNEV+IASVLPAC+QLGALDIGKRIE YAR NGFFKN YVSNA+LE+HARCGNIE A++VFDEIGSKRNLCSWNTMIMGLAVHGRC D
Subjt:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNEYVSNAVLELHARCGNIEVAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID

Query:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN
        ALQLYDQMLI++ RPDDVTFVGLLLACTHGGMVA+GRQ+FESME KFQ+APKLEHYGCLVDLLGRAGE++EAY+LIQ+MPM PDSVIWG LLGACSFHGN
Subjt:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN

Query:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHNIYDIIKLHKHAHH
        VELGEVAAESLFKLEPWNPGNYVILSNIYA AGDWSGVAR+RKMMKGGHI KRAG SYIEVGDGIHEFIVEDRSH KS EIYALLH IY IIKLH     
Subjt:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHNIYDIIKLHKHAHH

Query:  NENEDGELLYSS
        NE E+ ELLYSS
Subjt:  NENEDGELLYSS

XP_023527365.1 pentatricopeptide repeat-containing protein At5g08510 [Cucurbita pepo subsp. pepo]0.085.96Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQISKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP
        MNQLKQIHAYSLRNG+D+TKFLIEKLLQ+P+LPYACTLFD I KPSV+LYNKFIQTFSSIGH HRCWLLY QMC QGCSPN +SFTFLFPACAS  N YP
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQISKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP

Query:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSACQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE
        GQMLHSHFCKSGFASD+FA+TALLDMY KLG+L+SA QLFDE PVRDIPTWNS+IAGYARSG+M AAL+LF+KMP RNVISWTALISGYAQNGKYAKALE
Subjt:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSACQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE

Query:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNEYVSNAVLELHARCGNIEVAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID
        MF+ LENEKGTKPNEV+IASVLPAC+ LGALDIGKRIEAYAR NGFFKN YVSNA+LE+HARCGNIE A++VFDEIGSKRNLCSWNTMIMGLAVHGRC D
Subjt:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNEYVSNAVLELHARCGNIEVAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID

Query:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN
        ALQLYDQML+++ RPDDVTFVGLLLACTHGGMVA+GRQLFESME KFQ+APKLEHYGCLVDLLGRAGE++EAY+LIQ+MPM PDSVIWG LLGACSFHGN
Subjt:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN

Query:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHNIYDIIKLHKHAHH
        VELGEVAAESLFKLEPWNPGNYVILSNIYA AGDWSGVAR+RKMMKGGHI KRAG SYIEVGDGIHEFIVEDRSH KS EIYALLH +Y IIKLH     
Subjt:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHNIYDIIKLHKHAHH

Query:  NENE-DGELLYSS
        N+NE + ELLYSS
Subjt:  NENE-DGELLYSS

XP_038877076.1 pentatricopeptide repeat-containing protein At5g08510-like [Benincasa hispida]0.089.08Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQISKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP
        M QLKQIHAYSLRNGLD+TKFLIEKLLQ P+LPYACTLFD I +PSV+LYNKFIQTFSSIG PHRCWLLY QMC QGCSPNQ+SFTFLF ACASL N YP
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQISKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP

Query:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSACQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE
        GQMLHSHFCKSGFASD+FA TALLDMYAKLGMLRSA QLFDEMPVRDIPTWNS+IAGYARSG MEAA +LF+KMPVR+V+SWT LISGYAQNGKYAKALE
Subjt:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSACQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE

Query:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNE-YVSNAVLELHARCGNIEVAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCI
        +F+ LENEKG KPNEV+IASVLPAC+QLGALDIGKRIEAYARNNGFFKN  YVSNA+LE+HARCGNI  A++VFDEIGSKRNLCSWNTMIMGLAVHGRC 
Subjt:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNE-YVSNAVLELHARCGNIEVAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCI

Query:  DALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHG
        DALQLYDQMLIR+MRPDDVTFVGLLLACTHGGMVAEGRQ+FESMES FQ+APKLEHYGCLVDLLGRAGELQEAY LIQNMPMAPDSVIWG LLGACSFHG
Subjt:  DALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHG

Query:  NVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHNIYDIIKLHKHAH
        NVELGEVAAESLF LEPWNPGNYVILSNIYA AGDWSGVARLRKMMKGG ITKRAGYSYIEVGDGIHEFIVEDRSHL+SGEIYALLH IYDIIKLHK+ H
Subjt:  NVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHNIYDIIKLHKHAH

Query:  HNENEDGELLYSS
        HN+NED ELLYSS
Subjt:  HNENEDGELLYSS

TrEMBL top hitse value%identityAlignment
A0A0A0LY28 Uncharacterized protein1.1e-30098.24Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQISKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP
        MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQI KPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQISKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP

Query:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSACQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE
        GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSA QLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE
Subjt:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSACQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE

Query:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNEYVSNAVLELHARCGNIEVAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID
        MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKN YVSNAVLELHARCGNIE AQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID
Subjt:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNEYVSNAVLELHARCGNIEVAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID

Query:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN
        ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN
Subjt:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN

Query:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHNIYDIIKLHKHAHH
        VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLH IYDIIKLHKH HH
Subjt:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHNIYDIIKLHKHAHH

Query:  NENEDGELLYSS
        + NED ELLYSS
Subjt:  NENEDGELLYSS

A0A6J1D160 pentatricopeptide repeat-containing protein At5g08510 isoform X22.8e-25984.92Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQISKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP
        MNQLKQIHAY LR+G+D+TKFLIEKLLQ+P+LPYAC LFD I KPSV+LYNKFIQ++SS G  HRCW LY QMC QGCSPNQ+SFTFLF ACASL NV+P
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQISKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP

Query:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSACQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE
        GQMLH+HFCKSGFASD+FA+TALLDMYAKLGMLRSA QLFDEMPVRDIPTWNS++AGY+RSG M AALELF++MPVRNV+SWTALISGY+QNGKYAKALE
Subjt:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSACQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE

Query:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNEYVSNAVLELHARCGNIEVAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID
        MF+ LENE+G KPNEV++ASVLPAC+QLGALDIG+RIEAYARNNGFFKN YVSNA+LE+HARCGNIE A+QVFDEIGSKRNLCSWNTMIMGLAVHGRC  
Subjt:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNEYVSNAVLELHARCGNIEVAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID

Query:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN
        A++LYDQML +++RPDDVTF+GLLLACTHGGMVA+GRQLFESMESKFQ+APKLEHYGCLVDLLGRAGEL+EAYNLIQ MPM PDSVIWG LLGACSFHG+
Subjt:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN

Query:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHNIYDIIKLHKHAHH
        VEL EVAAESLFKLEPWNPGNYVILSNIYA AGDW GVARLRK MKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKS EIYALLH IY IIKL K A H
Subjt:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHNIYDIIKLHKHAHH

Query:  NENE
        ++NE
Subjt:  NENE

A0A6J1D2G9 pentatricopeptide repeat-containing protein At5g08510 isoform X12.3e-26184.54Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQISKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP
        MNQLKQIHAY LR+G+D+TKFLIEKLLQ+P+LPYAC LFD I KPSV+LYNKFIQ++SS G  HRCW LY QMC QGCSPNQ+SFTFLF ACASL NV+P
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQISKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP

Query:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSACQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE
        GQMLH+HFCKSGFASD+FA+TALLDMYAKLGMLRSA QLFDEMPVRDIPTWNS++AGY+RSG M AALELF++MPVRNV+SWTALISGY+QNGKYAKALE
Subjt:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSACQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE

Query:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNEYVSNAVLELHARCGNIEVAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID
        MF+ LENE+G KPNEV++ASVLPAC+QLGALDIG+RIEAYARNNGFFKN YVSNA+LE+HARCGNIE A+QVFDEIGSKRNLCSWNTMIMGLAVHGRC  
Subjt:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNEYVSNAVLELHARCGNIEVAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID

Query:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN
        A++LYDQML +++RPDDVTF+GLLLACTHGGMVA+GRQLFESMESKFQ+APKLEHYGCLVDLLGRAGEL+EAYNLIQ MPM PDSVIWG LLGACSFHG+
Subjt:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN

Query:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHNIYDIIKLHKHAHH
        VEL EVAAESLFKLEPWNPGNYVILSNIYA AGDW GVARLRK MKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKS EIYALLH IY IIKL K A H
Subjt:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHNIYDIIKLHKHAHH

Query:  NENEDGELLYS
        ++NE  ELL+S
Subjt:  NENEDGELLYS

A0A6J1E8Q2 pentatricopeptide repeat-containing protein At5g08510 isoform X17.8e-26285.94Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQISKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP
        MNQLKQIHAYSLRNG+D+TKFLI+KLLQ+P+LPYACTLFD I KPSV+LYNKFIQTFSSIGH HRCWLLY QMC QGCSPN +SFTFLFPACAS  N YP
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQISKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP

Query:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSACQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE
        GQMLHSHF KSGFASD+FA+TALLDMY KLG+L+SA QLFDEMPVRDIPTWNS+IAGYARSG+M AALELF+KMP RNVISWTALISGYAQNGKYAKALE
Subjt:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSACQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE

Query:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNEYVSNAVLELHARCGNIEVAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID
        MF+ LENEKGTKPNEV+IASVLPAC+ LGALDIGKRIEAYAR NGFFKN YVSNA+LE+HARCGNIE A++VFDEIGSKRNLCSWNTMIMGLAVHGRC D
Subjt:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNEYVSNAVLELHARCGNIEVAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID

Query:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN
        ALQLYDQMLI++ RPDDVTFVGLLLACTHGGMVA+GRQLFESME KFQ+APKLEHYGCLVDLLGRAGE++EAY+LIQ+MPM PDSVIWG LLGACSFHGN
Subjt:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN

Query:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHNIYDIIKLHKHAHH
        VELGEVAAESLFKLEPWNPGNYVILSNIYA AGDWSGVAR RKMMKGGH+ KRAG SYIEVGDGIHEF+VEDRSH KS EIYALLH +Y IIKLH   + 
Subjt:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHNIYDIIKLHKHAHH

Query:  NENEDGELLYSS
        NE E+ ELLYSS
Subjt:  NENEDGELLYSS

A0A6J1ISU4 pentatricopeptide repeat-containing protein At5g08510 isoform X12.6e-26586.91Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQISKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP
        MNQLKQIHAYSLRNG+D+TKFLIEKLLQ+P+LPYACTLFD I KPSV+LYNKFIQTFSSIGHPHRCWLLY QMC QGCSPN +SFTFLFPACAS  N YP
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQISKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP

Query:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSACQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE
        GQMLHSHFCKSGFASD+FA+TALLDMY KLG+L+SA QLFDEMPVRDIPTWNS+IAGYARSG+M AALELF+KMP+RNVISWTALISGYAQNGKYAKALE
Subjt:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSACQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE

Query:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNEYVSNAVLELHARCGNIEVAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID
        MF+ LENEKGTKPNEV+IASVLPAC+QLGALDIGKRIE YAR NGFFKN YVSNA+LE+HARCGNIE A++VFDEIGSKRNLCSWNTMIMGLAVHGRC D
Subjt:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNEYVSNAVLELHARCGNIEVAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID

Query:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN
        ALQLYDQMLI++ RPDDVTFVGLLLACTHGGMVA+GRQ+FESME KFQ+APKLEHYGCLVDLLGRAGE++EAY+LIQ+MPM PDSVIWG LLGACSFHGN
Subjt:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN

Query:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHNIYDIIKLHKHAHH
        VELGEVAAESLFKLEPWNPGNYVILSNIYA AGDWSGVAR+RKMMKGGHI KRAG SYIEVGDGIHEFIVEDRSH KS EIYALLH IY IIKLH     
Subjt:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHNIYDIIKLHKHAHH

Query:  NENEDGELLYSS
        NE E+ ELLYSS
Subjt:  NENEDGELLYSS

SwissProt top hitse value%identityAlignment
Q9FNN7 Pentatricopeptide repeat-containing protein At5g085105.3e-16755.33Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQISKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP
        MN +KQ+HA+ LR G+D TK L+++LL +P+L YA  LFD       +LYNK IQ +     PH   +LY  +   G  P+ ++F F+F A AS  +  P
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQISKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP

Query:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSACQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE
         ++LHS F +SGF SD F  T L+  YAKLG L  A ++FDEM  RD+P WN++I GY R G M+AA+ELF+ MP +NV SWT +ISG++QNG Y++AL+
Subjt:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSACQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE

Query:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNEYVSNAVLELHARCGNIEVAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID
        MF+ +E +K  KPN +++ SVLPAC+ LG L+IG+R+E YAR NGFF N YV NA +E++++CG I+VA+++F+E+G++RNLCSWN+MI  LA HG+  +
Subjt:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNEYVSNAVLELHARCGNIEVAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID

Query:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN
        AL L+ QML    +PD VTFVGLLLAC HGGMV +G++LF+SME   +++PKLEHYGC++DLLGR G+LQEAY+LI+ MPM PD+V+WGTLLGACSFHGN
Subjt:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN

Query:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSY-IEVGDGIHEFIVEDRSHLKSGEIYALLHNIYDIIKLHK
        VE+ E+A+E+LFKLEP NPGN VI+SNIYA    W GV R+RK+MK   +TK AGYSY +EVG  +H+F VED+SH +S EIY +L  I+  +KL K
Subjt:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSY-IEVGDGIHEFIVEDRSHLKSGEIYALLHNIYDIIKLHK

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic3.1e-10635.51Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQ-------LPDLPYACTLFDQISKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACA
        +  L+ IHA  ++ GL +T + + KL++          LPYA ++F  I +P++ ++N   +  +    P     LY  M S G  PN Y+F F+  +CA
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQ-------LPDLPYACTLFDQISKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACA

Query:  SLFNVYPGQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSACQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNG
               GQ +H H  K G   D++  T+L+ MY + G L  A ++FD+ P RD+ ++ +LI GYA  G++E A +LF+++PV++V+SW A+ISGYA+ G
Subjt:  SLFNVYPGQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSACQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNG

Query:  KYAKALEMF------------------------------------------------------------------------------------------I
         Y +ALE+F                                                                                          +
Subjt:  KYAKALEMF------------------------------------------------------------------------------------------I

Query:  GLENE----------KGTKPNEVSIASVLPACSQLGALDIGKRIEAY--ARNNGFFKNEYVSNAVLELHARCGNIEVAQQVFDEIGSKRNLCSWNTMIMG
         L  E           G  PN+V++ S+LPAC+ LGA+DIG+ I  Y   R  G      +  ++++++A+CG+IE A QVF+ I  K +L SWN MI G
Subjt:  GLENE----------KGTKPNEVSIASVLPACSQLGALDIGKRIEAY--ARNNGFFKNEYVSNAVLELHARCGNIEVAQQVFDEIGSKRNLCSWNTMIMG

Query:  LAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTL
         A+HGR   +  L+ +M    ++PDD+TFVGLL AC+H GM+  GR +F +M   +++ PKLEHYGC++DLLG +G  +EA  +I  M M PD VIW +L
Subjt:  LAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTL

Query:  LGACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHNI
        L AC  HGNVELGE  AE+L K+EP NPG+YV+LSNIYA AG W+ VA+ R ++    + K  G S IE+   +HEFI+ D+ H ++ EIY +L  +
Subjt:  LGACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHNI

Q9LS72 Pentatricopeptide repeat-containing protein At3g292301.7e-10937.21Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDL----PYACTLFDQISKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLF
        +NQ+KQ+HA  +R  L     +  KL+    L      A  +F+Q+ +P+V+L N  I+  +    P++ + ++ +M   G   + +++ FL  AC+   
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDL----PYACTLFDQISKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLF

Query:  NVYPGQMLHSHFCKSGFASDMFAMTALLDMYA---------------------------------KLGMLRSACQLFDEMPVRDIPTWNSLIAGYARSGH
         +   +M+H+H  K G +SD++   AL+D Y+                                 K G LR A +LFDEMP RD+ +WN+++ GYAR   
Subjt:  NVYPGQMLHSHFCKSGFASDMFAMTALLDMYA---------------------------------KLGMLRSACQLFDEMPVRDIPTWNSLIAGYARSGH

Query:  MEAALELFNKMPVRNVISWTALISGYAQNGKYAKALEMF----------------------IGLENE----------KGTKPNEVSIASVLPACSQLGAL
        M  A ELF KMP RN +SW+ ++ GY++ G    A  MF                       GL  E           G K +  ++ S+L AC++ G L
Subjt:  MEAALELFNKMPVRNVISWTALISGYAQNGKYAKALEMF----------------------IGLENE----------KGTKPNEVSIASVLPACSQLGAL

Query:  DIGKRIEAYARNNGFFKNEYVSNAVLELHARCGNIEVAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGG
         +G RI +  + +    N YV NA+L+++A+CGN++ A  VF++I  K++L SWNTM+ GL VHG   +A++L+ +M    +RPD VTF+ +L +C H G
Subjt:  DIGKRIEAYARNNGFFKNEYVSNAVLELHARCGNIEVAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGG

Query:  MVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYAL
        ++ EG   F SME  + + P++EHYGCLVDLLGR G L+EA  ++Q MPM P+ VIWG LLGAC  H  V++ +   ++L KL+P +PGNY +LSNIYA 
Subjt:  MVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYAL

Query:  AGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHNIYD
        A DW GVA +R  MK   + K +G S +E+ DGIHEF V D+SH KS +IY +L ++ +
Subjt:  AGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHNIYD

Q9SIL5 Pentatricopeptide repeat-containing protein At2g205403.7e-11241.84Show/hide
Query:  NQLKQIHAYSLRNGLDHTKFLIEKLL----QLPDLPYACTLFDQISKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCS-PNQYSFTFLFPACASLF
        N+ K+I+A  + +GL  + F++ K++    ++ D+ YA  LF+Q+S P+V+LYN  I+ ++          +Y Q+  +    P++++F F+F +CASL 
Subjt:  NQLKQIHAYSLRNGLDHTKFLIEKLL----QLPDLPYACTLFDQISKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCS-PNQYSFTFLFPACASLF

Query:  NVYPGQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSACQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYA
        + Y G+ +H H CK G    +    AL+DMY K   L  A ++FDEM  RD+ +WNSL++GYAR G M+ A  LF+ M  + ++SWTA+ISGY   G Y 
Subjt:  NVYPGQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSACQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYA

Query:  KALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNEYVSNAVLELHARCGNIEVAQQVFDEIGSKRNLCSWNTMIMGLAVHG
        +A++ F  ++   G +P+E+S+ SVLP+C+QLG+L++GK I  YA   GF K   V NA++E++++CG I  A Q+F ++  K ++ SW+TMI G A HG
Subjt:  KALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNEYVSNAVLELHARCGNIEVAQQVFDEIGSKRNLCSWNTMIMGLAVHG

Query:  RCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACS
            A++ +++M   K++P+ +TF+GLL AC+H GM  EG + F+ M   +Q+ PK+EHYGCL+D+L RAG+L+ A  + + MPM PDS IWG+LL +C 
Subjt:  RCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACS

Query:  FHGNVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRS
          GN+++  VA + L +LEP + GNYV+L+NIYA  G W  V+RLRKM++  ++ K  G S IEV + + EF+  D S
Subjt:  FHGNVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRS

Q9SZT8 Pentatricopeptide repeat-containing protein ELI1, chloroplastic1.8e-10640.16Show/hide
Query:  MNQLKQIHAYSLR-NGLDHTKFLIEKL------LQLPDLPYACTLFDQISKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACA
        ++++ QIHA  LR N L H ++ +  L           + ++  LF Q   P ++L+   I T S  G   + +LLY Q+ S   +PN+++F+ L  +C+
Subjt:  MNQLKQIHAYSLR-NGLDHTKFLIEKL------LQLPDLPYACTLFDQISKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACA

Query:  SLFNVYPGQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSACQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNG
        +      G+++H+H  K G   D +  T L+D+YAK G + SA ++FD MP R + +  ++I  YA+ G++EAA  LF+ M  R+++SW  +I GYAQ+G
Subjt:  SLFNVYPGQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSACQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNG

Query:  KYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNEYVSNAVLELHARCGNIEVAQQVFDEIGSKRNLCSWNTMIMGLA
            AL +F  L  E   KP+E+++ + L ACSQ+GAL+ G+ I  + +++    N  V   +++++++CG++E A  VF++   ++++ +WN MI G A
Subjt:  KYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNEYVSNAVLELHARCGNIEVAQQVFDEIGSKRNLCSWNTMIMGLA

Query:  VHGRCIDALQLYDQML-IRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLL
        +HG   DAL+L+++M  I  ++P D+TF+G L AC H G+V EG ++FESM  ++ + PK+EHYGCLV LLGRAG+L+ AY  I+NM M  DSV+W ++L
Subjt:  VHGRCIDALQLYDQML-IRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLL

Query:  GACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHNIYDII
        G+C  HG+  LG+  AE L  L   N G YV+LSNIYA  GD+ GVA++R +MK   I K  G S IE+ + +HEF   DR H KS EIY +L  I + I
Subjt:  GACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHNIYDII

Query:  KLH
        K H
Subjt:  KLH

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.2e-10735.51Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQ-------LPDLPYACTLFDQISKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACA
        +  L+ IHA  ++ GL +T + + KL++          LPYA ++F  I +P++ ++N   +  +    P     LY  M S G  PN Y+F F+  +CA
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQ-------LPDLPYACTLFDQISKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACA

Query:  SLFNVYPGQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSACQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNG
               GQ +H H  K G   D++  T+L+ MY + G L  A ++FD+ P RD+ ++ +LI GYA  G++E A +LF+++PV++V+SW A+ISGYA+ G
Subjt:  SLFNVYPGQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSACQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNG

Query:  KYAKALEMF------------------------------------------------------------------------------------------I
         Y +ALE+F                                                                                          +
Subjt:  KYAKALEMF------------------------------------------------------------------------------------------I

Query:  GLENE----------KGTKPNEVSIASVLPACSQLGALDIGKRIEAY--ARNNGFFKNEYVSNAVLELHARCGNIEVAQQVFDEIGSKRNLCSWNTMIMG
         L  E           G  PN+V++ S+LPAC+ LGA+DIG+ I  Y   R  G      +  ++++++A+CG+IE A QVF+ I  K +L SWN MI G
Subjt:  GLENE----------KGTKPNEVSIASVLPACSQLGALDIGKRIEAY--ARNNGFFKNEYVSNAVLELHARCGNIEVAQQVFDEIGSKRNLCSWNTMIMG

Query:  LAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTL
         A+HGR   +  L+ +M    ++PDD+TFVGLL AC+H GM+  GR +F +M   +++ PKLEHYGC++DLLG +G  +EA  +I  M M PD VIW +L
Subjt:  LAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTL

Query:  LGACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHNI
        L AC  HGNVELGE  AE+L K+EP NPG+YV+LSNIYA AG W+ VA+ R ++    + K  G S IE+   +HEFI+ D+ H ++ EIY +L  +
Subjt:  LGACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHNI

AT2G20540.1 mitochondrial editing factor 212.7e-11341.84Show/hide
Query:  NQLKQIHAYSLRNGLDHTKFLIEKLL----QLPDLPYACTLFDQISKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCS-PNQYSFTFLFPACASLF
        N+ K+I+A  + +GL  + F++ K++    ++ D+ YA  LF+Q+S P+V+LYN  I+ ++          +Y Q+  +    P++++F F+F +CASL 
Subjt:  NQLKQIHAYSLRNGLDHTKFLIEKLL----QLPDLPYACTLFDQISKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCS-PNQYSFTFLFPACASLF

Query:  NVYPGQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSACQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYA
        + Y G+ +H H CK G    +    AL+DMY K   L  A ++FDEM  RD+ +WNSL++GYAR G M+ A  LF+ M  + ++SWTA+ISGY   G Y 
Subjt:  NVYPGQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSACQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYA

Query:  KALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNEYVSNAVLELHARCGNIEVAQQVFDEIGSKRNLCSWNTMIMGLAVHG
        +A++ F  ++   G +P+E+S+ SVLP+C+QLG+L++GK I  YA   GF K   V NA++E++++CG I  A Q+F ++  K ++ SW+TMI G A HG
Subjt:  KALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNEYVSNAVLELHARCGNIEVAQQVFDEIGSKRNLCSWNTMIMGLAVHG

Query:  RCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACS
            A++ +++M   K++P+ +TF+GLL AC+H GM  EG + F+ M   +Q+ PK+EHYGCL+D+L RAG+L+ A  + + MPM PDS IWG+LL +C 
Subjt:  RCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACS

Query:  FHGNVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRS
          GN+++  VA + L +LEP + GNYV+L+NIYA  G W  V+RLRKM++  ++ K  G S IEV + + EF+  D S
Subjt:  FHGNVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRS

AT3G29230.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.2e-11037.21Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDL----PYACTLFDQISKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLF
        +NQ+KQ+HA  +R  L     +  KL+    L      A  +F+Q+ +P+V+L N  I+  +    P++ + ++ +M   G   + +++ FL  AC+   
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDL----PYACTLFDQISKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLF

Query:  NVYPGQMLHSHFCKSGFASDMFAMTALLDMYA---------------------------------KLGMLRSACQLFDEMPVRDIPTWNSLIAGYARSGH
         +   +M+H+H  K G +SD++   AL+D Y+                                 K G LR A +LFDEMP RD+ +WN+++ GYAR   
Subjt:  NVYPGQMLHSHFCKSGFASDMFAMTALLDMYA---------------------------------KLGMLRSACQLFDEMPVRDIPTWNSLIAGYARSGH

Query:  MEAALELFNKMPVRNVISWTALISGYAQNGKYAKALEMF----------------------IGLENE----------KGTKPNEVSIASVLPACSQLGAL
        M  A ELF KMP RN +SW+ ++ GY++ G    A  MF                       GL  E           G K +  ++ S+L AC++ G L
Subjt:  MEAALELFNKMPVRNVISWTALISGYAQNGKYAKALEMF----------------------IGLENE----------KGTKPNEVSIASVLPACSQLGAL

Query:  DIGKRIEAYARNNGFFKNEYVSNAVLELHARCGNIEVAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGG
         +G RI +  + +    N YV NA+L+++A+CGN++ A  VF++I  K++L SWNTM+ GL VHG   +A++L+ +M    +RPD VTF+ +L +C H G
Subjt:  DIGKRIEAYARNNGFFKNEYVSNAVLELHARCGNIEVAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGG

Query:  MVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYAL
        ++ EG   F SME  + + P++EHYGCLVDLLGR G L+EA  ++Q MPM P+ VIWG LLGAC  H  V++ +   ++L KL+P +PGNY +LSNIYA 
Subjt:  MVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYAL

Query:  AGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHNIYD
        A DW GVA +R  MK   + K +G S +E+ DGIHEF V D+SH KS +IY +L ++ +
Subjt:  AGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHNIYD

AT4G37380.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.3e-10740.16Show/hide
Query:  MNQLKQIHAYSLR-NGLDHTKFLIEKL------LQLPDLPYACTLFDQISKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACA
        ++++ QIHA  LR N L H ++ +  L           + ++  LF Q   P ++L+   I T S  G   + +LLY Q+ S   +PN+++F+ L  +C+
Subjt:  MNQLKQIHAYSLR-NGLDHTKFLIEKL------LQLPDLPYACTLFDQISKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACA

Query:  SLFNVYPGQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSACQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNG
        +      G+++H+H  K G   D +  T L+D+YAK G + SA ++FD MP R + +  ++I  YA+ G++EAA  LF+ M  R+++SW  +I GYAQ+G
Subjt:  SLFNVYPGQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSACQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNG

Query:  KYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNEYVSNAVLELHARCGNIEVAQQVFDEIGSKRNLCSWNTMIMGLA
            AL +F  L  E   KP+E+++ + L ACSQ+GAL+ G+ I  + +++    N  V   +++++++CG++E A  VF++   ++++ +WN MI G A
Subjt:  KYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNEYVSNAVLELHARCGNIEVAQQVFDEIGSKRNLCSWNTMIMGLA

Query:  VHGRCIDALQLYDQML-IRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLL
        +HG   DAL+L+++M  I  ++P D+TF+G L AC H G+V EG ++FESM  ++ + PK+EHYGCLV LLGRAG+L+ AY  I+NM M  DSV+W ++L
Subjt:  VHGRCIDALQLYDQML-IRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLL

Query:  GACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHNIYDII
        G+C  HG+  LG+  AE L  L   N G YV+LSNIYA  GD+ GVA++R +MK   I K  G S IE+ + +HEF   DR H KS EIY +L  I + I
Subjt:  GACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHNIYDII

Query:  KLH
        K H
Subjt:  KLH

AT5G08510.1 Pentatricopeptide repeat (PPR) superfamily protein3.8e-16855.33Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQISKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP
        MN +KQ+HA+ LR G+D TK L+++LL +P+L YA  LFD       +LYNK IQ +     PH   +LY  +   G  P+ ++F F+F A AS  +  P
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQISKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP

Query:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSACQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE
         ++LHS F +SGF SD F  T L+  YAKLG L  A ++FDEM  RD+P WN++I GY R G M+AA+ELF+ MP +NV SWT +ISG++QNG Y++AL+
Subjt:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSACQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE

Query:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNEYVSNAVLELHARCGNIEVAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID
        MF+ +E +K  KPN +++ SVLPAC+ LG L+IG+R+E YAR NGFF N YV NA +E++++CG I+VA+++F+E+G++RNLCSWN+MI  LA HG+  +
Subjt:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNEYVSNAVLELHARCGNIEVAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID

Query:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN
        AL L+ QML    +PD VTFVGLLLAC HGGMV +G++LF+SME   +++PKLEHYGC++DLLGR G+LQEAY+LI+ MPM PD+V+WGTLLGACSFHGN
Subjt:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN

Query:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSY-IEVGDGIHEFIVEDRSHLKSGEIYALLHNIYDIIKLHK
        VE+ E+A+E+LFKLEP NPGN VI+SNIYA    W GV R+RK+MK   +TK AGYSY +EVG  +H+F VED+SH +S EIY +L  I+  +KL K
Subjt:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSY-IEVGDGIHEFIVEDRSHLKSGEIYALLHNIYDIIKLHK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACCAATTGAAGCAAATTCATGCTTATAGCCTCAGAAACGGCCTAGATCACACAAAGTTCCTCATTGAAAAGCTTCTGCAGTTACCAGATCTTCCGTATGCTTGCAC
CCTGTTTGACCAAATTTCTAAGCCATCTGTTTATCTCTACAACAAGTTCATTCAAACATTTTCTTCAATTGGTCACCCCCATCGATGCTGGTTGCTTTACTGTCAAATGT
GTTCCCAAGGTTGCTCTCCGAATCAGTATTCATTCACCTTTCTCTTTCCCGCGTGTGCTTCCCTTTTTAATGTTTACCCAGGTCAGATGCTTCATTCTCATTTCTGTAAG
TCAGGATTTGCTTCTGATATGTTCGCTATGACGGCATTGTTGGACATGTATGCGAAATTGGGAATGTTGAGGTCTGCATGCCAACTGTTTGATGAAATGCCTGTTCGAGA
TATACCCACCTGGAATTCATTGATTGCGGGTTATGCAAGGTCCGGGCATATGGAGGCTGCGTTAGAATTGTTCAACAAAATGCCGGTGAGAAATGTGATTTCCTGGACAG
CTTTGATATCTGGGTATGCACAAAATGGGAAGTATGCGAAGGCCTTGGAGATGTTTATAGGATTGGAAAATGAGAAAGGCACTAAGCCAAATGAGGTGTCCATAGCAAGT
GTTCTTCCGGCCTGTTCTCAGCTTGGGGCATTGGATATAGGGAAGAGGATTGAAGCATATGCAAGAAATAATGGATTTTTCAAAAACGAGTATGTGAGCAATGCGGTACT
GGAATTGCATGCTAGGTGTGGGAACATCGAGGTAGCGCAGCAAGTTTTTGATGAGATTGGAAGCAAAAGAAATTTGTGCTCGTGGAATACCATGATAATGGGATTGGCTG
TGCATGGAAGATGCATTGATGCTCTTCAGCTTTATGATCAAATGTTGATACGGAAAATGAGACCGGACGACGTGACATTTGTAGGGCTTCTCTTGGCTTGCACACACGGA
GGCATGGTTGCAGAAGGCCGACAACTCTTTGAATCAATGGAGAGTAAGTTTCAAGTTGCTCCCAAATTAGAGCACTATGGCTGCTTGGTAGATTTATTAGGGAGAGCTGG
AGAGCTGCAGGAAGCTTACAATCTCATTCAAAACATGCCAATGGCTCCTGACTCTGTTATATGGGGAACGCTTTTGGGAGCTTGTAGCTTCCATGGCAATGTTGAATTGG
GTGAAGTAGCAGCTGAGTCCCTCTTCAAGCTTGAGCCATGGAACCCTGGAAATTATGTCATTCTTTCTAACATTTACGCGTTGGCAGGTGATTGGTCTGGAGTTGCAAGA
TTAAGGAAGATGATGAAAGGAGGACATATAACAAAGAGAGCAGGATATAGTTATATTGAAGTGGGAGATGGAATTCATGAGTTCATTGTAGAAGATAGATCACATTTGAA
GAGTGGTGAAATATATGCTTTACTTCATAATATTTATGACATTATTAAACTTCATAAGCATGCACATCATAATGAAAACGAAGATGGAGAACTACTCTATTCTTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAACCAATTGAAGCAAATTCATGCTTATAGCCTCAGAAACGGCCTAGATCACACAAAGTTCCTCATTGAAAAGCTTCTGCAGTTACCAGATCTTCCGTATGCTTGCAC
CCTGTTTGACCAAATTTCTAAGCCATCTGTTTATCTCTACAACAAGTTCATTCAAACATTTTCTTCAATTGGTCACCCCCATCGATGCTGGTTGCTTTACTGTCAAATGT
GTTCCCAAGGTTGCTCTCCGAATCAGTATTCATTCACCTTTCTCTTTCCCGCGTGTGCTTCCCTTTTTAATGTTTACCCAGGTCAGATGCTTCATTCTCATTTCTGTAAG
TCAGGATTTGCTTCTGATATGTTCGCTATGACGGCATTGTTGGACATGTATGCGAAATTGGGAATGTTGAGGTCTGCATGCCAACTGTTTGATGAAATGCCTGTTCGAGA
TATACCCACCTGGAATTCATTGATTGCGGGTTATGCAAGGTCCGGGCATATGGAGGCTGCGTTAGAATTGTTCAACAAAATGCCGGTGAGAAATGTGATTTCCTGGACAG
CTTTGATATCTGGGTATGCACAAAATGGGAAGTATGCGAAGGCCTTGGAGATGTTTATAGGATTGGAAAATGAGAAAGGCACTAAGCCAAATGAGGTGTCCATAGCAAGT
GTTCTTCCGGCCTGTTCTCAGCTTGGGGCATTGGATATAGGGAAGAGGATTGAAGCATATGCAAGAAATAATGGATTTTTCAAAAACGAGTATGTGAGCAATGCGGTACT
GGAATTGCATGCTAGGTGTGGGAACATCGAGGTAGCGCAGCAAGTTTTTGATGAGATTGGAAGCAAAAGAAATTTGTGCTCGTGGAATACCATGATAATGGGATTGGCTG
TGCATGGAAGATGCATTGATGCTCTTCAGCTTTATGATCAAATGTTGATACGGAAAATGAGACCGGACGACGTGACATTTGTAGGGCTTCTCTTGGCTTGCACACACGGA
GGCATGGTTGCAGAAGGCCGACAACTCTTTGAATCAATGGAGAGTAAGTTTCAAGTTGCTCCCAAATTAGAGCACTATGGCTGCTTGGTAGATTTATTAGGGAGAGCTGG
AGAGCTGCAGGAAGCTTACAATCTCATTCAAAACATGCCAATGGCTCCTGACTCTGTTATATGGGGAACGCTTTTGGGAGCTTGTAGCTTCCATGGCAATGTTGAATTGG
GTGAAGTAGCAGCTGAGTCCCTCTTCAAGCTTGAGCCATGGAACCCTGGAAATTATGTCATTCTTTCTAACATTTACGCGTTGGCAGGTGATTGGTCTGGAGTTGCAAGA
TTAAGGAAGATGATGAAAGGAGGACATATAACAAAGAGAGCAGGATATAGTTATATTGAAGTGGGAGATGGAATTCATGAGTTCATTGTAGAAGATAGATCACATTTGAA
GAGTGGTGAAATATATGCTTTACTTCATAATATTTATGACATTATTAAACTTCATAAGCATGCACATCATAATGAAAACGAAGATGGAGAACTACTCTATTCTTCTTAA
Protein sequenceShow/hide protein sequence
MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQISKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCK
SGFASDMFAMTALLDMYAKLGMLRSACQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALEMFIGLENEKGTKPNEVSIAS
VLPACSQLGALDIGKRIEAYARNNGFFKNEYVSNAVLELHARCGNIEVAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHG
GMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVAR
LRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHNIYDIIKLHKHAHHNENEDGELLYSS