; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr015621 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr015621
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00004836:330520..336755
RNA-Seq ExpressionSgr015621
SyntenySgr015621
Gene Ontology termsGO:0009416 - response to light stimulus (biological process)
GO:0009451 - RNA modification (biological process)
GO:0009658 - chloroplast organization (biological process)
GO:0015689 - molybdate ion transport (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0015098 - molybdate ion transmembrane transporter activity (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR031563 - Molybdate transporter 1/2
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575573.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0080.06Show/hide
Query:  MAVFAKASHLLTYLDRSNPQISNNHNPRILSFSKNLLTHSQTARKNQEISVPG-ALSNSTINQTQNLELQELCLHGNLEQAMKRLESMLELRIEVEDDAY
        MAVFA++S++LTY+DRSNPQISNN+N + LSFSKNL TH QT RK+QEISV G A+SNSTINQTQNLELQ+LCL G LEQAMKR+ESMLELRIEVE+D Y
Subjt:  MAVFAKASHLLTYLDRSNPQISNNHNPRILSFSKNLLTHSQTARKNQEISVPG-ALSNSTINQTQNLELQELCLHGNLEQAMKRLESMLELRIEVEDDAY

Query:  IALLRLCEWRRAPDEGSRIYALVSSSKSRLGVRLGNALLSMFVRFGNLVDAWYVFGRMAERDVFSWNVLVGGYAKAGCFDEALNLYHRMLWAELRPDVYT
        IALLRLCEWRRAPDEGSR+Y  V SSKSRLGVRLGNALLSMFVRFGNL DAWYVFG+M+ERDVFSWNVL+GGYAKAGCFDEALNLYHRMLWAE+RPDVYT
Subjt:  IALLRLCEWRRAPDEGSRIYALVSSSKSRLGVRLGNALLSMFVRFGNLVDAWYVFGRMAERDVFSWNVLVGGYAKAGCFDEALNLYHRMLWAELRPDVYT

Query:  FPSVLRTCGGVPDIARGKEIHAHVIRFGFESDVDVVNALITMYVKCGDISNARTLFDKMPNRDRISWNAMISGYFENGEGLEGLRLFFMMRELSVDPDLI
        FPSVLRTCG + DIARGKEIHAHVIRFGFE DVDV NALITMYVKCGD+SNAR LFDKMP RDRISWNAMISGYFENG GLEGLRLFFMMRELSVDPDLI
Subjt:  FPSVLRTCGGVPDIARGKEIHAHVIRFGFESDVDVVNALITMYVKCGDISNARTLFDKMPNRDRISWNAMISGYFENGEGLEGLRLFFMMRELSVDPDLI

Query:  TLTTVVSACELLSDERLGREIHGYVIRSEFGDDVSVNNSLIQMYSSLGHLKEAEKVFSRMESKDVVSWTAMTASYDGHKLPCKAVETYKMMELEGVLPDE
        T+T++ SACELL ++RLGREIHG V++SEFGDDVS+NNSLIQMYSSLGHL EAEKVFSRM SKDVVSWTAM ASYD HKLP KAVETYK M LEGV+ D 
Subjt:  TLTTVVSACELLSDERLGREIHGYVIRSEFGDDVSVNNSLIQMYSSLGHLKEAEKVFSRMESKDVVSWTAMTASYDGHKLPCKAVETYKMMELEGVLPDE

Query:  ITLVSVLSACACLRYLDLGIKLHEIAIKTGLISHVIVSNSLIDMYSKCKCVDKALEVFRNISGKNVISWTSLILGLRINNRSFEALLFFRQMKESLKPNS
        ITLVSVLSACACL +LDLGI+LH I+IKTG ISHV+VSN+LIDMYSKCKCV+KALEVF  ISGKNVISWTSLILGLRINNRSFEAL FFRQMKES+KPNS
Subjt:  ITLVSVLSACACLRYLDLGIKLHEIAIKTGLISHVIVSNSLIDMYSKCKCVDKALEVFRNISGKNVISWTSLILGLRINNRSFEALLFFRQMKESLKPNS

Query:  VTLISVLSACARIGALMRGKEIHAHALRTGVGFDGFLPNAILDMYIRCGREVPALNQFNSQKKDVTAWNILLTGYAEQGQGKLAVELFDKMLESEVNPDE
        VTLI++LSACARIGALMRGKEIHAHALRTGVGFDGFLPNAILDMY+RCGR VPALNQFNSQKKDVTAWNILLTGYAEQGQGKLAVELFD+MLESE+NPDE
Subjt:  VTLISVLSACARIGALMRGKEIHAHALRTGVGFDGFLPNAILDMYIRCGREVPALNQFNSQKKDVTAWNILLTGYAEQGQGKLAVELFDKMLESEVNPDE

Query:  ITFISLLCACSRSGMVMEGLEYFNVMKNKYHLTPNLKHYACVIDLLGRAGRLDDAYDFIQDMPIKPDAAIWGALLNACRIHRNVELGELAATRIFEKDDK
        ITFI+LLCACSRSGMVMEGLEYFN MKNKY+LTPNLKH+ACV+DLLGRAG+LDDAYDFIQDMPI+PDAAIWGALLNACRIHR+V+LGELAA+ +FEKDDK
Subjt:  ITFISLLCACSRSGMVMEGLEYFNVMKNKYHLTPNLKHYACVIDLLGRAGRLDDAYDFIQDMPIKPDAAIWGALLNACRIHRNVELGELAATRIFEKDDK

Query:  SVGYYILLCNLYAECGNWDKVSKVRSMMRERGLSVDTGCSWVEVKGKVHAFLSGDNFHAQSKEINGVLEGFYDKMKETGFSDLKSSFMSEVESSRADIFC
        SVGYY+LLCN YAECGNW+KVSKVR++MRERGL+VD GCSWVEV GKVHAFLSGDNFHAQSKEINGVL+GFY KMKE G  DLKSSF S+VESSRAD+FC
Subjt:  SVGYYILLCNLYAECGNWDKVSKVRSMMRERGLSVDTGCSWVEVKGKVHAFLSGDNFHAQSKEINGVLEGFYDKMKETGFSDLKSSFMSEVESSRADIFC

Query:  GHSEKQAIAFGLINTAPGMPIWVTKNLYMCHSCHNFVKFISTVVRRDIS---------------------------DFYTI----------------LIV
        GHSE+QAIAFGLINTAPGMPIWVTKN+YMC SCH+ VKFIS +VRR+IS                           D + +                L+ 
Subjt:  GHSEKQAIAFGLINTAPGMPIWVTKNLYMCHSCHNFVKFISTVVRRDIS---------------------------DFYTI----------------LIV

Query:  PWRR-------HLYSGRSLP---------------------------------------GCSDRRKFVAGRFPAMEESVPDRTTTTTASATSSIVSNRWF
        PW               SLP                                            R++  GR P MEE  PD++ T           NRWF
Subjt:  PWRR-------HLYSGRSLP---------------------------------------GCSDRRKFVAGRFPAMEESVPDRTTTTTASATSSIVSNRWF

Query:  LRLPATLRPKTSVLAEISGAVGDLGTYIPIVLTLTLVSHLDLGTTLIFTALYNIVTGLLFGIPMPVQPMKSIAAVAVAESTHLTIPQIAAAGLSTAAVLL
         RLPATL PK SVLAEISGAVGDLGTYIPIVLTLTLVSHLDLGTTLIFTALYNIVTGLLFGIPMPVQPMKSIAAVAVAESTHLT+PQIAAAGLSTAAVLL
Subjt:  LRLPATLRPKTSVLAEISGAVGDLGTYIPIVLTLTLVSHLDLGTTLIFTALYNIVTGLLFGIPMPVQPMKSIAAVAVAESTHLTIPQIAAAGLSTAAVLL

Query:  VLGATGLMSFLYRYLPLPVVRGIQLSQGLSFAFSAIKYIRYNQDLVTSKSGGPRSWLGFDGLLVALASCLFLIFTTGAGDSHVRAEREPSSSDPPNGSST
        +LGATGLMS LYRYLPLPVVRGIQLSQGLSFAF+AIKYIRY+QDLVTSKSGG RSW+GFDGLLVAL SCLFLI TTGAGDSH     EPSSS+  N S +
Subjt:  VLGATGLMSFLYRYLPLPVVRGIQLSQGLSFAFSAIKYIRYNQDLVTSKSGGPRSWLGFDGLLVALASCLFLIFTTGAGDSHVRAEREPSSSDPPNGSST

Query:  RSGRRVRRLRILSTIPAALIVFLFGLLICFLRDLSVLKYLKFGPSKLHFLSITWEDWKIGFLRAAIPQIPLSILNSVIAVCKLSADLFPDREASAMTVSI
        RSG R+RRLR+LS IPAALIVFLFGLL+CFLRDLSVLKYLKFGPSKLH L ITWEDWKIGF+RAAIPQIPLSILNSVIAVCKLS DLFPDREASAM+VSI
Subjt:  RSGRRVRRLRILSTIPAALIVFLFGLLICFLRDLSVLKYLKFGPSKLHFLSITWEDWKIGFLRAAIPQIPLSILNSVIAVCKLSADLFPDREASAMTVSI

Query:  SVGIMNFVGCWFGAMPVCHGAGGLAGQYRFGGRSGASVVFLGIGKLVLGLAFGNSFAQVLNQFPIGVLGVLLLFAGIELAMASRDMKSKEESFVMLVCAA
        SVGIMNFVGCWFGAMPVCHGAGGLAGQYRFGGRSGASVVFLGIGKLVLGLAFGNSFAQVLN+FPIGVLGVLLLFAGIELAMASRDM SKEESFVMLVCAA
Subjt:  SVGIMNFVGCWFGAMPVCHGAGGLAGQYRFGGRSGASVVFLGIGKLVLGLAFGNSFAQVLNQFPIGVLGVLLLFAGIELAMASRDMKSKEESFVMLVCAA

Query:  VSLTGSSAALGFGVGILLSLLMKLREFDCS--SGFGFQKMKPKSDADEE
        VSLTGSSAALGFGVGI+L LL+KLREFDCS  S FGF KMKPKSDA+EE
Subjt:  VSLTGSSAALGFGVGILLSLLMKLREFDCS--SGFGFQKMKPKSDADEE

XP_004136076.2 pentatricopeptide repeat-containing protein At1g15510, chloroplastic [Cucumis sativus]0.0e+0087.75Show/hide
Query:  MAVFAKASHLLTYLDRSNPQISNNHNPRILSFSKNLLTHSQTARKNQEISVPG-ALSNSTINQTQNLELQELCLHGNLEQAMKRLESMLELRIEVEDDAY
        MAVFA++SHLLTYLDRSNPQ+ NNHNP+ LSFSKNL TH  T RK QEISV G A+S+S I+QTQNLEL+ELCL GNLEQAMKRLESMLELRIEVE+DAY
Subjt:  MAVFAKASHLLTYLDRSNPQISNNHNPRILSFSKNLLTHSQTARKNQEISVPG-ALSNSTINQTQNLELQELCLHGNLEQAMKRLESMLELRIEVEDDAY

Query:  IALLRLCEWRRAPDEGSRIYALVSSSKSRLGVRLGNALLSMFVRFGNLVDAWYVFGRMAERDVFSWNVLVGGYAKAGCFDEALNLYHRMLWAELRPDVYT
        IALLRLCEWRRAPDEGSR+Y LVSSSKS L VRLGNALLSMFVRFGNL+DAWYVFG+M+ERDVFSWNVLVGGYAKAGCFDEALNLYHRMLWAE+RP+VYT
Subjt:  IALLRLCEWRRAPDEGSRIYALVSSSKSRLGVRLGNALLSMFVRFGNLVDAWYVFGRMAERDVFSWNVLVGGYAKAGCFDEALNLYHRMLWAELRPDVYT

Query:  FPSVLRTCGGVPDIARGKEIHAHVIRFGFESDVDVVNALITMYVKCGDISNARTLFDKMPNRDRISWNAMISGYFENGEGLEGLRLFFMMRELSVDPDLI
        FPSVL+TC GV DIARGKEIHAHVIRFGFESDVDV NALITMYVKCGDISNAR LFDKMP RDRISWNAMISGYFENG GLEGL LF MMRELSVDPDLI
Subjt:  FPSVLRTCGGVPDIARGKEIHAHVIRFGFESDVDVVNALITMYVKCGDISNARTLFDKMPNRDRISWNAMISGYFENGEGLEGLRLFFMMRELSVDPDLI

Query:  TLTTVVSACELLSDERLGREIHGYVIRSEFGDDVSVNNSLIQMYSSLGHLKEAEKVFSRMESKDVVSWTAMTASYDGHKLPCKAVETYKMMELEGVLPDE
        T+TTV SACELL +ERLGR +HGYV++SEFG D+S+NNSLIQMYSSLG L+EAE VFSRMESKDVVSWTAM AS   HKLP KAVETYKMMELEG+LPDE
Subjt:  TLTTVVSACELLSDERLGREIHGYVIRSEFGDDVSVNNSLIQMYSSLGHLKEAEKVFSRMESKDVVSWTAMTASYDGHKLPCKAVETYKMMELEGVLPDE

Query:  ITLVSVLSACACLRYLDLGIKLHEIAIKTGLISHVIVSNSLIDMYSKCKCVDKALEVFRNISGKNVISWTSLILGLRINNRSFEALLFFRQMKESLKPNS
        ITLVSVLSACAC+ +LDLGI+LHEIAIKTGL+SHVIVSNSLIDMYSKCKCVD ALEVFRNISGKNV+SWTSLILGLRINNRSFEALLFFRQMKES+KPNS
Subjt:  ITLVSVLSACACLRYLDLGIKLHEIAIKTGLISHVIVSNSLIDMYSKCKCVDKALEVFRNISGKNVISWTSLILGLRINNRSFEALLFFRQMKESLKPNS

Query:  VTLISVLSACARIGALMRGKEIHAHALRTGVGFDGFLPNAILDMYIRCGREVPALNQFNSQKKDVTAWNILLTGYAEQGQGKLAVELFDKMLESEVNPDE
        VTLISVLSACARIGALMRGKEIHAHALRTGVGFDGFLPNAILDMY+RCGR+VPALNQFNSQKKDVTAWNILLTGYA+QGQ KLAVELFDKMLE E++PDE
Subjt:  VTLISVLSACARIGALMRGKEIHAHALRTGVGFDGFLPNAILDMYIRCGREVPALNQFNSQKKDVTAWNILLTGYAEQGQGKLAVELFDKMLESEVNPDE

Query:  ITFISLLCACSRSGMVMEGLEYFNVMKNKYHLTPNLKHYACVIDLLGRAGRLDDAYDFIQDMPIKPDAAIWGALLNACRIHRNVELGELAATRIFEKDDK
        ITFISLLCACS+SGMV EGLEYFN+MKNKY+LTPNLKHYACV+D+LGRAG+LDDAYDFIQDMPI+PDAAIWGALLNACRIHRNVELGE+AA R+FEKD+K
Subjt:  ITFISLLCACSRSGMVMEGLEYFNVMKNKYHLTPNLKHYACVIDLLGRAGRLDDAYDFIQDMPIKPDAAIWGALLNACRIHRNVELGELAATRIFEKDDK

Query:  SVGYYILLCNLYAECGNWDKVSKVRSMMRERGLSVDTGCSWVEVKGKVHAFLSGDNFHAQSKEINGVLEGFYDKMKETGFSDLKSSFMSEVESSRADIFC
        SVGYYILLCNLYA CGNWDKVSKVRS+MRERGLS D GCSWVE+KGKVHAFLSGDN H+QSKEINGVL+GF  KMKE GF +LKSSF SE+ESSRADIFC
Subjt:  SVGYYILLCNLYAECGNWDKVSKVRSMMRERGLSVDTGCSWVEVKGKVHAFLSGDNFHAQSKEINGVLEGFYDKMKETGFSDLKSSFMSEVESSRADIFC

Query:  GHSEKQAIAFGLINTAPGMPIWVTKNLYMCHSCHNFVKFISTVVRRDIS
        GHSE+QAIAFGLINTAPGMPIWVTKNLYMCHSCHN VKFIST+VRR+IS
Subjt:  GHSEKQAIAFGLINTAPGMPIWVTKNLYMCHSCHNFVKFISTVVRRDIS

XP_008461062.1 PREDICTED: pentatricopeptide repeat-containing protein At1g15510, chloroplastic [Cucumis melo]0.0e+0088.57Show/hide
Query:  MAVFAKASHLLTYLDRSNPQISNNHNPRILSFSKNLLTHSQTARKNQEISVPG-ALSNSTINQTQNLELQELCLHGNLEQAMKRLESMLELRIEVEDDAY
        MAVFA++SHLLTY+DRSNPQ+ NNHN + LSFSKNL TH +T RK QEISV G A+SNSTINQTQNLEL+ELCL GNLEQAMKRLESMLELRIEVE+DAY
Subjt:  MAVFAKASHLLTYLDRSNPQISNNHNPRILSFSKNLLTHSQTARKNQEISVPG-ALSNSTINQTQNLELQELCLHGNLEQAMKRLESMLELRIEVEDDAY

Query:  IALLRLCEWRRAPDEGSRIYALVSSSKSRLGVRLGNALLSMFVRFGNLVDAWYVFGRMAERDVFSWNVLVGGYAKAGCFDEALNLYHRMLWAELRPDVYT
        IALLRLCEWRRAPDEGSR+Y LVSSSKS L VRLGNALLSMFVRFGNL+DAWYVFG+M+ERDVFSWNVLVGGY KAGCFDEALNLYHRMLWAE+RP+VYT
Subjt:  IALLRLCEWRRAPDEGSRIYALVSSSKSRLGVRLGNALLSMFVRFGNLVDAWYVFGRMAERDVFSWNVLVGGYAKAGCFDEALNLYHRMLWAELRPDVYT

Query:  FPSVLRTCGGVPDIARGKEIHAHVIRFGFESDVDVVNALITMYVKCGDISNARTLFDKMPNRDRISWNAMISGYFENGEGLEGLRLFFMMRELSVDPDLI
        FPSVLRTCGGV DIARGKEIHAHVIRFGFESDVDV NALITMYVKCGDIS AR LFDKMP RDRI+WNAMISGYFENG GLEGLRLFFMMRELSVDPDLI
Subjt:  FPSVLRTCGGVPDIARGKEIHAHVIRFGFESDVDVVNALITMYVKCGDISNARTLFDKMPNRDRISWNAMISGYFENGEGLEGLRLFFMMRELSVDPDLI

Query:  TLTTVVSACELLSDERLGREIHGYVIRSEFGDDVSVNNSLIQMYSSLGHLKEAEKVFSRMESKDVVSWTAMTASYDGHKLPCKAVETYKMMELEGVLPDE
        T+T+V SACELL +ERLGR IHGYV++ EFG DVS+NNSLI+MYSS+GHL+EAEKVFSRME KDVVSWTAM AS   HKLP KAVETYKMMELEG+LPDE
Subjt:  TLTTVVSACELLSDERLGREIHGYVIRSEFGDDVSVNNSLIQMYSSLGHLKEAEKVFSRMESKDVVSWTAMTASYDGHKLPCKAVETYKMMELEGVLPDE

Query:  ITLVSVLSACACLRYLDLGIKLHEIAIKTGLISHVIVSNSLIDMYSKCKCVDKALEVFRNISGKNVISWTSLILGLRINNRSFEALLFFRQMKESLKPNS
        ITLVSVLSACA L +LDLGI+LHEIAIKTGLISHVIVSNSLIDMYSKCKCVDKALEVFRNISGKNV+SWTSLILGLRINNRSFEALLFFRQMKES+KPNS
Subjt:  ITLVSVLSACACLRYLDLGIKLHEIAIKTGLISHVIVSNSLIDMYSKCKCVDKALEVFRNISGKNVISWTSLILGLRINNRSFEALLFFRQMKESLKPNS

Query:  VTLISVLSACARIGALMRGKEIHAHALRTGVGFDGFLPNAILDMYIRCGREVPALNQFNSQKKDVTAWNILLTGYAEQGQGKLAVELFDKMLESEVNPDE
        VTLISVLSACARIGALMRGKEIHAHALRTGVGFDGFLPNAILDMY+RCGR+VPALNQFNSQKKDVTAWNILLTGYA+QGQ KLAVELF+KMLE E+NPDE
Subjt:  VTLISVLSACARIGALMRGKEIHAHALRTGVGFDGFLPNAILDMYIRCGREVPALNQFNSQKKDVTAWNILLTGYAEQGQGKLAVELFDKMLESEVNPDE

Query:  ITFISLLCACSRSGMVMEGLEYFNVMKNKYHLTPNLKHYACVIDLLGRAGRLDDAYDFIQDMPIKPDAAIWGALLNACRIHRNVELGELAATRIFEKDDK
        ITFISLLCACS+SGMV EGLEYFN+MKNKY+LTPNLKHYACV+D+LGRAG+LDDAYDFIQDMPI+PDAAIWGALLNACRIHRNVELGE+AATRIFEKDDK
Subjt:  ITFISLLCACSRSGMVMEGLEYFNVMKNKYHLTPNLKHYACVIDLLGRAGRLDDAYDFIQDMPIKPDAAIWGALLNACRIHRNVELGELAATRIFEKDDK

Query:  SVGYYILLCNLYAECGNWDKVSKVRSMMRERGLSVDTGCSWVEVKGKVHAFLSGDNFHAQSKEINGVLEGFYDKMKETGFSDLKSSFMSEVESSRADIFC
        SVGYYILLCNLYAECGNWDKV+KVRS+MRERGLSVD GCSWVE+KGKVHAFLSGDN HA+SKEINGVL+GFY KMKE+GF +LKSSF SE+ESS+ADIFC
Subjt:  SVGYYILLCNLYAECGNWDKVSKVRSMMRERGLSVDTGCSWVEVKGKVHAFLSGDNFHAQSKEINGVLEGFYDKMKETGFSDLKSSFMSEVESSRADIFC

Query:  GHSEKQAIAFGLINTAPGMPIWVTKNLYMCHSCHNFVKFISTVVRRDIS
        GHSE+QAIAFGLINTAPGMPIWVTKNLYMCHSCHN VKFIST+VRR+IS
Subjt:  GHSEKQAIAFGLINTAPGMPIWVTKNLYMCHSCHNFVKFISTVVRRDIS

XP_022153716.1 pentatricopeptide repeat-containing protein At1g15510, chloroplastic [Momordica charantia]0.0e+0091.28Show/hide
Query:  MAVFAKASHLLTYLDRSNPQISNNHNPRILSFSKNLLTHSQTARKNQEISVPG-ALSNSTINQTQNLELQELCLHGNLEQAMKRLESMLELRIEVEDDAY
        MAVFAK SHLLTYLDRSNPQISNNH P+ LSFSKNLLTH++TARK QEISV G ALSNSTINQT NLELQELCL GNLEQAMKRLESM ELRIEV+++AY
Subjt:  MAVFAKASHLLTYLDRSNPQISNNHNPRILSFSKNLLTHSQTARKNQEISVPG-ALSNSTINQTQNLELQELCLHGNLEQAMKRLESMLELRIEVEDDAY

Query:  IALLRLCEWRRAPDEGSRIYALVSSSKSRLGVRLGNALLSMFVRFGNLVDAWYVFGRMAERDVFSWNVLVGGYAKAGCFDEALNLYHRMLWAELRPDVYT
        +ALLRLCEWRRAPDEGSR+YALVSSSKSRL VRLGNALLSMFVRFGNLVDAWYVFG+M+ERDVFSWNVLVGGY+KAGCFDEALNLYHRMLWAE+RPDVYT
Subjt:  IALLRLCEWRRAPDEGSRIYALVSSSKSRLGVRLGNALLSMFVRFGNLVDAWYVFGRMAERDVFSWNVLVGGYAKAGCFDEALNLYHRMLWAELRPDVYT

Query:  FPSVLRTCGGVPDIARGKEIHAHVIRFGFESDVDVVNALITMYVKCGDISNARTLFDKMPNRDRISWNAMISGYFENGEGLEGLRLFFMMRELSVDPDLI
        FPSVLRTCGGV DIARGKEIH HVIRFGFESDVDV NALITMYVKCGDIS+AR LFD+MP RDRISWNAMISGYFENGEGLEGLRLFF+MRELSVDPDLI
Subjt:  FPSVLRTCGGVPDIARGKEIHAHVIRFGFESDVDVVNALITMYVKCGDISNARTLFDKMPNRDRISWNAMISGYFENGEGLEGLRLFFMMRELSVDPDLI

Query:  TLTTVVSACELLSDERLGREIHGYVIRSEFGDDVSVNNSLIQMYSSLGHLKEAEKVFSRMESKDVVSWTAMTASYDGHKLPCKAVETYKMMELEGVLPDE
        T+T+V SACELL +ERLGREIHGYV+RSEFG DVSVNNSLIQMYSSLG+L+EAEKVFSRMESKD+VSWTAM ASYD HKLPC+AVETYKMMELEG+LPDE
Subjt:  TLTTVVSACELLSDERLGREIHGYVIRSEFGDDVSVNNSLIQMYSSLGHLKEAEKVFSRMESKDVVSWTAMTASYDGHKLPCKAVETYKMMELEGVLPDE

Query:  ITLVSVLSACACLRYLDLGIKLHEIAIKTGLISHVIVSNSLIDMYSKCKCVDKALEVFRNISGKNVISWTSLILGLRINNRSFEALLFFRQMKESLKPNS
        ITLVSVLSACACL +LDLGI+LHEIAIKTGLISHV+V+NSLIDMYSKCKCVDKALEVFRNISGKNVISWTSLILGLRINNRSFEALLFFRQMKES++PNS
Subjt:  ITLVSVLSACACLRYLDLGIKLHEIAIKTGLISHVIVSNSLIDMYSKCKCVDKALEVFRNISGKNVISWTSLILGLRINNRSFEALLFFRQMKESLKPNS

Query:  VTLISVLSACARIGALMRGKEIHAHALRTGVGFDGFLPNAILDMYIRCGREVPALNQFNSQKKDVTAWNILLTGYAEQGQGKLAVELFDKMLESEVNPDE
        VTLISVLSACARIGALMRGKEIHAHALRTGVGFDGFLPNAILDMY+RCGREVPALNQFNSQKKDVTAWNILLTGYAE+GQGKLAVELFDKMLESEVNPDE
Subjt:  VTLISVLSACARIGALMRGKEIHAHALRTGVGFDGFLPNAILDMYIRCGREVPALNQFNSQKKDVTAWNILLTGYAEQGQGKLAVELFDKMLESEVNPDE

Query:  ITFISLLCACSRSGMVMEGLEYFNVMKNKYHLTPNLKHYACVIDLLGRAGRLDDAYDFIQDMPIKPDAAIWGALLNACRIHRNVELGELAATRIFEKDDK
        ITFISLLCACSRSGMV+EGLEYFNVMKNKY+LTPNLKH+AC++DLLGRAGRLDDAYDFIQDM +KPDAAIWGALLNACRIHRNVELGELA TRIFEKD+K
Subjt:  ITFISLLCACSRSGMVMEGLEYFNVMKNKYHLTPNLKHYACVIDLLGRAGRLDDAYDFIQDMPIKPDAAIWGALLNACRIHRNVELGELAATRIFEKDDK

Query:  SVGYYILLCNLYAECGNWDKVSKVRSMMRERGLSVDTGCSWVEVKGKVHAFLSGDNFHAQSKEINGVLEGFYDKMKETGFSDLKSSFMSEVESSRADIFC
        SVGYYILLCNLYAECGNWDKVSKVRS+MRERGLSVD GCSWVE+KGKVHAFLSGDNFHA SKEI+GVLEGFY KMKET FSDLKSSF+SEVE+SRADIFC
Subjt:  SVGYYILLCNLYAECGNWDKVSKVRSMMRERGLSVDTGCSWVEVKGKVHAFLSGDNFHAQSKEINGVLEGFYDKMKETGFSDLKSSFMSEVESSRADIFC

Query:  GHSEKQAIAFGLINTAPGMPIWVTKNLYMCHSCHNFVKFISTVVRRDIS
        GHSE+QAIAFGLINTAPG PIWVTKNLYMCHSCHNFVKFISTVVRR+IS
Subjt:  GHSEKQAIAFGLINTAPGMPIWVTKNLYMCHSCHNFVKFISTVVRRDIS

XP_038898908.1 pentatricopeptide repeat-containing protein At1g15510, chloroplastic [Benincasa hispida]0.0e+0090.22Show/hide
Query:  MAVFAKASHLLTYLDRSNPQISNNHNPRILSFSKNLLTHSQTARKNQEISVPG-ALSNSTINQTQNLELQELCLHGNLEQAMKRLESMLELRIEVEDDAY
        MAVFA++SHLLTYLDRSNPQISNNH+P+ LSFSKNL TH  T RK QEISV G A+SNS INQTQNLELQELCL GNLEQAMKRLESMLELRIEVE+DAY
Subjt:  MAVFAKASHLLTYLDRSNPQISNNHNPRILSFSKNLLTHSQTARKNQEISVPG-ALSNSTINQTQNLELQELCLHGNLEQAMKRLESMLELRIEVEDDAY

Query:  IALLRLCEWRRAPDEGSRIYALVSSSKSRLGVRLGNALLSMFVRFGNLVDAWYVFGRMAERDVFSWNVLVGGYAKAGCFDEALNLYHRMLWAELRPDVYT
        IALLRLCEWRRAPDEGSR+Y LVSSSKSRLGV LGNALLSMF+RFGNL+DAWYVFG+M+ERDVFSWNVLVGGYAKAGCFDEALNLYHRMLWAE+RPDVYT
Subjt:  IALLRLCEWRRAPDEGSRIYALVSSSKSRLGVRLGNALLSMFVRFGNLVDAWYVFGRMAERDVFSWNVLVGGYAKAGCFDEALNLYHRMLWAELRPDVYT

Query:  FPSVLRTCGGVPDIARGKEIHAHVIRFGFESDVDVVNALITMYVKCGDISNARTLFDKMPNRDRISWNAMISGYFENGEGLEGLRLFFMMRELSVDPDLI
        FPSVLRTCG V DIARGKEIHAHVIRFGFESDVDV NALITMYVKCGDISNAR LFDKMP RDRISWNAMISGYFENG GLEGLRLFFMMRELSVDPDLI
Subjt:  FPSVLRTCGGVPDIARGKEIHAHVIRFGFESDVDVVNALITMYVKCGDISNARTLFDKMPNRDRISWNAMISGYFENGEGLEGLRLFFMMRELSVDPDLI

Query:  TLTTVVSACELLSDERLGREIHGYVIRSEFGDDVSVNNSLIQMYSSLGHLKEAEKVFSRMESKDVVSWTAMTASYDGHKLPCKAVETYKMMELEGVLPDE
        T+T+V SACELL +ERLGR IHGYV++SEFG DVS+NNSLIQMYSSL HL+EAE+VFSRMESKD+VSWT M ASYD HKLP KAVETYKMMELEG+LPDE
Subjt:  TLTTVVSACELLSDERLGREIHGYVIRSEFGDDVSVNNSLIQMYSSLGHLKEAEKVFSRMESKDVVSWTAMTASYDGHKLPCKAVETYKMMELEGVLPDE

Query:  ITLVSVLSACACLRYLDLGIKLHEIAIKTGLISHVIVSNSLIDMYSKCKCVDKALEVFRNISGKNVISWTSLILGLRINNRSFEALLFFRQMKESLKPNS
        ITLVSVLSACACL +LDLGI+LHEIAIKTGLISHVIVSNSLIDMYSKCKCVD ALEVFRNISGKNV+SWTSLILGLRINNRSFEALLFFRQMKES+KPNS
Subjt:  ITLVSVLSACACLRYLDLGIKLHEIAIKTGLISHVIVSNSLIDMYSKCKCVDKALEVFRNISGKNVISWTSLILGLRINNRSFEALLFFRQMKESLKPNS

Query:  VTLISVLSACARIGALMRGKEIHAHALRTGVGFDGFLPNAILDMYIRCGREVPALNQFNSQKKDVTAWNILLTGYAEQGQGKLAVELFDKMLESEVNPDE
        VTLISVLSACARIGALMRGKEIHAHALRTGVGFDGFLPNAILDMY+RCGR+VPALNQFNSQKKDVTAWNILLTGYAEQGQGKLAVELF+KMLESEVNPDE
Subjt:  VTLISVLSACARIGALMRGKEIHAHALRTGVGFDGFLPNAILDMYIRCGREVPALNQFNSQKKDVTAWNILLTGYAEQGQGKLAVELFDKMLESEVNPDE

Query:  ITFISLLCACSRSGMVMEGLEYFNVMKNKYHLTPNLKHYACVIDLLGRAGRLDDAYDFIQDMPIKPDAAIWGALLNACRIHRNVELGELAATRIFEKDDK
        ITFISLLCACSRS MVMEGLEYFN+MKNKY LTPNLKHYACV+DLLGRAG+LD+AYDFIQDMPI+PDAAIWGALLNACRIHRNVELGELAATR+FEKD+K
Subjt:  ITFISLLCACSRSGMVMEGLEYFNVMKNKYHLTPNLKHYACVIDLLGRAGRLDDAYDFIQDMPIKPDAAIWGALLNACRIHRNVELGELAATRIFEKDDK

Query:  SVGYYILLCNLYAECGNWDKVSKVRSMMRERGLSVDTGCSWVEVKGKVHAFLSGDNFHAQSKEINGVLEGFYDKMKETGFSDLKSSFMSEVESSRADIFC
        SVGYYILLCNLYAECGNWDKVSKVRS+MRERGLSVD GCSWVE+KGKVHAFLSGDNFHAQSKEING+L+GFY KMKETGF DLK+SFMSE+ESSRADIFC
Subjt:  SVGYYILLCNLYAECGNWDKVSKVRSMMRERGLSVDTGCSWVEVKGKVHAFLSGDNFHAQSKEINGVLEGFYDKMKETGFSDLKSSFMSEVESSRADIFC

Query:  GHSEKQAIAFGLINTAPGMPIWVTKNLYMCHSCHNFVKFISTVVRRDIS
        GHSE+QAIAF LINTAPGMPIWVTKNLY CHSCH  VKFIST+VRR+IS
Subjt:  GHSEKQAIAFGLINTAPGMPIWVTKNLYMCHSCHNFVKFISTVVRRDIS

TrEMBL top hitse value%identityAlignment
A0A0A0K739 DYW_deaminase domain-containing protein0.0e+0087.75Show/hide
Query:  MAVFAKASHLLTYLDRSNPQISNNHNPRILSFSKNLLTHSQTARKNQEISVPG-ALSNSTINQTQNLELQELCLHGNLEQAMKRLESMLELRIEVEDDAY
        MAVFA++SHLLTYLDRSNPQ+ NNHNP+ LSFSKNL TH  T RK QEISV G A+S+S I+QTQNLEL+ELCL GNLEQAMKRLESMLELRIEVE+DAY
Subjt:  MAVFAKASHLLTYLDRSNPQISNNHNPRILSFSKNLLTHSQTARKNQEISVPG-ALSNSTINQTQNLELQELCLHGNLEQAMKRLESMLELRIEVEDDAY

Query:  IALLRLCEWRRAPDEGSRIYALVSSSKSRLGVRLGNALLSMFVRFGNLVDAWYVFGRMAERDVFSWNVLVGGYAKAGCFDEALNLYHRMLWAELRPDVYT
        IALLRLCEWRRAPDEGSR+Y LVSSSKS L VRLGNALLSMFVRFGNL+DAWYVFG+M+ERDVFSWNVLVGGYAKAGCFDEALNLYHRMLWAE+RP+VYT
Subjt:  IALLRLCEWRRAPDEGSRIYALVSSSKSRLGVRLGNALLSMFVRFGNLVDAWYVFGRMAERDVFSWNVLVGGYAKAGCFDEALNLYHRMLWAELRPDVYT

Query:  FPSVLRTCGGVPDIARGKEIHAHVIRFGFESDVDVVNALITMYVKCGDISNARTLFDKMPNRDRISWNAMISGYFENGEGLEGLRLFFMMRELSVDPDLI
        FPSVL+TC GV DIARGKEIHAHVIRFGFESDVDV NALITMYVKCGDISNAR LFDKMP RDRISWNAMISGYFENG GLEGL LF MMRELSVDPDLI
Subjt:  FPSVLRTCGGVPDIARGKEIHAHVIRFGFESDVDVVNALITMYVKCGDISNARTLFDKMPNRDRISWNAMISGYFENGEGLEGLRLFFMMRELSVDPDLI

Query:  TLTTVVSACELLSDERLGREIHGYVIRSEFGDDVSVNNSLIQMYSSLGHLKEAEKVFSRMESKDVVSWTAMTASYDGHKLPCKAVETYKMMELEGVLPDE
        T+TTV SACELL +ERLGR +HGYV++SEFG D+S+NNSLIQMYSSLG L+EAE VFSRMESKDVVSWTAM AS   HKLP KAVETYKMMELEG+LPDE
Subjt:  TLTTVVSACELLSDERLGREIHGYVIRSEFGDDVSVNNSLIQMYSSLGHLKEAEKVFSRMESKDVVSWTAMTASYDGHKLPCKAVETYKMMELEGVLPDE

Query:  ITLVSVLSACACLRYLDLGIKLHEIAIKTGLISHVIVSNSLIDMYSKCKCVDKALEVFRNISGKNVISWTSLILGLRINNRSFEALLFFRQMKESLKPNS
        ITLVSVLSACAC+ +LDLGI+LHEIAIKTGL+SHVIVSNSLIDMYSKCKCVD ALEVFRNISGKNV+SWTSLILGLRINNRSFEALLFFRQMKES+KPNS
Subjt:  ITLVSVLSACACLRYLDLGIKLHEIAIKTGLISHVIVSNSLIDMYSKCKCVDKALEVFRNISGKNVISWTSLILGLRINNRSFEALLFFRQMKESLKPNS

Query:  VTLISVLSACARIGALMRGKEIHAHALRTGVGFDGFLPNAILDMYIRCGREVPALNQFNSQKKDVTAWNILLTGYAEQGQGKLAVELFDKMLESEVNPDE
        VTLISVLSACARIGALMRGKEIHAHALRTGVGFDGFLPNAILDMY+RCGR+VPALNQFNSQKKDVTAWNILLTGYA+QGQ KLAVELFDKMLE E++PDE
Subjt:  VTLISVLSACARIGALMRGKEIHAHALRTGVGFDGFLPNAILDMYIRCGREVPALNQFNSQKKDVTAWNILLTGYAEQGQGKLAVELFDKMLESEVNPDE

Query:  ITFISLLCACSRSGMVMEGLEYFNVMKNKYHLTPNLKHYACVIDLLGRAGRLDDAYDFIQDMPIKPDAAIWGALLNACRIHRNVELGELAATRIFEKDDK
        ITFISLLCACS+SGMV EGLEYFN+MKNKY+LTPNLKHYACV+D+LGRAG+LDDAYDFIQDMPI+PDAAIWGALLNACRIHRNVELGE+AA R+FEKD+K
Subjt:  ITFISLLCACSRSGMVMEGLEYFNVMKNKYHLTPNLKHYACVIDLLGRAGRLDDAYDFIQDMPIKPDAAIWGALLNACRIHRNVELGELAATRIFEKDDK

Query:  SVGYYILLCNLYAECGNWDKVSKVRSMMRERGLSVDTGCSWVEVKGKVHAFLSGDNFHAQSKEINGVLEGFYDKMKETGFSDLKSSFMSEVESSRADIFC
        SVGYYILLCNLYA CGNWDKVSKVRS+MRERGLS D GCSWVE+KGKVHAFLSGDN H+QSKEINGVL+GF  KMKE GF +LKSSF SE+ESSRADIFC
Subjt:  SVGYYILLCNLYAECGNWDKVSKVRSMMRERGLSVDTGCSWVEVKGKVHAFLSGDNFHAQSKEINGVLEGFYDKMKETGFSDLKSSFMSEVESSRADIFC

Query:  GHSEKQAIAFGLINTAPGMPIWVTKNLYMCHSCHNFVKFISTVVRRDIS
        GHSE+QAIAFGLINTAPGMPIWVTKNLYMCHSCHN VKFIST+VRR+IS
Subjt:  GHSEKQAIAFGLINTAPGMPIWVTKNLYMCHSCHNFVKFISTVVRRDIS

A0A1S3CDV5 pentatricopeptide repeat-containing protein At1g15510, chloroplastic0.0e+0088.57Show/hide
Query:  MAVFAKASHLLTYLDRSNPQISNNHNPRILSFSKNLLTHSQTARKNQEISVPG-ALSNSTINQTQNLELQELCLHGNLEQAMKRLESMLELRIEVEDDAY
        MAVFA++SHLLTY+DRSNPQ+ NNHN + LSFSKNL TH +T RK QEISV G A+SNSTINQTQNLEL+ELCL GNLEQAMKRLESMLELRIEVE+DAY
Subjt:  MAVFAKASHLLTYLDRSNPQISNNHNPRILSFSKNLLTHSQTARKNQEISVPG-ALSNSTINQTQNLELQELCLHGNLEQAMKRLESMLELRIEVEDDAY

Query:  IALLRLCEWRRAPDEGSRIYALVSSSKSRLGVRLGNALLSMFVRFGNLVDAWYVFGRMAERDVFSWNVLVGGYAKAGCFDEALNLYHRMLWAELRPDVYT
        IALLRLCEWRRAPDEGSR+Y LVSSSKS L VRLGNALLSMFVRFGNL+DAWYVFG+M+ERDVFSWNVLVGGY KAGCFDEALNLYHRMLWAE+RP+VYT
Subjt:  IALLRLCEWRRAPDEGSRIYALVSSSKSRLGVRLGNALLSMFVRFGNLVDAWYVFGRMAERDVFSWNVLVGGYAKAGCFDEALNLYHRMLWAELRPDVYT

Query:  FPSVLRTCGGVPDIARGKEIHAHVIRFGFESDVDVVNALITMYVKCGDISNARTLFDKMPNRDRISWNAMISGYFENGEGLEGLRLFFMMRELSVDPDLI
        FPSVLRTCGGV DIARGKEIHAHVIRFGFESDVDV NALITMYVKCGDIS AR LFDKMP RDRI+WNAMISGYFENG GLEGLRLFFMMRELSVDPDLI
Subjt:  FPSVLRTCGGVPDIARGKEIHAHVIRFGFESDVDVVNALITMYVKCGDISNARTLFDKMPNRDRISWNAMISGYFENGEGLEGLRLFFMMRELSVDPDLI

Query:  TLTTVVSACELLSDERLGREIHGYVIRSEFGDDVSVNNSLIQMYSSLGHLKEAEKVFSRMESKDVVSWTAMTASYDGHKLPCKAVETYKMMELEGVLPDE
        T+T+V SACELL +ERLGR IHGYV++ EFG DVS+NNSLI+MYSS+GHL+EAEKVFSRME KDVVSWTAM AS   HKLP KAVETYKMMELEG+LPDE
Subjt:  TLTTVVSACELLSDERLGREIHGYVIRSEFGDDVSVNNSLIQMYSSLGHLKEAEKVFSRMESKDVVSWTAMTASYDGHKLPCKAVETYKMMELEGVLPDE

Query:  ITLVSVLSACACLRYLDLGIKLHEIAIKTGLISHVIVSNSLIDMYSKCKCVDKALEVFRNISGKNVISWTSLILGLRINNRSFEALLFFRQMKESLKPNS
        ITLVSVLSACA L +LDLGI+LHEIAIKTGLISHVIVSNSLIDMYSKCKCVDKALEVFRNISGKNV+SWTSLILGLRINNRSFEALLFFRQMKES+KPNS
Subjt:  ITLVSVLSACACLRYLDLGIKLHEIAIKTGLISHVIVSNSLIDMYSKCKCVDKALEVFRNISGKNVISWTSLILGLRINNRSFEALLFFRQMKESLKPNS

Query:  VTLISVLSACARIGALMRGKEIHAHALRTGVGFDGFLPNAILDMYIRCGREVPALNQFNSQKKDVTAWNILLTGYAEQGQGKLAVELFDKMLESEVNPDE
        VTLISVLSACARIGALMRGKEIHAHALRTGVGFDGFLPNAILDMY+RCGR+VPALNQFNSQKKDVTAWNILLTGYA+QGQ KLAVELF+KMLE E+NPDE
Subjt:  VTLISVLSACARIGALMRGKEIHAHALRTGVGFDGFLPNAILDMYIRCGREVPALNQFNSQKKDVTAWNILLTGYAEQGQGKLAVELFDKMLESEVNPDE

Query:  ITFISLLCACSRSGMVMEGLEYFNVMKNKYHLTPNLKHYACVIDLLGRAGRLDDAYDFIQDMPIKPDAAIWGALLNACRIHRNVELGELAATRIFEKDDK
        ITFISLLCACS+SGMV EGLEYFN+MKNKY+LTPNLKHYACV+D+LGRAG+LDDAYDFIQDMPI+PDAAIWGALLNACRIHRNVELGE+AATRIFEKDDK
Subjt:  ITFISLLCACSRSGMVMEGLEYFNVMKNKYHLTPNLKHYACVIDLLGRAGRLDDAYDFIQDMPIKPDAAIWGALLNACRIHRNVELGELAATRIFEKDDK

Query:  SVGYYILLCNLYAECGNWDKVSKVRSMMRERGLSVDTGCSWVEVKGKVHAFLSGDNFHAQSKEINGVLEGFYDKMKETGFSDLKSSFMSEVESSRADIFC
        SVGYYILLCNLYAECGNWDKV+KVRS+MRERGLSVD GCSWVE+KGKVHAFLSGDN HA+SKEINGVL+GFY KMKE+GF +LKSSF SE+ESS+ADIFC
Subjt:  SVGYYILLCNLYAECGNWDKVSKVRSMMRERGLSVDTGCSWVEVKGKVHAFLSGDNFHAQSKEINGVLEGFYDKMKETGFSDLKSSFMSEVESSRADIFC

Query:  GHSEKQAIAFGLINTAPGMPIWVTKNLYMCHSCHNFVKFISTVVRRDIS
        GHSE+QAIAFGLINTAPGMPIWVTKNLYMCHSCHN VKFIST+VRR+IS
Subjt:  GHSEKQAIAFGLINTAPGMPIWVTKNLYMCHSCHNFVKFISTVVRRDIS

A0A5A7UVR8 Pentatricopeptide repeat-containing protein0.0e+0088.57Show/hide
Query:  MAVFAKASHLLTYLDRSNPQISNNHNPRILSFSKNLLTHSQTARKNQEISVPG-ALSNSTINQTQNLELQELCLHGNLEQAMKRLESMLELRIEVEDDAY
        MAVFA++SHLLTY+DRSNPQ+ NNHN + LSFSKNL TH +T RK QEISV G A+SNSTINQTQNLEL+ELCL GNLEQAMKRLESMLELRIEVE+DAY
Subjt:  MAVFAKASHLLTYLDRSNPQISNNHNPRILSFSKNLLTHSQTARKNQEISVPG-ALSNSTINQTQNLELQELCLHGNLEQAMKRLESMLELRIEVEDDAY

Query:  IALLRLCEWRRAPDEGSRIYALVSSSKSRLGVRLGNALLSMFVRFGNLVDAWYVFGRMAERDVFSWNVLVGGYAKAGCFDEALNLYHRMLWAELRPDVYT
        IALLRLCEWRRAPDEGSR+Y LVSSSKS L VRLGNALLSMFVRFGNL+DAWYVFG+M+ERDVFSWNVLVGGY KAGCFDEALNLYHRMLWAE+RP+VYT
Subjt:  IALLRLCEWRRAPDEGSRIYALVSSSKSRLGVRLGNALLSMFVRFGNLVDAWYVFGRMAERDVFSWNVLVGGYAKAGCFDEALNLYHRMLWAELRPDVYT

Query:  FPSVLRTCGGVPDIARGKEIHAHVIRFGFESDVDVVNALITMYVKCGDISNARTLFDKMPNRDRISWNAMISGYFENGEGLEGLRLFFMMRELSVDPDLI
        FPSVLRTCGGV DIARGKEIHAHVIRFGFESDVDV NALITMYVKCGDIS AR LFDKMP RDRI+WNAMISGYFENG GLEGLRLFFMMRELSVDPDLI
Subjt:  FPSVLRTCGGVPDIARGKEIHAHVIRFGFESDVDVVNALITMYVKCGDISNARTLFDKMPNRDRISWNAMISGYFENGEGLEGLRLFFMMRELSVDPDLI

Query:  TLTTVVSACELLSDERLGREIHGYVIRSEFGDDVSVNNSLIQMYSSLGHLKEAEKVFSRMESKDVVSWTAMTASYDGHKLPCKAVETYKMMELEGVLPDE
        T+T+V SACELL +ERLGR IHGYV++ EFG DVS+NNSLI+MYSS+GHL+EAEKVFSRME KDVVSWTAM AS   HKLP KAVETYKMMELEG+LPDE
Subjt:  TLTTVVSACELLSDERLGREIHGYVIRSEFGDDVSVNNSLIQMYSSLGHLKEAEKVFSRMESKDVVSWTAMTASYDGHKLPCKAVETYKMMELEGVLPDE

Query:  ITLVSVLSACACLRYLDLGIKLHEIAIKTGLISHVIVSNSLIDMYSKCKCVDKALEVFRNISGKNVISWTSLILGLRINNRSFEALLFFRQMKESLKPNS
        ITLVSVLSACA L +LDLGI+LHEIAIKTGLISHVIVSNSLIDMYSKCKCVDKALEVFRNISGKNV+SWTSLILGLRINNRSFEALLFFRQMKES+KPNS
Subjt:  ITLVSVLSACACLRYLDLGIKLHEIAIKTGLISHVIVSNSLIDMYSKCKCVDKALEVFRNISGKNVISWTSLILGLRINNRSFEALLFFRQMKESLKPNS

Query:  VTLISVLSACARIGALMRGKEIHAHALRTGVGFDGFLPNAILDMYIRCGREVPALNQFNSQKKDVTAWNILLTGYAEQGQGKLAVELFDKMLESEVNPDE
        VTLISVLSACARIGALMRGKEIHAHALRTGVGFDGFLPNAILDMY+RCGR+VPALNQFNSQKKDVTAWNILLTGYA+QGQ KLAVELF+KMLE E+NPDE
Subjt:  VTLISVLSACARIGALMRGKEIHAHALRTGVGFDGFLPNAILDMYIRCGREVPALNQFNSQKKDVTAWNILLTGYAEQGQGKLAVELFDKMLESEVNPDE

Query:  ITFISLLCACSRSGMVMEGLEYFNVMKNKYHLTPNLKHYACVIDLLGRAGRLDDAYDFIQDMPIKPDAAIWGALLNACRIHRNVELGELAATRIFEKDDK
        ITFISLLCACS+SGMV EGLEYFN+MKNKY+LTPNLKHYACV+D+LGRAG+LDDAYDFIQDMPI+PDAAIWGALLNACRIHRNVELGE+AATRIFEKDDK
Subjt:  ITFISLLCACSRSGMVMEGLEYFNVMKNKYHLTPNLKHYACVIDLLGRAGRLDDAYDFIQDMPIKPDAAIWGALLNACRIHRNVELGELAATRIFEKDDK

Query:  SVGYYILLCNLYAECGNWDKVSKVRSMMRERGLSVDTGCSWVEVKGKVHAFLSGDNFHAQSKEINGVLEGFYDKMKETGFSDLKSSFMSEVESSRADIFC
        SVGYYILLCNLYAECGNWDKV+KVRS+MRERGLSVD GCSWVE+KGKVHAFLSGDN HA+SKEINGVL+GFY KMKE+GF +LKSSF SE+ESS+ADIFC
Subjt:  SVGYYILLCNLYAECGNWDKVSKVRSMMRERGLSVDTGCSWVEVKGKVHAFLSGDNFHAQSKEINGVLEGFYDKMKETGFSDLKSSFMSEVESSRADIFC

Query:  GHSEKQAIAFGLINTAPGMPIWVTKNLYMCHSCHNFVKFISTVVRRDIS
        GHSE+QAIAFGLINTAPGMPIWVTKNLYMCHSCHN VKFIST+VRR+IS
Subjt:  GHSEKQAIAFGLINTAPGMPIWVTKNLYMCHSCHNFVKFISTVVRRDIS

A0A6J1DLI6 pentatricopeptide repeat-containing protein At1g15510, chloroplastic0.0e+0091.28Show/hide
Query:  MAVFAKASHLLTYLDRSNPQISNNHNPRILSFSKNLLTHSQTARKNQEISVPG-ALSNSTINQTQNLELQELCLHGNLEQAMKRLESMLELRIEVEDDAY
        MAVFAK SHLLTYLDRSNPQISNNH P+ LSFSKNLLTH++TARK QEISV G ALSNSTINQT NLELQELCL GNLEQAMKRLESM ELRIEV+++AY
Subjt:  MAVFAKASHLLTYLDRSNPQISNNHNPRILSFSKNLLTHSQTARKNQEISVPG-ALSNSTINQTQNLELQELCLHGNLEQAMKRLESMLELRIEVEDDAY

Query:  IALLRLCEWRRAPDEGSRIYALVSSSKSRLGVRLGNALLSMFVRFGNLVDAWYVFGRMAERDVFSWNVLVGGYAKAGCFDEALNLYHRMLWAELRPDVYT
        +ALLRLCEWRRAPDEGSR+YALVSSSKSRL VRLGNALLSMFVRFGNLVDAWYVFG+M+ERDVFSWNVLVGGY+KAGCFDEALNLYHRMLWAE+RPDVYT
Subjt:  IALLRLCEWRRAPDEGSRIYALVSSSKSRLGVRLGNALLSMFVRFGNLVDAWYVFGRMAERDVFSWNVLVGGYAKAGCFDEALNLYHRMLWAELRPDVYT

Query:  FPSVLRTCGGVPDIARGKEIHAHVIRFGFESDVDVVNALITMYVKCGDISNARTLFDKMPNRDRISWNAMISGYFENGEGLEGLRLFFMMRELSVDPDLI
        FPSVLRTCGGV DIARGKEIH HVIRFGFESDVDV NALITMYVKCGDIS+AR LFD+MP RDRISWNAMISGYFENGEGLEGLRLFF+MRELSVDPDLI
Subjt:  FPSVLRTCGGVPDIARGKEIHAHVIRFGFESDVDVVNALITMYVKCGDISNARTLFDKMPNRDRISWNAMISGYFENGEGLEGLRLFFMMRELSVDPDLI

Query:  TLTTVVSACELLSDERLGREIHGYVIRSEFGDDVSVNNSLIQMYSSLGHLKEAEKVFSRMESKDVVSWTAMTASYDGHKLPCKAVETYKMMELEGVLPDE
        T+T+V SACELL +ERLGREIHGYV+RSEFG DVSVNNSLIQMYSSLG+L+EAEKVFSRMESKD+VSWTAM ASYD HKLPC+AVETYKMMELEG+LPDE
Subjt:  TLTTVVSACELLSDERLGREIHGYVIRSEFGDDVSVNNSLIQMYSSLGHLKEAEKVFSRMESKDVVSWTAMTASYDGHKLPCKAVETYKMMELEGVLPDE

Query:  ITLVSVLSACACLRYLDLGIKLHEIAIKTGLISHVIVSNSLIDMYSKCKCVDKALEVFRNISGKNVISWTSLILGLRINNRSFEALLFFRQMKESLKPNS
        ITLVSVLSACACL +LDLGI+LHEIAIKTGLISHV+V+NSLIDMYSKCKCVDKALEVFRNISGKNVISWTSLILGLRINNRSFEALLFFRQMKES++PNS
Subjt:  ITLVSVLSACACLRYLDLGIKLHEIAIKTGLISHVIVSNSLIDMYSKCKCVDKALEVFRNISGKNVISWTSLILGLRINNRSFEALLFFRQMKESLKPNS

Query:  VTLISVLSACARIGALMRGKEIHAHALRTGVGFDGFLPNAILDMYIRCGREVPALNQFNSQKKDVTAWNILLTGYAEQGQGKLAVELFDKMLESEVNPDE
        VTLISVLSACARIGALMRGKEIHAHALRTGVGFDGFLPNAILDMY+RCGREVPALNQFNSQKKDVTAWNILLTGYAE+GQGKLAVELFDKMLESEVNPDE
Subjt:  VTLISVLSACARIGALMRGKEIHAHALRTGVGFDGFLPNAILDMYIRCGREVPALNQFNSQKKDVTAWNILLTGYAEQGQGKLAVELFDKMLESEVNPDE

Query:  ITFISLLCACSRSGMVMEGLEYFNVMKNKYHLTPNLKHYACVIDLLGRAGRLDDAYDFIQDMPIKPDAAIWGALLNACRIHRNVELGELAATRIFEKDDK
        ITFISLLCACSRSGMV+EGLEYFNVMKNKY+LTPNLKH+AC++DLLGRAGRLDDAYDFIQDM +KPDAAIWGALLNACRIHRNVELGELA TRIFEKD+K
Subjt:  ITFISLLCACSRSGMVMEGLEYFNVMKNKYHLTPNLKHYACVIDLLGRAGRLDDAYDFIQDMPIKPDAAIWGALLNACRIHRNVELGELAATRIFEKDDK

Query:  SVGYYILLCNLYAECGNWDKVSKVRSMMRERGLSVDTGCSWVEVKGKVHAFLSGDNFHAQSKEINGVLEGFYDKMKETGFSDLKSSFMSEVESSRADIFC
        SVGYYILLCNLYAECGNWDKVSKVRS+MRERGLSVD GCSWVE+KGKVHAFLSGDNFHA SKEI+GVLEGFY KMKET FSDLKSSF+SEVE+SRADIFC
Subjt:  SVGYYILLCNLYAECGNWDKVSKVRSMMRERGLSVDTGCSWVEVKGKVHAFLSGDNFHAQSKEINGVLEGFYDKMKETGFSDLKSSFMSEVESSRADIFC

Query:  GHSEKQAIAFGLINTAPGMPIWVTKNLYMCHSCHNFVKFISTVVRRDIS
        GHSE+QAIAFGLINTAPG PIWVTKNLYMCHSCHNFVKFISTVVRR+IS
Subjt:  GHSEKQAIAFGLINTAPGMPIWVTKNLYMCHSCHNFVKFISTVVRRDIS

A0A6J1GME6 pentatricopeptide repeat-containing protein At1g15510, chloroplastic0.0e+0085.51Show/hide
Query:  MAVFAKASHLLTYLDRSNPQISNNHNPRILSFSKNLLTHSQTARKNQEISVPG-ALSNSTINQTQNLELQELCLHGNLEQAMKRLESMLELRIEVEDDAY
        MAVFA++S++LTY DRSNPQISNN+N + LSF KNL TH QT RK+QEISV G A+SNSTI QTQNLELQ+LCL G LEQA+KR+ESMLELRIEVE+D Y
Subjt:  MAVFAKASHLLTYLDRSNPQISNNHNPRILSFSKNLLTHSQTARKNQEISVPG-ALSNSTINQTQNLELQELCLHGNLEQAMKRLESMLELRIEVEDDAY

Query:  IALLRLCEWRRAPDEGSRIYALVSSSKSRLGVRLGNALLSMFVRFGNLVDAWYVFGRMAERDVFSWNVLVGGYAKAGCFDEALNLYHRMLWAELRPDVYT
        IALLRLCEWRRAPDEGSR+Y  V SSKSRLGVRLGNALLSMFVRFGNL DAWYVFG+M+ERDVFSWNVL+GGYAKAGCFDEALNLYHRMLWAE+RPDVYT
Subjt:  IALLRLCEWRRAPDEGSRIYALVSSSKSRLGVRLGNALLSMFVRFGNLVDAWYVFGRMAERDVFSWNVLVGGYAKAGCFDEALNLYHRMLWAELRPDVYT

Query:  FPSVLRTCGGVPDIARGKEIHAHVIRFGFESDVDVVNALITMYVKCGDISNARTLFDKMPNRDRISWNAMISGYFENGEGLEGLRLFFMMRELSVDPDLI
        FPSVLRTCG + DIARGKEIH HVIRFGFE DVDV NALITMYVKCGD+SNAR LFDKMP RDRISWNAMISGYFENG GLEGLRLFFMMRELSVDPDLI
Subjt:  FPSVLRTCGGVPDIARGKEIHAHVIRFGFESDVDVVNALITMYVKCGDISNARTLFDKMPNRDRISWNAMISGYFENGEGLEGLRLFFMMRELSVDPDLI

Query:  TLTTVVSACELLSDERLGREIHGYVIRSEFGDDVSVNNSLIQMYSSLGHLKEAEKVFSRMESKDVVSWTAMTASYDGHKLPCKAVETYKMMELEGVLPDE
        T+T++ SACELL ++RLGREIHG  ++SEFGDDVS+NNSLIQMYSSLGHL EAEK+FSRM SKDVVSWTAM ASYD HKLP KAVETYK M LEGV+ D 
Subjt:  TLTTVVSACELLSDERLGREIHGYVIRSEFGDDVSVNNSLIQMYSSLGHLKEAEKVFSRMESKDVVSWTAMTASYDGHKLPCKAVETYKMMELEGVLPDE

Query:  ITLVSVLSACACLRYLDLGIKLHEIAIKTGLISHVIVSNSLIDMYSKCKCVDKALEVFRNISGKNVISWTSLILGLRINNRSFEALLFFRQMKESLKPNS
        ITLVSVLSACACL +LDLGI+LH I+IKTG ISHV+VSN+LIDMYSKCKCV+KALEVF  ISGKNVISWTSLILGLRINNRSFEAL FFRQMKES+KPNS
Subjt:  ITLVSVLSACACLRYLDLGIKLHEIAIKTGLISHVIVSNSLIDMYSKCKCVDKALEVFRNISGKNVISWTSLILGLRINNRSFEALLFFRQMKESLKPNS

Query:  VTLISVLSACARIGALMRGKEIHAHALRTGVGFDGFLPNAILDMYIRCGREVPALNQFNSQKKDVTAWNILLTGYAEQGQGKLAVELFDKMLESEVNPDE
        VTLI++LSACARIGALMRGKEIHAHALRTGVGFDGFLPNAILDMY+RCGR VPALNQFNSQKKDVTAWNILLTGYAEQGQGKLAVELFD+MLESE+NPDE
Subjt:  VTLISVLSACARIGALMRGKEIHAHALRTGVGFDGFLPNAILDMYIRCGREVPALNQFNSQKKDVTAWNILLTGYAEQGQGKLAVELFDKMLESEVNPDE

Query:  ITFISLLCACSRSGMVMEGLEYFNVMKNKYHLTPNLKHYACVIDLLGRAGRLDDAYDFIQDMPIKPDAAIWGALLNACRIHRNVELGELAATRIFEKDDK
        ITFI+LLCACSRSGMVMEGLEYFN MKNKY+LTPNLKH+ACV+DLLGRAG+LDDAYDFIQDMPI+PDAAIWGALLNACRIHR+V+LGELAA+R+FEKDDK
Subjt:  ITFISLLCACSRSGMVMEGLEYFNVMKNKYHLTPNLKHYACVIDLLGRAGRLDDAYDFIQDMPIKPDAAIWGALLNACRIHRNVELGELAATRIFEKDDK

Query:  SVGYYILLCNLYAECGNWDKVSKVRSMMRERGLSVDTGCSWVEVKGKVHAFLSGDNFHAQSKEINGVLEGFYDKMKETGFSDLKSSFMSEVESSRADIFC
        SVGYY+LLCN YAECGNW+KVSKVR++MRERGL+VD GCSWVEV GKVHAFLSGDNFHAQSKEINGVL+GFY KMKE G  DLKS   SEVESSRAD+FC
Subjt:  SVGYYILLCNLYAECGNWDKVSKVRSMMRERGLSVDTGCSWVEVKGKVHAFLSGDNFHAQSKEINGVLEGFYDKMKETGFSDLKSSFMSEVESSRADIFC

Query:  GHSEKQAIAFGLINTAPGMPIWVTKNLYMCHSCHNFVKFISTVVRRDIS
        GHSE+QAIAFGLINTAPGMPIWVTKN+Y+C SCH+ VKFIS  VRR+IS
Subjt:  GHSEKQAIAFGLINTAPGMPIWVTKNLYMCHSCHNFVKFISTVVRRDIS

SwissProt top hitse value%identityAlignment
O81767 Pentatricopeptide repeat-containing protein At4g339901.2e-14437.36Show/hide
Query:  IYALVSSSKSRLGVRLGNALLSMFVRFGNLVDAWYVFGRMAERDVFSWNVLVGGYAKAGCFDEALNLYHR-MLWAELRPDVYTFPSVLRTCGGVPDIARG
        ++A +  SK    V +   L++++   GN+  A + F  +  RDV++WN+++ GY +AG   E +  +   ML + L PD  TFPSVL+ C  V D   G
Subjt:  IYALVSSSKSRLGVRLGNALLSMFVRFGNLVDAWYVFGRMAERDVFSWNVLVGGYAKAGCFDEALNLYHR-MLWAELRPDVYTFPSVLRTCGGVPDIARG

Query:  KEIHAHVIRFGFESDVDVVNALITMYVKCGDISNARTLFDKMPNRDRISWNAMISGYFENGEGLEGLRLFFMMRELSVDPDLITLTTVVSACELLSDERL
         +IH   ++FGF  DV V  +LI +Y +   + NAR LFD+MP RD  SWNAMISGY ++G   E L L   +R +    D +T+ +++SAC    D   
Subjt:  KEIHAHVIRFGFESDVDVVNALITMYVKCGDISNARTLFDKMPNRDRISWNAMISGYFENGEGLEGLRLFFMMRELSVDPDLITLTTVVSACELLSDERL

Query:  GREIHGYVIRSEFGDDVSVNNSLIQMYSSLGHLKEAEKVFSRMESKDVVSWTAMTASYDGHKLPCKAVETYKMMELEGVLPDEITLVSVLSACACLRYLD
        G  IH Y I+     ++ V+N LI +Y+  G L++ +KVF RM  +D++SW ++  +Y+ ++ P +A+  ++ M L  + PD +TL+S+ S  + L  + 
Subjt:  GREIHGYVIRSEFGDDVSVNNSLIQMYSSLGHLKEAEKVFSRMESKDVVSWTAMTASYDGHKLPCKAVETYKMMELEGVLPDEITLVSVLSACACLRYLD

Query:  LGIKLHEIAIKTG-LISHVIVSNSLIDMYSKCKCVDKALEVFRNISGKNVISWTSLILGLRINNRSFEALLFFRQMKE--SLKPNSVTLISVLSACARIG
            +    ++ G  +  + + N+++ MY+K   VD A  VF  +   +VISW ++I G   N  + EA+  +  M+E   +  N  T +SVL AC++ G
Subjt:  LGIKLHEIAIKTG-LISHVIVSNSLIDMYSKCKCVDKALEVFRNISGKNVISWTSLILGLRINNRSFEALLFFRQMKE--SLKPNSVTLISVLSACARIG

Query:  ALMRGKEIHAHALRTGVGFDGFLPNAILDMYIRCGREVPALNQF-NSQKKDVTAWNILLTGYAEQGQGKLAVELFDKMLESEVNPDEITFISLLCACSRS
        AL +G ++H   L+ G+  D F+  ++ DMY +CGR   AL+ F    + +   WN L+  +   G G+ AV LF +ML+  V PD ITF++LL ACS S
Subjt:  ALMRGKEIHAHALRTGVGFDGFLPNAILDMYIRCGREVPALNQF-NSQKKDVTAWNILLTGYAEQGQGKLAVELFDKMLESEVNPDEITFISLLCACSRS

Query:  GMVMEGLEYFNVMKNKYHLTPNLKHYACVIDLLGRAGRLDDAYDFIQDMPIKPDAAIWGALLNACRIHRNVELGELAATRIFEKDDKSVGYYILLCNLYA
        G+V EG   F +M+  Y +TP+LKHY C++D+ GRAG+L+ A  FI+ M ++PDA+IWGALL+ACR+H NV+LG++A+  +FE + + VGY++LL N+YA
Subjt:  GMVMEGLEYFNVMKNKYHLTPNLKHYACVIDLLGRAGRLDDAYDFIQDMPIKPDAAIWGALLNACRIHRNVELGELAATRIFEKDDKSVGYYILLCNLYA

Query:  ECGNWDKVSKVRSMMRERGLSVDTGCSWVEVKGKVHAFLSGDNFHAQSKEINGVLEGFYDKMKETGFSDLKSSFMSEVESSRAD-IFCGHSEKQAIAFGL
          G W+ V ++RS+   +GL    G S +EV  KV  F +G+  H   +E+   L     K+K  G+       + +VE    + I   HSE+ AIAF L
Subjt:  ECGNWDKVSKVRSMMRERGLSVDTGCSWVEVKGKVHAFLSGDNFHAQSKEINGVLEGFYDKMKETGFSDLKSSFMSEVESSRAD-IFCGHSEKQAIAFGL

Query:  INTAPGMPIWVTKNLYMCHSCHNFVKFISTVVRRDI
        I T     I + KNL +C  CH+  KFIS +  R+I
Subjt:  INTAPGMPIWVTKNLYMCHSCHNFVKFISTVVRRDI

Q0WP36 Molybdate transporter 21.1e-16668.82Show/hide
Query:  TTTTASATSSIVSNRWFLRLPATLRPKTSVLAEISGAVGDLGTYIPIVLTLTLVSHLDLGTTLIFTALYNIVTGLLFGIPMPVQPMKSIAAVAVAESTHL
        TTTT           W  R    LR K  + +E+SGAVGDLGT+IPIVLTLTLVS+LDL TTLIFT  YNI TGLLF IPMPVQPMKSIAAVAV+ES HL
Subjt:  TTTTASATSSIVSNRWFLRLPATLRPKTSVLAEISGAVGDLGTYIPIVLTLTLVSHLDLGTTLIFTALYNIVTGLLFGIPMPVQPMKSIAAVAVAESTHL

Query:  TIPQIAAAGLSTAAVLLVLGATGLMSFLYRYLPLPVVRGIQLSQGLSFAFSAIKYIRYNQDLVTSK-SGGPRSWLGFDGLLVALASCLFLIFTTGAGDSH
        T  QIAAAG STAA LL+LGATG MSFLY  +PLPVVRG+QLSQGL FAF+AIKY+R+N D  T K S  PR WLG DGL++ALA+ LF+I +TG+G+  
Subjt:  TIPQIAAAGLSTAAVLLVLGATGLMSFLYRYLPLPVVRGIQLSQGLSFAFSAIKYIRYNQDLVTSK-SGGPRSWLGFDGLLVALASCLFLIFTTGAGDSH

Query:  VRAEREPSSSDPPNGSSTRSGRRVRRLRILSTIPAALIVFLFGLLICFLRDLSVLKYLKFGPSKLHFLSITWEDWKIGFLRAAIPQIPLSILNSVIAVCK
           +RE    D    SS  S  R RRLR+LS+IP+ALIVF  GL++CF+RD S+ K LKFGPSK H L I+W+DWKIGFLRAAIPQIPLS+LNSVIAVCK
Subjt:  VRAEREPSSSDPPNGSSTRSGRRVRRLRILSTIPAALIVFLFGLLICFLRDLSVLKYLKFGPSKLHFLSITWEDWKIGFLRAAIPQIPLSILNSVIAVCK

Query:  LSADLFPDREASAMTVSISVGIMNFVGCWFGAMPVCHGAGGLAGQYRFGGRSGASVVFLGIGKLVLGLAFGNSFAQVLNQFPIGVLGVLLLFAGIELAMA
        LS DLF D+E SA TVSISVG+MN +GCWFGAMPVCHGAGGLAGQYRFG RSG SV+FLGIGKL++GL FGNSF ++L+QFPIG+LGVLLLFAGIELAMA
Subjt:  LSADLFPDREASAMTVSISVGIMNFVGCWFGAMPVCHGAGGLAGQYRFGGRSGASVVFLGIGKLVLGLAFGNSFAQVLNQFPIGVLGVLLLFAGIELAMA

Query:  SRDMKSKEESFVMLVCAAVSLTGSSAALGFGVGILLSLLMKLREFDCSSGFGFQKMKPKSDADEE
        S+DM SKE+SF+MLVCAAVS+TGSSAALGFG G++L LL+KLR  DCSS   F +   +S  D E
Subjt:  SRDMKSKEESFVMLVCAAVSLTGSSAALGFGVGILLSLLMKLREFDCSSGFGFQKMKPKSDADEE

Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic2.1e-15438.29Show/hide
Query:  LLSMFVRFGNLVDAWYVFGRMAERDVFSWNVLVGGYAKAGCFDEALNLYHRMLWAELRPDVYTFPSVLRTCGGVPDIARGKEIHAHVIRFGFESDVDVVN
        L+S+F R+G++ +A  VF  +  +    ++ ++ G+AK    D+AL  + RM + ++ P VY F  +L+ CG   ++  GKEIH  +++ GF  D+  + 
Subjt:  LLSMFVRFGNLVDAWYVFGRMAERDVFSWNVLVGGYAKAGCFDEALNLYHRMLWAELRPDVYTFPSVLRTCGGVPDIARGKEIHAHVIRFGFESDVDVVN

Query:  ALITMYVKCGDISNARTLFDKMPNRDRISWNAMISGYFENGEGLEGLRLFFMMRELSVDPDLITLTTVVSACELLSDERLGREIHGYVIRSEFGDDVSVN
         L  MY KC  ++ AR +FD+MP RD +SWN +++GY +NG     L +   M E ++ P  IT+ +V+ A   L    +G+EIHGY +RS F   V+++
Subjt:  ALITMYVKCGDISNARTLFDKMPNRDRISWNAMISGYFENGEGLEGLRLFFMMRELSVDPDLITLTTVVSACELLSDERLGREIHGYVIRSEFGDDVSVN

Query:  NSLIQMYSSLGHLKEAEKVFSRMESKDVVSWTAMTASYDGHKLPCKAVETYKMMELEGVLPDEITLVSVLSACACLRYLDLGIKLHEIAIKTGLISHVIV
         +L+ MY+  G L+ A ++F  M  ++VVSW +M  +Y  ++ P +A+  ++ M  EGV P +++++  L ACA L  L+ G  +H+++++ GL  +V V
Subjt:  NSLIQMYSSLGHLKEAEKVFSRMESKDVVSWTAMTASYDGHKLPCKAVETYKMMELEGVLPDEITLVSVLSACACLRYLDLGIKLHEIAIKTGLISHVIV

Query:  SNSLIDMYSKCKCVDKALEVFRNISGKNVISWTSLILGLRINNRSFEALLFFRQMK-ESLKPNSVTLISVLSACARIGALMRGKEIHAHALRTGVGFDGF
         NSLI MY KCK VD A  +F  +  + ++SW ++ILG   N R  +AL +F QM+  ++KP++ T +SV++A A +      K IH   +R+ +  + F
Subjt:  SNSLIDMYSKCKCVDKALEVFRNISGKNVISWTSLILGLRINNRSFEALLFFRQMK-ESLKPNSVTLISVLSACARIGALMRGKEIHAHALRTGVGFDGF

Query:  LPNAILDMYIRCGREVPALNQFN-SQKKDVTAWNILLTGYAEQGQGKLAVELFDKMLESEVNPDEITFISLLCACSRSGMVMEGLEYFNVMKNKYHLTPN
        +  A++DMY +CG  + A   F+   ++ VT WN ++ GY   G GK A+ELF++M +  + P+ +TF+S++ ACS SG+V  GL+ F +MK  Y +  +
Subjt:  LPNAILDMYIRCGREVPALNQFN-SQKKDVTAWNILLTGYAEQGQGKLAVELFDKMLESEVNPDEITFISLLCACSRSGMVMEGLEYFNVMKNKYHLTPN

Query:  LKHYACVIDLLGRAGRLDDAYDFIQDMPIKPDAAIWGALLNACRIHRNVELGELAATRIFEKDDKSVGYYILLCNLYAECGNWDKVSKVRSMMRERGLSV
        + HY  ++DLLGRAGRL++A+DFI  MP+KP   ++GA+L AC+IH+NV   E AA R+FE +    GY++LL N+Y     W+KV +VR  M  +GL  
Subjt:  LKHYACVIDLLGRAGRLDDAYDFIQDMPIKPDAAIWGALLNACRIHRNVELGELAATRIFEKDDKSVGYYILLCNLYAECGNWDKVSKVRSMMRERGLSV

Query:  DTGCSWVEVKGKVHAFLSGDNFHAQSKEINGVLEGFYDKMKETGFSDLKSSFMSEVESSRADIFCGHSEKQAIAFGLINTAPGMPIWVTKNLYMCHSCHN
          GCS VE+K +VH+F SG   H  SK+I   LE     +KE G+    +  +      +  +   HSEK AI+FGL+NT  G  I V KNL +C  CHN
Subjt:  DTGCSWVEVKGKVHAFLSGDNFHAQSKEINGVLEGFYDKMKETGFSDLKSSFMSEVESSRADIFCGHSEKQAIAFGLINTAPGMPIWVTKNLYMCHSCHN

Query:  FVKFISTVVRRDI
          K+IS V  R+I
Subjt:  FVKFISTVVRRDI

Q9M9E2 Pentatricopeptide repeat-containing protein At1g15510, chloroplastic0.0e+0065.76Show/hide
Query:  MAVFAKASHLLTYLDRSNPQISNNHNPRILSFSKNLLTHSQTARKNQEISVPGALSNSTINQTQNLELQELCLHGNLEQAMKRLESMLELRIEVEDDAYI
        MA  A++ H      +SN   S  +  R ++F  N        RK+Q +SV  + S+ST     N +L  LC +G LE+AMK L SM ELR+ V++D ++
Subjt:  MAVFAKASHLLTYLDRSNPQISNNHNPRILSFSKNLLTHSQTARKNQEISVPGALSNSTINQTQNLELQELCLHGNLEQAMKRLESMLELRIEVEDDAYI

Query:  ALLRLCEWRRAPDEGSRIYALVSSSKSRLGVRLGNALLSMFVRFGNLVDAWYVFGRMAERDVFSWNVLVGGYAKAGCFDEALNLYHRMLW-AELRPDVYT
        AL+RLCEW+RA +EGS++Y++  SS S LGV LGNA L+MFVRFGNLVDAWYVFG+M+ER++FSWNVLVGGYAK G FDEA+ LYHRMLW   ++PDVYT
Subjt:  ALLRLCEWRRAPDEGSRIYALVSSSKSRLGVRLGNALLSMFVRFGNLVDAWYVFGRMAERDVFSWNVLVGGYAKAGCFDEALNLYHRMLW-AELRPDVYT

Query:  FPSVLRTCGGVPDIARGKEIHAHVIRFGFESDVDVVNALITMYVKCGDISNARTLFDKMPNRDRISWNAMISGYFENGEGLEGLRLFFMMRELSVDPDLI
        FP VLRTCGG+PD+ARGKE+H HV+R+G+E D+DVVNALITMYVKCGD+ +AR LFD+MP RD ISWNAMISGYFENG   EGL LFF MR LSVDPDL+
Subjt:  FPSVLRTCGGVPDIARGKEIHAHVIRFGFESDVDVVNALITMYVKCGDISNARTLFDKMPNRDRISWNAMISGYFENGEGLEGLRLFFMMRELSVDPDLI

Query:  TLTTVVSACELLSDERLGREIHGYVIRSEFGDDVSVNNSLIQMYSSLGHLKEAEKVFSRMESKDVVSWTAMTASYDGHKLPCKAVETYKMMELEGVLPDE
        TLT+V+SACELL D RLGR+IH YVI + F  D+SV NSL QMY + G  +EAEK+FSRME KD+VSWT M + Y+ + LP KA++TY+MM+ + V PDE
Subjt:  TLTTVVSACELLSDERLGREIHGYVIRSEFGDDVSVNNSLIQMYSSLGHLKEAEKVFSRMESKDVVSWTAMTASYDGHKLPCKAVETYKMMELEGVLPDE

Query:  ITLVSVLSACACLRYLDLGIKLHEIAIKTGLISHVIVSNSLIDMYSKCKCVDKALEVFRNISGKNVISWTSLILGLRINNRSFEALLFFRQMKESLKPNS
        IT+ +VLSACA L  LD G++LH++AIK  LIS+VIV+N+LI+MYSKCKC+DKAL++F NI  KNVISWTS+I GLR+NNR FEAL+F RQMK +L+PN+
Subjt:  ITLVSVLSACACLRYLDLGIKLHEIAIKTGLISHVIVSNSLIDMYSKCKCVDKALEVFRNISGKNVISWTSLILGLRINNRSFEALLFFRQMKESLKPNS

Query:  VTLISVLSACARIGALMRGKEIHAHALRTGVGFDGFLPNAILDMYIRCGREVPALNQFNSQKKDVTAWNILLTGYAEQGQGKLAVELFDKMLESEVNPDE
        +TL + L+ACARIGALM GKEIHAH LRTGVG D FLPNA+LDMY+RCGR   A +QFNSQKKDVT+WNILLTGY+E+GQG + VELFD+M++S V PDE
Subjt:  VTLISVLSACARIGALMRGKEIHAHALRTGVGFDGFLPNAILDMYIRCGREVPALNQFNSQKKDVTAWNILLTGYAEQGQGKLAVELFDKMLESEVNPDE

Query:  ITFISLLCACSRSGMVMEGLEYFNVMKNKYHLTPNLKHYACVIDLLGRAGRLDDAYDFIQDMPIKPDAAIWGALLNACRIHRNVELGELAATRIFEKDDK
        ITFISLLC CS+S MV +GL YF+ M++ Y +TPNLKHYACV+DLLGRAG L +A+ FIQ MP+ PD A+WGALLNACRIH  ++LGEL+A  IFE D K
Subjt:  ITFISLLCACSRSGMVMEGLEYFNVMKNKYHLTPNLKHYACVIDLLGRAGRLDDAYDFIQDMPIKPDAAIWGALLNACRIHRNVELGELAATRIFEKDDK

Query:  SVGYYILLCNLYAECGNWDKVSKVRSMMRERGLSVDTGCSWVEVKGKVHAFLSGDNFHAQSKEINGVLEGFYDKMKETGFSDL-KSSFMSEVESSRADIF
        SVGYYILLCNLYA+CG W +V+KVR MM+E GL+VD GCSWVEVKGKVHAFLS D +H Q+KEIN VLEGFY+KM E G + + +SS M E E SR +IF
Subjt:  SVGYYILLCNLYAECGNWDKVSKVRSMMRERGLSVDTGCSWVEVKGKVHAFLSGDNFHAQSKEINGVLEGFYDKMKETGFSDL-KSSFMSEVESSRADIF

Query:  CGHSEKQAIAFGLINTAPGMPIWVTKNLYMCHSCHNFVKFISTVVRRDIS
        CGHSE++AIAFGLINT PGMPIWVTKNL MC +CH+ VKFIS  VRR+IS
Subjt:  CGHSEKQAIAFGLINTAPGMPIWVTKNLYMCHSCHNFVKFISTVVRRDIS

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic1.1e-15836.59Show/hide
Query:  NLELQELCLHGNLEQAMKRLESMLELRIEVEDDAYIALLRLCEWRRAPDEGSRIYALVSSSKSRLGVRLGNALLSMFVRFGNLVDAWYVFGRMAERDVFS
        N +L+  C  GNLE A+K L   +  + +++     ++L+LC   ++  +G  +   +  +   +   LG+ L  M+   G+L +A  VF  +       
Subjt:  NLELQELCLHGNLEQAMKRLESMLELRIEVEDDAYIALLRLCEWRRAPDEGSRIYALVSSSKSRLGVRLGNALLSMFVRFGNLVDAWYVFGRMAERDVFS

Query:  WNVLVGGYAKAGCFDEALNLYHRMLWAELRPDVYTFPSVLRTCGGVPDIARGKEIHAHVIRFGFESDVDVVNALITMYVKCGDISNARTLFDKMPNRDRI
        WN+L+   AK+G F  ++ L+ +M+ + +  D YTF  V ++   +  +  G+++H  +++ GF     V N+L+  Y+K   + +AR +FD+M  RD I
Subjt:  WNVLVGGYAKAGCFDEALNLYHRMLWAELRPDVYTFPSVLRTCGGVPDIARGKEIHAHVIRFGFESDVDVVNALITMYVKCGDISNARTLFDKMPNRDRI

Query:  SWNAMISGYFENGEGLEGLRLFFMMRELSVDPDLITLTTVVSACELLSDERLGREIHGYVIRSEFGDDVSVNNSLIQMYSSLGHLKEAEKVFSRMESKDV
        SWN++I+GY  NG   +GL +F  M    ++ DL T+ +V + C       LGR +H   +++ F  +    N+L+ MYS  G L  A+ VF  M  + V
Subjt:  SWNAMISGYFENGEGLEGLRLFFMMRELSVDPDLITLTTVVSACELLSDERLGREIHGYVIRSEFGDDVSVNNSLIQMYSSLGHLKEAEKVFSRMESKDV

Query:  VSWTAMTASYDGHKLPCKAVETYKMMELEGVLPDEITLVSVLSACACLRYLDLGIKLHEIAIKTGLISHVIVSNSLIDMYSKCKCVDKALEVFRNISGKN
        VS+T+M A Y    L  +AV+ ++ ME EG+ PD  T+ +VL+ CA  R LD G ++HE   +  L   + VSN+L+DMY+KC  + +A  VF  +  K+
Subjt:  VSWTAMTASYDGHKLPCKAVETYKMMELEGVLPDEITLVSVLSACACLRYLDLGIKLHEIAIKTGLISHVIVSNSLIDMYSKCKCVDKALEVFRNISGKN

Query:  VISWTSLILGLRINNRSFEALLFFRQMKES--LKPNSVTLISVLSACARIGALMRGKEIHAHALRTGVGFDGFLPNAILDMYIRCGREVPALNQFNS-QK
        +ISW ++I G   N  + EAL  F  + E     P+  T+  VL ACA + A  +G+EIH + +R G   D  + N+++DMY +CG  + A   F+    
Subjt:  VISWTSLILGLRINNRSFEALLFFRQMKES--LKPNSVTLISVLSACARIGALMRGKEIHAHALRTGVGFDGFLPNAILDMYIRCGREVPALNQFNS-QK

Query:  KDVTAWNILLTGYAEQGQGKLAVELFDKMLESEVNPDEITFISLLCACSRSGMVMEGLEYFNVMKNKYHLTPNLKHYACVIDLLGRAGRLDDAYDFIQDM
        KD+ +W +++ GY   G GK A+ LF++M ++ +  DEI+F+SLL ACS SG+V EG  +FN+M+++  + P ++HYAC++D+L R G L  AY FI++M
Subjt:  KDVTAWNILLTGYAEQGQGKLAVELFDKMLESEVNPDEITFISLLCACSRSGMVMEGLEYFNVMKNKYHLTPNLKHYACVIDLLGRAGRLDDAYDFIQDM

Query:  PIKPDAAIWGALLNACRIHRNVELGELAATRIFEKDDKSVGYYILLCNLYAECGNWDKVSKVRSMMRERGLSVDTGCSWVEVKGKVHAFLSGDNFHAQSK
        PI PDA IWGALL  CRIH +V+L E  A ++FE + ++ GYY+L+ N+YAE   W++V ++R  + +RGL  + GCSW+E+KG+V+ F++GD+ + +++
Subjt:  PIKPDAAIWGALLNACRIHRNVELGELAATRIFEKDDKSVGYYILLCNLYAECGNWDKVSKVRSMMRERGLSVDTGCSWVEVKGKVHAFLSGDNFHAQSK

Query:  EINGVLEGFYDKMKETGFSDL-KSSFMSEVESSRADIFCGHSEKQAIAFGLINTAPGMPIWVTKNLYMCHSCHNFVKFISTVVRRDI
         I   L     +M E G+S L K + +   E  + +  CGHSEK A+A G+I++  G  I VTKNL +C  CH   KF+S + RR+I
Subjt:  EINGVLEGFYDKMKETGFSDL-KSSFMSEVESSRADIFCGHSEKQAIAFGLINTAPGMPIWVTKNLYMCHSCHNFVKFISTVVRRDI

Arabidopsis top hitse value%identityAlignment
AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein1.5e-15538.29Show/hide
Query:  LLSMFVRFGNLVDAWYVFGRMAERDVFSWNVLVGGYAKAGCFDEALNLYHRMLWAELRPDVYTFPSVLRTCGGVPDIARGKEIHAHVIRFGFESDVDVVN
        L+S+F R+G++ +A  VF  +  +    ++ ++ G+AK    D+AL  + RM + ++ P VY F  +L+ CG   ++  GKEIH  +++ GF  D+  + 
Subjt:  LLSMFVRFGNLVDAWYVFGRMAERDVFSWNVLVGGYAKAGCFDEALNLYHRMLWAELRPDVYTFPSVLRTCGGVPDIARGKEIHAHVIRFGFESDVDVVN

Query:  ALITMYVKCGDISNARTLFDKMPNRDRISWNAMISGYFENGEGLEGLRLFFMMRELSVDPDLITLTTVVSACELLSDERLGREIHGYVIRSEFGDDVSVN
         L  MY KC  ++ AR +FD+MP RD +SWN +++GY +NG     L +   M E ++ P  IT+ +V+ A   L    +G+EIHGY +RS F   V+++
Subjt:  ALITMYVKCGDISNARTLFDKMPNRDRISWNAMISGYFENGEGLEGLRLFFMMRELSVDPDLITLTTVVSACELLSDERLGREIHGYVIRSEFGDDVSVN

Query:  NSLIQMYSSLGHLKEAEKVFSRMESKDVVSWTAMTASYDGHKLPCKAVETYKMMELEGVLPDEITLVSVLSACACLRYLDLGIKLHEIAIKTGLISHVIV
         +L+ MY+  G L+ A ++F  M  ++VVSW +M  +Y  ++ P +A+  ++ M  EGV P +++++  L ACA L  L+ G  +H+++++ GL  +V V
Subjt:  NSLIQMYSSLGHLKEAEKVFSRMESKDVVSWTAMTASYDGHKLPCKAVETYKMMELEGVLPDEITLVSVLSACACLRYLDLGIKLHEIAIKTGLISHVIV

Query:  SNSLIDMYSKCKCVDKALEVFRNISGKNVISWTSLILGLRINNRSFEALLFFRQMK-ESLKPNSVTLISVLSACARIGALMRGKEIHAHALRTGVGFDGF
         NSLI MY KCK VD A  +F  +  + ++SW ++ILG   N R  +AL +F QM+  ++KP++ T +SV++A A +      K IH   +R+ +  + F
Subjt:  SNSLIDMYSKCKCVDKALEVFRNISGKNVISWTSLILGLRINNRSFEALLFFRQMK-ESLKPNSVTLISVLSACARIGALMRGKEIHAHALRTGVGFDGF

Query:  LPNAILDMYIRCGREVPALNQFN-SQKKDVTAWNILLTGYAEQGQGKLAVELFDKMLESEVNPDEITFISLLCACSRSGMVMEGLEYFNVMKNKYHLTPN
        +  A++DMY +CG  + A   F+   ++ VT WN ++ GY   G GK A+ELF++M +  + P+ +TF+S++ ACS SG+V  GL+ F +MK  Y +  +
Subjt:  LPNAILDMYIRCGREVPALNQFN-SQKKDVTAWNILLTGYAEQGQGKLAVELFDKMLESEVNPDEITFISLLCACSRSGMVMEGLEYFNVMKNKYHLTPN

Query:  LKHYACVIDLLGRAGRLDDAYDFIQDMPIKPDAAIWGALLNACRIHRNVELGELAATRIFEKDDKSVGYYILLCNLYAECGNWDKVSKVRSMMRERGLSV
        + HY  ++DLLGRAGRL++A+DFI  MP+KP   ++GA+L AC+IH+NV   E AA R+FE +    GY++LL N+Y     W+KV +VR  M  +GL  
Subjt:  LKHYACVIDLLGRAGRLDDAYDFIQDMPIKPDAAIWGALLNACRIHRNVELGELAATRIFEKDDKSVGYYILLCNLYAECGNWDKVSKVRSMMRERGLSV

Query:  DTGCSWVEVKGKVHAFLSGDNFHAQSKEINGVLEGFYDKMKETGFSDLKSSFMSEVESSRADIFCGHSEKQAIAFGLINTAPGMPIWVTKNLYMCHSCHN
          GCS VE+K +VH+F SG   H  SK+I   LE     +KE G+    +  +      +  +   HSEK AI+FGL+NT  G  I V KNL +C  CHN
Subjt:  DTGCSWVEVKGKVHAFLSGDNFHAQSKEINGVLEGFYDKMKETGFSDLKSSFMSEVESSRADIFCGHSEKQAIAFGLINTAPGMPIWVTKNLYMCHSCHN

Query:  FVKFISTVVRRDI
          K+IS V  R+I
Subjt:  FVKFISTVVRRDI

AT1G15510.1 Tetratricopeptide repeat (TPR)-like superfamily protein0.0e+0065.76Show/hide
Query:  MAVFAKASHLLTYLDRSNPQISNNHNPRILSFSKNLLTHSQTARKNQEISVPGALSNSTINQTQNLELQELCLHGNLEQAMKRLESMLELRIEVEDDAYI
        MA  A++ H      +SN   S  +  R ++F  N        RK+Q +SV  + S+ST     N +L  LC +G LE+AMK L SM ELR+ V++D ++
Subjt:  MAVFAKASHLLTYLDRSNPQISNNHNPRILSFSKNLLTHSQTARKNQEISVPGALSNSTINQTQNLELQELCLHGNLEQAMKRLESMLELRIEVEDDAYI

Query:  ALLRLCEWRRAPDEGSRIYALVSSSKSRLGVRLGNALLSMFVRFGNLVDAWYVFGRMAERDVFSWNVLVGGYAKAGCFDEALNLYHRMLW-AELRPDVYT
        AL+RLCEW+RA +EGS++Y++  SS S LGV LGNA L+MFVRFGNLVDAWYVFG+M+ER++FSWNVLVGGYAK G FDEA+ LYHRMLW   ++PDVYT
Subjt:  ALLRLCEWRRAPDEGSRIYALVSSSKSRLGVRLGNALLSMFVRFGNLVDAWYVFGRMAERDVFSWNVLVGGYAKAGCFDEALNLYHRMLW-AELRPDVYT

Query:  FPSVLRTCGGVPDIARGKEIHAHVIRFGFESDVDVVNALITMYVKCGDISNARTLFDKMPNRDRISWNAMISGYFENGEGLEGLRLFFMMRELSVDPDLI
        FP VLRTCGG+PD+ARGKE+H HV+R+G+E D+DVVNALITMYVKCGD+ +AR LFD+MP RD ISWNAMISGYFENG   EGL LFF MR LSVDPDL+
Subjt:  FPSVLRTCGGVPDIARGKEIHAHVIRFGFESDVDVVNALITMYVKCGDISNARTLFDKMPNRDRISWNAMISGYFENGEGLEGLRLFFMMRELSVDPDLI

Query:  TLTTVVSACELLSDERLGREIHGYVIRSEFGDDVSVNNSLIQMYSSLGHLKEAEKVFSRMESKDVVSWTAMTASYDGHKLPCKAVETYKMMELEGVLPDE
        TLT+V+SACELL D RLGR+IH YVI + F  D+SV NSL QMY + G  +EAEK+FSRME KD+VSWT M + Y+ + LP KA++TY+MM+ + V PDE
Subjt:  TLTTVVSACELLSDERLGREIHGYVIRSEFGDDVSVNNSLIQMYSSLGHLKEAEKVFSRMESKDVVSWTAMTASYDGHKLPCKAVETYKMMELEGVLPDE

Query:  ITLVSVLSACACLRYLDLGIKLHEIAIKTGLISHVIVSNSLIDMYSKCKCVDKALEVFRNISGKNVISWTSLILGLRINNRSFEALLFFRQMKESLKPNS
        IT+ +VLSACA L  LD G++LH++AIK  LIS+VIV+N+LI+MYSKCKC+DKAL++F NI  KNVISWTS+I GLR+NNR FEAL+F RQMK +L+PN+
Subjt:  ITLVSVLSACACLRYLDLGIKLHEIAIKTGLISHVIVSNSLIDMYSKCKCVDKALEVFRNISGKNVISWTSLILGLRINNRSFEALLFFRQMKESLKPNS

Query:  VTLISVLSACARIGALMRGKEIHAHALRTGVGFDGFLPNAILDMYIRCGREVPALNQFNSQKKDVTAWNILLTGYAEQGQGKLAVELFDKMLESEVNPDE
        +TL + L+ACARIGALM GKEIHAH LRTGVG D FLPNA+LDMY+RCGR   A +QFNSQKKDVT+WNILLTGY+E+GQG + VELFD+M++S V PDE
Subjt:  VTLISVLSACARIGALMRGKEIHAHALRTGVGFDGFLPNAILDMYIRCGREVPALNQFNSQKKDVTAWNILLTGYAEQGQGKLAVELFDKMLESEVNPDE

Query:  ITFISLLCACSRSGMVMEGLEYFNVMKNKYHLTPNLKHYACVIDLLGRAGRLDDAYDFIQDMPIKPDAAIWGALLNACRIHRNVELGELAATRIFEKDDK
        ITFISLLC CS+S MV +GL YF+ M++ Y +TPNLKHYACV+DLLGRAG L +A+ FIQ MP+ PD A+WGALLNACRIH  ++LGEL+A  IFE D K
Subjt:  ITFISLLCACSRSGMVMEGLEYFNVMKNKYHLTPNLKHYACVIDLLGRAGRLDDAYDFIQDMPIKPDAAIWGALLNACRIHRNVELGELAATRIFEKDDK

Query:  SVGYYILLCNLYAECGNWDKVSKVRSMMRERGLSVDTGCSWVEVKGKVHAFLSGDNFHAQSKEINGVLEGFYDKMKETGFSDL-KSSFMSEVESSRADIF
        SVGYYILLCNLYA+CG W +V+KVR MM+E GL+VD GCSWVEVKGKVHAFLS D +H Q+KEIN VLEGFY+KM E G + + +SS M E E SR +IF
Subjt:  SVGYYILLCNLYAECGNWDKVSKVRSMMRERGLSVDTGCSWVEVKGKVHAFLSGDNFHAQSKEINGVLEGFYDKMKETGFSDL-KSSFMSEVESSRADIF

Query:  CGHSEKQAIAFGLINTAPGMPIWVTKNLYMCHSCHNFVKFISTVVRRDIS
        CGHSE++AIAFGLINT PGMPIWVTKNL MC +CH+ VKFIS  VRR+IS
Subjt:  CGHSEKQAIAFGLINTAPGMPIWVTKNLYMCHSCHNFVKFISTVVRRDIS

AT1G80310.1 sulfate transmembrane transporters7.7e-16868.82Show/hide
Query:  TTTTASATSSIVSNRWFLRLPATLRPKTSVLAEISGAVGDLGTYIPIVLTLTLVSHLDLGTTLIFTALYNIVTGLLFGIPMPVQPMKSIAAVAVAESTHL
        TTTT           W  R    LR K  + +E+SGAVGDLGT+IPIVLTLTLVS+LDL TTLIFT  YNI TGLLF IPMPVQPMKSIAAVAV+ES HL
Subjt:  TTTTASATSSIVSNRWFLRLPATLRPKTSVLAEISGAVGDLGTYIPIVLTLTLVSHLDLGTTLIFTALYNIVTGLLFGIPMPVQPMKSIAAVAVAESTHL

Query:  TIPQIAAAGLSTAAVLLVLGATGLMSFLYRYLPLPVVRGIQLSQGLSFAFSAIKYIRYNQDLVTSK-SGGPRSWLGFDGLLVALASCLFLIFTTGAGDSH
        T  QIAAAG STAA LL+LGATG MSFLY  +PLPVVRG+QLSQGL FAF+AIKY+R+N D  T K S  PR WLG DGL++ALA+ LF+I +TG+G+  
Subjt:  TIPQIAAAGLSTAAVLLVLGATGLMSFLYRYLPLPVVRGIQLSQGLSFAFSAIKYIRYNQDLVTSK-SGGPRSWLGFDGLLVALASCLFLIFTTGAGDSH

Query:  VRAEREPSSSDPPNGSSTRSGRRVRRLRILSTIPAALIVFLFGLLICFLRDLSVLKYLKFGPSKLHFLSITWEDWKIGFLRAAIPQIPLSILNSVIAVCK
           +RE    D    SS  S  R RRLR+LS+IP+ALIVF  GL++CF+RD S+ K LKFGPSK H L I+W+DWKIGFLRAAIPQIPLS+LNSVIAVCK
Subjt:  VRAEREPSSSDPPNGSSTRSGRRVRRLRILSTIPAALIVFLFGLLICFLRDLSVLKYLKFGPSKLHFLSITWEDWKIGFLRAAIPQIPLSILNSVIAVCK

Query:  LSADLFPDREASAMTVSISVGIMNFVGCWFGAMPVCHGAGGLAGQYRFGGRSGASVVFLGIGKLVLGLAFGNSFAQVLNQFPIGVLGVLLLFAGIELAMA
        LS DLF D+E SA TVSISVG+MN +GCWFGAMPVCHGAGGLAGQYRFG RSG SV+FLGIGKL++GL FGNSF ++L+QFPIG+LGVLLLFAGIELAMA
Subjt:  LSADLFPDREASAMTVSISVGIMNFVGCWFGAMPVCHGAGGLAGQYRFGGRSGASVVFLGIGKLVLGLAFGNSFAQVLNQFPIGVLGVLLLFAGIELAMA

Query:  SRDMKSKEESFVMLVCAAVSLTGSSAALGFGVGILLSLLMKLREFDCSSGFGFQKMKPKSDADEE
        S+DM SKE+SF+MLVCAAVS+TGSSAALGFG G++L LL+KLR  DCSS   F +   +S  D E
Subjt:  SRDMKSKEESFVMLVCAAVSLTGSSAALGFGVGILLSLLMKLREFDCSSGFGFQKMKPKSDADEE

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein7.7e-16036.59Show/hide
Query:  NLELQELCLHGNLEQAMKRLESMLELRIEVEDDAYIALLRLCEWRRAPDEGSRIYALVSSSKSRLGVRLGNALLSMFVRFGNLVDAWYVFGRMAERDVFS
        N +L+  C  GNLE A+K L   +  + +++     ++L+LC   ++  +G  +   +  +   +   LG+ L  M+   G+L +A  VF  +       
Subjt:  NLELQELCLHGNLEQAMKRLESMLELRIEVEDDAYIALLRLCEWRRAPDEGSRIYALVSSSKSRLGVRLGNALLSMFVRFGNLVDAWYVFGRMAERDVFS

Query:  WNVLVGGYAKAGCFDEALNLYHRMLWAELRPDVYTFPSVLRTCGGVPDIARGKEIHAHVIRFGFESDVDVVNALITMYVKCGDISNARTLFDKMPNRDRI
        WN+L+   AK+G F  ++ L+ +M+ + +  D YTF  V ++   +  +  G+++H  +++ GF     V N+L+  Y+K   + +AR +FD+M  RD I
Subjt:  WNVLVGGYAKAGCFDEALNLYHRMLWAELRPDVYTFPSVLRTCGGVPDIARGKEIHAHVIRFGFESDVDVVNALITMYVKCGDISNARTLFDKMPNRDRI

Query:  SWNAMISGYFENGEGLEGLRLFFMMRELSVDPDLITLTTVVSACELLSDERLGREIHGYVIRSEFGDDVSVNNSLIQMYSSLGHLKEAEKVFSRMESKDV
        SWN++I+GY  NG   +GL +F  M    ++ DL T+ +V + C       LGR +H   +++ F  +    N+L+ MYS  G L  A+ VF  M  + V
Subjt:  SWNAMISGYFENGEGLEGLRLFFMMRELSVDPDLITLTTVVSACELLSDERLGREIHGYVIRSEFGDDVSVNNSLIQMYSSLGHLKEAEKVFSRMESKDV

Query:  VSWTAMTASYDGHKLPCKAVETYKMMELEGVLPDEITLVSVLSACACLRYLDLGIKLHEIAIKTGLISHVIVSNSLIDMYSKCKCVDKALEVFRNISGKN
        VS+T+M A Y    L  +AV+ ++ ME EG+ PD  T+ +VL+ CA  R LD G ++HE   +  L   + VSN+L+DMY+KC  + +A  VF  +  K+
Subjt:  VSWTAMTASYDGHKLPCKAVETYKMMELEGVLPDEITLVSVLSACACLRYLDLGIKLHEIAIKTGLISHVIVSNSLIDMYSKCKCVDKALEVFRNISGKN

Query:  VISWTSLILGLRINNRSFEALLFFRQMKES--LKPNSVTLISVLSACARIGALMRGKEIHAHALRTGVGFDGFLPNAILDMYIRCGREVPALNQFNS-QK
        +ISW ++I G   N  + EAL  F  + E     P+  T+  VL ACA + A  +G+EIH + +R G   D  + N+++DMY +CG  + A   F+    
Subjt:  VISWTSLILGLRINNRSFEALLFFRQMKES--LKPNSVTLISVLSACARIGALMRGKEIHAHALRTGVGFDGFLPNAILDMYIRCGREVPALNQFNS-QK

Query:  KDVTAWNILLTGYAEQGQGKLAVELFDKMLESEVNPDEITFISLLCACSRSGMVMEGLEYFNVMKNKYHLTPNLKHYACVIDLLGRAGRLDDAYDFIQDM
        KD+ +W +++ GY   G GK A+ LF++M ++ +  DEI+F+SLL ACS SG+V EG  +FN+M+++  + P ++HYAC++D+L R G L  AY FI++M
Subjt:  KDVTAWNILLTGYAEQGQGKLAVELFDKMLESEVNPDEITFISLLCACSRSGMVMEGLEYFNVMKNKYHLTPNLKHYACVIDLLGRAGRLDDAYDFIQDM

Query:  PIKPDAAIWGALLNACRIHRNVELGELAATRIFEKDDKSVGYYILLCNLYAECGNWDKVSKVRSMMRERGLSVDTGCSWVEVKGKVHAFLSGDNFHAQSK
        PI PDA IWGALL  CRIH +V+L E  A ++FE + ++ GYY+L+ N+YAE   W++V ++R  + +RGL  + GCSW+E+KG+V+ F++GD+ + +++
Subjt:  PIKPDAAIWGALLNACRIHRNVELGELAATRIFEKDDKSVGYYILLCNLYAECGNWDKVSKVRSMMRERGLSVDTGCSWVEVKGKVHAFLSGDNFHAQSK

Query:  EINGVLEGFYDKMKETGFSDL-KSSFMSEVESSRADIFCGHSEKQAIAFGLINTAPGMPIWVTKNLYMCHSCHNFVKFISTVVRRDI
         I   L     +M E G+S L K + +   E  + +  CGHSEK A+A G+I++  G  I VTKNL +C  CH   KF+S + RR+I
Subjt:  EINGVLEGFYDKMKETGFSDL-KSSFMSEVESSRADIFCGHSEKQAIAFGLINTAPGMPIWVTKNLYMCHSCHNFVKFISTVVRRDI

AT4G33990.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.3e-14637.36Show/hide
Query:  IYALVSSSKSRLGVRLGNALLSMFVRFGNLVDAWYVFGRMAERDVFSWNVLVGGYAKAGCFDEALNLYHR-MLWAELRPDVYTFPSVLRTCGGVPDIARG
        ++A +  SK    V +   L++++   GN+  A + F  +  RDV++WN+++ GY +AG   E +  +   ML + L PD  TFPSVL+ C  V D   G
Subjt:  IYALVSSSKSRLGVRLGNALLSMFVRFGNLVDAWYVFGRMAERDVFSWNVLVGGYAKAGCFDEALNLYHR-MLWAELRPDVYTFPSVLRTCGGVPDIARG

Query:  KEIHAHVIRFGFESDVDVVNALITMYVKCGDISNARTLFDKMPNRDRISWNAMISGYFENGEGLEGLRLFFMMRELSVDPDLITLTTVVSACELLSDERL
         +IH   ++FGF  DV V  +LI +Y +   + NAR LFD+MP RD  SWNAMISGY ++G   E L L   +R +    D +T+ +++SAC    D   
Subjt:  KEIHAHVIRFGFESDVDVVNALITMYVKCGDISNARTLFDKMPNRDRISWNAMISGYFENGEGLEGLRLFFMMRELSVDPDLITLTTVVSACELLSDERL

Query:  GREIHGYVIRSEFGDDVSVNNSLIQMYSSLGHLKEAEKVFSRMESKDVVSWTAMTASYDGHKLPCKAVETYKMMELEGVLPDEITLVSVLSACACLRYLD
        G  IH Y I+     ++ V+N LI +Y+  G L++ +KVF RM  +D++SW ++  +Y+ ++ P +A+  ++ M L  + PD +TL+S+ S  + L  + 
Subjt:  GREIHGYVIRSEFGDDVSVNNSLIQMYSSLGHLKEAEKVFSRMESKDVVSWTAMTASYDGHKLPCKAVETYKMMELEGVLPDEITLVSVLSACACLRYLD

Query:  LGIKLHEIAIKTG-LISHVIVSNSLIDMYSKCKCVDKALEVFRNISGKNVISWTSLILGLRINNRSFEALLFFRQMKE--SLKPNSVTLISVLSACARIG
            +    ++ G  +  + + N+++ MY+K   VD A  VF  +   +VISW ++I G   N  + EA+  +  M+E   +  N  T +SVL AC++ G
Subjt:  LGIKLHEIAIKTG-LISHVIVSNSLIDMYSKCKCVDKALEVFRNISGKNVISWTSLILGLRINNRSFEALLFFRQMKE--SLKPNSVTLISVLSACARIG

Query:  ALMRGKEIHAHALRTGVGFDGFLPNAILDMYIRCGREVPALNQF-NSQKKDVTAWNILLTGYAEQGQGKLAVELFDKMLESEVNPDEITFISLLCACSRS
        AL +G ++H   L+ G+  D F+  ++ DMY +CGR   AL+ F    + +   WN L+  +   G G+ AV LF +ML+  V PD ITF++LL ACS S
Subjt:  ALMRGKEIHAHALRTGVGFDGFLPNAILDMYIRCGREVPALNQF-NSQKKDVTAWNILLTGYAEQGQGKLAVELFDKMLESEVNPDEITFISLLCACSRS

Query:  GMVMEGLEYFNVMKNKYHLTPNLKHYACVIDLLGRAGRLDDAYDFIQDMPIKPDAAIWGALLNACRIHRNVELGELAATRIFEKDDKSVGYYILLCNLYA
        G+V EG   F +M+  Y +TP+LKHY C++D+ GRAG+L+ A  FI+ M ++PDA+IWGALL+ACR+H NV+LG++A+  +FE + + VGY++LL N+YA
Subjt:  GMVMEGLEYFNVMKNKYHLTPNLKHYACVIDLLGRAGRLDDAYDFIQDMPIKPDAAIWGALLNACRIHRNVELGELAATRIFEKDDKSVGYYILLCNLYA

Query:  ECGNWDKVSKVRSMMRERGLSVDTGCSWVEVKGKVHAFLSGDNFHAQSKEINGVLEGFYDKMKETGFSDLKSSFMSEVESSRAD-IFCGHSEKQAIAFGL
          G W+ V ++RS+   +GL    G S +EV  KV  F +G+  H   +E+   L     K+K  G+       + +VE    + I   HSE+ AIAF L
Subjt:  ECGNWDKVSKVRSMMRERGLSVDTGCSWVEVKGKVHAFLSGDNFHAQSKEINGVLEGFYDKMKETGFSDLKSSFMSEVESSRAD-IFCGHSEKQAIAFGL

Query:  INTAPGMPIWVTKNLYMCHSCHNFVKFISTVVRRDI
        I T     I + KNL +C  CH+  KFIS +  R+I
Subjt:  INTAPGMPIWVTKNLYMCHSCHNFVKFISTVVRRDI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGTTTTTGCTAAAGCCTCTCATCTTCTTACCTATCTTGACCGTTCCAACCCTCAAATCTCCAACAATCACAACCCCAGAATCCTATCTTTCTCCAAAAATCTGCT
AACCCACTCGCAGACGGCAAGAAAAAATCAAGAAATCTCAGTGCCTGGTGCGCTCTCTAATTCGACTATCAACCAAACCCAGAACTTAGAATTACAAGAGCTATGCCTTC
ATGGGAACCTGGAACAAGCCATGAAACGTTTGGAATCGATGCTAGAGCTGCGGATTGAGGTGGAAGACGATGCTTATATTGCTTTGTTGAGGCTGTGTGAGTGGAGAAGG
GCACCCGATGAGGGGTCTCGCATCTACGCACTTGTATCGAGTTCGAAATCTCGTTTGGGTGTTCGGCTTGGTAATGCGCTGTTGAGCATGTTCGTGAGGTTTGGCAATTT
GGTTGATGCCTGGTATGTGTTTGGTAGAATGGCTGAAAGGGATGTGTTTTCTTGGAATGTGTTGGTTGGTGGGTATGCTAAAGCGGGCTGTTTTGACGAGGCTTTGAATC
TGTATCATAGAATGTTGTGGGCTGAACTTAGGCCTGATGTATATACTTTTCCTTCTGTTTTAAGAACTTGTGGTGGTGTTCCTGATATAGCAAGAGGCAAGGAGATTCAT
GCGCATGTCATAAGATTTGGATTTGAGTCGGATGTGGATGTGGTTAATGCTTTAATCACCATGTACGTGAAATGTGGTGACATAAGTAATGCAAGGACACTGTTTGATAA
AATGCCAAATAGAGATCGGATCTCATGGAATGCAATGATTTCGGGGTACTTCGAAAATGGGGAGGGATTGGAAGGATTGAGATTGTTTTTCATGATGCGTGAGCTTTCAG
TGGATCCAGATTTGATTACTCTGACTACTGTAGTATCCGCTTGTGAGCTTCTCAGTGACGAGAGATTAGGGAGAGAAATCCATGGATATGTAATTAGATCAGAGTTTGGG
GATGATGTTTCAGTGAATAATTCTTTGATTCAGATGTATTCAAGTCTTGGGCATTTGAAGGAAGCAGAGAAAGTCTTTTCTCGAATGGAATCAAAAGATGTGGTATCGTG
GACGGCAATGACAGCAAGCTACGACGGTCACAAGCTGCCTTGTAAGGCTGTGGAAACTTATAAAATGATGGAGTTGGAGGGCGTACTGCCGGATGAGATTACTTTAGTTA
GCGTATTATCTGCTTGTGCTTGTTTAAGGTATTTAGATTTGGGTATAAAGCTTCATGAGATTGCCATCAAGACAGGCCTCATATCACATGTCATAGTCTCAAACTCTCTC
ATTGATATGTACTCAAAGTGTAAATGCGTCGACAAGGCCTTAGAAGTTTTCCGCAATATCTCAGGAAAGAATGTTATATCTTGGACTTCACTCATCCTTGGGCTACGGAT
TAATAATAGAAGTTTTGAGGCTCTGTTATTCTTTCGTCAGATGAAGGAATCATTGAAACCAAATTCTGTAACACTGATTTCAGTCCTATCAGCATGTGCTAGAATAGGAG
CTTTAATGCGTGGAAAAGAGATTCATGCACATGCTCTACGTACTGGTGTTGGGTTTGATGGCTTTTTACCTAATGCTATTCTAGACATGTATATAAGATGTGGGAGGGAA
GTGCCTGCACTGAACCAATTTAACTCTCAAAAGAAAGATGTTACAGCGTGGAATATACTGCTTACGGGTTATGCCGAGCAGGGGCAGGGCAAACTTGCTGTTGAGCTATT
TGACAAGATGCTAGAGTCAGAGGTAAATCCAGATGAGATTACCTTTATTTCGCTATTATGTGCCTGCAGCAGGTCGGGTATGGTTATGGAAGGTTTGGAGTACTTCAATG
TAATGAAAAACAAGTATCATCTGACTCCTAATCTGAAACATTACGCGTGTGTCATTGATCTACTTGGTCGTGCTGGGCGATTAGATGATGCTTATGATTTTATACAGGAC
ATGCCAATTAAGCCAGATGCAGCCATATGGGGAGCATTGCTAAATGCTTGCAGAATTCACAGAAATGTTGAGCTTGGAGAACTTGCTGCGACACGTATATTTGAAAAGGA
TGATAAAAGCGTCGGGTATTATATTCTTCTCTGCAATCTCTATGCTGAATGCGGTAACTGGGACAAGGTTTCAAAAGTGAGATCAATGATGAGAGAGCGAGGACTATCTG
TAGATACTGGTTGCAGCTGGGTCGAAGTAAAGGGAAAAGTTCATGCTTTCCTCAGCGGCGATAATTTTCACGCTCAGTCAAAGGAAATAAATGGGGTTCTGGAAGGATTC
TATGACAAGATGAAGGAAACCGGTTTCAGCGATTTGAAGAGCAGTTTCATGAGTGAGGTTGAATCTTCCAGAGCTGACATTTTTTGTGGGCATAGTGAAAAGCAGGCCAT
TGCATTCGGGCTGATCAATACTGCCCCAGGGATGCCTATCTGGGTGACAAAGAATTTGTACATGTGCCATAGTTGTCACAACTTTGTAAAATTCATCTCGACAGTCGTAC
GTAGAGACATATCGGATTTCTACACAATCTTAATCGTTCCGTGGCGACGACATCTATACAGCGGAAGATCACTTCCTGGTTGCTCGGACAGACGGAAGTTCGTCGCCGGA
CGTTTTCCAGCTATGGAGGAGTCCGTTCCCGACCGAACCACCACCACCACCGCCTCCGCCACCTCTTCGATTGTCTCGAACCGGTGGTTCCTCCGCCTTCCGGCCACCCT
CCGCCCCAAGACTTCCGTCCTTGCAGAGATATCAGGAGCAGTCGGCGACCTTGGAACTTACATCCCCATCGTTCTCACTCTCACTCTGGTCTCTCACCTCGACCTCGGAA
CCACCCTTATCTTCACCGCTCTCTACAACATCGTCACCGGCCTCCTCTTCGGCATTCCGATGCCCGTTCAGCCCATGAAGTCCATCGCCGCAGTTGCCGTCGCCGAGTCT
ACCCATCTCACTATCCCTCAAATCGCCGCCGCCGGCCTCTCCACTGCTGCCGTCCTCCTTGTCCTCGGCGCCACCGGCCTTATGTCTTTCCTCTACCGCTACCTTCCGCT
GCCTGTCGTCCGTGGCATCCAGCTCTCGCAGGGCCTCTCCTTCGCCTTCTCCGCCATTAAATACATCCGGTACAATCAAGATTTGGTCACCTCTAAATCCGGCGGACCTC
GTTCCTGGCTTGGTTTCGACGGCCTACTTGTTGCTCTTGCTTCTTGTTTGTTTCTAATCTTCACCACCGGAGCCGGTGATTCCCATGTAAGAGCCGAGAGAGAACCGTCG
TCTTCCGATCCCCCTAACGGTTCCTCAACTCGTTCTGGCCGTCGTGTCAGAAGGTTGCGGATCTTATCTACAATTCCTGCTGCTCTTATTGTGTTCTTGTTCGGGTTATT
GATCTGTTTCCTTCGCGATCTTTCCGTTTTGAAGTACCTCAAATTTGGCCCTTCCAAATTGCACTTTCTGAGCATCACATGGGAAGATTGGAAAATAGGGTTTCTGCGAG
CTGCGATCCCGCAAATCCCGTTATCCATTTTGAACTCAGTGATCGCAGTATGCAAATTGTCCGCCGATTTGTTCCCAGATCGTGAAGCCTCCGCCATGACCGTCTCCATC
AGTGTAGGTATCATGAACTTCGTCGGTTGCTGGTTCGGCGCAATGCCGGTCTGCCACGGCGCCGGTGGACTTGCCGGGCAGTACCGGTTCGGCGGCAGAAGTGGTGCATC
AGTAGTGTTTTTGGGGATCGGGAAGTTGGTTTTGGGATTGGCGTTTGGGAATTCTTTTGCGCAGGTTCTGAATCAATTCCCAATTGGGGTTCTTGGGGTTCTTCTGCTAT
TTGCTGGGATTGAATTAGCCATGGCTTCCAGAGACATGAAGAGTAAAGAAGAGTCATTTGTGATGTTGGTCTGTGCTGCTGTGTCACTCACAGGGTCAAGTGCTGCTTTA
GGATTTGGAGTTGGGATTCTGCTCTCTCTGCTTATGAAACTGAGAGAATTTGACTGTTCTTCTGGTTTTGGGTTTCAGAAGATGAAACCCAAATCTGATGCCGACGAAGA
AACACACTTGTTAGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGTTTTTGCTAAAGCCTCTCATCTTCTTACCTATCTTGACCGTTCCAACCCTCAAATCTCCAACAATCACAACCCCAGAATCCTATCTTTCTCCAAAAATCTGCT
AACCCACTCGCAGACGGCAAGAAAAAATCAAGAAATCTCAGTGCCTGGTGCGCTCTCTAATTCGACTATCAACCAAACCCAGAACTTAGAATTACAAGAGCTATGCCTTC
ATGGGAACCTGGAACAAGCCATGAAACGTTTGGAATCGATGCTAGAGCTGCGGATTGAGGTGGAAGACGATGCTTATATTGCTTTGTTGAGGCTGTGTGAGTGGAGAAGG
GCACCCGATGAGGGGTCTCGCATCTACGCACTTGTATCGAGTTCGAAATCTCGTTTGGGTGTTCGGCTTGGTAATGCGCTGTTGAGCATGTTCGTGAGGTTTGGCAATTT
GGTTGATGCCTGGTATGTGTTTGGTAGAATGGCTGAAAGGGATGTGTTTTCTTGGAATGTGTTGGTTGGTGGGTATGCTAAAGCGGGCTGTTTTGACGAGGCTTTGAATC
TGTATCATAGAATGTTGTGGGCTGAACTTAGGCCTGATGTATATACTTTTCCTTCTGTTTTAAGAACTTGTGGTGGTGTTCCTGATATAGCAAGAGGCAAGGAGATTCAT
GCGCATGTCATAAGATTTGGATTTGAGTCGGATGTGGATGTGGTTAATGCTTTAATCACCATGTACGTGAAATGTGGTGACATAAGTAATGCAAGGACACTGTTTGATAA
AATGCCAAATAGAGATCGGATCTCATGGAATGCAATGATTTCGGGGTACTTCGAAAATGGGGAGGGATTGGAAGGATTGAGATTGTTTTTCATGATGCGTGAGCTTTCAG
TGGATCCAGATTTGATTACTCTGACTACTGTAGTATCCGCTTGTGAGCTTCTCAGTGACGAGAGATTAGGGAGAGAAATCCATGGATATGTAATTAGATCAGAGTTTGGG
GATGATGTTTCAGTGAATAATTCTTTGATTCAGATGTATTCAAGTCTTGGGCATTTGAAGGAAGCAGAGAAAGTCTTTTCTCGAATGGAATCAAAAGATGTGGTATCGTG
GACGGCAATGACAGCAAGCTACGACGGTCACAAGCTGCCTTGTAAGGCTGTGGAAACTTATAAAATGATGGAGTTGGAGGGCGTACTGCCGGATGAGATTACTTTAGTTA
GCGTATTATCTGCTTGTGCTTGTTTAAGGTATTTAGATTTGGGTATAAAGCTTCATGAGATTGCCATCAAGACAGGCCTCATATCACATGTCATAGTCTCAAACTCTCTC
ATTGATATGTACTCAAAGTGTAAATGCGTCGACAAGGCCTTAGAAGTTTTCCGCAATATCTCAGGAAAGAATGTTATATCTTGGACTTCACTCATCCTTGGGCTACGGAT
TAATAATAGAAGTTTTGAGGCTCTGTTATTCTTTCGTCAGATGAAGGAATCATTGAAACCAAATTCTGTAACACTGATTTCAGTCCTATCAGCATGTGCTAGAATAGGAG
CTTTAATGCGTGGAAAAGAGATTCATGCACATGCTCTACGTACTGGTGTTGGGTTTGATGGCTTTTTACCTAATGCTATTCTAGACATGTATATAAGATGTGGGAGGGAA
GTGCCTGCACTGAACCAATTTAACTCTCAAAAGAAAGATGTTACAGCGTGGAATATACTGCTTACGGGTTATGCCGAGCAGGGGCAGGGCAAACTTGCTGTTGAGCTATT
TGACAAGATGCTAGAGTCAGAGGTAAATCCAGATGAGATTACCTTTATTTCGCTATTATGTGCCTGCAGCAGGTCGGGTATGGTTATGGAAGGTTTGGAGTACTTCAATG
TAATGAAAAACAAGTATCATCTGACTCCTAATCTGAAACATTACGCGTGTGTCATTGATCTACTTGGTCGTGCTGGGCGATTAGATGATGCTTATGATTTTATACAGGAC
ATGCCAATTAAGCCAGATGCAGCCATATGGGGAGCATTGCTAAATGCTTGCAGAATTCACAGAAATGTTGAGCTTGGAGAACTTGCTGCGACACGTATATTTGAAAAGGA
TGATAAAAGCGTCGGGTATTATATTCTTCTCTGCAATCTCTATGCTGAATGCGGTAACTGGGACAAGGTTTCAAAAGTGAGATCAATGATGAGAGAGCGAGGACTATCTG
TAGATACTGGTTGCAGCTGGGTCGAAGTAAAGGGAAAAGTTCATGCTTTCCTCAGCGGCGATAATTTTCACGCTCAGTCAAAGGAAATAAATGGGGTTCTGGAAGGATTC
TATGACAAGATGAAGGAAACCGGTTTCAGCGATTTGAAGAGCAGTTTCATGAGTGAGGTTGAATCTTCCAGAGCTGACATTTTTTGTGGGCATAGTGAAAAGCAGGCCAT
TGCATTCGGGCTGATCAATACTGCCCCAGGGATGCCTATCTGGGTGACAAAGAATTTGTACATGTGCCATAGTTGTCACAACTTTGTAAAATTCATCTCGACAGTCGTAC
GTAGAGACATATCGGATTTCTACACAATCTTAATCGTTCCGTGGCGACGACATCTATACAGCGGAAGATCACTTCCTGGTTGCTCGGACAGACGGAAGTTCGTCGCCGGA
CGTTTTCCAGCTATGGAGGAGTCCGTTCCCGACCGAACCACCACCACCACCGCCTCCGCCACCTCTTCGATTGTCTCGAACCGGTGGTTCCTCCGCCTTCCGGCCACCCT
CCGCCCCAAGACTTCCGTCCTTGCAGAGATATCAGGAGCAGTCGGCGACCTTGGAACTTACATCCCCATCGTTCTCACTCTCACTCTGGTCTCTCACCTCGACCTCGGAA
CCACCCTTATCTTCACCGCTCTCTACAACATCGTCACCGGCCTCCTCTTCGGCATTCCGATGCCCGTTCAGCCCATGAAGTCCATCGCCGCAGTTGCCGTCGCCGAGTCT
ACCCATCTCACTATCCCTCAAATCGCCGCCGCCGGCCTCTCCACTGCTGCCGTCCTCCTTGTCCTCGGCGCCACCGGCCTTATGTCTTTCCTCTACCGCTACCTTCCGCT
GCCTGTCGTCCGTGGCATCCAGCTCTCGCAGGGCCTCTCCTTCGCCTTCTCCGCCATTAAATACATCCGGTACAATCAAGATTTGGTCACCTCTAAATCCGGCGGACCTC
GTTCCTGGCTTGGTTTCGACGGCCTACTTGTTGCTCTTGCTTCTTGTTTGTTTCTAATCTTCACCACCGGAGCCGGTGATTCCCATGTAAGAGCCGAGAGAGAACCGTCG
TCTTCCGATCCCCCTAACGGTTCCTCAACTCGTTCTGGCCGTCGTGTCAGAAGGTTGCGGATCTTATCTACAATTCCTGCTGCTCTTATTGTGTTCTTGTTCGGGTTATT
GATCTGTTTCCTTCGCGATCTTTCCGTTTTGAAGTACCTCAAATTTGGCCCTTCCAAATTGCACTTTCTGAGCATCACATGGGAAGATTGGAAAATAGGGTTTCTGCGAG
CTGCGATCCCGCAAATCCCGTTATCCATTTTGAACTCAGTGATCGCAGTATGCAAATTGTCCGCCGATTTGTTCCCAGATCGTGAAGCCTCCGCCATGACCGTCTCCATC
AGTGTAGGTATCATGAACTTCGTCGGTTGCTGGTTCGGCGCAATGCCGGTCTGCCACGGCGCCGGTGGACTTGCCGGGCAGTACCGGTTCGGCGGCAGAAGTGGTGCATC
AGTAGTGTTTTTGGGGATCGGGAAGTTGGTTTTGGGATTGGCGTTTGGGAATTCTTTTGCGCAGGTTCTGAATCAATTCCCAATTGGGGTTCTTGGGGTTCTTCTGCTAT
TTGCTGGGATTGAATTAGCCATGGCTTCCAGAGACATGAAGAGTAAAGAAGAGTCATTTGTGATGTTGGTCTGTGCTGCTGTGTCACTCACAGGGTCAAGTGCTGCTTTA
GGATTTGGAGTTGGGATTCTGCTCTCTCTGCTTATGAAACTGAGAGAATTTGACTGTTCTTCTGGTTTTGGGTTTCAGAAGATGAAACCCAAATCTGATGCCGACGAAGA
AACACACTTGTTAGCTTGA
Protein sequenceShow/hide protein sequence
MAVFAKASHLLTYLDRSNPQISNNHNPRILSFSKNLLTHSQTARKNQEISVPGALSNSTINQTQNLELQELCLHGNLEQAMKRLESMLELRIEVEDDAYIALLRLCEWRR
APDEGSRIYALVSSSKSRLGVRLGNALLSMFVRFGNLVDAWYVFGRMAERDVFSWNVLVGGYAKAGCFDEALNLYHRMLWAELRPDVYTFPSVLRTCGGVPDIARGKEIH
AHVIRFGFESDVDVVNALITMYVKCGDISNARTLFDKMPNRDRISWNAMISGYFENGEGLEGLRLFFMMRELSVDPDLITLTTVVSACELLSDERLGREIHGYVIRSEFG
DDVSVNNSLIQMYSSLGHLKEAEKVFSRMESKDVVSWTAMTASYDGHKLPCKAVETYKMMELEGVLPDEITLVSVLSACACLRYLDLGIKLHEIAIKTGLISHVIVSNSL
IDMYSKCKCVDKALEVFRNISGKNVISWTSLILGLRINNRSFEALLFFRQMKESLKPNSVTLISVLSACARIGALMRGKEIHAHALRTGVGFDGFLPNAILDMYIRCGRE
VPALNQFNSQKKDVTAWNILLTGYAEQGQGKLAVELFDKMLESEVNPDEITFISLLCACSRSGMVMEGLEYFNVMKNKYHLTPNLKHYACVIDLLGRAGRLDDAYDFIQD
MPIKPDAAIWGALLNACRIHRNVELGELAATRIFEKDDKSVGYYILLCNLYAECGNWDKVSKVRSMMRERGLSVDTGCSWVEVKGKVHAFLSGDNFHAQSKEINGVLEGF
YDKMKETGFSDLKSSFMSEVESSRADIFCGHSEKQAIAFGLINTAPGMPIWVTKNLYMCHSCHNFVKFISTVVRRDISDFYTILIVPWRRHLYSGRSLPGCSDRRKFVAG
RFPAMEESVPDRTTTTTASATSSIVSNRWFLRLPATLRPKTSVLAEISGAVGDLGTYIPIVLTLTLVSHLDLGTTLIFTALYNIVTGLLFGIPMPVQPMKSIAAVAVAES
THLTIPQIAAAGLSTAAVLLVLGATGLMSFLYRYLPLPVVRGIQLSQGLSFAFSAIKYIRYNQDLVTSKSGGPRSWLGFDGLLVALASCLFLIFTTGAGDSHVRAEREPS
SSDPPNGSSTRSGRRVRRLRILSTIPAALIVFLFGLLICFLRDLSVLKYLKFGPSKLHFLSITWEDWKIGFLRAAIPQIPLSILNSVIAVCKLSADLFPDREASAMTVSI
SVGIMNFVGCWFGAMPVCHGAGGLAGQYRFGGRSGASVVFLGIGKLVLGLAFGNSFAQVLNQFPIGVLGVLLLFAGIELAMASRDMKSKEESFVMLVCAAVSLTGSSAAL
GFGVGILLSLLMKLREFDCSSGFGFQKMKPKSDADEETHLLA