; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10007554 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10007554
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr10:6972332..6974578
RNA-Seq ExpressionHG10007554
SyntenyHG10007554
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588214.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0083.82Show/hide
Query:  MAAHRLPLPKSTFRSGILTSTFTSATPLLRANFISFRLHTIYSLPKFHSNTPLRSFQSHGTPEPPSLSPSDSFLVEKTLFSLKQNNVSYLSNSLFRLNPS
        MAAHRLPLPK TFR+ IL STF+ ATPLLRAN ISF  H  YSLPKFHS     +  S+  P   S+S S+SFLVEK LFSLKQNNVS LSNSLFRLNPS
Subjt:  MAAHRLPLPKSTFRSGILTSTFTSATPLLRANFISFRLHTIYSLPKFHSNTPLRSFQSHGTPEPPSLSPSDSFLVEKTLFSLKQNNVSYLSNSLFRLNPS

Query:  LLLQVLCRCRENLHLGLKFIGLLSSNCPNFKHSSLSLSAMVHFLVRGRRLSEAQACILRMVRKSGVSRVEVVESLVSTCGNFGSIGLVSDLLIRTYVQAK
         L++VL  CRENLHLGLKFI L+SS+CPN KHSS+SLSAMVHFLVRGRRLSEAQACILRMVRKSGVSRVEVV+SLVSTCGNFGSIGLVSDLL+RTYVQA+
Subjt:  LLLQVLCRCRENLHLGLKFIGLLSSNCPNFKHSSLSLSAMVHFLVRGRRLSEAQACILRMVRKSGVSRVEVVESLVSTCGNFGSIGLVSDLLIRTYVQAK

Query:  NLREGSEAFRILRSKGVSVSINACNSLLRGLVKIGWVDLALEICGEVVRGGIELNVYTLNIMVNALCKDHKIENVNLFLSDMEEKGVFADIVTYNTLISA
         LREGSEAFRILRSKGVSVSINACNSLL GLVKIGWVDLA EI GEVVRGG ELNVYTLNIMVNALCKD +I NVNLFLSDME+KGVF DIVTYNTLIS 
Subjt:  NLREGSEAFRILRSKGVSVSINACNSLLRGLVKIGWVDLALEICGEVVRGGIELNVYTLNIMVNALCKDHKIENVNLFLSDMEEKGVFADIVTYNTLISA

Query:  YCREGLIEEAFQLLNLISGKGMESGLLTYNAIINGLCKIGKFDRAKDILNEMLQLGLRPDATSYNMLLVEICRRDNILEAQEIFDEMSRHDVLPDLVSFS
        YCREGL+EEAF+LLN IS KGME GLLTYNAIINGLCKIGK++RAKD+LN+M QLGL+PDA +YN LLVEICRRDNI EA+EIFDEMSRH VLPDL+SFS
Subjt:  YCREGLIEEAFQLLNLISGKGMESGLLTYNAIINGLCKIGKFDRAKDILNEMLQLGLRPDATSYNMLLVEICRRDNILEAQEIFDEMSRHDVLPDLVSFS

Query:  SLIGVLARNGHLDRAFMYFRDMKSIGLVPDNVIYTILIDGFCRNGAISDALKMRDEMLARGCFMDVVAYNTILNGLCKKKMFVDADMLFNEMVERGVFPD
        SLI VLARNG LD A  YFR+MKSIGLVPDNVIYTILIDGFCRNGAISDALKMRDEMLA+GC +DVVAYNTILNGLCKKKM VDADMLFNEMVERG+FPD
Subjt:  SLIGVLARNGHLDRAFMYFRDMKSIGLVPDNVIYTILIDGFCRNGAISDALKMRDEMLARGCFMDVVAYNTILNGLCKKKMFVDADMLFNEMVERGVFPD

Query:  FYTFTTLIHGYCKVGNMDKALNLFGTMVRTNLKPDIVTYNTLIDGFCKVGEMGRAKELWDDMVRKDILPNHISYGIVINGFCSSGHLSEAWHLCDQMVEQ
        FYTFTTLIHGYCK GNMD+ALNLF TMVRTNLKPDIVTYNTLIDGFCKVG+M +AK+LWDDM+RKDI+PNH++YG VINGFCSSG+L EA HLCDQMVE+
Subjt:  FYTFTTLIHGYCKVGNMDKALNLFGTMVRTNLKPDIVTYNTLIDGFCKVGEMGRAKELWDDMVRKDILPNHISYGIVINGFCSSGHLSEAWHLCDQMVEQ

Query:  GIKPNLITCNTLIKGYCQSGDMPNAYEYLSKMISNGIIPDSISYNTLIDGYLKENNLEKAFVLINEMEKQGLQLDAITYNVILNGFCARGRMQEAEQVLR
        GIKPNL+T NTLIKGYC+S DM  A+E LSKMISNGIIPD ISYNTLIDGYLK+ NL KAFV+INEMEKQGL+LD ITYN+ILNG+CA+GRM EAEQVLR
Subjt:  GIKPNLITCNTLIKGYCQSGDMPNAYEYLSKMISNGIIPDSISYNTLIDGYLKENNLEKAFVLINEMEKQGLQLDAITYNVILNGFCARGRMQEAEQVLR

Query:  KMIENGVNPDRATYSSLINGHVSQDSMKDAFHFHDEMLQRGLVPDDTF
        KMIENGVNPDRATYSSLINGHVSQD+MKDAF FHDEMLQRGLVPDD F
Subjt:  KMIENGVNPDRATYSSLINGHVSQDSMKDAFHFHDEMLQRGLVPDDTF

XP_022933790.1 pentatricopeptide repeat-containing protein At5g01110 [Cucurbita moschata]0.0e+0083.96Show/hide
Query:  MAAHRLPLPKSTFRSGILTSTFTSATPLLRANFISFRLHTIYSLPKFHSNTPLRSFQSHGTPEPPSLSPSDSFLVEKTLFSLKQNNVSYLSNSLFRLNPS
        MAAHRLPLPK TFR+ I+ ST T ATPLLR+N ISF  H  YSLPKFHS     +  S+  P   S+S S+SFLVEK LFSLKQNNVS LSNSLFRLNPS
Subjt:  MAAHRLPLPKSTFRSGILTSTFTSATPLLRANFISFRLHTIYSLPKFHSNTPLRSFQSHGTPEPPSLSPSDSFLVEKTLFSLKQNNVSYLSNSLFRLNPS

Query:  LLLQVLCRCRENLHLGLKFIGLLSSNCPNFKHSSLSLSAMVHFLVRGRRLSEAQACILRMVRKSGVSRVEVVESLVSTCGNFGSIGLVSDLLIRTYVQAK
         L++VL  CRENLHLGLKFI L+SS+CPN KHSS+SLSAMVHFLVRGRRLSEAQACILRMVRKSGVSRVEVV+SLVSTCGNFGSIGLVSDLL+RTYVQA+
Subjt:  LLLQVLCRCRENLHLGLKFIGLLSSNCPNFKHSSLSLSAMVHFLVRGRRLSEAQACILRMVRKSGVSRVEVVESLVSTCGNFGSIGLVSDLLIRTYVQAK

Query:  NLREGSEAFRILRSKGVSVSINACNSLLRGLVKIGWVDLALEICGEVVRGGIELNVYTLNIMVNALCKDHKIENVNLFLSDMEEKGVFADIVTYNTLISA
         LREGSEAFRILRSKGVSVSINACNSLL GLVKIGWVDLA EI GEVVRGG ELNVYTLNIMVNALCKD +I NVNLFLSDME+KGVF DIVTYNTLISA
Subjt:  NLREGSEAFRILRSKGVSVSINACNSLLRGLVKIGWVDLALEICGEVVRGGIELNVYTLNIMVNALCKDHKIENVNLFLSDMEEKGVFADIVTYNTLISA

Query:  YCREGLIEEAFQLLNLISGKGMESGLLTYNAIINGLCKIGKFDRAKDILNEMLQLGLRPDATSYNMLLVEICRRDNILEAQEIFDEMSRHDVLPDLVSFS
        YCREGL+EEAF+LLN IS KGME GLLTYNAIINGLCKIGK++RAKD+LN+M QLGL+PDA +YN LLVEICRRDNI EA+EIFDEMSRH VLPDL+SFS
Subjt:  YCREGLIEEAFQLLNLISGKGMESGLLTYNAIINGLCKIGKFDRAKDILNEMLQLGLRPDATSYNMLLVEICRRDNILEAQEIFDEMSRHDVLPDLVSFS

Query:  SLIGVLARNGHLDRAFMYFRDMKSIGLVPDNVIYTILIDGFCRNGAISDALKMRDEMLARGCFMDVVAYNTILNGLCKKKMFVDADMLFNEMVERGVFPD
        SLI VLARNG LD A  YFR+MKSIGLVPDNVIYTILIDGFCRNGAISDALKMRDEMLA+GC +DVVAYNTILNGLCKKKM VDADMLFNEMVERGVFPD
Subjt:  SLIGVLARNGHLDRAFMYFRDMKSIGLVPDNVIYTILIDGFCRNGAISDALKMRDEMLARGCFMDVVAYNTILNGLCKKKMFVDADMLFNEMVERGVFPD

Query:  FYTFTTLIHGYCKVGNMDKALNLFGTMVRTNLKPDIVTYNTLIDGFCKVGEMGRAKELWDDMVRKDILPNHISYGIVINGFCSSGHLSEAWHLCDQMVEQ
        FYTFTTLIHGYCK GNMD+ALNLFG MVRTNLKPDIVTYNTLIDGFCKVG+M +AK+LWDDM+RKDI+PNH+SYG VINGFCSSG+LSEA HLCDQMVE+
Subjt:  FYTFTTLIHGYCKVGNMDKALNLFGTMVRTNLKPDIVTYNTLIDGFCKVGEMGRAKELWDDMVRKDILPNHISYGIVINGFCSSGHLSEAWHLCDQMVEQ

Query:  GIKPNLITCNTLIKGYCQSGDMPNAYEYLSKMISNGIIPDSISYNTLIDGYLKENNLEKAFVLINEMEKQGLQLDAITYNVILNGFCARGRMQEAEQVLR
        GIKPNL+T NTLIKGYC+S DM  A+E LSKMISNGIIPD ISYNTLIDGYLK+ NL KAFV+INEMEKQ L+LD ITYN+ILNG+CA+GRM EAEQVLR
Subjt:  GIKPNLITCNTLIKGYCQSGDMPNAYEYLSKMISNGIIPDSISYNTLIDGYLKENNLEKAFVLINEMEKQGLQLDAITYNVILNGFCARGRMQEAEQVLR

Query:  KMIENGVNPDRATYSSLINGHVSQDSMKDAFHFHDEMLQRGLVPDDTF
        KMIENGVNPDRATYSSLINGHVSQD+MKDAF FHDEMLQRGLVPDD F
Subjt:  KMIENGVNPDRATYSSLINGHVSQDSMKDAFHFHDEMLQRGLVPDDTF

XP_022973817.1 pentatricopeptide repeat-containing protein At5g01110 [Cucurbita maxima]0.0e+0084.63Show/hide
Query:  MAAHRLPLPKSTFRSGILTSTFTSATPLLRANFISFRLHTIYSLPKFHSNTPLRSFQSHGTPEPPSLSPSDSFLVEKTLFSLKQNNVSYLSNSLFRLNPS
        MAAHRLPLPK TFR+ IL STFT ATPLLRAN ISF  H IYSLPKFHS     S  S+  P   S+S S+SFLVEK LFSLKQNNVS LSNSLFRLNPS
Subjt:  MAAHRLPLPKSTFRSGILTSTFTSATPLLRANFISFRLHTIYSLPKFHSNTPLRSFQSHGTPEPPSLSPSDSFLVEKTLFSLKQNNVSYLSNSLFRLNPS

Query:  LLLQVLCRCRENLHLGLKFIGLLSSNCPNFKHSSLSLSAMVHFLVRGRRLSEAQACILRMVRKSGVSRVEVVESLVSTCGNFGSIGLVSDLLIRTYVQAK
         L++VL  CRENLHLGLKFI L+SS+CPN KHSS+SLSAMVHFLVRGRRLSEAQACILRMVRKSGVSRVEVVES+VSTCGNFGSIGLVSDLL+RTYVQA+
Subjt:  LLLQVLCRCRENLHLGLKFIGLLSSNCPNFKHSSLSLSAMVHFLVRGRRLSEAQACILRMVRKSGVSRVEVVESLVSTCGNFGSIGLVSDLLIRTYVQAK

Query:  NLREGSEAFRILRSKGVSVSINACNSLLRGLVKIGWVDLALEICGEVVRGGIELNVYTLNIMVNALCKDHKIENVNLFLSDMEEKGVFADIVTYNTLISA
         LREGSEAFRIL+SKGVSVSINACNSLL GLVKIGWVDLA EI GEVVRGG ELNVYTLNIMVNALCKD +I NVNLFLSDME+KGVF DIVTYNTLISA
Subjt:  NLREGSEAFRILRSKGVSVSINACNSLLRGLVKIGWVDLALEICGEVVRGGIELNVYTLNIMVNALCKDHKIENVNLFLSDMEEKGVFADIVTYNTLISA

Query:  YCREGLIEEAFQLLNLISGKGMESGLLTYNAIINGLCKIGKFDRAKDILNEMLQLGLRPDATSYNMLLVEICRRDNILEAQEIFDEMSRHDVLPDLVSFS
        YCREG +EEAF+LLN IS KGME GLLTYNAIINGLCKI K++RAKD+LN+M QLGL+PDA +YN LLVEICRRDNI EA+EIFDEMSRH VLPDL+SFS
Subjt:  YCREGLIEEAFQLLNLISGKGMESGLLTYNAIINGLCKIGKFDRAKDILNEMLQLGLRPDATSYNMLLVEICRRDNILEAQEIFDEMSRHDVLPDLVSFS

Query:  SLIGVLARNGHLDRAFMYFRDMKSIGLVPDNVIYTILIDGFCRNGAISDALKMRDEMLARGCFMDVVAYNTILNGLCKKKMFVDADMLFNEMVERGVFPD
        SLI VLARNG+LD A  YFRDMK+IGLVPDNVIYTILIDGFCRNGAISDALKMRDEMLA+GC +DVVAYNTILNGLCKKKM VDADMLFNEMVERGVFPD
Subjt:  SLIGVLARNGHLDRAFMYFRDMKSIGLVPDNVIYTILIDGFCRNGAISDALKMRDEMLARGCFMDVVAYNTILNGLCKKKMFVDADMLFNEMVERGVFPD

Query:  FYTFTTLIHGYCKVGNMDKALNLFGTMVRTNLKPDIVTYNTLIDGFCKVGEMGRAKELWDDMVRKDILPNHISYGIVINGFCSSGHLSEAWHLCDQMVEQ
        FYTFTTLIHGYCK GNMD+ALNLFGTMVRTNLKPDIVTYNTLIDGFCKVG+M RAK+LWDDM+RKDI+PNH+SYG VINGFCSSG+LSEA HLCDQMVE+
Subjt:  FYTFTTLIHGYCKVGNMDKALNLFGTMVRTNLKPDIVTYNTLIDGFCKVGEMGRAKELWDDMVRKDILPNHISYGIVINGFCSSGHLSEAWHLCDQMVEQ

Query:  GIKPNLITCNTLIKGYCQSGDMPNAYEYLSKMISNGIIPDSISYNTLIDGYLKENNLEKAFVLINEMEKQGLQLDAITYNVILNGFCARGRMQEAEQVLR
        GIKPNL+T NTLIKGYC+S DM  A+E LSKMISNGIIPD ISYNTLIDGYLK+ NL KAFV+INEMEKQGL+LD ITYN+ILNG+CA+GRM EAEQVLR
Subjt:  GIKPNLITCNTLIKGYCQSGDMPNAYEYLSKMISNGIIPDSISYNTLIDGYLKENNLEKAFVLINEMEKQGLQLDAITYNVILNGFCARGRMQEAEQVLR

Query:  KMIENGVNPDRATYSSLINGHVSQDSMKDAFHFHDEMLQRGLVPDDTF
        KMIENGVNPDRATYSSLINGHVSQD+MKDAF FHDEMLQRGLVPDD F
Subjt:  KMIENGVNPDRATYSSLINGHVSQDSMKDAFHFHDEMLQRGLVPDDTF

XP_023530884.1 pentatricopeptide repeat-containing protein At5g01110 [Cucurbita pepo subsp. pepo]0.0e+0084.22Show/hide
Query:  MAAHRLPLPKSTFRSGILTSTFTSATPLLRANFISFRLHTIYSLPKFHSNTPLRSFQSHGTPEPPSLSPSDSFLVEKTLFSLKQNNVSYLSNSLFRLNPS
        MAAHRLPLPK TFR+ IL  TFT ATPLLRANFISF  H  YSLPKFHS     +  S+  P   S+S S+SFLVEK LFSLKQNNVS LSNSLFRLNPS
Subjt:  MAAHRLPLPKSTFRSGILTSTFTSATPLLRANFISFRLHTIYSLPKFHSNTPLRSFQSHGTPEPPSLSPSDSFLVEKTLFSLKQNNVSYLSNSLFRLNPS

Query:  LLLQVLCRCRENLHLGLKFIGLLSSNCPNFKHSSLSLSAMVHFLVRGRRLSEAQACILRMVRKSGVSRVEVVESLVSTCGNFGSIGLVSDLLIRTYVQAK
         L++VL  CRENLHLGLKFI L+SS+CPN KHSS+SLSAMVHFLVRGRRLSEAQ CILRMVRKSGVSRVEVVES+VSTCGNFGSIGLVSDLL+RTYVQA+
Subjt:  LLLQVLCRCRENLHLGLKFIGLLSSNCPNFKHSSLSLSAMVHFLVRGRRLSEAQACILRMVRKSGVSRVEVVESLVSTCGNFGSIGLVSDLLIRTYVQAK

Query:  NLREGSEAFRILRSKGVSVSINACNSLLRGLVKIGWVDLALEICGEVVRGGIELNVYTLNIMVNALCKDHKIENVNLFLSDMEEKGVFADIVTYNTLISA
         LREGSEAFRILRSKGVSVSINACNSLL GLVKIGWVDLA EI GEVVRGG ELNVYTLNIMVNALCKD +I NVNLFLSDME+KGVF DIVTYNTLISA
Subjt:  NLREGSEAFRILRSKGVSVSINACNSLLRGLVKIGWVDLALEICGEVVRGGIELNVYTLNIMVNALCKDHKIENVNLFLSDMEEKGVFADIVTYNTLISA

Query:  YCREGLIEEAFQLLNLISGKGMESGLLTYNAIINGLCKIGKFDRAKDILNEMLQLGLRPDATSYNMLLVEICRRDNILEAQEIFDEMSRHDVLPDLVSFS
        YCREGL+EEAF+LLN IS KGME GLLTYNAIINGLCKIGK++RAKD+LN+M QLGL+PDA +YN LLVEICRRDNI EA+EIFDEMSRH VLPDL+SFS
Subjt:  YCREGLIEEAFQLLNLISGKGMESGLLTYNAIINGLCKIGKFDRAKDILNEMLQLGLRPDATSYNMLLVEICRRDNILEAQEIFDEMSRHDVLPDLVSFS

Query:  SLIGVLARNGHLDRAFMYFRDMKSIGLVPDNVIYTILIDGFCRNGAISDALKMRDEMLARGCFMDVVAYNTILNGLCKKKMFVDADMLFNEMVERGVFPD
        SLI VLARNG LD A  YFR+MKSIGLVPDNVIYTILIDGFCRNGAISDALKMRDEMLA+GC +DVVAYNTILNGLCKKKM VDADMLFNEM+ERGVFPD
Subjt:  SLIGVLARNGHLDRAFMYFRDMKSIGLVPDNVIYTILIDGFCRNGAISDALKMRDEMLARGCFMDVVAYNTILNGLCKKKMFVDADMLFNEMVERGVFPD

Query:  FYTFTTLIHGYCKVGNMDKALNLFGTMVRTNLKPDIVTYNTLIDGFCKVGEMGRAKELWDDMVRKDILPNHISYGIVINGFCSSGHLSEAWHLCDQMVEQ
        FYTFTTLIHGYCK GNMD+ALNLFG MVRTNLKPDIVTYNTLIDGFCKVG+M +AK+LWDDM+RKDI+PNH+SYG VINGFCSSG+LSEA HLCDQMVE+
Subjt:  FYTFTTLIHGYCKVGNMDKALNLFGTMVRTNLKPDIVTYNTLIDGFCKVGEMGRAKELWDDMVRKDILPNHISYGIVINGFCSSGHLSEAWHLCDQMVEQ

Query:  GIKPNLITCNTLIKGYCQSGDMPNAYEYLSKMISNGIIPDSISYNTLIDGYLKENNLEKAFVLINEMEKQGLQLDAITYNVILNGFCARGRMQEAEQVLR
        GIKPNL+T NTLIKGYC+S DM  A+E LSKMISNGIIPD ISYNTLIDGYLK+ NL KAFV+INEMEKQGL+LD ITYN+ILNG+CA GRM EAEQVLR
Subjt:  GIKPNLITCNTLIKGYCQSGDMPNAYEYLSKMISNGIIPDSISYNTLIDGYLKENNLEKAFVLINEMEKQGLQLDAITYNVILNGFCARGRMQEAEQVLR

Query:  KMIENGVNPDRATYSSLINGHVSQDSMKDAFHFHDEMLQRGLVPDDTF
        KMIENGVNPDRATYSSLINGHVSQD+MKDAF FHDEMLQRGLVPDD F
Subjt:  KMIENGVNPDRATYSSLINGHVSQDSMKDAFHFHDEMLQRGLVPDDTF

XP_038879208.1 pentatricopeptide repeat-containing protein At5g01110 [Benincasa hispida]0.0e+0089.14Show/hide
Query:  MAAHRLPLPKSTFRSGILTSTFTSATPLLRANFISFRLHTIYSLPKFHSNTPLRSFQSHGTPEPP-------SLSPSDSFLVEKTLFSLKQNNVSYLSNS
        MA HRLPLPKSTFRSG+L STF SATP +         HTIYSLPKF SNTPLRS Q+H TPE P       S SPSDSFLV+K LF+LKQNNVSYLSNS
Subjt:  MAAHRLPLPKSTFRSGILTSTFTSATPLLRANFISFRLHTIYSLPKFHSNTPLRSFQSHGTPEPP-------SLSPSDSFLVEKTLFSLKQNNVSYLSNS

Query:  LFRLNPSLLLQVLCRCRENLHLGLKFIGLLSSNCPNFKHSSLSLSAMVHFLVRGRRLSEAQACILRMVRKSGVSRVEVVESLVSTCGNFGSIGLVSDLLI
        LFRLNPSLLL+VLCRCRENLHLGLKFI L+SSNCPNFKHSSLSLSAMVHFLVRGRRLSEAQACILRMVRKSGVSRVEVVESL+ST  NFGSIGLVSDLL+
Subjt:  LFRLNPSLLLQVLCRCRENLHLGLKFIGLLSSNCPNFKHSSLSLSAMVHFLVRGRRLSEAQACILRMVRKSGVSRVEVVESLVSTCGNFGSIGLVSDLLI

Query:  RTYVQAKNLREGSEAFRILRSKGVSVSINACNSLLRGLVKIGWVDLALEICGEVVRGGIELNVYTLNIMVNALCKDHKIENVNLFLSDMEEKGVFADIVT
        RTYVQA+ LREGSEAFRILRSKGVSVSINACNSLL GLVKIGWVDLA EI GEVVRGGIELNVYTLNIMVNALCKD KIENVNLFLSDMEEKGVF DIVT
Subjt:  RTYVQAKNLREGSEAFRILRSKGVSVSINACNSLLRGLVKIGWVDLALEICGEVVRGGIELNVYTLNIMVNALCKDHKIENVNLFLSDMEEKGVFADIVT

Query:  YNTLISAYCREGLIEEAFQLLNLISGKGMESGLLTYNAIINGLCKIGKFDRAKDILNEMLQLGLRPDATSYNMLLVEICRRDNILEAQEIFDEMSRHDVL
        YNTLI+AYCREGL+EEAFQLLN IS KGME GLLTYNAIINGLCKIGK++RAKD+LNEM+QLGLRPDA +YN LLVEICRRDNILEAQEIFD+MSRH VL
Subjt:  YNTLISAYCREGLIEEAFQLLNLISGKGMESGLLTYNAIINGLCKIGKFDRAKDILNEMLQLGLRPDATSYNMLLVEICRRDNILEAQEIFDEMSRHDVL

Query:  PDLVSFSSLIGVLARNGHLDRAFMYFRDMKSIGLVPDNVIYTILIDGFCRNGAISDALKMRDEMLARGCFMDVVAYNTILNGLCKKKMFVDADMLFNEMV
        PDLVSFSSL+GVLARNGHLD+AFM+FR+MKSIGLVPDNVIYTILIDGFCRNGAISDALKMRDEMLA+GCFMDVVAYNTILNGLCKKKMF DADMLFNEMV
Subjt:  PDLVSFSSLIGVLARNGHLDRAFMYFRDMKSIGLVPDNVIYTILIDGFCRNGAISDALKMRDEMLARGCFMDVVAYNTILNGLCKKKMFVDADMLFNEMV

Query:  ERGVFPDFYTFTTLIHGYCKVGNMDKALNLFGTMVRTNLKPDIVTYNTLIDGFCKVGEMGRAKELWDDMVRKDILPNHISYGIVINGFCSSGHLSEAWHL
        ERGVFPDFYTFTTLIHGYCK GNMDKALNLFGTMVRTNLKPDIVTYNTLIDGFCKVG M RAKELWDDM+RKDILPNHISYGIVINGFCSSGHLSEAWHL
Subjt:  ERGVFPDFYTFTTLIHGYCKVGNMDKALNLFGTMVRTNLKPDIVTYNTLIDGFCKVGEMGRAKELWDDMVRKDILPNHISYGIVINGFCSSGHLSEAWHL

Query:  CDQMVEQGIKPNLITCNTLIKGYCQSGDMPNAYEYLSKMISNGIIPDSISYNTLIDGYLKENNLEKAFVLINEMEKQGLQLDAITYNVILNGFCARGRMQ
        CDQMVE+GIKPNL+TCNTLIKGYC+SGDMP AYEYLS+MISNGIIPDSISYNTLIDGYLKE+NLEKAFVLINEMEKQGLQLD ITYNVILNGFCA+GRMQ
Subjt:  CDQMVEQGIKPNLITCNTLIKGYCQSGDMPNAYEYLSKMISNGIIPDSISYNTLIDGYLKENNLEKAFVLINEMEKQGLQLDAITYNVILNGFCARGRMQ

Query:  EAEQVLRKMIENGVNPDRATYSSLINGHVSQDSMKDAFHFHDEMLQRGLVPDDTF
        EAEQVL+KMIENG+NPDRATYSSLINGHVSQD+MKDAF FHDEMLQRGLVPDD F
Subjt:  EAEQVLRKMIENGVNPDRATYSSLINGHVSQDSMKDAFHFHDEMLQRGLVPDDTF

TrEMBL top hitse value%identityAlignment
A0A1S3BNF5 pentatricopeptide repeat-containing protein At5g011100.0e+0083.16Show/hide
Query:  MAAHRLPLPKSTFRSGILTSTFTSATPLLRANFISFRLHTIYSLPKFHSNTPLRSFQSHGTPEP------PSLSPSDSFLVEKTLFSLKQNNVSYLSNSL
        MA HRLPLPKSTFRSGIL+S FTS+TPLL  NF SF+LHTIYS     SNT LR  Q+  TP P       S+SPSDSFL+EK LFSLKQNNVSYL +SL
Subjt:  MAAHRLPLPKSTFRSGILTSTFTSATPLLRANFISFRLHTIYSLPKFHSNTPLRSFQSHGTPEP------PSLSPSDSFLVEKTLFSLKQNNVSYLSNSL

Query:  FRLNPSLLLQVLCRCRENLHLGLKFIGLLSSNCPNFKHSSLSLSAMVHFLVRGRRLSEAQACILRMVRKSGVSRVEVVESLVSTCGNFGSIGLVSDLLIR
         RL+PSLLLQVL RCRE+LHLGLKFIGL+S   PNFKHSSLSLSAMVHFLVRGRRLSEAQACILRMVRK GVSRV+VVESL+STC NFGSIGLV DLL+R
Subjt:  FRLNPSLLLQVLCRCRENLHLGLKFIGLLSSNCPNFKHSSLSLSAMVHFLVRGRRLSEAQACILRMVRKSGVSRVEVVESLVSTCGNFGSIGLVSDLLIR

Query:  TYVQAKNLREGSEAFRILRSKGVSVSINACNSLLRGLVKIGWVDLALEICGEVVRGGIELNVYTLNIMVNALCKDHKIENVNLFLSDMEEKGVFADIVTY
        TYVQAK LREGSEAF+ILR KGVSVSINACN LL GLV+ GWVDLA EI GEVVRGGIELNVYTLNIMVNALCKD K ENV  FLSDMEEKGVFADIVTY
Subjt:  TYVQAKNLREGSEAFRILRSKGVSVSINACNSLLRGLVKIGWVDLALEICGEVVRGGIELNVYTLNIMVNALCKDHKIENVNLFLSDMEEKGVFADIVTY

Query:  NTLISAYCREGLIEEAFQLLNLISGKGMESGLLTYNAIINGLCKIGKFDRAKDILNEMLQLGLRPDATSYNMLLVEICRRDNILEAQEIFDEMSRHDVLP
        NTLI AYCREGL+EEAFQLLN  S +GME G+LTYNAI+ GLCK+GK+DRAK +L EMLQLGL P+A +YN+LLVEICRRDNILEAQEIFDEMSRH VLP
Subjt:  NTLISAYCREGLIEEAFQLLNLISGKGMESGLLTYNAIINGLCKIGKFDRAKDILNEMLQLGLRPDATSYNMLLVEICRRDNILEAQEIFDEMSRHDVLP

Query:  DLVSFSSLIGVLARNGHLDRAFMYFRDMKSIGLVPDNVIYTILIDGFCRNGAISDALKMRDEMLARGCFMDVVAYNTILNGLCKKKMFVDADMLFNEMVE
        DLVSFSSLIGVLARNGHL +A MYFR+M+  GLVPDNVIYTILIDGFCRNGA+SDALKMRDEMLARG FMDVV YNT LNG CKKKM  DADMLF EMVE
Subjt:  DLVSFSSLIGVLARNGHLDRAFMYFRDMKSIGLVPDNVIYTILIDGFCRNGAISDALKMRDEMLARGCFMDVVAYNTILNGLCKKKMFVDADMLFNEMVE

Query:  RGVFPDFYTFTTLIHGYCKVGNMDKALNLFGTMVRTNLKPDIVTYNTLIDGFCKVGEMGRAKELWDDMVRKDILPNHISYGIVINGFCSSGHLSEAWHLC
        RG+ PDFYTFTTLI GYCK GNMDKALNLF TMVR+NLKPDIVTYNTLIDGFCK GEM RAKELWDDM+RKDILPNHISYGIVINGFCSSG L +A HLC
Subjt:  RGVFPDFYTFTTLIHGYCKVGNMDKALNLFGTMVRTNLKPDIVTYNTLIDGFCKVGEMGRAKELWDDMVRKDILPNHISYGIVINGFCSSGHLSEAWHLC

Query:  DQMVEQGIKPNLITCNTLIKGYCQSGDMPNAYEYLSKMISNGIIPDSISYNTLIDGYLKENNLEKAFVLINEMEKQGLQLDAITYNVILNGFCARGRMQE
        DQMVE+GI+PNL+TCNTLIKGYC+SGDMP AYEYLSKMISNGIIPDSISYNTLIDGYLKE NLEKAFVLINEMEK+GLQL+ ITYN ILNGFCA GRMQE
Subjt:  DQMVEQGIKPNLITCNTLIKGYCQSGDMPNAYEYLSKMISNGIIPDSISYNTLIDGYLKENNLEKAFVLINEMEKQGLQLDAITYNVILNGFCARGRMQE

Query:  AEQVLRKMIENGVNPDRATYSSLINGHVSQDSMKDAFHFHDEMLQRGLVPDDTF
        AE VLRKMIE G+NPDRATYS LINGHVSQD+MK+AF FHDEMLQRGLVPDD F
Subjt:  AEQVLRKMIENGVNPDRATYSSLINGHVSQDSMKDAFHFHDEMLQRGLVPDDTF

A0A6J1DJZ9 pentatricopeptide repeat-containing protein At5g011100.0e+0080.93Show/hide
Query:  MAAHRLPLPKSTFRSGILTSTFTSATPLL--RANFISFRLHTIYSLPKFHSNTPLRSFQSHGTPEPPSLSPSDSFLVEKTLFSLKQNNVSYLSNSLFRLN
        MAA+RLP    TFR+  L  TFT   PLL  RA F S RLH+I+SL K H++         G  +   +S S+SFLVEK LF LKQNNVS LS SLF LN
Subjt:  MAAHRLPLPKSTFRSGILTSTFTSATPLL--RANFISFRLHTIYSLPKFHSNTPLRSFQSHGTPEPPSLSPSDSFLVEKTLFSLKQNNVSYLSNSLFRLN

Query:  PSLLLQVLCRCRENLHLGLKFIGLLSSNCPNFKHSSLSLSAMVHFLVRGRRLSEAQACILRMVRKSGVSRVEVVESLVSTCGNFGSIGLVSDLLIRTYVQ
         S L++VL  CREN+ LG++FIGL+SSNCPNFKHSSLSLS M+HFLVRG RLSEAQA ILRMVRKSGVSRVEVVESLVSTC +FGSI LV DLL+RTYVQ
Subjt:  PSLLLQVLCRCRENLHLGLKFIGLLSSNCPNFKHSSLSLSAMVHFLVRGRRLSEAQACILRMVRKSGVSRVEVVESLVSTCGNFGSIGLVSDLLIRTYVQ

Query:  AKNLREGSEAFRILRSKGVSVSINACNSLLRGLVKIGWVDLALEICGEVVRGGIELNVYTLNIMVNALCKDHKIENVNLFLSDMEEKGVFADIVTYNTLI
        A+ LREGSEAFRILRSKGVSVSINACNSLL GLVKIGWVDLA EI GEVVRGGI+LN YTLNIMVNALCKD KIENVNLFLSDME+KGVF DIVTYNTLI
Subjt:  AKNLREGSEAFRILRSKGVSVSINACNSLLRGLVKIGWVDLALEICGEVVRGGIELNVYTLNIMVNALCKDHKIENVNLFLSDMEEKGVFADIVTYNTLI

Query:  SAYCREGLIEEAFQLLNLISGKGMESGLLTYNAIINGLCKIGKFDRAKDILNEMLQLGLRPDATSYNMLLVEICRRDNILEAQEIFDEMSRHDVLPDLVS
        SAYCREGL+EEAF+LLNL+S KGME G+LTYNAIINGLCK GK+  AK +LNEMLQLGLRPD T+YN LLVE CRRD+ILEA+EIFDEMSRH V  DL+S
Subjt:  SAYCREGLIEEAFQLLNLISGKGMESGLLTYNAIINGLCKIGKFDRAKDILNEMLQLGLRPDATSYNMLLVEICRRDNILEAQEIFDEMSRHDVLPDLVS

Query:  FSSLIGVLARNGHLDRAFMYFRDMKSIGLVPDNVIYTILIDGFCRNGAISDALKMRDEMLARGCFMDVVAYNTILNGLCKKKMFVDADMLFNEMVERGVF
        FSS+IGVL+RNGHLDRA + FRDMKS+GLVPDNVIYTILIDGFCRNGAISDALKMRDEMLA+GC MDVVAYNTILNGLCKKKM+VDA+MLFNEMVERGVF
Subjt:  FSSLIGVLARNGHLDRAFMYFRDMKSIGLVPDNVIYTILIDGFCRNGAISDALKMRDEMLARGCFMDVVAYNTILNGLCKKKMFVDADMLFNEMVERGVF

Query:  PDFYTFTTLIHGYCKVGNMDKALNLFGTMVRTNLKPDIVTYNTLIDGFCKVGEMGRAKELWDDMVRKDILPNHISYGIVINGFCSSGHLSEAWHLCDQMV
        PDFYTFTTLI+GYCK GNMDKALNLFGTM+ TNLKPDIVTYNTLIDGFCKVGE+ RAKELWDDM RKDILPNHISYGIVINGFC+SGHLSEA  LC+QMV
Subjt:  PDFYTFTTLIHGYCKVGNMDKALNLFGTMVRTNLKPDIVTYNTLIDGFCKVGEMGRAKELWDDMVRKDILPNHISYGIVINGFCSSGHLSEAWHLCDQMV

Query:  EQGIKPNLITCNTLIKGYCQSGDMPNAYEYLSKMISNGIIPDSISYNTLIDGYLKENNLEKAFVLINEMEKQGLQLDAITYNVILNGFCARGRMQEAEQV
        EQGIK NLITCNTLIKG+C+SGDM  AY  LSKMISNGI PDSISYNTLIDGY+KE N+EKA VL+NEMEKQG+QLD ITYN ILNGFCA+GRM+EAEQV
Subjt:  EQGIKPNLITCNTLIKGYCQSGDMPNAYEYLSKMISNGIIPDSISYNTLIDGYLKENNLEKAFVLINEMEKQGLQLDAITYNVILNGFCARGRMQEAEQV

Query:  LRKMIENGVNPDRATYSSLINGHVSQDSMKDAFHFHDEMLQRGLVPDDTF
        LRKMIENG+NPDRAT+SSLINGHVSQD+MKDAF FHDEMLQRGLVPDD F
Subjt:  LRKMIENGVNPDRATYSSLINGHVSQDSMKDAFHFHDEMLQRGLVPDDTF

A0A6J1F5T7 pentatricopeptide repeat-containing protein At5g011100.0e+0083.96Show/hide
Query:  MAAHRLPLPKSTFRSGILTSTFTSATPLLRANFISFRLHTIYSLPKFHSNTPLRSFQSHGTPEPPSLSPSDSFLVEKTLFSLKQNNVSYLSNSLFRLNPS
        MAAHRLPLPK TFR+ I+ ST T ATPLLR+N ISF  H  YSLPKFHS     +  S+  P   S+S S+SFLVEK LFSLKQNNVS LSNSLFRLNPS
Subjt:  MAAHRLPLPKSTFRSGILTSTFTSATPLLRANFISFRLHTIYSLPKFHSNTPLRSFQSHGTPEPPSLSPSDSFLVEKTLFSLKQNNVSYLSNSLFRLNPS

Query:  LLLQVLCRCRENLHLGLKFIGLLSSNCPNFKHSSLSLSAMVHFLVRGRRLSEAQACILRMVRKSGVSRVEVVESLVSTCGNFGSIGLVSDLLIRTYVQAK
         L++VL  CRENLHLGLKFI L+SS+CPN KHSS+SLSAMVHFLVRGRRLSEAQACILRMVRKSGVSRVEVV+SLVSTCGNFGSIGLVSDLL+RTYVQA+
Subjt:  LLLQVLCRCRENLHLGLKFIGLLSSNCPNFKHSSLSLSAMVHFLVRGRRLSEAQACILRMVRKSGVSRVEVVESLVSTCGNFGSIGLVSDLLIRTYVQAK

Query:  NLREGSEAFRILRSKGVSVSINACNSLLRGLVKIGWVDLALEICGEVVRGGIELNVYTLNIMVNALCKDHKIENVNLFLSDMEEKGVFADIVTYNTLISA
         LREGSEAFRILRSKGVSVSINACNSLL GLVKIGWVDLA EI GEVVRGG ELNVYTLNIMVNALCKD +I NVNLFLSDME+KGVF DIVTYNTLISA
Subjt:  NLREGSEAFRILRSKGVSVSINACNSLLRGLVKIGWVDLALEICGEVVRGGIELNVYTLNIMVNALCKDHKIENVNLFLSDMEEKGVFADIVTYNTLISA

Query:  YCREGLIEEAFQLLNLISGKGMESGLLTYNAIINGLCKIGKFDRAKDILNEMLQLGLRPDATSYNMLLVEICRRDNILEAQEIFDEMSRHDVLPDLVSFS
        YCREGL+EEAF+LLN IS KGME GLLTYNAIINGLCKIGK++RAKD+LN+M QLGL+PDA +YN LLVEICRRDNI EA+EIFDEMSRH VLPDL+SFS
Subjt:  YCREGLIEEAFQLLNLISGKGMESGLLTYNAIINGLCKIGKFDRAKDILNEMLQLGLRPDATSYNMLLVEICRRDNILEAQEIFDEMSRHDVLPDLVSFS

Query:  SLIGVLARNGHLDRAFMYFRDMKSIGLVPDNVIYTILIDGFCRNGAISDALKMRDEMLARGCFMDVVAYNTILNGLCKKKMFVDADMLFNEMVERGVFPD
        SLI VLARNG LD A  YFR+MKSIGLVPDNVIYTILIDGFCRNGAISDALKMRDEMLA+GC +DVVAYNTILNGLCKKKM VDADMLFNEMVERGVFPD
Subjt:  SLIGVLARNGHLDRAFMYFRDMKSIGLVPDNVIYTILIDGFCRNGAISDALKMRDEMLARGCFMDVVAYNTILNGLCKKKMFVDADMLFNEMVERGVFPD

Query:  FYTFTTLIHGYCKVGNMDKALNLFGTMVRTNLKPDIVTYNTLIDGFCKVGEMGRAKELWDDMVRKDILPNHISYGIVINGFCSSGHLSEAWHLCDQMVEQ
        FYTFTTLIHGYCK GNMD+ALNLFG MVRTNLKPDIVTYNTLIDGFCKVG+M +AK+LWDDM+RKDI+PNH+SYG VINGFCSSG+LSEA HLCDQMVE+
Subjt:  FYTFTTLIHGYCKVGNMDKALNLFGTMVRTNLKPDIVTYNTLIDGFCKVGEMGRAKELWDDMVRKDILPNHISYGIVINGFCSSGHLSEAWHLCDQMVEQ

Query:  GIKPNLITCNTLIKGYCQSGDMPNAYEYLSKMISNGIIPDSISYNTLIDGYLKENNLEKAFVLINEMEKQGLQLDAITYNVILNGFCARGRMQEAEQVLR
        GIKPNL+T NTLIKGYC+S DM  A+E LSKMISNGIIPD ISYNTLIDGYLK+ NL KAFV+INEMEKQ L+LD ITYN+ILNG+CA+GRM EAEQVLR
Subjt:  GIKPNLITCNTLIKGYCQSGDMPNAYEYLSKMISNGIIPDSISYNTLIDGYLKENNLEKAFVLINEMEKQGLQLDAITYNVILNGFCARGRMQEAEQVLR

Query:  KMIENGVNPDRATYSSLINGHVSQDSMKDAFHFHDEMLQRGLVPDDTF
        KMIENGVNPDRATYSSLINGHVSQD+MKDAF FHDEMLQRGLVPDD F
Subjt:  KMIENGVNPDRATYSSLINGHVSQDSMKDAFHFHDEMLQRGLVPDDTF

A0A6J1I8K2 pentatricopeptide repeat-containing protein At5g011100.0e+0084.63Show/hide
Query:  MAAHRLPLPKSTFRSGILTSTFTSATPLLRANFISFRLHTIYSLPKFHSNTPLRSFQSHGTPEPPSLSPSDSFLVEKTLFSLKQNNVSYLSNSLFRLNPS
        MAAHRLPLPK TFR+ IL STFT ATPLLRAN ISF  H IYSLPKFHS     S  S+  P   S+S S+SFLVEK LFSLKQNNVS LSNSLFRLNPS
Subjt:  MAAHRLPLPKSTFRSGILTSTFTSATPLLRANFISFRLHTIYSLPKFHSNTPLRSFQSHGTPEPPSLSPSDSFLVEKTLFSLKQNNVSYLSNSLFRLNPS

Query:  LLLQVLCRCRENLHLGLKFIGLLSSNCPNFKHSSLSLSAMVHFLVRGRRLSEAQACILRMVRKSGVSRVEVVESLVSTCGNFGSIGLVSDLLIRTYVQAK
         L++VL  CRENLHLGLKFI L+SS+CPN KHSS+SLSAMVHFLVRGRRLSEAQACILRMVRKSGVSRVEVVES+VSTCGNFGSIGLVSDLL+RTYVQA+
Subjt:  LLLQVLCRCRENLHLGLKFIGLLSSNCPNFKHSSLSLSAMVHFLVRGRRLSEAQACILRMVRKSGVSRVEVVESLVSTCGNFGSIGLVSDLLIRTYVQAK

Query:  NLREGSEAFRILRSKGVSVSINACNSLLRGLVKIGWVDLALEICGEVVRGGIELNVYTLNIMVNALCKDHKIENVNLFLSDMEEKGVFADIVTYNTLISA
         LREGSEAFRIL+SKGVSVSINACNSLL GLVKIGWVDLA EI GEVVRGG ELNVYTLNIMVNALCKD +I NVNLFLSDME+KGVF DIVTYNTLISA
Subjt:  NLREGSEAFRILRSKGVSVSINACNSLLRGLVKIGWVDLALEICGEVVRGGIELNVYTLNIMVNALCKDHKIENVNLFLSDMEEKGVFADIVTYNTLISA

Query:  YCREGLIEEAFQLLNLISGKGMESGLLTYNAIINGLCKIGKFDRAKDILNEMLQLGLRPDATSYNMLLVEICRRDNILEAQEIFDEMSRHDVLPDLVSFS
        YCREG +EEAF+LLN IS KGME GLLTYNAIINGLCKI K++RAKD+LN+M QLGL+PDA +YN LLVEICRRDNI EA+EIFDEMSRH VLPDL+SFS
Subjt:  YCREGLIEEAFQLLNLISGKGMESGLLTYNAIINGLCKIGKFDRAKDILNEMLQLGLRPDATSYNMLLVEICRRDNILEAQEIFDEMSRHDVLPDLVSFS

Query:  SLIGVLARNGHLDRAFMYFRDMKSIGLVPDNVIYTILIDGFCRNGAISDALKMRDEMLARGCFMDVVAYNTILNGLCKKKMFVDADMLFNEMVERGVFPD
        SLI VLARNG+LD A  YFRDMK+IGLVPDNVIYTILIDGFCRNGAISDALKMRDEMLA+GC +DVVAYNTILNGLCKKKM VDADMLFNEMVERGVFPD
Subjt:  SLIGVLARNGHLDRAFMYFRDMKSIGLVPDNVIYTILIDGFCRNGAISDALKMRDEMLARGCFMDVVAYNTILNGLCKKKMFVDADMLFNEMVERGVFPD

Query:  FYTFTTLIHGYCKVGNMDKALNLFGTMVRTNLKPDIVTYNTLIDGFCKVGEMGRAKELWDDMVRKDILPNHISYGIVINGFCSSGHLSEAWHLCDQMVEQ
        FYTFTTLIHGYCK GNMD+ALNLFGTMVRTNLKPDIVTYNTLIDGFCKVG+M RAK+LWDDM+RKDI+PNH+SYG VINGFCSSG+LSEA HLCDQMVE+
Subjt:  FYTFTTLIHGYCKVGNMDKALNLFGTMVRTNLKPDIVTYNTLIDGFCKVGEMGRAKELWDDMVRKDILPNHISYGIVINGFCSSGHLSEAWHLCDQMVEQ

Query:  GIKPNLITCNTLIKGYCQSGDMPNAYEYLSKMISNGIIPDSISYNTLIDGYLKENNLEKAFVLINEMEKQGLQLDAITYNVILNGFCARGRMQEAEQVLR
        GIKPNL+T NTLIKGYC+S DM  A+E LSKMISNGIIPD ISYNTLIDGYLK+ NL KAFV+INEMEKQGL+LD ITYN+ILNG+CA+GRM EAEQVLR
Subjt:  GIKPNLITCNTLIKGYCQSGDMPNAYEYLSKMISNGIIPDSISYNTLIDGYLKENNLEKAFVLINEMEKQGLQLDAITYNVILNGFCARGRMQEAEQVLR

Query:  KMIENGVNPDRATYSSLINGHVSQDSMKDAFHFHDEMLQRGLVPDDTF
        KMIENGVNPDRATYSSLINGHVSQD+MKDAF FHDEMLQRGLVPDD F
Subjt:  KMIENGVNPDRATYSSLINGHVSQDSMKDAFHFHDEMLQRGLVPDDTF

A0A7N2LDD7 Uncharacterized protein1.5e-29869.56Show/hide
Query:  MAAHRLPLPKSTFRSGILTSTFTSAT-PLLRANFISFRLHTIYSLPKFHSNTPLRSFQSHGTPEPPSLSPSDSFLVEKTLFSLKQNNVSYLSNSLFRLNP
        MA  RL   K   R      TFT+ T P   A   S    ++ S PK  SN  LR+ Q+   P   + S SDSFLVEK LFSLKQ N S L N LFRLNP
Subjt:  MAAHRLPLPKSTFRSGILTSTFTSAT-PLLRANFISFRLHTIYSLPKFHSNTPLRSFQSHGTPEPPSLSPSDSFLVEKTLFSLKQNNVSYLSNSLFRLNP

Query:  SLLLQVLCRCRENLHLGLKFIGLLSSNCPNFKHSSLSLSAMVHFLVRGRRLSEAQACILRMVRKSGVSRVEVVESLVSTCGNFGSIGLVSDLLIRTYVQA
         ++++VLCRCRENL LG KF+ L+  NCPNFKHSS SLSAMVH LVR RRLS+AQ  ILRMVRKSGVSR E+VESLVS C N     LV DLLIRTYVQA
Subjt:  SLLLQVLCRCRENLHLGLKFIGLLSSNCPNFKHSSLSLSAMVHFLVRGRRLSEAQACILRMVRKSGVSRVEVVESLVSTCGNFGSIGLVSDLLIRTYVQA

Query:  KNLREGSEAFRILRSKGVSVSINACNSLLRGLVKIGWVDLALEICGEVVRGGIELNVYTLNIMVNALCKDHKIENVNLFLSDMEEKGVFADIVTYNTLIS
        + LREGSEAFR+LRSKG  VSINACNSLL GLVK+GWV LA ++ GEVV  GI+LNVYTLNIMVNALCKD KI+NV +FL  MEEKGVF DIVTYNTLI+
Subjt:  KNLREGSEAFRILRSKGVSVSINACNSLLRGLVKIGWVDLALEICGEVVRGGIELNVYTLNIMVNALCKDHKIENVNLFLSDMEEKGVFADIVTYNTLIS

Query:  AYCREGLIEEAFQLLNLISGKGMESGLLTYNAIINGLCKIGKFDRAKDILNEMLQLGLRPDATSYNMLLVEICRRDNILEAQEIFDEMSRHDVLPDLVSF
        AYCREGL++EAF+++N + GKG++ GL+TYNAIINGLCK GK+ RAK++L+E+L  GL PD T+YN LLVE CR+DNILEA+ IF+EM    V+PDLVSF
Subjt:  AYCREGLIEEAFQLLNLISGKGMESGLLTYNAIINGLCKIGKFDRAKDILNEMLQLGLRPDATSYNMLLVEICRRDNILEAQEIFDEMSRHDVLPDLVSF

Query:  SSLIGVLARNGHLDRAFMYFRDMKSIGLVPDNVIYTILIDGFCRNGAISDALKMRDEMLARGCFMDVVAYNTILNGLCKKKMFVDADMLFNEMVERGVFP
        SSLIGV  RNGHLD+A ++FR+MK  GLVPDNVIYTILIDG+CRNG + +ALKMRDEML RGC MDVV YNTILNGLC++KM  DAD  FNEM+ERGVFP
Subjt:  SSLIGVLARNGHLDRAFMYFRDMKSIGLVPDNVIYTILIDGFCRNGAISDALKMRDEMLARGCFMDVVAYNTILNGLCKKKMFVDADMLFNEMVERGVFP

Query:  DFYTFTTLIHGYCKVGNMDKALNLFGTMVRTNLKPDIVTYNTLIDGFCKVGEMGRAKELWDDMVRKDILPNHISYGIVINGFCSSGHLSEAWHLCDQMVE
        DFYTFTTLIHGYCK GNM++ALNLF TM + NLKPDIVTYNTLIDGFCKVGEM +AKELW DM  + I PNHISYGIVINGFCS GH+SEA+   D+MVE
Subjt:  DFYTFTTLIHGYCKVGNMDKALNLFGTMVRTNLKPDIVTYNTLIDGFCKVGEMGRAKELWDDMVRKDILPNHISYGIVINGFCSSGHLSEAWHLCDQMVE

Query:  QGIKPNLITCNTLIKGYCQSGDMPNAYEYLSKMISNGIIPDSISYNTLIDGYLKENNLEKAFVLINEMEKQGLQLDAITYNVILNGFCARGRMQEAEQVL
        +GIKP+L+TCNT+IKG+C+SG+   A E+L KM+S GI+PDSI+YNTLI+GY++E N++KAFVL+N+MEKQGL  D ITYNVILNGFC +GRM+EAE +L
Subjt:  QGIKPNLITCNTLIKGYCQSGDMPNAYEYLSKMISNGIIPDSISYNTLIDGYLKENNLEKAFVLINEMEKQGLQLDAITYNVILNGFCARGRMQEAEQVL

Query:  RKMIENGVNPDRATYSSLINGHVSQDSMKDAFHFHDEMLQRGLVPDDTF
        +KMIE GVNPDR TY+SLINGHVSQD++K+AF FHDEMLQRG VPDD F
Subjt:  RKMIENGVNPDRATYSSLINGHVSQDSMKDAFHFHDEMLQRGLVPDDTF

SwissProt top hitse value%identityAlignment
Q0WVK7 Pentatricopeptide repeat-containing protein At1g05670, mitochondrial1.1e-9630.53Show/hide
Query:  FHSNTPLRSFQSHGTPEPPSLSPSDSFLVEKTLFSLKQNNVSYLSNSL----FRLNPSLLLQVLCRCRENLHLGLKFIGLLSSNCPNFKHSSL-SLSAMV
        F + T  R F  +    P   S  D+  V +    +K      L  SL     +     L+ VL + + +  L L F     S     + S+L SL  ++
Subjt:  FHSNTPLRSFQSHGTPEPPSLSPSDSFLVEKTLFSLKQNNVSYLSNSL----FRLNPSLLLQVLCRCRENLHLGLKFIGLLSSNCPNFKHSSL-SLSAMV

Query:  HFLVRGRRLSEAQACILRMVRKSGV----SRVEVVESLVSTCGNFGSIGLVSDLLIRTYVQAKNLREGSEAFRILRSKGVSVSINACNSLLRGLVKIGW-
        H  V  + L  AQ+ I     +  +    S V+  + LV T  ++GS   V D+  +  V    LRE    F  + + G+ +S+++CN  L  L K  + 
Subjt:  HFLVRGRRLSEAQACILRMVRKSGV----SRVEVVESLVSTCGNFGSIGLVSDLLIRTYVQAKNLREGSEAFRILRSKGVSVSINACNSLLRGLVKIGW-

Query:  VDLALEICGEVVRGGIELNVYTLNIMVNALCKDHKIENVNLFLSDMEEKGVFADIVTYNTLISAYCREGLIEEAFQLLNLISGKGMESGLLTYNAIINGL
           A+ +  E    G+  NV + NI+++ +C+  +I+  +  L  ME KG   D+++Y+T+++ YCR G +++ ++L+ ++  KG++     Y +II  L
Subjt:  VDLALEICGEVVRGGIELNVYTLNIMVNALCKDHKIENVNLFLSDMEEKGVFADIVTYNTLISAYCREGLIEEAFQLLNLISGKGMESGLLTYNAIINGL

Query:  CKIGKFDRAKDILNEMLQLGLRPDATSYNMLLVEICRRDNILEAQEIFDEMSRHDVLPDLVSFSSLIGVLARNGHLDRAFMYFRDMKSIGLVPDNVIYTI
        C+I K   A++  +EM++ G+ PD   Y  L+   C+R +I  A + F EM   D+ PD+++++++I    + G +  A   F +M   GL PD+V +T 
Subjt:  CKIGKFDRAKDILNEMLQLGLRPDATSYNMLLVEICRRDNILEAQEIFDEMSRHDVLPDLVSFSSLIGVLARNGHLDRAFMYFRDMKSIGLVPDNVIYTI

Query:  LIDGFCRNGAISDALKMRDEMLARGCFMDVVAYNTILNGLCKKKMFVDADMLFNEMVERGVFPDFYTFTTLIHGYCKVGNMDKALNLFGTMVRTNLKPDI
        LI+G+C+ G + DA ++ + M+  GC  +VV Y T+++GLCK+     A+ L +EM + G+ P+ +T+ ++++G CK GN+++A+ L G      L  D 
Subjt:  LIDGFCRNGAISDALKMRDEMLARGCFMDVVAYNTILNGLCKKKMFVDADMLFNEMVERGVFPDFYTFTTLIHGYCKVGNMDKALNLFGTMVRTNLKPDI

Query:  VTYNTLIDGFCKVGEMGRAKELWDDMVRKDILPNHISYGIVINGFCSSGHLSEAWHLCDQMVEQGIKPNLITCNTLIKGYCQSGDMPNAYEYLSKMISNG
        VTY TL+D +CK GEM +A+E+  +M+ K + P  +++ +++NGFC  G L +   L + M+ +GI PN  T N+L+K YC   ++  A      M S G
Subjt:  VTYNTLIDGFCKVGEMGRAKELWDDMVRKDILPNHISYGIVINGFCSSGHLSEAWHLCDQMVEQGIKPNLITCNTLIKGYCQSGDMPNAYEYLSKMISNG

Query:  IIPDSISYNTLIDGYLKENNLEKAFVLINEMEKQGLQLDAITYNVILNGFCARGRMQEAEQVLRKMIENGVNPDRATY
        + PD  +Y  L+ G+ K  N+++A+ L  EM+ +G  +   TY+V++ GF  R +  EA +V  +M   G+  D+  +
Subjt:  IIPDSISYNTLIDGYLKENNLEKAFVLINEMEKQGLQLDAITYNVILNGFCARGRMQEAEQVLRKMIENGVNPDRATY

Q6NQ83 Pentatricopeptide repeat-containing protein At3g22470, mitochondrial3.8e-8931.55Show/hide
Query:  NSLLRGLVKIGWVDLALEICGEVVRGGIELNVYTLNIMVNALCKDHKIENVNLFLSDMEEKGVFADIVTYNTLISAYCREGLIEEAFQLLNLISGKGMES
        N L   + +    DL L  C  +   GIE ++YT+ IM+N  C+  K+      L    + G   D +T++TL++ +C EG + EA  L++ +       
Subjt:  NSLLRGLVKIGWVDLALEICGEVVRGGIELNVYTLNIMVNALCKDHKIENVNLFLSDMEEKGVFADIVTYNTLISAYCREGLIEEAFQLLNLISGKGMES

Query:  GLLTYNAIINGLCKIGKFDRAKDILNEMLQLGLRPDATSYNMLLVEICRRDNILEAQEIFDEMSRHDVLPDLVSFSSLIGVLARNGHLDRAFMYFRDMKS
         L+T + +INGLC  G+   A  +++ M++ G +PD  +Y  +L  +C+  N   A ++F +M   ++   +V +S +I  L ++G  D A   F +M+ 
Subjt:  GLLTYNAIINGLCKIGKFDRAKDILNEMLQLGLRPDATSYNMLLVEICRRDNILEAQEIFDEMSRHDVLPDLVSFSSLIGVLARNGHLDRAFMYFRDMKS

Query:  IGLVPDNVIYTILIDGFCRNGAISDALKMRDEMLARGCFMDVVAYNTILNGLCKKKMFVDADMLFNEMVERGVFPDFYTFTTLIHGYCKVGNMDKALNLF
         G+  D V Y+ LI G C +G   D  KM  EM+ R    DVV ++ +++   K+   ++A  L+NEM+ RG+ PD  T+ +LI G+CK   + +A  +F
Subjt:  IGLVPDNVIYTILIDGFCRNGAISDALKMRDEMLARGCFMDVVAYNTILNGLCKKKMFVDADMLFNEMVERGVFPDFYTFTTLIHGYCKVGNMDKALNLF

Query:  GTMVRTNLKPDIVTYNTLIDGFCKVGEMGRAKELWDDMVRKDILPNHISYGIVINGFCSSGHLSEAWHLCDQMVEQGIKPNLITCNTLIKGYCQSGDMPN
          MV    +PDIVTY+ LI+ +CK   +     L+ ++  K ++PN I+Y  ++ GFC SG L+ A  L  +MV +G+ P+++T   L+ G C +G++  
Subjt:  GTMVRTNLKPDIVTYNTLIDGFCKVGEMGRAKELWDDMVRKDILPNHISYGIVINGFCSSGHLSEAWHLCDQMVEQGIKPNLITCNTLIKGYCQSGDMPN

Query:  AYEYLSKMISNGIIPDSISYNTLIDGYLKENNLEKAFVLINEMEKQGLQLDAITYNVILNGFCARGRMQEAEQVLRKMIENGVNPDRATYSSLINGHVSQ
        A E   KM  + +      YN +I G    + ++ A+ L   +  +G++ D +TYNV++ G C +G + EA+ + RKM E+G  PD  TY+ LI  H+  
Subjt:  AYEYLSKMISNGIIPDSISYNTLIDGYLKENNLEKAFVLINEMEKQGLQLDAITYNVILNGFCARGRMQEAEQVLRKMIENGVNPDRATYSSLINGHVSQ

Query:  DSMKDAFHFHDEMLQRGLVPDDT
          +  +    +EM   G   D +
Subjt:  DSMKDAFHFHDEMLQRGLVPDDT

Q9FIX3 Pentatricopeptide repeat-containing protein At5g397101.3e-8928.72Show/hide
Query:  SPSDSFLVEKTLFSLKQNNVSYLSNSLFRLNPSLLLQVLCRCRENLHLGLKFIGLLSSNCPNFKHSSLSLSAMVHFLVRGRRLSEAQACILRMVRKSGVS
        SPSDS L +K L  LK++    L +      P     +L + + +  L LKF+   +   P+   +       +H L + +    AQ     +  K+   
Subjt:  SPSDSFLVEKTLFSLKQNNVSYLSNSLFRLNPSLLLQVLCRCRENLHLGLKFIGLLSSNCPNFKHSSLSLSAMVHFLVRGRRLSEAQACILRMVRKSGVS

Query:  RVE--VVESLVSTCGNFGSIGLVSDLLIRTYVQAKNLREGSEAFRILRSKGVSVSINACNSLLRGLVKIGW-VDLALEICGEVVRGGIELNVYTLNIMVN
             V +SL  T     S   V DL++++Y +   + +      + ++ G    + + N++L   ++    +  A  +  E++   +  NV+T NI++ 
Subjt:  RVE--VVESLVSTCGNFGSIGLVSDLLIRTYVQAKNLREGSEAFRILRSKGVSVSINACNSLLRGLVKIGW-VDLALEICGEVVRGGIELNVYTLNIMVN

Query:  ALCKDHKIENVNLFLSDMEEKGVFADIVTYNTLISAYCREGLIEEAFQLLNLISGKGMESGLLTYNAIINGLCKIGKFDRAKDILNEMLQLGLRPDATSY
          C    I+        ME KG   ++VTYNTLI  YC+   I++ F+LL  ++ KG+E  L++YN +INGLC+ G+      +L EM + G   D  +Y
Subjt:  ALCKDHKIENVNLFLSDMEEKGVFADIVTYNTLISAYCREGLIEEAFQLLNLISGKGMESGLLTYNAIINGLCKIGKFDRAKDILNEMLQLGLRPDATSY

Query:  NMLLVEICRRDNILEAQEIFDEMSRHDVLPDLVSFSSLIGVLARNGHLDRAFMYFRDMKSIGLVPDNVIYTILIDGFCRNGAISDALKMRDEMLARGCFM
        N L+   C+  N  +A  +  EM RH + P +++++SLI  + + G+++RA  +   M+  GL P+   YT L+DGF + G +++A ++  EM   G   
Subjt:  NMLLVEICRRDNILEAQEIFDEMSRHDVLPDLVSFSSLIGVLARNGHLDRAFMYFRDMKSIGLVPDNVIYTILIDGFCRNGAISDALKMRDEMLARGCFM

Query:  DVVAYNTILNGLCKKKMFVDADMLFNEMVERGVFPDFYTFTTLIHGYCKVGNMDKALNLFGTMVRTNLKPDIVTYNTLIDGFCKVGEMGRAKELWDDMVR
         VV YN ++NG C      DA  +  +M E+G+ PD  +++T++ G+C+  ++D+AL +   MV   +KPD +TY++LI GFC+      A +L+++M+R
Subjt:  DVVAYNTILNGLCKKKMFVDADMLFNEMVERGVFPDFYTFTTLIHGYCKVGNMDKALNLFGTMVRTNLKPDIVTYNTLIDGFCKVGEMGRAKELWDDMVR

Query:  KDILPNHISYGIVINGFCSSGHLSEAWHLCDQMVEQGIKPNLITCNTLIKGYCQSGDMPNAYEYLSKMISNGIIPDSISYNTLIDGYLKENNLEKAFVLI
          + P+  +Y  +IN +C  G L +A  L ++MVE+G+ P+++T + LI G  +      A   L K+     +P  ++Y+TLI+     +N+E   V+ 
Subjt:  KDILPNHISYGIVINGFCSSGHLSEAWHLCDQMVEQGIKPNLITCNTLIKGYCQSGDMPNAYEYLSKMISNGIIPDSISYNTLIDGYLKENNLEKAFVLI

Query:  NEMEKQGLQLDAITYNVILNGFCARGRMQEAEQVLRKMIENGVNPDRATYSSLINGHVSQDSMKDAFHFHDEMLQRGLV
                         ++ GFC +G M EA+QV   M+     PD   Y+ +I+GH     ++ A+  + EM++ G +
Subjt:  NEMEKQGLQLDAITYNVILNGFCARGRMQEAEQVLRKMIENGVNPDRATYSSLINGHVSQDSMKDAFHFHDEMLQRGLV

Q9FJE6 Putative pentatricopeptide repeat-containing protein At5g599005.0e-8929.57Show/hide
Query:  RLNPSLLLQVLCRCRENLHLGLKFIGLLSSNCPNFKHSSLSLSAMVHFLVRGRRL----SEAQACILRMVRKSGVSRVEVVESLVSTCGNFGSIGLVSDL
        RL    + ++L    ++  LGL+F   L  +   F HS+ S   ++H LV+        S  Q  +LR ++ S V    V+ S    C    S     DL
Subjt:  RLNPSLLLQVLCRCRENLHLGLKFIGLLSSNCPNFKHSSLSLSAMVHFLVRGRRL----SEAQACILRMVRKSGVSRVEVVESLVSTCGNFGSIGLVSDL

Query:  LIRTYVQAKNLREGSEAFRILRSK-GVSVSINACNSLLRGLVKIGWVDLALEICGEVVRGGIELNVYTLNIMVNALCKDHKIENVNLFLSDMEEKGVFAD
        LI+ YV+++ + +G   F+++ +K  +   +   ++LL GLVK     LA+E+  ++V  GI  +VY    ++ +LC+   +      ++ ME  G   +
Subjt:  LIRTYVQAKNLREGSEAFRILRSK-GVSVSINACNSLLRGLVKIGWVDLALEICGEVVRGGIELNVYTLNIMVNALCKDHKIENVNLFLSDMEEKGVFAD

Query:  IVTYNTLISAYCREGLIEEAFQLLNLISGKGMESGLLTYNAIINGLCKIGKFDRAKDILNEMLQLGLRPDATSYNMLLVEICRRDNILEAQEIFDEMSRH
        IV YN LI   C++  + EA  +   ++GK ++  ++TY  ++ GLCK+ +F+   ++++EML L   P   + + L+  + +R  I EA  +   +   
Subjt:  IVTYNTLISAYCREGLIEEAFQLLNLISGKGMESGLLTYNAIINGLCKIGKFDRAKDILNEMLQLGLRPDATSYNMLLVEICRRDNILEAQEIFDEMSRH

Query:  DVLPDLVSFSSLIGVLARNGHLDRAFMYFRDMKSIGLVPDNVIYTILIDGFCRNGAISDALKMRDEMLARGCFMDVVAYNTILNGLCKKKMFVDADMLFN
         V P+L  +++LI  L +      A + F  M  IGL P++V Y+ILID FCR G +  AL    EM+  G  + V  YN+++NG CK      A+    
Subjt:  DVLPDLVSFSSLIGVLARNGHLDRAFMYFRDMKSIGLVPDNVIYTILIDGFCRNGAISDALKMRDEMLARGCFMDVVAYNTILNGLCKKKMFVDADMLFN

Query:  EMVERGVFPDFYTFTTLIHGYCKVGNMDKALNLFGTMVRTNLKPDIVTYNTLIDGFCKVGEMGRAKELWDDMVRKDILPNHISYGIVINGFCSSGHLSEA
        EM+ + + P   T+T+L+ GYC  G ++KAL L+  M    + P I T+ TL+ G  + G +  A +L+++M   ++ PN ++Y ++I G+C  G +S+A
Subjt:  EMVERGVFPDFYTFTTLIHGYCKVGNMDKALNLFGTMVRTNLKPDIVTYNTLIDGFCKVGEMGRAKELWDDMVRKDILPNHISYGIVINGFCSSGHLSEA

Query:  WHLCDQMVEQGIKPNLITCNTLIKGYCQSGDMPNAYEYLSKMISNGIIPDSISYNTLIDGYLKENNLEKAFVLINEMEKQGLQLDAITYNVILNGFCARG
        +    +M E+GI P+  +   LI G C +G    A  ++  +       + I Y  L+ G+ +E  LE+A  +  EM ++G+ LD + Y V+++G     
Subjt:  WHLCDQMVEQGIKPNLITCNTLIKGYCQSGDMPNAYEYLSKMISNGIIPDSISYNTLIDGYLKENNLEKAFVLINEMEKQGLQLDAITYNVILNGFCARG

Query:  RMQEAEQVLRKMIENGVNPDRATYSSLINGHVSQDSMKDAFHFHDEMLQRGLVPDD
          +    +L++M + G+ PD   Y+S+I+        K+AF   D M+  G VP++
Subjt:  RMQEAEQVLRKMIENGVNPDRATYSSLINGHVSQDSMKDAFHFHDEMLQRGLVPDD

Q9LFC5 Pentatricopeptide repeat-containing protein At5g011104.5e-23957.76Show/hide
Query:  SFQSHGTPEPPSLSPSDSFLVEKTLFSLKQNNVSYLSNSLFRLNPSLLLQVLCRCRENLHLGLKFIGLLSSNCPNFKHSSLSLSAMVHFLVRGRRLSEAQ
        S  S  +    S S SDSFLVEK  FSLKQ N + + N L RLNP  +++VL RCR +L LG +F+  L  + PNFKH+SLSLSAM+H LVR  RLS+AQ
Subjt:  SFQSHGTPEPPSLSPSDSFLVEKTLFSLKQNNVSYLSNSLFRLNPSLLLQVLCRCRENLHLGLKFIGLLSSNCPNFKHSSLSLSAMVHFLVRGRRLSEAQ

Query:  ACILRMVRKSGVSRVEVVESLVSTCGNFGSIGLVSDLLIRTYVQAKNLREGSEAFRILRSKGVSVSINACNSLLRGLVKIGWVDLALEICGEVVRGGIEL
        +C+LRM+R+SGVSR+E+V SL ST  N GS   V DLLIRTYVQA+ LRE  EAF +LRSKG +VSI+ACN+L+  LV+IGWV+LA  +  E+ R G+ +
Subjt:  ACILRMVRKSGVSRVEVVESLVSTCGNFGSIGLVSDLLIRTYVQAKNLREGSEAFRILRSKGVSVSINACNSLLRGLVKIGWVDLALEICGEVVRGGIEL

Query:  NVYTLNIMVNALCKDHKIENVNLFLSDMEEKGVFADIVTYNTLISAYCREGLIEEAFQLLNLISGKGMESGLLTYNAIINGLCKIGKFDRAKDILNEMLQ
        NVYTLNIMVNALCKD K+E V  FLS ++EKGV+ DIVTYNTLISAY  +GL+EEAF+L+N + GKG   G+ TYN +INGLCK GK++RAK++  EML+
Subjt:  NVYTLNIMVNALCKDHKIENVNLFLSDMEEKGVFADIVTYNTLISAYCREGLIEEAFQLLNLISGKGMESGLLTYNAIINGLCKIGKFDRAKDILNEMLQ

Query:  LGLRPDATSYNMLLVEICRRDNILEAQEIFDEMSRHDVLPDLVSFSSLIGVLARNGHLDRAFMYFRDMKSIGLVPDNVIYTILIDGFCRNGAISDALKMR
         GL PD+T+Y  LL+E C++ +++E +++F +M   DV+PDLV FSS++ +  R+G+LD+A MYF  +K  GL+PDNVIYTILI G+CR G IS A+ +R
Subjt:  LGLRPDATSYNMLLVEICRRDNILEAQEIFDEMSRHDVLPDLVSFSSLIGVLARNGHLDRAFMYFRDMKSIGLVPDNVIYTILIDGFCRNGAISDALKMR

Query:  DEMLARGCFMDVVAYNTILNGLCKKKMFVDADMLFNEMVERGVFPDFYTFTTLIHGYCKVGNMDKALNLFGTMVRTNLKPDIVTYNTLIDGFCKVGEMGR
        +EML +GC MDVV YNTIL+GLCK+KM  +AD LFNEM ER +FPD YT T LI G+CK+GN+  A+ LF  M    ++ D+VTYNTL+DGF KVG++  
Subjt:  DEMLARGCFMDVVAYNTILNGLCKKKMFVDADMLFNEMVERGVFPDFYTFTTLIHGYCKVGNMDKALNLFGTMVRTNLKPDIVTYNTLIDGFCKVGEMGR

Query:  AKELWDDMVRKDILPNHISYGIVINGFCSSGHLSEAWHLCDQMVEQGIKPNLITCNTLIKGYCQSGDMPNAYEYLSKMISNGIIPDSISYNTLIDGYLKE
        AKE+W DMV K+ILP  ISY I++N  CS GHL+EA+ + D+M+ + IKP ++ CN++IKGYC+SG+  +   +L KMIS G +PD ISYNTLI G+++E
Subjt:  AKELWDDMVRKDILPNHISYGIVINGFCSSGHLSEAWHLCDQMVEQGIKPNLITCNTLIKGYCQSGDMPNAYEYLSKMISNGIIPDSISYNTLIDGYLKE

Query:  NNLEKAFVLINEMEKQ--GLQLDAITYNVILNGFCARGRMQEAEQVLRKMIENGVNPDRATYSSLINGHVSQDSMKDAFHFHDEMLQRGLVPDDTF
         N+ KAF L+ +ME++  GL  D  TYN IL+GFC + +M+EAE VLRKMIE GVNPDR+TY+ +ING VSQD++ +AF  HDEMLQRG  PDD F
Subjt:  NNLEKAFVLINEMEKQ--GLQLDAITYNVILNGFCARGRMQEAEQVLRKMIENGVNPDRATYSSLINGHVSQDSMKDAFHFHDEMLQRGLVPDDTF

Arabidopsis top hitse value%identityAlignment
AT1G05670.1 Pentatricopeptide repeat (PPR-like) superfamily protein7.9e-9830.53Show/hide
Query:  FHSNTPLRSFQSHGTPEPPSLSPSDSFLVEKTLFSLKQNNVSYLSNSL----FRLNPSLLLQVLCRCRENLHLGLKFIGLLSSNCPNFKHSSL-SLSAMV
        F + T  R F  +    P   S  D+  V +    +K      L  SL     +     L+ VL + + +  L L F     S     + S+L SL  ++
Subjt:  FHSNTPLRSFQSHGTPEPPSLSPSDSFLVEKTLFSLKQNNVSYLSNSL----FRLNPSLLLQVLCRCRENLHLGLKFIGLLSSNCPNFKHSSL-SLSAMV

Query:  HFLVRGRRLSEAQACILRMVRKSGV----SRVEVVESLVSTCGNFGSIGLVSDLLIRTYVQAKNLREGSEAFRILRSKGVSVSINACNSLLRGLVKIGW-
        H  V  + L  AQ+ I     +  +    S V+  + LV T  ++GS   V D+  +  V    LRE    F  + + G+ +S+++CN  L  L K  + 
Subjt:  HFLVRGRRLSEAQACILRMVRKSGV----SRVEVVESLVSTCGNFGSIGLVSDLLIRTYVQAKNLREGSEAFRILRSKGVSVSINACNSLLRGLVKIGW-

Query:  VDLALEICGEVVRGGIELNVYTLNIMVNALCKDHKIENVNLFLSDMEEKGVFADIVTYNTLISAYCREGLIEEAFQLLNLISGKGMESGLLTYNAIINGL
           A+ +  E    G+  NV + NI+++ +C+  +I+  +  L  ME KG   D+++Y+T+++ YCR G +++ ++L+ ++  KG++     Y +II  L
Subjt:  VDLALEICGEVVRGGIELNVYTLNIMVNALCKDHKIENVNLFLSDMEEKGVFADIVTYNTLISAYCREGLIEEAFQLLNLISGKGMESGLLTYNAIINGL

Query:  CKIGKFDRAKDILNEMLQLGLRPDATSYNMLLVEICRRDNILEAQEIFDEMSRHDVLPDLVSFSSLIGVLARNGHLDRAFMYFRDMKSIGLVPDNVIYTI
        C+I K   A++  +EM++ G+ PD   Y  L+   C+R +I  A + F EM   D+ PD+++++++I    + G +  A   F +M   GL PD+V +T 
Subjt:  CKIGKFDRAKDILNEMLQLGLRPDATSYNMLLVEICRRDNILEAQEIFDEMSRHDVLPDLVSFSSLIGVLARNGHLDRAFMYFRDMKSIGLVPDNVIYTI

Query:  LIDGFCRNGAISDALKMRDEMLARGCFMDVVAYNTILNGLCKKKMFVDADMLFNEMVERGVFPDFYTFTTLIHGYCKVGNMDKALNLFGTMVRTNLKPDI
        LI+G+C+ G + DA ++ + M+  GC  +VV Y T+++GLCK+     A+ L +EM + G+ P+ +T+ ++++G CK GN+++A+ L G      L  D 
Subjt:  LIDGFCRNGAISDALKMRDEMLARGCFMDVVAYNTILNGLCKKKMFVDADMLFNEMVERGVFPDFYTFTTLIHGYCKVGNMDKALNLFGTMVRTNLKPDI

Query:  VTYNTLIDGFCKVGEMGRAKELWDDMVRKDILPNHISYGIVINGFCSSGHLSEAWHLCDQMVEQGIKPNLITCNTLIKGYCQSGDMPNAYEYLSKMISNG
        VTY TL+D +CK GEM +A+E+  +M+ K + P  +++ +++NGFC  G L +   L + M+ +GI PN  T N+L+K YC   ++  A      M S G
Subjt:  VTYNTLIDGFCKVGEMGRAKELWDDMVRKDILPNHISYGIVINGFCSSGHLSEAWHLCDQMVEQGIKPNLITCNTLIKGYCQSGDMPNAYEYLSKMISNG

Query:  IIPDSISYNTLIDGYLKENNLEKAFVLINEMEKQGLQLDAITYNVILNGFCARGRMQEAEQVLRKMIENGVNPDRATY
        + PD  +Y  L+ G+ K  N+++A+ L  EM+ +G  +   TY+V++ GF  R +  EA +V  +M   G+  D+  +
Subjt:  IIPDSISYNTLIDGYLKENNLEKAFVLINEMEKQGLQLDAITYNVILNGFCARGRMQEAEQVLRKMIENGVNPDRATY

AT1G05670.2 Pentatricopeptide repeat (PPR-like) superfamily protein7.9e-9830.53Show/hide
Query:  FHSNTPLRSFQSHGTPEPPSLSPSDSFLVEKTLFSLKQNNVSYLSNSL----FRLNPSLLLQVLCRCRENLHLGLKFIGLLSSNCPNFKHSSL-SLSAMV
        F + T  R F  +    P   S  D+  V +    +K      L  SL     +     L+ VL + + +  L L F     S     + S+L SL  ++
Subjt:  FHSNTPLRSFQSHGTPEPPSLSPSDSFLVEKTLFSLKQNNVSYLSNSL----FRLNPSLLLQVLCRCRENLHLGLKFIGLLSSNCPNFKHSSL-SLSAMV

Query:  HFLVRGRRLSEAQACILRMVRKSGV----SRVEVVESLVSTCGNFGSIGLVSDLLIRTYVQAKNLREGSEAFRILRSKGVSVSINACNSLLRGLVKIGW-
        H  V  + L  AQ+ I     +  +    S V+  + LV T  ++GS   V D+  +  V    LRE    F  + + G+ +S+++CN  L  L K  + 
Subjt:  HFLVRGRRLSEAQACILRMVRKSGV----SRVEVVESLVSTCGNFGSIGLVSDLLIRTYVQAKNLREGSEAFRILRSKGVSVSINACNSLLRGLVKIGW-

Query:  VDLALEICGEVVRGGIELNVYTLNIMVNALCKDHKIENVNLFLSDMEEKGVFADIVTYNTLISAYCREGLIEEAFQLLNLISGKGMESGLLTYNAIINGL
           A+ +  E    G+  NV + NI+++ +C+  +I+  +  L  ME KG   D+++Y+T+++ YCR G +++ ++L+ ++  KG++     Y +II  L
Subjt:  VDLALEICGEVVRGGIELNVYTLNIMVNALCKDHKIENVNLFLSDMEEKGVFADIVTYNTLISAYCREGLIEEAFQLLNLISGKGMESGLLTYNAIINGL

Query:  CKIGKFDRAKDILNEMLQLGLRPDATSYNMLLVEICRRDNILEAQEIFDEMSRHDVLPDLVSFSSLIGVLARNGHLDRAFMYFRDMKSIGLVPDNVIYTI
        C+I K   A++  +EM++ G+ PD   Y  L+   C+R +I  A + F EM   D+ PD+++++++I    + G +  A   F +M   GL PD+V +T 
Subjt:  CKIGKFDRAKDILNEMLQLGLRPDATSYNMLLVEICRRDNILEAQEIFDEMSRHDVLPDLVSFSSLIGVLARNGHLDRAFMYFRDMKSIGLVPDNVIYTI

Query:  LIDGFCRNGAISDALKMRDEMLARGCFMDVVAYNTILNGLCKKKMFVDADMLFNEMVERGVFPDFYTFTTLIHGYCKVGNMDKALNLFGTMVRTNLKPDI
        LI+G+C+ G + DA ++ + M+  GC  +VV Y T+++GLCK+     A+ L +EM + G+ P+ +T+ ++++G CK GN+++A+ L G      L  D 
Subjt:  LIDGFCRNGAISDALKMRDEMLARGCFMDVVAYNTILNGLCKKKMFVDADMLFNEMVERGVFPDFYTFTTLIHGYCKVGNMDKALNLFGTMVRTNLKPDI

Query:  VTYNTLIDGFCKVGEMGRAKELWDDMVRKDILPNHISYGIVINGFCSSGHLSEAWHLCDQMVEQGIKPNLITCNTLIKGYCQSGDMPNAYEYLSKMISNG
        VTY TL+D +CK GEM +A+E+  +M+ K + P  +++ +++NGFC  G L +   L + M+ +GI PN  T N+L+K YC   ++  A      M S G
Subjt:  VTYNTLIDGFCKVGEMGRAKELWDDMVRKDILPNHISYGIVINGFCSSGHLSEAWHLCDQMVEQGIKPNLITCNTLIKGYCQSGDMPNAYEYLSKMISNG

Query:  IIPDSISYNTLIDGYLKENNLEKAFVLINEMEKQGLQLDAITYNVILNGFCARGRMQEAEQVLRKMIENGVNPDRATY
        + PD  +Y  L+ G+ K  N+++A+ L  EM+ +G  +   TY+V++ GF  R +  EA +V  +M   G+  D+  +
Subjt:  IIPDSISYNTLIDGYLKENNLEKAFVLINEMEKQGLQLDAITYNVILNGFCARGRMQEAEQVLRKMIENGVNPDRATY

AT3G22470.1 Pentatricopeptide repeat (PPR) superfamily protein2.7e-9031.55Show/hide
Query:  NSLLRGLVKIGWVDLALEICGEVVRGGIELNVYTLNIMVNALCKDHKIENVNLFLSDMEEKGVFADIVTYNTLISAYCREGLIEEAFQLLNLISGKGMES
        N L   + +    DL L  C  +   GIE ++YT+ IM+N  C+  K+      L    + G   D +T++TL++ +C EG + EA  L++ +       
Subjt:  NSLLRGLVKIGWVDLALEICGEVVRGGIELNVYTLNIMVNALCKDHKIENVNLFLSDMEEKGVFADIVTYNTLISAYCREGLIEEAFQLLNLISGKGMES

Query:  GLLTYNAIINGLCKIGKFDRAKDILNEMLQLGLRPDATSYNMLLVEICRRDNILEAQEIFDEMSRHDVLPDLVSFSSLIGVLARNGHLDRAFMYFRDMKS
         L+T + +INGLC  G+   A  +++ M++ G +PD  +Y  +L  +C+  N   A ++F +M   ++   +V +S +I  L ++G  D A   F +M+ 
Subjt:  GLLTYNAIINGLCKIGKFDRAKDILNEMLQLGLRPDATSYNMLLVEICRRDNILEAQEIFDEMSRHDVLPDLVSFSSLIGVLARNGHLDRAFMYFRDMKS

Query:  IGLVPDNVIYTILIDGFCRNGAISDALKMRDEMLARGCFMDVVAYNTILNGLCKKKMFVDADMLFNEMVERGVFPDFYTFTTLIHGYCKVGNMDKALNLF
         G+  D V Y+ LI G C +G   D  KM  EM+ R    DVV ++ +++   K+   ++A  L+NEM+ RG+ PD  T+ +LI G+CK   + +A  +F
Subjt:  IGLVPDNVIYTILIDGFCRNGAISDALKMRDEMLARGCFMDVVAYNTILNGLCKKKMFVDADMLFNEMVERGVFPDFYTFTTLIHGYCKVGNMDKALNLF

Query:  GTMVRTNLKPDIVTYNTLIDGFCKVGEMGRAKELWDDMVRKDILPNHISYGIVINGFCSSGHLSEAWHLCDQMVEQGIKPNLITCNTLIKGYCQSGDMPN
          MV    +PDIVTY+ LI+ +CK   +     L+ ++  K ++PN I+Y  ++ GFC SG L+ A  L  +MV +G+ P+++T   L+ G C +G++  
Subjt:  GTMVRTNLKPDIVTYNTLIDGFCKVGEMGRAKELWDDMVRKDILPNHISYGIVINGFCSSGHLSEAWHLCDQMVEQGIKPNLITCNTLIKGYCQSGDMPN

Query:  AYEYLSKMISNGIIPDSISYNTLIDGYLKENNLEKAFVLINEMEKQGLQLDAITYNVILNGFCARGRMQEAEQVLRKMIENGVNPDRATYSSLINGHVSQ
        A E   KM  + +      YN +I G    + ++ A+ L   +  +G++ D +TYNV++ G C +G + EA+ + RKM E+G  PD  TY+ LI  H+  
Subjt:  AYEYLSKMISNGIIPDSISYNTLIDGYLKENNLEKAFVLINEMEKQGLQLDAITYNVILNGFCARGRMQEAEQVLRKMIENGVNPDRATYSSLINGHVSQ

Query:  DSMKDAFHFHDEMLQRGLVPDDT
          +  +    +EM   G   D +
Subjt:  DSMKDAFHFHDEMLQRGLVPDDT

AT5G01110.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.2e-24057.76Show/hide
Query:  SFQSHGTPEPPSLSPSDSFLVEKTLFSLKQNNVSYLSNSLFRLNPSLLLQVLCRCRENLHLGLKFIGLLSSNCPNFKHSSLSLSAMVHFLVRGRRLSEAQ
        S  S  +    S S SDSFLVEK  FSLKQ N + + N L RLNP  +++VL RCR +L LG +F+  L  + PNFKH+SLSLSAM+H LVR  RLS+AQ
Subjt:  SFQSHGTPEPPSLSPSDSFLVEKTLFSLKQNNVSYLSNSLFRLNPSLLLQVLCRCRENLHLGLKFIGLLSSNCPNFKHSSLSLSAMVHFLVRGRRLSEAQ

Query:  ACILRMVRKSGVSRVEVVESLVSTCGNFGSIGLVSDLLIRTYVQAKNLREGSEAFRILRSKGVSVSINACNSLLRGLVKIGWVDLALEICGEVVRGGIEL
        +C+LRM+R+SGVSR+E+V SL ST  N GS   V DLLIRTYVQA+ LRE  EAF +LRSKG +VSI+ACN+L+  LV+IGWV+LA  +  E+ R G+ +
Subjt:  ACILRMVRKSGVSRVEVVESLVSTCGNFGSIGLVSDLLIRTYVQAKNLREGSEAFRILRSKGVSVSINACNSLLRGLVKIGWVDLALEICGEVVRGGIEL

Query:  NVYTLNIMVNALCKDHKIENVNLFLSDMEEKGVFADIVTYNTLISAYCREGLIEEAFQLLNLISGKGMESGLLTYNAIINGLCKIGKFDRAKDILNEMLQ
        NVYTLNIMVNALCKD K+E V  FLS ++EKGV+ DIVTYNTLISAY  +GL+EEAF+L+N + GKG   G+ TYN +INGLCK GK++RAK++  EML+
Subjt:  NVYTLNIMVNALCKDHKIENVNLFLSDMEEKGVFADIVTYNTLISAYCREGLIEEAFQLLNLISGKGMESGLLTYNAIINGLCKIGKFDRAKDILNEMLQ

Query:  LGLRPDATSYNMLLVEICRRDNILEAQEIFDEMSRHDVLPDLVSFSSLIGVLARNGHLDRAFMYFRDMKSIGLVPDNVIYTILIDGFCRNGAISDALKMR
         GL PD+T+Y  LL+E C++ +++E +++F +M   DV+PDLV FSS++ +  R+G+LD+A MYF  +K  GL+PDNVIYTILI G+CR G IS A+ +R
Subjt:  LGLRPDATSYNMLLVEICRRDNILEAQEIFDEMSRHDVLPDLVSFSSLIGVLARNGHLDRAFMYFRDMKSIGLVPDNVIYTILIDGFCRNGAISDALKMR

Query:  DEMLARGCFMDVVAYNTILNGLCKKKMFVDADMLFNEMVERGVFPDFYTFTTLIHGYCKVGNMDKALNLFGTMVRTNLKPDIVTYNTLIDGFCKVGEMGR
        +EML +GC MDVV YNTIL+GLCK+KM  +AD LFNEM ER +FPD YT T LI G+CK+GN+  A+ LF  M    ++ D+VTYNTL+DGF KVG++  
Subjt:  DEMLARGCFMDVVAYNTILNGLCKKKMFVDADMLFNEMVERGVFPDFYTFTTLIHGYCKVGNMDKALNLFGTMVRTNLKPDIVTYNTLIDGFCKVGEMGR

Query:  AKELWDDMVRKDILPNHISYGIVINGFCSSGHLSEAWHLCDQMVEQGIKPNLITCNTLIKGYCQSGDMPNAYEYLSKMISNGIIPDSISYNTLIDGYLKE
        AKE+W DMV K+ILP  ISY I++N  CS GHL+EA+ + D+M+ + IKP ++ CN++IKGYC+SG+  +   +L KMIS G +PD ISYNTLI G+++E
Subjt:  AKELWDDMVRKDILPNHISYGIVINGFCSSGHLSEAWHLCDQMVEQGIKPNLITCNTLIKGYCQSGDMPNAYEYLSKMISNGIIPDSISYNTLIDGYLKE

Query:  NNLEKAFVLINEMEKQ--GLQLDAITYNVILNGFCARGRMQEAEQVLRKMIENGVNPDRATYSSLINGHVSQDSMKDAFHFHDEMLQRGLVPDDTF
         N+ KAF L+ +ME++  GL  D  TYN IL+GFC + +M+EAE VLRKMIE GVNPDR+TY+ +ING VSQD++ +AF  HDEMLQRG  PDD F
Subjt:  NNLEKAFVLINEMEKQ--GLQLDAITYNVILNGFCARGRMQEAEQVLRKMIENGVNPDRATYSSLINGHVSQDSMKDAFHFHDEMLQRGLVPDDTF

AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.3e-9128.72Show/hide
Query:  SPSDSFLVEKTLFSLKQNNVSYLSNSLFRLNPSLLLQVLCRCRENLHLGLKFIGLLSSNCPNFKHSSLSLSAMVHFLVRGRRLSEAQACILRMVRKSGVS
        SPSDS L +K L  LK++    L +      P     +L + + +  L LKF+   +   P+   +       +H L + +    AQ     +  K+   
Subjt:  SPSDSFLVEKTLFSLKQNNVSYLSNSLFRLNPSLLLQVLCRCRENLHLGLKFIGLLSSNCPNFKHSSLSLSAMVHFLVRGRRLSEAQACILRMVRKSGVS

Query:  RVE--VVESLVSTCGNFGSIGLVSDLLIRTYVQAKNLREGSEAFRILRSKGVSVSINACNSLLRGLVKIGW-VDLALEICGEVVRGGIELNVYTLNIMVN
             V +SL  T     S   V DL++++Y +   + +      + ++ G    + + N++L   ++    +  A  +  E++   +  NV+T NI++ 
Subjt:  RVE--VVESLVSTCGNFGSIGLVSDLLIRTYVQAKNLREGSEAFRILRSKGVSVSINACNSLLRGLVKIGW-VDLALEICGEVVRGGIELNVYTLNIMVN

Query:  ALCKDHKIENVNLFLSDMEEKGVFADIVTYNTLISAYCREGLIEEAFQLLNLISGKGMESGLLTYNAIINGLCKIGKFDRAKDILNEMLQLGLRPDATSY
          C    I+        ME KG   ++VTYNTLI  YC+   I++ F+LL  ++ KG+E  L++YN +INGLC+ G+      +L EM + G   D  +Y
Subjt:  ALCKDHKIENVNLFLSDMEEKGVFADIVTYNTLISAYCREGLIEEAFQLLNLISGKGMESGLLTYNAIINGLCKIGKFDRAKDILNEMLQLGLRPDATSY

Query:  NMLLVEICRRDNILEAQEIFDEMSRHDVLPDLVSFSSLIGVLARNGHLDRAFMYFRDMKSIGLVPDNVIYTILIDGFCRNGAISDALKMRDEMLARGCFM
        N L+   C+  N  +A  +  EM RH + P +++++SLI  + + G+++RA  +   M+  GL P+   YT L+DGF + G +++A ++  EM   G   
Subjt:  NMLLVEICRRDNILEAQEIFDEMSRHDVLPDLVSFSSLIGVLARNGHLDRAFMYFRDMKSIGLVPDNVIYTILIDGFCRNGAISDALKMRDEMLARGCFM

Query:  DVVAYNTILNGLCKKKMFVDADMLFNEMVERGVFPDFYTFTTLIHGYCKVGNMDKALNLFGTMVRTNLKPDIVTYNTLIDGFCKVGEMGRAKELWDDMVR
         VV YN ++NG C      DA  +  +M E+G+ PD  +++T++ G+C+  ++D+AL +   MV   +KPD +TY++LI GFC+      A +L+++M+R
Subjt:  DVVAYNTILNGLCKKKMFVDADMLFNEMVERGVFPDFYTFTTLIHGYCKVGNMDKALNLFGTMVRTNLKPDIVTYNTLIDGFCKVGEMGRAKELWDDMVR

Query:  KDILPNHISYGIVINGFCSSGHLSEAWHLCDQMVEQGIKPNLITCNTLIKGYCQSGDMPNAYEYLSKMISNGIIPDSISYNTLIDGYLKENNLEKAFVLI
          + P+  +Y  +IN +C  G L +A  L ++MVE+G+ P+++T + LI G  +      A   L K+     +P  ++Y+TLI+     +N+E   V+ 
Subjt:  KDILPNHISYGIVINGFCSSGHLSEAWHLCDQMVEQGIKPNLITCNTLIKGYCQSGDMPNAYEYLSKMISNGIIPDSISYNTLIDGYLKENNLEKAFVLI

Query:  NEMEKQGLQLDAITYNVILNGFCARGRMQEAEQVLRKMIENGVNPDRATYSSLINGHVSQDSMKDAFHFHDEMLQRGLV
                         ++ GFC +G M EA+QV   M+     PD   Y+ +I+GH     ++ A+  + EM++ G +
Subjt:  NEMEKQGLQLDAITYNVILNGFCARGRMQEAEQVLRKMIENGVNPDRATYSSLINGHVSQDSMKDAFHFHDEMLQRGLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCTCATCGACTTCCTCTCCCAAAATCCACCTTCAGAAGTGGAATTCTTACATCAACATTCACATCTGCAACTCCTTTACTTCGTGCTAATTTCATTTCTTTTAG
ACTTCACACTATCTATTCGCTCCCCAAATTTCATTCTAATACTCCCCTTCGATCCTTCCAATCCCATGGTACTCCAGAACCTCCTTCACTCTCTCCCTCGGATTCGTTTC
TAGTGGAAAAGACTTTGTTCAGTTTGAAGCAGAATAATGTAAGTTATTTGTCAAATTCTCTTTTCCGCCTGAACCCTTCACTTTTACTCCAAGTTCTCTGTAGGTGTCGT
GAAAATTTACATTTGGGCTTAAAATTTATTGGTTTACTTTCTTCTAATTGCCCGAATTTCAAGCATTCGTCACTTTCCTTGAGTGCAATGGTTCATTTTTTAGTGCGCGG
CAGGAGGCTCTCAGAAGCCCAAGCTTGCATTCTTAGGATGGTGAGGAAGAGCGGCGTCTCACGAGTTGAGGTCGTTGAATCCTTAGTCTCGACGTGTGGTAATTTTGGGT
CGATTGGTTTGGTTTCTGATTTGTTAATAAGGACTTATGTGCAAGCTAAAAATTTAAGAGAAGGGTCTGAAGCATTTCGAATTTTGAGGAGTAAAGGAGTTTCTGTTTCT
ATAAATGCTTGTAACAGTCTCCTTCGTGGTCTTGTGAAGATTGGGTGGGTTGATTTAGCTTTGGAAATATGTGGGGAAGTTGTGAGAGGAGGTATTGAGTTGAATGTTTA
TACATTGAACATTATGGTTAATGCTCTATGTAAAGACCACAAAATTGAGAATGTGAACTTGTTCTTATCAGATATGGAAGAGAAAGGAGTTTTTGCTGACATTGTGACAT
ATAATACTCTCATCAGTGCCTACTGTCGTGAAGGACTTATTGAAGAAGCCTTCCAATTGTTGAATTTAATCTCGGGTAAGGGTATGGAATCGGGTCTTCTAACTTACAAT
GCTATCATAAATGGCCTGTGTAAGATAGGTAAGTTTGACAGGGCAAAGGATATTTTGAATGAGATGTTGCAACTTGGGTTAAGGCCTGATGCTACTTCGTATAACATGTT
GCTAGTTGAGATCTGCCGAAGAGATAATATTCTAGAAGCTCAAGAGATATTTGATGAAATGTCACGTCATGATGTTCTTCCTGATCTGGTTAGTTTTAGTTCTCTGATCG
GTGTACTTGCAAGGAATGGGCATCTTGATCGGGCTTTTATGTATTTTAGAGATATGAAAAGTATTGGTCTAGTGCCTGATAATGTTATTTATACAATTCTTATAGATGGG
TTTTGTCGAAATGGTGCTATTTCAGATGCTTTGAAAATGCGGGATGAAATGCTTGCTCGGGGTTGCTTTATGGATGTAGTTGCATATAATACTATTTTGAATGGATTATG
CAAGAAGAAGATGTTCGTTGATGCAGATATGTTATTTAATGAAATGGTTGAGAGGGGTGTGTTTCCCGACTTTTATACTTTCACCACGCTCATTCATGGATATTGCAAGG
TTGGAAATATGGATAAAGCGCTGAATTTGTTTGGAACAATGGTTCGTACGAACCTCAAGCCAGATATAGTGACATACAACACGCTGATTGATGGCTTTTGCAAAGTAGGT
GAAATGGGAAGGGCCAAGGAGTTGTGGGATGATATGGTTAGGAAAGATATTCTCCCTAACCACATTTCCTATGGAATTGTAATAAATGGTTTTTGTAGTTCAGGCCATTT
ATCTGAGGCGTGGCATTTGTGTGACCAGATGGTCGAGCAGGGTATCAAACCCAATCTCATCACTTGCAATACTTTAATTAAGGGATACTGCCAGTCTGGTGATATGCCAA
ATGCATATGAATATTTGAGCAAAATGATATCAAATGGAATAATTCCTGATAGCATCTCATATAATACTCTTATTGATGGATATTTAAAAGAAAATAACCTAGAAAAAGCT
TTTGTATTGATTAATGAGATGGAAAAACAAGGGCTCCAACTTGATGCTATTACATATAATGTAATTTTAAATGGATTCTGTGCAAGAGGAAGAATGCAAGAGGCTGAGCA
GGTATTAAGGAAAATGATTGAGAATGGTGTAAATCCTGACAGAGCCACATACTCTTCTCTGATAAATGGTCATGTCAGCCAAGACAGTATGAAGGATGCATTTCATTTCC
ATGATGAAATGCTCCAACGAGGACTTGTGCCTGATGATACATTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGCTCATCGACTTCCTCTCCCAAAATCCACCTTCAGAAGTGGAATTCTTACATCAACATTCACATCTGCAACTCCTTTACTTCGTGCTAATTTCATTTCTTTTAG
ACTTCACACTATCTATTCGCTCCCCAAATTTCATTCTAATACTCCCCTTCGATCCTTCCAATCCCATGGTACTCCAGAACCTCCTTCACTCTCTCCCTCGGATTCGTTTC
TAGTGGAAAAGACTTTGTTCAGTTTGAAGCAGAATAATGTAAGTTATTTGTCAAATTCTCTTTTCCGCCTGAACCCTTCACTTTTACTCCAAGTTCTCTGTAGGTGTCGT
GAAAATTTACATTTGGGCTTAAAATTTATTGGTTTACTTTCTTCTAATTGCCCGAATTTCAAGCATTCGTCACTTTCCTTGAGTGCAATGGTTCATTTTTTAGTGCGCGG
CAGGAGGCTCTCAGAAGCCCAAGCTTGCATTCTTAGGATGGTGAGGAAGAGCGGCGTCTCACGAGTTGAGGTCGTTGAATCCTTAGTCTCGACGTGTGGTAATTTTGGGT
CGATTGGTTTGGTTTCTGATTTGTTAATAAGGACTTATGTGCAAGCTAAAAATTTAAGAGAAGGGTCTGAAGCATTTCGAATTTTGAGGAGTAAAGGAGTTTCTGTTTCT
ATAAATGCTTGTAACAGTCTCCTTCGTGGTCTTGTGAAGATTGGGTGGGTTGATTTAGCTTTGGAAATATGTGGGGAAGTTGTGAGAGGAGGTATTGAGTTGAATGTTTA
TACATTGAACATTATGGTTAATGCTCTATGTAAAGACCACAAAATTGAGAATGTGAACTTGTTCTTATCAGATATGGAAGAGAAAGGAGTTTTTGCTGACATTGTGACAT
ATAATACTCTCATCAGTGCCTACTGTCGTGAAGGACTTATTGAAGAAGCCTTCCAATTGTTGAATTTAATCTCGGGTAAGGGTATGGAATCGGGTCTTCTAACTTACAAT
GCTATCATAAATGGCCTGTGTAAGATAGGTAAGTTTGACAGGGCAAAGGATATTTTGAATGAGATGTTGCAACTTGGGTTAAGGCCTGATGCTACTTCGTATAACATGTT
GCTAGTTGAGATCTGCCGAAGAGATAATATTCTAGAAGCTCAAGAGATATTTGATGAAATGTCACGTCATGATGTTCTTCCTGATCTGGTTAGTTTTAGTTCTCTGATCG
GTGTACTTGCAAGGAATGGGCATCTTGATCGGGCTTTTATGTATTTTAGAGATATGAAAAGTATTGGTCTAGTGCCTGATAATGTTATTTATACAATTCTTATAGATGGG
TTTTGTCGAAATGGTGCTATTTCAGATGCTTTGAAAATGCGGGATGAAATGCTTGCTCGGGGTTGCTTTATGGATGTAGTTGCATATAATACTATTTTGAATGGATTATG
CAAGAAGAAGATGTTCGTTGATGCAGATATGTTATTTAATGAAATGGTTGAGAGGGGTGTGTTTCCCGACTTTTATACTTTCACCACGCTCATTCATGGATATTGCAAGG
TTGGAAATATGGATAAAGCGCTGAATTTGTTTGGAACAATGGTTCGTACGAACCTCAAGCCAGATATAGTGACATACAACACGCTGATTGATGGCTTTTGCAAAGTAGGT
GAAATGGGAAGGGCCAAGGAGTTGTGGGATGATATGGTTAGGAAAGATATTCTCCCTAACCACATTTCCTATGGAATTGTAATAAATGGTTTTTGTAGTTCAGGCCATTT
ATCTGAGGCGTGGCATTTGTGTGACCAGATGGTCGAGCAGGGTATCAAACCCAATCTCATCACTTGCAATACTTTAATTAAGGGATACTGCCAGTCTGGTGATATGCCAA
ATGCATATGAATATTTGAGCAAAATGATATCAAATGGAATAATTCCTGATAGCATCTCATATAATACTCTTATTGATGGATATTTAAAAGAAAATAACCTAGAAAAAGCT
TTTGTATTGATTAATGAGATGGAAAAACAAGGGCTCCAACTTGATGCTATTACATATAATGTAATTTTAAATGGATTCTGTGCAAGAGGAAGAATGCAAGAGGCTGAGCA
GGTATTAAGGAAAATGATTGAGAATGGTGTAAATCCTGACAGAGCCACATACTCTTCTCTGATAAATGGTCATGTCAGCCAAGACAGTATGAAGGATGCATTTCATTTCC
ATGATGAAATGCTCCAACGAGGACTTGTGCCTGATGATACATTTTAA
Protein sequenceShow/hide protein sequence
MAAHRLPLPKSTFRSGILTSTFTSATPLLRANFISFRLHTIYSLPKFHSNTPLRSFQSHGTPEPPSLSPSDSFLVEKTLFSLKQNNVSYLSNSLFRLNPSLLLQVLCRCR
ENLHLGLKFIGLLSSNCPNFKHSSLSLSAMVHFLVRGRRLSEAQACILRMVRKSGVSRVEVVESLVSTCGNFGSIGLVSDLLIRTYVQAKNLREGSEAFRILRSKGVSVS
INACNSLLRGLVKIGWVDLALEICGEVVRGGIELNVYTLNIMVNALCKDHKIENVNLFLSDMEEKGVFADIVTYNTLISAYCREGLIEEAFQLLNLISGKGMESGLLTYN
AIINGLCKIGKFDRAKDILNEMLQLGLRPDATSYNMLLVEICRRDNILEAQEIFDEMSRHDVLPDLVSFSSLIGVLARNGHLDRAFMYFRDMKSIGLVPDNVIYTILIDG
FCRNGAISDALKMRDEMLARGCFMDVVAYNTILNGLCKKKMFVDADMLFNEMVERGVFPDFYTFTTLIHGYCKVGNMDKALNLFGTMVRTNLKPDIVTYNTLIDGFCKVG
EMGRAKELWDDMVRKDILPNHISYGIVINGFCSSGHLSEAWHLCDQMVEQGIKPNLITCNTLIKGYCQSGDMPNAYEYLSKMISNGIIPDSISYNTLIDGYLKENNLEKA
FVLINEMEKQGLQLDAITYNVILNGFCARGRMQEAEQVLRKMIENGVNPDRATYSSLINGHVSQDSMKDAFHFHDEMLQRGLVPDDTF