; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0019586 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0019586
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr5:43562578..43564599
RNA-Seq ExpressionLag0019586
SyntenyLag0019586
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140278.1 pentatricopeptide repeat-containing protein At3g62890 [Cucumis sativus]0.0e+0088.25Show/hide
Query:  MHNGIRLSISIPIPNPIHLLFRTLHSYSGSPHIDVAP----SPFKCSISRL----SLCNLLQPLFAPNPPPVLSYAPVFQFLTGQNLLRLGHQVHAHMIL
        MHNGIRLSISIP P+  HLLFR LHSYSGS HID  P     PFKCSIS L    +L NLLQPL AP PPP+LSYAPVFQFLTG N+L+LGHQVHAHM+L
Subjt:  MHNGIRLSISIPIPNPIHLLFRTLHSYSGSPHIDVAP----SPFKCSISRL----SLCNLLQPLFAPNPPPVLSYAPVFQFLTGQNLLRLGHQVHAHMIL

Query:  RGLEPTALVGSKMVAFYASSGDIDSSVSVFNRISEPSSLLFNSMIRAFARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVDLLSVWMGKCVHGLVL
        RGL+PTALVGSKMVAFYASSGDIDSSVSVFN I EPSSLLFNSMIRA+ARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSV+LLSVWMGKCVHGL+L
Subjt:  RGLEPTALVGSKMVAFYASSGDIDSSVSVFNRISEPSSLLFNSMIRAFARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVDLLSVWMGKCVHGLVL

Query:  RVGLQFDLYVATSLIDMYGKCGEINDASIVFDNMIVRDVSAWNALLGGYMKGGCVDAAVAIFKRMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMLKED
        R+GLQFDLYVATSLI +YGKCGEINDA  VFDNM +RDVS+WNALL GY K GC+DAA+AIF+RMPWRNIVSWTTMISGYSQSGLAQQALSLFDEM+KED
Subjt:  RVGLQFDLYVATSLIDMYGKCGEINDASIVFDNMIVRDVSAWNALLGGYMKGGCVDAAVAIFKRMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMLKED

Query:  SGVRPNWVTIMSVLPACAQSSALERGRRIHELASQMGLNSNASVLIALTAMYAKCGSLVDARNCFDRLNRSEKTLVAWNTMITAYASYGHGLEAVSTFQE
        SGVRPNWVTIMSVLPACAQ S LERGR+IHELA +MGLNSNASVLIALTAMYAKCGSLVDARNCFD+LNR+EK L+AWNTMITAYASYGHGL+AVSTF+E
Subjt:  SGVRPNWVTIMSVLPACAQSSALERGRRIHELASQMGLNSNASVLIALTAMYAKCGSLVDARNCFDRLNRSEKTLVAWNTMITAYASYGHGLEAVSTFQE

Query:  MIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFKCMNTTYSIEPRAEHYACVVDLLGRAGRLAEASKIVDEMPMPAGPSIWGSLLAACRKYRNLEMAETA
        MIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYF  M+TTYSI PR EHYACV DLLGRAGRLAEASK+V EMPMPAGPSIWGSLLAACRK+RNLEMAETA
Subjt:  MIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFKCMNTTYSIEPRAEHYACVVDLLGRAGRLAEASKIVDEMPMPAGPSIWGSLLAACRKYRNLEMAETA

Query:  ARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAILKSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAKDIYTFLEALPEKMKAAGYVPDTSYVLHD
        ARKLFVLEPENTGNYVLLSNMYAEAGRWQEV+KLRAI+KSQGTKKSPGCSWIE+NGKAHMFLGGDTSHPQ K+IY FLEALPEKMKAAGY PDTSYVLHD
Subjt:  ARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAILKSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAKDIYTFLEALPEKMKAAGYVPDTSYVLHD

Query:  ISEEEKEFNLIAHSEKLAVAFGILNTSAETVLRVTKNLRICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW
        ISEEEKEFNLIAHSEKLAVAFGILNT AETVLRVTKNLRICGDCHTAMVFISEIYGRE++VRD+NRFHHFKGG CSCGDYW
Subjt:  ISEEEKEFNLIAHSEKLAVAFGILNTSAETVLRVTKNLRICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW

XP_008456075.1 PREDICTED: pentatricopeptide repeat-containing protein At3g62890-like [Cucumis melo]0.0e+0088.4Show/hide
Query:  MHNGIRLSISIPIPNPIHLLFRTLHSYSGSPHIDVAPSP----FKCSISRL----SLCNLLQPLFAPNPPPVLSYAPVFQFLTGQNLLRLGHQVHAHMIL
        MHNGIRLSIS  IP P  LLFR LHSYSGS HI+  P P    FKCSIS L    +L NLLQPL AP PPP+LSYAPVFQFLTG N+L+LGHQVHAHM+L
Subjt:  MHNGIRLSISIPIPNPIHLLFRTLHSYSGSPHIDVAPSP----FKCSISRL----SLCNLLQPLFAPNPPPVLSYAPVFQFLTGQNLLRLGHQVHAHMIL

Query:  RGLEPTALVGSKMVAFYASSGDIDSSVSVFNRISEPSSLLFNSMIRAFARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVDLLSVWMGKCVHGLVL
        RGL+PTALVGSKMVAFYASSGDIDSSVSVFN I EPSSLLFNSMIRA+ARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSS DLLSVWMGKCVHGL+L
Subjt:  RGLEPTALVGSKMVAFYASSGDIDSSVSVFNRISEPSSLLFNSMIRAFARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVDLLSVWMGKCVHGLVL

Query:  RVGLQFDLYVATSLIDMYGKCGEINDASIVFDNMIVRDVSAWNALLGGYMKGGCVDAAVAIFKRMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMLKED
        R+GL  DLYVATSLID+YGKCGEIN+A  VFDNM +RDVS+WNALL GYMK GCVDAAVAIF+RMPWRNIVSWTTMISGYSQSGLAQQALSLFDEM+KED
Subjt:  RVGLQFDLYVATSLIDMYGKCGEINDASIVFDNMIVRDVSAWNALLGGYMKGGCVDAAVAIFKRMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMLKED

Query:  SGVRPNWVTIMSVLPACAQSSALERGRRIHELASQMGLNSNASVLIALTAMYAKCGSLVDARNCFDRLNRSEKTLVAWNTMITAYASYGHGLEAVSTFQE
        SGVRPNWVTIMSVLPACAQ S LERG +IHELA +MGLNSNASVLIALTAMYAKCGSLVDARNCFD+LNRSEK L+AWNTMITAYASYGHGLEAVSTF+E
Subjt:  SGVRPNWVTIMSVLPACAQSSALERGRRIHELASQMGLNSNASVLIALTAMYAKCGSLVDARNCFDRLNRSEKTLVAWNTMITAYASYGHGLEAVSTFQE

Query:  MIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFKCMNTTYSIEPRAEHYACVVDLLGRAGRLAEASKIVDEMPMPAGPSIWGSLLAACRKYRNLEMAETA
        MIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYF  M+TTYSI PR EHYACV DLLGRAGRLAEASK+VDEMPMPAG SIWGSLLAACRK+RNLEMAE A
Subjt:  MIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFKCMNTTYSIEPRAEHYACVVDLLGRAGRLAEASKIVDEMPMPAGPSIWGSLLAACRKYRNLEMAETA

Query:  ARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAILKSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAKDIYTFLEALPEKMKAAGYVPDTSYVLHD
        ARKLFVLEPEN+GNYVLLSNMYAEAGRWQEV+KLRAI+KSQGTKKSPGCSWIE+NGKAHMFLGGDTSHPQAK+IY FLEALPEKMKAAGYVPDTSYVLHD
Subjt:  ARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAILKSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAKDIYTFLEALPEKMKAAGYVPDTSYVLHD

Query:  ISEEEKEFNLIAHSEKLAVAFGILNTSAETVLRVTKNLRICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW
        ISEEEKEFNLIAHSEKLAVAFGILNT AETVLRVTKNLRICGDCHTAMVFISEIYGRE++VRD+NRFHHFKGGSCSCGDYW
Subjt:  ISEEEKEFNLIAHSEKLAVAFGILNTSAETVLRVTKNLRICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW

XP_022149333.1 pentatricopeptide repeat-containing protein At3g62890-like [Momordica charantia]0.0e+0089.46Show/hide
Query:  MHNGIRLSISIPIPNPIHLLFRTLHSYSGSPHIDVAP-----SPFKCSISRLSLC----NLLQPLFAPNPPPVLSYAPVFQFLTGQNLLRLGHQVHAHMI
        M NG+RLS+S PIPN  H LFR LHSYS SP ID+AP      PFKCSIS LSL     NLL+PL AP+PPPV+SYAP+FQFLTG NLL+LG Q+HAHM+
Subjt:  MHNGIRLSISIPIPNPIHLLFRTLHSYSGSPHIDVAP-----SPFKCSISRLSLC----NLLQPLFAPNPPPVLSYAPVFQFLTGQNLLRLGHQVHAHMI

Query:  LRGLEPTALVGSKMVAFYASSGDIDSSVSVFNRISE-PSSLLFNSMIRAFARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVDLLSVWMGKCVHGL
        LRG++PTALVGSK+VAFYASSGDIDSSVSVFNR S+ PSSLLFNSMIRAF+R+GFAERTVATYF MHSWGFTGDYFTFPFVLKSSVDLL VWMG+CVHG 
Subjt:  LRGLEPTALVGSKMVAFYASSGDIDSSVSVFNRISE-PSSLLFNSMIRAFARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVDLLSVWMGKCVHGL

Query:  VLRVGLQFDLYVATSLIDMYGKCGEINDASIVFDNMIVRDVSAWNALLGGYMKGGCVDAAVAIFKRMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMLK
        ++R+GLQFDLYVATSLIDMYGKCGEINDA  VFDNM VRDVS+WNALLGGYMKGG VDAAVAIF+R+PWRNIVSWTTMISGYSQSGLAQQALSLFDEMLK
Subjt:  VLRVGLQFDLYVATSLIDMYGKCGEINDASIVFDNMIVRDVSAWNALLGGYMKGGCVDAAVAIFKRMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMLK

Query:  EDSGVRPNWVTIMSVLPACAQSSALERGRRIHELASQMGLNSNASVLIALTAMYAKCGSLVDARNCFDRLNRSEKTLVAWNTMITAYASYGHGLEAVSTF
        EDSGVRPNWVTIMSVLPACAQSSAL+RGR+IHELA +MGLNSNASVLIALTAMYAKCGSL DA+NCF+RLNRSE+ LVAWNTMITAYASYGHGLEAVSTF
Subjt:  EDSGVRPNWVTIMSVLPACAQSSALERGRRIHELASQMGLNSNASVLIALTAMYAKCGSLVDARNCFDRLNRSEKTLVAWNTMITAYASYGHGLEAVSTF

Query:  QEMIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFKCMNTTYSIEPRAEHYACVVDLLGRAGRLAEASKIVDEMPMPAGPSIWGSLLAACRKYRNLEMAE
        QEMIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYF CM+TTYSI+PRAEHYACVVDLLGRAGRLAEASK+VDEMPMPAGPSIWGSLLAACRKYRNLEMAE
Subjt:  QEMIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFKCMNTTYSIEPRAEHYACVVDLLGRAGRLAEASKIVDEMPMPAGPSIWGSLLAACRKYRNLEMAE

Query:  TAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAILKSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAKDIYTFLEALPEKMKAAGYVPDTSYVL
        TAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAILKSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAK+IY FLEALPEKMKAAGY+PDTSYVL
Subjt:  TAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAILKSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAKDIYTFLEALPEKMKAAGYVPDTSYVL

Query:  HDISEEEKEFNLIAHSEKLAVAFGILNTSAETVLRVTKNLRICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW
        HDISEEEKE NLIAHSEKLAVAFGILNTSAETVLRVTKNLRICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW
Subjt:  HDISEEEKEFNLIAHSEKLAVAFGILNTSAETVLRVTKNLRICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW

XP_022922754.1 pentatricopeptide repeat-containing protein At3g62890-like [Cucurbita moschata]0.0e+0089.57Show/hide
Query:  MHNGIRLSISIPIPNPIHLLFRTLHSYSGSPHIDVAP----SPFKCSISRLSLC----NLLQPLFAPNPPPVLSYAPVFQFLTGQNLLRLGHQVHAHMIL
        M NGIRLSI   IPNP  LLFR LHSY GS HID+AP     PFKCSIS  SL     NLLQPL AP+PPP+LSYA VFQFLTGQNLL+LG QVHAHM+L
Subjt:  MHNGIRLSISIPIPNPIHLLFRTLHSYSGSPHIDVAP----SPFKCSISRLSLC----NLLQPLFAPNPPPVLSYAPVFQFLTGQNLLRLGHQVHAHMIL

Query:  RGLEPTALVGSKMVAFYASSGDIDSSVSVFNRISEPSSLLFNSMIRAFARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVDLLSVWMGKCVHGLVL
        RGLEPTALVGSKMVAFYASSGDIDSSV+VFNRISEPSSLLFNSMIRA+ARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVDLLSVWMGKCVHGLVL
Subjt:  RGLEPTALVGSKMVAFYASSGDIDSSVSVFNRISEPSSLLFNSMIRAFARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVDLLSVWMGKCVHGLVL

Query:  RVGLQFDLYVATSLIDMYGKCGEINDASIVFDNMIVRDVSAWNALLGGYMKGGCVDAAVAIFKRMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMLKED
        R GL+FDLYVATSLIDMYGKCGEINDA  VFD M VRDVS+WNALL GYMKGG +DAAVAIF+RMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMLKED
Subjt:  RVGLQFDLYVATSLIDMYGKCGEINDASIVFDNMIVRDVSAWNALLGGYMKGGCVDAAVAIFKRMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMLKED

Query:  SGVRPNWVTIMSVLPACAQSSALERGRRIHELASQMGLNSNASVLIALTAMYAKCGSLVDARNCFDRLNRSEKTLVAWNTMITAYASYGHGLEAVSTFQE
        SGVRPNWVTIMSVLPACAQSSALERGRRIHELA +MGLNSNASVLIALTAMYAKCGSL DARNCF+RLNRSEK+LVAWNTMITAYASYGHG EAVSTFQE
Subjt:  SGVRPNWVTIMSVLPACAQSSALERGRRIHELASQMGLNSNASVLIALTAMYAKCGSLVDARNCFDRLNRSEKTLVAWNTMITAYASYGHGLEAVSTFQE

Query:  MIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFKCMNTTYSIEPRAEHYACVVDLLGRAGRLAEASKIVDEMPMPAGPSIWGSLLAACRKYRNLEMAETA
        MI+AGI+PDDITFTGLLS CSHSGLVD+GL YF  M+TTYS  PRAEHYACVVDLLGRAGRLAEASK+VDEMPMPAGPSIWGSLLAACRKYRNLEMAETA
Subjt:  MIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFKCMNTTYSIEPRAEHYACVVDLLGRAGRLAEASKIVDEMPMPAGPSIWGSLLAACRKYRNLEMAETA

Query:  ARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAILKSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAKDIYTFLEALPEKMKAAGYVPDTSYVLHD
        ARKLFVLEPENTGNYVLLSNMYAEAGRWQEV+KLRAIL SQGTKKSPGCSWIEVNG AHMFLGGDTSHPQ K+IY FLEALPEKMKAAGY PDTS+VLHD
Subjt:  ARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAILKSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAKDIYTFLEALPEKMKAAGYVPDTSYVLHD

Query:  ISEEEKEFNLIAHSEKLAVAFGILNTSAETVLRVTKNLRICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW
        ISEEEKEFNLIAHSEKLAVAFGILNT +ETVLRVTKNLRICGDCHTAMVFISEIYGRE+VVRDVNRFHHFK GSCSCGDYW
Subjt:  ISEEEKEFNLIAHSEKLAVAFGILNTSAETVLRVTKNLRICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW

XP_038880377.1 pentatricopeptide repeat-containing protein At3g62890-like [Benincasa hispida]0.0e+0089.26Show/hide
Query:  MHNGIRLSISIPIPNPIHLLFRTLHSYSGSPHIDVAP---SPFKCSISRLSLC----NLLQPLFAPNPPPVLSYAPVFQFLTGQNLLRLGHQVHAHMILR
        MHNGIRLSIS  IPNP HLLFR LHSYSGSP ID+ P   SPFKCSISR SL     NLL+PLFA +PPP+ SYAPVFQFLTG NLLRLGHQVHAHM+LR
Subjt:  MHNGIRLSISIPIPNPIHLLFRTLHSYSGSPHIDVAP---SPFKCSISRLSLC----NLLQPLFAPNPPPVLSYAPVFQFLTGQNLLRLGHQVHAHMILR

Query:  GLEPTALVGSKMVAFYASSGDIDSSVSVFNRISEPSSLLFNSMIRAFARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVDLLSVWMGKCVHGLVLR
        GL+PT LVGSKMVAFYASSGDIDSS+SVFNRISEPSSLLFNSMIRA++RYGFAERT ATYFSMHSWGFTGDYFTFPFVLKSSVDLLSVWMGKCVHG+VLR
Subjt:  GLEPTALVGSKMVAFYASSGDIDSSVSVFNRISEPSSLLFNSMIRAFARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVDLLSVWMGKCVHGLVLR

Query:  VGLQFDLYVATSLIDMYGKCGEINDASIVFDNMIVRDVSAWNALLGGYMKGGCVDAAVAIFKRMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMLKEDS
        +GLQFDLYVATSLIDMYGKCGEINDA  VFDNM VRDVSAWNALL  YMK GC+DAA+AIF+RMPWRNIVSWTTMISGYSQSGLAQQAL+LFDEM+KEDS
Subjt:  VGLQFDLYVATSLIDMYGKCGEINDASIVFDNMIVRDVSAWNALLGGYMKGGCVDAAVAIFKRMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMLKEDS

Query:  GVRPNWVTIMSVLPACAQSSALERGRRIHELASQMGLNSNASVLIALTAMYAKCGSLVDARNCFDRLNRSEKTLVAWNTMITAYASYGHGLEAVSTFQEM
        G+RPNWVTIMSVLPACAQSSALERGR+IHELA +MGLNSNASVLIALTAMYAKCGSL DARNCFDRLNRSEK LVAWNTMITAYASYGHGLEAVSTF+EM
Subjt:  GVRPNWVTIMSVLPACAQSSALERGRRIHELASQMGLNSNASVLIALTAMYAKCGSLVDARNCFDRLNRSEKTLVAWNTMITAYASYGHGLEAVSTFQEM

Query:  IQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFKCMNTTYSIEPRAEHYACVVDLLGRAGRLAEASKIVDEMPMPAGPSIWGSLLAACRKYRNLEMAETAA
        IQAGIQPDDITFTGLLSGCSHSGLVDVGLKYF  M+T YSI PRAEHYACVVDLLGRAGRLAEASK++DEMPM AGPSIWGSLLAACRKYRNLEMAETAA
Subjt:  IQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFKCMNTTYSIEPRAEHYACVVDLLGRAGRLAEASKIVDEMPMPAGPSIWGSLLAACRKYRNLEMAETAA

Query:  RKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAILKSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAKDIYTFLEALPEKMKAAGYVPDTSYVLHDI
        R+LFVLEPENTGNYVLLSNMYAEAGRWQEV+K+RAILK QGT KSPG SWIE++GK HMFLGGDTSHPQAK+IY FLEALPEKMKAAGYVPDTSYVLHDI
Subjt:  RKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAILKSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAKDIYTFLEALPEKMKAAGYVPDTSYVLHDI

Query:  SEEEKEFNLIAHSEKLAVAFGILNTSAETVLRVTKNLRICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW
        SEEEKEFNLIAHSEKLAVAFGILNTSAET+LRVTKNLRICGDCHTAMVFISEIY RE+VVRDV RFHHFKGGSCSCGD+W
Subjt:  SEEEKEFNLIAHSEKLAVAFGILNTSAETVLRVTKNLRICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW

TrEMBL top hitse value%identityAlignment
A0A0A0KEZ1 DYW_deaminase domain-containing protein0.0e+0088.25Show/hide
Query:  MHNGIRLSISIPIPNPIHLLFRTLHSYSGSPHIDVAP----SPFKCSISRL----SLCNLLQPLFAPNPPPVLSYAPVFQFLTGQNLLRLGHQVHAHMIL
        MHNGIRLSISIP P+  HLLFR LHSYSGS HID  P     PFKCSIS L    +L NLLQPL AP PPP+LSYAPVFQFLTG N+L+LGHQVHAHM+L
Subjt:  MHNGIRLSISIPIPNPIHLLFRTLHSYSGSPHIDVAP----SPFKCSISRL----SLCNLLQPLFAPNPPPVLSYAPVFQFLTGQNLLRLGHQVHAHMIL

Query:  RGLEPTALVGSKMVAFYASSGDIDSSVSVFNRISEPSSLLFNSMIRAFARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVDLLSVWMGKCVHGLVL
        RGL+PTALVGSKMVAFYASSGDIDSSVSVFN I EPSSLLFNSMIRA+ARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSV+LLSVWMGKCVHGL+L
Subjt:  RGLEPTALVGSKMVAFYASSGDIDSSVSVFNRISEPSSLLFNSMIRAFARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVDLLSVWMGKCVHGLVL

Query:  RVGLQFDLYVATSLIDMYGKCGEINDASIVFDNMIVRDVSAWNALLGGYMKGGCVDAAVAIFKRMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMLKED
        R+GLQFDLYVATSLI +YGKCGEINDA  VFDNM +RDVS+WNALL GY K GC+DAA+AIF+RMPWRNIVSWTTMISGYSQSGLAQQALSLFDEM+KED
Subjt:  RVGLQFDLYVATSLIDMYGKCGEINDASIVFDNMIVRDVSAWNALLGGYMKGGCVDAAVAIFKRMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMLKED

Query:  SGVRPNWVTIMSVLPACAQSSALERGRRIHELASQMGLNSNASVLIALTAMYAKCGSLVDARNCFDRLNRSEKTLVAWNTMITAYASYGHGLEAVSTFQE
        SGVRPNWVTIMSVLPACAQ S LERGR+IHELA +MGLNSNASVLIALTAMYAKCGSLVDARNCFD+LNR+EK L+AWNTMITAYASYGHGL+AVSTF+E
Subjt:  SGVRPNWVTIMSVLPACAQSSALERGRRIHELASQMGLNSNASVLIALTAMYAKCGSLVDARNCFDRLNRSEKTLVAWNTMITAYASYGHGLEAVSTFQE

Query:  MIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFKCMNTTYSIEPRAEHYACVVDLLGRAGRLAEASKIVDEMPMPAGPSIWGSLLAACRKYRNLEMAETA
        MIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYF  M+TTYSI PR EHYACV DLLGRAGRLAEASK+V EMPMPAGPSIWGSLLAACRK+RNLEMAETA
Subjt:  MIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFKCMNTTYSIEPRAEHYACVVDLLGRAGRLAEASKIVDEMPMPAGPSIWGSLLAACRKYRNLEMAETA

Query:  ARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAILKSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAKDIYTFLEALPEKMKAAGYVPDTSYVLHD
        ARKLFVLEPENTGNYVLLSNMYAEAGRWQEV+KLRAI+KSQGTKKSPGCSWIE+NGKAHMFLGGDTSHPQ K+IY FLEALPEKMKAAGY PDTSYVLHD
Subjt:  ARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAILKSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAKDIYTFLEALPEKMKAAGYVPDTSYVLHD

Query:  ISEEEKEFNLIAHSEKLAVAFGILNTSAETVLRVTKNLRICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW
        ISEEEKEFNLIAHSEKLAVAFGILNT AETVLRVTKNLRICGDCHTAMVFISEIYGRE++VRD+NRFHHFKGG CSCGDYW
Subjt:  ISEEEKEFNLIAHSEKLAVAFGILNTSAETVLRVTKNLRICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW

A0A1S3C2H6 pentatricopeptide repeat-containing protein At3g62890-like0.0e+0088.4Show/hide
Query:  MHNGIRLSISIPIPNPIHLLFRTLHSYSGSPHIDVAPSP----FKCSISRL----SLCNLLQPLFAPNPPPVLSYAPVFQFLTGQNLLRLGHQVHAHMIL
        MHNGIRLSIS  IP P  LLFR LHSYSGS HI+  P P    FKCSIS L    +L NLLQPL AP PPP+LSYAPVFQFLTG N+L+LGHQVHAHM+L
Subjt:  MHNGIRLSISIPIPNPIHLLFRTLHSYSGSPHIDVAPSP----FKCSISRL----SLCNLLQPLFAPNPPPVLSYAPVFQFLTGQNLLRLGHQVHAHMIL

Query:  RGLEPTALVGSKMVAFYASSGDIDSSVSVFNRISEPSSLLFNSMIRAFARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVDLLSVWMGKCVHGLVL
        RGL+PTALVGSKMVAFYASSGDIDSSVSVFN I EPSSLLFNSMIRA+ARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSS DLLSVWMGKCVHGL+L
Subjt:  RGLEPTALVGSKMVAFYASSGDIDSSVSVFNRISEPSSLLFNSMIRAFARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVDLLSVWMGKCVHGLVL

Query:  RVGLQFDLYVATSLIDMYGKCGEINDASIVFDNMIVRDVSAWNALLGGYMKGGCVDAAVAIFKRMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMLKED
        R+GL  DLYVATSLID+YGKCGEIN+A  VFDNM +RDVS+WNALL GYMK GCVDAAVAIF+RMPWRNIVSWTTMISGYSQSGLAQQALSLFDEM+KED
Subjt:  RVGLQFDLYVATSLIDMYGKCGEINDASIVFDNMIVRDVSAWNALLGGYMKGGCVDAAVAIFKRMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMLKED

Query:  SGVRPNWVTIMSVLPACAQSSALERGRRIHELASQMGLNSNASVLIALTAMYAKCGSLVDARNCFDRLNRSEKTLVAWNTMITAYASYGHGLEAVSTFQE
        SGVRPNWVTIMSVLPACAQ S LERG +IHELA +MGLNSNASVLIALTAMYAKCGSLVDARNCFD+LNRSEK L+AWNTMITAYASYGHGLEAVSTF+E
Subjt:  SGVRPNWVTIMSVLPACAQSSALERGRRIHELASQMGLNSNASVLIALTAMYAKCGSLVDARNCFDRLNRSEKTLVAWNTMITAYASYGHGLEAVSTFQE

Query:  MIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFKCMNTTYSIEPRAEHYACVVDLLGRAGRLAEASKIVDEMPMPAGPSIWGSLLAACRKYRNLEMAETA
        MIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYF  M+TTYSI PR EHYACV DLLGRAGRLAEASK+VDEMPMPAG SIWGSLLAACRK+RNLEMAE A
Subjt:  MIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFKCMNTTYSIEPRAEHYACVVDLLGRAGRLAEASKIVDEMPMPAGPSIWGSLLAACRKYRNLEMAETA

Query:  ARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAILKSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAKDIYTFLEALPEKMKAAGYVPDTSYVLHD
        ARKLFVLEPEN+GNYVLLSNMYAEAGRWQEV+KLRAI+KSQGTKKSPGCSWIE+NGKAHMFLGGDTSHPQAK+IY FLEALPEKMKAAGYVPDTSYVLHD
Subjt:  ARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAILKSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAKDIYTFLEALPEKMKAAGYVPDTSYVLHD

Query:  ISEEEKEFNLIAHSEKLAVAFGILNTSAETVLRVTKNLRICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW
        ISEEEKEFNLIAHSEKLAVAFGILNT AETVLRVTKNLRICGDCHTAMVFISEIYGRE++VRD+NRFHHFKGGSCSCGDYW
Subjt:  ISEEEKEFNLIAHSEKLAVAFGILNTSAETVLRVTKNLRICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW

A0A5A7TCM1 Pentatricopeptide repeat-containing protein0.0e+0088.11Show/hide
Query:  MHNGIRLSISIPIPNPIHLLFRTLHSYSGSPHIDVAPSP----FKCSISRL----SLCNLLQPLFAPNPPPVLSYAPVFQFLTGQNLLRLGHQVHAHMIL
        MHNGIRLSIS  IP P  LLFR LHSYSGS HI+  P P    FKCSIS L    +L NLLQPL AP PPP+LSYAPVFQFLTG N+L+LGHQVHAHM+L
Subjt:  MHNGIRLSISIPIPNPIHLLFRTLHSYSGSPHIDVAPSP----FKCSISRL----SLCNLLQPLFAPNPPPVLSYAPVFQFLTGQNLLRLGHQVHAHMIL

Query:  RGLEPTALVGSKMVAFYASSGDIDSSVSVFNRISEPSSLLFNSMIRAFARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVDLLSVWMGKCVHGLVL
        RGL+PTALVGSKMVAFYASSGDIDSSVSVFN I EPSSLLFNSMIRA+ARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSS DLLSVWMGKCVHGL+L
Subjt:  RGLEPTALVGSKMVAFYASSGDIDSSVSVFNRISEPSSLLFNSMIRAFARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVDLLSVWMGKCVHGLVL

Query:  RVGLQFDLYVATSLIDMYGKCGEINDASIVFDNMIVRDVSAWNALLGGYMKGGCVDAAVAIFKRMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMLKED
        R+GL  DLYVATSLID+YGKCGEIN+A  VFDNM +RDVS+WNALL GYMK GC+DAAVAIF+RMPWRNIVSWTTMISGYSQSGLAQQALSLFDEM+KED
Subjt:  RVGLQFDLYVATSLIDMYGKCGEINDASIVFDNMIVRDVSAWNALLGGYMKGGCVDAAVAIFKRMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMLKED

Query:  SGVRPNWVTIMSVLPACAQSSALERGRRIHELASQMGLNSNASVLIALTAMYAKCGSLVDARNCFDRLNRSEKTLVAWNTMITAYASYGHGLEAVSTFQE
        SGVRPNWVTIMSVLPACAQ S LERG +IHELA +MGLNSNASVLIALTAMYAKCGSLVDARNCFD+LNRSEK L+AWNTMITAYASYGHGLEAVSTF+E
Subjt:  SGVRPNWVTIMSVLPACAQSSALERGRRIHELASQMGLNSNASVLIALTAMYAKCGSLVDARNCFDRLNRSEKTLVAWNTMITAYASYGHGLEAVSTFQE

Query:  MIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFKCMNTTYSIEPRAEHYACVVDLLGRAGRLAEASKIVDEMPMPAGPSIWGSLLAACRKYRNLEMAETA
        MIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYF  M+TTYSI PR EHYACV DLLGRAGRLAEASK+VDEMPMPAG SIWGSLLAACRK+RNLEMAE A
Subjt:  MIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFKCMNTTYSIEPRAEHYACVVDLLGRAGRLAEASKIVDEMPMPAGPSIWGSLLAACRKYRNLEMAETA

Query:  ARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAILKSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAKDIYTFLEALPEKMKAAGYVPDTSYVLHD
        ARKLFVLEPEN+GNYVLLSNMYAEAGRWQEV+KLRAI+KSQGTKKSPGCSWIE+NGKAHMFLGGDTSHPQ K+IY FLEALPEKMKAAGYVPDTSYVLHD
Subjt:  ARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAILKSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAKDIYTFLEALPEKMKAAGYVPDTSYVLHD

Query:  ISEEEKEFNLIAHSEKLAVAFGILNTSAETVLRVTKNLRICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW
        ISEEEKEFNLIAHSEKLAVAFGILNT AETVLRVTKNLRICGDCHTAMVFISEIYGRE++VRD+NRFHHFKGGSCSCGDYW
Subjt:  ISEEEKEFNLIAHSEKLAVAFGILNTSAETVLRVTKNLRICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW

A0A6J1D6I5 pentatricopeptide repeat-containing protein At3g62890-like0.0e+0089.46Show/hide
Query:  MHNGIRLSISIPIPNPIHLLFRTLHSYSGSPHIDVAP-----SPFKCSISRLSLC----NLLQPLFAPNPPPVLSYAPVFQFLTGQNLLRLGHQVHAHMI
        M NG+RLS+S PIPN  H LFR LHSYS SP ID+AP      PFKCSIS LSL     NLL+PL AP+PPPV+SYAP+FQFLTG NLL+LG Q+HAHM+
Subjt:  MHNGIRLSISIPIPNPIHLLFRTLHSYSGSPHIDVAP-----SPFKCSISRLSLC----NLLQPLFAPNPPPVLSYAPVFQFLTGQNLLRLGHQVHAHMI

Query:  LRGLEPTALVGSKMVAFYASSGDIDSSVSVFNRISE-PSSLLFNSMIRAFARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVDLLSVWMGKCVHGL
        LRG++PTALVGSK+VAFYASSGDIDSSVSVFNR S+ PSSLLFNSMIRAF+R+GFAERTVATYF MHSWGFTGDYFTFPFVLKSSVDLL VWMG+CVHG 
Subjt:  LRGLEPTALVGSKMVAFYASSGDIDSSVSVFNRISE-PSSLLFNSMIRAFARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVDLLSVWMGKCVHGL

Query:  VLRVGLQFDLYVATSLIDMYGKCGEINDASIVFDNMIVRDVSAWNALLGGYMKGGCVDAAVAIFKRMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMLK
        ++R+GLQFDLYVATSLIDMYGKCGEINDA  VFDNM VRDVS+WNALLGGYMKGG VDAAVAIF+R+PWRNIVSWTTMISGYSQSGLAQQALSLFDEMLK
Subjt:  VLRVGLQFDLYVATSLIDMYGKCGEINDASIVFDNMIVRDVSAWNALLGGYMKGGCVDAAVAIFKRMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMLK

Query:  EDSGVRPNWVTIMSVLPACAQSSALERGRRIHELASQMGLNSNASVLIALTAMYAKCGSLVDARNCFDRLNRSEKTLVAWNTMITAYASYGHGLEAVSTF
        EDSGVRPNWVTIMSVLPACAQSSAL+RGR+IHELA +MGLNSNASVLIALTAMYAKCGSL DA+NCF+RLNRSE+ LVAWNTMITAYASYGHGLEAVSTF
Subjt:  EDSGVRPNWVTIMSVLPACAQSSALERGRRIHELASQMGLNSNASVLIALTAMYAKCGSLVDARNCFDRLNRSEKTLVAWNTMITAYASYGHGLEAVSTF

Query:  QEMIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFKCMNTTYSIEPRAEHYACVVDLLGRAGRLAEASKIVDEMPMPAGPSIWGSLLAACRKYRNLEMAE
        QEMIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYF CM+TTYSI+PRAEHYACVVDLLGRAGRLAEASK+VDEMPMPAGPSIWGSLLAACRKYRNLEMAE
Subjt:  QEMIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFKCMNTTYSIEPRAEHYACVVDLLGRAGRLAEASKIVDEMPMPAGPSIWGSLLAACRKYRNLEMAE

Query:  TAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAILKSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAKDIYTFLEALPEKMKAAGYVPDTSYVL
        TAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAILKSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAK+IY FLEALPEKMKAAGY+PDTSYVL
Subjt:  TAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAILKSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAKDIYTFLEALPEKMKAAGYVPDTSYVL

Query:  HDISEEEKEFNLIAHSEKLAVAFGILNTSAETVLRVTKNLRICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW
        HDISEEEKE NLIAHSEKLAVAFGILNTSAETVLRVTKNLRICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW
Subjt:  HDISEEEKEFNLIAHSEKLAVAFGILNTSAETVLRVTKNLRICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW

A0A6J1E4B5 pentatricopeptide repeat-containing protein At3g62890-like0.0e+0089.57Show/hide
Query:  MHNGIRLSISIPIPNPIHLLFRTLHSYSGSPHIDVAP----SPFKCSISRLSLC----NLLQPLFAPNPPPVLSYAPVFQFLTGQNLLRLGHQVHAHMIL
        M NGIRLSI   IPNP  LLFR LHSY GS HID+AP     PFKCSIS  SL     NLLQPL AP+PPP+LSYA VFQFLTGQNLL+LG QVHAHM+L
Subjt:  MHNGIRLSISIPIPNPIHLLFRTLHSYSGSPHIDVAP----SPFKCSISRLSLC----NLLQPLFAPNPPPVLSYAPVFQFLTGQNLLRLGHQVHAHMIL

Query:  RGLEPTALVGSKMVAFYASSGDIDSSVSVFNRISEPSSLLFNSMIRAFARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVDLLSVWMGKCVHGLVL
        RGLEPTALVGSKMVAFYASSGDIDSSV+VFNRISEPSSLLFNSMIRA+ARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVDLLSVWMGKCVHGLVL
Subjt:  RGLEPTALVGSKMVAFYASSGDIDSSVSVFNRISEPSSLLFNSMIRAFARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVDLLSVWMGKCVHGLVL

Query:  RVGLQFDLYVATSLIDMYGKCGEINDASIVFDNMIVRDVSAWNALLGGYMKGGCVDAAVAIFKRMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMLKED
        R GL+FDLYVATSLIDMYGKCGEINDA  VFD M VRDVS+WNALL GYMKGG +DAAVAIF+RMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMLKED
Subjt:  RVGLQFDLYVATSLIDMYGKCGEINDASIVFDNMIVRDVSAWNALLGGYMKGGCVDAAVAIFKRMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMLKED

Query:  SGVRPNWVTIMSVLPACAQSSALERGRRIHELASQMGLNSNASVLIALTAMYAKCGSLVDARNCFDRLNRSEKTLVAWNTMITAYASYGHGLEAVSTFQE
        SGVRPNWVTIMSVLPACAQSSALERGRRIHELA +MGLNSNASVLIALTAMYAKCGSL DARNCF+RLNRSEK+LVAWNTMITAYASYGHG EAVSTFQE
Subjt:  SGVRPNWVTIMSVLPACAQSSALERGRRIHELASQMGLNSNASVLIALTAMYAKCGSLVDARNCFDRLNRSEKTLVAWNTMITAYASYGHGLEAVSTFQE

Query:  MIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFKCMNTTYSIEPRAEHYACVVDLLGRAGRLAEASKIVDEMPMPAGPSIWGSLLAACRKYRNLEMAETA
        MI+AGI+PDDITFTGLLS CSHSGLVD+GL YF  M+TTYS  PRAEHYACVVDLLGRAGRLAEASK+VDEMPMPAGPSIWGSLLAACRKYRNLEMAETA
Subjt:  MIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFKCMNTTYSIEPRAEHYACVVDLLGRAGRLAEASKIVDEMPMPAGPSIWGSLLAACRKYRNLEMAETA

Query:  ARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAILKSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAKDIYTFLEALPEKMKAAGYVPDTSYVLHD
        ARKLFVLEPENTGNYVLLSNMYAEAGRWQEV+KLRAIL SQGTKKSPGCSWIEVNG AHMFLGGDTSHPQ K+IY FLEALPEKMKAAGY PDTS+VLHD
Subjt:  ARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAILKSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAKDIYTFLEALPEKMKAAGYVPDTSYVLHD

Query:  ISEEEKEFNLIAHSEKLAVAFGILNTSAETVLRVTKNLRICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW
        ISEEEKEFNLIAHSEKLAVAFGILNT +ETVLRVTKNLRICGDCHTAMVFISEIYGRE+VVRDVNRFHHFK GSCSCGDYW
Subjt:  ISEEEKEFNLIAHSEKLAVAFGILNTSAETVLRVTKNLRICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW

SwissProt top hitse value%identityAlignment
O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic5.4e-13540.67Show/hide
Query:  LRLGHQVHAHMILRGLEPTALVGSKMVAFYASSGDIDSSVSVFNRISEPSSLLFNSMIRAFARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVDLL
        L LG  +H   +   +     V + ++  Y S GD+DS+  VF  I E   + +NSMI  F + G  ++ +  +  M S      + T   VL +   + 
Subjt:  LRLGHQVHAHMILRGLEPTALVGSKMVAFYASSGDIDSSVSVFNRISEPSSLLFNSMIRAFARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVDLL

Query:  SVWMGKCVHGLVLRVGLQFDLYVATSLIDMYGKCGEINDASIVFDNMIVRDVSAWNALLGGYMKGGCVDAAVAIFKRMPWRNIVSWTTMISGYSQSGLAQ
        ++  G+ V   +    +  +L +A +++DMY KCG I DA  +FD M  +D   W  +L GY      +AA  +   MP ++IV+W  +IS Y Q+G   
Subjt:  SVWMGKCVHGLVLRVGLQFDLYVATSLIDMYGKCGEINDASIVFDNMIVRDVSAWNALLGGYMKGGCVDAAVAIFKRMPWRNIVSWTTMISGYSQSGLAQ

Query:  QALSLFDEMLKEDSGVRPNWVTIMSVLPACAQSSALERGRRIHELASQMGLNSNASVLIALTAMYAKCGSLVDARNCFDRLNRSEKTLVAWNTMITAYAS
        +AL +F E L+    ++ N +T++S L ACAQ  ALE GR IH    + G+  N  V  AL  MY+KCG L  +R  F+ + +  + +  W+ MI   A 
Subjt:  QALSLFDEMLKEDSGVRPNWVTIMSVLPACAQSSALERGRRIHELASQMGLNSNASVLIALTAMYAKCGSLVDARNCFDRLNRSEKTLVAWNTMITAYAS

Query:  YGHGLEAVSTFQEMIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFKCMNTTYSIEPRAEHYACVVDLLGRAGRLAEASKIVDEMPMPAGPSIWGSLLAA
        +G G EAV  F +M +A ++P+ +TFT +   CSH+GLVD     F  M + Y I P  +HYAC+VD+LGR+G L +A K ++ MP+P   S+WG+LL A
Subjt:  YGHGLEAVSTFQEMIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFKCMNTTYSIEPRAEHYACVVDLLGRAGRLAEASKIVDEMPMPAGPSIWGSLLAA

Query:  CRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAILKSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAKDIYTFLEALPEKMKA
        C+ + NL +AE A  +L  LEP N G +VLLSN+YA+ G+W+ V++LR  ++  G KK PGCS IE++G  H FL GD +HP ++ +Y  L  + EK+K+
Subjt:  CRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAILKSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAKDIYTFLEALPEKMKA

Query:  AGYVPDTSYVLHDISEEE-KEFNLIAHSEKLAVAFGILNTSAETVLRVTKNLRICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW
         GY P+ S VL  I EEE KE +L  HSEKLA+ +G+++T A  V+RV KNLR+CGDCH+    IS++Y REI+VRD  RFHHF+ G CSC D+W
Subjt:  AGYVPDTSYVLHDISEEE-KEFNLIAHSEKLAVAFGILNTSAETVLRVTKNLRICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW

P0C899 Putative pentatricopeptide repeat-containing protein At3g491429.2e-14341.82Show/hide
Query:  FLTGQNL-----LRLGHQVHAHMILRGLEPTALVGSKMVAFYASSGDIDSSVSVFNRISEPSSLLFNSMIRAFARYGFAERTVATYFSMHSWGFTGDYFT
        FL GQ L     +R    VH+ +IL  L   + +G K++  YAS  D+ S+  VF+ I E + ++ N MIR++   GF    V  + +M       D++T
Subjt:  FLTGQNL-----LRLGHQVHAHMILRGLEPTALVGSKMVAFYASSGDIDSSVSVFNRISEPSSLLFNSMIRAFARYGFAERTVATYFSMHSWGFTGDYFT

Query:  FPFVLKSSVDLLSVWMGKCVHGLVLRVGLQFDLYVATSLIDMYGKCGEINDASIVFDNMIVRDVSAWNALLGGYMKGGCVDAAVAI--------------
        FP VLK+     ++ +G+ +HG   +VGL   L+V   L+ MYGKCG +++A +V D M  RDV +WN+L+ GY +    D A+ +              
Subjt:  FPFVLKSSVDLLSVWMGKCVHGLVLRVGLQFDLYVATSLIDMYGKCGEINDASIVFDNMIVRDVSAWNALLGGYMKGGCVDAAVAI--------------

Query:  -----------------------FKRMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMLKEDSGVRPNWVTIMSVLPACAQSSALERGRRIHELASQMGL
                               F +M  +++VSW  MI  Y ++ +  +A+ L+  M  E  G  P+ V+I SVLPAC  +SAL  G++IH    +  L
Subjt:  -----------------------FKRMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMLKEDSGVRPNWVTIMSVLPACAQSSALERGRRIHELASQMGL

Query:  NSNASVLIALTAMYAKCGSLVDARNCFDRLNRSEKTLVAWNTMITAYASYGHGLEAVSTFQEMIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFKCMNT
          N  +  AL  MYAKCG L  AR+ F+  N   + +V+W  MI+AY   G G +AV+ F ++  +G+ PD I F   L+ CSH+GL++ G   FK M  
Subjt:  NSNASVLIALTAMYAKCGSLVDARNCFDRLNRSEKTLVAWNTMITAYASYGHGLEAVSTFQEMIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFKCMNT

Query:  TYSIEPRAEHYACVVDLLGRAGRLAEASKIVDEMPMPAGPSIWGSLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAIL
         Y I PR EH AC+VDLLGRAG++ EA + + +M M     +WG+LL ACR + + ++   AA KLF L PE +G YVLLSN+YA+AGRW+EV  +R I+
Subjt:  TYSIEPRAEHYACVVDLLGRAGRLAEASKIVDEMPMPAGPSIWGSLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAIL

Query:  KSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAKDIYTFLEALPEKMKAAGYVPDTSYVLHDISEEEKEFNLIAHSEKLAVAFGILNTSAE-----TVLR
        KS+G KK+PG S +EVN   H FL GD SHPQ+ +IY  L+ L +KMK  GYVPD+   LHD+ EE+KE +L  HSEKLA+ F ++NT  E       +R
Subjt:  KSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAKDIYTFLEALPEKMKAAGYVPDTSYVLHDISEEEKEFNLIAHSEKLAVAFGILNTSAE-----TVLR

Query:  VTKNLRICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW
        +TKNLRICGDCH A   IS+I  REI++RD NRFH F+ G CSCGDYW
Subjt:  VTKNLRICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic6.0e-13438.73Show/hide
Query:  VHAHMILRGLEPTALVGSKMVAFYASSGDIDS---SVSVFNRISEPSSLLFNSMIRAFARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVDLLSVW
        +HA MI  GL  T    SK++ F   S   +    ++SVF  I EP+ L++N+M R  A        +  Y  M S G   + +TFPFVLKS     +  
Subjt:  VHAHMILRGLEPTALVGSKMVAFYASSGDIDS---SVSVFNRISEPSSLLFNSMIRAFARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVDLLSVW

Query:  MGKCVHGLVLRVGLQFDLYVATSLIDMYGKCGEINDASIVFDNMIVRDVSAWNALLGGYMKGGCVDAAVAIFKRMPWRNIVSWTTMISGYSQSGLAQQAL
         G+ +HG VL++G   DLYV TSLI MY + G + DA  VFD    RDV ++ AL+ GY   G ++ A  +F  +P +++VSW  MISGY+++G  ++AL
Subjt:  MGKCVHGLVLRVGLQFDLYVATSLIDMYGKCGEINDASIVFDNMIVRDVSAWNALLGGYMKGGCVDAAVAIFKRMPWRNIVSWTTMISGYSQSGLAQQAL

Query:  SLFDEMLKEDSGVRPNWVTIMSVLPACAQSSALERGRRIHELASQMGLNSNASVLIALTAMYAKCGSLVDARNCFDRL----------------------
         LF +M+K  + VRP+  T+++V+ ACAQS ++E GR++H      G  SN  ++ AL  +Y+KCG L  A   F+RL                      
Subjt:  SLFDEMLKEDSGVRPNWVTIMSVLPACAQSSALERGRRIHELASQMGLNSNASVLIALTAMYAKCGSLVDARNCFDRL----------------------

Query:  ----------------------------------------------------NRS---------------------------EKTLVAWNTMITAYASYG
                                                            N S                            K+L +WN MI  +A +G
Subjt:  ----------------------------------------------------NRS---------------------------EKTLVAWNTMITAYASYG

Query:  HGLEAVSTFQEMIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFKCMNTTYSIEPRAEHYACVVDLLGRAGRLAEASKIVDEMPMPAGPSIWGSLLAACR
            +   F  M + GIQPDDITF GLLS CSHSG++D+G   F+ M   Y + P+ EHY C++DLLG +G   EA ++++ M M     IW SLL AC+
Subjt:  HGLEAVSTFQEMIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFKCMNTTYSIEPRAEHYACVVDLLGRAGRLAEASKIVDEMPMPAGPSIWGSLLAACR

Query:  KYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAILKSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAKDIYTFLEALPEKMKAAG
         + N+E+ E+ A  L  +EPEN G+YVLLSN+YA AGRW EV K RA+L  +G KK PGCS IE++   H F+ GD  HP+ ++IY  LE +   ++ AG
Subjt:  KYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAILKSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAKDIYTFLEALPEKMKAAG

Query:  YVPDTSYVLHDISEEEKEFNLIAHSEKLAVAFGILNTSAETVLRVTKNLRICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW
        +VPDTS VL ++ EE KE  L  HSEKLA+AFG+++T   T L + KNLR+C +CH A   IS+IY REI+ RD  RFHHF+ G CSC DYW
Subjt:  YVPDTSYVLHDISEEEKEFNLIAHSEKLAVAFGILNTSAETVLRVTKNLRICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW

Q9LNU6 Pentatricopeptide repeat-containing protein At1g202307.5e-13740.44Show/hide
Query:  VFQFLTGQNLLRLGHQVHAHMILRGLEPTALVGSKMVAFYASSGDIDSSVSVFNRISEP-----SSLL------------------------------FN
        +F+     +  ++G Q+H    + GL+  A V   M   Y   G +  +  VF+R+S+      S+LL                              +N
Subjt:  VFQFLTGQNLLRLGHQVHAHMILRGLEPTALVGSKMVAFYASSGDIDSSVSVFNRISEP-----SSLL------------------------------FN

Query:  SMIRAFARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVDLLSVWMGKCVHGLVLRVGLQFDLYVATSLIDMYGKCGEINDASIVFDNMIVRDVSAW
         ++  F R G+ +  V  +  +H  GF  D  T   VL S  D   + MG+ +HG V++ GL  D  V +++IDMYGK G +     +F+   + +    
Subjt:  SMIRAFARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVDLLSVWMGKCVHGLVLRVGLQFDLYVATSLIDMYGKCGEINDASIVFDNMIVRDVSAW

Query:  NALLGGYMKGGCVDAAVAIFKRMPWR----NIVSWTTMISGYSQSGLAQQALSLFDEMLKEDSGVRPNWVTIMSVLPACAQSSALERGRRIHELASQMGL
        NA + G  + G VD A+ +F+    +    N+VSWT++I+G +Q+G   +AL LF EM  + +GV+PN VTI S+LPAC   +AL  GR  H  A ++ L
Subjt:  NALLGGYMKGGCVDAAVAIFKRMPWR----NIVSWTTMISGYSQSGLAQQALSLFDEMLKEDSGVRPNWVTIMSVLPACAQSSALERGRRIHELASQMGL

Query:  NSNASVLIALTAMYAKCGSLVDARNCFDRLNRSEKTLVAWNTMITAYASYGHGLEAVSTFQEMIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFKCMNT
          N  V  AL  MYAKCG +  ++  F+ +    K LV WN+++  ++ +G   E +S F+ +++  ++PD I+FT LLS C   GL D G KYFK M+ 
Subjt:  NSNASVLIALTAMYAKCGSLVDARNCFDRLNRSEKTLVAWNTMITAYASYGHGLEAVSTFQEMIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFKCMNT

Query:  TYSIEPRAEHYACVVDLLGRAGRLAEASKIVDEMPMPAGPSIWGSLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAIL
         Y I+PR EHY+C+V+LLGRAG+L EA  ++ EMP      +WG+LL +CR   N+++AE AA KLF LEPEN G YVLLSN+YA  G W EV+ +R  +
Subjt:  TYSIEPRAEHYACVVDLLGRAGRLAEASKIVDEMPMPAGPSIWGSLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAIL

Query:  KSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAKDIYTFLEALPEKMKAAGYVPDTSYVLHDISEEEKEFNLIAHSEKLAVAFGILNTSAETVLRVTKNL
        +S G KK+PGCSWI+V  + +  L GD SHPQ   I   ++ + ++M+ +G+ P+  + LHD+ E+E+E  L  HSEKLAV FG+LNT   T L+V KNL
Subjt:  KSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAKDIYTFLEALPEKMKAAGYVPDTSYVLHDISEEEKEFNLIAHSEKLAVAFGILNTSAETVLRVTKNL

Query:  RICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW
        RICGDCH  + FIS   GREI +RD NRFHHFK G CSCGD+W
Subjt:  RICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW

Q9STF3 Pentatricopeptide repeat-containing protein At3g46790, chloroplastic4.4e-13741.15Show/hide
Query:  QVHAHMILRGLEPTALVGSKMVAFYASSGDIDSSVSVFNRISEPSSLLFNSMIRAFARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSV----DLLS
        +VH H++  G +    + +K++  Y+  G +D +  VF++  + +  ++N++ RA    G  E  +  Y+ M+  G   D FT+ +VLK+ V     +  
Subjt:  QVHAHMILRGLEPTALVGSKMVAFYASSGDIDSSVSVFNRISEPSSLLFNSMIRAFARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSV----DLLS

Query:  VWMGKCVHGLVLRVGLQFDLYVATSLIDMYGKCGEINDASIVFDNMIVRDVSAWNALLGGYMKGGCVDAAVAIFKRMPWRNIVSWTTMISGYSQSGLAQQ
        +  GK +H  + R G    +Y+ T+L+DMY +                                GCVD A  +F  MP RN+VSW+ MI+ Y+++G A +
Subjt:  VWMGKCVHGLVLRVGLQFDLYVATSLIDMYGKCGEINDASIVFDNMIVRDVSAWNALLGGYMKGGCVDAAVAIFKRMPWRNIVSWTTMISGYSQSGLAQQ

Query:  ALSLFDEMLKEDSGVRPNWVTIMSVLPACAQSSALERGRRIHELASQMGLNSNASVLIALTAMYAKCGSLVDARNCFDRLNRSEKTLVAWNTMITAYASY
        AL  F EM++E     PN VT++SVL ACA  +ALE+G+ IH    + GL+S   V+ AL  MY +CG L   +  FDR++  ++ +V+WN++I++Y  +
Subjt:  ALSLFDEMLKEDSGVRPNWVTIMSVLPACAQSSALERGRRIHELASQMGLNSNASVLIALTAMYAKCGSLVDARNCFDRLNRSEKTLVAWNTMITAYASY

Query:  GHGLEAVSTFQEMIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFKCMNTTYSIEPRAEHYACVVDLLGRAGRLAEASKIVDEMPMPAGPSIWGSLLAAC
        G+G +A+  F+EM+  G  P  +TF  +L  CSH GLV+ G + F+ M   + I+P+ EHYAC+VDLLGRA RL EA+K+V +M    GP +WGSLL +C
Subjt:  GHGLEAVSTFQEMIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFKCMNTTYSIEPRAEHYACVVDLLGRAGRLAEASKIVDEMPMPAGPSIWGSLLAAC

Query:  RKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAILKSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAKDIYTFLEALPEKMKAA
        R + N+E+AE A+R+LF LEP+N GNYVLL+++YAEA  W EV +++ +L+ +G +K PG  W+EV  K + F+  D  +P  + I+ FL  L E MK  
Subjt:  RKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAILKSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAKDIYTFLEALPEKMKAA

Query:  GYVPDTSYVLHDISEEEKEFNLIAHSEKLAVAFGILNTSAETVLRVTKNLRICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW
        GY+P T  VL+++  EEKE  ++ HSEKLA+AFG++NTS    +R+TKNLR+C DCH    FIS+   +EI+VRDVNRFH FK G CSCGDYW
Subjt:  GYVPDTSYVLHDISEEEKEFNLIAHSEKLAVAFGILNTSAETVLRVTKNLRICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.2e-13538.73Show/hide
Query:  VHAHMILRGLEPTALVGSKMVAFYASSGDIDS---SVSVFNRISEPSSLLFNSMIRAFARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVDLLSVW
        +HA MI  GL  T    SK++ F   S   +    ++SVF  I EP+ L++N+M R  A        +  Y  M S G   + +TFPFVLKS     +  
Subjt:  VHAHMILRGLEPTALVGSKMVAFYASSGDIDS---SVSVFNRISEPSSLLFNSMIRAFARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVDLLSVW

Query:  MGKCVHGLVLRVGLQFDLYVATSLIDMYGKCGEINDASIVFDNMIVRDVSAWNALLGGYMKGGCVDAAVAIFKRMPWRNIVSWTTMISGYSQSGLAQQAL
         G+ +HG VL++G   DLYV TSLI MY + G + DA  VFD    RDV ++ AL+ GY   G ++ A  +F  +P +++VSW  MISGY+++G  ++AL
Subjt:  MGKCVHGLVLRVGLQFDLYVATSLIDMYGKCGEINDASIVFDNMIVRDVSAWNALLGGYMKGGCVDAAVAIFKRMPWRNIVSWTTMISGYSQSGLAQQAL

Query:  SLFDEMLKEDSGVRPNWVTIMSVLPACAQSSALERGRRIHELASQMGLNSNASVLIALTAMYAKCGSLVDARNCFDRL----------------------
         LF +M+K  + VRP+  T+++V+ ACAQS ++E GR++H      G  SN  ++ AL  +Y+KCG L  A   F+RL                      
Subjt:  SLFDEMLKEDSGVRPNWVTIMSVLPACAQSSALERGRRIHELASQMGLNSNASVLIALTAMYAKCGSLVDARNCFDRL----------------------

Query:  ----------------------------------------------------NRS---------------------------EKTLVAWNTMITAYASYG
                                                            N S                            K+L +WN MI  +A +G
Subjt:  ----------------------------------------------------NRS---------------------------EKTLVAWNTMITAYASYG

Query:  HGLEAVSTFQEMIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFKCMNTTYSIEPRAEHYACVVDLLGRAGRLAEASKIVDEMPMPAGPSIWGSLLAACR
            +   F  M + GIQPDDITF GLLS CSHSG++D+G   F+ M   Y + P+ EHY C++DLLG +G   EA ++++ M M     IW SLL AC+
Subjt:  HGLEAVSTFQEMIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFKCMNTTYSIEPRAEHYACVVDLLGRAGRLAEASKIVDEMPMPAGPSIWGSLLAACR

Query:  KYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAILKSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAKDIYTFLEALPEKMKAAG
         + N+E+ E+ A  L  +EPEN G+YVLLSN+YA AGRW EV K RA+L  +G KK PGCS IE++   H F+ GD  HP+ ++IY  LE +   ++ AG
Subjt:  KYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAILKSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAKDIYTFLEALPEKMKAAG

Query:  YVPDTSYVLHDISEEEKEFNLIAHSEKLAVAFGILNTSAETVLRVTKNLRICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW
        +VPDTS VL ++ EE KE  L  HSEKLA+AFG+++T   T L + KNLR+C +CH A   IS+IY REI+ RD  RFHHF+ G CSC DYW
Subjt:  YVPDTSYVLHDISEEEKEFNLIAHSEKLAVAFGILNTSAETVLRVTKNLRICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW

AT1G20230.1 Pentatricopeptide repeat (PPR) superfamily protein5.3e-13840.44Show/hide
Query:  VFQFLTGQNLLRLGHQVHAHMILRGLEPTALVGSKMVAFYASSGDIDSSVSVFNRISEP-----SSLL------------------------------FN
        +F+     +  ++G Q+H    + GL+  A V   M   Y   G +  +  VF+R+S+      S+LL                              +N
Subjt:  VFQFLTGQNLLRLGHQVHAHMILRGLEPTALVGSKMVAFYASSGDIDSSVSVFNRISEP-----SSLL------------------------------FN

Query:  SMIRAFARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVDLLSVWMGKCVHGLVLRVGLQFDLYVATSLIDMYGKCGEINDASIVFDNMIVRDVSAW
         ++  F R G+ +  V  +  +H  GF  D  T   VL S  D   + MG+ +HG V++ GL  D  V +++IDMYGK G +     +F+   + +    
Subjt:  SMIRAFARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVDLLSVWMGKCVHGLVLRVGLQFDLYVATSLIDMYGKCGEINDASIVFDNMIVRDVSAW

Query:  NALLGGYMKGGCVDAAVAIFKRMPWR----NIVSWTTMISGYSQSGLAQQALSLFDEMLKEDSGVRPNWVTIMSVLPACAQSSALERGRRIHELASQMGL
        NA + G  + G VD A+ +F+    +    N+VSWT++I+G +Q+G   +AL LF EM  + +GV+PN VTI S+LPAC   +AL  GR  H  A ++ L
Subjt:  NALLGGYMKGGCVDAAVAIFKRMPWR----NIVSWTTMISGYSQSGLAQQALSLFDEMLKEDSGVRPNWVTIMSVLPACAQSSALERGRRIHELASQMGL

Query:  NSNASVLIALTAMYAKCGSLVDARNCFDRLNRSEKTLVAWNTMITAYASYGHGLEAVSTFQEMIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFKCMNT
          N  V  AL  MYAKCG +  ++  F+ +    K LV WN+++  ++ +G   E +S F+ +++  ++PD I+FT LLS C   GL D G KYFK M+ 
Subjt:  NSNASVLIALTAMYAKCGSLVDARNCFDRLNRSEKTLVAWNTMITAYASYGHGLEAVSTFQEMIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFKCMNT

Query:  TYSIEPRAEHYACVVDLLGRAGRLAEASKIVDEMPMPAGPSIWGSLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAIL
         Y I+PR EHY+C+V+LLGRAG+L EA  ++ EMP      +WG+LL +CR   N+++AE AA KLF LEPEN G YVLLSN+YA  G W EV+ +R  +
Subjt:  TYSIEPRAEHYACVVDLLGRAGRLAEASKIVDEMPMPAGPSIWGSLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAIL

Query:  KSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAKDIYTFLEALPEKMKAAGYVPDTSYVLHDISEEEKEFNLIAHSEKLAVAFGILNTSAETVLRVTKNL
        +S G KK+PGCSWI+V  + +  L GD SHPQ   I   ++ + ++M+ +G+ P+  + LHD+ E+E+E  L  HSEKLAV FG+LNT   T L+V KNL
Subjt:  KSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAKDIYTFLEALPEKMKAAGYVPDTSYVLHDISEEEKEFNLIAHSEKLAVAFGILNTSAETVLRVTKNL

Query:  RICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW
        RICGDCH  + FIS   GREI +RD NRFHHFK G CSCGD+W
Subjt:  RICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.8e-13640.67Show/hide
Query:  LRLGHQVHAHMILRGLEPTALVGSKMVAFYASSGDIDSSVSVFNRISEPSSLLFNSMIRAFARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVDLL
        L LG  +H   +   +     V + ++  Y S GD+DS+  VF  I E   + +NSMI  F + G  ++ +  +  M S      + T   VL +   + 
Subjt:  LRLGHQVHAHMILRGLEPTALVGSKMVAFYASSGDIDSSVSVFNRISEPSSLLFNSMIRAFARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVDLL

Query:  SVWMGKCVHGLVLRVGLQFDLYVATSLIDMYGKCGEINDASIVFDNMIVRDVSAWNALLGGYMKGGCVDAAVAIFKRMPWRNIVSWTTMISGYSQSGLAQ
        ++  G+ V   +    +  +L +A +++DMY KCG I DA  +FD M  +D   W  +L GY      +AA  +   MP ++IV+W  +IS Y Q+G   
Subjt:  SVWMGKCVHGLVLRVGLQFDLYVATSLIDMYGKCGEINDASIVFDNMIVRDVSAWNALLGGYMKGGCVDAAVAIFKRMPWRNIVSWTTMISGYSQSGLAQ

Query:  QALSLFDEMLKEDSGVRPNWVTIMSVLPACAQSSALERGRRIHELASQMGLNSNASVLIALTAMYAKCGSLVDARNCFDRLNRSEKTLVAWNTMITAYAS
        +AL +F E L+    ++ N +T++S L ACAQ  ALE GR IH    + G+  N  V  AL  MY+KCG L  +R  F+ + +  + +  W+ MI   A 
Subjt:  QALSLFDEMLKEDSGVRPNWVTIMSVLPACAQSSALERGRRIHELASQMGLNSNASVLIALTAMYAKCGSLVDARNCFDRLNRSEKTLVAWNTMITAYAS

Query:  YGHGLEAVSTFQEMIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFKCMNTTYSIEPRAEHYACVVDLLGRAGRLAEASKIVDEMPMPAGPSIWGSLLAA
        +G G EAV  F +M +A ++P+ +TFT +   CSH+GLVD     F  M + Y I P  +HYAC+VD+LGR+G L +A K ++ MP+P   S+WG+LL A
Subjt:  YGHGLEAVSTFQEMIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFKCMNTTYSIEPRAEHYACVVDLLGRAGRLAEASKIVDEMPMPAGPSIWGSLLAA

Query:  CRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAILKSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAKDIYTFLEALPEKMKA
        C+ + NL +AE A  +L  LEP N G +VLLSN+YA+ G+W+ V++LR  ++  G KK PGCS IE++G  H FL GD +HP ++ +Y  L  + EK+K+
Subjt:  CRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAILKSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAKDIYTFLEALPEKMKA

Query:  AGYVPDTSYVLHDISEEE-KEFNLIAHSEKLAVAFGILNTSAETVLRVTKNLRICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW
         GY P+ S VL  I EEE KE +L  HSEKLA+ +G+++T A  V+RV KNLR+CGDCH+    IS++Y REI+VRD  RFHHF+ G CSC D+W
Subjt:  AGYVPDTSYVLHDISEEE-KEFNLIAHSEKLAVAFGILNTSAETVLRVTKNLRICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW

AT3G46790.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.1e-13841.15Show/hide
Query:  QVHAHMILRGLEPTALVGSKMVAFYASSGDIDSSVSVFNRISEPSSLLFNSMIRAFARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSV----DLLS
        +VH H++  G +    + +K++  Y+  G +D +  VF++  + +  ++N++ RA    G  E  +  Y+ M+  G   D FT+ +VLK+ V     +  
Subjt:  QVHAHMILRGLEPTALVGSKMVAFYASSGDIDSSVSVFNRISEPSSLLFNSMIRAFARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSV----DLLS

Query:  VWMGKCVHGLVLRVGLQFDLYVATSLIDMYGKCGEINDASIVFDNMIVRDVSAWNALLGGYMKGGCVDAAVAIFKRMPWRNIVSWTTMISGYSQSGLAQQ
        +  GK +H  + R G    +Y+ T+L+DMY +                                GCVD A  +F  MP RN+VSW+ MI+ Y+++G A +
Subjt:  VWMGKCVHGLVLRVGLQFDLYVATSLIDMYGKCGEINDASIVFDNMIVRDVSAWNALLGGYMKGGCVDAAVAIFKRMPWRNIVSWTTMISGYSQSGLAQQ

Query:  ALSLFDEMLKEDSGVRPNWVTIMSVLPACAQSSALERGRRIHELASQMGLNSNASVLIALTAMYAKCGSLVDARNCFDRLNRSEKTLVAWNTMITAYASY
        AL  F EM++E     PN VT++SVL ACA  +ALE+G+ IH    + GL+S   V+ AL  MY +CG L   +  FDR++  ++ +V+WN++I++Y  +
Subjt:  ALSLFDEMLKEDSGVRPNWVTIMSVLPACAQSSALERGRRIHELASQMGLNSNASVLIALTAMYAKCGSLVDARNCFDRLNRSEKTLVAWNTMITAYASY

Query:  GHGLEAVSTFQEMIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFKCMNTTYSIEPRAEHYACVVDLLGRAGRLAEASKIVDEMPMPAGPSIWGSLLAAC
        G+G +A+  F+EM+  G  P  +TF  +L  CSH GLV+ G + F+ M   + I+P+ EHYAC+VDLLGRA RL EA+K+V +M    GP +WGSLL +C
Subjt:  GHGLEAVSTFQEMIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFKCMNTTYSIEPRAEHYACVVDLLGRAGRLAEASKIVDEMPMPAGPSIWGSLLAAC

Query:  RKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAILKSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAKDIYTFLEALPEKMKAA
        R + N+E+AE A+R+LF LEP+N GNYVLL+++YAEA  W EV +++ +L+ +G +K PG  W+EV  K + F+  D  +P  + I+ FL  L E MK  
Subjt:  RKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAILKSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAKDIYTFLEALPEKMKAA

Query:  GYVPDTSYVLHDISEEEKEFNLIAHSEKLAVAFGILNTSAETVLRVTKNLRICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW
        GY+P T  VL+++  EEKE  ++ HSEKLA+AFG++NTS    +R+TKNLR+C DCH    FIS+   +EI+VRDVNRFH FK G CSCGDYW
Subjt:  GYVPDTSYVLHDISEEEKEFNLIAHSEKLAVAFGILNTSAETVLRVTKNLRICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW

AT3G49142.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.5e-14441.82Show/hide
Query:  FLTGQNL-----LRLGHQVHAHMILRGLEPTALVGSKMVAFYASSGDIDSSVSVFNRISEPSSLLFNSMIRAFARYGFAERTVATYFSMHSWGFTGDYFT
        FL GQ L     +R    VH+ +IL  L   + +G K++  YAS  D+ S+  VF+ I E + ++ N MIR++   GF    V  + +M       D++T
Subjt:  FLTGQNL-----LRLGHQVHAHMILRGLEPTALVGSKMVAFYASSGDIDSSVSVFNRISEPSSLLFNSMIRAFARYGFAERTVATYFSMHSWGFTGDYFT

Query:  FPFVLKSSVDLLSVWMGKCVHGLVLRVGLQFDLYVATSLIDMYGKCGEINDASIVFDNMIVRDVSAWNALLGGYMKGGCVDAAVAI--------------
        FP VLK+     ++ +G+ +HG   +VGL   L+V   L+ MYGKCG +++A +V D M  RDV +WN+L+ GY +    D A+ +              
Subjt:  FPFVLKSSVDLLSVWMGKCVHGLVLRVGLQFDLYVATSLIDMYGKCGEINDASIVFDNMIVRDVSAWNALLGGYMKGGCVDAAVAI--------------

Query:  -----------------------FKRMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMLKEDSGVRPNWVTIMSVLPACAQSSALERGRRIHELASQMGL
                               F +M  +++VSW  MI  Y ++ +  +A+ L+  M  E  G  P+ V+I SVLPAC  +SAL  G++IH    +  L
Subjt:  -----------------------FKRMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMLKEDSGVRPNWVTIMSVLPACAQSSALERGRRIHELASQMGL

Query:  NSNASVLIALTAMYAKCGSLVDARNCFDRLNRSEKTLVAWNTMITAYASYGHGLEAVSTFQEMIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFKCMNT
          N  +  AL  MYAKCG L  AR+ F+  N   + +V+W  MI+AY   G G +AV+ F ++  +G+ PD I F   L+ CSH+GL++ G   FK M  
Subjt:  NSNASVLIALTAMYAKCGSLVDARNCFDRLNRSEKTLVAWNTMITAYASYGHGLEAVSTFQEMIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFKCMNT

Query:  TYSIEPRAEHYACVVDLLGRAGRLAEASKIVDEMPMPAGPSIWGSLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAIL
         Y I PR EH AC+VDLLGRAG++ EA + + +M M     +WG+LL ACR + + ++   AA KLF L PE +G YVLLSN+YA+AGRW+EV  +R I+
Subjt:  TYSIEPRAEHYACVVDLLGRAGRLAEASKIVDEMPMPAGPSIWGSLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAIL

Query:  KSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAKDIYTFLEALPEKMKAAGYVPDTSYVLHDISEEEKEFNLIAHSEKLAVAFGILNTSAE-----TVLR
        KS+G KK+PG S +EVN   H FL GD SHPQ+ +IY  L+ L +KMK  GYVPD+   LHD+ EE+KE +L  HSEKLA+ F ++NT  E       +R
Subjt:  KSQGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAKDIYTFLEALPEKMKAAGYVPDTSYVLHDISEEEKEFNLIAHSEKLAVAFGILNTSAE-----TVLR

Query:  VTKNLRICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW
        +TKNLRICGDCH A   IS+I  REI++RD NRFH F+ G CSCGDYW
Subjt:  VTKNLRICGDCHTAMVFISEIYGREIVVRDVNRFHHFKGGSCSCGDYW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACAATGGCATCCGGTTATCCATCTCCATCCCAATCCCAAACCCCATTCATCTTCTCTTCCGAACCCTCCATTCTTACTCTGGTTCCCCTCACATCGACGTTGCCCC
TTCACCATTCAAATGCTCAATCTCGCGCCTTTCTCTCTGCAACCTCCTGCAACCGCTCTTTGCGCCAAACCCACCTCCGGTTCTTTCTTATGCGCCCGTTTTCCAGTTCC
TCACCGGTCAAAACCTGTTGAGATTGGGCCACCAGGTTCACGCCCACATGATTCTCCGTGGCCTCGAGCCCACCGCGCTTGTCGGCTCCAAGATGGTTGCGTTTTATGCG
AGTTCTGGTGATATTGATTCCTCTGTCTCGGTTTTCAATCGCATTAGTGAGCCTTCTTCTCTCTTGTTTAATTCCATGATTCGGGCCTTTGCGCGATATGGGTTTGCGGA
GAGGACTGTTGCCACTTATTTTTCTATGCATTCTTGGGGCTTTACGGGGGATTACTTTACTTTCCCTTTTGTTCTTAAGTCTTCTGTGGATTTGTTGAGTGTTTGGATGG
GGAAATGTGTTCATGGACTGGTTTTGAGAGTTGGGTTGCAGTTTGATTTGTATGTGGCTACTTCTTTGATTGATATGTATGGGAAATGTGGGGAGATCAACGATGCGAGT
ATTGTGTTTGATAATATGATTGTTAGAGATGTTTCGGCTTGGAATGCCTTACTTGGTGGTTACATGAAGGGTGGGTGTGTTGACGCTGCTGTGGCGATTTTCAAGAGAAT
GCCGTGGAGGAATATTGTCTCTTGGACGACTATGATATCTGGATACTCACAGAGCGGCCTGGCACAGCAGGCATTGAGTTTGTTTGATGAAATGTTGAAAGAAGATTCAG
GCGTAAGACCCAACTGGGTGACTATAATGAGTGTCCTCCCAGCTTGTGCGCAGTCATCGGCACTCGAACGTGGAAGGCGGATTCATGAGTTGGCTTCTCAGATGGGTTTG
AATTCAAATGCTTCTGTGCTGATAGCCCTTACTGCAATGTATGCAAAATGTGGAAGCTTAGTCGATGCTCGCAACTGTTTCGACAGGCTTAATAGAAGTGAAAAGACTTT
GGTTGCTTGGAATACCATGATAACTGCTTATGCTTCTTATGGGCATGGACTTGAAGCAGTGTCAACCTTTCAAGAGATGATCCAAGCAGGCATTCAGCCCGACGACATTA
CATTTACAGGATTGTTATCGGGTTGCAGTCATTCGGGTCTTGTTGATGTCGGTTTAAAGTATTTCAAATGCATGAACACCACTTATTCGATTGAACCCCGAGCTGAGCAT
TATGCTTGTGTTGTTGATCTCTTGGGTCGAGCAGGGAGATTAGCTGAAGCAAGTAAAATTGTAGACGAAATGCCAATGCCAGCAGGACCAAGCATTTGGGGTTCGCTATT
GGCCGCCTGTCGAAAATATCGCAATCTGGAAATGGCAGAAACTGCAGCAAGAAAGCTATTTGTCCTAGAACCAGAAAACACAGGCAACTATGTCCTGCTCTCAAACATGT
ATGCTGAAGCTGGAAGGTGGCAGGAAGTTAACAAACTGAGAGCAATTCTGAAATCTCAGGGAACAAAGAAAAGTCCGGGTTGCAGTTGGATTGAGGTCAATGGCAAAGCA
CATATGTTTCTCGGTGGCGATACATCCCACCCTCAAGCCAAGGATATCTACACGTTCTTGGAGGCATTGCCAGAGAAGATGAAGGCAGCTGGCTATGTTCCTGATACAAG
CTATGTGTTGCACGATATCAGCGAGGAAGAGAAAGAATTCAACCTCATTGCACATAGTGAGAAGCTTGCTGTTGCTTTCGGGATTCTCAACACTTCAGCTGAAACCGTTC
TCCGGGTGACGAAGAACTTGAGAATCTGTGGGGACTGCCACACTGCAATGGTGTTCATATCAGAGATATATGGGCGGGAAATAGTTGTTAGAGATGTGAACCGGTTCCAT
CACTTTAAAGGTGGTTCTTGTTCTTGTGGAGATTACTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCACAATGGCATCCGGTTATCCATCTCCATCCCAATCCCAAACCCCATTCATCTTCTCTTCCGAACCCTCCATTCTTACTCTGGTTCCCCTCACATCGACGTTGCCCC
TTCACCATTCAAATGCTCAATCTCGCGCCTTTCTCTCTGCAACCTCCTGCAACCGCTCTTTGCGCCAAACCCACCTCCGGTTCTTTCTTATGCGCCCGTTTTCCAGTTCC
TCACCGGTCAAAACCTGTTGAGATTGGGCCACCAGGTTCACGCCCACATGATTCTCCGTGGCCTCGAGCCCACCGCGCTTGTCGGCTCCAAGATGGTTGCGTTTTATGCG
AGTTCTGGTGATATTGATTCCTCTGTCTCGGTTTTCAATCGCATTAGTGAGCCTTCTTCTCTCTTGTTTAATTCCATGATTCGGGCCTTTGCGCGATATGGGTTTGCGGA
GAGGACTGTTGCCACTTATTTTTCTATGCATTCTTGGGGCTTTACGGGGGATTACTTTACTTTCCCTTTTGTTCTTAAGTCTTCTGTGGATTTGTTGAGTGTTTGGATGG
GGAAATGTGTTCATGGACTGGTTTTGAGAGTTGGGTTGCAGTTTGATTTGTATGTGGCTACTTCTTTGATTGATATGTATGGGAAATGTGGGGAGATCAACGATGCGAGT
ATTGTGTTTGATAATATGATTGTTAGAGATGTTTCGGCTTGGAATGCCTTACTTGGTGGTTACATGAAGGGTGGGTGTGTTGACGCTGCTGTGGCGATTTTCAAGAGAAT
GCCGTGGAGGAATATTGTCTCTTGGACGACTATGATATCTGGATACTCACAGAGCGGCCTGGCACAGCAGGCATTGAGTTTGTTTGATGAAATGTTGAAAGAAGATTCAG
GCGTAAGACCCAACTGGGTGACTATAATGAGTGTCCTCCCAGCTTGTGCGCAGTCATCGGCACTCGAACGTGGAAGGCGGATTCATGAGTTGGCTTCTCAGATGGGTTTG
AATTCAAATGCTTCTGTGCTGATAGCCCTTACTGCAATGTATGCAAAATGTGGAAGCTTAGTCGATGCTCGCAACTGTTTCGACAGGCTTAATAGAAGTGAAAAGACTTT
GGTTGCTTGGAATACCATGATAACTGCTTATGCTTCTTATGGGCATGGACTTGAAGCAGTGTCAACCTTTCAAGAGATGATCCAAGCAGGCATTCAGCCCGACGACATTA
CATTTACAGGATTGTTATCGGGTTGCAGTCATTCGGGTCTTGTTGATGTCGGTTTAAAGTATTTCAAATGCATGAACACCACTTATTCGATTGAACCCCGAGCTGAGCAT
TATGCTTGTGTTGTTGATCTCTTGGGTCGAGCAGGGAGATTAGCTGAAGCAAGTAAAATTGTAGACGAAATGCCAATGCCAGCAGGACCAAGCATTTGGGGTTCGCTATT
GGCCGCCTGTCGAAAATATCGCAATCTGGAAATGGCAGAAACTGCAGCAAGAAAGCTATTTGTCCTAGAACCAGAAAACACAGGCAACTATGTCCTGCTCTCAAACATGT
ATGCTGAAGCTGGAAGGTGGCAGGAAGTTAACAAACTGAGAGCAATTCTGAAATCTCAGGGAACAAAGAAAAGTCCGGGTTGCAGTTGGATTGAGGTCAATGGCAAAGCA
CATATGTTTCTCGGTGGCGATACATCCCACCCTCAAGCCAAGGATATCTACACGTTCTTGGAGGCATTGCCAGAGAAGATGAAGGCAGCTGGCTATGTTCCTGATACAAG
CTATGTGTTGCACGATATCAGCGAGGAAGAGAAAGAATTCAACCTCATTGCACATAGTGAGAAGCTTGCTGTTGCTTTCGGGATTCTCAACACTTCAGCTGAAACCGTTC
TCCGGGTGACGAAGAACTTGAGAATCTGTGGGGACTGCCACACTGCAATGGTGTTCATATCAGAGATATATGGGCGGGAAATAGTTGTTAGAGATGTGAACCGGTTCCAT
CACTTTAAAGGTGGTTCTTGTTCTTGTGGAGATTACTGGTGA
Protein sequenceShow/hide protein sequence
MHNGIRLSISIPIPNPIHLLFRTLHSYSGSPHIDVAPSPFKCSISRLSLCNLLQPLFAPNPPPVLSYAPVFQFLTGQNLLRLGHQVHAHMILRGLEPTALVGSKMVAFYA
SSGDIDSSVSVFNRISEPSSLLFNSMIRAFARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVDLLSVWMGKCVHGLVLRVGLQFDLYVATSLIDMYGKCGEINDAS
IVFDNMIVRDVSAWNALLGGYMKGGCVDAAVAIFKRMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMLKEDSGVRPNWVTIMSVLPACAQSSALERGRRIHELASQMGL
NSNASVLIALTAMYAKCGSLVDARNCFDRLNRSEKTLVAWNTMITAYASYGHGLEAVSTFQEMIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFKCMNTTYSIEPRAEH
YACVVDLLGRAGRLAEASKIVDEMPMPAGPSIWGSLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAILKSQGTKKSPGCSWIEVNGKA
HMFLGGDTSHPQAKDIYTFLEALPEKMKAAGYVPDTSYVLHDISEEEKEFNLIAHSEKLAVAFGILNTSAETVLRVTKNLRICGDCHTAMVFISEIYGREIVVRDVNRFH
HFKGGSCSCGDYW