; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g34150 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g34150
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGDSL-like Lipase/Acylhydrolase superfamily protein
Genome locationchr8:24910042..24933058
RNA-Seq ExpressionMoc08g34150
SyntenyMoc08g34150
Gene Ontology termsGO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0016788 - hydrolase activity, acting on ester bonds (molecular function)
InterPro domainsIPR000504 - RNA recognition motif domain
IPR001087 - GDSL lipase/esterase
IPR001202 - WW domain
IPR002885 - Pentatricopeptide repeat
IPR004332 - Transposase, MuDR, plant
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR035669 - GDSL lipase/esterase-like, plant
IPR035979 - RNA-binding domain superfamily
IPR036020 - WW domain superfamily
IPR036514 - SGNH hydrolase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022157018.1 CUGBP Elav-like family member 4 [Momordica charantia]1.0e-28779.05Show/hide
Query:  MRSEEADEYLVAGVKKLAEAAAAEEGETSF---------------------------TVVEAIFMDDLSSSLVYVTVDDGRVLGISPLDVDHVVVANKIK
        MRSEEADEYLVAGVKKLAEAAAAEEG  S                            ++   +     SSS+     + GRV   S + +   +  ++  
Subjt:  MRSEEADEYLVAGVKKLAEAAAAEEGETSF---------------------------TVVEAIFMDDLSSSLVYVTVDDGRVLGISPLDVDHVVVANKIK

Query:  TNIVMYGHWLSHLVKENFHGSRARKTKQQVGNDEGAAAIGANTISEYGR--------------------------------LSNNWNQPEFHNHQPEYRH
        ++      W S   + NF+  +     Q   +  G      N   +Y                                    NNWNQPEFHNHQPEYRH
Subjt:  TNIVMYGHWLSHLVKENFHGSRARKTKQQVGNDEGAAAIGANTISEYGR--------------------------------LSNNWNQPEFHNHQPEYRH

Query:  QPHFNGEANEGFGNGGLRPNCGNQNANLGRKRPRNYSNRTVPSDHAEAVKLYVAQVPRTGTEEAIRPLFEVHGDIVEIVILRDKITGQQQGSCFVKYATS
        QPHFNGEANEGFGNGGLRPNCGNQNANLGRKRPRNYSNRTVPSDHAEAVKLYVAQVPRTGTEEAIRPLFEVHGDIVEIVILRDKITGQQQGSCFVKYATS
Subjt:  QPHFNGEANEGFGNGGLRPNCGNQNANLGRKRPRNYSNRTVPSDHAEAVKLYVAQVPRTGTEEAIRPLFEVHGDIVEIVILRDKITGQQQGSCFVKYATS

Query:  IEADRAIGALDNQFTFPGEMAPINVKYADGERERLGVLEKLYVGSLNKNTTKREIEEVFSPYGFVEDIYIMRDELKQSRGCAFVKYARREMAMAAIKALN
        IEADRAIGALDNQFTFPGEMAPINVKYADGERERLGVLEKLYVGSLNKNTTKREIEEVFSPYGFVEDIYIMRDELKQSRGCAFVKYARREMAMAAIKALN
Subjt:  IEADRAIGALDNQFTFPGEMAPINVKYADGERERLGVLEKLYVGSLNKNTTKREIEEVFSPYGFVEDIYIMRDELKQSRGCAFVKYARREMAMAAIKALN

Query:  GNYTIRGCDQPLIVRLADPKKSRVGEQRSNSMSGSPNFGHHPQPFRPEPPLGAPAGGCFPNNLYPPQQNSASLGPAKNASQVASNAPLAPNTIQKAHPPI
        GNYTIRGCDQPLIVRLADPKKSRVGEQRSNSMSGSPNFGHHPQPFRPEPPLGAPAGGCFPNNLYPPQQNSASLGPAKNASQVASNAPLAPNTIQKAHPPI
Subjt:  GNYTIRGCDQPLIVRLADPKKSRVGEQRSNSMSGSPNFGHHPQPFRPEPPLGAPAGGCFPNNLYPPQQNSASLGPAKNASQVASNAPLAPNTIQKAHPPI

Query:  QEPSSSFAHIPSQPMRTTQQVCQPPTQPDFSKMQNQVYCQQQPRKDSYQQQNSQVNENTPPTAHGLQTFSGVPNSPLVRPCSRVEVSLECDWSEHTCPDG
        QEPSSSFAHIPSQPMRTTQQVCQPPTQPDFSKMQNQVYCQQQPRKDSYQQQNSQVNENTPPTAHGLQTFSGVPNSPLVRPCSRVEVSLECDWSEHTCPDG
Subjt:  QEPSSSFAHIPSQPMRTTQQVCQPPTQPDFSKMQNQVYCQQQPRKDSYQQQNSQVNENTPPTAHGLQTFSGVPNSPLVRPCSRVEVSLECDWSEHTCPDG

Query:  FKYYYNCVTCESSWEKPEEFALFEQQLKQEKLQKQNHQLHSSLPISSPEVLPHPNVFSQKLEVQSSSAVRELD
        FKYYYNCVTCESSWEKPEEFALFEQQLKQEKLQKQNHQLHSSLPISSPEVLPHPNVFSQKLEVQSSSAVRELD
Subjt:  FKYYYNCVTCESSWEKPEEFALFEQQLKQEKLQKQNHQLHSSLPISSPEVLPHPNVFSQKLEVQSSSAVRELD

XP_022157024.1 putative pentatricopeptide repeat-containing protein At1g02420 isoform X1 [Momordica charantia]7.8e-26499.56Show/hide
Query:  MILRPTPRYFPPIPLSNSFSHPFSSANENQSLNGKVETIFRIISSSNSSTNMRHSLKSARVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGFYH
        MILRPTPRYFPPIPLSNSFSHPFSSANENQSLNGKVETIFRIISSSNSSTNMRHSLKSARVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGFYH
Subjt:  MILRPTPRYFPPIPLSNSFSHPFSSANENQSLNGKVETIFRIISSSNSSTNMRHSLKSARVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGFYH

Query:  TAFSVDSMLYILGRSRKFEKIWDVLVDVKLKDRSLITPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVACFNALLRTLCQEKSMTDARNVYHGMK
        TAFSVDSMLYILGRSRKFEKIWDVLVDVKLKDRSLITPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVACFNALLRTLCQEKSMTDARNVYHGMK
Subjt:  TAFSVDSMLYILGRSRKFEKIWDVLVDVKLKDRSLITPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVACFNALLRTLCQEKSMTDARNVYHGMK

Query:  SKFRPNLQTFNILLSGWKSSEEAEGFFDEMREMGVKPDVVSYNCLVDVYCKNREMDKAYKVVERMQDEDIHADVITYTSIIGGLGLIGQPDKARNILKEM
        SKFRPNLQTFNILLSGWKSSEEAEGFFDEMREMGVKPDVVSYNCLVDVYCKNREMDKAYKVVERMQDEDIHADVITYTSIIGGLGLIGQPDKARNILKEM
Subjt:  SKFRPNLQTFNILLSGWKSSEEAEGFFDEMREMGVKPDVVSYNCLVDVYCKNREMDKAYKVVERMQDEDIHADVITYTSIIGGLGLIGQPDKARNILKEM

Query:  KEYGCYPDVAAYNAAIRNFCIAKRLREAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTGCLPNTQSCLFLMRLFKKHEKVEMALE
        KEYGCYPDVAAYNAAIRNFCIAKRLREAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTGCLPNTQSCLFLMRLFKKHEKVEMALE
Subjt:  KEYGCYPDVAAYNAAIRNFCIAKRLREAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTGCLPNTQSCLFLMRLFKKHEKVEMALE

Query:  LWNDMVVRGFGSYILVSEELFDLLCDLGKLVEAEMCFLQMVDKGHKPSNVSFKRIKL
        LWNDMVVRGFGSYILVSEELFDLL DLGKLVEAEMCFLQMVDKGHKPSNVSFKRIK+
Subjt:  LWNDMVVRGFGSYILVSEELFDLLCDLGKLVEAEMCFLQMVDKGHKPSNVSFKRIKL

XP_022157025.1 putative pentatricopeptide repeat-containing protein At1g02420 isoform X2 [Momordica charantia]7.8e-26499.56Show/hide
Query:  MILRPTPRYFPPIPLSNSFSHPFSSANENQSLNGKVETIFRIISSSNSSTNMRHSLKSARVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGFYH
        MILRPTPRYFPPIPLSNSFSHPFSSANENQSLNGKVETIFRIISSSNSSTNMRHSLKSARVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGFYH
Subjt:  MILRPTPRYFPPIPLSNSFSHPFSSANENQSLNGKVETIFRIISSSNSSTNMRHSLKSARVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGFYH

Query:  TAFSVDSMLYILGRSRKFEKIWDVLVDVKLKDRSLITPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVACFNALLRTLCQEKSMTDARNVYHGMK
        TAFSVDSMLYILGRSRKFEKIWDVLVDVKLKDRSLITPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVACFNALLRTLCQEKSMTDARNVYHGMK
Subjt:  TAFSVDSMLYILGRSRKFEKIWDVLVDVKLKDRSLITPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVACFNALLRTLCQEKSMTDARNVYHGMK

Query:  SKFRPNLQTFNILLSGWKSSEEAEGFFDEMREMGVKPDVVSYNCLVDVYCKNREMDKAYKVVERMQDEDIHADVITYTSIIGGLGLIGQPDKARNILKEM
        SKFRPNLQTFNILLSGWKSSEEAEGFFDEMREMGVKPDVVSYNCLVDVYCKNREMDKAYKVVERMQDEDIHADVITYTSIIGGLGLIGQPDKARNILKEM
Subjt:  SKFRPNLQTFNILLSGWKSSEEAEGFFDEMREMGVKPDVVSYNCLVDVYCKNREMDKAYKVVERMQDEDIHADVITYTSIIGGLGLIGQPDKARNILKEM

Query:  KEYGCYPDVAAYNAAIRNFCIAKRLREAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTGCLPNTQSCLFLMRLFKKHEKVEMALE
        KEYGCYPDVAAYNAAIRNFCIAKRLREAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTGCLPNTQSCLFLMRLFKKHEKVEMALE
Subjt:  KEYGCYPDVAAYNAAIRNFCIAKRLREAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTGCLPNTQSCLFLMRLFKKHEKVEMALE

Query:  LWNDMVVRGFGSYILVSEELFDLLCDLGKLVEAEMCFLQMVDKGHKPSNVSFKRIKL
        LWNDMVVRGFGSYILVSEELFDLL DLGKLVEAEMCFLQMVDKGHKPSNVSFKRIK+
Subjt:  LWNDMVVRGFGSYILVSEELFDLLCDLGKLVEAEMCFLQMVDKGHKPSNVSFKRIKL

XP_022948942.1 putative pentatricopeptide repeat-containing protein At1g02420 [Cucurbita moschata]1.6e-24091.11Show/hide
Query:  RYFPPIPLSNSFSHPFSSANENQSLNGKVETIFRIISSSNSSTNMRHSLKSARVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGFYHTAFSVDS
        RYF P PLSN   H FS+A+ENQSLN  V+T+FRIISSS SS NMR SLKS+RVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGFYH+AFS D+
Subjt:  RYFPPIPLSNSFSHPFSSANENQSLNGKVETIFRIISSSNSSTNMRHSLKSARVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGFYHTAFSVDS

Query:  MLYILGRSRKFEKIWDVLVDVKLKDRSLITPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVACFNALLRTLCQEKSMTDARNVYHGMKSKFRPNL
        MLYILGR+RKFEKIWDVL+D+KLKDRSLITPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDV CFNALLRTLCQEKSMTDARNVYHG+KSKFRPNL
Subjt:  MLYILGRSRKFEKIWDVLVDVKLKDRSLITPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVACFNALLRTLCQEKSMTDARNVYHGMKSKFRPNL

Query:  QTFNILLSGWKSSEEAEGFFDEMREMGVKPDVVSYNCLVDVYCKNREMDKAYKVVERMQDEDIHADVITYTSIIGGLGLIGQPDKARNILKEMKEYGCYP
        QTFNILLSGWKSSEEAEGFFDEMREMGV+PDVVSYNCLVDVYCKNREMDKA+KV+E+M+DEDI ADVITYTSIIGGLGLIGQPDKARNILKEMKEYGCYP
Subjt:  QTFNILLSGWKSSEEAEGFFDEMREMGVKPDVVSYNCLVDVYCKNREMDKAYKVVERMQDEDIHADVITYTSIIGGLGLIGQPDKARNILKEMKEYGCYP

Query:  DVAAYNAAIRNFCIAKRLREAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTGCLPNTQSCLFLMRLFKKHEKVEMALELWNDMVV
        DVAAYNAAIRNFCIAKRLREAFDLMDEM NKGL+PNATTYNLFFRIFFWSNDLQSSWNLYRRMMD GCLPNTQSCLFLMR FK+HEKVEMAL+LWNDMV 
Subjt:  DVAAYNAAIRNFCIAKRLREAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTGCLPNTQSCLFLMRLFKKHEKVEMALELWNDMVV

Query:  RGFGSYILVSEELFDLLCDLGKLVEAEMCFLQMVDKGHKPSNVSFKRIKL
        RGFGSYILVSEELFDLLCDLGKL+EAE CFLQMVDKGHKPSNVSFKRIK+
Subjt:  RGFGSYILVSEELFDLLCDLGKLVEAEMCFLQMVDKGHKPSNVSFKRIKL

XP_038903108.1 putative pentatricopeptide repeat-containing protein At1g02420 [Benincasa hispida]9.2e-24191.78Show/hide
Query:  RYFPPIPLSNSFSHPFSSANENQSLNGKVETIFRIISSSNSSTNMRHSLKSARVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGFYHTAFSVDS
        RYF PIPLSN FSH FSSA+ENQSLN  VET+FRII+SS SS +MR SLKS+RVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGFYHTAFSVD+
Subjt:  RYFPPIPLSNSFSHPFSSANENQSLNGKVETIFRIISSSNSSTNMRHSLKSARVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGFYHTAFSVDS

Query:  MLYILGRSRKFEKIWDVLVDVKLKDRSLITPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVACFNALLRTLCQEKSMTDARNVYHGMKSKFRPNL
        MLYILGR RKFEKIWDVLVD+KLKDRSLI+PRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDV CFNALLRTLCQEKSM DARNVYH +KSKFRPNL
Subjt:  MLYILGRSRKFEKIWDVLVDVKLKDRSLITPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVACFNALLRTLCQEKSMTDARNVYHGMKSKFRPNL

Query:  QTFNILLSGWKSSEEAEGFFDEMREMGVKPDVVSYNCLVDVYCKNREMDKAYKVVERMQDEDIHADVITYTSIIGGLGLIGQPDKARNILKEMKEYGCYP
        QTFNILLSGWKSSEEAEGFF+EMREMGVKPDVVSYNCLVDVYCKNREMDKA+KV+E+M+DEDI ADVITYTS+IGGLGLIGQPDKARNILKEMKEYGCYP
Subjt:  QTFNILLSGWKSSEEAEGFFDEMREMGVKPDVVSYNCLVDVYCKNREMDKAYKVVERMQDEDIHADVITYTSIIGGLGLIGQPDKARNILKEMKEYGCYP

Query:  DVAAYNAAIRNFCIAKRLREAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTGCLPNTQSCLFLMRLFKKHEKVEMALELWNDMVV
        DVAAYNAAIRNFCIAKRLR AFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDT CLPNTQSCLFLMRLFKKHE VEMALELWNDM+ 
Subjt:  DVAAYNAAIRNFCIAKRLREAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTGCLPNTQSCLFLMRLFKKHEKVEMALELWNDMVV

Query:  RGFGSYILVSEELFDLLCDLGKLVEAEMCFLQMVDKGHKPSNVSFKRIKL
        RGFGSYILVSEELFD LCDLGKL+EAE CFLQMVDKGHKPSNVSFKRIK+
Subjt:  RGFGSYILVSEELFDLLCDLGKLVEAEMCFLQMVDKGHKPSNVSFKRIKL

TrEMBL top hitse value%identityAlignment
A0A6J1DS00 putative pentatricopeptide repeat-containing protein At1g02420 isoform X23.8e-26499.56Show/hide
Query:  MILRPTPRYFPPIPLSNSFSHPFSSANENQSLNGKVETIFRIISSSNSSTNMRHSLKSARVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGFYH
        MILRPTPRYFPPIPLSNSFSHPFSSANENQSLNGKVETIFRIISSSNSSTNMRHSLKSARVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGFYH
Subjt:  MILRPTPRYFPPIPLSNSFSHPFSSANENQSLNGKVETIFRIISSSNSSTNMRHSLKSARVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGFYH

Query:  TAFSVDSMLYILGRSRKFEKIWDVLVDVKLKDRSLITPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVACFNALLRTLCQEKSMTDARNVYHGMK
        TAFSVDSMLYILGRSRKFEKIWDVLVDVKLKDRSLITPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVACFNALLRTLCQEKSMTDARNVYHGMK
Subjt:  TAFSVDSMLYILGRSRKFEKIWDVLVDVKLKDRSLITPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVACFNALLRTLCQEKSMTDARNVYHGMK

Query:  SKFRPNLQTFNILLSGWKSSEEAEGFFDEMREMGVKPDVVSYNCLVDVYCKNREMDKAYKVVERMQDEDIHADVITYTSIIGGLGLIGQPDKARNILKEM
        SKFRPNLQTFNILLSGWKSSEEAEGFFDEMREMGVKPDVVSYNCLVDVYCKNREMDKAYKVVERMQDEDIHADVITYTSIIGGLGLIGQPDKARNILKEM
Subjt:  SKFRPNLQTFNILLSGWKSSEEAEGFFDEMREMGVKPDVVSYNCLVDVYCKNREMDKAYKVVERMQDEDIHADVITYTSIIGGLGLIGQPDKARNILKEM

Query:  KEYGCYPDVAAYNAAIRNFCIAKRLREAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTGCLPNTQSCLFLMRLFKKHEKVEMALE
        KEYGCYPDVAAYNAAIRNFCIAKRLREAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTGCLPNTQSCLFLMRLFKKHEKVEMALE
Subjt:  KEYGCYPDVAAYNAAIRNFCIAKRLREAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTGCLPNTQSCLFLMRLFKKHEKVEMALE

Query:  LWNDMVVRGFGSYILVSEELFDLLCDLGKLVEAEMCFLQMVDKGHKPSNVSFKRIKL
        LWNDMVVRGFGSYILVSEELFDLL DLGKLVEAEMCFLQMVDKGHKPSNVSFKRIK+
Subjt:  LWNDMVVRGFGSYILVSEELFDLLCDLGKLVEAEMCFLQMVDKGHKPSNVSFKRIKL

A0A6J1DSB0 CUGBP Elav-like family member 44.9e-28879.05Show/hide
Query:  MRSEEADEYLVAGVKKLAEAAAAEEGETSF---------------------------TVVEAIFMDDLSSSLVYVTVDDGRVLGISPLDVDHVVVANKIK
        MRSEEADEYLVAGVKKLAEAAAAEEG  S                            ++   +     SSS+     + GRV   S + +   +  ++  
Subjt:  MRSEEADEYLVAGVKKLAEAAAAEEGETSF---------------------------TVVEAIFMDDLSSSLVYVTVDDGRVLGISPLDVDHVVVANKIK

Query:  TNIVMYGHWLSHLVKENFHGSRARKTKQQVGNDEGAAAIGANTISEYGR--------------------------------LSNNWNQPEFHNHQPEYRH
        ++      W S   + NF+  +     Q   +  G      N   +Y                                    NNWNQPEFHNHQPEYRH
Subjt:  TNIVMYGHWLSHLVKENFHGSRARKTKQQVGNDEGAAAIGANTISEYGR--------------------------------LSNNWNQPEFHNHQPEYRH

Query:  QPHFNGEANEGFGNGGLRPNCGNQNANLGRKRPRNYSNRTVPSDHAEAVKLYVAQVPRTGTEEAIRPLFEVHGDIVEIVILRDKITGQQQGSCFVKYATS
        QPHFNGEANEGFGNGGLRPNCGNQNANLGRKRPRNYSNRTVPSDHAEAVKLYVAQVPRTGTEEAIRPLFEVHGDIVEIVILRDKITGQQQGSCFVKYATS
Subjt:  QPHFNGEANEGFGNGGLRPNCGNQNANLGRKRPRNYSNRTVPSDHAEAVKLYVAQVPRTGTEEAIRPLFEVHGDIVEIVILRDKITGQQQGSCFVKYATS

Query:  IEADRAIGALDNQFTFPGEMAPINVKYADGERERLGVLEKLYVGSLNKNTTKREIEEVFSPYGFVEDIYIMRDELKQSRGCAFVKYARREMAMAAIKALN
        IEADRAIGALDNQFTFPGEMAPINVKYADGERERLGVLEKLYVGSLNKNTTKREIEEVFSPYGFVEDIYIMRDELKQSRGCAFVKYARREMAMAAIKALN
Subjt:  IEADRAIGALDNQFTFPGEMAPINVKYADGERERLGVLEKLYVGSLNKNTTKREIEEVFSPYGFVEDIYIMRDELKQSRGCAFVKYARREMAMAAIKALN

Query:  GNYTIRGCDQPLIVRLADPKKSRVGEQRSNSMSGSPNFGHHPQPFRPEPPLGAPAGGCFPNNLYPPQQNSASLGPAKNASQVASNAPLAPNTIQKAHPPI
        GNYTIRGCDQPLIVRLADPKKSRVGEQRSNSMSGSPNFGHHPQPFRPEPPLGAPAGGCFPNNLYPPQQNSASLGPAKNASQVASNAPLAPNTIQKAHPPI
Subjt:  GNYTIRGCDQPLIVRLADPKKSRVGEQRSNSMSGSPNFGHHPQPFRPEPPLGAPAGGCFPNNLYPPQQNSASLGPAKNASQVASNAPLAPNTIQKAHPPI

Query:  QEPSSSFAHIPSQPMRTTQQVCQPPTQPDFSKMQNQVYCQQQPRKDSYQQQNSQVNENTPPTAHGLQTFSGVPNSPLVRPCSRVEVSLECDWSEHTCPDG
        QEPSSSFAHIPSQPMRTTQQVCQPPTQPDFSKMQNQVYCQQQPRKDSYQQQNSQVNENTPPTAHGLQTFSGVPNSPLVRPCSRVEVSLECDWSEHTCPDG
Subjt:  QEPSSSFAHIPSQPMRTTQQVCQPPTQPDFSKMQNQVYCQQQPRKDSYQQQNSQVNENTPPTAHGLQTFSGVPNSPLVRPCSRVEVSLECDWSEHTCPDG

Query:  FKYYYNCVTCESSWEKPEEFALFEQQLKQEKLQKQNHQLHSSLPISSPEVLPHPNVFSQKLEVQSSSAVRELD
        FKYYYNCVTCESSWEKPEEFALFEQQLKQEKLQKQNHQLHSSLPISSPEVLPHPNVFSQKLEVQSSSAVRELD
Subjt:  FKYYYNCVTCESSWEKPEEFALFEQQLKQEKLQKQNHQLHSSLPISSPEVLPHPNVFSQKLEVQSSSAVRELD

A0A6J1DVB2 putative pentatricopeptide repeat-containing protein At1g02420 isoform X13.8e-26499.56Show/hide
Query:  MILRPTPRYFPPIPLSNSFSHPFSSANENQSLNGKVETIFRIISSSNSSTNMRHSLKSARVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGFYH
        MILRPTPRYFPPIPLSNSFSHPFSSANENQSLNGKVETIFRIISSSNSSTNMRHSLKSARVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGFYH
Subjt:  MILRPTPRYFPPIPLSNSFSHPFSSANENQSLNGKVETIFRIISSSNSSTNMRHSLKSARVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGFYH

Query:  TAFSVDSMLYILGRSRKFEKIWDVLVDVKLKDRSLITPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVACFNALLRTLCQEKSMTDARNVYHGMK
        TAFSVDSMLYILGRSRKFEKIWDVLVDVKLKDRSLITPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVACFNALLRTLCQEKSMTDARNVYHGMK
Subjt:  TAFSVDSMLYILGRSRKFEKIWDVLVDVKLKDRSLITPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVACFNALLRTLCQEKSMTDARNVYHGMK

Query:  SKFRPNLQTFNILLSGWKSSEEAEGFFDEMREMGVKPDVVSYNCLVDVYCKNREMDKAYKVVERMQDEDIHADVITYTSIIGGLGLIGQPDKARNILKEM
        SKFRPNLQTFNILLSGWKSSEEAEGFFDEMREMGVKPDVVSYNCLVDVYCKNREMDKAYKVVERMQDEDIHADVITYTSIIGGLGLIGQPDKARNILKEM
Subjt:  SKFRPNLQTFNILLSGWKSSEEAEGFFDEMREMGVKPDVVSYNCLVDVYCKNREMDKAYKVVERMQDEDIHADVITYTSIIGGLGLIGQPDKARNILKEM

Query:  KEYGCYPDVAAYNAAIRNFCIAKRLREAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTGCLPNTQSCLFLMRLFKKHEKVEMALE
        KEYGCYPDVAAYNAAIRNFCIAKRLREAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTGCLPNTQSCLFLMRLFKKHEKVEMALE
Subjt:  KEYGCYPDVAAYNAAIRNFCIAKRLREAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTGCLPNTQSCLFLMRLFKKHEKVEMALE

Query:  LWNDMVVRGFGSYILVSEELFDLLCDLGKLVEAEMCFLQMVDKGHKPSNVSFKRIKL
        LWNDMVVRGFGSYILVSEELFDLL DLGKLVEAEMCFLQMVDKGHKPSNVSFKRIK+
Subjt:  LWNDMVVRGFGSYILVSEELFDLLCDLGKLVEAEMCFLQMVDKGHKPSNVSFKRIKL

A0A6J1GAM6 putative pentatricopeptide repeat-containing protein At1g024207.6e-24191.11Show/hide
Query:  RYFPPIPLSNSFSHPFSSANENQSLNGKVETIFRIISSSNSSTNMRHSLKSARVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGFYHTAFSVDS
        RYF P PLSN   H FS+A+ENQSLN  V+T+FRIISSS SS NMR SLKS+RVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGFYH+AFS D+
Subjt:  RYFPPIPLSNSFSHPFSSANENQSLNGKVETIFRIISSSNSSTNMRHSLKSARVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGFYHTAFSVDS

Query:  MLYILGRSRKFEKIWDVLVDVKLKDRSLITPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVACFNALLRTLCQEKSMTDARNVYHGMKSKFRPNL
        MLYILGR+RKFEKIWDVL+D+KLKDRSLITPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDV CFNALLRTLCQEKSMTDARNVYHG+KSKFRPNL
Subjt:  MLYILGRSRKFEKIWDVLVDVKLKDRSLITPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVACFNALLRTLCQEKSMTDARNVYHGMKSKFRPNL

Query:  QTFNILLSGWKSSEEAEGFFDEMREMGVKPDVVSYNCLVDVYCKNREMDKAYKVVERMQDEDIHADVITYTSIIGGLGLIGQPDKARNILKEMKEYGCYP
        QTFNILLSGWKSSEEAEGFFDEMREMGV+PDVVSYNCLVDVYCKNREMDKA+KV+E+M+DEDI ADVITYTSIIGGLGLIGQPDKARNILKEMKEYGCYP
Subjt:  QTFNILLSGWKSSEEAEGFFDEMREMGVKPDVVSYNCLVDVYCKNREMDKAYKVVERMQDEDIHADVITYTSIIGGLGLIGQPDKARNILKEMKEYGCYP

Query:  DVAAYNAAIRNFCIAKRLREAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTGCLPNTQSCLFLMRLFKKHEKVEMALELWNDMVV
        DVAAYNAAIRNFCIAKRLREAFDLMDEM NKGL+PNATTYNLFFRIFFWSNDLQSSWNLYRRMMD GCLPNTQSCLFLMR FK+HEKVEMAL+LWNDMV 
Subjt:  DVAAYNAAIRNFCIAKRLREAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTGCLPNTQSCLFLMRLFKKHEKVEMALELWNDMVV

Query:  RGFGSYILVSEELFDLLCDLGKLVEAEMCFLQMVDKGHKPSNVSFKRIKL
        RGFGSYILVSEELFDLLCDLGKL+EAE CFLQMVDKGHKPSNVSFKRIK+
Subjt:  RGFGSYILVSEELFDLLCDLGKLVEAEMCFLQMVDKGHKPSNVSFKRIKL

A0A6J1KCH6 putative pentatricopeptide repeat-containing protein At1g024208.4e-24090.67Show/hide
Query:  RYFPPIPLSNSFSHPFSSANENQSLNGKVETIFRIISSSNSSTNMRHSLKSARVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGFYHTAFSVDS
        RYF P PLSN   H FS+A+ENQSLN  VET+FRIISSS+SS NMRHSLKS+RVFLSNELIDGVLKRVRFSHGNPLQALEFFNYT RRR FYH+AFS+D+
Subjt:  RYFPPIPLSNSFSHPFSSANENQSLNGKVETIFRIISSSNSSTNMRHSLKSARVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGFYHTAFSVDS

Query:  MLYILGRSRKFEKIWDVLVDVKLKDRSLITPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVACFNALLRTLCQEKSMTDARNVYHGMKSKFRPNL
        MLYILGR+RKFEKIWDVL+D+KLKDRSLITPRTVMVVLGRIAKVCSVRQTVESFRK KKFVPEFDV CFNALLRTLCQEKSMTDARNVYHG+KSKFRPNL
Subjt:  MLYILGRSRKFEKIWDVLVDVKLKDRSLITPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVACFNALLRTLCQEKSMTDARNVYHGMKSKFRPNL

Query:  QTFNILLSGWKSSEEAEGFFDEMREMGVKPDVVSYNCLVDVYCKNREMDKAYKVVERMQDEDIHADVITYTSIIGGLGLIGQPDKARNILKEMKEYGCYP
        QTFNILLSGWKSSEEAEGFFDEMREMGV+PDVVSYNCL+DVYCKNREMDKA+KV+E+M+DEDI ADVITYTSIIGGLGLIGQPDKARNILKEMKEYGCYP
Subjt:  QTFNILLSGWKSSEEAEGFFDEMREMGVKPDVVSYNCLVDVYCKNREMDKAYKVVERMQDEDIHADVITYTSIIGGLGLIGQPDKARNILKEMKEYGCYP

Query:  DVAAYNAAIRNFCIAKRLREAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTGCLPNTQSCLFLMRLFKKHEKVEMALELWNDMVV
        DVAAYNAAIRNFCIAKRLREAFDLMDEM NKGL+PNATTYNLFFRIFFWSNDLQSSWNLYRRMMD GCLPNTQSCLFLMR FK+HEKVEMAL+LWNDMV 
Subjt:  DVAAYNAAIRNFCIAKRLREAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTGCLPNTQSCLFLMRLFKKHEKVEMALELWNDMVV

Query:  RGFGSYILVSEELFDLLCDLGKLVEAEMCFLQMVDKGHKPSNVSFKRIKL
        RGFGSYILVSEELFDLLCDLGKL+EAE CFLQMVDKGHKPSNVSFKRIK+
Subjt:  RGFGSYILVSEELFDLLCDLGKLVEAEMCFLQMVDKGHKPSNVSFKRIKL

SwissProt top hitse value%identityAlignment
B8BCZ8 Flowering time control protein FCA2.9e-8035.88Show/hide
Query:  GNGGLRPNCGNQNANLGR-KRPRNYSNRTVPSDHAEA---VKLYVAQVPRTGTEEAIRPLFEVHGDIVEIVILRDKITGQQQGSCFVKYATSIEADRAIG
        G GG R   G +    G   R R  S R   SDH      VKL++  VPRT TE+ +RPLFE HGD+VE+ +++D+ TG+QQG CFVKYATS EA+RAI 
Subjt:  GNGGLRPNCGNQNANLGR-KRPRNYSNRTVPSDHAEA---VKLYVAQVPRTGTEEAIRPLFEVHGDIVEIVILRDKITGQQQGSCFVKYATSIEADRAIG

Query:  ALDNQFTFPGEMAPINVKYADGERERLGVLE-KLYVGSLNKNTTKREIEEVFSPYGFVEDIYIMRDELKQSRGCAFVKYARREMAMAAIKALNGNYTIRG
        AL NQ+T PG M PI V+YADGERER G +E KL+V SLNK  T +EIEE+F+PYG VED+YIM+D ++QSRGC FVK++ RE A+AA+ AL+GNY +RG
Subjt:  ALDNQFTFPGEMAPINVKYADGERERLGVLE-KLYVGSLNKNTTKREIEEVFSPYGFVEDIYIMRDELKQSRGCAFVKYARREMAMAAIKALNGNYTIRG

Query:  CDQPLIVRLADPKKSRVGEQRSNSMSGSPNF--------------------GHHPQP---------------------------------------FRPE
        C+QPLI+R ADPK+ R GE R     G P F                    G H  P                                       FRP+
Subjt:  CDQPLIVRLADPKKSRVGEQRSNSMSGSPNF--------------------GHHPQP---------------------------------------FRPE

Query:  ------------------------PPL--GAPAGG--------------CFPNNLYPPQQNSASLGPAKNASQVASNAPLAPNT----IQKAHPPIQEP-
                                PP+  G   GG               FP  L   QQ     GPA+   Q+     + P +    +     P+ +P 
Subjt:  ------------------------PPL--GAPAGG--------------CFPNNLYPPQQNSASLGPAKNASQVASNAPLAPNT----IQKAHPPIQEP-

Query:  ----------------SSSFAHIPSQ---PMRTTQQVCQPPT------------------QPDFSKMQNQVYCQQQPRKDSYQQQNSQVNENTPPTAHGL
                        S+S   IP Q   P     Q+ Q P                   Q  +   Q  +Y  QQ  +   QQQ S +N   P     +
Subjt:  ----------------SSSFAHIPSQ---PMRTTQQVCQPPT------------------QPDFSKMQNQVYCQQQPRKDSYQQQNSQVNENTPPTAHGL

Query:  QTFS-GVPN-------SPLVRPCSRVEVSLECDWSEHTCPDGFKYYYNCVTCESSWEKPEEFALFEQQLKQEKLQK----QNHQLHSSLP--ISSPEVLP
        Q+ + G PN       + + +  +   V L C+W+EHT P+GFKYYYN +T ES W+KPEE+ L+EQQ +Q++ QK    Q HQ   ++    S P+   
Subjt:  QTFS-GVPN-------SPLVRPCSRVEVSLECDWSEHTCPDGFKYYYNCVTCESSWEKPEEFALFEQQLKQEKLQK----QNHQLHSSLP--ISSPEVLP

Query:  HP
        HP
Subjt:  HP

O80522 GDSL esterase/lipase At1g093904.9e-11260.6Show/hide
Query:  PVIFNFGDSNSDTGGYSEGLGLRFGPPNGRTFFHKPSGRLCDGRLMIDFLCESVSSDYLTPYLRS-VGSNFTDGANFAISGSSTLPRNHPFNLNVQVLQF
        PVIFNFGDSNSDTGG   GLG   G PNGR+FF + +GRL DGRL+IDFLC+S+++  L PYL S VGS F +GANFAI GSSTLPR  PF LN+Q++QF
Subjt:  PVIFNFGDSNSDTGGYSEGLGLRFGPPNGRTFFHKPSGRLCDGRLMIDFLCESVSSDYLTPYLRS-VGSNFTDGANFAISGSSTLPRNHPFNLNVQVLQF

Query:  LQFQSCSLEL--ISKGYKD-LVDVEGFKNALYTIDIGQNDLAGSFTY-LSYPQVIQRIPSFVSEIRDAIWSIYQHGGRNFWVHNTGPLGCMPQKLATTAA
        L F+S +LEL  IS   K+ ++   GF+NALY IDIGQND+A SF+  LSY +V++ IP+ +SEI+ AI  +Y  GGR FWVHNTGPLGC+PQKL  +  
Subjt:  LQFQSCSLEL--ISKGYKD-LVDVEGFKNALYTIDIGQNDLAGSFTY-LSYPQVIQRIPSFVSEIRDAIWSIYQHGGRNFWVHNTGPLGCMPQKLATTAA

Query:  NASDIDNHGCLQALNNAAKEFNTQLKAACGELRSALTNATIVYVDVYAIKYNLIANSASNGFENPLMVCCGYGGPPYNFNQSITCGQTGFNVCDEGLKYI
        ++   D HGCL   N AAK FN  L   C +LR+ L  A IVYVD+YAIKY+LIANS + GFE PLM CCGYGGPPYN+N +ITCG  G   CDEG ++I
Subjt:  NASDIDNHGCLQALNNAAKEFNTQLKAACGELRSALTNATIVYVDVYAIKYNLIANSASNGFENPLMVCCGYGGPPYNFNQSITCGQTGFNVCDEGLKYI

Query:  SWDGVHYTEAANAVFASKILSSEYSTPNFHFNFFC
        SWDG+HYTE ANA+ A K+LS ++STP   F+FFC
Subjt:  SWDGVHYTEAANAVFASKILSSEYSTPNFHFNFFC

Q6NLP7 GDSL esterase/lipase At3g622802.6e-9752.74Show/hide
Query:  SSRKPVIFNFGDSNSDTGGYSEGLGLRFGPPNGRTFFHKPSGRLCDGRLMIDFLCESVSSDYLTPYLRSVGSNFTDGANFAISGSSTLPRNHPFNLNVQV
        S++KP++ NFGDSNSDTGG   G+GL  G P+G TFFH+ +GRL DGRL++DF CE +   YL+PYL S+  NF  G NFA+SG++ LP    F L +Q+
Subjt:  SSRKPVIFNFGDSNSDTGGYSEGLGLRFGPPNGRTFFHKPSGRLCDGRLMIDFLCESVSSDYLTPYLRSVGSNFTDGANFAISGSSTLPRNHPFNLNVQV

Query:  LQFLQFQSCSLELISKGYKDLVDVEGFKNALYTIDIGQNDLAGSF--TYLSYPQVIQRIPSFVSEIRDAIWSIYQHGGRNFWVHNTGPLGCMPQKLATTA
         QF+ F++ S ELIS G +DL+D  GF+NALY IDIGQNDL  +   + L+Y  V+++IPS + EI+ AI ++Y +GGR FWVHNTGPLGC P++LA   
Subjt:  LQFLQFQSCSLELISKGYKDLVDVEGFKNALYTIDIGQNDLAGSF--TYLSYPQVIQRIPSFVSEIRDAIWSIYQHGGRNFWVHNTGPLGCMPQKLATTA

Query:  ANASDIDNHGCLQALNNAAKEFNTQLKAACGELRSALTNATIVYVDVYAIKYNLIANSASNGFENPLMVCCGYGGPPYNFNQSITCGQTGFNVCDEGLKY
         N SD+D  GC +  N  AK FN  L + C ELRS   +AT+VYVD+Y+IKY L A+    GF +PLM CCGYGG P N+++  TCGQ G  +C +  K 
Subjt:  ANASDIDNHGCLQALNNAAKEFNTQLKAACGELRSALTNATIVYVDVYAIKYNLIANSASNGFENPLMVCCGYGGPPYNFNQSITCGQTGFNVCDEGLKY

Query:  ISWDGVHYTEAANAVFASKILSSEYSTP
        I WDGVHYTEAAN      +L++ YS P
Subjt:  ISWDGVHYTEAANAVFASKILSSEYSTP

Q9FXB6 GDSL esterase/lipase LIP-42.7e-11057.69Show/hide
Query:  KPVIFNFGDSNSDTGGYSEGLGLRFGPPNGRTFFHKPSGRLCDGRLMIDFLCESVSSDYLTPYLRSVG-SNFTDGANFAISGSSTLPRNHPFNLNVQVLQ
        +PVIFNFGDSNSDTGG   GLG   G PNGR FF + +GRL DGRL+IDFLC+S+++  L PYL S+G + F +GANFAI+GS TLP+N PF+LN+QV Q
Subjt:  KPVIFNFGDSNSDTGGYSEGLGLRFGPPNGRTFFHKPSGRLCDGRLMIDFLCESVSSDYLTPYLRSVG-SNFTDGANFAISGSSTLPRNHPFNLNVQVLQ

Query:  FLQFQSCSLELISKGYK---DLVDVEGFKNALYTIDIGQNDLAGSFTY-LSYPQVIQRIPSFVSEIRDAIWSIYQHGGRNFWVHNTGPLGCMPQKLATTA
        F  F+S SLEL S         +   GFKNALY IDIGQND+A SF    SY Q ++ IP  ++EI+ +I  +Y  GGR FW+HNTGPLGC+PQKL  + 
Subjt:  FLQFQSCSLELISKGYK---DLVDVEGFKNALYTIDIGQNDLAGSFTY-LSYPQVIQRIPSFVSEIRDAIWSIYQHGGRNFWVHNTGPLGCMPQKLATTA

Query:  ANASDIDNHGCLQALNNAAKEFNTQLKAACGELRSALTNATIVYVDVYAIKYNLIANSASNGFENPLMVCCGYGGPPYNFNQSITCGQTGFNVCDEGLKY
          + D+D HGCL + N+AA  FN  L   C ELR+ L +ATI+Y+D+YAIKY+LIANS   GF++PLM CCGYGG PYN+N  ITCG  G NVC+EG ++
Subjt:  ANASDIDNHGCLQALNNAAKEFNTQLKAACGELRSALTNATIVYVDVYAIKYNLIANSASNGFENPLMVCCGYGGPPYNFNQSITCGQTGFNVCDEGLKY

Query:  ISWDGVHYTEAANAVFASKILSSEYSTPNFHFNFFCNK
        ISWDG+HYTE ANA+ A K+LS  YS P   F+FFC +
Subjt:  ISWDGVHYTEAANAVFASKILSSEYSTPNFHFNFFCNK

Q9FZ19 Putative pentatricopeptide repeat-containing protein At1g024201.5e-18066.23Show/hide
Query:  MILRP-TPRYFPPIPLSNSFSHPFSSANEN---QSLNGKVETIFRIISSSNSSTNMRHSLKSARVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRR
        MIL+P +  +     LS SF H  + ++     +      ET+FR+I+ SN    ++ SL S+ + LS +LID VLKRVRFSHGNP+Q LEF+ Y    R
Subjt:  MILRP-TPRYFPPIPLSNSFSHPFSSANEN---QSLNGKVETIFRIISSSNSSTNMRHSLKSARVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRR

Query:  GFYHTAFSVDSMLYILGRSRKFEKIWDVLVDVKLKDRSLITPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPE-FDVACFNALLRTLCQEKSMTDARNV
        GFYH++FS+D+MLYILGR+RKF++IW++L++ K KDRSLI+PRT+ VVLGR+AK+CSVRQTVESF KFK+ VP+ FD ACFNALLRTLCQEKSMTDARNV
Subjt:  GFYHTAFSVDSMLYILGRSRKFEKIWDVLVDVKLKDRSLITPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPE-FDVACFNALLRTLCQEKSMTDARNV

Query:  YHGMKSKFRPNLQTFNILLSGWKSSEEAEGFFDEMREMGVKPDVVSYNCLVDVYCKNREMDKAYKVVERMQDEDIHADVITYTSIIGGLGLIGQPDKARN
        YH +K +F+P+LQTFNILLSGWKSSEEAE FF+EM+  G+KPDVV+YN L+DVYCK+RE++KAYK++++M++E+   DVITYT++IGGLGLIGQPDKAR 
Subjt:  YHGMKSKFRPNLQTFNILLSGWKSSEEAEGFFDEMREMGVKPDVVSYNCLVDVYCKNREMDKAYKVVERMQDEDIHADVITYTSIIGGLGLIGQPDKARN

Query:  ILKEMKEYGCYPDVAAYNAAIRNFCIAKRLREAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTGCLPNTQSCLFLMRLFKKHEKV
        +LKEMKEYGCYPDVAAYNAAIRNFCIA+RL +A  L+DEMV KGLSPNATTYNLFFR+   +NDL  SW LY RM+   CLPNTQSC+FL+++FK+HEKV
Subjt:  ILKEMKEYGCYPDVAAYNAAIRNFCIAKRLREAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTGCLPNTQSCLFLMRLFKKHEKV

Query:  EMALELWNDMVVRGFGSYILVSEELFDLLCDLGKLVEAEMCFLQMVDKGHKPSNVSFKRIKL
        +MA+ LW DMVV+GFGSY LVS+ L DLLCDL K+ EAE C L+MV+KGH+PSNVSFKRIKL
Subjt:  EMALELWNDMVVRGFGSYILVSEELFDLLCDLGKLVEAEMCFLQMVDKGHKPSNVSFKRIKL

Arabidopsis top hitse value%identityAlignment
AT1G02420.1 Pentatricopeptide repeat (PPR) superfamily protein1.0e-18166.23Show/hide
Query:  MILRP-TPRYFPPIPLSNSFSHPFSSANEN---QSLNGKVETIFRIISSSNSSTNMRHSLKSARVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRR
        MIL+P +  +     LS SF H  + ++     +      ET+FR+I+ SN    ++ SL S+ + LS +LID VLKRVRFSHGNP+Q LEF+ Y    R
Subjt:  MILRP-TPRYFPPIPLSNSFSHPFSSANEN---QSLNGKVETIFRIISSSNSSTNMRHSLKSARVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRR

Query:  GFYHTAFSVDSMLYILGRSRKFEKIWDVLVDVKLKDRSLITPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPE-FDVACFNALLRTLCQEKSMTDARNV
        GFYH++FS+D+MLYILGR+RKF++IW++L++ K KDRSLI+PRT+ VVLGR+AK+CSVRQTVESF KFK+ VP+ FD ACFNALLRTLCQEKSMTDARNV
Subjt:  GFYHTAFSVDSMLYILGRSRKFEKIWDVLVDVKLKDRSLITPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPE-FDVACFNALLRTLCQEKSMTDARNV

Query:  YHGMKSKFRPNLQTFNILLSGWKSSEEAEGFFDEMREMGVKPDVVSYNCLVDVYCKNREMDKAYKVVERMQDEDIHADVITYTSIIGGLGLIGQPDKARN
        YH +K +F+P+LQTFNILLSGWKSSEEAE FF+EM+  G+KPDVV+YN L+DVYCK+RE++KAYK++++M++E+   DVITYT++IGGLGLIGQPDKAR 
Subjt:  YHGMKSKFRPNLQTFNILLSGWKSSEEAEGFFDEMREMGVKPDVVSYNCLVDVYCKNREMDKAYKVVERMQDEDIHADVITYTSIIGGLGLIGQPDKARN

Query:  ILKEMKEYGCYPDVAAYNAAIRNFCIAKRLREAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTGCLPNTQSCLFLMRLFKKHEKV
        +LKEMKEYGCYPDVAAYNAAIRNFCIA+RL +A  L+DEMV KGLSPNATTYNLFFR+   +NDL  SW LY RM+   CLPNTQSC+FL+++FK+HEKV
Subjt:  ILKEMKEYGCYPDVAAYNAAIRNFCIAKRLREAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTGCLPNTQSCLFLMRLFKKHEKV

Query:  EMALELWNDMVVRGFGSYILVSEELFDLLCDLGKLVEAEMCFLQMVDKGHKPSNVSFKRIKL
        +MA+ LW DMVV+GFGSY LVS+ L DLLCDL K+ EAE C L+MV+KGH+PSNVSFKRIKL
Subjt:  EMALELWNDMVVRGFGSYILVSEELFDLLCDLGKLVEAEMCFLQMVDKGHKPSNVSFKRIKL

AT1G09390.1 GDSL-like Lipase/Acylhydrolase superfamily protein3.5e-11360.6Show/hide
Query:  PVIFNFGDSNSDTGGYSEGLGLRFGPPNGRTFFHKPSGRLCDGRLMIDFLCESVSSDYLTPYLRS-VGSNFTDGANFAISGSSTLPRNHPFNLNVQVLQF
        PVIFNFGDSNSDTGG   GLG   G PNGR+FF + +GRL DGRL+IDFLC+S+++  L PYL S VGS F +GANFAI GSSTLPR  PF LN+Q++QF
Subjt:  PVIFNFGDSNSDTGGYSEGLGLRFGPPNGRTFFHKPSGRLCDGRLMIDFLCESVSSDYLTPYLRS-VGSNFTDGANFAISGSSTLPRNHPFNLNVQVLQF

Query:  LQFQSCSLEL--ISKGYKD-LVDVEGFKNALYTIDIGQNDLAGSFTY-LSYPQVIQRIPSFVSEIRDAIWSIYQHGGRNFWVHNTGPLGCMPQKLATTAA
        L F+S +LEL  IS   K+ ++   GF+NALY IDIGQND+A SF+  LSY +V++ IP+ +SEI+ AI  +Y  GGR FWVHNTGPLGC+PQKL  +  
Subjt:  LQFQSCSLEL--ISKGYKD-LVDVEGFKNALYTIDIGQNDLAGSFTY-LSYPQVIQRIPSFVSEIRDAIWSIYQHGGRNFWVHNTGPLGCMPQKLATTAA

Query:  NASDIDNHGCLQALNNAAKEFNTQLKAACGELRSALTNATIVYVDVYAIKYNLIANSASNGFENPLMVCCGYGGPPYNFNQSITCGQTGFNVCDEGLKYI
        ++   D HGCL   N AAK FN  L   C +LR+ L  A IVYVD+YAIKY+LIANS + GFE PLM CCGYGGPPYN+N +ITCG  G   CDEG ++I
Subjt:  NASDIDNHGCLQALNNAAKEFNTQLKAACGELRSALTNATIVYVDVYAIKYNLIANSASNGFENPLMVCCGYGGPPYNFNQSITCGQTGFNVCDEGLKYI

Query:  SWDGVHYTEAANAVFASKILSSEYSTPNFHFNFFC
        SWDG+HYTE ANA+ A K+LS ++STP   F+FFC
Subjt:  SWDGVHYTEAANAVFASKILSSEYSTPNFHFNFFC

AT1G54790.1 GDSL-like Lipase/Acylhydrolase superfamily protein1.8e-8546.33Show/hide
Query:  PVIFNFGDSNSDTGGYSEGLGLRFGPPNGRTFFHKPSGRLCDGRLMIDFLCESVSSDYLTPYLRSVG-SNFTDGANFAISGSSTLPRN----HPFNLNVQ
        P  FNFGDSNSDTG    GLG+R   PNG+  F   S R CDGRL+IDFL + +   +L PYL S+G  NF  G NFA +GS+ LP N     PF+ ++Q
Subjt:  PVIFNFGDSNSDTGGYSEGLGLRFGPPNGRTFFHKPSGRLCDGRLMIDFLCESVSSDYLTPYLRSVG-SNFTDGANFAISGSSTLPRN----HPFNLNVQ

Query:  VLQFLQFQSCSLELISKG----YKDLVDVEGFKNALYTIDIGQNDLAGSFTYLSYPQVIQRIPSFVSEIRDAIWSIYQHGGRNFWVHNTGPLGCMPQKLA
        + QF++F+S ++EL+SK      K L  ++ +   LY IDIGQND+AG+F   +  QV+  IPS +      +  +Y+ GGRN W+HNTGPLGC+ Q +A
Subjt:  VLQFLQFQSCSLELISKG----YKDLVDVEGFKNALYTIDIGQNDLAGSFTYLSYPQVIQRIPSFVSEIRDAIWSIYQHGGRNFWVHNTGPLGCMPQKLA

Query:  TTAANASDIDNHGCLQALNNAAKEFNTQLKAACGELRSALTNATIVYVDVYAIKYNLIANSASNGFENPLMVCCGYGGPPYNFNQSITCGQT----GFNV
            +++ +D  GC+ + N AAK FN QL A   + ++   +A + YVD+++IK NLIAN +  GFE PLM CCG GG P N++  ITCGQT    G +V
Subjt:  TTAANASDIDNHGCLQALNNAAKEFNTQLKAACGELRSALTNATIVYVDVYAIKYNLIANSASNGFENPLMVCCGYGGPPYNFNQSITCGQT----GFNV

Query:  ----CDEGLKYISWDGVHYTEAANAVFASKILSSEYSTPNF
            C++  +YI+WDG+HYTEAAN   +S+IL+ +YS P F
Subjt:  ----CDEGLKYISWDGVHYTEAANAVFASKILSSEYSTPNF

AT1G56670.1 GDSL-like Lipase/Acylhydrolase superfamily protein1.9e-11157.69Show/hide
Query:  KPVIFNFGDSNSDTGGYSEGLGLRFGPPNGRTFFHKPSGRLCDGRLMIDFLCESVSSDYLTPYLRSVG-SNFTDGANFAISGSSTLPRNHPFNLNVQVLQ
        +PVIFNFGDSNSDTGG   GLG   G PNGR FF + +GRL DGRL+IDFLC+S+++  L PYL S+G + F +GANFAI+GS TLP+N PF+LN+QV Q
Subjt:  KPVIFNFGDSNSDTGGYSEGLGLRFGPPNGRTFFHKPSGRLCDGRLMIDFLCESVSSDYLTPYLRSVG-SNFTDGANFAISGSSTLPRNHPFNLNVQVLQ

Query:  FLQFQSCSLELISKGYK---DLVDVEGFKNALYTIDIGQNDLAGSFTY-LSYPQVIQRIPSFVSEIRDAIWSIYQHGGRNFWVHNTGPLGCMPQKLATTA
        F  F+S SLEL S         +   GFKNALY IDIGQND+A SF    SY Q ++ IP  ++EI+ +I  +Y  GGR FW+HNTGPLGC+PQKL  + 
Subjt:  FLQFQSCSLELISKGYK---DLVDVEGFKNALYTIDIGQNDLAGSFTY-LSYPQVIQRIPSFVSEIRDAIWSIYQHGGRNFWVHNTGPLGCMPQKLATTA

Query:  ANASDIDNHGCLQALNNAAKEFNTQLKAACGELRSALTNATIVYVDVYAIKYNLIANSASNGFENPLMVCCGYGGPPYNFNQSITCGQTGFNVCDEGLKY
          + D+D HGCL + N+AA  FN  L   C ELR+ L +ATI+Y+D+YAIKY+LIANS   GF++PLM CCGYGG PYN+N  ITCG  G NVC+EG ++
Subjt:  ANASDIDNHGCLQALNNAAKEFNTQLKAACGELRSALTNATIVYVDVYAIKYNLIANSASNGFENPLMVCCGYGGPPYNFNQSITCGQTGFNVCDEGLKY

Query:  ISWDGVHYTEAANAVFASKILSSEYSTPNFHFNFFCNK
        ISWDG+HYTE ANA+ A K+LS  YS P   F+FFC +
Subjt:  ISWDGVHYTEAANAVFASKILSSEYSTPNFHFNFFCNK

AT3G62280.1 GDSL-like Lipase/Acylhydrolase superfamily protein1.9e-9852.74Show/hide
Query:  SSRKPVIFNFGDSNSDTGGYSEGLGLRFGPPNGRTFFHKPSGRLCDGRLMIDFLCESVSSDYLTPYLRSVGSNFTDGANFAISGSSTLPRNHPFNLNVQV
        S++KP++ NFGDSNSDTGG   G+GL  G P+G TFFH+ +GRL DGRL++DF CE +   YL+PYL S+  NF  G NFA+SG++ LP    F L +Q+
Subjt:  SSRKPVIFNFGDSNSDTGGYSEGLGLRFGPPNGRTFFHKPSGRLCDGRLMIDFLCESVSSDYLTPYLRSVGSNFTDGANFAISGSSTLPRNHPFNLNVQV

Query:  LQFLQFQSCSLELISKGYKDLVDVEGFKNALYTIDIGQNDLAGSF--TYLSYPQVIQRIPSFVSEIRDAIWSIYQHGGRNFWVHNTGPLGCMPQKLATTA
         QF+ F++ S ELIS G +DL+D  GF+NALY IDIGQNDL  +   + L+Y  V+++IPS + EI+ AI ++Y +GGR FWVHNTGPLGC P++LA   
Subjt:  LQFLQFQSCSLELISKGYKDLVDVEGFKNALYTIDIGQNDLAGSF--TYLSYPQVIQRIPSFVSEIRDAIWSIYQHGGRNFWVHNTGPLGCMPQKLATTA

Query:  ANASDIDNHGCLQALNNAAKEFNTQLKAACGELRSALTNATIVYVDVYAIKYNLIANSASNGFENPLMVCCGYGGPPYNFNQSITCGQTGFNVCDEGLKY
         N SD+D  GC +  N  AK FN  L + C ELRS   +AT+VYVD+Y+IKY L A+    GF +PLM CCGYGG P N+++  TCGQ G  +C +  K 
Subjt:  ANASDIDNHGCLQALNNAAKEFNTQLKAACGELRSALTNATIVYVDVYAIKYNLIANSASNGFENPLMVCCGYGGPPYNFNQSITCGQTGFNVCDEGLKY

Query:  ISWDGVHYTEAANAVFASKILSSEYSTP
        I WDGVHYTEAAN      +L++ YS P
Subjt:  ISWDGVHYTEAANAVFASKILSSEYSTP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGGAGTGAAGAAGCTGATGAATACCTTGTGGCCGGAGTGAAGAAGCTGGCGGAGGCGGCTGCCGCCGAAGAGGGAGAGACCTCTTTTACTGTTGTGGAGGCAATTTT
CATGGATGATCTCTCTTCTTCTCTAGTTTATGTGACAGTAGACGATGGTCGGGTGTTGGGAATCTCCCCGTTGGATGTTGACCATGTTGTTGTGGCTAACAAAATCAAAA
CCAACATTGTTATGTATGGTCATTGGCTTAGTCATTTGGTTAAAGAGAACTTCCATGGTTCAAGGGCAAGGAAGACGAAGCAGCAAGTGGGCAATGACGAAGGTGCAGCG
GCAATCGGTGCAAACACCATTTCGGAATATGGGAGGCTATCGAATAACTGGAACCAGCCGGAGTTCCACAATCATCAGCCGGAGTATCGTCACCAGCCACACTTCAACGG
GGAGGCGAATGAGGGGTTCGGAAACGGCGGTTTAAGGCCGAATTGTGGCAATCAAAATGCTAATTTGGGGCGCAAAAGGCCGAGAAACTATTCCAACAGAACAGTTCCCT
CAGATCATGCCGAGGCTGTCAAATTGTATGTTGCACAAGTTCCCCGGACAGGAACCGAAGAAGCTATCCGTCCCCTGTTTGAAGTACATGGAGATATTGTCGAAATTGTA
ATATTAAGGGATAAGATAACTGGCCAACAGCAAGGTAGTTGTTTTGTGAAGTATGCAACCTCTATTGAAGCAGACAGGGCTATCGGAGCTTTGGACAATCAGTTTACTTT
TCCTGGAGAAATGGCTCCTATCAATGTGAAATATGCTGATGGTGAGAGGGAGCGCCTAGGAGTACTTGAAAAATTGTATGTTGGTAGCTTGAATAAAAATACAACAAAAA
GGGAAATTGAGGAGGTATTCTCGCCTTATGGTTTTGTTGAGGATATTTATATCATGCGTGATGAGCTGAAGCAAAGCCGTGGATGTGCCTTTGTTAAGTACGCTCGTAGA
GAGATGGCAATGGCAGCAATCAAAGCATTGAATGGAAATTATACGATTAGAGGTTGTGATCAGCCACTTATTGTTCGTCTTGCAGACCCTAAGAAATCCAGGGTTGGTGA
ACAAAGAAGCAACAGCATGTCAGGCAGCCCAAATTTTGGTCATCATCCTCAACCTTTTAGACCCGAGCCCCCTCTTGGTGCCCCTGCTGGAGGATGTTTTCCCAATAATT
TATATCCCCCACAACAAAATTCTGCAAGTTTAGGACCAGCAAAAAATGCTTCCCAAGTGGCATCCAATGCCCCTTTGGCTCCTAATACCATACAGAAAGCACACCCTCCA
ATACAAGAACCTTCATCATCTTTTGCTCATATACCATCACAACCAATGAGAACAACACAACAAGTTTGTCAGCCTCCTACACAGCCTGATTTTTCTAAGATGCAGAATCA
GGTGTATTGTCAACAACAGCCCCGGAAAGACTCGTATCAGCAACAGAATTCACAGGTTAATGAGAATACGCCTCCTACAGCACATGGTCTTCAGACCTTTAGTGGTGTTC
CTAACTCACCTTTGGTGCGTCCATGTTCTCGGGTAGAAGTTTCCTTGGAGTGCGATTGGAGTGAACACACCTGCCCTGACGGTTTCAAATACTACTATAACTGTGTCACC
TGTGAAAGTTCGTGGGAGAAGCCAGAAGAGTTTGCCTTGTTTGAGCAACAATTAAAGCAGGAAAAGCTTCAAAAACAAAACCATCAGCTTCATTCTTCATTACCAATCTC
TTCTCCAGAAGTTCTGCCTCATCCAAATGTCTTCAGTCAAAAACTTGAAGTCCAATCCTCCTCTGCTGTACGAGAATTGGATTATCCCGTTTTCAAAATCCGCTCAACTA
CGTACGTGTTTGATCTTTCTTTGTCCTCACATTATCCAAAATCATTGTCCCCACAATTTCACTTGAGGATGCCTCGTGTTTTCATAACATTCGGTGGAGAATGGAATGAT
AGTGAAAAAGATTATGTCGGCGGTCGTACGAGGGGATTGACAGTGGATAGTACAATCACGTACAGAGAATTTCTAGGCCAAAACGTGTCCTCGATTCCTATGTCCGCAGC
TTGTATTCCCCCATTCCAGAGACCCACATATCCTATACCCTCATTTCCTTCCTCATCATCGAACCCCTCTTCTTCCCGACAGCCACACCCCTCCTACGGGCATATAGGTC
ATGATGTAGAGGGTTTAACACCATTGGGGTCAGATGTTGTTCCATGTAATCTAGGAGATGATAGGTCTGAAGAAGGACACTCTCAAGCAGAATATGGGAACGAAGAGCAT
GACGATGCGCTTGATGATGAGCTTGAGCCTGATGTCGAACAAGTGCACACTGAGATTCGCAGGGATGAAGAAGCAGTCCGGCCACCGGGATGTAATGGTCTCACCGGAGA
CCCTAATGATGAGAAATTGCAACTCATAGTACAGTCTTCTGGGACAAATGATGTTAATGAGGGCGATGTATTTGATAATAAGAAGGAGTTGAGTTTGAAAATGCATTTAG
TTGCAATGCGGAAGAATTTTCAGTTTAAAGTAAAGAAGTCAACGCCGAAGCTATATATACTGCGGTGCGTTCATGCTGATTGCACGTGGAGACTTCGAGCTACCAAGCTA
AAGGAATGCACTTTGTTCAAGATAAAAAAATATTGTGCCACCCATACGTGCTATGGTGGAGCTTTAAAACATGATCATAGGCAAGCCAAAAGTTGGGTGGTTGGACATCT
TGTGCAAGAGAAGTTCACAGACGTCTCCCGCACGTATAGACCGAAGGACATTATACAAGACATGAGGAAGGAGTATGGTGTCAATTTAAGTTATGATAGAGCATGGCGTT
CTAGTGAAGAAGCACTCCGACTTATTAGAGGTGATCCAGCATCGTCATATGGTCTACTTCCAGCTTATGGTAAAGCTTTGAAAATCATGAACCCAGGACCGGGACGTGTC
CGGGATCGACCTTCATCCCGGACCGAACCGGACCGGGACAAGTCCGAGATTAAAGTAGATCCTGGAGGAGATCGAACCCTTAACATGGATAAAATAGAGAGAAAATGGCT
AAGTAATCCACCTCGAAAGCACTCATTTTCTTCCGCTCTTCGCCTTCCAAACACTTCACCACTTTCGCAATCTCCTCCTCCGTCTCAACGCCGCCTTGATCTCCTCCGTC
ATCATCACCGCGTTCATCCGGCCAACCTATCAGAGGAACCCCATTATCCAATTTCTTCTCATCGAATACTTCATATTCAGTTTCATACGGCGATGATTCTCCGGCCAACG
CCCAGGTACTTCCCACCTATTCCGTTATCCAATTCCTTCTCGCATCCATTTTCATCTGCAAATGAAAATCAATCGTTAAACGGCAAAGTAGAAACAATTTTCCGCATAAT
TAGTAGTTCAAACTCGTCAACAAATATGAGGCATTCTCTGAAATCGGCTAGGGTTTTCCTCTCAAATGAGTTGATCGATGGGGTTCTTAAGAGGGTTAGGTTTAGTCACG
GTAATCCTTTACAGGCCTTGGAGTTCTTTAATTACACTGGTAGAAGAAGGGGATTTTATCACACTGCGTTTTCTGTGGATAGCATGCTTTATATCTTAGGCAGGAGCCGG
AAGTTTGAAAAGATTTGGGACGTTTTGGTTGATGTTAAGCTTAAAGATCGGTCGTTAATCACGCCGCGGACTGTTATGGTTGTTTTGGGAAGAATTGCCAAAGTCTGCTC
TGTGAGACAGACTGTGGAGTCTTTTAGGAAGTTTAAGAAGTTTGTTCCTGAGTTTGATGTGGCCTGCTTCAATGCCTTGTTGAGAACTCTGTGCCAGGAGAAGAGTATGA
CGGATGCGAGGAATGTGTATCACGGTATGAAGAGTAAGTTTAGACCGAACTTGCAGACGTTTAACATATTATTGTCGGGTTGGAAGTCCTCGGAAGAAGCCGAGGGATTC
TTTGATGAGATGAGAGAAATGGGGGTCAAGCCTGATGTTGTTTCATATAATTGTTTGGTTGATGTTTATTGTAAAAACAGGGAAATGGACAAGGCGTACAAGGTTGTTGA
GAGAATGCAGGATGAGGATATACATGCTGATGTGATTACGTACACTAGTATCATTGGGGGACTCGGGTTGATCGGTCAACCAGACAAAGCGAGAAATATTTTGAAGGAAA
TGAAGGAGTATGGATGTTATCCTGATGTTGCAGCTTATAATGCTGCGATACGAAATTTCTGCATTGCGAAGAGGCTTCGCGAGGCTTTTGATTTGATGGATGAAATGGTG
AATAAGGGTTTGAGTCCAAATGCTACTACATACAACTTGTTCTTTAGGATCTTCTTCTGGTCAAATGACTTGCAAAGCTCGTGGAACTTGTACCGGCGAATGATGGATAC
GGGGTGCTTGCCTAATACGCAGTCCTGCTTGTTTCTAATGAGGTTATTTAAGAAGCATGAAAAGGTAGAAATGGCACTGGAGCTATGGAATGACATGGTAGTAAGGGGAT
TTGGATCTTATATATTAGTATCTGAGGAGTTATTTGATCTTCTCTGTGATTTGGGAAAGTTGGTGGAAGCAGAGATGTGTTTTCTGCAGATGGTAGATAAGGGGCATAAG
CCTAGTAACGTCTCGTTTAAAAGGATCAAACTTGGTGATGGATTGATTTATGAACTTGTGGATGGTTCCAATAGCTTCCCTCATTTCTATGGTCCTTCTCGAAGCTTCAG
CCCTATCCCAATAGATGCAGTAACCAAAGCAGAGAAACTAACTCTTTCTGGTGGTCGATATGCTTGTGCTCATGTTCTCTTAGAATATAATGATGTCATGGACGATATTC
TTATTTGTAGGAATCCCTTTCCACTTGCGTTCCTTATAGATTTTATTGAGAAAATTTCCAGCCGAAAGCCGGTTATTTTCAACTTCGGGGACTCGAATTCCGATACAGGT
GGCTACTCTGAGGGACTTGGACTCAGATTTGGACCCCCAAATGGCCGCACATTCTTCCACAAACCATCAGGAAGATTATGTGATGGACGTCTAATGATTGATTTTCTATG
TGAAAGTGTGAGTTCTGATTATCTGACCCCATATCTTCGATCTGTGGGCTCAAATTTCACTGATGGAGCTAACTTTGCAATATCTGGTTCATCCACTCTGCCAAGGAATC
ATCCATTTAATTTGAACGTTCAAGTTCTGCAATTCCTTCAATTTCAATCATGCTCCCTTGAGCTTATTTCGAAAGGCTATAAAGACTTGGTTGATGTGGAGGGATTCAAG
AATGCACTTTACACGATCGACATTGGACAAAATGATCTTGCTGGTTCATTCACTTACCTGTCTTATCCGCAAGTCATCCAGAGAATCCCGTCTTTTGTTTCTGAGATTAG
AGATGCCATATGGAGTATATATCAACATGGAGGAAGAAACTTTTGGGTACACAACACAGGGCCATTGGGATGTATGCCTCAAAAGCTTGCTACAACAGCTGCAAATGCTA
GTGACATTGACAATCATGGCTGCCTACAAGCCCTCAACAATGCTGCAAAAGAATTCAACACTCAGTTGAAAGCTGCATGTGGAGAACTAAGATCCGCTTTGACGAATGCC
ACCATCGTGTACGTCGATGTCTACGCCATCAAATACAACCTCATTGCCAATTCCGCATCGAACGGGTTCGAGAATCCATTGATGGTGTGCTGTGGATACGGCGGACCGCC
GTACAACTTCAACCAGAGCATCACGTGCGGGCAAACCGGGTTCAACGTGTGCGACGAAGGATTGAAGTACATAAGCTGGGATGGAGTTCACTACACAGAAGCTGCAAATG
CTGTGTTTGCCTCCAAGATACTTTCTTCTGAATACTCTACTCCAAATTTTCACTTCAATTTCTTCTGTAACAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCGGAGTGAAGAAGCTGATGAATACCTTGTGGCCGGAGTGAAGAAGCTGGCGGAGGCGGCTGCCGCCGAAGAGGGAGAGACCTCTTTTACTGTTGTGGAGGCAATTTT
CATGGATGATCTCTCTTCTTCTCTAGTTTATGTGACAGTAGACGATGGTCGGGTGTTGGGAATCTCCCCGTTGGATGTTGACCATGTTGTTGTGGCTAACAAAATCAAAA
CCAACATTGTTATGTATGGTCATTGGCTTAGTCATTTGGTTAAAGAGAACTTCCATGGTTCAAGGGCAAGGAAGACGAAGCAGCAAGTGGGCAATGACGAAGGTGCAGCG
GCAATCGGTGCAAACACCATTTCGGAATATGGGAGGCTATCGAATAACTGGAACCAGCCGGAGTTCCACAATCATCAGCCGGAGTATCGTCACCAGCCACACTTCAACGG
GGAGGCGAATGAGGGGTTCGGAAACGGCGGTTTAAGGCCGAATTGTGGCAATCAAAATGCTAATTTGGGGCGCAAAAGGCCGAGAAACTATTCCAACAGAACAGTTCCCT
CAGATCATGCCGAGGCTGTCAAATTGTATGTTGCACAAGTTCCCCGGACAGGAACCGAAGAAGCTATCCGTCCCCTGTTTGAAGTACATGGAGATATTGTCGAAATTGTA
ATATTAAGGGATAAGATAACTGGCCAACAGCAAGGTAGTTGTTTTGTGAAGTATGCAACCTCTATTGAAGCAGACAGGGCTATCGGAGCTTTGGACAATCAGTTTACTTT
TCCTGGAGAAATGGCTCCTATCAATGTGAAATATGCTGATGGTGAGAGGGAGCGCCTAGGAGTACTTGAAAAATTGTATGTTGGTAGCTTGAATAAAAATACAACAAAAA
GGGAAATTGAGGAGGTATTCTCGCCTTATGGTTTTGTTGAGGATATTTATATCATGCGTGATGAGCTGAAGCAAAGCCGTGGATGTGCCTTTGTTAAGTACGCTCGTAGA
GAGATGGCAATGGCAGCAATCAAAGCATTGAATGGAAATTATACGATTAGAGGTTGTGATCAGCCACTTATTGTTCGTCTTGCAGACCCTAAGAAATCCAGGGTTGGTGA
ACAAAGAAGCAACAGCATGTCAGGCAGCCCAAATTTTGGTCATCATCCTCAACCTTTTAGACCCGAGCCCCCTCTTGGTGCCCCTGCTGGAGGATGTTTTCCCAATAATT
TATATCCCCCACAACAAAATTCTGCAAGTTTAGGACCAGCAAAAAATGCTTCCCAAGTGGCATCCAATGCCCCTTTGGCTCCTAATACCATACAGAAAGCACACCCTCCA
ATACAAGAACCTTCATCATCTTTTGCTCATATACCATCACAACCAATGAGAACAACACAACAAGTTTGTCAGCCTCCTACACAGCCTGATTTTTCTAAGATGCAGAATCA
GGTGTATTGTCAACAACAGCCCCGGAAAGACTCGTATCAGCAACAGAATTCACAGGTTAATGAGAATACGCCTCCTACAGCACATGGTCTTCAGACCTTTAGTGGTGTTC
CTAACTCACCTTTGGTGCGTCCATGTTCTCGGGTAGAAGTTTCCTTGGAGTGCGATTGGAGTGAACACACCTGCCCTGACGGTTTCAAATACTACTATAACTGTGTCACC
TGTGAAAGTTCGTGGGAGAAGCCAGAAGAGTTTGCCTTGTTTGAGCAACAATTAAAGCAGGAAAAGCTTCAAAAACAAAACCATCAGCTTCATTCTTCATTACCAATCTC
TTCTCCAGAAGTTCTGCCTCATCCAAATGTCTTCAGTCAAAAACTTGAAGTCCAATCCTCCTCTGCTGTACGAGAATTGGATTATCCCGTTTTCAAAATCCGCTCAACTA
CGTACGTGTTTGATCTTTCTTTGTCCTCACATTATCCAAAATCATTGTCCCCACAATTTCACTTGAGGATGCCTCGTGTTTTCATAACATTCGGTGGAGAATGGAATGAT
AGTGAAAAAGATTATGTCGGCGGTCGTACGAGGGGATTGACAGTGGATAGTACAATCACGTACAGAGAATTTCTAGGCCAAAACGTGTCCTCGATTCCTATGTCCGCAGC
TTGTATTCCCCCATTCCAGAGACCCACATATCCTATACCCTCATTTCCTTCCTCATCATCGAACCCCTCTTCTTCCCGACAGCCACACCCCTCCTACGGGCATATAGGTC
ATGATGTAGAGGGTTTAACACCATTGGGGTCAGATGTTGTTCCATGTAATCTAGGAGATGATAGGTCTGAAGAAGGACACTCTCAAGCAGAATATGGGAACGAAGAGCAT
GACGATGCGCTTGATGATGAGCTTGAGCCTGATGTCGAACAAGTGCACACTGAGATTCGCAGGGATGAAGAAGCAGTCCGGCCACCGGGATGTAATGGTCTCACCGGAGA
CCCTAATGATGAGAAATTGCAACTCATAGTACAGTCTTCTGGGACAAATGATGTTAATGAGGGCGATGTATTTGATAATAAGAAGGAGTTGAGTTTGAAAATGCATTTAG
TTGCAATGCGGAAGAATTTTCAGTTTAAAGTAAAGAAGTCAACGCCGAAGCTATATATACTGCGGTGCGTTCATGCTGATTGCACGTGGAGACTTCGAGCTACCAAGCTA
AAGGAATGCACTTTGTTCAAGATAAAAAAATATTGTGCCACCCATACGTGCTATGGTGGAGCTTTAAAACATGATCATAGGCAAGCCAAAAGTTGGGTGGTTGGACATCT
TGTGCAAGAGAAGTTCACAGACGTCTCCCGCACGTATAGACCGAAGGACATTATACAAGACATGAGGAAGGAGTATGGTGTCAATTTAAGTTATGATAGAGCATGGCGTT
CTAGTGAAGAAGCACTCCGACTTATTAGAGGTGATCCAGCATCGTCATATGGTCTACTTCCAGCTTATGGTAAAGCTTTGAAAATCATGAACCCAGGACCGGGACGTGTC
CGGGATCGACCTTCATCCCGGACCGAACCGGACCGGGACAAGTCCGAGATTAAAGTAGATCCTGGAGGAGATCGAACCCTTAACATGGATAAAATAGAGAGAAAATGGCT
AAGTAATCCACCTCGAAAGCACTCATTTTCTTCCGCTCTTCGCCTTCCAAACACTTCACCACTTTCGCAATCTCCTCCTCCGTCTCAACGCCGCCTTGATCTCCTCCGTC
ATCATCACCGCGTTCATCCGGCCAACCTATCAGAGGAACCCCATTATCCAATTTCTTCTCATCGAATACTTCATATTCAGTTTCATACGGCGATGATTCTCCGGCCAACG
CCCAGGTACTTCCCACCTATTCCGTTATCCAATTCCTTCTCGCATCCATTTTCATCTGCAAATGAAAATCAATCGTTAAACGGCAAAGTAGAAACAATTTTCCGCATAAT
TAGTAGTTCAAACTCGTCAACAAATATGAGGCATTCTCTGAAATCGGCTAGGGTTTTCCTCTCAAATGAGTTGATCGATGGGGTTCTTAAGAGGGTTAGGTTTAGTCACG
GTAATCCTTTACAGGCCTTGGAGTTCTTTAATTACACTGGTAGAAGAAGGGGATTTTATCACACTGCGTTTTCTGTGGATAGCATGCTTTATATCTTAGGCAGGAGCCGG
AAGTTTGAAAAGATTTGGGACGTTTTGGTTGATGTTAAGCTTAAAGATCGGTCGTTAATCACGCCGCGGACTGTTATGGTTGTTTTGGGAAGAATTGCCAAAGTCTGCTC
TGTGAGACAGACTGTGGAGTCTTTTAGGAAGTTTAAGAAGTTTGTTCCTGAGTTTGATGTGGCCTGCTTCAATGCCTTGTTGAGAACTCTGTGCCAGGAGAAGAGTATGA
CGGATGCGAGGAATGTGTATCACGGTATGAAGAGTAAGTTTAGACCGAACTTGCAGACGTTTAACATATTATTGTCGGGTTGGAAGTCCTCGGAAGAAGCCGAGGGATTC
TTTGATGAGATGAGAGAAATGGGGGTCAAGCCTGATGTTGTTTCATATAATTGTTTGGTTGATGTTTATTGTAAAAACAGGGAAATGGACAAGGCGTACAAGGTTGTTGA
GAGAATGCAGGATGAGGATATACATGCTGATGTGATTACGTACACTAGTATCATTGGGGGACTCGGGTTGATCGGTCAACCAGACAAAGCGAGAAATATTTTGAAGGAAA
TGAAGGAGTATGGATGTTATCCTGATGTTGCAGCTTATAATGCTGCGATACGAAATTTCTGCATTGCGAAGAGGCTTCGCGAGGCTTTTGATTTGATGGATGAAATGGTG
AATAAGGGTTTGAGTCCAAATGCTACTACATACAACTTGTTCTTTAGGATCTTCTTCTGGTCAAATGACTTGCAAAGCTCGTGGAACTTGTACCGGCGAATGATGGATAC
GGGGTGCTTGCCTAATACGCAGTCCTGCTTGTTTCTAATGAGGTTATTTAAGAAGCATGAAAAGGTAGAAATGGCACTGGAGCTATGGAATGACATGGTAGTAAGGGGAT
TTGGATCTTATATATTAGTATCTGAGGAGTTATTTGATCTTCTCTGTGATTTGGGAAAGTTGGTGGAAGCAGAGATGTGTTTTCTGCAGATGGTAGATAAGGGGCATAAG
CCTAGTAACGTCTCGTTTAAAAGGATCAAACTTGGTGATGGATTGATTTATGAACTTGTGGATGGTTCCAATAGCTTCCCTCATTTCTATGGTCCTTCTCGAAGCTTCAG
CCCTATCCCAATAGATGCAGTAACCAAAGCAGAGAAACTAACTCTTTCTGGTGGTCGATATGCTTGTGCTCATGTTCTCTTAGAATATAATGATGTCATGGACGATATTC
TTATTTGTAGGAATCCCTTTCCACTTGCGTTCCTTATAGATTTTATTGAGAAAATTTCCAGCCGAAAGCCGGTTATTTTCAACTTCGGGGACTCGAATTCCGATACAGGT
GGCTACTCTGAGGGACTTGGACTCAGATTTGGACCCCCAAATGGCCGCACATTCTTCCACAAACCATCAGGAAGATTATGTGATGGACGTCTAATGATTGATTTTCTATG
TGAAAGTGTGAGTTCTGATTATCTGACCCCATATCTTCGATCTGTGGGCTCAAATTTCACTGATGGAGCTAACTTTGCAATATCTGGTTCATCCACTCTGCCAAGGAATC
ATCCATTTAATTTGAACGTTCAAGTTCTGCAATTCCTTCAATTTCAATCATGCTCCCTTGAGCTTATTTCGAAAGGCTATAAAGACTTGGTTGATGTGGAGGGATTCAAG
AATGCACTTTACACGATCGACATTGGACAAAATGATCTTGCTGGTTCATTCACTTACCTGTCTTATCCGCAAGTCATCCAGAGAATCCCGTCTTTTGTTTCTGAGATTAG
AGATGCCATATGGAGTATATATCAACATGGAGGAAGAAACTTTTGGGTACACAACACAGGGCCATTGGGATGTATGCCTCAAAAGCTTGCTACAACAGCTGCAAATGCTA
GTGACATTGACAATCATGGCTGCCTACAAGCCCTCAACAATGCTGCAAAAGAATTCAACACTCAGTTGAAAGCTGCATGTGGAGAACTAAGATCCGCTTTGACGAATGCC
ACCATCGTGTACGTCGATGTCTACGCCATCAAATACAACCTCATTGCCAATTCCGCATCGAACGGGTTCGAGAATCCATTGATGGTGTGCTGTGGATACGGCGGACCGCC
GTACAACTTCAACCAGAGCATCACGTGCGGGCAAACCGGGTTCAACGTGTGCGACGAAGGATTGAAGTACATAAGCTGGGATGGAGTTCACTACACAGAAGCTGCAAATG
CTGTGTTTGCCTCCAAGATACTTTCTTCTGAATACTCTACTCCAAATTTTCACTTCAATTTCTTCTGTAACAAGTGA
Protein sequenceShow/hide protein sequence
MRSEEADEYLVAGVKKLAEAAAAEEGETSFTVVEAIFMDDLSSSLVYVTVDDGRVLGISPLDVDHVVVANKIKTNIVMYGHWLSHLVKENFHGSRARKTKQQVGNDEGAA
AIGANTISEYGRLSNNWNQPEFHNHQPEYRHQPHFNGEANEGFGNGGLRPNCGNQNANLGRKRPRNYSNRTVPSDHAEAVKLYVAQVPRTGTEEAIRPLFEVHGDIVEIV
ILRDKITGQQQGSCFVKYATSIEADRAIGALDNQFTFPGEMAPINVKYADGERERLGVLEKLYVGSLNKNTTKREIEEVFSPYGFVEDIYIMRDELKQSRGCAFVKYARR
EMAMAAIKALNGNYTIRGCDQPLIVRLADPKKSRVGEQRSNSMSGSPNFGHHPQPFRPEPPLGAPAGGCFPNNLYPPQQNSASLGPAKNASQVASNAPLAPNTIQKAHPP
IQEPSSSFAHIPSQPMRTTQQVCQPPTQPDFSKMQNQVYCQQQPRKDSYQQQNSQVNENTPPTAHGLQTFSGVPNSPLVRPCSRVEVSLECDWSEHTCPDGFKYYYNCVT
CESSWEKPEEFALFEQQLKQEKLQKQNHQLHSSLPISSPEVLPHPNVFSQKLEVQSSSAVRELDYPVFKIRSTTYVFDLSLSSHYPKSLSPQFHLRMPRVFITFGGEWND
SEKDYVGGRTRGLTVDSTITYREFLGQNVSSIPMSAACIPPFQRPTYPIPSFPSSSSNPSSSRQPHPSYGHIGHDVEGLTPLGSDVVPCNLGDDRSEEGHSQAEYGNEEH
DDALDDELEPDVEQVHTEIRRDEEAVRPPGCNGLTGDPNDEKLQLIVQSSGTNDVNEGDVFDNKKELSLKMHLVAMRKNFQFKVKKSTPKLYILRCVHADCTWRLRATKL
KECTLFKIKKYCATHTCYGGALKHDHRQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVNLSYDRAWRSSEEALRLIRGDPASSYGLLPAYGKALKIMNPGPGRV
RDRPSSRTEPDRDKSEIKVDPGGDRTLNMDKIERKWLSNPPRKHSFSSALRLPNTSPLSQSPPPSQRRLDLLRHHHRVHPANLSEEPHYPISSHRILHIQFHTAMILRPT
PRYFPPIPLSNSFSHPFSSANENQSLNGKVETIFRIISSSNSSTNMRHSLKSARVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGFYHTAFSVDSMLYILGRSR
KFEKIWDVLVDVKLKDRSLITPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVACFNALLRTLCQEKSMTDARNVYHGMKSKFRPNLQTFNILLSGWKSSEEAEGF
FDEMREMGVKPDVVSYNCLVDVYCKNREMDKAYKVVERMQDEDIHADVITYTSIIGGLGLIGQPDKARNILKEMKEYGCYPDVAAYNAAIRNFCIAKRLREAFDLMDEMV
NKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTGCLPNTQSCLFLMRLFKKHEKVEMALELWNDMVVRGFGSYILVSEELFDLLCDLGKLVEAEMCFLQMVDKGHK
PSNVSFKRIKLGDGLIYELVDGSNSFPHFYGPSRSFSPIPIDAVTKAEKLTLSGGRYACAHVLLEYNDVMDDILICRNPFPLAFLIDFIEKISSRKPVIFNFGDSNSDTG
GYSEGLGLRFGPPNGRTFFHKPSGRLCDGRLMIDFLCESVSSDYLTPYLRSVGSNFTDGANFAISGSSTLPRNHPFNLNVQVLQFLQFQSCSLELISKGYKDLVDVEGFK
NALYTIDIGQNDLAGSFTYLSYPQVIQRIPSFVSEIRDAIWSIYQHGGRNFWVHNTGPLGCMPQKLATTAANASDIDNHGCLQALNNAAKEFNTQLKAACGELRSALTNA
TIVYVDVYAIKYNLIANSASNGFENPLMVCCGYGGPPYNFNQSITCGQTGFNVCDEGLKYISWDGVHYTEAANAVFASKILSSEYSTPNFHFNFFCNK