; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS010699 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS010699
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationscaffold35:1460052..1462166
RNA-Seq ExpressionMS010699
SyntenyMS010699
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0004553 - hydrolase activity, hydrolyzing O-glycosyl compounds (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6603840.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0086.95Show/hide
Query:  MQFAPPGGHPAPLSLSTFHSAFKSYIEGKISTSPLLLFRRLLRYRVTPNDSTFSLLIKAFVLSSSSSSFAPASSSENAKAEANQLQTHFVKWGFDQFLYV
        MQF    GH A LS STFHSAFKSY+EGKIST PLL+FR+LLR RV PNDSTFSLLIKAFV+SSSSSSFAP S SENAKAEANQLQ HF+KWGFD+FLYV
Subjt:  MQFAPPGGHPAPLSLSTFHSAFKSYIEGKISTSPLLLFRRLLRYRVTPNDSTFSLLIKAFVLSSSSSSFAPASSSENAKAEANQLQTHFVKWGFDQFLYV

Query:  STAFLNLYSKLGHVKAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVEMRRRGFKPRQRTVVSLIPACGSQQLFVQGKSIHALGIKAGLDLDSH
        STAFL+LYSKLG VKAARRLFDDIPEKDVV WNALISGYSRSGY+HDAFELFVEMRRRGF P QRT+VSLIP+CG+Q LFVQGK IHALG+KAGLDLDS 
Subjt:  STAFLNLYSKLGHVKAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVEMRRRGFKPRQRTVVSLIPACGSQQLFVQGKSIHALGIKAGLDLDSH

Query:  VKNALASMYAKCADVEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFKQMLEESVNANSVTMVTILSANANPGSIHCYATKTGLVENVSVVTSL
        VKN LASMY KCAD+EGVEL FGE++EK+VVSWNT IGAFGQNGFFVEAMLVFKQMLEE ++ NSVTMV+ILSANANP SIHCYATKTGLVENVSVVTSL
Subjt:  VKNALASMYAKCADVEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFKQMLEESVNANSVTMVTILSANANPGSIHCYATKTGLVENVSVVTSL

Query:  VCSYVRCGNIQIAELIYMSLLQKDLVSLTAIISSYGEKGDIKSVVKLYSRMQHLDMKLDAVAMIGIIQGITYPDHFGIGLTFHGYGLKSGLIIDCLVANG
        +CSYVRCG IQIAELIYMS LQK+LV+LTAIIS Y EKGD+ SVVKLYSR+QHL+MKLDAVAM+GIIQGITYPDH GIGL FHGYGLKSGLIIDCLVANG
Subjt:  VCSYVRCGNIQIAELIYMSLLQKDLVSLTAIISSYGEKGDIKSVVKLYSRMQHLDMKLDAVAMIGIIQGITYPDHFGIGLTFHGYGLKSGLIIDCLVANG

Query:  FISMYSKFDDIAAVFSLFHEMHEKTLSSWNSVISSCAQAGRSIDAMVLFSKMKLSGYGPDSITIASLLSACCQNGNLHFGERLHCYSLRNNLDLEGFVGT
        FISMYS+FDDI AVFSLF EM EKTLSSWNSVISSCAQAGRSIDAM LFS+MKLSGYGPDSIT+ASLLSACCQNGNLHFGE +H Y LRNNLDLEGFVGT
Subjt:  FISMYSKFDDIAAVFSLFHEMHEKTLSSWNSVISSCAQAGRSIDAMVLFSKMKLSGYGPDSITIASLLSACCQNGNLHFGERLHCYSLRNNLDLEGFVGT

Query:  ALVDMYVKCGRLDFAERVFKSMKEPCLASWNSMISGYGLFGFDNHALLCYTKMIEKGIKPNKITFSGILAACTHGGLVREGKTYFKIMMEEFGIAPELQH
        AL+DMYVKCGR+DFAE+VFKSMKEPCLASWNS+ISGYGLFGF+NHA LCYTKM+EKGIKPNKITFSGILAACTHGGLV EG+TYF+IM +E GI PE QH
Subjt:  ALVDMYVKCGRLDFAERVFKSMKEPCLASWNSMISGYGLFGFDNHALLCYTKMIEKGIKPNKITFSGILAACTHGGLVREGKTYFKIMMEEFGIAPELQH

Query:  CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVVRVRKMMREMGEDGCSG
        CASMVGLLGRAGLFEEAI+FIKNME+NPDSAVWGALL+ACCIHQEVKLGESVAK+LLFSN RNGGFFVLMSNLYAASGRWNDV RVRKMMREMGEDGCSG
Subjt:  CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVVRVRKMMREMGEDGCSG

Query:  VSLME
        VSLME
Subjt:  VSLME

XP_022133023.1 pentatricopeptide repeat-containing protein At2g04860 [Momordica charantia]0.0e+0099.01Show/hide
Query:  MQFAPPGGHPAPLSLSTFHSAFKSYIEGKISTSPLLLFRRLLRYRVTPNDSTFSLLIKAFVLSSSSSSFAPASSSENAKAEANQLQTHFVKWGFDQFLYV
        MQFAP GGHPAPLSLSTFHSAFKSYIEGKISTSPLLLFRRLLRYRVTPNDSTFSLLIKAFVLSSSSSSFAPASSSENAKAEANQLQTHFVKWGFDQFLYV
Subjt:  MQFAPPGGHPAPLSLSTFHSAFKSYIEGKISTSPLLLFRRLLRYRVTPNDSTFSLLIKAFVLSSSSSSFAPASSSENAKAEANQLQTHFVKWGFDQFLYV

Query:  STAFLNLYSKLGHVKAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVEMRRRGFKPRQRTVVSLIPACGSQQLFVQGKSIHALGIKAGLDLDSH
        STAFLNLYSKLGHVKAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVEMRRRGFKPRQRTVVSLIPACGSQQLFVQGKSIHALGIKAGLDLDSH
Subjt:  STAFLNLYSKLGHVKAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVEMRRRGFKPRQRTVVSLIPACGSQQLFVQGKSIHALGIKAGLDLDSH

Query:  VKNALASMYAKCADVEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFKQMLEESVNANSVTMVTILSANANPGSIHCYATKTGLVENVSVVTSL
        VKNALASMYAKC+DVEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFKQMLEESVNANSVTMVTILSANANPGSIHCYATKTGLVENVSVVTSL
Subjt:  VKNALASMYAKCADVEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFKQMLEESVNANSVTMVTILSANANPGSIHCYATKTGLVENVSVVTSL

Query:  VCSYVRCGNIQIAELIYMSLLQKDLVSLTAIISSYGEKGDIKSVVKLYSRMQHLDMKLDAVAMIGIIQGITYPDHFGIGLTFHGYGLKSGLIIDCLVANG
        VCSYVRCGNIQIAELIYMSLLQKDLVSLTAIISSY EKGDIKSVVKLYSRMQHLDMKLDAVAMIGIIQGITYPDHFGIGLTFHGYGLKSGLIIDCLVANG
Subjt:  VCSYVRCGNIQIAELIYMSLLQKDLVSLTAIISSYGEKGDIKSVVKLYSRMQHLDMKLDAVAMIGIIQGITYPDHFGIGLTFHGYGLKSGLIIDCLVANG

Query:  FISMYSKFDDIAAVFSLFHEMHEKTLSSWNSVISSCAQAGRSIDAMVLFSKMKLSGYGPDSITIASLLSACCQNGNLHFGERLHCYSLRNNLDLEGFVGT
        FISMYSKFDDIAAVFSLFHEMHEKTLSSWN+VISSCAQAGRSIDAMVLFSKMKLSGYGPDSITIASLLSACCQNGNLHFGERLHCY+LRNNLDLEGFVGT
Subjt:  FISMYSKFDDIAAVFSLFHEMHEKTLSSWNSVISSCAQAGRSIDAMVLFSKMKLSGYGPDSITIASLLSACCQNGNLHFGERLHCYSLRNNLDLEGFVGT

Query:  ALVDMYVKCGRLDFAERVFKSMKEPCLASWNSMISGYGLFGFDNHALLCYTKMIEKGIKPNKITFSGILAACTHGGLVREGKTYFKIMMEEFGIAPELQH
        ALVDMYVKCGRLDFAERVFKSMKEPCLASWNSMISGYGLFGFDNHALLCYTKMIEKGIKPNKITFSGILAACTHGGLVREGKTYFKIMMEEFGIAPELQH
Subjt:  ALVDMYVKCGRLDFAERVFKSMKEPCLASWNSMISGYGLFGFDNHALLCYTKMIEKGIKPNKITFSGILAACTHGGLVREGKTYFKIMMEEFGIAPELQH

Query:  CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVVRVRKMMREMGEDGCSG
        CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAAS RWNDV RVRKMMREMGEDGCSG
Subjt:  CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVVRVRKMMREMGEDGCSG

Query:  VSLME
        VSLME
Subjt:  VSLME

XP_022950030.1 pentatricopeptide repeat-containing protein At2g04860 isoform X1 [Cucurbita moschata]0.0e+0087.09Show/hide
Query:  MQFAPPGGHPAPLSLSTFHSAFKSYIEGKISTSPLLLFRRLLRYRVTPNDSTFSLLIKAFVLSSSSSSFAPASSSENAKAEANQLQTHFVKWGFDQFLYV
        MQF    GH A LSLSTFHSAFKSY+EGKIST PLL+FR+LLR RV PNDSTFSLLIKAFV+SSSSSSFAP S SENAKAEANQLQ HF+KWGFDQFLYV
Subjt:  MQFAPPGGHPAPLSLSTFHSAFKSYIEGKISTSPLLLFRRLLRYRVTPNDSTFSLLIKAFVLSSSSSSFAPASSSENAKAEANQLQTHFVKWGFDQFLYV

Query:  STAFLNLYSKLGHVKAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVEMRRRGFKPRQRTVVSLIPACGSQQLFVQGKSIHALGIKAGLDLDSH
        STAFL+LYSKLG VKAARRLFDDIPEKDVV WNALISGYSRSGY+HDAFELFVEMRRRGF P QRT+VSLIP+CG+Q LF QGK IHALG+KAGLDLDS 
Subjt:  STAFLNLYSKLGHVKAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVEMRRRGFKPRQRTVVSLIPACGSQQLFVQGKSIHALGIKAGLDLDSH

Query:  VKNALASMYAKCADVEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFKQMLEESVNANSVTMVTILSANANPGSIHCYATKTGLVENVSVVTSL
        VKN+LASMY KCAD+EGVEL FGE++EK+VVSWNT IGAFGQNGFFVEAMLVFKQMLEE ++ NSVTMV+ILSANANP SIHCYATKTGLVENVSVVTSL
Subjt:  VKNALASMYAKCADVEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFKQMLEESVNANSVTMVTILSANANPGSIHCYATKTGLVENVSVVTSL

Query:  VCSYVRCGNIQIAELIYMSLLQKDLVSLTAIISSYGEKGDIKSVVKLYSRMQHLDMKLDAVAMIGIIQGITYPDHFGIGLTFHGYGLKSGLIIDCLVANG
        +CSYVRCG IQIAELIYMS LQK+LV+LTAIIS Y EKGD+ SVVKLYSR+QHL+MKLDAVAM+GIIQGITYPDH GIGL FHGYGLKSGLIIDCLVANG
Subjt:  VCSYVRCGNIQIAELIYMSLLQKDLVSLTAIISSYGEKGDIKSVVKLYSRMQHLDMKLDAVAMIGIIQGITYPDHFGIGLTFHGYGLKSGLIIDCLVANG

Query:  FISMYSKFDDIAAVFSLFHEMHEKTLSSWNSVISSCAQAGRSIDAMVLFSKMKLSGYGPDSITIASLLSACCQNGNLHFGERLHCYSLRNNLDLEGFVGT
        FISMYS+FDDI AVFSLF EM EKTLSSWNSVISSCAQAGRSIDAM LFS+MKLSGYGPDSIT+ASLLSACCQNGNLHFGE +H Y LRNNLDLEGFVGT
Subjt:  FISMYSKFDDIAAVFSLFHEMHEKTLSSWNSVISSCAQAGRSIDAMVLFSKMKLSGYGPDSITIASLLSACCQNGNLHFGERLHCYSLRNNLDLEGFVGT

Query:  ALVDMYVKCGRLDFAERVFKSMKEPCLASWNSMISGYGLFGFDNHALLCYTKMIEKGIKPNKITFSGILAACTHGGLVREGKTYFKIMMEEFGIAPELQH
        AL+DMYVKCGR+DFAE+VFKSMKEPCLASWNS+ISGYGLFGF+NHA LCYTKM+EKGIKPNKITFSGILAACTHGGLV EG+TYF+IM +E GI PE QH
Subjt:  ALVDMYVKCGRLDFAERVFKSMKEPCLASWNSMISGYGLFGFDNHALLCYTKMIEKGIKPNKITFSGILAACTHGGLVREGKTYFKIMMEEFGIAPELQH

Query:  CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVVRVRKMMREMGEDGCSG
        CASMVGLLGRAGLFEEAI+FIKNME+NPDSAVWGALL+ACCIHQEVKLGESVAK+LLFSN RNGGFFVLMSNLYAASGRWNDV RVRKMMREMGEDGCSG
Subjt:  CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVVRVRKMMREMGEDGCSG

Query:  VSLME
        VSLME
Subjt:  VSLME

XP_022977696.1 pentatricopeptide repeat-containing protein At2g04860 isoform X1 [Cucurbita maxima]0.0e+0086.95Show/hide
Query:  MQFAPPGGHPAPLSLSTFHSAFKSYIEGKISTSPLLLFRRLLRYRVTPNDSTFSLLIKAFVLSSSSSSFAPASSSENAKAEANQLQTHFVKWGFDQFLYV
        MQF    GH A LSLSTFHSAFKSY+EGKIST PLL+FR+LLR RV PNDSTFSLLIKAFV+SSSSSSFAP+S SENA+AEANQLQTHF+KWGFDQFLYV
Subjt:  MQFAPPGGHPAPLSLSTFHSAFKSYIEGKISTSPLLLFRRLLRYRVTPNDSTFSLLIKAFVLSSSSSSFAPASSSENAKAEANQLQTHFVKWGFDQFLYV

Query:  STAFLNLYSKLGHVKAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVEMRRRGFKPRQRTVVSLIPACGSQQLFVQGKSIHALGIKAGLDLDSH
        STAFL+LYSKLG VKAARRLFDDIPEKDVV WNALISGYSRSG++HD FELFVEMRRRGF P QRT+VSLIP+CG+Q LFVQGK IHALG+KAGLDLDS 
Subjt:  STAFLNLYSKLGHVKAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVEMRRRGFKPRQRTVVSLIPACGSQQLFVQGKSIHALGIKAGLDLDSH

Query:  VKNALASMYAKCADVEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFKQMLEESVNANSVTMVTILSANANPGSIHCYATKTGLVENVSVVTSL
        VKN+LASMY KCAD+EGVEL FGE++EK+VVSWNT IGAFGQNGFFVEAMLVFKQMLEES+N +SVTMV+ILSANANP SIHCYATKTGL+ENVSVVTSL
Subjt:  VKNALASMYAKCADVEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFKQMLEESVNANSVTMVTILSANANPGSIHCYATKTGLVENVSVVTSL

Query:  VCSYVRCGNIQIAELIYMSLLQKDLVSLTAIISSYGEKGDIKSVVKLYSRMQHLDMKLDAVAMIGIIQGITYPDHFGIGLTFHGYGLKSGLIIDCLVANG
        +CSYV+CG I IAE IYMS LQK+LV+LTAIIS Y EKGD+ +VVKLYSR+QHL+MKLDAVAM+GIIQGITYPDH GIGL+FHGYGLKSGLIIDCLVANG
Subjt:  VCSYVRCGNIQIAELIYMSLLQKDLVSLTAIISSYGEKGDIKSVVKLYSRMQHLDMKLDAVAMIGIIQGITYPDHFGIGLTFHGYGLKSGLIIDCLVANG

Query:  FISMYSKFDDIAAVFSLFHEMHEKTLSSWNSVISSCAQAGRSIDAMVLFSKMKLSGYGPDSITIASLLSACCQNGNLHFGERLHCYSLRNNLDLEGFVGT
        FISMYS+FDDI AVFSLF EMHEKTLSSWNSVISSCAQAGRSIDAM LFS+MKLSGYGPDSIT+ASLLSACCQNGNLHFGE LH Y LRNNLDLEGFVGT
Subjt:  FISMYSKFDDIAAVFSLFHEMHEKTLSSWNSVISSCAQAGRSIDAMVLFSKMKLSGYGPDSITIASLLSACCQNGNLHFGERLHCYSLRNNLDLEGFVGT

Query:  ALVDMYVKCGRLDFAERVFKSMKEPCLASWNSMISGYGLFGFDNHALLCYTKMIEKGIKPNKITFSGILAACTHGGLVREGKTYFKIMMEEFGIAPELQH
        AL+DMYVKCGR+DFAE+VFKSMKEPCLASWNSMISGYGLFGFDNH  LCYTKM+EKGIKPNKITFSGILAACTHGGLV EG+TYF+IM +E GI PE QH
Subjt:  ALVDMYVKCGRLDFAERVFKSMKEPCLASWNSMISGYGLFGFDNHALLCYTKMIEKGIKPNKITFSGILAACTHGGLVREGKTYFKIMMEEFGIAPELQH

Query:  CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVVRVRKMMREMGEDGCSG
        CASMVGLLGRAGLFEEAI+FIKNME+NPDSAVWGA LSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDV +VRKMMREMGEDGCSG
Subjt:  CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVVRVRKMMREMGEDGCSG

Query:  VSLME
        VSLME
Subjt:  VSLME

XP_023543683.1 pentatricopeptide repeat-containing protein At2g04860 [Cucurbita pepo subsp. pepo]0.0e+0087.66Show/hide
Query:  MQFAPPGGHPAPLSLSTFHSAFKSYIEGKISTSPLLLFRRLLRYRVTPNDSTFSLLIKAFVLSSSSSSFAPASSSENAKAEANQLQTHFVKWGFDQFLYV
        MQF    GH A LS STFHSAFKSY+EGKIST PLL+FR+LLRYRV PNDSTFSLLIKAFV+SSSSSSFAP+S SENAK EANQLQTHF+KWGFDQFLYV
Subjt:  MQFAPPGGHPAPLSLSTFHSAFKSYIEGKISTSPLLLFRRLLRYRVTPNDSTFSLLIKAFVLSSSSSSFAPASSSENAKAEANQLQTHFVKWGFDQFLYV

Query:  STAFLNLYSKLGHVKAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVEMRRRGFKPRQRTVVSLIPACGSQQLFVQGKSIHALGIKAGLDLDSH
        STAFL+LYSKLG VKAARRLFDDIPEKDVV WNALISGYSRSGY+HDAFELFVEMRRRGF P QRT+VSLIP+CG+Q LFVQGK IHALG+KAGLDLDS 
Subjt:  STAFLNLYSKLGHVKAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVEMRRRGFKPRQRTVVSLIPACGSQQLFVQGKSIHALGIKAGLDLDSH

Query:  VKNALASMYAKCADVEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFKQMLEESVNANSVTMVTILSANANPGSIHCYATKTGLVENVSVVTSL
        VKN+LASMY KCAD+EGVEL FGE++EK+VVSWNT IGAFGQNGFFVEAMLVFKQMLE S+N NSVTMV+ILSANANP SIHCYATKTGL+ENVSVV SL
Subjt:  VKNALASMYAKCADVEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFKQMLEESVNANSVTMVTILSANANPGSIHCYATKTGLVENVSVVTSL

Query:  VCSYVRCGNIQIAELIYMSLLQKDLVSLTAIISSYGEKGDIKSVVKLYSRMQHLDMKLDAVAMIGIIQGITYPDHFGIGLTFHGYGLKSGLIIDCLVANG
        +CSYV+CG IQIAELIYMS LQK+LV+LTAIIS Y EKGD+ SVVKLYSR+QHL+MKLDAVAM+GIIQGITYPDH GIGL FHGYGLKSGLIIDCLVANG
Subjt:  VCSYVRCGNIQIAELIYMSLLQKDLVSLTAIISSYGEKGDIKSVVKLYSRMQHLDMKLDAVAMIGIIQGITYPDHFGIGLTFHGYGLKSGLIIDCLVANG

Query:  FISMYSKFDDIAAVFSLFHEMHEKTLSSWNSVISSCAQAGRSIDAMVLFSKMKLSGYGPDSITIASLLSACCQNGNLHFGERLHCYSLRNNLDLEGFVGT
        FISMYSKFDDI AVF+LF EMHEKTLSSWNSVISSCAQAGRSIDAM LFS+MKLSGYGPDSIT+ASLLSACCQNGNLHFGE LH Y LRNNLDLEGFVGT
Subjt:  FISMYSKFDDIAAVFSLFHEMHEKTLSSWNSVISSCAQAGRSIDAMVLFSKMKLSGYGPDSITIASLLSACCQNGNLHFGERLHCYSLRNNLDLEGFVGT

Query:  ALVDMYVKCGRLDFAERVFKSMKEPCLASWNSMISGYGLFGFDNHALLCYTKMIEKGIKPNKITFSGILAACTHGGLVREGKTYFKIMMEEFGIAPELQH
        AL+DMYVKCGR+DFAE VFKSMKEPCLASWNS+ISGYGLFGFDNHA LCYT M+EKGIKPNKITFSGILAACTHGGLV EG+TYF+IM +E GI PE QH
Subjt:  ALVDMYVKCGRLDFAERVFKSMKEPCLASWNSMISGYGLFGFDNHALLCYTKMIEKGIKPNKITFSGILAACTHGGLVREGKTYFKIMMEEFGIAPELQH

Query:  CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVVRVRKMMREMGEDGCSG
        CASMVGLLGRAGLFEEAI+FIKNME+NPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDV RVRKMMREMGEDGCSG
Subjt:  CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVVRVRKMMREMGEDGCSG

Query:  VSLME
        VSLME
Subjt:  VSLME

TrEMBL top hitse value%identityAlignment
A0A0A0KMV8 Uncharacterized protein0.0e+0084.8Show/hide
Query:  MQFAPPGGHPAPLSLSTFHSAFKSYIEGKISTSPLLLFRRLLRYRVTPNDSTFSLLIKAFVLSSSSSSFAPASSSENAKAEANQLQTHFVKWGFDQFLYV
        MQF    GHPA LSL+TFHSAFK Y+EGK  T PLLLFR LLR+RV PNDSTFSLLIKAFV+SSS+SSFAP+  SEN KAEANQLQTHF+KWGFDQFLYV
Subjt:  MQFAPPGGHPAPLSLSTFHSAFKSYIEGKISTSPLLLFRRLLRYRVTPNDSTFSLLIKAFVLSSSSSSFAPASSSENAKAEANQLQTHFVKWGFDQFLYV

Query:  STAFLNLYSKLGHVKAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVEMRRRGFKPRQRTVVSLIPACGSQQLFVQGKSIHALGIKAGLDLDSH
        STAFL+LYSKLG VKAA+RLFDD PEKDVV WNALISGY+R G SHDAF+LFVEMRRR F P QRT+VSL+P+CG+QQLFVQGKSIH LG+KAGLDLDS 
Subjt:  STAFLNLYSKLGHVKAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVEMRRRGFKPRQRTVVSLIPACGSQQLFVQGKSIHALGIKAGLDLDSH

Query:  VKNALASMYAKCADVEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFKQMLEESVNANSVTMVTILSANANPGSIHCYATKTGLVENVSVVTSL
        VKNAL SMY KCAD++GV+L FGE+ EKSVVSWNT IGAFGQNG F EAMLVFKQMLEESVNANSVTMV+ILSANAN G IHCYATK GLVENVSVVTSL
Subjt:  VKNALASMYAKCADVEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFKQMLEESVNANSVTMVTILSANANPGSIHCYATKTGLVENVSVVTSL

Query:  VCSYVRCGNIQIAELIYMSLLQKDLVSLTAIISSYGEKGDIKSVVKLYSRMQHLDMKLDAVAMIGIIQGITYPDHFGIGLTFHGYGLKSGLIIDCLVANG
        VCSYV+CG I++AELIYMS L+K+LV+LTAIIS Y EKGD+ SVV+LYS +QHLDMKLDAVAM+GIIQG TYPDH GIGL FHGYG+KSGLIIDCLVANG
Subjt:  VCSYVRCGNIQIAELIYMSLLQKDLVSLTAIISSYGEKGDIKSVVKLYSRMQHLDMKLDAVAMIGIIQGITYPDHFGIGLTFHGYGLKSGLIIDCLVANG

Query:  FISMYSKFDDIAAVFSLFHEMHEKTLSSWNSVISSCAQAGRSIDAMVLFSKMKLSGYGPDSITIASLLSACCQNGNLHFGERLHCYSLRNNLDLEGFVGT
        FISMYSKFD+I AVFSLF EMH+KTLSSWNSVISSCAQAGRSIDAM LFS+M LSGYGPDSIT+ASLLSACCQNGNLHFGE LHCY LRNNLDLEGFVGT
Subjt:  FISMYSKFDDIAAVFSLFHEMHEKTLSSWNSVISSCAQAGRSIDAMVLFSKMKLSGYGPDSITIASLLSACCQNGNLHFGERLHCYSLRNNLDLEGFVGT

Query:  ALVDMYVKCGRLDFAERVFKSMKEPCLASWNSMISGYGLFGFDNHALLCYTKMIEKGIKPNKITFSGILAACTHGGLVREGKTYFKIMMEEFGIAPELQH
        ALVDMYVKCGR+DFAE VFKSMKEPCLASWNS+ISGYGLFGF NHALLCYT+M+EKGIKPNKITFSGILAACTHGGLV EG+ YFKIM ++FGI PE QH
Subjt:  ALVDMYVKCGRLDFAERVFKSMKEPCLASWNSMISGYGLFGFDNHALLCYTKMIEKGIKPNKITFSGILAACTHGGLVREGKTYFKIMMEEFGIAPELQH

Query:  CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVVRVRKMMREMGEDGCSG
        CASMVG+LGRAGLFEEAIVFI+NME NPDSAVWGALLSACCIHQEVKLGESVAKKL FSNCRNGGFFVLMSNLYAAS RWNDV R+RKMMREMGEDGCSG
Subjt:  CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVVRVRKMMREMGEDGCSG

Query:  VSLM
        VSL+
Subjt:  VSLM

A0A5D3CM04 Pentatricopeptide repeat-containing protein0.0e+0084.4Show/hide
Query:  MQFAPPGGHPAPLSLSTFHSAFKSYIEGKISTSPLLLFRRLLRYRVTPNDSTFSLLIKAFVLSSSSSSFAPASSSENAKAEANQLQTHFVKWGFDQFLYV
        MQF P  GHPA LSL+TFHSAFK Y+EGK  T PLLLFR+LLR++V PNDSTFSLLIKAFV+SSSS        SEN KAEANQLQTHF+KWGFDQFLYV
Subjt:  MQFAPPGGHPAPLSLSTFHSAFKSYIEGKISTSPLLLFRRLLRYRVTPNDSTFSLLIKAFVLSSSSSSFAPASSSENAKAEANQLQTHFVKWGFDQFLYV

Query:  STAFLNLYSKLGHVKAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVEMRRRGFKPRQRTVVSLIPACGSQQLFVQGKSIHALGIKAGLDLDSH
        STAFL+LYS+LG VKAARRLFDD PEKDVV WNALISGY+R GYSHDAF+LFVEMRRRGF P QRT+VSL+P+CG+Q+LFVQGKSIH LG+KAGLDLDS 
Subjt:  STAFLNLYSKLGHVKAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVEMRRRGFKPRQRTVVSLIPACGSQQLFVQGKSIHALGIKAGLDLDSH

Query:  VKNALASMYAKCADVEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFKQMLEESVNANSVTMVTILSANANPGSIHCYATKTGLVENVSVVTSL
        VKN L SMY KCAD+EGV+L FGE+ EK+VVSWNT IGAFGQNGFF+EAMLVFKQMLEESV+ANSVTMV+ILSANAN G IHCYATK GLVENVSVVTSL
Subjt:  VKNALASMYAKCADVEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFKQMLEESVNANSVTMVTILSANANPGSIHCYATKTGLVENVSVVTSL

Query:  VCSYVRCGNIQIAELIYMSLLQKDLVSLTAIISSYGEKGDIKSVVKLYSRMQHLDMKLDAVAMIGIIQGITYPDHFGIGLTFHGYGLKSGLIIDCLVANG
        VCSYV+CG I+IAELIYMS LQK+LV+LTAIIS Y EKGD+ SVV+LYS +QHLDMKLDAVAM+GIIQG TYPDH GIGL FHGYG+KSGLIIDCLVANG
Subjt:  VCSYVRCGNIQIAELIYMSLLQKDLVSLTAIISSYGEKGDIKSVVKLYSRMQHLDMKLDAVAMIGIIQGITYPDHFGIGLTFHGYGLKSGLIIDCLVANG

Query:  FISMYSKFDDIAAVFSLFHEMHEKTLSSWNSVISSCAQAGRSIDAMVLFSKMKLSGYGPDSITIASLLSACCQNGNLHFGERLHCYSLRNNLDLEGFVGT
        FISMYSKFD+I AVFSLF EMH+KTLSSWNSVISS AQAGRSIDAM LFS+M LSGYGPDSIT+ASLLSACCQNGNLHFGE LHCY LRN++DLEGFVGT
Subjt:  FISMYSKFDDIAAVFSLFHEMHEKTLSSWNSVISSCAQAGRSIDAMVLFSKMKLSGYGPDSITIASLLSACCQNGNLHFGERLHCYSLRNNLDLEGFVGT

Query:  ALVDMYVKCGRLDFAERVFKSMKEPCLASWNSMISGYGLFGFDNHALLCYTKMIEKGIKPNKITFSGILAACTHGGLVREGKTYFKIMMEEFGIAPELQH
        ALVDMYVKCGR+DFAE VFKSMKEPCLASWNS+ISGYGLFGF N ALLCYTKM+EKGIKPNKITFSGILAACTHGGLV EG+ YFK M +EFGI PE QH
Subjt:  ALVDMYVKCGRLDFAERVFKSMKEPCLASWNSMISGYGLFGFDNHALLCYTKMIEKGIKPNKITFSGILAACTHGGLVREGKTYFKIMMEEFGIAPELQH

Query:  CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVVRVRKMMREMGEDGCSG
        CASMVGLLGRAGLFEEAIVFIKNME NPDSAVWGALLSACCIHQE+KLGESVAKKL FSNCRNGGFFVLMSNLYAASGRWNDV ++RKMMREMGEDGCSG
Subjt:  CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVVRVRKMMREMGEDGCSG

Query:  VSLME
        VSLME
Subjt:  VSLME

A0A6J1BXV1 pentatricopeptide repeat-containing protein At2g048600.0e+0099.01Show/hide
Query:  MQFAPPGGHPAPLSLSTFHSAFKSYIEGKISTSPLLLFRRLLRYRVTPNDSTFSLLIKAFVLSSSSSSFAPASSSENAKAEANQLQTHFVKWGFDQFLYV
        MQFAP GGHPAPLSLSTFHSAFKSYIEGKISTSPLLLFRRLLRYRVTPNDSTFSLLIKAFVLSSSSSSFAPASSSENAKAEANQLQTHFVKWGFDQFLYV
Subjt:  MQFAPPGGHPAPLSLSTFHSAFKSYIEGKISTSPLLLFRRLLRYRVTPNDSTFSLLIKAFVLSSSSSSFAPASSSENAKAEANQLQTHFVKWGFDQFLYV

Query:  STAFLNLYSKLGHVKAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVEMRRRGFKPRQRTVVSLIPACGSQQLFVQGKSIHALGIKAGLDLDSH
        STAFLNLYSKLGHVKAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVEMRRRGFKPRQRTVVSLIPACGSQQLFVQGKSIHALGIKAGLDLDSH
Subjt:  STAFLNLYSKLGHVKAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVEMRRRGFKPRQRTVVSLIPACGSQQLFVQGKSIHALGIKAGLDLDSH

Query:  VKNALASMYAKCADVEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFKQMLEESVNANSVTMVTILSANANPGSIHCYATKTGLVENVSVVTSL
        VKNALASMYAKC+DVEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFKQMLEESVNANSVTMVTILSANANPGSIHCYATKTGLVENVSVVTSL
Subjt:  VKNALASMYAKCADVEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFKQMLEESVNANSVTMVTILSANANPGSIHCYATKTGLVENVSVVTSL

Query:  VCSYVRCGNIQIAELIYMSLLQKDLVSLTAIISSYGEKGDIKSVVKLYSRMQHLDMKLDAVAMIGIIQGITYPDHFGIGLTFHGYGLKSGLIIDCLVANG
        VCSYVRCGNIQIAELIYMSLLQKDLVSLTAIISSY EKGDIKSVVKLYSRMQHLDMKLDAVAMIGIIQGITYPDHFGIGLTFHGYGLKSGLIIDCLVANG
Subjt:  VCSYVRCGNIQIAELIYMSLLQKDLVSLTAIISSYGEKGDIKSVVKLYSRMQHLDMKLDAVAMIGIIQGITYPDHFGIGLTFHGYGLKSGLIIDCLVANG

Query:  FISMYSKFDDIAAVFSLFHEMHEKTLSSWNSVISSCAQAGRSIDAMVLFSKMKLSGYGPDSITIASLLSACCQNGNLHFGERLHCYSLRNNLDLEGFVGT
        FISMYSKFDDIAAVFSLFHEMHEKTLSSWN+VISSCAQAGRSIDAMVLFSKMKLSGYGPDSITIASLLSACCQNGNLHFGERLHCY+LRNNLDLEGFVGT
Subjt:  FISMYSKFDDIAAVFSLFHEMHEKTLSSWNSVISSCAQAGRSIDAMVLFSKMKLSGYGPDSITIASLLSACCQNGNLHFGERLHCYSLRNNLDLEGFVGT

Query:  ALVDMYVKCGRLDFAERVFKSMKEPCLASWNSMISGYGLFGFDNHALLCYTKMIEKGIKPNKITFSGILAACTHGGLVREGKTYFKIMMEEFGIAPELQH
        ALVDMYVKCGRLDFAERVFKSMKEPCLASWNSMISGYGLFGFDNHALLCYTKMIEKGIKPNKITFSGILAACTHGGLVREGKTYFKIMMEEFGIAPELQH
Subjt:  ALVDMYVKCGRLDFAERVFKSMKEPCLASWNSMISGYGLFGFDNHALLCYTKMIEKGIKPNKITFSGILAACTHGGLVREGKTYFKIMMEEFGIAPELQH

Query:  CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVVRVRKMMREMGEDGCSG
        CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAAS RWNDV RVRKMMREMGEDGCSG
Subjt:  CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVVRVRKMMREMGEDGCSG

Query:  VSLME
        VSLME
Subjt:  VSLME

A0A6J1GEF9 pentatricopeptide repeat-containing protein At2g04860 isoform X10.0e+0087.09Show/hide
Query:  MQFAPPGGHPAPLSLSTFHSAFKSYIEGKISTSPLLLFRRLLRYRVTPNDSTFSLLIKAFVLSSSSSSFAPASSSENAKAEANQLQTHFVKWGFDQFLYV
        MQF    GH A LSLSTFHSAFKSY+EGKIST PLL+FR+LLR RV PNDSTFSLLIKAFV+SSSSSSFAP S SENAKAEANQLQ HF+KWGFDQFLYV
Subjt:  MQFAPPGGHPAPLSLSTFHSAFKSYIEGKISTSPLLLFRRLLRYRVTPNDSTFSLLIKAFVLSSSSSSFAPASSSENAKAEANQLQTHFVKWGFDQFLYV

Query:  STAFLNLYSKLGHVKAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVEMRRRGFKPRQRTVVSLIPACGSQQLFVQGKSIHALGIKAGLDLDSH
        STAFL+LYSKLG VKAARRLFDDIPEKDVV WNALISGYSRSGY+HDAFELFVEMRRRGF P QRT+VSLIP+CG+Q LF QGK IHALG+KAGLDLDS 
Subjt:  STAFLNLYSKLGHVKAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVEMRRRGFKPRQRTVVSLIPACGSQQLFVQGKSIHALGIKAGLDLDSH

Query:  VKNALASMYAKCADVEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFKQMLEESVNANSVTMVTILSANANPGSIHCYATKTGLVENVSVVTSL
        VKN+LASMY KCAD+EGVEL FGE++EK+VVSWNT IGAFGQNGFFVEAMLVFKQMLEE ++ NSVTMV+ILSANANP SIHCYATKTGLVENVSVVTSL
Subjt:  VKNALASMYAKCADVEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFKQMLEESVNANSVTMVTILSANANPGSIHCYATKTGLVENVSVVTSL

Query:  VCSYVRCGNIQIAELIYMSLLQKDLVSLTAIISSYGEKGDIKSVVKLYSRMQHLDMKLDAVAMIGIIQGITYPDHFGIGLTFHGYGLKSGLIIDCLVANG
        +CSYVRCG IQIAELIYMS LQK+LV+LTAIIS Y EKGD+ SVVKLYSR+QHL+MKLDAVAM+GIIQGITYPDH GIGL FHGYGLKSGLIIDCLVANG
Subjt:  VCSYVRCGNIQIAELIYMSLLQKDLVSLTAIISSYGEKGDIKSVVKLYSRMQHLDMKLDAVAMIGIIQGITYPDHFGIGLTFHGYGLKSGLIIDCLVANG

Query:  FISMYSKFDDIAAVFSLFHEMHEKTLSSWNSVISSCAQAGRSIDAMVLFSKMKLSGYGPDSITIASLLSACCQNGNLHFGERLHCYSLRNNLDLEGFVGT
        FISMYS+FDDI AVFSLF EM EKTLSSWNSVISSCAQAGRSIDAM LFS+MKLSGYGPDSIT+ASLLSACCQNGNLHFGE +H Y LRNNLDLEGFVGT
Subjt:  FISMYSKFDDIAAVFSLFHEMHEKTLSSWNSVISSCAQAGRSIDAMVLFSKMKLSGYGPDSITIASLLSACCQNGNLHFGERLHCYSLRNNLDLEGFVGT

Query:  ALVDMYVKCGRLDFAERVFKSMKEPCLASWNSMISGYGLFGFDNHALLCYTKMIEKGIKPNKITFSGILAACTHGGLVREGKTYFKIMMEEFGIAPELQH
        AL+DMYVKCGR+DFAE+VFKSMKEPCLASWNS+ISGYGLFGF+NHA LCYTKM+EKGIKPNKITFSGILAACTHGGLV EG+TYF+IM +E GI PE QH
Subjt:  ALVDMYVKCGRLDFAERVFKSMKEPCLASWNSMISGYGLFGFDNHALLCYTKMIEKGIKPNKITFSGILAACTHGGLVREGKTYFKIMMEEFGIAPELQH

Query:  CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVVRVRKMMREMGEDGCSG
        CASMVGLLGRAGLFEEAI+FIKNME+NPDSAVWGALL+ACCIHQEVKLGESVAK+LLFSN RNGGFFVLMSNLYAASGRWNDV RVRKMMREMGEDGCSG
Subjt:  CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVVRVRKMMREMGEDGCSG

Query:  VSLME
        VSLME
Subjt:  VSLME

A0A6J1IKP6 pentatricopeptide repeat-containing protein At2g04860 isoform X10.0e+0086.95Show/hide
Query:  MQFAPPGGHPAPLSLSTFHSAFKSYIEGKISTSPLLLFRRLLRYRVTPNDSTFSLLIKAFVLSSSSSSFAPASSSENAKAEANQLQTHFVKWGFDQFLYV
        MQF    GH A LSLSTFHSAFKSY+EGKIST PLL+FR+LLR RV PNDSTFSLLIKAFV+SSSSSSFAP+S SENA+AEANQLQTHF+KWGFDQFLYV
Subjt:  MQFAPPGGHPAPLSLSTFHSAFKSYIEGKISTSPLLLFRRLLRYRVTPNDSTFSLLIKAFVLSSSSSSFAPASSSENAKAEANQLQTHFVKWGFDQFLYV

Query:  STAFLNLYSKLGHVKAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVEMRRRGFKPRQRTVVSLIPACGSQQLFVQGKSIHALGIKAGLDLDSH
        STAFL+LYSKLG VKAARRLFDDIPEKDVV WNALISGYSRSG++HD FELFVEMRRRGF P QRT+VSLIP+CG+Q LFVQGK IHALG+KAGLDLDS 
Subjt:  STAFLNLYSKLGHVKAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVEMRRRGFKPRQRTVVSLIPACGSQQLFVQGKSIHALGIKAGLDLDSH

Query:  VKNALASMYAKCADVEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFKQMLEESVNANSVTMVTILSANANPGSIHCYATKTGLVENVSVVTSL
        VKN+LASMY KCAD+EGVEL FGE++EK+VVSWNT IGAFGQNGFFVEAMLVFKQMLEES+N +SVTMV+ILSANANP SIHCYATKTGL+ENVSVVTSL
Subjt:  VKNALASMYAKCADVEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFKQMLEESVNANSVTMVTILSANANPGSIHCYATKTGLVENVSVVTSL

Query:  VCSYVRCGNIQIAELIYMSLLQKDLVSLTAIISSYGEKGDIKSVVKLYSRMQHLDMKLDAVAMIGIIQGITYPDHFGIGLTFHGYGLKSGLIIDCLVANG
        +CSYV+CG I IAE IYMS LQK+LV+LTAIIS Y EKGD+ +VVKLYSR+QHL+MKLDAVAM+GIIQGITYPDH GIGL+FHGYGLKSGLIIDCLVANG
Subjt:  VCSYVRCGNIQIAELIYMSLLQKDLVSLTAIISSYGEKGDIKSVVKLYSRMQHLDMKLDAVAMIGIIQGITYPDHFGIGLTFHGYGLKSGLIIDCLVANG

Query:  FISMYSKFDDIAAVFSLFHEMHEKTLSSWNSVISSCAQAGRSIDAMVLFSKMKLSGYGPDSITIASLLSACCQNGNLHFGERLHCYSLRNNLDLEGFVGT
        FISMYS+FDDI AVFSLF EMHEKTLSSWNSVISSCAQAGRSIDAM LFS+MKLSGYGPDSIT+ASLLSACCQNGNLHFGE LH Y LRNNLDLEGFVGT
Subjt:  FISMYSKFDDIAAVFSLFHEMHEKTLSSWNSVISSCAQAGRSIDAMVLFSKMKLSGYGPDSITIASLLSACCQNGNLHFGERLHCYSLRNNLDLEGFVGT

Query:  ALVDMYVKCGRLDFAERVFKSMKEPCLASWNSMISGYGLFGFDNHALLCYTKMIEKGIKPNKITFSGILAACTHGGLVREGKTYFKIMMEEFGIAPELQH
        AL+DMYVKCGR+DFAE+VFKSMKEPCLASWNSMISGYGLFGFDNH  LCYTKM+EKGIKPNKITFSGILAACTHGGLV EG+TYF+IM +E GI PE QH
Subjt:  ALVDMYVKCGRLDFAERVFKSMKEPCLASWNSMISGYGLFGFDNHALLCYTKMIEKGIKPNKITFSGILAACTHGGLVREGKTYFKIMMEEFGIAPELQH

Query:  CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVVRVRKMMREMGEDGCSG
        CASMVGLLGRAGLFEEAI+FIKNME+NPDSAVWGA LSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDV +VRKMMREMGEDGCSG
Subjt:  CASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVVRVRKMMREMGEDGCSG

Query:  VSLME
        VSLME
Subjt:  VSLME

SwissProt top hitse value%identityAlignment
Q0WN60 Pentatricopeptide repeat-containing protein At1g184856.3e-10232.86Show/hide
Query:  VKWGFDQFLYVSTAFLNLYSKLGHVKAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVEMRRR----GFKPRQRTVVSLIPACGSQQLFVQGKS
        VK G  + ++V  A ++ Y   G V  A +LFD +PE+++V WN++I  +S +G+S ++F L  EM        F P   T+V+++P C  ++    GK 
Subjt:  VKWGFDQFLYVSTAFLNLYSKLGHVKAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVEMRRR----GFKPRQRTVVSLIPACGSQQLFVQGKS

Query:  IHALGIKAGLDLDSHVKNALASMYAKCADVEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFKQMLE--ESVNANSVTMVTIL------SANAN
        +H   +K  LD +  + NAL  MY+KC  +   ++ F     K+VVSWNT +G F   G       V +QML   E V A+ VT++  +      S   +
Subjt:  IHALGIKAGLDLDSHVKNALASMYAKCADVEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFKQMLE--ESVNANSVTMVTIL------SANAN

Query:  PGSIHCYATKTGLVENVSVVTSLVCSYVRCGNIQIAELIYMSLLQKDLVSLTAIISSYGEKGDIKSVVKLYSRMQHLDMKLDAVAMIGIIQGITYPDHFG
           +HCY+ K   V N  V  + V SY +CG++  A+ ++  +  K + S  A+I  + +  D +  +  + +M+   +  D+  +  ++   +      
Subjt:  PGSIHCYATKTGLVENVSVVTSLVCSYVRCGNIQIAELIYMSLLQKDLVSLTAIISSYGEKGDIKSVVKLYSRMQHLDMKLDAVAMIGIIQGITYPDHFG

Query:  IGLTFHGYGLKSGLIIDCLVANGFISMYSKFDDIAAVFSLFHEMHEKTLSSWNSVISSCAQAGRSIDAMVLFSKMKLSGYGPDSITIASLLSACCQNGNL
        +G   HG+ +++ L  D  V    +S+Y    ++  V +LF  M +K+L SWN+VI+   Q G    A+ +F +M L G     I++  +  AC    +L
Subjt:  IGLTFHGYGLKSGLIIDCLVANGFISMYSKFDDIAAVFSLFHEMHEKTLSSWNSVISSCAQAGRSIDAMVLFSKMKLSGYGPDSITIASLLSACCQNGNL

Query:  HFGERLHCYSLRNNLDLEGFVGTALVDMYVKCGRLDFAERVFKSMKEPCLASWNSMISGYGLFGFDNHALLCYTKMIEKGIKPNKITFSGILAACTHGGL
          G   H Y+L++ L+ + F+  +L+DMY K G +  + +VF  +KE   ASWN+MI GYG+ G    A+  + +M   G  P+ +TF G+L AC H GL
Subjt:  HFGERLHCYSLRNNLDLEGFVGTALVDMYVKCGRLDFAERVFKSMKEPCLASWNSMISGYGLFGFDNHALLCYTKMIEKGIKPNKITFSGILAACTHGGL

Query:  VREGKTYFKIMMEEFGIAPELQHCASMVGLLGRAGLFEEAI-VFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAA
        + EG  Y   M   FG+ P L+H A ++ +LGRAG  ++A+ V  + M    D  +W +LLS+C IHQ +++GE VA KL          +VL+SNLYA 
Subjt:  VREGKTYFKIMMEEFGIAPELQHCASMVGLLGRAGLFEEAI-VFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAA

Query:  SGRWNDVVRVRKMMREMG---EDGCSGVSL
         G+W DV +VR+ M EM    + GCS + L
Subjt:  SGRWNDVVRVRKMMREMG---EDGCSGVSL

Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic1.3e-10233.18Show/hide
Query:  SSSSFAPASSSENAKA----------EANQLQTHFVKWGFDQFLYVSTAFLNLYSKLGHVKAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVE
        S  ++ PA+  E+  A          E  Q+     K G  Q  +  T  ++L+ + G V  A R+F+ I  K  VL++ ++ G+++      A + FV 
Subjt:  SSSSFAPASSSENAKA----------EANQLQTHFVKWGFDQFLYVSTAFLNLYSKLGHVKAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVE

Query:  MRRRGFKPRQRTVVSLIPACGSQQLFVQGKSIHALGIKAGLDLDSHVKNALASMYAKCADVEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFK
        MR    +P       L+  CG +     GK IH L +K+G  LD      L +MYAKC  V      F  M E+ +VSWNT +  + QNG    A+ + K
Subjt:  MRRRGFKPRQRTVVSLIPACGSQQLFVQGKSIHALGIKAGLDLDSHVKNALASMYAKCADVEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFK

Query:  QMLEESVNANSVTMVTILSANAN------PGSIHCYATKTGLVENVSVVTSLVCSYVRCGNIQIAELIYMSLLQKDLVSLTAIISSYGEKGDIKSVVKLY
         M EE++  + +T+V++L A +          IH YA ++G    V++ T+LV  Y +CG+++ A  ++  +L++++VS  ++I +Y +  + K  + ++
Subjt:  QMLEESVNANSVTMVTILSANAN------PGSIHCYATKTGLVENVSVVTSLVCSYVRCGNIQIAELIYMSLLQKDLVSLTAIISSYGEKGDIKSVVKLY

Query:  SRMQHLDMKLDAVAMIGIIQGITYPDHFGIGLTFHGYGLKSGLIIDCLVANGFISMYSKFDDIAAVFSLFHEMHEKTLSSWNSVISSCAQAGRSIDAMVL
         +M    +K   V+++G +           G   H   ++ GL  +  V N  ISMY K  ++    S+F ++  +TL SWN++I   AQ GR IDA+  
Subjt:  SRMQHLDMKLDAVAMIGIIQGITYPDHFGIGLTFHGYGLKSGLIIDCLVANGFISMYSKFDDIAAVFSLFHEMHEKTLSSWNSVISSCAQAGRSIDAMVL

Query:  FSKMKLSGYGPDSITIASLLSACCQNGNLHFGERLHCYSLRNNLDLEGFVGTALVDMYVKCGRLDFAERVFKSMKEPCLASWNSMISGYGLFGFDNHALL
        FS+M+     PD+ T  S+++A  +    H  + +H   +R+ LD   FV TALVDMY KCG +  A  +F  M E  + +WN+MI GYG  GF   AL 
Subjt:  FSKMKLSGYGPDSITIASLLSACCQNGNLHFGERLHCYSLRNNLDLEGFVGTALVDMYVKCGRLDFAERVFKSMKEPCLASWNSMISGYGLFGFDNHALL

Query:  CYTKMIEKGIKPNKITFSGILAACTHGGLVREGKTYFKIMMEEFGIAPELQHCASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKL
         + +M +  IKPN +TF  +++AC+H GLV  G   F +M E + I   + H  +MV LLGRAG   EA  FI  M + P   V+GA+L AC IH+ V  
Subjt:  CYTKMIEKGIKPNKITFSGILAACTHGGLVREGKTYFKIMMEEFGIAPELQHCASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKL

Query:  GESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVVRVRKMMREMGEDGCSGVSLME
         E  A++L   N  +GG+ VL++N+Y A+  W  V +VR  M   G     G S++E
Subjt:  GESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVVRVRKMMREMGEDGCSGVSLME

Q9M9E2 Pentatricopeptide repeat-containing protein At1g15510, chloroplastic1.2e-10533.71Show/hide
Query:  VSTAFLNLYSKLGHVKAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVEMR-RRGFKPRQRTVVSLIPACGSQQLFVQGKSIHALGIKAGLDLD
        +  AFL ++ + G++  A  +F  + E+++  WN L+ GY++ GY  +A  L+  M    G KP   T   ++  CG      +GK +H   ++ G +LD
Subjt:  VSTAFLNLYSKLGHVKAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVEMR-RRGFKPRQRTVVSLIPACGSQQLFVQGKSIHALGIKAGLDLD

Query:  SHVKNALASMYAKCADVEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFKQMLEESVNANSVTMVTILSANANPG------SIHCYATKTGLVE
          V NAL +MY KC DV+   L F  M  + ++SWN  I  + +NG   E + +F  M   SV+ + +T+ +++SA    G       IH Y   TG   
Subjt:  SHVKNALASMYAKCADVEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFKQMLEESVNANSVTMVTILSANANPG------SIHCYATKTGLVE

Query:  NVSVVTSLVCSYVRCGNIQIAELIYMSLLQKDLVSLTAIISSYGEKGDIKSVVKLYSRMQHLDMKLDAVAMIGIIQGITYPDHFGIGLTFHGYGLKSGLI
        ++SV  SL   Y+  G+ + AE ++  + +KD+VS T +IS Y         +  Y  M    +K D + +  ++           G+  H   +K+ LI
Subjt:  NVSVVTSLVCSYVRCGNIQIAELIYMSLLQKDLVSLTAIISSYGEKGDIKSVVKLYSRMQHLDMKLDAVAMIGIIQGITYPDHFGIGLTFHGYGLKSGLI

Query:  IDCLVANGFISMYSKFDDIAAVFSLFHEMHEKTLSSWNSVISSCAQAGRSIDAMVLFSKMKLSGYGPDSITIASLLSACCQNGNLHFGERLHCYSLRNNL
           +VAN  I+MYSK   I     +FH +  K + SW S+I+      R  +A++   +MK++   P++IT+ + L+AC + G L  G+ +H + LR  +
Subjt:  IDCLVANGFISMYSKFDDIAAVFSLFHEMHEKTLSSWNSVISSCAQAGRSIDAMVLFSKMKLSGYGPDSITIASLLSACCQNGNLHFGERLHCYSLRNNL

Query:  DLEGFVGTALVDMYVKCGRLDFAERVFKSMKEPCLASWNSMISGYGLFGFDNHALLCYTKMIEKGIKPNKITFSGILAACTHGGLVREGKTYFKIMMEEF
         L+ F+  AL+DMYV+CGR++ A   F S K+  + SWN +++GY   G  +  +  + +M++  ++P++ITF  +L  C+   +VR+G  YF   ME++
Subjt:  DLEGFVGTALVDMYVKCGRLDFAERVFKSMKEPCLASWNSMISGYGLFGFDNHALLCYTKMIEKGIKPNKITFSGILAACTHGGLVREGKTYFKIMMEEF

Query:  GIAPELQHCASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVVRVRKMMRE
        G+ P L+H A +V LLGRAG  +EA  FI+ M + PD AVWGALL+AC IH ++ LGE  A+ +   + ++ G+++L+ NLYA  G+W +V +VR+MM+E
Subjt:  GIAPELQHCASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVVRVRKMMRE

Query:  MG---EDGCSGVSL
         G   + GCS V +
Subjt:  MG---EDGCSGVSL

Q9SJ73 Pentatricopeptide repeat-containing protein At2g048606.1e-20652.83Show/hide
Query:  LSTFHSAFKSYIEGKISTSPLLLFRRLLRYRVTPNDSTFSLLIKAFVLSSSSSSFAPASSSENAKAEANQLQTHFVKWGFDQFLYVSTAFLNLYSKLGHV
        LS FHS  KS I G+IS+SP+ +FR LLR  +TPN  T S+ ++A   ++S +SF         K +  Q+QTH  K G D+F+YV T+ LNLY K G V
Subjt:  LSTFHSAFKSYIEGKISTSPLLLFRRLLRYRVTPNDSTFSLLIKAFVLSSSSSSFAPASSSENAKAEANQLQTHFVKWGFDQFLYVSTAFLNLYSKLGHV

Query:  KAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVEMRRRGFKPRQRTVVSLIPACGSQQLFVQGKSIHALGIKAGLDLDSHVKNALASMYAKCAD
         +A+ LFD++PE+D V+WNALI GYSR+GY  DA++LF+ M ++GF P   T+V+L+P CG      QG+S+H +  K+GL+LDS VKNAL S Y+KCA+
Subjt:  KAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVEMRRRGFKPRQRTVVSLIPACGSQQLFVQGKSIHALGIKAGLDLDSHVKNALASMYAKCAD

Query:  VEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFKQMLEESVNANSVTMVTILSANANPGSIHCYATKTGLVENVSVVTSLVCSYVRCGNIQIAE
        +   E+ F EM +KS VSWNT IGA+ Q+G   EA+ VFK M E++V  + VT++ +LSA+ +   +HC   K G+V ++SVVTSLVC+Y RCG +  AE
Subjt:  VEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFKQMLEESVNANSVTMVTILSANANPGSIHCYATKTGLVENVSVVTSLVCSYVRCGNIQIAE

Query:  LIYMSLLQKDLVSLTAIISSYGEKGDIKSVVKLYSRMQHLDMKLDAVAMIGIIQGITYPDHFGIGLTFHGYGLKSGLIIDCLVANGFISMYSKFDDIAAV
         +Y S  Q  +V LT+I+S Y EKGD+   V  +S+ + L MK+DAVA++GI+ G     H  IG++ HGY +KSGL    LV NG I+MYSKFDD+  V
Subjt:  LIYMSLLQKDLVSLTAIISSYGEKGDIKSVVKLYSRMQHLDMKLDAVAMIGIIQGITYPDHFGIGLTFHGYGLKSGLIIDCLVANGFISMYSKFDDIAAV

Query:  FSLFHEMHEKTLSSWNSVISSCAQAGRSIDAMVLFSKMKLS-GYGPDSITIASLLSACCQNGNLHFGERLHCYSLRNNLDLEGFVGTALVDMYVKCGRLD
          LF ++ E  L SWNSVIS C Q+GR+  A  +F +M L+ G  PD+ITIASLL+ C Q   L+ G+ LH Y+LRNN + E FV TAL+DMY KCG   
Subjt:  FSLFHEMHEKTLSSWNSVISSCAQAGRSIDAMVLFSKMKLS-GYGPDSITIASLLSACCQNGNLHFGERLHCYSLRNNLDLEGFVGTALVDMYVKCGRLD

Query:  FAERVFKSMKEPCLASWNSMISGYGLFGFDNHALLCYTKMIEKGIKPNKITFSGILAACTHGGLVREGKTYFKIMMEEFGIAPELQHCASMVGLLGRAGL
         AE VFKS+K PC A+WNSMISGY L G  + AL CY +M EKG+KP++ITF G+L+AC HGG V EGK  F+ M++EFGI+P LQH A MVGLLGRA L
Subjt:  FAERVFKSMKEPCLASWNSMISGYGLFGFDNHALLCYTKMIEKGIKPNKITFSGILAACTHGGLVREGKTYFKIMMEEFGIAPELQHCASMVGLLGRAGL

Query:  FEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVVRVRKMMREMGEDGCSGVS
        F EA+  I  M+I PDSAVWGALLSAC IH+E+++GE VA+K+   + +NGG +VLMSNLYA    W+DVVRVR MM++ G DG  GVS
Subjt:  FEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVVRVRKMMREMGEDGCSGVS

Q9STE1 Pentatricopeptide repeat-containing protein At4g213009.7e-10329.61Show/hide
Query:  SLSTFHSAFKSYIEGKISTSPLLLFRRLLRYRVTPNDSTFSLLIKAFVLSSSSSSFAPASSSENAKAEANQLQTHFVKWGFDQFLYVSTAFLNLYSKLGH
        S+  ++S   S++   +    L  + ++L + V+P+ STF  L+KA V   +       S + ++              G D   +V+++ +  Y + G 
Subjt:  SLSTFHSAFKSYIEGKISTSPLLLFRRLLRYRVTPNDSTFSLLIKAFVLSSSSSSFAPASSSENAKAEANQLQTHFVKWGFDQFLYVSTAFLNLYSKLGH

Query:  VKAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVEMRRRGFKPRQRTVVSLIPACGSQQLFVQGKSIHALGIKAGLDLDSHVKNALASMYAKCA
        +    +LFD + +KD V+WN +++GY++ G      + F  MR     P   T   ++  C S+ L   G  +H L + +G+D +  +KN+L SMY+KC 
Subjt:  VKAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVEMRRRGFKPRQRTVVSLIPACGSQQLFVQGKSIHALGIKAGLDLDSHVKNALASMYAKCA

Query:  DVEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFKQMLEESVNANSVTMVTILSANAN------PGSIHCYATKTGLVENVSVVTSLVCSYVRC
          +     F  M     V+WN  I  + Q+G   E++  F +M+   V  +++T  ++L + +          IHCY  +  +  ++ + ++L+ +Y +C
Subjt:  DVEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFKQMLEESVNANSVTMVTILSANAN------PGSIHCYATKTGLVENVSVVTSLVCSYVRC

Query:  GNIQIAELIYMSLLQKDLVSLTAIISSYGEKGDIKSVVKLYSRMQHLDMKLDAVAMIGIIQGITYPDHFGIGLTFHGYGLKSGLIIDCLVANGFISMYSK
          + +A+ I+      D+V  TA+IS Y   G     ++++  +  + +  + + ++ I+  I       +G   HG+ +K G    C +    I MY+K
Subjt:  GNIQIAELIYMSLLQKDLVSLTAIISSYGEKGDIKSVVKLYSRMQHLDMKLDAVAMIGIIQGITYPDHFGIGLTFHGYGLKSGLIIDCLVANGFISMYSK

Query:  FDDIAAVFSLFHEMHEKTLSSWNSVISSCAQAGRSIDAMVLFSKMKLSGYGPDSITIASLLSACCQNGNLHFGERLHCYSLRNNLDLEGFVGTALVDMYV
           +   + +F  + ++ + SWNS+I+ CAQ+     A+ +F +M +SG   D ++I++ LSAC    +  FG+ +H + ++++L  + +  + L+DMY 
Subjt:  FDDIAAVFSLFHEMHEKTLSSWNSVISSCAQAGRSIDAMVLFSKMKLSGYGPDSITIASLLSACCQNGNLHFGERLHCYSLRNNLDLEGFVGTALVDMYV

Query:  KCGRLDFAERVFKSMKEPCLASWNSMISGYGLFGFDNHALLCYTKMIEK-GIKPNKITFSGILAACTHGGLVREGKTYFKIMMEEFGIAPELQHCASMVG
        KCG L  A  VFK+MKE  + SWNS+I+  G  G    +L  + +M+EK GI+P++ITF  I+++C H G V EG  +F+ M E++GI P+ +H A +V 
Subjt:  KCGRLDFAERVFKSMKEPCLASWNSMISGYGLFGFDNHALLCYTKMIEK-GIKPNKITFSGILAACTHGGLVREGKTYFKIMMEEFGIAPELQHCASMVG

Query:  LLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVVRVRKMMREMGEDGCSGVSLME
        L GRAG   EA   +K+M   PD+ VWG LL AC +H+ V+L E  + KL+  +  N G++VL+SN +A +  W  V +VR +M+E       G S +E
Subjt:  LLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVVRVRKMMREMGEDGCSGVSLME

Arabidopsis top hitse value%identityAlignment
AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein9.0e-10433.18Show/hide
Query:  SSSSFAPASSSENAKA----------EANQLQTHFVKWGFDQFLYVSTAFLNLYSKLGHVKAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVE
        S  ++ PA+  E+  A          E  Q+     K G  Q  +  T  ++L+ + G V  A R+F+ I  K  VL++ ++ G+++      A + FV 
Subjt:  SSSSFAPASSSENAKA----------EANQLQTHFVKWGFDQFLYVSTAFLNLYSKLGHVKAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVE

Query:  MRRRGFKPRQRTVVSLIPACGSQQLFVQGKSIHALGIKAGLDLDSHVKNALASMYAKCADVEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFK
        MR    +P       L+  CG +     GK IH L +K+G  LD      L +MYAKC  V      F  M E+ +VSWNT +  + QNG    A+ + K
Subjt:  MRRRGFKPRQRTVVSLIPACGSQQLFVQGKSIHALGIKAGLDLDSHVKNALASMYAKCADVEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFK

Query:  QMLEESVNANSVTMVTILSANAN------PGSIHCYATKTGLVENVSVVTSLVCSYVRCGNIQIAELIYMSLLQKDLVSLTAIISSYGEKGDIKSVVKLY
         M EE++  + +T+V++L A +          IH YA ++G    V++ T+LV  Y +CG+++ A  ++  +L++++VS  ++I +Y +  + K  + ++
Subjt:  QMLEESVNANSVTMVTILSANAN------PGSIHCYATKTGLVENVSVVTSLVCSYVRCGNIQIAELIYMSLLQKDLVSLTAIISSYGEKGDIKSVVKLY

Query:  SRMQHLDMKLDAVAMIGIIQGITYPDHFGIGLTFHGYGLKSGLIIDCLVANGFISMYSKFDDIAAVFSLFHEMHEKTLSSWNSVISSCAQAGRSIDAMVL
         +M    +K   V+++G +           G   H   ++ GL  +  V N  ISMY K  ++    S+F ++  +TL SWN++I   AQ GR IDA+  
Subjt:  SRMQHLDMKLDAVAMIGIIQGITYPDHFGIGLTFHGYGLKSGLIIDCLVANGFISMYSKFDDIAAVFSLFHEMHEKTLSSWNSVISSCAQAGRSIDAMVL

Query:  FSKMKLSGYGPDSITIASLLSACCQNGNLHFGERLHCYSLRNNLDLEGFVGTALVDMYVKCGRLDFAERVFKSMKEPCLASWNSMISGYGLFGFDNHALL
        FS+M+     PD+ T  S+++A  +    H  + +H   +R+ LD   FV TALVDMY KCG +  A  +F  M E  + +WN+MI GYG  GF   AL 
Subjt:  FSKMKLSGYGPDSITIASLLSACCQNGNLHFGERLHCYSLRNNLDLEGFVGTALVDMYVKCGRLDFAERVFKSMKEPCLASWNSMISGYGLFGFDNHALL

Query:  CYTKMIEKGIKPNKITFSGILAACTHGGLVREGKTYFKIMMEEFGIAPELQHCASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKL
         + +M +  IKPN +TF  +++AC+H GLV  G   F +M E + I   + H  +MV LLGRAG   EA  FI  M + P   V+GA+L AC IH+ V  
Subjt:  CYTKMIEKGIKPNKITFSGILAACTHGGLVREGKTYFKIMMEEFGIAPELQHCASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKL

Query:  GESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVVRVRKMMREMGEDGCSGVSLME
         E  A++L   N  +GG+ VL++N+Y A+  W  V +VR  M   G     G S++E
Subjt:  GESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVVRVRKMMREMGEDGCSGVSLME

AT1G15510.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.7e-10733.71Show/hide
Query:  VSTAFLNLYSKLGHVKAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVEMR-RRGFKPRQRTVVSLIPACGSQQLFVQGKSIHALGIKAGLDLD
        +  AFL ++ + G++  A  +F  + E+++  WN L+ GY++ GY  +A  L+  M    G KP   T   ++  CG      +GK +H   ++ G +LD
Subjt:  VSTAFLNLYSKLGHVKAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVEMR-RRGFKPRQRTVVSLIPACGSQQLFVQGKSIHALGIKAGLDLD

Query:  SHVKNALASMYAKCADVEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFKQMLEESVNANSVTMVTILSANANPG------SIHCYATKTGLVE
          V NAL +MY KC DV+   L F  M  + ++SWN  I  + +NG   E + +F  M   SV+ + +T+ +++SA    G       IH Y   TG   
Subjt:  SHVKNALASMYAKCADVEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFKQMLEESVNANSVTMVTILSANANPG------SIHCYATKTGLVE

Query:  NVSVVTSLVCSYVRCGNIQIAELIYMSLLQKDLVSLTAIISSYGEKGDIKSVVKLYSRMQHLDMKLDAVAMIGIIQGITYPDHFGIGLTFHGYGLKSGLI
        ++SV  SL   Y+  G+ + AE ++  + +KD+VS T +IS Y         +  Y  M    +K D + +  ++           G+  H   +K+ LI
Subjt:  NVSVVTSLVCSYVRCGNIQIAELIYMSLLQKDLVSLTAIISSYGEKGDIKSVVKLYSRMQHLDMKLDAVAMIGIIQGITYPDHFGIGLTFHGYGLKSGLI

Query:  IDCLVANGFISMYSKFDDIAAVFSLFHEMHEKTLSSWNSVISSCAQAGRSIDAMVLFSKMKLSGYGPDSITIASLLSACCQNGNLHFGERLHCYSLRNNL
           +VAN  I+MYSK   I     +FH +  K + SW S+I+      R  +A++   +MK++   P++IT+ + L+AC + G L  G+ +H + LR  +
Subjt:  IDCLVANGFISMYSKFDDIAAVFSLFHEMHEKTLSSWNSVISSCAQAGRSIDAMVLFSKMKLSGYGPDSITIASLLSACCQNGNLHFGERLHCYSLRNNL

Query:  DLEGFVGTALVDMYVKCGRLDFAERVFKSMKEPCLASWNSMISGYGLFGFDNHALLCYTKMIEKGIKPNKITFSGILAACTHGGLVREGKTYFKIMMEEF
         L+ F+  AL+DMYV+CGR++ A   F S K+  + SWN +++GY   G  +  +  + +M++  ++P++ITF  +L  C+   +VR+G  YF   ME++
Subjt:  DLEGFVGTALVDMYVKCGRLDFAERVFKSMKEPCLASWNSMISGYGLFGFDNHALLCYTKMIEKGIKPNKITFSGILAACTHGGLVREGKTYFKIMMEEF

Query:  GIAPELQHCASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVVRVRKMMRE
        G+ P L+H A +V LLGRAG  +EA  FI+ M + PD AVWGALL+AC IH ++ LGE  A+ +   + ++ G+++L+ NLYA  G+W +V +VR+MM+E
Subjt:  GIAPELQHCASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVVRVRKMMRE

Query:  MG---EDGCSGVSL
         G   + GCS V +
Subjt:  MG---EDGCSGVSL

AT1G18485.1 Pentatricopeptide repeat (PPR) superfamily protein4.5e-10332.86Show/hide
Query:  VKWGFDQFLYVSTAFLNLYSKLGHVKAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVEMRRR----GFKPRQRTVVSLIPACGSQQLFVQGKS
        VK G  + ++V  A ++ Y   G V  A +LFD +PE+++V WN++I  +S +G+S ++F L  EM        F P   T+V+++P C  ++    GK 
Subjt:  VKWGFDQFLYVSTAFLNLYSKLGHVKAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVEMRRR----GFKPRQRTVVSLIPACGSQQLFVQGKS

Query:  IHALGIKAGLDLDSHVKNALASMYAKCADVEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFKQMLE--ESVNANSVTMVTIL------SANAN
        +H   +K  LD +  + NAL  MY+KC  +   ++ F     K+VVSWNT +G F   G       V +QML   E V A+ VT++  +      S   +
Subjt:  IHALGIKAGLDLDSHVKNALASMYAKCADVEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFKQMLE--ESVNANSVTMVTIL------SANAN

Query:  PGSIHCYATKTGLVENVSVVTSLVCSYVRCGNIQIAELIYMSLLQKDLVSLTAIISSYGEKGDIKSVVKLYSRMQHLDMKLDAVAMIGIIQGITYPDHFG
           +HCY+ K   V N  V  + V SY +CG++  A+ ++  +  K + S  A+I  + +  D +  +  + +M+   +  D+  +  ++   +      
Subjt:  PGSIHCYATKTGLVENVSVVTSLVCSYVRCGNIQIAELIYMSLLQKDLVSLTAIISSYGEKGDIKSVVKLYSRMQHLDMKLDAVAMIGIIQGITYPDHFG

Query:  IGLTFHGYGLKSGLIIDCLVANGFISMYSKFDDIAAVFSLFHEMHEKTLSSWNSVISSCAQAGRSIDAMVLFSKMKLSGYGPDSITIASLLSACCQNGNL
        +G   HG+ +++ L  D  V    +S+Y    ++  V +LF  M +K+L SWN+VI+   Q G    A+ +F +M L G     I++  +  AC    +L
Subjt:  IGLTFHGYGLKSGLIIDCLVANGFISMYSKFDDIAAVFSLFHEMHEKTLSSWNSVISSCAQAGRSIDAMVLFSKMKLSGYGPDSITIASLLSACCQNGNL

Query:  HFGERLHCYSLRNNLDLEGFVGTALVDMYVKCGRLDFAERVFKSMKEPCLASWNSMISGYGLFGFDNHALLCYTKMIEKGIKPNKITFSGILAACTHGGL
          G   H Y+L++ L+ + F+  +L+DMY K G +  + +VF  +KE   ASWN+MI GYG+ G    A+  + +M   G  P+ +TF G+L AC H GL
Subjt:  HFGERLHCYSLRNNLDLEGFVGTALVDMYVKCGRLDFAERVFKSMKEPCLASWNSMISGYGLFGFDNHALLCYTKMIEKGIKPNKITFSGILAACTHGGL

Query:  VREGKTYFKIMMEEFGIAPELQHCASMVGLLGRAGLFEEAI-VFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAA
        + EG  Y   M   FG+ P L+H A ++ +LGRAG  ++A+ V  + M    D  +W +LLS+C IHQ +++GE VA KL          +VL+SNLYA 
Subjt:  VREGKTYFKIMMEEFGIAPELQHCASMVGLLGRAGLFEEAI-VFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAA

Query:  SGRWNDVVRVRKMMREMG---EDGCSGVSL
         G+W DV +VR+ M EM    + GCS + L
Subjt:  SGRWNDVVRVRKMMREMG---EDGCSGVSL

AT2G04860.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.3e-20752.83Show/hide
Query:  LSTFHSAFKSYIEGKISTSPLLLFRRLLRYRVTPNDSTFSLLIKAFVLSSSSSSFAPASSSENAKAEANQLQTHFVKWGFDQFLYVSTAFLNLYSKLGHV
        LS FHS  KS I G+IS+SP+ +FR LLR  +TPN  T S+ ++A   ++S +SF         K +  Q+QTH  K G D+F+YV T+ LNLY K G V
Subjt:  LSTFHSAFKSYIEGKISTSPLLLFRRLLRYRVTPNDSTFSLLIKAFVLSSSSSSFAPASSSENAKAEANQLQTHFVKWGFDQFLYVSTAFLNLYSKLGHV

Query:  KAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVEMRRRGFKPRQRTVVSLIPACGSQQLFVQGKSIHALGIKAGLDLDSHVKNALASMYAKCAD
         +A+ LFD++PE+D V+WNALI GYSR+GY  DA++LF+ M ++GF P   T+V+L+P CG      QG+S+H +  K+GL+LDS VKNAL S Y+KCA+
Subjt:  KAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVEMRRRGFKPRQRTVVSLIPACGSQQLFVQGKSIHALGIKAGLDLDSHVKNALASMYAKCAD

Query:  VEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFKQMLEESVNANSVTMVTILSANANPGSIHCYATKTGLVENVSVVTSLVCSYVRCGNIQIAE
        +   E+ F EM +KS VSWNT IGA+ Q+G   EA+ VFK M E++V  + VT++ +LSA+ +   +HC   K G+V ++SVVTSLVC+Y RCG +  AE
Subjt:  VEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFKQMLEESVNANSVTMVTILSANANPGSIHCYATKTGLVENVSVVTSLVCSYVRCGNIQIAE

Query:  LIYMSLLQKDLVSLTAIISSYGEKGDIKSVVKLYSRMQHLDMKLDAVAMIGIIQGITYPDHFGIGLTFHGYGLKSGLIIDCLVANGFISMYSKFDDIAAV
         +Y S  Q  +V LT+I+S Y EKGD+   V  +S+ + L MK+DAVA++GI+ G     H  IG++ HGY +KSGL    LV NG I+MYSKFDD+  V
Subjt:  LIYMSLLQKDLVSLTAIISSYGEKGDIKSVVKLYSRMQHLDMKLDAVAMIGIIQGITYPDHFGIGLTFHGYGLKSGLIIDCLVANGFISMYSKFDDIAAV

Query:  FSLFHEMHEKTLSSWNSVISSCAQAGRSIDAMVLFSKMKLS-GYGPDSITIASLLSACCQNGNLHFGERLHCYSLRNNLDLEGFVGTALVDMYVKCGRLD
          LF ++ E  L SWNSVIS C Q+GR+  A  +F +M L+ G  PD+ITIASLL+ C Q   L+ G+ LH Y+LRNN + E FV TAL+DMY KCG   
Subjt:  FSLFHEMHEKTLSSWNSVISSCAQAGRSIDAMVLFSKMKLS-GYGPDSITIASLLSACCQNGNLHFGERLHCYSLRNNLDLEGFVGTALVDMYVKCGRLD

Query:  FAERVFKSMKEPCLASWNSMISGYGLFGFDNHALLCYTKMIEKGIKPNKITFSGILAACTHGGLVREGKTYFKIMMEEFGIAPELQHCASMVGLLGRAGL
         AE VFKS+K PC A+WNSMISGY L G  + AL CY +M EKG+KP++ITF G+L+AC HGG V EGK  F+ M++EFGI+P LQH A MVGLLGRA L
Subjt:  FAERVFKSMKEPCLASWNSMISGYGLFGFDNHALLCYTKMIEKGIKPNKITFSGILAACTHGGLVREGKTYFKIMMEEFGIAPELQHCASMVGLLGRAGL

Query:  FEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVVRVRKMMREMGEDGCSGVS
        F EA+  I  M+I PDSAVWGALLSAC IH+E+++GE VA+K+   + +NGG +VLMSNLYA    W+DVVRVR MM++ G DG  GVS
Subjt:  FEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVVRVRKMMREMGEDGCSGVS

AT4G21300.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.9e-10429.61Show/hide
Query:  SLSTFHSAFKSYIEGKISTSPLLLFRRLLRYRVTPNDSTFSLLIKAFVLSSSSSSFAPASSSENAKAEANQLQTHFVKWGFDQFLYVSTAFLNLYSKLGH
        S+  ++S   S++   +    L  + ++L + V+P+ STF  L+KA V   +       S + ++              G D   +V+++ +  Y + G 
Subjt:  SLSTFHSAFKSYIEGKISTSPLLLFRRLLRYRVTPNDSTFSLLIKAFVLSSSSSSFAPASSSENAKAEANQLQTHFVKWGFDQFLYVSTAFLNLYSKLGH

Query:  VKAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVEMRRRGFKPRQRTVVSLIPACGSQQLFVQGKSIHALGIKAGLDLDSHVKNALASMYAKCA
        +    +LFD + +KD V+WN +++GY++ G      + F  MR     P   T   ++  C S+ L   G  +H L + +G+D +  +KN+L SMY+KC 
Subjt:  VKAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVEMRRRGFKPRQRTVVSLIPACGSQQLFVQGKSIHALGIKAGLDLDSHVKNALASMYAKCA

Query:  DVEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFKQMLEESVNANSVTMVTILSANAN------PGSIHCYATKTGLVENVSVVTSLVCSYVRC
          +     F  M     V+WN  I  + Q+G   E++  F +M+   V  +++T  ++L + +          IHCY  +  +  ++ + ++L+ +Y +C
Subjt:  DVEGVELFFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFKQMLEESVNANSVTMVTILSANAN------PGSIHCYATKTGLVENVSVVTSLVCSYVRC

Query:  GNIQIAELIYMSLLQKDLVSLTAIISSYGEKGDIKSVVKLYSRMQHLDMKLDAVAMIGIIQGITYPDHFGIGLTFHGYGLKSGLIIDCLVANGFISMYSK
          + +A+ I+      D+V  TA+IS Y   G     ++++  +  + +  + + ++ I+  I       +G   HG+ +K G    C +    I MY+K
Subjt:  GNIQIAELIYMSLLQKDLVSLTAIISSYGEKGDIKSVVKLYSRMQHLDMKLDAVAMIGIIQGITYPDHFGIGLTFHGYGLKSGLIIDCLVANGFISMYSK

Query:  FDDIAAVFSLFHEMHEKTLSSWNSVISSCAQAGRSIDAMVLFSKMKLSGYGPDSITIASLLSACCQNGNLHFGERLHCYSLRNNLDLEGFVGTALVDMYV
           +   + +F  + ++ + SWNS+I+ CAQ+     A+ +F +M +SG   D ++I++ LSAC    +  FG+ +H + ++++L  + +  + L+DMY 
Subjt:  FDDIAAVFSLFHEMHEKTLSSWNSVISSCAQAGRSIDAMVLFSKMKLSGYGPDSITIASLLSACCQNGNLHFGERLHCYSLRNNLDLEGFVGTALVDMYV

Query:  KCGRLDFAERVFKSMKEPCLASWNSMISGYGLFGFDNHALLCYTKMIEK-GIKPNKITFSGILAACTHGGLVREGKTYFKIMMEEFGIAPELQHCASMVG
        KCG L  A  VFK+MKE  + SWNS+I+  G  G    +L  + +M+EK GI+P++ITF  I+++C H G V EG  +F+ M E++GI P+ +H A +V 
Subjt:  KCGRLDFAERVFKSMKEPCLASWNSMISGYGLFGFDNHALLCYTKMIEK-GIKPNKITFSGILAACTHGGLVREGKTYFKIMMEEFGIAPELQHCASMVG

Query:  LLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVVRVRKMMREMGEDGCSGVSLME
        L GRAG   EA   +K+M   PD+ VWG LL AC +H+ V+L E  + KL+  +  N G++VL+SN +A +  W  V +VR +M+E       G S +E
Subjt:  LLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSNCRNGGFFVLMSNLYAASGRWNDVVRVRKMMREMGEDGCSGVSLME


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAATTTGCGCCTCCGGGAGGCCATCCGGCACCTCTCTCGCTCTCTACCTTCCATTCTGCATTCAAGTCTTACATCGAAGGCAAAATTTCCACTTCACCCTTGTTGCT
TTTCCGTCGGTTGCTTAGGTATCGGGTTACACCCAATGATTCTACCTTCTCCTTACTCATCAAAGCCTTCGTTTTATCGTCTTCATCTTCTTCTTTTGCCCCAGCGTCCT
CCTCTGAGAATGCTAAAGCGGAAGCGAATCAGCTCCAAACCCACTTCGTCAAATGGGGATTTGACCAATTTTTGTATGTTAGTACCGCTTTTCTCAACTTGTATTCAAAA
TTGGGTCATGTAAAAGCTGCTCGACGTTTGTTTGATGATATTCCTGAGAAAGATGTAGTATTGTGGAATGCGTTGATTTCTGGGTACTCACGAAGTGGTTACAGCCATGA
TGCCTTCGAACTATTTGTCGAAATGCGCAGAAGGGGTTTTAAACCTCGTCAGAGGACGGTGGTAAGTTTAATTCCTGCCTGTGGTAGCCAACAATTATTCGTTCAAGGAA
AATCCATCCATGCGTTAGGTATTAAGGCTGGCCTAGATTTGGACTCCCATGTGAAAAATGCTCTGGCGTCGATGTATGCTAAATGTGCAGATGTAGAAGGGGTGGAACTC
TTCTTTGGAGAAATGATGGAAAAAAGCGTAGTTTCTTGGAATACTAGGATTGGGGCATTTGGGCAAAATGGGTTCTTCGTGGAGGCAATGCTTGTTTTCAAACAAATGCT
TGAGGAAAGTGTCAATGCTAACTCAGTGACGATGGTGACTATCTTGTCTGCAAATGCAAATCCAGGATCTATCCATTGTTATGCTACCAAAACTGGTCTTGTGGAAAATG
TTTCCGTGGTTACCTCTCTAGTTTGCTCGTATGTAAGATGTGGAAACATACAAATAGCAGAATTGATTTATATGTCACTACTCCAGAAAGACTTGGTTTCCTTAACTGCA
ATTATTTCAAGCTACGGTGAGAAAGGAGACATTAAATCTGTGGTGAAGTTGTATTCCCGAATGCAGCATCTAGACATGAAACTAGATGCAGTTGCAATGATTGGCATAAT
CCAAGGAATTACATATCCTGATCACTTTGGCATTGGACTTACTTTCCACGGTTATGGGCTAAAGAGTGGGTTAATAATTGATTGTTTGGTTGCAAATGGTTTCATAAGCA
TGTACTCGAAGTTCGACGATATTGCTGCAGTGTTTTCTTTATTTCATGAGATGCATGAAAAGACATTGAGCAGCTGGAATTCTGTGATATCTAGCTGTGCACAGGCAGGA
AGATCAATTGATGCCATGGTTTTGTTTTCCAAAATGAAGTTGTCAGGTTATGGGCCAGATTCAATTACAATTGCTAGTTTACTATCTGCTTGTTGCCAAAATGGGAATTT
GCATTTTGGGGAGAGACTTCATTGCTATAGTCTAAGAAACAATCTGGATTTGGAAGGTTTTGTTGGAACTGCTCTTGTAGACATGTACGTCAAGTGTGGCAGATTGGACT
TTGCTGAAAGGGTGTTTAAGAGCATGAAAGAGCCATGTTTAGCTTCATGGAACTCGATGATCTCAGGGTACGGTTTATTTGGGTTTGACAACCATGCCCTCCTTTGTTAC
ACTAAAATGATAGAGAAGGGGATAAAACCCAATAAAATTACTTTCTCAGGAATTTTAGCTGCTTGTACTCATGGAGGACTTGTTAGAGAAGGTAAAACATACTTCAAAAT
CATGATGGAAGAATTTGGTATAGCGCCTGAATTGCAACATTGTGCATCCATGGTTGGCTTGCTTGGTCGGGCGGGCTTATTTGAGGAGGCAATTGTGTTTATCAAGAACA
TGGAAATCAATCCAGATTCTGCAGTGTGGGGAGCTTTGCTCAGTGCTTGCTGCATTCACCAGGAAGTTAAGCTTGGGGAATCTGTGGCCAAGAAGTTGCTTTTCTCAAAC
TGTAGAAATGGGGGATTCTTTGTATTGATGTCAAACCTTTATGCAGCATCAGGGAGGTGGAATGATGTAGTAAGAGTCAGAAAGATGATGAGAGAAATGGGAGAAGATGG
ATGTTCAGGTGTTAGCCTTATGGAA
mRNA sequenceShow/hide mRNA sequence
ATGCAATTTGCGCCTCCGGGAGGCCATCCGGCACCTCTCTCGCTCTCTACCTTCCATTCTGCATTCAAGTCTTACATCGAAGGCAAAATTTCCACTTCACCCTTGTTGCT
TTTCCGTCGGTTGCTTAGGTATCGGGTTACACCCAATGATTCTACCTTCTCCTTACTCATCAAAGCCTTCGTTTTATCGTCTTCATCTTCTTCTTTTGCCCCAGCGTCCT
CCTCTGAGAATGCTAAAGCGGAAGCGAATCAGCTCCAAACCCACTTCGTCAAATGGGGATTTGACCAATTTTTGTATGTTAGTACCGCTTTTCTCAACTTGTATTCAAAA
TTGGGTCATGTAAAAGCTGCTCGACGTTTGTTTGATGATATTCCTGAGAAAGATGTAGTATTGTGGAATGCGTTGATTTCTGGGTACTCACGAAGTGGTTACAGCCATGA
TGCCTTCGAACTATTTGTCGAAATGCGCAGAAGGGGTTTTAAACCTCGTCAGAGGACGGTGGTAAGTTTAATTCCTGCCTGTGGTAGCCAACAATTATTCGTTCAAGGAA
AATCCATCCATGCGTTAGGTATTAAGGCTGGCCTAGATTTGGACTCCCATGTGAAAAATGCTCTGGCGTCGATGTATGCTAAATGTGCAGATGTAGAAGGGGTGGAACTC
TTCTTTGGAGAAATGATGGAAAAAAGCGTAGTTTCTTGGAATACTAGGATTGGGGCATTTGGGCAAAATGGGTTCTTCGTGGAGGCAATGCTTGTTTTCAAACAAATGCT
TGAGGAAAGTGTCAATGCTAACTCAGTGACGATGGTGACTATCTTGTCTGCAAATGCAAATCCAGGATCTATCCATTGTTATGCTACCAAAACTGGTCTTGTGGAAAATG
TTTCCGTGGTTACCTCTCTAGTTTGCTCGTATGTAAGATGTGGAAACATACAAATAGCAGAATTGATTTATATGTCACTACTCCAGAAAGACTTGGTTTCCTTAACTGCA
ATTATTTCAAGCTACGGTGAGAAAGGAGACATTAAATCTGTGGTGAAGTTGTATTCCCGAATGCAGCATCTAGACATGAAACTAGATGCAGTTGCAATGATTGGCATAAT
CCAAGGAATTACATATCCTGATCACTTTGGCATTGGACTTACTTTCCACGGTTATGGGCTAAAGAGTGGGTTAATAATTGATTGTTTGGTTGCAAATGGTTTCATAAGCA
TGTACTCGAAGTTCGACGATATTGCTGCAGTGTTTTCTTTATTTCATGAGATGCATGAAAAGACATTGAGCAGCTGGAATTCTGTGATATCTAGCTGTGCACAGGCAGGA
AGATCAATTGATGCCATGGTTTTGTTTTCCAAAATGAAGTTGTCAGGTTATGGGCCAGATTCAATTACAATTGCTAGTTTACTATCTGCTTGTTGCCAAAATGGGAATTT
GCATTTTGGGGAGAGACTTCATTGCTATAGTCTAAGAAACAATCTGGATTTGGAAGGTTTTGTTGGAACTGCTCTTGTAGACATGTACGTCAAGTGTGGCAGATTGGACT
TTGCTGAAAGGGTGTTTAAGAGCATGAAAGAGCCATGTTTAGCTTCATGGAACTCGATGATCTCAGGGTACGGTTTATTTGGGTTTGACAACCATGCCCTCCTTTGTTAC
ACTAAAATGATAGAGAAGGGGATAAAACCCAATAAAATTACTTTCTCAGGAATTTTAGCTGCTTGTACTCATGGAGGACTTGTTAGAGAAGGTAAAACATACTTCAAAAT
CATGATGGAAGAATTTGGTATAGCGCCTGAATTGCAACATTGTGCATCCATGGTTGGCTTGCTTGGTCGGGCGGGCTTATTTGAGGAGGCAATTGTGTTTATCAAGAACA
TGGAAATCAATCCAGATTCTGCAGTGTGGGGAGCTTTGCTCAGTGCTTGCTGCATTCACCAGGAAGTTAAGCTTGGGGAATCTGTGGCCAAGAAGTTGCTTTTCTCAAAC
TGTAGAAATGGGGGATTCTTTGTATTGATGTCAAACCTTTATGCAGCATCAGGGAGGTGGAATGATGTAGTAAGAGTCAGAAAGATGATGAGAGAAATGGGAGAAGATGG
ATGTTCAGGTGTTAGCCTTATGGAA
Protein sequenceShow/hide protein sequence
MQFAPPGGHPAPLSLSTFHSAFKSYIEGKISTSPLLLFRRLLRYRVTPNDSTFSLLIKAFVLSSSSSSFAPASSSENAKAEANQLQTHFVKWGFDQFLYVSTAFLNLYSK
LGHVKAARRLFDDIPEKDVVLWNALISGYSRSGYSHDAFELFVEMRRRGFKPRQRTVVSLIPACGSQQLFVQGKSIHALGIKAGLDLDSHVKNALASMYAKCADVEGVEL
FFGEMMEKSVVSWNTRIGAFGQNGFFVEAMLVFKQMLEESVNANSVTMVTILSANANPGSIHCYATKTGLVENVSVVTSLVCSYVRCGNIQIAELIYMSLLQKDLVSLTA
IISSYGEKGDIKSVVKLYSRMQHLDMKLDAVAMIGIIQGITYPDHFGIGLTFHGYGLKSGLIIDCLVANGFISMYSKFDDIAAVFSLFHEMHEKTLSSWNSVISSCAQAG
RSIDAMVLFSKMKLSGYGPDSITIASLLSACCQNGNLHFGERLHCYSLRNNLDLEGFVGTALVDMYVKCGRLDFAERVFKSMKEPCLASWNSMISGYGLFGFDNHALLCY
TKMIEKGIKPNKITFSGILAACTHGGLVREGKTYFKIMMEEFGIAPELQHCASMVGLLGRAGLFEEAIVFIKNMEINPDSAVWGALLSACCIHQEVKLGESVAKKLLFSN
CRNGGFFVLMSNLYAASGRWNDVVRVRKMMREMGEDGCSGVSLME