; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023580 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023580
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00000892:4687069..4705386
RNA-Seq ExpressionSgr023580
SyntenySgr023580
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001487 - Bromodomain
IPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR027353 - NET domain
IPR036427 - Bromodomain-like superfamily
IPR037377 - Putative transcription factor GTE, bromodomain
IPR038336 - NET domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6604761.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0083.38Show/hide
Query:  MSFNGDFIKHHHLLLQLLQACSKAPSLKATRLLHALTITMGPVPNQAIFVNNNIIFQYTSLGMLLVARNLFDKMPHRNVVSYNTMISACSQRGFVKEAWY
        MSFNGD IK H LLLQLLQACSKAP++K+TR LHALTITMGPVPNQAIFV+NN++FQY+SLG+LL+ARNLFD+MPHRNVVSYNT+ISA S+RGFVKEAW 
Subjt:  MSFNGDFIKHHHLLLQLLQACSKAPSLKATRLLHALTITMGPVPNQAIFVNNNIIFQYTSLGMLLVARNLFDKMPHRNVVSYNTMISACSQRGFVKEAWY

Query:  LFSEMRDCGFVPTQFTFGGLFSADLLDVWQGAQLQSLSVKNGLFDAGAVVGTALLGLYGRRGCLEEALRVFEDMPWKSLVTWNSILTLLGRNQLVEECKH
        LFSEMRDCGFVPTQFTFGGL SADLLDVWQGAQLQ LSVKNG+FDA A+VGT LLGLYGR GC EEALRVFEDM WKSLVTWNSIL+LLGR+QLV+ECK 
Subjt:  LFSEMRDCGFVPTQFTFGGLFSADLLDVWQGAQLQSLSVKNGLFDAGAVVGTALLGLYGRRGCLEEALRVFEDMPWKSLVTWNSILTLLGRNQLVEECKH

Query:  LFCELMCGGMELSKFSFVGILSCFSREEDLKFGQQLHGIVIKIGFYYEVLVVNSLVNMYLQCGGFFLAEKLFEEVPMRDVVTYNSIIGAGAIIKKPEIAL
        LFCELM G MELSKFSFV +LSCFSR+EDLKFGQQLHGIV+KIGFYYEVLVVNSL+NMYLQCGGF+LAEKLFEEVP+RDVVTYNSII AG  + KPE+AL
Subjt:  LFCELMCGGMELSKFSFVGILSCFSREEDLKFGQQLHGIVIKIGFYYEVLVVNSLVNMYLQCGGFFLAEKLFEEVPMRDVVTYNSIIGAGAIIKKPEIAL

Query:  KLFYTMSANGLIPTQASFVNAVNSCSCLGSSIYGEYFHSKIIRYALESDVFAGTALIDCYAKFRKMEEARYCFDEIAEKNLVSWNTLIMGYSTDCYTSSM
        +LFY M   GLIPTQASFVN V+SCS + SSIYGEYFHSK IR A ESDVF GTALID YAKF+K+EEAR+CFDEI EKNLVSWN LI GYSTDCY+S M
Subjt:  KLFYTMSANGLIPTQASFVNAVNSCSCLGSSIYGEYFHSKIIRYALESDVFAGTALIDCYAKFRKMEEARYCFDEIAEKNLVSWNTLIMGYSTDCYTSSM

Query:  YLLLEMLHFGYRPNEFTFSTIMKTLLALELFQIHCLIIKMGYEENDYVSSSVASSYAKHGLISDVLAYVSDSNKQPSVVLSNIVAGYYNRVGQYNETQKL
        YLL+EMLHFGYRPNEFTFS IMK L+A EL QIHCLII+MGYEEN YVSS++ASSYAKHGLISDVLAY+S    QPSVVLSNIVAGYYNRVG Y+ETQKL
Subjt:  YLLLEMLHFGYRPNEFTFSTIMKTLLALELFQIHCLIIKMGYEENDYVSSSVASSYAKHGLISDVLAYVSDSNKQPSVVLSNIVAGYYNRVGQYNETQKL

Query:  LYQLEEPGIISWNIMIEACAKTDNYFKVLALFKSMLMLQICPDNYTFISLLSVCAKLCNLALGSSVHGVIIKTGSGCCDTFVCNLLIDMYGKCGSVGCSL
        L  LE   IISWNI++E+CAKT NYFKVLALFK ML+LQI PDNYTFISLLSVCAKLCNLALGSSVHGV+IKTGS C DTFVCNLLI MYGKCGS+GC+L
Subjt:  LYQLEEPGIISWNIMIEACAKTDNYFKVLALFKSMLMLQICPDNYTFISLLSVCAKLCNLALGSSVHGVIIKTGSGCCDTFVCNLLIDMYGKCGSVGCSL

Query:  KIFDEVKDRNLITWTVLVSVLGLHGYAHEALERFAEMELSGFEPDGVALGAVLAACKHGGLVKEGMELFSKMKVKYGVEPEMDHYQCVVDLLSTHGHVVQ
        KIFD+VKDRNLITWTVLVSVLGLHG+A+EALERFAEMELSG +PDGVALGAVL A KHGGLVKEGMELFSKMKV+YGVEPEMDHYQC+VDLLS HG+VV+
Subjt:  KIFDEVKDRNLITWTVLVSVLGLHGYAHEALERFAEMELSGFEPDGVALGAVLAACKHGGLVKEGMELFSKMKVKYGVEPEMDHYQCVVDLLSTHGHVVQ

Query:  AEKI
        AEK+
Subjt:  AEKI

XP_022133878.1 pentatricopeptide repeat-containing protein At3g58590-like [Momordica charantia]0.0e+0083.24Show/hide
Query:  MSFNGDFIKHHHLLLQLLQACSKAPSLKATRLLHALTITMGPVPNQAIFVNNNIIFQYTSLGMLLVARNLFDKMPHRNVVSYNTMISACSQRGFVKEAWY
        MSFNGD  KHH  LLQLLQACSKAPSLKATR LHA+TITMGPVPNQAIFV+NN+IFQY+S GML VARNLFDKMPHRN VSYNT+ISA S+ GFV EAW 
Subjt:  MSFNGDFIKHHHLLLQLLQACSKAPSLKATRLLHALTITMGPVPNQAIFVNNNIIFQYTSLGMLLVARNLFDKMPHRNVVSYNTMISACSQRGFVKEAWY

Query:  LFSEMRDCGFVPTQFTFGGLFSADLLDVWQGAQLQSLSVKNGLFDAGAVVGTALLGLYGRRGCLEEALRVFEDMPWKSLVTWNSILTLLGRNQLVEECKH
        LFSEMRDCGFV TQFTFGGL SA+LLD WQG QLQ+LSVKNGLFDA A+VGTAL+ LYGR GC +EAL VF DM WKSLVTWN IL LLGRNQLVEECK 
Subjt:  LFSEMRDCGFVPTQFTFGGLFSADLLDVWQGAQLQSLSVKNGLFDAGAVVGTALLGLYGRRGCLEEALRVFEDMPWKSLVTWNSILTLLGRNQLVEECKH

Query:  LFCELMCGGMELSKFSFVGILSCFSREEDLKFGQQLHGIVIKIGFYYEVLVVNSLVNMYLQCGGFFLAEKLFEEVPMRDVVTYNSIIGAGAIIKKPEIAL
        LFCELM GGM LSKFSFVG+LSCFS EEDLKFGQQLHGIVIKIGFY EVLVVNSL+NMYLQCGGFFLAEKLF EVP+RDVVTYNSIIGA   +KKPEIAL
Subjt:  LFCELMCGGMELSKFSFVGILSCFSREEDLKFGQQLHGIVIKIGFYYEVLVVNSLVNMYLQCGGFFLAEKLFEEVPMRDVVTYNSIIGAGAIIKKPEIAL

Query:  KLFYTMSANGLIPTQASFVNAVNSCSCLGSSIYGEYFHSKIIRYALESDVFAGTALIDCYAKFRKMEEARYCFDEIAEKNLVSWNTLIMGYSTDCYTSSM
        +LFY MS +GLIPTQASFVN V SCSCL SSIYGEYFHSKIIR+ALESDV+ GT+L+  YAKFRKMEEARYCFDEIAEKNLVSWNTLI+G+STDCYTSS+
Subjt:  KLFYTMSANGLIPTQASFVNAVNSCSCLGSSIYGEYFHSKIIRYALESDVFAGTALIDCYAKFRKMEEARYCFDEIAEKNLVSWNTLIMGYSTDCYTSSM

Query:  YLLLEMLHFGYRPNEFTFSTIMKTLLALELFQIHCLIIKMGYEENDYVSSSVASSYAKHGLISDVLAYVSDSNKQPSVVLSNIVAGYYNRVGQYNETQKL
        YLLLEML FGYRPNEFTFS I++TLLA EL QIHCLII+MGYEENDYVSSS+ASSYAKHGLISD L YVSDSNKQPSVVLSNI+AGY+NRVG+Y ET+KL
Subjt:  YLLLEMLHFGYRPNEFTFSTIMKTLLALELFQIHCLIIKMGYEENDYVSSSVASSYAKHGLISDVLAYVSDSNKQPSVVLSNIVAGYYNRVGQYNETQKL

Query:  LYQLEEPGIISWNIMIEACAKTDNYFKVLALFKSMLMLQICPDNYTFISLLSVCAKLCNLALGSSVHGVIIKTGSGCCDTFVCNLLIDMYGKCGSVGCSL
        LY LEEP I+SWNI+IEACAKT NY K L LFK MLMLQI PDNYTFISLLSVCAKLCNLALGSSVHG+IIKT   C DTF+CNLLIDMYGKCGS+GC+L
Subjt:  LYQLEEPGIISWNIMIEACAKTDNYFKVLALFKSMLMLQICPDNYTFISLLSVCAKLCNLALGSSVHGVIIKTGSGCCDTFVCNLLIDMYGKCGSVGCSL

Query:  KIFDEVKDRNLITWTVLVSVLGLHGYAHEALERFAEMELSGFEPDGVALGAVLAACKHGGLVKEGMELFSKMKVKYGVEPEMDHYQCVVDLLSTHGHVVQ
        KIFD+V+DRNLITWT+L+S+LGLHG A+EALERFAEMELSGF PD VALGAVL ACKHGGLVKEGMELFSKMKVKYG+EPEM+HYQC+VDLLS+HGH   
Subjt:  KIFDEVKDRNLITWTVLVSVLGLHGYAHEALERFAEMELSGFEPDGVALGAVLAACKHGGLVKEGMELFSKMKVKYGVEPEMDHYQCVVDLLSTHGHVVQ

Query:  AEKI
         EK+
Subjt:  AEKI

XP_022947134.1 pentatricopeptide repeat-containing protein At3g58590 [Cucurbita moschata]0.0e+0082.95Show/hide
Query:  MSFNGDFIKHHHLLLQLLQACSKAPSLKATRLLHALTITMGPVPNQAIFVNNNIIFQYTSLGMLLVARNLFDKMPHRNVVSYNTMISACSQRGFVKEAWY
        MSFNGD IK H LLLQLLQACSKAP++K+TR LHALTITMGPVPNQAIFV+NN++FQY+SLG+LL+ARNLFD+MPHRNVVSYNT+ISA S+RGFVKEAW 
Subjt:  MSFNGDFIKHHHLLLQLLQACSKAPSLKATRLLHALTITMGPVPNQAIFVNNNIIFQYTSLGMLLVARNLFDKMPHRNVVSYNTMISACSQRGFVKEAWY

Query:  LFSEMRDCGFVPTQFTFGGLFSADLLDVWQGAQLQSLSVKNGLFDAGAVVGTALLGLYGRRGCLEEALRVFEDMPWKSLVTWNSILTLLGRNQLVEECKH
        LFSEMR+CGFVPTQFTFGGL SADLLDVWQGAQLQ LSVKNG+FDA A+VGT LLGLYGR GC EEALRVFEDM WKSLVTWNSIL+LLGR+QLV+ECK 
Subjt:  LFSEMRDCGFVPTQFTFGGLFSADLLDVWQGAQLQSLSVKNGLFDAGAVVGTALLGLYGRRGCLEEALRVFEDMPWKSLVTWNSILTLLGRNQLVEECKH

Query:  LFCELMCGGMELSKFSFVGILSCFSREEDLKFGQQLHGIVIKIGFYYEVLVVNSLVNMYLQCGGFFLAEKLFEEVPMRDVVTYNSIIGAGAIIKKPEIAL
        LFCELM G MELSKFSFV +LSCFSR+EDLKFGQQLHGIV+KIGFYYEVLVVNSL+NMYLQCGGF+LAEKLFEEVP+ DVVTYNSII AG  + KPE+AL
Subjt:  LFCELMCGGMELSKFSFVGILSCFSREEDLKFGQQLHGIVIKIGFYYEVLVVNSLVNMYLQCGGFFLAEKLFEEVPMRDVVTYNSIIGAGAIIKKPEIAL

Query:  KLFYTMSANGLIPTQASFVNAVNSCSCLGSSIYGEYFHSKIIRYALESDVFAGTALIDCYAKFRKMEEARYCFDEIAEKNLVSWNTLIMGYSTDCYTSSM
        +LFY M   GLIPTQASFVN V+SCS + SSIYGEYFHSK IR A ESDVF GTALID YAKF+K+EEAR+CFDEI EKNLVSWN LI GYSTDCY+S M
Subjt:  KLFYTMSANGLIPTQASFVNAVNSCSCLGSSIYGEYFHSKIIRYALESDVFAGTALIDCYAKFRKMEEARYCFDEIAEKNLVSWNTLIMGYSTDCYTSSM

Query:  YLLLEMLHFGYRPNEFTFSTIMKTLLALELFQIHCLIIKMGYEENDYVSSSVASSYAKHGLISDVLAYVSDSNKQPSVVLSNIVAGYYNRVGQYNETQKL
        YLL+EMLHFGYRPNEFTFS IMK L+A EL QIHCLII+MGYEEN YVSS++ASSYAKHGLISDVLAY+S    QPSV LSNIVAGYYNRVG Y+ETQKL
Subjt:  YLLLEMLHFGYRPNEFTFSTIMKTLLALELFQIHCLIIKMGYEENDYVSSSVASSYAKHGLISDVLAYVSDSNKQPSVVLSNIVAGYYNRVGQYNETQKL

Query:  LYQLEEPGIISWNIMIEACAKTDNYFKVLALFKSMLMLQICPDNYTFISLLSVCAKLCNLALGSSVHGVIIKTGSGCCDTFVCNLLIDMYGKCGSVGCSL
        L  LE   IISWNI++E+CAKT NYFKVLALFK ML+LQI PDNYTFISLLSVCAKLCNLALGSSVHGV+IKTGS C DTFVCNLLI MYGKCGS+GC+L
Subjt:  LYQLEEPGIISWNIMIEACAKTDNYFKVLALFKSMLMLQICPDNYTFISLLSVCAKLCNLALGSSVHGVIIKTGSGCCDTFVCNLLIDMYGKCGSVGCSL

Query:  KIFDEVKDRNLITWTVLVSVLGLHGYAHEALERFAEMELSGFEPDGVALGAVLAACKHGGLVKEGMELFSKMKVKYGVEPEMDHYQCVVDLLSTHGHVVQ
        KIFD+VKDRNLITWTVL+SVLGLHG+A+EALERFAEMELSG +PDGVALGAVL ACKHGGLVKEGMELFSKMKV+YGVEPEMDHYQC+VDLLS HG+VV+
Subjt:  KIFDEVKDRNLITWTVLVSVLGLHGYAHEALERFAEMELSGFEPDGVALGAVLAACKHGGLVKEGMELFSKMKVKYGVEPEMDHYQCVVDLLSTHGHVVQ

Query:  AEKI
        AEK+
Subjt:  AEKI

XP_023532810.1 pentatricopeptide repeat-containing protein At3g58590 [Cucurbita pepo subsp. pepo]0.0e+0083.1Show/hide
Query:  MSFNGDFIKHHHLLLQLLQACSKAPSLKATRLLHALTITMGPVPNQAIFVNNNIIFQYTSLGMLLVARNLFDKMPHRNVVSYNTMISACSQRGFVKEAWY
        MSFNGD IK H LLLQLLQACSKAP++K TR LHALTITMGPVPNQAIFV+NN+IFQY+SLG+LL+ARNLFD+MPHRNVVSYNT+ISA S+RGFVKEAW 
Subjt:  MSFNGDFIKHHHLLLQLLQACSKAPSLKATRLLHALTITMGPVPNQAIFVNNNIIFQYTSLGMLLVARNLFDKMPHRNVVSYNTMISACSQRGFVKEAWY

Query:  LFSEMRDCGFVPTQFTFGGLFSADLLDVWQGAQLQSLSVKNGLFDAGAVVGTALLGLYGRRGCLEEALRVFEDMPWKSLVTWNSILTLLGRNQLVEECKH
        LFSEMRDCGFVPTQFTFGGL SADLLDVWQGAQLQ L+VK G+FDA A+VGT LLGLYGR GC EEALRVFEDM WKSLVTWNSIL+LLGR+QLV+ECK 
Subjt:  LFSEMRDCGFVPTQFTFGGLFSADLLDVWQGAQLQSLSVKNGLFDAGAVVGTALLGLYGRRGCLEEALRVFEDMPWKSLVTWNSILTLLGRNQLVEECKH

Query:  LFCELMCGGMELSKFSFVGILSCFSREEDLKFGQQLHGIVIKIGFYYEVLVVNSLVNMYLQCGGFFLAEKLFEEVPMRDVVTYNSIIGAGAIIKKPEIAL
        LFCELM G MELSKFSFV +LSCFSR+EDLKFGQQLHGIV KIGFYY+VLVVNSL+NMYLQCGGF+LAEKLFEEVP+RDVVTYNSII AG  + KPE+AL
Subjt:  LFCELMCGGMELSKFSFVGILSCFSREEDLKFGQQLHGIVIKIGFYYEVLVVNSLVNMYLQCGGFFLAEKLFEEVPMRDVVTYNSIIGAGAIIKKPEIAL

Query:  KLFYTMSANGLIPTQASFVNAVNSCSCLGSSIYGEYFHSKIIRYALESDVFAGTALIDCYAKFRKMEEARYCFDEIAEKNLVSWNTLIMGYSTDCYTSSM
        +LFY M   GLIPTQASFVN V+SCS + SSIYGEYFHSK IR A ESDVF GTALID YAKF+K+EEAR+CFDEI EKNLVSWN LI GYSTDCYTS M
Subjt:  KLFYTMSANGLIPTQASFVNAVNSCSCLGSSIYGEYFHSKIIRYALESDVFAGTALIDCYAKFRKMEEARYCFDEIAEKNLVSWNTLIMGYSTDCYTSSM

Query:  YLLLEMLHFGYRPNEFTFSTIMKTLLALELFQIHCLIIKMGYEENDYVSSSVASSYAKHGLISDVLAYVSDSNKQPSVVLSNIVAGYYNRVGQYNETQKL
        YLL+EMLHFGYRPNEFTFS IMK L+A EL QIHCLII+MGYEEN YVSSS+ASSYAKHGLISDVLAY+S    QPS VLSNIVAGYYNRVG Y+ETQKL
Subjt:  YLLLEMLHFGYRPNEFTFSTIMKTLLALELFQIHCLIIKMGYEENDYVSSSVASSYAKHGLISDVLAYVSDSNKQPSVVLSNIVAGYYNRVGQYNETQKL

Query:  LYQLEEPGIISWNIMIEACAKTDNYFKVLALFKSMLMLQICPDNYTFISLLSVCAKLCNLALGSSVHGVIIKTGSGCCDTFVCNLLIDMYGKCGSVGCSL
           LE   IISWNI++E+CAKT NYFKVLALFK ML+LQI PDNYTFISLLSVCAKLCNLALGSSVHGV+IKTGS C DTFVCNLLI MYGKCGS+GC+L
Subjt:  LYQLEEPGIISWNIMIEACAKTDNYFKVLALFKSMLMLQICPDNYTFISLLSVCAKLCNLALGSSVHGVIIKTGSGCCDTFVCNLLIDMYGKCGSVGCSL

Query:  KIFDEVKDRNLITWTVLVSVLGLHGYAHEALERFAEMELSGFEPDGVALGAVLAACKHGGLVKEGMELFSKMKVKYGVEPEMDHYQCVVDLLSTHGHVVQ
        KIFD+VKDRNLITWTVL+SVLGLHG+A+EALERFAEMELSG +PDGVALGAVL ACKHGGLVKEGMELF KMKV+YGVEPEMDHYQCVV+LLSTHG VV+
Subjt:  KIFDEVKDRNLITWTVLVSVLGLHGYAHEALERFAEMELSGFEPDGVALGAVLAACKHGGLVKEGMELFSKMKVKYGVEPEMDHYQCVVDLLSTHGHVVQ

Query:  AEKI
        AEK+
Subjt:  AEKI

XP_038902940.1 pentatricopeptide repeat-containing protein At3g58590 [Benincasa hispida]0.0e+0084.94Show/hide
Query:  MSFNGDFIKHHHLLLQLLQACSKAPSLKATRLLHALTITMGPVPNQAIFVNNNIIFQYTSLGMLLVARNLFDKMPHRNVVSYNTMISACSQRGFVKEAWY
        MSFNG  IKHHHL+L LL+ACSKAPS K T+ LHALTITMGPVPNQAIFV+NN++FQY+S+GMLLVAR++FD+MP RNVVSYNTMIS  S+ GFVKEAW 
Subjt:  MSFNGDFIKHHHLLLQLLQACSKAPSLKATRLLHALTITMGPVPNQAIFVNNNIIFQYTSLGMLLVARNLFDKMPHRNVVSYNTMISACSQRGFVKEAWY

Query:  LFSEMRDCGFVPTQFTFGGLFSADLLDVWQGAQLQSLSVKNGLFDAGAVVGTALLGLYGRRGCLEEALRVFEDMPWKSLVTWNSILTLLGRNQLVEECKH
        LFSEMRDCGF PTQFTFGGL S +LLDVWQGAQLQ LSVKNGLF +GAVVGT LLGLYGR GC +EALRVFEDM WKSLVTWNS+L+LLGRNQLV+ECK 
Subjt:  LFSEMRDCGFVPTQFTFGGLFSADLLDVWQGAQLQSLSVKNGLFDAGAVVGTALLGLYGRRGCLEEALRVFEDMPWKSLVTWNSILTLLGRNQLVEECKH

Query:  LFCELMCGGMELSKFSFVGILSCFSREEDLKFGQQLHGIVIKIGFYYEVLVVNSLVNMYLQCGGFFLAEKLFEEVPMRDVVTYNSIIGAGAIIKKPEIAL
        +FCELMCGG+ELSKFSFVG+LSCFSREEDLKFGQ LHGIVIKIGFYYEV VVNSLVNMYLQCGGFFLA+KLFEEVP+RDVVTYNSIIG GA + +PEIAL
Subjt:  LFCELMCGGMELSKFSFVGILSCFSREEDLKFGQQLHGIVIKIGFYYEVLVVNSLVNMYLQCGGFFLAEKLFEEVPMRDVVTYNSIIGAGAIIKKPEIAL

Query:  KLFYTMSANGLIPTQASFVNAVNSCSCLGSSIYGEYFHSKIIRYALESDVFAGTALIDCYAKFRKMEEARYCFDEIAEKNLVSWNTLIMGYSTDCYTSSM
        +LFYTM +NGLIPTQASFVNAVNSCSCL SSIYGEYFHSK I YALESDVF GTALID YA FRK+EEAR+CFDEIAEKNLVSWN LI GYS DCYTSSM
Subjt:  KLFYTMSANGLIPTQASFVNAVNSCSCLGSSIYGEYFHSKIIRYALESDVFAGTALIDCYAKFRKMEEARYCFDEIAEKNLVSWNTLIMGYSTDCYTSSM

Query:  YLLLEMLHFGYRPNEFTFSTIMKTLLALELFQIHCLIIKMGYEENDYVSSSVASSYAKHGLISDVLAYVSDSNKQPSVVLSNIVAGYYNRVGQYNETQKL
        YLL+EML FG RPNEFTFS IMKTLLA EL QIHCLII+MGYEENDYVSSS+ASSYAKHGLISDVLAY+SDSNKQPSVVLSNIVAGYYNRVG Y+ETQKL
Subjt:  YLLLEMLHFGYRPNEFTFSTIMKTLLALELFQIHCLIIKMGYEENDYVSSSVASSYAKHGLISDVLAYVSDSNKQPSVVLSNIVAGYYNRVGQYNETQKL

Query:  LYQLEEPGIISWNIMIEACAKTDNYFKVLALFKSMLMLQICPDNYTFISLLSVCAKLCNLALGSSVHGVIIKTGSGCCDTFVCNLLIDMYGKCGSVGCSL
        L  LEEP +ISWNI+IEACAK DNYFKVL LFK ML+LQI PDNYTFISLLSVCAKL NLALGSSVHGVIIKTG GCCDTFVCNLLIDMYGKCGS+ C+L
Subjt:  LYQLEEPGIISWNIMIEACAKTDNYFKVLALFKSMLMLQICPDNYTFISLLSVCAKLCNLALGSSVHGVIIKTGSGCCDTFVCNLLIDMYGKCGSVGCSL

Query:  KIFDEVKDRNLITWTVLVSVLGLHGYAHEALERFAEMELSGFEPDGVALGAVLAACKHGGLVKEGMELFSKMKVKYGVEPEMDHYQCVVDLLSTHGHVVQ
        KIFD+VK RNLITWTVL+SVLGLHG+ +EAL+RFAEME  G +PDGVALGAVL ACKHGGLVKEGMELFSKMKVKYGVEPEMDHYQCVVDLLS+HGHVV+
Subjt:  KIFDEVKDRNLITWTVLVSVLGLHGYAHEALERFAEMELSGFEPDGVALGAVLAACKHGGLVKEGMELFSKMKVKYGVEPEMDHYQCVVDLLSTHGHVVQ

Query:  AEKI
        AEK+
Subjt:  AEKI

TrEMBL top hitse value%identityAlignment
A0A1S3C2S6 pentatricopeptide repeat-containing protein At3g585900.0e+0082.93Show/hide
Query:  IKHHHLLLQLLQACSKAPSLKATRLLHALTITMGPVPNQAIFVNNNIIFQYTSLGMLLVARNLFDKMPHRNVVSYNTMISACSQRGFVKEAWYLFSEMRD
        IKHHHLLL LLQACSK PSLK TR LHALTITMGPVPNQAIFV+NN++ QYTS+GML +ARNLFD+MPHRNVVSYNTMIS   + GFVKEAW LFSEMR+
Subjt:  IKHHHLLLQLLQACSKAPSLKATRLLHALTITMGPVPNQAIFVNNNIIFQYTSLGMLLVARNLFDKMPHRNVVSYNTMISACSQRGFVKEAWYLFSEMRD

Query:  CGFVPTQFTFGGLFSADLLDVWQGAQLQSLSVKNGLFDAGAVVGTALLGLYGRRGCLEEALRVFEDMPWKSLVTWNSILTLLGRNQLVEECKHLFCELMC
        CGF PTQFTFGGL S +LLDVWQGAQLQ LSVKNGLF +GA+VGTALLGLYGR GC EEALRV EDM WKSLVTWNSIL+LLGRNQLV+ECK +FCELMC
Subjt:  CGFVPTQFTFGGLFSADLLDVWQGAQLQSLSVKNGLFDAGAVVGTALLGLYGRRGCLEEALRVFEDMPWKSLVTWNSILTLLGRNQLVEECKHLFCELMC

Query:  GGMELSKFSFVGILSCFSREEDLKFGQQLHGIVIKIGFYYEVLVVNSLVNMYLQCGGFFLAEKLFEEVPMRDVVTYNSIIGAGAIIKKPEIALKLFYTMS
         GMELSKFSFVG+LSCFSREEDLKFGQ LHGIVIKIGFYYEVLVVNSL+NMYLQCGGFF A+KLFEEVP+RDVVTYNSII  G  + +PEIAL+LFY+M+
Subjt:  GGMELSKFSFVGILSCFSREEDLKFGQQLHGIVIKIGFYYEVLVVNSLVNMYLQCGGFFLAEKLFEEVPMRDVVTYNSIIGAGAIIKKPEIALKLFYTMS

Query:  ANGLIPTQASFVNAVNSCSCLGSSIYGEYFHSKIIRYALESDVFAGTALIDCYAKFRKMEEARYCFDEIAEKNLVSWNTLIMGYSTDCYTSSMYLLLEML
        ANGL PTQASFVNAVNSCSCLGSSIYGEYFHSK +RYALESDVF GTALID YAKF+K+EEA +CFDEIAEKN+VSWN LI+GYS +CYTSS YLL++ML
Subjt:  ANGLIPTQASFVNAVNSCSCLGSSIYGEYFHSKIIRYALESDVFAGTALIDCYAKFRKMEEARYCFDEIAEKNLVSWNTLIMGYSTDCYTSSMYLLLEML

Query:  HFGYRPNEFTFSTIMKTLLALELFQIHCLIIKMGYEENDYVSSSVASSYAKHGLISDVLAYVSDSNKQPSVVLSNIVAGYYNRVGQYNETQKLLYQLEEP
        HFGYRPNEFTFS IMKTLL  EL QIH LII+MGYEENDYVSSS+ASSYAKHGLISDVLAYVSDSNKQPSVV SNIVAGYYNRV  Y+ETQKLL  LE P
Subjt:  HFGYRPNEFTFSTIMKTLLALELFQIHCLIIKMGYEENDYVSSSVASSYAKHGLISDVLAYVSDSNKQPSVVLSNIVAGYYNRVGQYNETQKLLYQLEEP

Query:  GIISWNIMIEACAKTDNYFKVLALFKSMLMLQICPDNYTFISLLSVCAKLCNLALGSSVHGVIIKTGSGCCDTFVCNLLIDMYGKCGSVGCSLKIFDEVK
         +ISWNI+IEACAK + YFKVL LFK ML+ QI PDNYTF SLLSVCAKLCNLALGSS+HGV+IK GSG CDTFVCNLLIDMYGKCGS+ C+LKIFDEVK
Subjt:  GIISWNIMIEACAKTDNYFKVLALFKSMLMLQICPDNYTFISLLSVCAKLCNLALGSSVHGVIIKTGSGCCDTFVCNLLIDMYGKCGSVGCSLKIFDEVK

Query:  DRNLITWTVLVSVLGLHGYAHEALERFAEMELSGFEPDGVALGAVLAACKHGGLVKEGMELFSKMKVKYGVEPEMDHYQCVVDLLSTHGHVVQAEKI
         RNLITWTVL+SVLGLHG+A+EA++RFAEMEL G +PD VAL AVL ACKHGGLV+EGMELFSKMKVKYGVEPEM+HYQCVVDLLS+HGHVV+AEK+
Subjt:  DRNLITWTVLVSVLGLHGYAHEALERFAEMELSGFEPDGVALGAVLAACKHGGLVKEGMELFSKMKVKYGVEPEMDHYQCVVDLLSTHGHVVQAEKI

A0A5A7UHJ9 Pentatricopeptide repeat-containing protein0.0e+0082.93Show/hide
Query:  IKHHHLLLQLLQACSKAPSLKATRLLHALTITMGPVPNQAIFVNNNIIFQYTSLGMLLVARNLFDKMPHRNVVSYNTMISACSQRGFVKEAWYLFSEMRD
        IKHHHLLL LLQACSK PSLK TR LHALTITMGPVPNQAIFV+NN++ QYTS+GML +ARNLFD+MPHRNVVSYNTMIS   + GFVKEAW LFSEMR+
Subjt:  IKHHHLLLQLLQACSKAPSLKATRLLHALTITMGPVPNQAIFVNNNIIFQYTSLGMLLVARNLFDKMPHRNVVSYNTMISACSQRGFVKEAWYLFSEMRD

Query:  CGFVPTQFTFGGLFSADLLDVWQGAQLQSLSVKNGLFDAGAVVGTALLGLYGRRGCLEEALRVFEDMPWKSLVTWNSILTLLGRNQLVEECKHLFCELMC
        CGF PTQFTFGGL S +LLDVWQGAQLQ LSVKNGLF +GA+VGTALLGLYGR GC EEALRV EDM WKSLVTWNSIL+LLGRNQLV+ECK +FCELMC
Subjt:  CGFVPTQFTFGGLFSADLLDVWQGAQLQSLSVKNGLFDAGAVVGTALLGLYGRRGCLEEALRVFEDMPWKSLVTWNSILTLLGRNQLVEECKHLFCELMC

Query:  GGMELSKFSFVGILSCFSREEDLKFGQQLHGIVIKIGFYYEVLVVNSLVNMYLQCGGFFLAEKLFEEVPMRDVVTYNSIIGAGAIIKKPEIALKLFYTMS
         GMELSKFSFVG+LSCFSREEDLKFGQ LHGIVIKIGFYYEVLVVNSL+NMYLQCGGFF A+KLFEEVP+RDVVTYNSII  G  + +PEIAL+LFY+M+
Subjt:  GGMELSKFSFVGILSCFSREEDLKFGQQLHGIVIKIGFYYEVLVVNSLVNMYLQCGGFFLAEKLFEEVPMRDVVTYNSIIGAGAIIKKPEIALKLFYTMS

Query:  ANGLIPTQASFVNAVNSCSCLGSSIYGEYFHSKIIRYALESDVFAGTALIDCYAKFRKMEEARYCFDEIAEKNLVSWNTLIMGYSTDCYTSSMYLLLEML
        ANGL PTQASFVNAVNSCSCLGSSIYGEYFHSK +RYALESDVF GTALID YAKF+K+EEA +CFDEIAEKN+VSWN LI+GYS +CYTSS YLL++ML
Subjt:  ANGLIPTQASFVNAVNSCSCLGSSIYGEYFHSKIIRYALESDVFAGTALIDCYAKFRKMEEARYCFDEIAEKNLVSWNTLIMGYSTDCYTSSMYLLLEML

Query:  HFGYRPNEFTFSTIMKTLLALELFQIHCLIIKMGYEENDYVSSSVASSYAKHGLISDVLAYVSDSNKQPSVVLSNIVAGYYNRVGQYNETQKLLYQLEEP
        HFGYRPNEFTFS IMKTLL  EL QIH LII+MGYEENDYVSSS+ASSYAKHGLISDVLAYVSDSNKQPSVV SNIVAGYYNRV  Y+ETQKLL  LE P
Subjt:  HFGYRPNEFTFSTIMKTLLALELFQIHCLIIKMGYEENDYVSSSVASSYAKHGLISDVLAYVSDSNKQPSVVLSNIVAGYYNRVGQYNETQKLLYQLEEP

Query:  GIISWNIMIEACAKTDNYFKVLALFKSMLMLQICPDNYTFISLLSVCAKLCNLALGSSVHGVIIKTGSGCCDTFVCNLLIDMYGKCGSVGCSLKIFDEVK
         +ISWNI+IEACAK + YFKVL LFK ML+ QI PDNYTF SLLSVCAKLCNLALGSS+HGV+IK GSG CDTFVCNLLIDMYGKCGS+ C+LKIFDEVK
Subjt:  GIISWNIMIEACAKTDNYFKVLALFKSMLMLQICPDNYTFISLLSVCAKLCNLALGSSVHGVIIKTGSGCCDTFVCNLLIDMYGKCGSVGCSLKIFDEVK

Query:  DRNLITWTVLVSVLGLHGYAHEALERFAEMELSGFEPDGVALGAVLAACKHGGLVKEGMELFSKMKVKYGVEPEMDHYQCVVDLLSTHGHVVQAEKI
         RNLITWTVL+SVLGLHG+A+EA++RFAEMEL G +PD VAL AVL ACKHGGLV+EGMELFSKMKVKYGVEPEM+HYQCVVDLLS+HGHVV+AEK+
Subjt:  DRNLITWTVLVSVLGLHGYAHEALERFAEMELSGFEPDGVALGAVLAACKHGGLVKEGMELFSKMKVKYGVEPEMDHYQCVVDLLSTHGHVVQAEKI

A0A6J1C0G3 pentatricopeptide repeat-containing protein At3g58590-like0.0e+0083.24Show/hide
Query:  MSFNGDFIKHHHLLLQLLQACSKAPSLKATRLLHALTITMGPVPNQAIFVNNNIIFQYTSLGMLLVARNLFDKMPHRNVVSYNTMISACSQRGFVKEAWY
        MSFNGD  KHH  LLQLLQACSKAPSLKATR LHA+TITMGPVPNQAIFV+NN+IFQY+S GML VARNLFDKMPHRN VSYNT+ISA S+ GFV EAW 
Subjt:  MSFNGDFIKHHHLLLQLLQACSKAPSLKATRLLHALTITMGPVPNQAIFVNNNIIFQYTSLGMLLVARNLFDKMPHRNVVSYNTMISACSQRGFVKEAWY

Query:  LFSEMRDCGFVPTQFTFGGLFSADLLDVWQGAQLQSLSVKNGLFDAGAVVGTALLGLYGRRGCLEEALRVFEDMPWKSLVTWNSILTLLGRNQLVEECKH
        LFSEMRDCGFV TQFTFGGL SA+LLD WQG QLQ+LSVKNGLFDA A+VGTAL+ LYGR GC +EAL VF DM WKSLVTWN IL LLGRNQLVEECK 
Subjt:  LFSEMRDCGFVPTQFTFGGLFSADLLDVWQGAQLQSLSVKNGLFDAGAVVGTALLGLYGRRGCLEEALRVFEDMPWKSLVTWNSILTLLGRNQLVEECKH

Query:  LFCELMCGGMELSKFSFVGILSCFSREEDLKFGQQLHGIVIKIGFYYEVLVVNSLVNMYLQCGGFFLAEKLFEEVPMRDVVTYNSIIGAGAIIKKPEIAL
        LFCELM GGM LSKFSFVG+LSCFS EEDLKFGQQLHGIVIKIGFY EVLVVNSL+NMYLQCGGFFLAEKLF EVP+RDVVTYNSIIGA   +KKPEIAL
Subjt:  LFCELMCGGMELSKFSFVGILSCFSREEDLKFGQQLHGIVIKIGFYYEVLVVNSLVNMYLQCGGFFLAEKLFEEVPMRDVVTYNSIIGAGAIIKKPEIAL

Query:  KLFYTMSANGLIPTQASFVNAVNSCSCLGSSIYGEYFHSKIIRYALESDVFAGTALIDCYAKFRKMEEARYCFDEIAEKNLVSWNTLIMGYSTDCYTSSM
        +LFY MS +GLIPTQASFVN V SCSCL SSIYGEYFHSKIIR+ALESDV+ GT+L+  YAKFRKMEEARYCFDEIAEKNLVSWNTLI+G+STDCYTSS+
Subjt:  KLFYTMSANGLIPTQASFVNAVNSCSCLGSSIYGEYFHSKIIRYALESDVFAGTALIDCYAKFRKMEEARYCFDEIAEKNLVSWNTLIMGYSTDCYTSSM

Query:  YLLLEMLHFGYRPNEFTFSTIMKTLLALELFQIHCLIIKMGYEENDYVSSSVASSYAKHGLISDVLAYVSDSNKQPSVVLSNIVAGYYNRVGQYNETQKL
        YLLLEML FGYRPNEFTFS I++TLLA EL QIHCLII+MGYEENDYVSSS+ASSYAKHGLISD L YVSDSNKQPSVVLSNI+AGY+NRVG+Y ET+KL
Subjt:  YLLLEMLHFGYRPNEFTFSTIMKTLLALELFQIHCLIIKMGYEENDYVSSSVASSYAKHGLISDVLAYVSDSNKQPSVVLSNIVAGYYNRVGQYNETQKL

Query:  LYQLEEPGIISWNIMIEACAKTDNYFKVLALFKSMLMLQICPDNYTFISLLSVCAKLCNLALGSSVHGVIIKTGSGCCDTFVCNLLIDMYGKCGSVGCSL
        LY LEEP I+SWNI+IEACAKT NY K L LFK MLMLQI PDNYTFISLLSVCAKLCNLALGSSVHG+IIKT   C DTF+CNLLIDMYGKCGS+GC+L
Subjt:  LYQLEEPGIISWNIMIEACAKTDNYFKVLALFKSMLMLQICPDNYTFISLLSVCAKLCNLALGSSVHGVIIKTGSGCCDTFVCNLLIDMYGKCGSVGCSL

Query:  KIFDEVKDRNLITWTVLVSVLGLHGYAHEALERFAEMELSGFEPDGVALGAVLAACKHGGLVKEGMELFSKMKVKYGVEPEMDHYQCVVDLLSTHGHVVQ
        KIFD+V+DRNLITWT+L+S+LGLHG A+EALERFAEMELSGF PD VALGAVL ACKHGGLVKEGMELFSKMKVKYG+EPEM+HYQC+VDLLS+HGH   
Subjt:  KIFDEVKDRNLITWTVLVSVLGLHGYAHEALERFAEMELSGFEPDGVALGAVLAACKHGGLVKEGMELFSKMKVKYGVEPEMDHYQCVVDLLSTHGHVVQ

Query:  AEKI
         EK+
Subjt:  AEKI

A0A6J1G5W7 pentatricopeptide repeat-containing protein At3g585900.0e+0082.95Show/hide
Query:  MSFNGDFIKHHHLLLQLLQACSKAPSLKATRLLHALTITMGPVPNQAIFVNNNIIFQYTSLGMLLVARNLFDKMPHRNVVSYNTMISACSQRGFVKEAWY
        MSFNGD IK H LLLQLLQACSKAP++K+TR LHALTITMGPVPNQAIFV+NN++FQY+SLG+LL+ARNLFD+MPHRNVVSYNT+ISA S+RGFVKEAW 
Subjt:  MSFNGDFIKHHHLLLQLLQACSKAPSLKATRLLHALTITMGPVPNQAIFVNNNIIFQYTSLGMLLVARNLFDKMPHRNVVSYNTMISACSQRGFVKEAWY

Query:  LFSEMRDCGFVPTQFTFGGLFSADLLDVWQGAQLQSLSVKNGLFDAGAVVGTALLGLYGRRGCLEEALRVFEDMPWKSLVTWNSILTLLGRNQLVEECKH
        LFSEMR+CGFVPTQFTFGGL SADLLDVWQGAQLQ LSVKNG+FDA A+VGT LLGLYGR GC EEALRVFEDM WKSLVTWNSIL+LLGR+QLV+ECK 
Subjt:  LFSEMRDCGFVPTQFTFGGLFSADLLDVWQGAQLQSLSVKNGLFDAGAVVGTALLGLYGRRGCLEEALRVFEDMPWKSLVTWNSILTLLGRNQLVEECKH

Query:  LFCELMCGGMELSKFSFVGILSCFSREEDLKFGQQLHGIVIKIGFYYEVLVVNSLVNMYLQCGGFFLAEKLFEEVPMRDVVTYNSIIGAGAIIKKPEIAL
        LFCELM G MELSKFSFV +LSCFSR+EDLKFGQQLHGIV+KIGFYYEVLVVNSL+NMYLQCGGF+LAEKLFEEVP+ DVVTYNSII AG  + KPE+AL
Subjt:  LFCELMCGGMELSKFSFVGILSCFSREEDLKFGQQLHGIVIKIGFYYEVLVVNSLVNMYLQCGGFFLAEKLFEEVPMRDVVTYNSIIGAGAIIKKPEIAL

Query:  KLFYTMSANGLIPTQASFVNAVNSCSCLGSSIYGEYFHSKIIRYALESDVFAGTALIDCYAKFRKMEEARYCFDEIAEKNLVSWNTLIMGYSTDCYTSSM
        +LFY M   GLIPTQASFVN V+SCS + SSIYGEYFHSK IR A ESDVF GTALID YAKF+K+EEAR+CFDEI EKNLVSWN LI GYSTDCY+S M
Subjt:  KLFYTMSANGLIPTQASFVNAVNSCSCLGSSIYGEYFHSKIIRYALESDVFAGTALIDCYAKFRKMEEARYCFDEIAEKNLVSWNTLIMGYSTDCYTSSM

Query:  YLLLEMLHFGYRPNEFTFSTIMKTLLALELFQIHCLIIKMGYEENDYVSSSVASSYAKHGLISDVLAYVSDSNKQPSVVLSNIVAGYYNRVGQYNETQKL
        YLL+EMLHFGYRPNEFTFS IMK L+A EL QIHCLII+MGYEEN YVSS++ASSYAKHGLISDVLAY+S    QPSV LSNIVAGYYNRVG Y+ETQKL
Subjt:  YLLLEMLHFGYRPNEFTFSTIMKTLLALELFQIHCLIIKMGYEENDYVSSSVASSYAKHGLISDVLAYVSDSNKQPSVVLSNIVAGYYNRVGQYNETQKL

Query:  LYQLEEPGIISWNIMIEACAKTDNYFKVLALFKSMLMLQICPDNYTFISLLSVCAKLCNLALGSSVHGVIIKTGSGCCDTFVCNLLIDMYGKCGSVGCSL
        L  LE   IISWNI++E+CAKT NYFKVLALFK ML+LQI PDNYTFISLLSVCAKLCNLALGSSVHGV+IKTGS C DTFVCNLLI MYGKCGS+GC+L
Subjt:  LYQLEEPGIISWNIMIEACAKTDNYFKVLALFKSMLMLQICPDNYTFISLLSVCAKLCNLALGSSVHGVIIKTGSGCCDTFVCNLLIDMYGKCGSVGCSL

Query:  KIFDEVKDRNLITWTVLVSVLGLHGYAHEALERFAEMELSGFEPDGVALGAVLAACKHGGLVKEGMELFSKMKVKYGVEPEMDHYQCVVDLLSTHGHVVQ
        KIFD+VKDRNLITWTVL+SVLGLHG+A+EALERFAEMELSG +PDGVALGAVL ACKHGGLVKEGMELFSKMKV+YGVEPEMDHYQC+VDLLS HG+VV+
Subjt:  KIFDEVKDRNLITWTVLVSVLGLHGYAHEALERFAEMELSGFEPDGVALGAVLAACKHGGLVKEGMELFSKMKVKYGVEPEMDHYQCVVDLLSTHGHVVQ

Query:  AEKI
        AEK+
Subjt:  AEKI

A0A6J1I2K3 pentatricopeptide repeat-containing protein At3g585900.0e+0082.95Show/hide
Query:  MSFNGDFIKHHHLLLQLLQACSKAPSLKATRLLHALTITMGPVPNQAIFVNNNIIFQYTSLGMLLVARNLFDKMPHRNVVSYNTMISACSQRGFVKEAWY
        MSFNGD IK H LLLQLL+ACSKAP++K TR LHA TITMGPVPNQAIFV NN+IFQY+SLG+LL+ARNLFD+MPHRNVVSYNT+ISA S+RGFVKEAW 
Subjt:  MSFNGDFIKHHHLLLQLLQACSKAPSLKATRLLHALTITMGPVPNQAIFVNNNIIFQYTSLGMLLVARNLFDKMPHRNVVSYNTMISACSQRGFVKEAWY

Query:  LFSEMRDCGFVPTQFTFGGLFSADLLDVWQGAQLQSLSVKNGLFDAGAVVGTALLGLYGRRGCLEEALRVFEDMPWKSLVTWNSILTLLGRNQLVEECKH
        LFSEMRDCGFVPTQFTFGGL SADLLDVWQGAQLQ LSVKNG+FDA A+VGT LLGLYGR GC EEALRVFEDM WKSLVTWNSIL+LLGR+QLV+ECK 
Subjt:  LFSEMRDCGFVPTQFTFGGLFSADLLDVWQGAQLQSLSVKNGLFDAGAVVGTALLGLYGRRGCLEEALRVFEDMPWKSLVTWNSILTLLGRNQLVEECKH

Query:  LFCELMCGGMELSKFSFVGILSCFSREEDLKFGQQLHGIVIKIGFYYEVLVVNSLVNMYLQCGGFFLAEKLFEEVPMRDVVTYNSIIGAGAIIKKPEIAL
        LFCELM G  EL KFSFV +LSCFSR+EDLKFGQQLHGIV+KIGFYYEVLVVNSL+NMYLQCGGF+LAEKLF EVP+RDVVTYNSII AG  + KPE+AL
Subjt:  LFCELMCGGMELSKFSFVGILSCFSREEDLKFGQQLHGIVIKIGFYYEVLVVNSLVNMYLQCGGFFLAEKLFEEVPMRDVVTYNSIIGAGAIIKKPEIAL

Query:  KLFYTMSANGLIPTQASFVNAVNSCSCLGSSIYGEYFHSKIIRYALESDVFAGTALIDCYAKFRKMEEARYCFDEIAEKNLVSWNTLIMGYSTDCYTSSM
        + FY+M   GLIPTQASFVN VNSCS + SSIYGEYFHSK IR A ESDVF GTALID YAKF+K+EEAR CFDEI EKNLVSWN LI GYSTDCYTS M
Subjt:  KLFYTMSANGLIPTQASFVNAVNSCSCLGSSIYGEYFHSKIIRYALESDVFAGTALIDCYAKFRKMEEARYCFDEIAEKNLVSWNTLIMGYSTDCYTSSM

Query:  YLLLEMLHFGYRPNEFTFSTIMKTLLALELFQIHCLIIKMGYEENDYVSSSVASSYAKHGLISDVLAYVSDSNKQPSVVLSNIVAGYYNRVGQYNETQKL
        YLL+EMLHF YRPNEFTFS IMK+LLA EL QIHCLII+MGYEEN YVSS++ASSYAKHGLISDVLAY+S    QPSVVLSNIVAGYYNRVG Y+ETQKL
Subjt:  YLLLEMLHFGYRPNEFTFSTIMKTLLALELFQIHCLIIKMGYEENDYVSSSVASSYAKHGLISDVLAYVSDSNKQPSVVLSNIVAGYYNRVGQYNETQKL

Query:  LYQLEEPGIISWNIMIEACAKTDNYFKVLALFKSMLMLQICPDNYTFISLLSVCAKLCNLALGSSVHGVIIKTGSGCCDTFVCNLLIDMYGKCGSVGCSL
           LE  GIISWNI++E+CAKT NYFKVLALFK ML+LQI PDNYTFISLLSVCAKLCNLALGSSVHGV+IKTGS C DTFVCNLLI MYGKCGS+GC+L
Subjt:  LYQLEEPGIISWNIMIEACAKTDNYFKVLALFKSMLMLQICPDNYTFISLLSVCAKLCNLALGSSVHGVIIKTGSGCCDTFVCNLLIDMYGKCGSVGCSL

Query:  KIFDEVKDRNLITWTVLVSVLGLHGYAHEALERFAEMELSGFEPDGVALGAVLAACKHGGLVKEGMELFSKMKVKYGVEPEMDHYQCVVDLLSTHGHVVQ
        KIFD+VKDRNLITWTVL+SVLGLHG+A+EALERFAEMELSG +PDGVALGAVL ACKHGGLVKEGMELFSKMKV+YGVEPEMDHYQC+VDLLS HG+VV+
Subjt:  KIFDEVKDRNLITWTVLVSVLGLHGYAHEALERFAEMELSGFEPDGVALGAVLAACKHGGLVKEGMELFSKMKVKYGVEPEMDHYQCVVDLLSTHGHVVQ

Query:  AEKI
        AEK+
Subjt:  AEKI

SwissProt top hitse value%identityAlignment
Q0WN01 Pentatricopeptide repeat-containing protein At3g585906.4e-19048.86Show/hide
Query:  GDFIKHHHLLLQLLQACSKAPSLKATRLLHALTITMGPVPNQAIFVNNNIIFQYTSLGMLLVARNLFDKMPHRNVVSYNTMISACSQRGFVKEAWYLFSE
        GD   H+  ++ LL  C KAPS   T+ LHAL+IT+  V  Q ++V NNII  Y  LG + +A  +FD+MP RN VS+NT+I   S+ G V +AW +FSE
Subjt:  GDFIKHHHLLLQLLQACSKAPSLKATRLLHALTITMGPVPNQAIFVNNNIIFQYTSLGMLLVARNLFDKMPHRNVVSYNTMISACSQRGFVKEAWYLFSE

Query:  MRDCGFVPTQFTFGGLFSADLLDVWQGAQLQSLSVKNGLFDAGAVVGTALLGLYGRRGCLEEALRVFEDMPWKSLVTWNSILTLLGRNQLVEECKHLFCE
        MR  G++P Q T  GL S   LDV  G QL  LS+K GLF A A VGT LL LYGR   LE A +VFEDMP+KSL TWN +++LLG    ++EC   F E
Subjt:  MRDCGFVPTQFTFGGLFSADLLDVWQGAQLQSLSVKNGLFDAGAVVGTALLGLYGRRGCLEEALRVFEDMPWKSLVTWNSILTLLGRNQLVEECKHLFCE

Query:  LMCGGMELSKFSFVGILSCFSREEDLKFGQQLHGIVIKIGFYYEVLVVNSLVNMYLQCGGFFLAEKLFEEVPMRDVVTYNSIIGAGAIIKKPEIALKLFY
        L+  G  L++ SF+G+L   S  +DL   +QLH    K G   E+ VVNSL++ Y +CG   +AE++F++    D+V++N+II A A  + P  ALKLF 
Subjt:  LMCGGMELSKFSFVGILSCFSREEDLKFGQQLHGIVIKIGFYYEVLVVNSLVNMYLQCGGFFLAEKLFEEVPMRDVVTYNSIIGAGAIIKKPEIALKLFY

Query:  TMSANGLIPTQASFVNAVNSCSCLGSSIYGEYFHSKIIRYALESDVFAGTALIDCYAKFRKMEEARYCFDEIAEKNLVSWNTLIMGYSTDCYTSSMYLLL
        +M  +G  P Q ++V+ +   S +     G   H  +I+   E+ +  G ALID YAK   +E++R CFD I +KN+V WN L+ GY+       + L L
Subjt:  TMSANGLIPTQASFVNAVNSCSCLGSSIYGEYFHSKIIRYALESDVFAGTALIDCYAKFRKMEEARYCFDEIAEKNLVSWNTLIMGYSTDCYTSSMYLLL

Query:  EMLHFGYRPNEFTFSTIMKTLLALELFQIHCLIIKMGYEENDYVSSSVASSYAKHGLISDVLAYVSDSNKQPSVVLSNIVAGYYNRVGQYNETQKLLYQL
        +ML  G+RP E+TFST +K+    EL Q+H +I++MGYE+NDYV SS+  SYAK+ L++D L  +  ++   SVV  NIVAG Y+R GQY+E+ KL+  L
Subjt:  EMLHFGYRPNEFTFSTIMKTLLALELFQIHCLIIKMGYEENDYVSSSVASSYAKHGLISDVLAYVSDSNKQPSVVLSNIVAGYYNRVGQYNETQKLLYQL

Query:  EEPGIISWNIMIEACAKTDNYFKVLALFKSMLMLQICPDNYTFISLLSVCAKLCNLALGSSVHGVIIKTGSGCCDTFVCNLLIDMYGKCGSVGCSLKIFD
        E+P  +SWNI I AC+++D + +V+ LFK ML   I PD YTF+S+LS+C+KLC+L LGSS+HG+I KT   C DTFVCN+LIDMYGKCGS+   +K+F+
Subjt:  EEPGIISWNIMIEACAKTDNYFKVLALFKSMLMLQICPDNYTFISLLSVCAKLCNLALGSSVHGVIIKTGSGCCDTFVCNLLIDMYGKCGSVGCSLKIFD

Query:  EVKDRNLITWTVLVSVLGLHGYAHEALERFAEMELSGFEPDGVALGAVLAACKHGGLVKEGMELFSKMKVKYGVEPEMDHYQCVVDLLSTHGHVVQAEKI
        E +++NLITWT L+S LG+HGY  EALE+F E    GF+PD V+  ++L AC+HGG+VKEGM LF KMK  YGVEPEMDHY+C VDLL+ +G++ +AE +
Subjt:  EVKDRNLITWTVLVSVLGLHGYAHEALERFAEMELSGFEPDGVALGAVLAACKHGGLVKEGMELFSKMKVKYGVEPEMDHYQCVVDLLSTHGHVVQAEKI

Q9LNC4 Transcription factor GTE41.0e-13454.84Show/hide
Query:  QASVGRTAVAAQDPASGNGVIKTGFDNQSRVNWASKPKQEMQELRRKFESELEMVRNLVKRIEAIQGQLNSGHSHSHVSTMALADNGRGAYHVYSEVGSV
        Q   G T+ +A   A+G+  ++   D + R++ AS  KQ+ +E+R+K E +L +VR +VK+IE  +G++ + ++ S V      +NG G   + S   S 
Subjt:  QASVGRTAVAAQDPASGNGVIKTGFDNQSRVNWASKPKQEMQELRRKFESELEMVRNLVKRIEAIQGQLNSGHSHSHVSTMALADNGRGAYHVYSEVGSV

Query:  GVLPDDT----RPLHQLSISVLENGKGVNDFMEREKRTPKANQFYRNSEFLLAKDKIPPAESNKKSKLNGKKHGRRKFKHGFGMGTKIFNACVSLLEKLM
        G LP +     RP++QLSISVLEN +GVN+ +E+EKRTPKANQFYRNSEFLL  DK+PPAESNKKSK + KK G     HGFG GTK+F  C +LLE+LM
Subjt:  GVLPDDT----RPLHQLSISVLENGKGVNDFMEREKRTPKANQFYRNSEFLLAKDKIPPAESNKKSKLNGKKHGRRKFKHGFGMGTKIFNACVSLLEKLM

Query:  KHKHGWVFNTPVDVEGLGLHDYFSIIRHPMDLGTVKTMLNKNWYKSPKEFAEDVRLTFHNAMTYNPKGQDVHIMAEQLLKIFEDRWAIIESSYYQEMRL-
        KHKHGWVFN PVDV+GLGL DY++II HPMDLGT+K+ L KN YKSP+EFAEDVRLTFHNAMTYNP+GQDVH+MA  LL+IFE+RWA+IE+ Y +EMR  
Subjt:  KHKHGWVFNTPVDVEGLGLHDYFSIIRHPMDLGTVKTMLNKNWYKSPKEFAEDVRLTFHNAMTYNPKGQDVHIMAEQLLKIFEDRWAIIESSYYQEMRL-

Query:  -GMEYGATLPTSNSIGAPPVPLPPLDMRKILRRSESMINPADSKTQPMSA---TPMSVTPSARTPSLKKPKAKDPFKRDMTYNEKQKLSTNLQNLPSEKL
         G E     PT  S   P +P PP+++R  + R++       S  QP +    TP S TPS RTP+LKKPKA +P KRDMTY EKQKLS +LQNLP +KL
Subjt:  -GMEYGATLPTSNSIGAPPVPLPPLDMRKILRRSESMINPADSKTQPMSA---TPMSVTPSARTPSLKKPKAKDPFKRDMTYNEKQKLSTNLQNLPSEKL

Query:  DAILQIIKKRNFELLQQEDEIEVDIDSVDTETLWELDRLVMNYRKSLSKNKRKAELAILKAQAEAQHNDQEK-APAPDDSKFLRE-TRADENIISSSSPI
        DAI+QI+ KRN  +  +++EIEVDIDSVD ETLWELDR V NY+K LSK KRKAELAI +A+AEA+ N Q++ APAP   +F RE     +  + +  P 
Subjt:  DAILQIIKKRNFELLQQEDEIEVDIDSVDTETLWELDRLVMNYRKSLSKNKRKAELAILKAQAEAQHNDQEK-APAPDDSKFLRE-TRADENIISSSSPI

Query:  RGGQQQGHLSKTSSSSSSSSDSGSSSS
        +  +Q    S++SSSSSSSS S SS S
Subjt:  RGGQQQGHLSKTSSSSSSSSDSGSSSS

Q9SS60 Pentatricopeptide repeat-containing protein At3g035801.3e-7829.02Show/hide
Query:  QACSKAPSLKATRLLHALTITMGPVPNQAIFVNNNIIFQYTSLGMLLVARNLFDKM-PHRNVVSYNTMISACSQRGFVKEAWYLFSEMRDCGFVPTQFTF
        +A S + +L   R +HAL I++G   + + F +  +I +Y+       + ++F ++ P +NV  +N++I A S+ G   EA   + ++R+    P ++TF
Subjt:  QACSKAPSLKATRLLHALTITMGPVPNQAIFVNNNIIFQYTSLGMLLVARNLFDKM-PHRNVVSYNTMISACSQRGFVKEAWYLFSEMRDCGFVPTQFTF

Query:  -------GGLFSADLLDVWQGAQLQSLSVKNGLFDAGAVVGTALLGLYGRRGCLEEALRVFEDMPWKSLVTWNSILTLLGRNQLVEECKHLFCELMCGGM
                GLF A++ D+    Q+  +  ++ LF     VG AL+ +Y R G L  A +VF++MP + LV+WNS+++    +   EE   ++ EL    +
Subjt:  -------GGLFSADLLDVWQGAQLQSLSVKNGLFDAGAVVGTALLGLYGRRGCLEEALRVFEDMPWKSLVTWNSILTLLGRNQLVEECKHLFCELMCGGM

Query:  ELSKFSFVGILSCFSREEDLKFGQQLHGIVIKIGFYYEVLVVNSLVNMYLQCGGFFLAEKLFEEVPMRDVVTYNSIIGAGAIIKKPEIALKLFYTMSANG
            F+   +L  F     +K GQ LHG  +K G    V+V N LV MYL+      A ++F+E+ +RD V+YN++I     ++  E ++++F   + + 
Subjt:  ELSKFSFVGILSCFSREEDLKFGQQLHGIVIKIGFYYEVLVVNSLVNMYLQCGGFFLAEKLFEEVPMRDVVTYNSIIGAGAIIKKPEIALKLFYTMSANG

Query:  LIPTQASFVNAVNSCSCLGSSIYGEYFHSKIIR--YALESDVFAGTALIDCYAKFRKMEEARYCFDEIAEKNLVSWNTLIMGY-STDCYTSSMYLLLEML
          P   +  + + +C  L      +Y ++ +++  + LES V     LID YAK   M  AR  F+ +  K+ VSWN++I GY  +     +M L   M+
Subjt:  LIPTQASFVNAVNSCSCLGSSIYGEYFHSKIIR--YALESDVFAGTALIDCYAKFRKMEEARYCFDEIAEKNLVSWNTLIMGY-STDCYTSSMYLLLEML

Query:  HFGYRPNEFTFSTIMKTLLALELFQIHCLIIKMGYEENDYVSSSVASSYAKHGLISDVLAYVSDSNKQPSVVLSNIVAGYYNRVGQYNETQKLLYQLEEP
            + +  T+  ++     L         +K G          + S+  K G+  D             + +SN +   Y + G+  ++ K+   +   
Subjt:  HFGYRPNEFTFSTIMKTLLALELFQIHCLIIKMGYEENDYVSSSVASSYAKHGLISDVLAYVSDSNKQPSVVLSNIVAGYYNRVGQYNETQKLLYQLEEP

Query:  GIISWNIMIEACAKTDNYFKVLALFKSMLMLQICPDNYTFISLLSVCAKLCNLALGSSVHGVIIKTGSGCCDTFVCNLLIDMYGKCGSVGCSLKIFDEVK
          ++WN +I AC +  ++   L +   M   ++ PD  TF+  L +CA L    LG  +H  +++ G    +  + N LI+MY KCG +  S ++F+ + 
Subjt:  GIISWNIMIEACAKTDNYFKVLALFKSMLMLQICPDNYTFISLLSVCAKLCNLALGSSVHGVIIKTGSGCCDTFVCNLLIDMYGKCGSVGCSLKIFDEVK

Query:  DRNLITWTVLVSVLGLHGYAHEALERFAEMELSGFEPDGVALGAVLAACKHGGLVKEGMELFSKMKVKYGVEPEMDHYQCVVDLLSTHGHVVQAEK
         R+++TWT ++   G++G   +ALE FA+ME SG  PD V   A++ AC H GLV EG+  F KMK  Y ++P ++HY CVVDLLS    + +AE+
Subjt:  DRNLITWTVLVSVLGLHGYAHEALERFAEMELSGFEPDGVALGAVLAACKHGGLVKEGMELFSKMKVKYGVEPEMDHYQCVVDLLSTHGHVVQAEK

Q9SS83 Pentatricopeptide repeat-containing protein At3g09040, mitochondrial4.0e-8330.75Show/hide
Query:  GPVPNQAIFVNNNIIFQYTSLGMLLVARNLFDKMPHRNVVSYNTMISACSQRGFVKEAWYLFSEMRDCGFVPTQFTFGGLFSA--DLLDVWQGAQLQSLS
        G  P+   FV   +I  Y  LG L  AR LF +M   +VV++N MIS   +RG    A   F  MR      T+ T G + SA   + ++  G  + + +
Subjt:  GPVPNQAIFVNNNIIFQYTSLGMLLVARNLFDKMPHRNVVSYNTMISACSQRGFVKEAWYLFSEMRDCGFVPTQFTFGGLFSA--DLLDVWQGAQLQSLS

Query:  VKNGLFDAGAVVGTALLGLYGRRGCLEEALRVFEDMPWKSLVTWNSILTLLGRNQLVEECKHLFCELMCGGMELSKFSFVGILSCFSREEDLKFGQQLHG
        +K GL  +   VG++L+ +Y +   +E A +VFE +  K+ V WN+++     N    +   LF ++   G  +  F+F  +LS  +   DL+ G Q H 
Subjt:  VKNGLFDAGAVVGTALLGLYGRRGCLEEALRVFEDMPWKSLVTWNSILTLLGRNQLVEECKHLFCELMCGGMELSKFSFVGILSCFSREEDLKFGQQLHG

Query:  IVIKIGFYYEVLVVNSLVNMYLQCGGFFLAEKLFEEVPMRDVVTYNSIIGAGAIIKKPEIALKLFYTMSANGLIPTQASFVNAVNSCSCLGSSIYGEYFH
        I+IK      + V N+LV+MY +CG    A ++FE +  RD VT+N+IIG+    +    A  LF  M+  G++   A   + + +C+ +     G+  H
Subjt:  IVIKIGFYYEVLVVNSLVNMYLQCGGFFLAEKLFEEVPMRDVVTYNSIIGAGAIIKKPEIALKLFYTMSANGLIPTQASFVNAVNSCSCLGSSIYGEYFH

Query:  SKIIRYALESDVFAGTALIDCYAKFRKMEEARYCFDEIAEKNLVSWNTLIMGYSTDCYTSSMYLLLEMLHFGYRPNEFTFSTIMKTLLALELF----QIH
           ++  L+ D+  G++LID Y+K   +++AR  F  + E ++VS N LI GYS +    ++ L  EML  G  P+E TF+TI++     E      Q H
Subjt:  SKIIRYALESDVFAGTALIDCYAKFRKMEEARYCFDEIAEKNLVSWNTLIMGYSTDCYTSSMYLLLEMLHFGYRPNEFTFSTIMKTLLALELF----QIH

Query:  CLIIKMGY-EENDYVSSSVASSYAKHGLISDVLAYVSDSNKQPSVVLSNIVAGYYNRVGQYNETQKLLYQLEEPGIISWNIMIEACAKTDNYFKVLALFK
          I K G+  E +Y+  S+   Y     +++  A  S+ +   S+VL                               W  M+   ++   Y + L  +K
Subjt:  CLIIKMGY-EENDYVSSSVASSYAKHGLISDVLAYVSDSNKQPSVVLSNIVAGYYNRVGQYNETQKLLYQLEEPGIISWNIMIEACAKTDNYFKVLALFK

Query:  SMLMLQICPDNYTFISLLSVCAKLCNLALGSSVHGVIIKTGSGCCDTFVCNLLIDMYGKCGSVGCSLKIFDEVKDR-NLITWTVLVSVLGLHGYAHEALE
         M    + PD  TF+++L VC+ L +L  G ++H +I        D    N LIDMY KCG +  S ++FDE++ R N+++W  L++    +GYA +AL+
Subjt:  SMLMLQICPDNYTFISLLSVCAKLCNLALGSSVHGVIIKTGSGCCDTFVCNLLIDMYGKCGSVGCSLKIFDEVKDR-NLITWTVLVSVLGLHGYAHEALE

Query:  RFAEMELSGFEPDGVALGAVLAACKHGGLVKEGMELFSKMKVKYGVEPEMDHYQCVVDLLSTHGHVVQAE
         F  M  S   PD +    VL AC H G V +G ++F  M  +YG+E  +DH  C+VDLL   G++ +A+
Subjt:  RFAEMELSGFEPDGVALGAVLAACKHGGLVKEGMELFSKMKVKYGVEPEMDHYQCVVDLLSTHGHVVQAE

Q9ZUW3 Pentatricopeptide repeat-containing protein At2g276102.3e-7829.88Show/hide
Query:  FQYTSLGMLLVARNLFDKMPHRNVVSYNTMISACSQRGFVKEAWYLFSEMRDCGFVPTQFTFGGLF--SADLLDVWQGAQLQSLSVKNGLFDAGAVVGTA
        F   S   L  A NLFDK P R+  SY +++   S+ G  +EA  LF  +   G       F  +   SA L D   G QL    +K G  D    VGT+
Subjt:  FQYTSLGMLLVARNLFDKMPHRNVVSYNTMISACSQRGFVKEAWYLFSEMRDCGFVPTQFTFGGLF--SADLLDVWQGAQLQSLSVKNGLFDAGAVVGTA

Query:  LLGLYGRRGCLEEALRVFEDMPWKSLVTWNSILTLLGRNQLVEECKHLFCELMCGGMELSKFSFVGILSCFSREEDLKFGQQLHGIVIKIGFYYEVLVVN
        L+  Y +    ++  +VF++M  +++VTW ++++   RN + +E   LF  +   G + + F+F   L   + E     G Q+H +V+K G    + V N
Subjt:  LLGLYGRRGCLEEALRVFEDMPWKSLVTWNSILTLLGRNQLVEECKHLFCELMCGGMELSKFSFVGILSCFSREEDLKFGQQLHGIVIKIGFYYEVLVVN

Query:  SLVNMYLQCGGFFLAEKLFEEVPMRDVVTYNSIIGAGAIIKKPEIALKLFYTMSANGLIPTQASFVNAVNSCSCLGSSIYGEYFHSKIIRYALESDVFAG
        SL+N+YL+CG    A  LF++  ++ VVT+NS+I   A       AL +FY+M  N +  +++SF + +  C+ L    + E  H  +++Y    D    
Subjt:  SLVNMYLQCGGFFLAEKLFEEVPMRDVVTYNSIIGAGAIIKKPEIALKLFYTMSANGLIPTQASFVNAVNSCSCLGSSIYGEYFHSKIIRYALESDVFAG

Query:  TALIDCYAKFRKMEEARYCFDEI-AEKNLVSWNTLIMGY-STDCYTSSMYLLLEMLHFGYRPNEFTFSTIMKTLLALELFQIHCLIIKMGYEENDYVSSS
        TAL+  Y+K   M +A   F EI    N+VSW  +I G+   D    ++ L  EM   G RPNEFT+S I+  L  +   ++H  ++K  YE +  V ++
Subjt:  TALIDCYAKFRKMEEARYCFDEI-AEKNLVSWNTLIMGY-STDCYTSSMYLLLEMLHFGYRPNEFTFSTIMKTLLALELFQIHCLIIKMGYEENDYVSSS

Query:  VASSYAKHGLISDVLAYVSDSNKQPSVVLSNIVAGYYNRVGQYNETQKLLYQLEEPGIISWNIMIEACAKTDNYFKVLALFKSMLMLQICPDNYTFISLL
        +  +Y K                                +G+  E  K+   +++  I++W+ M+   A+T      + +F  +    I P+ +TF S+L
Subjt:  VASSYAKHGLISDVLAYVSDSNKQPSVVLSNIVAGYYNRVGQYNETQKLLYQLEEPGIISWNIMIEACAKTDNYFKVLALFKSMLMLQICPDNYTFISLL

Query:  SVCAKL-CNLALGSSVHGVIIKT--GSGCCDTFVCNLLIDMYGKCGSVGCSLKIFDEVKDRNLITWTVLVSVLGLHGYAHEALERFAEMELSGFEPDGVA
        +VCA    ++  G   HG  IK+   S  C   V + L+ MY K G++  + ++F   ++++L++W  ++S    HG A +AL+ F EM+    + DGV 
Subjt:  SVCAKL-CNLALGSSVHGVIIKT--GSGCCDTFVCNLLIDMYGKCGSVGCSLKIFDEVKDRNLITWTVLVSVLGLHGYAHEALERFAEMELSGFEPDGVA

Query:  LGAVLAACKHGGLVKEGMELFSKMKVKYGVEPEMDHYQCVVDLLSTHGHVVQAEKI
           V AAC H GLV+EG + F  M     + P  +H  C+VDL S  G + +A K+
Subjt:  LGAVLAACKHGGLVKEGMELFSKMKVKYGVEPEMDHYQCVVDLLSTHGHVVQAEKI

Arabidopsis top hitse value%identityAlignment
AT1G06230.1 global transcription factor group E47.2e-13654.84Show/hide
Query:  QASVGRTAVAAQDPASGNGVIKTGFDNQSRVNWASKPKQEMQELRRKFESELEMVRNLVKRIEAIQGQLNSGHSHSHVSTMALADNGRGAYHVYSEVGSV
        Q   G T+ +A   A+G+  ++   D + R++ AS  KQ+ +E+R+K E +L +VR +VK+IE  +G++ + ++ S V      +NG G   + S   S 
Subjt:  QASVGRTAVAAQDPASGNGVIKTGFDNQSRVNWASKPKQEMQELRRKFESELEMVRNLVKRIEAIQGQLNSGHSHSHVSTMALADNGRGAYHVYSEVGSV

Query:  GVLPDDT----RPLHQLSISVLENGKGVNDFMEREKRTPKANQFYRNSEFLLAKDKIPPAESNKKSKLNGKKHGRRKFKHGFGMGTKIFNACVSLLEKLM
        G LP +     RP++QLSISVLEN +GVN+ +E+EKRTPKANQFYRNSEFLL  DK+PPAESNKKSK + KK G     HGFG GTK+F  C +LLE+LM
Subjt:  GVLPDDT----RPLHQLSISVLENGKGVNDFMEREKRTPKANQFYRNSEFLLAKDKIPPAESNKKSKLNGKKHGRRKFKHGFGMGTKIFNACVSLLEKLM

Query:  KHKHGWVFNTPVDVEGLGLHDYFSIIRHPMDLGTVKTMLNKNWYKSPKEFAEDVRLTFHNAMTYNPKGQDVHIMAEQLLKIFEDRWAIIESSYYQEMRL-
        KHKHGWVFN PVDV+GLGL DY++II HPMDLGT+K+ L KN YKSP+EFAEDVRLTFHNAMTYNP+GQDVH+MA  LL+IFE+RWA+IE+ Y +EMR  
Subjt:  KHKHGWVFNTPVDVEGLGLHDYFSIIRHPMDLGTVKTMLNKNWYKSPKEFAEDVRLTFHNAMTYNPKGQDVHIMAEQLLKIFEDRWAIIESSYYQEMRL-

Query:  -GMEYGATLPTSNSIGAPPVPLPPLDMRKILRRSESMINPADSKTQPMSA---TPMSVTPSARTPSLKKPKAKDPFKRDMTYNEKQKLSTNLQNLPSEKL
         G E     PT  S   P +P PP+++R  + R++       S  QP +    TP S TPS RTP+LKKPKA +P KRDMTY EKQKLS +LQNLP +KL
Subjt:  -GMEYGATLPTSNSIGAPPVPLPPLDMRKILRRSESMINPADSKTQPMSA---TPMSVTPSARTPSLKKPKAKDPFKRDMTYNEKQKLSTNLQNLPSEKL

Query:  DAILQIIKKRNFELLQQEDEIEVDIDSVDTETLWELDRLVMNYRKSLSKNKRKAELAILKAQAEAQHNDQEK-APAPDDSKFLRE-TRADENIISSSSPI
        DAI+QI+ KRN  +  +++EIEVDIDSVD ETLWELDR V NY+K LSK KRKAELAI +A+AEA+ N Q++ APAP   +F RE     +  + +  P 
Subjt:  DAILQIIKKRNFELLQQEDEIEVDIDSVDTETLWELDRLVMNYRKSLSKNKRKAELAILKAQAEAQHNDQEK-APAPDDSKFLRE-TRADENIISSSSPI

Query:  RGGQQQGHLSKTSSSSSSSSDSGSSSS
        +  +Q    S++SSSSSSSS S SS S
Subjt:  RGGQQQGHLSKTSSSSSSSSDSGSSSS

AT1G06230.2 global transcription factor group E47.2e-13654.84Show/hide
Query:  QASVGRTAVAAQDPASGNGVIKTGFDNQSRVNWASKPKQEMQELRRKFESELEMVRNLVKRIEAIQGQLNSGHSHSHVSTMALADNGRGAYHVYSEVGSV
        Q   G T+ +A   A+G+  ++   D + R++ AS  KQ+ +E+R+K E +L +VR +VK+IE  +G++ + ++ S V      +NG G   + S   S 
Subjt:  QASVGRTAVAAQDPASGNGVIKTGFDNQSRVNWASKPKQEMQELRRKFESELEMVRNLVKRIEAIQGQLNSGHSHSHVSTMALADNGRGAYHVYSEVGSV

Query:  GVLPDDT----RPLHQLSISVLENGKGVNDFMEREKRTPKANQFYRNSEFLLAKDKIPPAESNKKSKLNGKKHGRRKFKHGFGMGTKIFNACVSLLEKLM
        G LP +     RP++QLSISVLEN +GVN+ +E+EKRTPKANQFYRNSEFLL  DK+PPAESNKKSK + KK G     HGFG GTK+F  C +LLE+LM
Subjt:  GVLPDDT----RPLHQLSISVLENGKGVNDFMEREKRTPKANQFYRNSEFLLAKDKIPPAESNKKSKLNGKKHGRRKFKHGFGMGTKIFNACVSLLEKLM

Query:  KHKHGWVFNTPVDVEGLGLHDYFSIIRHPMDLGTVKTMLNKNWYKSPKEFAEDVRLTFHNAMTYNPKGQDVHIMAEQLLKIFEDRWAIIESSYYQEMRL-
        KHKHGWVFN PVDV+GLGL DY++II HPMDLGT+K+ L KN YKSP+EFAEDVRLTFHNAMTYNP+GQDVH+MA  LL+IFE+RWA+IE+ Y +EMR  
Subjt:  KHKHGWVFNTPVDVEGLGLHDYFSIIRHPMDLGTVKTMLNKNWYKSPKEFAEDVRLTFHNAMTYNPKGQDVHIMAEQLLKIFEDRWAIIESSYYQEMRL-

Query:  -GMEYGATLPTSNSIGAPPVPLPPLDMRKILRRSESMINPADSKTQPMSA---TPMSVTPSARTPSLKKPKAKDPFKRDMTYNEKQKLSTNLQNLPSEKL
         G E     PT  S   P +P PP+++R  + R++       S  QP +    TP S TPS RTP+LKKPKA +P KRDMTY EKQKLS +LQNLP +KL
Subjt:  -GMEYGATLPTSNSIGAPPVPLPPLDMRKILRRSESMINPADSKTQPMSA---TPMSVTPSARTPSLKKPKAKDPFKRDMTYNEKQKLSTNLQNLPSEKL

Query:  DAILQIIKKRNFELLQQEDEIEVDIDSVDTETLWELDRLVMNYRKSLSKNKRKAELAILKAQAEAQHNDQEK-APAPDDSKFLRE-TRADENIISSSSPI
        DAI+QI+ KRN  +  +++EIEVDIDSVD ETLWELDR V NY+K LSK KRKAELAI +A+AEA+ N Q++ APAP   +F RE     +  + +  P 
Subjt:  DAILQIIKKRNFELLQQEDEIEVDIDSVDTETLWELDRLVMNYRKSLSKNKRKAELAILKAQAEAQHNDQEK-APAPDDSKFLRE-TRADENIISSSSPI

Query:  RGGQQQGHLSKTSSSSSSSSDSGSSSS
        +  +Q    S++SSSSSSSS S SS S
Subjt:  RGGQQQGHLSKTSSSSSSSSDSGSSSS

AT1G06230.3 global transcription factor group E47.2e-13654.84Show/hide
Query:  QASVGRTAVAAQDPASGNGVIKTGFDNQSRVNWASKPKQEMQELRRKFESELEMVRNLVKRIEAIQGQLNSGHSHSHVSTMALADNGRGAYHVYSEVGSV
        Q   G T+ +A   A+G+  ++   D + R++ AS  KQ+ +E+R+K E +L +VR +VK+IE  +G++ + ++ S V      +NG G   + S   S 
Subjt:  QASVGRTAVAAQDPASGNGVIKTGFDNQSRVNWASKPKQEMQELRRKFESELEMVRNLVKRIEAIQGQLNSGHSHSHVSTMALADNGRGAYHVYSEVGSV

Query:  GVLPDDT----RPLHQLSISVLENGKGVNDFMEREKRTPKANQFYRNSEFLLAKDKIPPAESNKKSKLNGKKHGRRKFKHGFGMGTKIFNACVSLLEKLM
        G LP +     RP++QLSISVLEN +GVN+ +E+EKRTPKANQFYRNSEFLL  DK+PPAESNKKSK + KK G     HGFG GTK+F  C +LLE+LM
Subjt:  GVLPDDT----RPLHQLSISVLENGKGVNDFMEREKRTPKANQFYRNSEFLLAKDKIPPAESNKKSKLNGKKHGRRKFKHGFGMGTKIFNACVSLLEKLM

Query:  KHKHGWVFNTPVDVEGLGLHDYFSIIRHPMDLGTVKTMLNKNWYKSPKEFAEDVRLTFHNAMTYNPKGQDVHIMAEQLLKIFEDRWAIIESSYYQEMRL-
        KHKHGWVFN PVDV+GLGL DY++II HPMDLGT+K+ L KN YKSP+EFAEDVRLTFHNAMTYNP+GQDVH+MA  LL+IFE+RWA+IE+ Y +EMR  
Subjt:  KHKHGWVFNTPVDVEGLGLHDYFSIIRHPMDLGTVKTMLNKNWYKSPKEFAEDVRLTFHNAMTYNPKGQDVHIMAEQLLKIFEDRWAIIESSYYQEMRL-

Query:  -GMEYGATLPTSNSIGAPPVPLPPLDMRKILRRSESMINPADSKTQPMSA---TPMSVTPSARTPSLKKPKAKDPFKRDMTYNEKQKLSTNLQNLPSEKL
         G E     PT  S   P +P PP+++R  + R++       S  QP +    TP S TPS RTP+LKKPKA +P KRDMTY EKQKLS +LQNLP +KL
Subjt:  -GMEYGATLPTSNSIGAPPVPLPPLDMRKILRRSESMINPADSKTQPMSA---TPMSVTPSARTPSLKKPKAKDPFKRDMTYNEKQKLSTNLQNLPSEKL

Query:  DAILQIIKKRNFELLQQEDEIEVDIDSVDTETLWELDRLVMNYRKSLSKNKRKAELAILKAQAEAQHNDQEK-APAPDDSKFLRE-TRADENIISSSSPI
        DAI+QI+ KRN  +  +++EIEVDIDSVD ETLWELDR V NY+K LSK KRKAELAI +A+AEA+ N Q++ APAP   +F RE     +  + +  P 
Subjt:  DAILQIIKKRNFELLQQEDEIEVDIDSVDTETLWELDRLVMNYRKSLSKNKRKAELAILKAQAEAQHNDQEK-APAPDDSKFLRE-TRADENIISSSSPI

Query:  RGGQQQGHLSKTSSSSSSSSDSGSSSS
        +  +Q    S++SSSSSSSS S SS S
Subjt:  RGGQQQGHLSKTSSSSSSSSDSGSSSS

AT3G09040.1 Pentatricopeptide repeat (PPR) superfamily protein2.9e-8430.75Show/hide
Query:  GPVPNQAIFVNNNIIFQYTSLGMLLVARNLFDKMPHRNVVSYNTMISACSQRGFVKEAWYLFSEMRDCGFVPTQFTFGGLFSA--DLLDVWQGAQLQSLS
        G  P+   FV   +I  Y  LG L  AR LF +M   +VV++N MIS   +RG    A   F  MR      T+ T G + SA   + ++  G  + + +
Subjt:  GPVPNQAIFVNNNIIFQYTSLGMLLVARNLFDKMPHRNVVSYNTMISACSQRGFVKEAWYLFSEMRDCGFVPTQFTFGGLFSA--DLLDVWQGAQLQSLS

Query:  VKNGLFDAGAVVGTALLGLYGRRGCLEEALRVFEDMPWKSLVTWNSILTLLGRNQLVEECKHLFCELMCGGMELSKFSFVGILSCFSREEDLKFGQQLHG
        +K GL  +   VG++L+ +Y +   +E A +VFE +  K+ V WN+++     N    +   LF ++   G  +  F+F  +LS  +   DL+ G Q H 
Subjt:  VKNGLFDAGAVVGTALLGLYGRRGCLEEALRVFEDMPWKSLVTWNSILTLLGRNQLVEECKHLFCELMCGGMELSKFSFVGILSCFSREEDLKFGQQLHG

Query:  IVIKIGFYYEVLVVNSLVNMYLQCGGFFLAEKLFEEVPMRDVVTYNSIIGAGAIIKKPEIALKLFYTMSANGLIPTQASFVNAVNSCSCLGSSIYGEYFH
        I+IK      + V N+LV+MY +CG    A ++FE +  RD VT+N+IIG+    +    A  LF  M+  G++   A   + + +C+ +     G+  H
Subjt:  IVIKIGFYYEVLVVNSLVNMYLQCGGFFLAEKLFEEVPMRDVVTYNSIIGAGAIIKKPEIALKLFYTMSANGLIPTQASFVNAVNSCSCLGSSIYGEYFH

Query:  SKIIRYALESDVFAGTALIDCYAKFRKMEEARYCFDEIAEKNLVSWNTLIMGYSTDCYTSSMYLLLEMLHFGYRPNEFTFSTIMKTLLALELF----QIH
           ++  L+ D+  G++LID Y+K   +++AR  F  + E ++VS N LI GYS +    ++ L  EML  G  P+E TF+TI++     E      Q H
Subjt:  SKIIRYALESDVFAGTALIDCYAKFRKMEEARYCFDEIAEKNLVSWNTLIMGYSTDCYTSSMYLLLEMLHFGYRPNEFTFSTIMKTLLALELF----QIH

Query:  CLIIKMGY-EENDYVSSSVASSYAKHGLISDVLAYVSDSNKQPSVVLSNIVAGYYNRVGQYNETQKLLYQLEEPGIISWNIMIEACAKTDNYFKVLALFK
          I K G+  E +Y+  S+   Y     +++  A  S+ +   S+VL                               W  M+   ++   Y + L  +K
Subjt:  CLIIKMGY-EENDYVSSSVASSYAKHGLISDVLAYVSDSNKQPSVVLSNIVAGYYNRVGQYNETQKLLYQLEEPGIISWNIMIEACAKTDNYFKVLALFK

Query:  SMLMLQICPDNYTFISLLSVCAKLCNLALGSSVHGVIIKTGSGCCDTFVCNLLIDMYGKCGSVGCSLKIFDEVKDR-NLITWTVLVSVLGLHGYAHEALE
         M    + PD  TF+++L VC+ L +L  G ++H +I        D    N LIDMY KCG +  S ++FDE++ R N+++W  L++    +GYA +AL+
Subjt:  SMLMLQICPDNYTFISLLSVCAKLCNLALGSSVHGVIIKTGSGCCDTFVCNLLIDMYGKCGSVGCSLKIFDEVKDR-NLITWTVLVSVLGLHGYAHEALE

Query:  RFAEMELSGFEPDGVALGAVLAACKHGGLVKEGMELFSKMKVKYGVEPEMDHYQCVVDLLSTHGHVVQAE
         F  M  S   PD +    VL AC H G V +G ++F  M  +YG+E  +DH  C+VDLL   G++ +A+
Subjt:  RFAEMELSGFEPDGVALGAVLAACKHGGLVKEGMELFSKMKVKYGVEPEMDHYQCVVDLLSTHGHVVQAE

AT3G58590.1 Pentatricopeptide repeat (PPR) superfamily protein4.6e-19148.86Show/hide
Query:  GDFIKHHHLLLQLLQACSKAPSLKATRLLHALTITMGPVPNQAIFVNNNIIFQYTSLGMLLVARNLFDKMPHRNVVSYNTMISACSQRGFVKEAWYLFSE
        GD   H+  ++ LL  C KAPS   T+ LHAL+IT+  V  Q ++V NNII  Y  LG + +A  +FD+MP RN VS+NT+I   S+ G V +AW +FSE
Subjt:  GDFIKHHHLLLQLLQACSKAPSLKATRLLHALTITMGPVPNQAIFVNNNIIFQYTSLGMLLVARNLFDKMPHRNVVSYNTMISACSQRGFVKEAWYLFSE

Query:  MRDCGFVPTQFTFGGLFSADLLDVWQGAQLQSLSVKNGLFDAGAVVGTALLGLYGRRGCLEEALRVFEDMPWKSLVTWNSILTLLGRNQLVEECKHLFCE
        MR  G++P Q T  GL S   LDV  G QL  LS+K GLF A A VGT LL LYGR   LE A +VFEDMP+KSL TWN +++LLG    ++EC   F E
Subjt:  MRDCGFVPTQFTFGGLFSADLLDVWQGAQLQSLSVKNGLFDAGAVVGTALLGLYGRRGCLEEALRVFEDMPWKSLVTWNSILTLLGRNQLVEECKHLFCE

Query:  LMCGGMELSKFSFVGILSCFSREEDLKFGQQLHGIVIKIGFYYEVLVVNSLVNMYLQCGGFFLAEKLFEEVPMRDVVTYNSIIGAGAIIKKPEIALKLFY
        L+  G  L++ SF+G+L   S  +DL   +QLH    K G   E+ VVNSL++ Y +CG   +AE++F++    D+V++N+II A A  + P  ALKLF 
Subjt:  LMCGGMELSKFSFVGILSCFSREEDLKFGQQLHGIVIKIGFYYEVLVVNSLVNMYLQCGGFFLAEKLFEEVPMRDVVTYNSIIGAGAIIKKPEIALKLFY

Query:  TMSANGLIPTQASFVNAVNSCSCLGSSIYGEYFHSKIIRYALESDVFAGTALIDCYAKFRKMEEARYCFDEIAEKNLVSWNTLIMGYSTDCYTSSMYLLL
        +M  +G  P Q ++V+ +   S +     G   H  +I+   E+ +  G ALID YAK   +E++R CFD I +KN+V WN L+ GY+       + L L
Subjt:  TMSANGLIPTQASFVNAVNSCSCLGSSIYGEYFHSKIIRYALESDVFAGTALIDCYAKFRKMEEARYCFDEIAEKNLVSWNTLIMGYSTDCYTSSMYLLL

Query:  EMLHFGYRPNEFTFSTIMKTLLALELFQIHCLIIKMGYEENDYVSSSVASSYAKHGLISDVLAYVSDSNKQPSVVLSNIVAGYYNRVGQYNETQKLLYQL
        +ML  G+RP E+TFST +K+    EL Q+H +I++MGYE+NDYV SS+  SYAK+ L++D L  +  ++   SVV  NIVAG Y+R GQY+E+ KL+  L
Subjt:  EMLHFGYRPNEFTFSTIMKTLLALELFQIHCLIIKMGYEENDYVSSSVASSYAKHGLISDVLAYVSDSNKQPSVVLSNIVAGYYNRVGQYNETQKLLYQL

Query:  EEPGIISWNIMIEACAKTDNYFKVLALFKSMLMLQICPDNYTFISLLSVCAKLCNLALGSSVHGVIIKTGSGCCDTFVCNLLIDMYGKCGSVGCSLKIFD
        E+P  +SWNI I AC+++D + +V+ LFK ML   I PD YTF+S+LS+C+KLC+L LGSS+HG+I KT   C DTFVCN+LIDMYGKCGS+   +K+F+
Subjt:  EEPGIISWNIMIEACAKTDNYFKVLALFKSMLMLQICPDNYTFISLLSVCAKLCNLALGSSVHGVIIKTGSGCCDTFVCNLLIDMYGKCGSVGCSLKIFD

Query:  EVKDRNLITWTVLVSVLGLHGYAHEALERFAEMELSGFEPDGVALGAVLAACKHGGLVKEGMELFSKMKVKYGVEPEMDHYQCVVDLLSTHGHVVQAEKI
        E +++NLITWT L+S LG+HGY  EALE+F E    GF+PD V+  ++L AC+HGG+VKEGM LF KMK  YGVEPEMDHY+C VDLL+ +G++ +AE +
Subjt:  EVKDRNLITWTVLVSVLGLHGYAHEALERFAEMELSGFEPDGVALGAVLAACKHGGLVKEGMELFSKMKVKYGVEPEMDHYQCVVDLLSTHGHVVQAEKI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTTCAATGGCGATTTCATCAAGCATCACCACCTTCTCCTCCAGCTACTTCAGGCATGCTCCAAGGCTCCTTCCCTCAAAGCAACGAGACTCCTTCATGCTCTCAC
AATTACGATGGGTCCTGTTCCGAACCAGGCCATTTTTGTTAATAATAATATCATATTCCAATATACTTCTCTTGGGATGTTGTTGGTGGCACGTAATCTGTTTGACAAAA
TGCCCCACCGAAATGTTGTGTCTTATAACACGATGATCAGTGCTTGTAGCCAACGTGGGTTTGTAAAGGAAGCATGGTATTTGTTTTCGGAGATGAGAGATTGTGGTTTT
GTACCAACTCAATTCACATTTGGTGGGTTATTTTCGGCCGACTTATTGGATGTTTGGCAGGGTGCTCAATTGCAGTCGTTATCAGTTAAAAATGGTCTGTTTGATGCTGG
TGCTGTTGTGGGAACGGCCTTGTTGGGGCTGTACGGCAGGCGTGGATGCTTGGAAGAAGCTCTGCGGGTTTTTGAAGATATGCCTTGGAAAAGTTTGGTGACATGGAATT
CGATATTGACATTACTCGGTCGTAACCAACTTGTGGAAGAATGTAAGCATCTGTTTTGTGAGCTTATGTGCGGAGGGATGGAACTGTCCAAGTTCTCTTTTGTGGGTATT
TTGTCTTGTTTTTCACGCGAAGAAGACTTGAAATTTGGGCAACAGTTACATGGTATTGTGATTAAAATTGGGTTTTATTATGAAGTTCTGGTTGTAAATTCTCTAGTGAA
CATGTATTTACAATGTGGAGGCTTTTTCTTAGCTGAGAAACTGTTTGAAGAGGTGCCCATGCGGGATGTTGTGACATATAATTCAATCATTGGCGCCGGGGCAATAATCA
AGAAACCTGAAATAGCATTGAAACTCTTTTACACTATGTCAGCGAATGGACTAATTCCTACCCAGGCATCATTTGTAAACGCTGTCAACTCTTGTAGTTGTTTGGGAAGT
TCCATTTATGGAGAATATTTTCATTCAAAAATAATTCGTTATGCTTTGGAGTCTGATGTATTTGCGGGCACTGCTTTGATTGACTGCTACGCTAAGTTCAGAAAAATGGA
GGAAGCCCGTTATTGCTTTGATGAGATAGCTGAGAAGAATTTGGTTTCTTGGAACACTTTGATTATGGGCTATTCAACTGATTGCTACACTTCTTCTATGTATTTACTGC
TAGAAATGCTGCATTTTGGTTATCGACCTAACGAATTTACATTTTCAACCATTATGAAGACACTATTGGCTTTGGAATTATTTCAGATTCATTGCTTGATTATAAAAATG
GGCTATGAGGAGAATGATTATGTATCAAGCTCTGTTGCTTCTTCCTATGCCAAACATGGTCTCATATCTGATGTCCTGGCTTATGTCTCTGATTCTAACAAACAACCTTC
TGTTGTGCTTTCTAACATAGTTGCTGGATATTATAATAGAGTTGGCCAATACAATGAGACACAGAAATTGCTTTACCAACTTGAGGAACCTGGCATTATATCTTGGAATA
TTATGATTGAAGCTTGCGCTAAAACAGACAATTACTTCAAAGTTCTAGCACTTTTCAAAAGCATGCTTATGCTCCAAATCTGCCCAGACAATTATACATTTATCTCCCTT
CTGAGTGTTTGTGCTAAACTGTGCAACCTTGCTCTGGGCAGTTCGGTTCATGGCGTTATCATAAAGACTGGTTCAGGTTGTTGTGATACATTTGTGTGCAATCTGCTAAT
TGACATGTATGGAAAATGTGGAAGCGTTGGATGCTCTTTGAAAATATTTGATGAAGTGAAAGATAGAAACTTAATCACATGGACAGTTTTAGTCTCCGTCCTTGGATTGC
ATGGCTATGCTCATGAAGCGCTAGAAAGGTTTGCAGAAATGGAGTTGTCAGGGTTTGAACCTGACGGGGTAGCTCTCGGTGCTGTGCTTGCAGCTTGCAAGCATGGTGGG
CTTGTTAAAGAAGGAATGGAGTTGTTTAGCAAGATGAAAGTGAAATATGGGGTCGAACCAGAAATGGATCATTATCAATGTGTTGTTGACTTGCTTTCTACGCATGGACA
TGTTGTGCAAGCGGAGAAGATTGGAGACTGCAGACTTATTGTGGATGACATGGCAGCATTAAGTATGCCTTACTCAACCTACAGCGCCGTGCCATTCCCTGCCCTTCTTG
AGGGGTCCCTAAGCTTTGGAGTCTGCGTTATATACTTTGATCAGTTTGCAGTTCTGGTTTCAGTGGCTTCTTCCACTCTCAACTGGTTCAGAAAATGTAGTGAATTTGAT
CAAGCGGTCCTCTACGATATTGGAGGGAAAGCTGCAAAGCCTGCAACTTTCTTCTTCTTCTTCTTCCTTTTTTCTACTTCGATCGCCTGCATTTACAACTCCACGCTTCG
AAACCCTGCAGCTGCATGCATGGCCAGAGCTCGCATTGTACCCATGTCGCTGCTTCTCTTCTGCGCTTTCTCCTTGCTCTTCGTCACCTCTCAGGCTCGACTTCTTCTTC
TCAGAACGCCCACCCACAGAGATGATGTCGCAGAAACGCCAGAATCCCACTTTCTTCTGCATAAAGTGTTGGGCTTTCGTCTCCCCGAGCTCAGACACCGCCGGCGACGG
TCGCTGGTCGAATCGGAGAGGATCGTGCCGGGAGGACCCGACCCGGAGCACCATAGTCCCCGCCATTCACCCACCTCGGTGAAATCTCCACGACGTCCACGTTACGCGCC
GGAATATTCTTCGCCCTTCTCTTTCAATTGGTTCGTGCGGTACACGCTCCACAGAAACACGTACAGATTAGGGCGCATCGCTGGTCCTTCGCACGGGTCTTCCGATGCCG
TCCCAGCCGCGTCATATAACACGCTGCGGGCCACGATCATCTCTGCAGCCGTACACCTAGGAGGAGCCAGATCTGGCCGGAAGTTCGGGGGCTCCGACCTCCAAACCCTA
ACTTTTTTGTGGCTTCAACGTCTCTCTGCGTTTATATTACTGCGCGGTCAGGGTCCTTTGCGAACCCTGGAAATTGCGCCTATAGCTGGAGGAGGAGATGAAGATAGAGT
GAAGCAGAGGTATTCGGAGAGTAAGGTTTACAGAAGGAAAACGTTCAAAGCTCTGAAGAACAAGAATACGGCCAGTGTCACCGCCAGCACCACCGTGTCCACCACCACCA
CCGACAAGGACGGGAGCAACAAGAATGGAAATAACATCACCGCTACCTTCAGCAACGTTAAAAGCTTCAATAACAATTCAGACCAGGCGGTGCCGCATTCAGAAGCGTCG
GAAGACGCGAATCTGTCTCGGCAGCAGCCTTTATCGCGGTTGGGCGCTGCCTCGGACGATTTGACGAGGCTTAATCGACAGGCATCCGTGGGGCGGACTGCTGTGGCTGC
CCAGGACCCGGCTTCTGGAAATGGCGTCATTAAGACGGGTTTCGACAATCAGAGTCGGGTCAACTGGGCATCAAAGCCGAAGCAGGAAATGCAAGAGCTTCGGCGGAAAT
TCGAGAGTGAGCTTGAAATGGTGAGAAATTTGGTTAAGAGGATTGAAGCCATACAAGGGCAGTTAAATAGCGGGCATAGTCATTCTCACGTGTCAACTATGGCATTGGCT
GATAACGGTCGTGGAGCATATCATGTCTATTCGGAAGTTGGTTCCGTTGGTGTTCTTCCGGATGATACCAGACCACTGCACCAGTTAAGCATATCGGTTCTGGAGAATGG
TAAAGGGGTGAATGATTTCATGGAAAGAGAGAAGAGGACCCCGAAAGCGAACCAGTTTTATCGTAATTCCGAATTCTTGCTTGCAAAAGATAAGATTCCTCCAGCTGAGA
GCAATAAAAAGTCTAAATTAAATGGGAAAAAGCATGGTAGACGAAAATTTAAACATGGTTTTGGCATGGGTACTAAGATCTTCAATGCTTGTGTTTCTTTACTTGAGAAA
TTGATGAAGCATAAGCATGGTTGGGTGTTCAATACTCCTGTTGACGTAGAGGGTCTTGGTTTGCATGATTATTTTAGCATTATCAGACATCCAATGGACTTGGGTACTGT
GAAAACGATGCTGAACAAGAATTGGTACAAGTCTCCAAAAGAATTTGCGGAGGACGTGAGACTTACTTTCCATAATGCTATGACTTATAATCCGAAAGGGCAAGATGTTC
ACATAATGGCGGAACAGTTACTGAAAATCTTTGAGGACAGGTGGGCTATAATAGAATCAAGTTATTATCAGGAGATGAGGCTGGGAATGGAATATGGGGCTACTCTTCCA
ACTTCTAATTCAATAGGAGCCCCTCCCGTGCCACTTCCTCCCCTTGACATGAGAAAAATTCTACGGAGGTCAGAGTCAATGATAAATCCAGCTGATTCCAAGACTCAACC
GATGAGTGCGACTCCGATGAGTGTGACTCCGTCAGCAAGGACTCCTTCACTGAAGAAACCAAAAGCAAAAGATCCCTTCAAAAGGGATATGACTTACAATGAAAAGCAAA
AGCTCAGTACGAACCTTCAGAATTTACCCTCTGAGAAGCTGGATGCCATTCTACAGATTATTAAAAAGAGAAATTTTGAGCTTCTCCAACAAGAGGATGAAATTGAAGTA
GACATTGATAGTGTTGACACTGAGACCCTGTGGGAGCTTGATAGACTAGTTATGAATTACAGGAAGAGTTTGAGCAAGAATAAGAGGAAGGCTGAACTTGCCATTTTGAA
GGCTCAAGCAGAAGCACAGCATAACGACCAGGAGAAGGCCCCAGCCCCAGATGATTCAAAGTTTCTTAGAGAAACTAGAGCAGATGAAAATATTATTTCCTCTTCATCAC
CCATCCGAGGGGGACAACAGCAGGGCCATTTGAGTAAGACCAGCAGCTCAAGTAGCTCTAGCAGTGATTCTGGATCTTCTTCTAGTG
mRNA sequenceShow/hide mRNA sequence
ATGAGCTTCAATGGCGATTTCATCAAGCATCACCACCTTCTCCTCCAGCTACTTCAGGCATGCTCCAAGGCTCCTTCCCTCAAAGCAACGAGACTCCTTCATGCTCTCAC
AATTACGATGGGTCCTGTTCCGAACCAGGCCATTTTTGTTAATAATAATATCATATTCCAATATACTTCTCTTGGGATGTTGTTGGTGGCACGTAATCTGTTTGACAAAA
TGCCCCACCGAAATGTTGTGTCTTATAACACGATGATCAGTGCTTGTAGCCAACGTGGGTTTGTAAAGGAAGCATGGTATTTGTTTTCGGAGATGAGAGATTGTGGTTTT
GTACCAACTCAATTCACATTTGGTGGGTTATTTTCGGCCGACTTATTGGATGTTTGGCAGGGTGCTCAATTGCAGTCGTTATCAGTTAAAAATGGTCTGTTTGATGCTGG
TGCTGTTGTGGGAACGGCCTTGTTGGGGCTGTACGGCAGGCGTGGATGCTTGGAAGAAGCTCTGCGGGTTTTTGAAGATATGCCTTGGAAAAGTTTGGTGACATGGAATT
CGATATTGACATTACTCGGTCGTAACCAACTTGTGGAAGAATGTAAGCATCTGTTTTGTGAGCTTATGTGCGGAGGGATGGAACTGTCCAAGTTCTCTTTTGTGGGTATT
TTGTCTTGTTTTTCACGCGAAGAAGACTTGAAATTTGGGCAACAGTTACATGGTATTGTGATTAAAATTGGGTTTTATTATGAAGTTCTGGTTGTAAATTCTCTAGTGAA
CATGTATTTACAATGTGGAGGCTTTTTCTTAGCTGAGAAACTGTTTGAAGAGGTGCCCATGCGGGATGTTGTGACATATAATTCAATCATTGGCGCCGGGGCAATAATCA
AGAAACCTGAAATAGCATTGAAACTCTTTTACACTATGTCAGCGAATGGACTAATTCCTACCCAGGCATCATTTGTAAACGCTGTCAACTCTTGTAGTTGTTTGGGAAGT
TCCATTTATGGAGAATATTTTCATTCAAAAATAATTCGTTATGCTTTGGAGTCTGATGTATTTGCGGGCACTGCTTTGATTGACTGCTACGCTAAGTTCAGAAAAATGGA
GGAAGCCCGTTATTGCTTTGATGAGATAGCTGAGAAGAATTTGGTTTCTTGGAACACTTTGATTATGGGCTATTCAACTGATTGCTACACTTCTTCTATGTATTTACTGC
TAGAAATGCTGCATTTTGGTTATCGACCTAACGAATTTACATTTTCAACCATTATGAAGACACTATTGGCTTTGGAATTATTTCAGATTCATTGCTTGATTATAAAAATG
GGCTATGAGGAGAATGATTATGTATCAAGCTCTGTTGCTTCTTCCTATGCCAAACATGGTCTCATATCTGATGTCCTGGCTTATGTCTCTGATTCTAACAAACAACCTTC
TGTTGTGCTTTCTAACATAGTTGCTGGATATTATAATAGAGTTGGCCAATACAATGAGACACAGAAATTGCTTTACCAACTTGAGGAACCTGGCATTATATCTTGGAATA
TTATGATTGAAGCTTGCGCTAAAACAGACAATTACTTCAAAGTTCTAGCACTTTTCAAAAGCATGCTTATGCTCCAAATCTGCCCAGACAATTATACATTTATCTCCCTT
CTGAGTGTTTGTGCTAAACTGTGCAACCTTGCTCTGGGCAGTTCGGTTCATGGCGTTATCATAAAGACTGGTTCAGGTTGTTGTGATACATTTGTGTGCAATCTGCTAAT
TGACATGTATGGAAAATGTGGAAGCGTTGGATGCTCTTTGAAAATATTTGATGAAGTGAAAGATAGAAACTTAATCACATGGACAGTTTTAGTCTCCGTCCTTGGATTGC
ATGGCTATGCTCATGAAGCGCTAGAAAGGTTTGCAGAAATGGAGTTGTCAGGGTTTGAACCTGACGGGGTAGCTCTCGGTGCTGTGCTTGCAGCTTGCAAGCATGGTGGG
CTTGTTAAAGAAGGAATGGAGTTGTTTAGCAAGATGAAAGTGAAATATGGGGTCGAACCAGAAATGGATCATTATCAATGTGTTGTTGACTTGCTTTCTACGCATGGACA
TGTTGTGCAAGCGGAGAAGATTGGAGACTGCAGACTTATTGTGGATGACATGGCAGCATTAAGTATGCCTTACTCAACCTACAGCGCCGTGCCATTCCCTGCCCTTCTTG
AGGGGTCCCTAAGCTTTGGAGTCTGCGTTATATACTTTGATCAGTTTGCAGTTCTGGTTTCAGTGGCTTCTTCCACTCTCAACTGGTTCAGAAAATGTAGTGAATTTGAT
CAAGCGGTCCTCTACGATATTGGAGGGAAAGCTGCAAAGCCTGCAACTTTCTTCTTCTTCTTCTTCCTTTTTTCTACTTCGATCGCCTGCATTTACAACTCCACGCTTCG
AAACCCTGCAGCTGCATGCATGGCCAGAGCTCGCATTGTACCCATGTCGCTGCTTCTCTTCTGCGCTTTCTCCTTGCTCTTCGTCACCTCTCAGGCTCGACTTCTTCTTC
TCAGAACGCCCACCCACAGAGATGATGTCGCAGAAACGCCAGAATCCCACTTTCTTCTGCATAAAGTGTTGGGCTTTCGTCTCCCCGAGCTCAGACACCGCCGGCGACGG
TCGCTGGTCGAATCGGAGAGGATCGTGCCGGGAGGACCCGACCCGGAGCACCATAGTCCCCGCCATTCACCCACCTCGGTGAAATCTCCACGACGTCCACGTTACGCGCC
GGAATATTCTTCGCCCTTCTCTTTCAATTGGTTCGTGCGGTACACGCTCCACAGAAACACGTACAGATTAGGGCGCATCGCTGGTCCTTCGCACGGGTCTTCCGATGCCG
TCCCAGCCGCGTCATATAACACGCTGCGGGCCACGATCATCTCTGCAGCCGTACACCTAGGAGGAGCCAGATCTGGCCGGAAGTTCGGGGGCTCCGACCTCCAAACCCTA
ACTTTTTTGTGGCTTCAACGTCTCTCTGCGTTTATATTACTGCGCGGTCAGGGTCCTTTGCGAACCCTGGAAATTGCGCCTATAGCTGGAGGAGGAGATGAAGATAGAGT
GAAGCAGAGGTATTCGGAGAGTAAGGTTTACAGAAGGAAAACGTTCAAAGCTCTGAAGAACAAGAATACGGCCAGTGTCACCGCCAGCACCACCGTGTCCACCACCACCA
CCGACAAGGACGGGAGCAACAAGAATGGAAATAACATCACCGCTACCTTCAGCAACGTTAAAAGCTTCAATAACAATTCAGACCAGGCGGTGCCGCATTCAGAAGCGTCG
GAAGACGCGAATCTGTCTCGGCAGCAGCCTTTATCGCGGTTGGGCGCTGCCTCGGACGATTTGACGAGGCTTAATCGACAGGCATCCGTGGGGCGGACTGCTGTGGCTGC
CCAGGACCCGGCTTCTGGAAATGGCGTCATTAAGACGGGTTTCGACAATCAGAGTCGGGTCAACTGGGCATCAAAGCCGAAGCAGGAAATGCAAGAGCTTCGGCGGAAAT
TCGAGAGTGAGCTTGAAATGGTGAGAAATTTGGTTAAGAGGATTGAAGCCATACAAGGGCAGTTAAATAGCGGGCATAGTCATTCTCACGTGTCAACTATGGCATTGGCT
GATAACGGTCGTGGAGCATATCATGTCTATTCGGAAGTTGGTTCCGTTGGTGTTCTTCCGGATGATACCAGACCACTGCACCAGTTAAGCATATCGGTTCTGGAGAATGG
TAAAGGGGTGAATGATTTCATGGAAAGAGAGAAGAGGACCCCGAAAGCGAACCAGTTTTATCGTAATTCCGAATTCTTGCTTGCAAAAGATAAGATTCCTCCAGCTGAGA
GCAATAAAAAGTCTAAATTAAATGGGAAAAAGCATGGTAGACGAAAATTTAAACATGGTTTTGGCATGGGTACTAAGATCTTCAATGCTTGTGTTTCTTTACTTGAGAAA
TTGATGAAGCATAAGCATGGTTGGGTGTTCAATACTCCTGTTGACGTAGAGGGTCTTGGTTTGCATGATTATTTTAGCATTATCAGACATCCAATGGACTTGGGTACTGT
GAAAACGATGCTGAACAAGAATTGGTACAAGTCTCCAAAAGAATTTGCGGAGGACGTGAGACTTACTTTCCATAATGCTATGACTTATAATCCGAAAGGGCAAGATGTTC
ACATAATGGCGGAACAGTTACTGAAAATCTTTGAGGACAGGTGGGCTATAATAGAATCAAGTTATTATCAGGAGATGAGGCTGGGAATGGAATATGGGGCTACTCTTCCA
ACTTCTAATTCAATAGGAGCCCCTCCCGTGCCACTTCCTCCCCTTGACATGAGAAAAATTCTACGGAGGTCAGAGTCAATGATAAATCCAGCTGATTCCAAGACTCAACC
GATGAGTGCGACTCCGATGAGTGTGACTCCGTCAGCAAGGACTCCTTCACTGAAGAAACCAAAAGCAAAAGATCCCTTCAAAAGGGATATGACTTACAATGAAAAGCAAA
AGCTCAGTACGAACCTTCAGAATTTACCCTCTGAGAAGCTGGATGCCATTCTACAGATTATTAAAAAGAGAAATTTTGAGCTTCTCCAACAAGAGGATGAAATTGAAGTA
GACATTGATAGTGTTGACACTGAGACCCTGTGGGAGCTTGATAGACTAGTTATGAATTACAGGAAGAGTTTGAGCAAGAATAAGAGGAAGGCTGAACTTGCCATTTTGAA
GGCTCAAGCAGAAGCACAGCATAACGACCAGGAGAAGGCCCCAGCCCCAGATGATTCAAAGTTTCTTAGAGAAACTAGAGCAGATGAAAATATTATTTCCTCTTCATCAC
CCATCCGAGGGGGACAACAGCAGGGCCATTTGAGTAAGACCAGCAGCTCAAGTAGCTCTAGCAGTGATTCTGGATCTTCTTCTAGTG
Protein sequenceShow/hide protein sequence
MSFNGDFIKHHHLLLQLLQACSKAPSLKATRLLHALTITMGPVPNQAIFVNNNIIFQYTSLGMLLVARNLFDKMPHRNVVSYNTMISACSQRGFVKEAWYLFSEMRDCGF
VPTQFTFGGLFSADLLDVWQGAQLQSLSVKNGLFDAGAVVGTALLGLYGRRGCLEEALRVFEDMPWKSLVTWNSILTLLGRNQLVEECKHLFCELMCGGMELSKFSFVGI
LSCFSREEDLKFGQQLHGIVIKIGFYYEVLVVNSLVNMYLQCGGFFLAEKLFEEVPMRDVVTYNSIIGAGAIIKKPEIALKLFYTMSANGLIPTQASFVNAVNSCSCLGS
SIYGEYFHSKIIRYALESDVFAGTALIDCYAKFRKMEEARYCFDEIAEKNLVSWNTLIMGYSTDCYTSSMYLLLEMLHFGYRPNEFTFSTIMKTLLALELFQIHCLIIKM
GYEENDYVSSSVASSYAKHGLISDVLAYVSDSNKQPSVVLSNIVAGYYNRVGQYNETQKLLYQLEEPGIISWNIMIEACAKTDNYFKVLALFKSMLMLQICPDNYTFISL
LSVCAKLCNLALGSSVHGVIIKTGSGCCDTFVCNLLIDMYGKCGSVGCSLKIFDEVKDRNLITWTVLVSVLGLHGYAHEALERFAEMELSGFEPDGVALGAVLAACKHGG
LVKEGMELFSKMKVKYGVEPEMDHYQCVVDLLSTHGHVVQAEKIGDCRLIVDDMAALSMPYSTYSAVPFPALLEGSLSFGVCVIYFDQFAVLVSVASSTLNWFRKCSEFD
QAVLYDIGGKAAKPATFFFFFFLFSTSIACIYNSTLRNPAAACMARARIVPMSLLLFCAFSLLFVTSQARLLLLRTPTHRDDVAETPESHFLLHKVLGFRLPELRHRRRR
SLVESERIVPGGPDPEHHSPRHSPTSVKSPRRPRYAPEYSSPFSFNWFVRYTLHRNTYRLGRIAGPSHGSSDAVPAASYNTLRATIISAAVHLGGARSGRKFGGSDLQTL
TFLWLQRLSAFILLRGQGPLRTLEIAPIAGGGDEDRVKQRYSESKVYRRKTFKALKNKNTASVTASTTVSTTTTDKDGSNKNGNNITATFSNVKSFNNNSDQAVPHSEAS
EDANLSRQQPLSRLGAASDDLTRLNRQASVGRTAVAAQDPASGNGVIKTGFDNQSRVNWASKPKQEMQELRRKFESELEMVRNLVKRIEAIQGQLNSGHSHSHVSTMALA
DNGRGAYHVYSEVGSVGVLPDDTRPLHQLSISVLENGKGVNDFMEREKRTPKANQFYRNSEFLLAKDKIPPAESNKKSKLNGKKHGRRKFKHGFGMGTKIFNACVSLLEK
LMKHKHGWVFNTPVDVEGLGLHDYFSIIRHPMDLGTVKTMLNKNWYKSPKEFAEDVRLTFHNAMTYNPKGQDVHIMAEQLLKIFEDRWAIIESSYYQEMRLGMEYGATLP
TSNSIGAPPVPLPPLDMRKILRRSESMINPADSKTQPMSATPMSVTPSARTPSLKKPKAKDPFKRDMTYNEKQKLSTNLQNLPSEKLDAILQIIKKRNFELLQQEDEIEV
DIDSVDTETLWELDRLVMNYRKSLSKNKRKAELAILKAQAEAQHNDQEKAPAPDDSKFLRETRADENIISSSSPIRGGQQQGHLSKTSSSSSSSSDSGSSSSX