; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g32890 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g32890
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr9:24881466..24889151
RNA-Seq ExpressionMoc09g32890
SyntenyMoc09g32890
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR002885 - Pentatricopeptide repeat
IPR009057 - Homeobox-like domain superfamily
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578928.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]1.1e-8153.16Show/hide
Query:  FRPTSPIFLLNQAIEAYGKCGCLEDAGELFEEMPHRDGGSSNAMITAYTQNGYALGALNLFSDMNESGVRATEIFLASILGSCGAALALHFSRQIHGHIV
        F PT PIFLLN+AIEAYGKCGCLEDA ELFEEMP RDGGS NAM+TAYTQNG+AL +LNLF +MN+SGV ATEI LAS+LGSCG+ LALH SRQ+HGHIV
Subjt:  FRPTSPIFLLNQAIEAYGKCGCLEDAGELFEEMPHRDGGSSNAMITAYTQNGYALGALNLFSDMNESGVRATEIFLASILGSCGAALALHFSRQIHGHIV

Query:  KSCFLGIVILENSLVDVYGKWAM--------EERQCRVDVLPNV----------PRRSDSIEFYLF--------------------------------LL
        KS F+G VILE+SLVDVYGK  +        +E Q R DV  NV           + + S+ F +F                                ++
Subjt:  KSCFLGIVILENSLVDVYGKWAM--------EERQCRVDVLPNV----------PRRSDSIEFYLF--------------------------------LL

Query:  VQV---------------------------------------WQHLY------KESPKARELFNEMPERNVISWNAMLAGYVDSSQWEEALDFVHLMCSS
        V+V                                       W  +        E  KARELFNEMPERNVISWNAMLAGY+ S QW+EALDFVHLM +S
Subjt:  VQV---------------------------------------WQHLY------KESPKARELFNEMPERNVISWNAMLAGYVDSSQWEEALDFVHLMCSS

Query:  IKEIDHVTLRLILNVCTDRLDVAIGKQVHGFVYKIGFYADLYIGSALL
        IK++DHVTLRLILNVCT  LDV  GKQVHGF+Y+IGFYA+LYIG+ALL
Subjt:  IKEIDHVTLRLILNVCTDRLDVAIGKQVHGFVYKIGFYADLYIGSALL

KAG7016451.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]1.1e-8153.16Show/hide
Query:  FRPTSPIFLLNQAIEAYGKCGCLEDAGELFEEMPHRDGGSSNAMITAYTQNGYALGALNLFSDMNESGVRATEIFLASILGSCGAALALHFSRQIHGHIV
        F PT PIFLLN+AIEAYGKCGCLEDA ELFEEMP RDGGS NAM+TAYTQNG+AL +LNLF +MN+SGV ATEI LAS+LGSCG+ LALH SRQ+HGHIV
Subjt:  FRPTSPIFLLNQAIEAYGKCGCLEDAGELFEEMPHRDGGSSNAMITAYTQNGYALGALNLFSDMNESGVRATEIFLASILGSCGAALALHFSRQIHGHIV

Query:  KSCFLGIVILENSLVDVYGKWAM--------EERQCRVDVLPNV----------PRRSDSIEFYLF--------------------------------LL
        KS F+G VILE+SLVDVYGK  +        +E Q R DV  NV           + + S+ F +F                                ++
Subjt:  KSCFLGIVILENSLVDVYGKWAM--------EERQCRVDVLPNV----------PRRSDSIEFYLF--------------------------------LL

Query:  VQV---------------------------------------WQHLY------KESPKARELFNEMPERNVISWNAMLAGYVDSSQWEEALDFVHLMCSS
        V+V                                       W  +        E  KARELFNEMPERNVISWNAMLAGY+ S QW+EALDFVHLM +S
Subjt:  VQV---------------------------------------WQHLY------KESPKARELFNEMPERNVISWNAMLAGYVDSSQWEEALDFVHLMCSS

Query:  IKEIDHVTLRLILNVCTDRLDVAIGKQVHGFVYKIGFYADLYIGSALL
        IK++DHVTLRLILNVCT  LDV  GKQVHGF+Y+IGFYA+LYIG+ALL
Subjt:  IKEIDHVTLRLILNVCTDRLDVAIGKQVHGFVYKIGFYADLYIGSALL

XP_022141462.1 pentatricopeptide repeat-containing protein At3g26540-like [Momordica charantia]9.6e-8958.62Show/hide
Query:  FRPTSPIFLLNQAIEAYGKCGCLEDAGELFEEMPHRDGGSSNAMITAYTQNGYALGALNLFSDMNESGVRATEIFLASILGSCGAALALHFSRQIHGHIV
        F PT PIFLLN+AIEAYGKCGCLEDA ELFEEMP RDGGS NAMITAYTQNGYAL AL LFSDMN+SGVRATEI LASILGSCG ALALHFSRQIHGHIV
Subjt:  FRPTSPIFLLNQAIEAYGKCGCLEDAGELFEEMPHRDGGSSNAMITAYTQNGYALGALNLFSDMNESGVRATEIFLASILGSCGAALALHFSRQIHGHIV

Query:  KSCFLGIVILENSLVDVYGKWAM--------EERQCRVDVLPNVPRR-----------------------------------------------------
        KS FLG VILE+SLVDVYGK  +        +E Q R DV  NV  R                                                     
Subjt:  KSCFLGIVILENSLVDVYGKWAM--------EERQCRVDVLPNVPRR-----------------------------------------------------

Query:  ------------SDSIEFYLFL-----------------LVQVWQHLY-----KESPKARELFNEMPERNVISWNAMLAGYVDSSQWEEALDFVHLMCSS
                    S  I+ Y+                   L+     LY      E PKARELFNEMPERNVISWNAMLAGYV S+QWEEALDFVHLM SS
Subjt:  ------------SDSIEFYLFL-----------------LVQVWQHLY-----KESPKARELFNEMPERNVISWNAMLAGYVDSSQWEEALDFVHLMCSS

Query:  IKEIDHVTLRLILNVCTDRLDVAIGKQVHGFVYKIGFYADLYIGSALL
        IKEIDHVTLRLILNVCT  LDVA GKQVHGFVY+IGFYA+LYIG+ALL
Subjt:  IKEIDHVTLRLILNVCTDRLDVAIGKQVHGFVYKIGFYADLYIGSALL

XP_022938461.1 pentatricopeptide repeat-containing protein At3g26540 [Cucurbita moschata]1.1e-8153.16Show/hide
Query:  FRPTSPIFLLNQAIEAYGKCGCLEDAGELFEEMPHRDGGSSNAMITAYTQNGYALGALNLFSDMNESGVRATEIFLASILGSCGAALALHFSRQIHGHIV
        F PT PIFLLN+AIEAYGKCGCLEDA ELFEEMP RDGGS NAM+TAYTQNG+AL +LNLF +MN+SGV ATEI LAS+LGSCG+ LALH SRQ+HGHIV
Subjt:  FRPTSPIFLLNQAIEAYGKCGCLEDAGELFEEMPHRDGGSSNAMITAYTQNGYALGALNLFSDMNESGVRATEIFLASILGSCGAALALHFSRQIHGHIV

Query:  KSCFLGIVILENSLVDVYGKWAM--------EERQCRVDVLPNV----------PRRSDSIEFYLF--------------------------------LL
        KS F+G VILE+SLVDVYGK  +        +E Q R DV  NV           + + S+ F +F                                ++
Subjt:  KSCFLGIVILENSLVDVYGKWAM--------EERQCRVDVLPNV----------PRRSDSIEFYLF--------------------------------LL

Query:  VQV---------------------------------------WQHLY------KESPKARELFNEMPERNVISWNAMLAGYVDSSQWEEALDFVHLMCSS
        V+V                                       W  +        E  KARELFNEMPERNVISWNAMLAGY+ S QW+EALDFVHLM +S
Subjt:  VQV---------------------------------------WQHLY------KESPKARELFNEMPERNVISWNAMLAGYVDSSQWEEALDFVHLMCSS

Query:  IKEIDHVTLRLILNVCTDRLDVAIGKQVHGFVYKIGFYADLYIGSALL
        IK++DHVTLRLILNVCT  LDV  GKQVHGF+Y+IGFYA+LYIG+ALL
Subjt:  IKEIDHVTLRLILNVCTDRLDVAIGKQVHGFVYKIGFYADLYIGSALL

XP_022993859.1 pentatricopeptide repeat-containing protein At3g26540 [Cucurbita maxima]1.9e-8153.74Show/hide
Query:  FRPTSPIFLLNQAIEAYGKCGCLEDAGELFEEMPHRDGGSSNAMITAYTQNGYALGALNLFSDMNESGVRATEIFLASILGSCGAALALHFSRQIHGHIV
        F PT PIFLLN+AIEAYGKCGCL DA ELFEEMP RDGGS NAMITAYT+NG+AL ALNLF +MN+SGV ATEI LAS+LGSCG+ LALH SRQ+HGHIV
Subjt:  FRPTSPIFLLNQAIEAYGKCGCLEDAGELFEEMPHRDGGSSNAMITAYTQNGYALGALNLFSDMNESGVRATEIFLASILGSCGAALALHFSRQIHGHIV

Query:  KSCFLGIVILENSLVDVYGKWAM--------EERQCRVDVLPNVPRR----------SDSIEFYLF--------------------------------LL
        KS F+G VILE+SLVDVYGK  +        +E Q R DV  NV  R          + S+ F +F                                ++
Subjt:  KSCFLGIVILENSLVDVYGKWAM--------EERQCRVDVLPNVPRR----------SDSIEFYLF--------------------------------LL

Query:  VQV---------------------------------------WQHLY------KESPKARELFNEMPERNVISWNAMLAGYVDSSQWEEALDFVHLMCSS
        V+V                                       W  +        E  KARELFNEMPERNVISWNAMLAGY+ S QWEEALDFVHLM +S
Subjt:  VQV---------------------------------------WQHLY------KESPKARELFNEMPERNVISWNAMLAGYVDSSQWEEALDFVHLMCSS

Query:  IKEIDHVTLRLILNVCTDRLDVAIGKQVHGFVYKIGFYADLYIGSALL
        IK++DHVTLRLILNVCT  LDV  GKQVHGF+Y+IGFYA+LYIG+ALL
Subjt:  IKEIDHVTLRLILNVCTDRLDVAIGKQVHGFVYKIGFYADLYIGSALL

TrEMBL top hitse value%identityAlignment
A0A0A0KFE0 Uncharacterized protein5.7e-7953.16Show/hide
Query:  FRPTSPIFLLNQAIEAYGKCGCLEDAGELFEEMPHRDGGSSNAMITAYTQNGYALGALNLFSDMNESGVRATEIFLASILGSCGAALALHFSRQIHGHIV
        F PT PIFLLN+AIEAYGKCGCL+DA ELF+EMP RDGGS NAMITAYTQNGYAL ALNL+ D+N+SGV ATE+ LASIL SCG+ LALHFSRQIHGHIV
Subjt:  FRPTSPIFLLNQAIEAYGKCGCLEDAGELFEEMPHRDGGSSNAMITAYTQNGYALGALNLFSDMNESGVRATEIFLASILGSCGAALALHFSRQIHGHIV

Query:  KSCFLGIVILENSLVDVYGKWAM--------EERQCRVDVLPNV----------PRRSDSIEFYLF--------------------------------LL
        K  F+G VILE+SLVDVYGK  +        +E Q R DV  NV           + + S+ F +F                                ++
Subjt:  KSCFLGIVILENSLVDVYGKWAM--------EERQCRVDVLPNV----------PRRSDSIEFYLF--------------------------------LL

Query:  VQV---------------------------------------WQ---HLYKESP---KARELFNEMPERNVISWNAMLAGYVDSSQWEEALDFVHLMCSS
        V+V                                       W    + Y  S    KARELFNEMPERNVISWNAMLAGY+ SSQWEEAL+FVHLM SS
Subjt:  VQV---------------------------------------WQ---HLYKESP---KARELFNEMPERNVISWNAMLAGYVDSSQWEEALDFVHLMCSS

Query:  IKEIDHVTLRLILNVCTDRLDVAIGKQVHGFVYKIGFYADLYIGSALL
        IK+ID  TL LILNVCT   DV  GKQVHGFVY+ GFYA+LYIG+ALL
Subjt:  IKEIDHVTLRLILNVCTDRLDVAIGKQVHGFVYKIGFYADLYIGSALL

A0A5D3DJG9 Pentatricopeptide repeat-containing protein2.8e-7853.16Show/hide
Query:  FRPTSPIFLLNQAIEAYGKCGCLEDAGELFEEMPHRDGGSSNAMITAYTQNGYALGALNLFSDMNESGVRATEIFLASILGSCGAALALHFSRQIHGHIV
        F PT PIFLLN+AIEAYGKCGCL+DA ELF+EMP RDGGS NAMITAYTQNGYAL ALNL+ D+ +SGV ATEI LASIL SCG+ LALHFSRQIHGHIV
Subjt:  FRPTSPIFLLNQAIEAYGKCGCLEDAGELFEEMPHRDGGSSNAMITAYTQNGYALGALNLFSDMNESGVRATEIFLASILGSCGAALALHFSRQIHGHIV

Query:  KSCFLGIVILENSLVDVYGKWAM--------EERQCRVDVLPNV----------PRRSDSIEFYLF--------------------------------LL
        K  F+G VILE+SLVDVYGK  +        +E Q R DV  NV           + + S+ F +F                                ++
Subjt:  KSCFLGIVILENSLVDVYGKWAM--------EERQCRVDVLPNV----------PRRSDSIEFYLF--------------------------------LL

Query:  VQV---------------------------------------WQ---HLYKESP---KARELFNEMPERNVISWNAMLAGYVDSSQWEEALDFVHLMCSS
        V+V                                       W    + Y  S    KARELFNEMPERNVISWNAMLAGY+ SSQWEEAL+FVHLM SS
Subjt:  VQV---------------------------------------WQ---HLYKESP---KARELFNEMPERNVISWNAMLAGYVDSSQWEEALDFVHLMCSS

Query:  IKEIDHVTLRLILNVCTDRLDVAIGKQVHGFVYKIGFYADLYIGSALL
        IK+ID  TL LILNVCT   DV  GKQVHGFVY+ GFYA+LYIG+ALL
Subjt:  IKEIDHVTLRLILNVCTDRLDVAIGKQVHGFVYKIGFYADLYIGSALL

A0A6J1CJX1 pentatricopeptide repeat-containing protein At3g26540-like4.6e-8958.62Show/hide
Query:  FRPTSPIFLLNQAIEAYGKCGCLEDAGELFEEMPHRDGGSSNAMITAYTQNGYALGALNLFSDMNESGVRATEIFLASILGSCGAALALHFSRQIHGHIV
        F PT PIFLLN+AIEAYGKCGCLEDA ELFEEMP RDGGS NAMITAYTQNGYAL AL LFSDMN+SGVRATEI LASILGSCG ALALHFSRQIHGHIV
Subjt:  FRPTSPIFLLNQAIEAYGKCGCLEDAGELFEEMPHRDGGSSNAMITAYTQNGYALGALNLFSDMNESGVRATEIFLASILGSCGAALALHFSRQIHGHIV

Query:  KSCFLGIVILENSLVDVYGKWAM--------EERQCRVDVLPNVPRR-----------------------------------------------------
        KS FLG VILE+SLVDVYGK  +        +E Q R DV  NV  R                                                     
Subjt:  KSCFLGIVILENSLVDVYGKWAM--------EERQCRVDVLPNVPRR-----------------------------------------------------

Query:  ------------SDSIEFYLFL-----------------LVQVWQHLY-----KESPKARELFNEMPERNVISWNAMLAGYVDSSQWEEALDFVHLMCSS
                    S  I+ Y+                   L+     LY      E PKARELFNEMPERNVISWNAMLAGYV S+QWEEALDFVHLM SS
Subjt:  ------------SDSIEFYLFL-----------------LVQVWQHLY-----KESPKARELFNEMPERNVISWNAMLAGYVDSSQWEEALDFVHLMCSS

Query:  IKEIDHVTLRLILNVCTDRLDVAIGKQVHGFVYKIGFYADLYIGSALL
        IKEIDHVTLRLILNVCT  LDVA GKQVHGFVY+IGFYA+LYIG+ALL
Subjt:  IKEIDHVTLRLILNVCTDRLDVAIGKQVHGFVYKIGFYADLYIGSALL

A0A6J1FD79 pentatricopeptide repeat-containing protein At3g265405.5e-8253.16Show/hide
Query:  FRPTSPIFLLNQAIEAYGKCGCLEDAGELFEEMPHRDGGSSNAMITAYTQNGYALGALNLFSDMNESGVRATEIFLASILGSCGAALALHFSRQIHGHIV
        F PT PIFLLN+AIEAYGKCGCLEDA ELFEEMP RDGGS NAM+TAYTQNG+AL +LNLF +MN+SGV ATEI LAS+LGSCG+ LALH SRQ+HGHIV
Subjt:  FRPTSPIFLLNQAIEAYGKCGCLEDAGELFEEMPHRDGGSSNAMITAYTQNGYALGALNLFSDMNESGVRATEIFLASILGSCGAALALHFSRQIHGHIV

Query:  KSCFLGIVILENSLVDVYGKWAM--------EERQCRVDVLPNV----------PRRSDSIEFYLF--------------------------------LL
        KS F+G VILE+SLVDVYGK  +        +E Q R DV  NV           + + S+ F +F                                ++
Subjt:  KSCFLGIVILENSLVDVYGKWAM--------EERQCRVDVLPNV----------PRRSDSIEFYLF--------------------------------LL

Query:  VQV---------------------------------------WQHLY------KESPKARELFNEMPERNVISWNAMLAGYVDSSQWEEALDFVHLMCSS
        V+V                                       W  +        E  KARELFNEMPERNVISWNAMLAGY+ S QW+EALDFVHLM +S
Subjt:  VQV---------------------------------------WQHLY------KESPKARELFNEMPERNVISWNAMLAGYVDSSQWEEALDFVHLMCSS

Query:  IKEIDHVTLRLILNVCTDRLDVAIGKQVHGFVYKIGFYADLYIGSALL
        IK++DHVTLRLILNVCT  LDV  GKQVHGF+Y+IGFYA+LYIG+ALL
Subjt:  IKEIDHVTLRLILNVCTDRLDVAIGKQVHGFVYKIGFYADLYIGSALL

A0A6J1K1B7 pentatricopeptide repeat-containing protein At3g265409.4e-8253.74Show/hide
Query:  FRPTSPIFLLNQAIEAYGKCGCLEDAGELFEEMPHRDGGSSNAMITAYTQNGYALGALNLFSDMNESGVRATEIFLASILGSCGAALALHFSRQIHGHIV
        F PT PIFLLN+AIEAYGKCGCL DA ELFEEMP RDGGS NAMITAYT+NG+AL ALNLF +MN+SGV ATEI LAS+LGSCG+ LALH SRQ+HGHIV
Subjt:  FRPTSPIFLLNQAIEAYGKCGCLEDAGELFEEMPHRDGGSSNAMITAYTQNGYALGALNLFSDMNESGVRATEIFLASILGSCGAALALHFSRQIHGHIV

Query:  KSCFLGIVILENSLVDVYGKWAM--------EERQCRVDVLPNVPRR----------SDSIEFYLF--------------------------------LL
        KS F+G VILE+SLVDVYGK  +        +E Q R DV  NV  R          + S+ F +F                                ++
Subjt:  KSCFLGIVILENSLVDVYGKWAM--------EERQCRVDVLPNVPRR----------SDSIEFYLF--------------------------------LL

Query:  VQV---------------------------------------WQHLY------KESPKARELFNEMPERNVISWNAMLAGYVDSSQWEEALDFVHLMCSS
        V+V                                       W  +        E  KARELFNEMPERNVISWNAMLAGY+ S QWEEALDFVHLM +S
Subjt:  VQV---------------------------------------WQHLY------KESPKARELFNEMPERNVISWNAMLAGYVDSSQWEEALDFVHLMCSS

Query:  IKEIDHVTLRLILNVCTDRLDVAIGKQVHGFVYKIGFYADLYIGSALL
        IK++DHVTLRLILNVCT  LDV  GKQVHGF+Y+IGFYA+LYIG+ALL
Subjt:  IKEIDHVTLRLILNVCTDRLDVAIGKQVHGFVYKIGFYADLYIGSALL

SwissProt top hitse value%identityAlignment
O04192 Transcription factor MYB252.3e-3761.21Show/hide
Query:  IALPDRNTSNNHNRVKGSWSPQEDATLVKLVEEHGPRNWSLISTGIPGRSGKSCRLRWCNQLSPAVQHRPFTPAEDDVIVQAHGVHGNKWSTIARLLPGR
        +A  D N     ++VKG W P++D  L +LV+  GPRNW+LIS GIPGRSGKSCRLRWCNQL P ++ +PF+  E+ +I+ A  V GNKWS IA+LLPGR
Subjt:  IALPDRNTSNNHNRVKGSWSPQEDATLVKLVEEHGPRNWSLISTGIPGRSGKSCRLRWCNQLSPAVQHRPFTPAEDDVIVQAHGVHGNKWSTIARLLPGR

Query:  TDNAIKNHWNSTLRRR
        TDNAIKNHWNS LRR+
Subjt:  TDNAIKNHWNSTLRRR

O23160 Transcription factor MYB733.5e-4153.18Show/hide
Query:  TSNNHNRVKGSWSPQEDATLVKLVEEHGPRNWSLISTGIPGRSGKSCRLRWCNQLSPAVQHRPFTPAEDDVIVQAHGVHGNKWSTIARLLPGRTDNAIKN
        T  N  R+KG WSP+ED  L +LV++HGPRNWSLIS  IPGRSGKSCRLRWCNQLSP V+HR F+  ED+ I++AH   GNKW+TI+RLL GRTDNAIKN
Subjt:  TSNNHNRVKGSWSPQEDATLVKLVEEHGPRNWSLISTGIPGRSGKSCRLRWCNQLSPAVQHRPFTPAEDDVIVQAHGVHGNKWSTIARLLPGRTDNAIKN

Query:  HWNSTLRRRRDADLSSDYSNAFLKRPIYDVSRSSSSDDSEASLKRKCFGSAAEKGGGGATAGVVAGGGGPETS
        HWNSTL+R+   +     S  F     YD      +   E  LKR         GGGG + G+    G P  S
Subjt:  HWNSTLRRRRDADLSSDYSNAFLKRPIYDVSRSSSSDDSEASLKRKCFGSAAEKGGGGATAGVVAGGGGPETS

Q9FDW1 Transcription factor MYB442.2e-4061.48Show/hide
Query:  NRVKGSWSPQEDATLVKLVEEHGPRNWSLISTGIPGRSGKSCRLRWCNQLSPAVQHRPFTPAEDDVIVQAHGVHGNKWSTIARLLPGRTDNAIKNHWNST
        +R+KG WSP+ED  L +LV ++GPRNW++IS  IPGRSGKSCRLRWCNQLSP V+HRPF+  ED+ I +AH   GNKW+TIARLL GRTDNA+KNHWNST
Subjt:  NRVKGSWSPQEDATLVKLVEEHGPRNWSLISTGIPGRSGKSCRLRWCNQLSPAVQHRPFTPAEDDVIVQAHGVHGNKWSTIARLLPGRTDNAIKNHWNST

Query:  LRRRRDADLSSDYSNAFLKRPI
        L+R+        Y  +   RP+
Subjt:  LRRRRDADLSSDYSNAFLKRPI

Q9LRV2 Pentatricopeptide repeat-containing protein At3g265403.3e-5238.79Show/hide
Query:  FRPTSPIFLLNQAIEAYGKCGCLEDAGELFEEMPHRDGGSSNAMITAYTQNGYALGALNLFSDMNESGVRATEIFLASILGSCGAALALHFSRQIHGHIV
        F P  PIFLLN+AIEAYGKCGC++DA ELFEEMP RDGGS NA+ITA  QNG +     +F  MN  GVRATE   A +L SCG  L L   RQ+H  +V
Subjt:  FRPTSPIFLLNQAIEAYGKCGCLEDAGELFEEMPHRDGGSSNAMITAYTQNGYALGALNLFSDMNESGVRATEIFLASILGSCGAALALHFSRQIHGHIV

Query:  KSCFLGIVILENSLVDVYGKWAMEERQCRV--------DVLPNVPRR-------SDSIEFYLF------------------------LLVQVWQHLYKES
        K  + G V LE S+VDVYGK  +     RV        DV  NV  R       +D      F                        L ++V + ++  +
Subjt:  KSCFLGIVILENSLVDVYGKWAMEERQCRV--------DVLPNVPRR-------SDSIEFYLF------------------------LLVQVWQHLYKES

Query:  PK--------------------------------------------------------ARELFNEMPERNVISWNAMLAGYVDSSQWEEALDFVHLMCSS
         K                                                        ARELF+ MPERN++SWNAML GYV + +W+EALDF+ LM   
Subjt:  PK--------------------------------------------------------ARELFNEMPERNVISWNAMLAGYVDSSQWEEALDFVHLMCSS

Query:  IKEIDHVTLRLILNVCTDRLDVAIGKQVHGFVYKIGFYADLYIGSALL
        I+ ID+VTL  ILNVC+   DV +GKQ HGF+Y+ G+  ++ + +ALL
Subjt:  IKEIDHVTLRLILNVCTDRLDVAIGKQVHGFVYKIGFYADLYIGSALL

Q9SN12 Transcription factor MYB771.3e-4071.15Show/hide
Query:  NRVKGSWSPQEDATLVKLVEEHGPRNWSLISTGIPGRSGKSCRLRWCNQLSPAVQHRPFTPAEDDVIVQAHGVHGNKWSTIARLLPGRTDNAIKNHWNST
        +RVKG WS +ED  L ++VE++GPRNWS IS  IPGRSGKSCRLRWCNQLSP V+HRPF+P ED+ IV A    GNKW+TIARLL GRTDNA+KNHWNST
Subjt:  NRVKGSWSPQEDATLVKLVEEHGPRNWSLISTGIPGRSGKSCRLRWCNQLSPAVQHRPFTPAEDDVIVQAHGVHGNKWSTIARLLPGRTDNAIKNHWNST

Query:  LRRR
        L+R+
Subjt:  LRRR

Arabidopsis top hitse value%identityAlignment
AT2G23290.1 myb domain protein 705.5e-4246.92Show/hide
Query:  NTSNNHNRVKGSWSPQEDATLVKLVEEHGPRNWSLISTGIPGRSGKSCRLRWCNQLSPAVQHRPFTPAEDDVIVQAHGVHGNKWSTIARLLPGRTDNAIK
        +T    +R+KG WSP+ED  L  LV++HGPRNWSLIS  IPGRSGKSCRLRWCNQLSP V+HR FT  EDD I+ AH   GNKW+TIARLL GRTDNAIK
Subjt:  NTSNNHNRVKGSWSPQEDATLVKLVEEHGPRNWSLISTGIPGRSGKSCRLRWCNQLSPAVQHRPFTPAEDDVIVQAHGVHGNKWSTIARLLPGRTDNAIK

Query:  NHWNSTLRRRRDADLSSDYSNAFLKRPIYDVSRSSSSDDSEASLKRKCFGSAAEKGGGGATAGV---VAGGGGPETSLTLLKMESERINDGRCQTSVAKP
        NHWNST                                     LKRKC G     GGGG   G      G GG + +LT  K    R + G     V   
Subjt:  NHWNSTLRRRRDADLSSDYSNAFLKRPIYDVSRSSSSDDSEASLKRKCFGSAAEKGGGGATAGV---VAGGGGPETSLTLLKMESERINDGRCQTSVAKP

Query:  DHTAASVAEEA
          T + V+E++
Subjt:  DHTAASVAEEA

AT3G26540.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.4e-5338.79Show/hide
Query:  FRPTSPIFLLNQAIEAYGKCGCLEDAGELFEEMPHRDGGSSNAMITAYTQNGYALGALNLFSDMNESGVRATEIFLASILGSCGAALALHFSRQIHGHIV
        F P  PIFLLN+AIEAYGKCGC++DA ELFEEMP RDGGS NA+ITA  QNG +     +F  MN  GVRATE   A +L SCG  L L   RQ+H  +V
Subjt:  FRPTSPIFLLNQAIEAYGKCGCLEDAGELFEEMPHRDGGSSNAMITAYTQNGYALGALNLFSDMNESGVRATEIFLASILGSCGAALALHFSRQIHGHIV

Query:  KSCFLGIVILENSLVDVYGKWAMEERQCRV--------DVLPNVPRR-------SDSIEFYLF------------------------LLVQVWQHLYKES
        K  + G V LE S+VDVYGK  +     RV        DV  NV  R       +D      F                        L ++V + ++  +
Subjt:  KSCFLGIVILENSLVDVYGKWAMEERQCRV--------DVLPNVPRR-------SDSIEFYLF------------------------LLVQVWQHLYKES

Query:  PK--------------------------------------------------------ARELFNEMPERNVISWNAMLAGYVDSSQWEEALDFVHLMCSS
         K                                                        ARELF+ MPERN++SWNAML GYV + +W+EALDF+ LM   
Subjt:  PK--------------------------------------------------------ARELFNEMPERNVISWNAMLAGYVDSSQWEEALDFVHLMCSS

Query:  IKEIDHVTLRLILNVCTDRLDVAIGKQVHGFVYKIGFYADLYIGSALL
        I+ ID+VTL  ILNVC+   DV +GKQ HGF+Y+ G+  ++ + +ALL
Subjt:  IKEIDHVTLRLILNVCTDRLDVAIGKQVHGFVYKIGFYADLYIGSALL

AT3G50060.1 myb domain protein 779.4e-4271.15Show/hide
Query:  NRVKGSWSPQEDATLVKLVEEHGPRNWSLISTGIPGRSGKSCRLRWCNQLSPAVQHRPFTPAEDDVIVQAHGVHGNKWSTIARLLPGRTDNAIKNHWNST
        +RVKG WS +ED  L ++VE++GPRNWS IS  IPGRSGKSCRLRWCNQLSP V+HRPF+P ED+ IV A    GNKW+TIARLL GRTDNA+KNHWNST
Subjt:  NRVKGSWSPQEDATLVKLVEEHGPRNWSLISTGIPGRSGKSCRLRWCNQLSPAVQHRPFTPAEDDVIVQAHGVHGNKWSTIARLLPGRTDNAIKNHWNST

Query:  LRRR
        L+R+
Subjt:  LRRR

AT3G55730.1 myb domain protein 1099.4e-4270.19Show/hide
Query:  NRVKGSWSPQEDATLVKLVEEHGPRNWSLISTGIPGRSGKSCRLRWCNQLSPAVQHRPFTPAEDDVIVQAHGVHGNKWSTIARLLPGRTDNAIKNHWNST
        ++VKG WS +EDA L KLV + GPRNWSLI+ GIPGRSGKSCRLRWCNQL P ++ +PF+  ED +I+ AH VHGNKW+ IA+LL GRTDNAIKNHWNST
Subjt:  NRVKGSWSPQEDATLVKLVEEHGPRNWSLISTGIPGRSGKSCRLRWCNQLSPAVQHRPFTPAEDDVIVQAHGVHGNKWSTIARLLPGRTDNAIKNHWNST

Query:  LRRR
        LRR+
Subjt:  LRRR

AT4G37260.1 myb domain protein 732.5e-4253.18Show/hide
Query:  TSNNHNRVKGSWSPQEDATLVKLVEEHGPRNWSLISTGIPGRSGKSCRLRWCNQLSPAVQHRPFTPAEDDVIVQAHGVHGNKWSTIARLLPGRTDNAIKN
        T  N  R+KG WSP+ED  L +LV++HGPRNWSLIS  IPGRSGKSCRLRWCNQLSP V+HR F+  ED+ I++AH   GNKW+TI+RLL GRTDNAIKN
Subjt:  TSNNHNRVKGSWSPQEDATLVKLVEEHGPRNWSLISTGIPGRSGKSCRLRWCNQLSPAVQHRPFTPAEDDVIVQAHGVHGNKWSTIARLLPGRTDNAIKN

Query:  HWNSTLRRRRDADLSSDYSNAFLKRPIYDVSRSSSSDDSEASLKRKCFGSAAEKGGGGATAGVVAGGGGPETS
        HWNSTL+R+   +     S  F     YD      +   E  LKR         GGGG + G+    G P  S
Subjt:  HWNSTLRRRRDADLSSDYSNAFLKRPIYDVSRSSSSDDSEASLKRKCFGSAAEKGGGGATAGVVAGGGGPETS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTCGCCCCACGTCGCCTATTTTTCTGTTGAATCAAGCCATTGAAGCCTATGGTAAATGTGGGTGTTTGGAGGATGCAGGGGAGTTGTTCGAGGAAATGCCTCATAG
AGATGGGGGATCGTCGAATGCGATGATAACAGCATATACCCAGAATGGGTATGCTTTGGGAGCATTGAATTTGTTTTCGGATATGAATGAATCTGGTGTTCGTGCTACTG
AGATATTTTTAGCCAGTATTCTTGGGTCTTGCGGAGCTGCATTGGCTCTTCACTTTTCGAGGCAAATTCACGGGCATATTGTGAAATCTTGCTTCCTTGGCATTGTAATT
CTTGAGAATTCTCTTGTTGATGTCTATGGAAAGTGGGCAATGGAAGAGAGGCAGTGTCGTGTCGATGTTTTACCAAATGTTCCGAGAAGAAGTGATTCCATTGAGTTTTA
CCTTTTCTTGCTTGTTCAAGTATGGCAGCACTTGTACAAGGAAAGTCCAAAAGCTAGGGAGCTTTTCAATGAAATGCCTGAACGCAATGTGATTTCATGGAATGCTATGT
TGGCAGGATATGTTGATTCCTCTCAATGGGAAGAGGCGTTAGACTTTGTCCATTTGATGTGCAGTTCGATTAAGGAAATTGATCACGTAACTCTTCGCCTGATACTGAAT
GTGTGTACCGACCGTCTAGATGTTGCAATAGGGAAGCAGGTTCATGGTTTTGTTTATAAAATTGGTTTCTATGCTGATCTCTATATTGGTAGTGCTCTTCTTGCATGTAT
GGATGTTGCCACAACAAAAGGGACGATAGTAATCGAGTTATTTCGATTGATGACAATGGAGGAAGGCGTGAAACCAGGCCATGTGACCTTTCAAGCCATGGTGATGATCG
GAATCGCCCTCCCCGATCGGAACACGAGCAATAATCACAATCGAGTTAAAGGATCGTGGAGCCCTCAGGAGGACGCTACGTTGGTCAAATTGGTGGAGGAGCACGGACCT
AGAAATTGGTCTTTGATTAGCACTGGAATTCCCGGCCGATCTGGAAAATCCTGCCGTCTGCGGTGGTGCAATCAGCTCAGTCCGGCGGTTCAGCACCGGCCGTTCACGCC
GGCGGAGGACGATGTCATAGTTCAGGCGCATGGCGTCCACGGGAACAAATGGTCCACCATTGCTCGTCTCTTGCCTGGCCGGACCGACAATGCGATTAAGAATCACTGGA
ACTCCACGCTGAGGCGCCGCCGCGACGCCGACTTGTCGTCCGATTATTCCAACGCGTTCTTGAAGCGGCCGATTTACGACGTCTCCAGATCCTCGTCGTCGGATGATTCG
GAAGCCTCGCTGAAACGCAAGTGCTTTGGATCGGCGGCGGAGAAAGGCGGCGGCGGTGCGACGGCGGGAGTTGTGGCTGGAGGTGGAGGACCTGAGACGTCGCTGACGCT
TTTGAAAATGGAATCAGAACGAATTAATGATGGAAGATGTCAAACCTCAGTTGCCAAACCAGACCATACAGCAGCTTCAGTTGCAGAAGAAGCATTTGAAAAAACTGTCA
ATTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTTCGCCCCACGTCGCCTATTTTTCTGTTGAATCAAGCCATTGAAGCCTATGGTAAATGTGGGTGTTTGGAGGATGCAGGGGAGTTGTTCGAGGAAATGCCTCATAG
AGATGGGGGATCGTCGAATGCGATGATAACAGCATATACCCAGAATGGGTATGCTTTGGGAGCATTGAATTTGTTTTCGGATATGAATGAATCTGGTGTTCGTGCTACTG
AGATATTTTTAGCCAGTATTCTTGGGTCTTGCGGAGCTGCATTGGCTCTTCACTTTTCGAGGCAAATTCACGGGCATATTGTGAAATCTTGCTTCCTTGGCATTGTAATT
CTTGAGAATTCTCTTGTTGATGTCTATGGAAAGTGGGCAATGGAAGAGAGGCAGTGTCGTGTCGATGTTTTACCAAATGTTCCGAGAAGAAGTGATTCCATTGAGTTTTA
CCTTTTCTTGCTTGTTCAAGTATGGCAGCACTTGTACAAGGAAAGTCCAAAAGCTAGGGAGCTTTTCAATGAAATGCCTGAACGCAATGTGATTTCATGGAATGCTATGT
TGGCAGGATATGTTGATTCCTCTCAATGGGAAGAGGCGTTAGACTTTGTCCATTTGATGTGCAGTTCGATTAAGGAAATTGATCACGTAACTCTTCGCCTGATACTGAAT
GTGTGTACCGACCGTCTAGATGTTGCAATAGGGAAGCAGGTTCATGGTTTTGTTTATAAAATTGGTTTCTATGCTGATCTCTATATTGGTAGTGCTCTTCTTGCATGTAT
GGATGTTGCCACAACAAAAGGGACGATAGTAATCGAGTTATTTCGATTGATGACAATGGAGGAAGGCGTGAAACCAGGCCATGTGACCTTTCAAGCCATGGTGATGATCG
GAATCGCCCTCCCCGATCGGAACACGAGCAATAATCACAATCGAGTTAAAGGATCGTGGAGCCCTCAGGAGGACGCTACGTTGGTCAAATTGGTGGAGGAGCACGGACCT
AGAAATTGGTCTTTGATTAGCACTGGAATTCCCGGCCGATCTGGAAAATCCTGCCGTCTGCGGTGGTGCAATCAGCTCAGTCCGGCGGTTCAGCACCGGCCGTTCACGCC
GGCGGAGGACGATGTCATAGTTCAGGCGCATGGCGTCCACGGGAACAAATGGTCCACCATTGCTCGTCTCTTGCCTGGCCGGACCGACAATGCGATTAAGAATCACTGGA
ACTCCACGCTGAGGCGCCGCCGCGACGCCGACTTGTCGTCCGATTATTCCAACGCGTTCTTGAAGCGGCCGATTTACGACGTCTCCAGATCCTCGTCGTCGGATGATTCG
GAAGCCTCGCTGAAACGCAAGTGCTTTGGATCGGCGGCGGAGAAAGGCGGCGGCGGTGCGACGGCGGGAGTTGTGGCTGGAGGTGGAGGACCTGAGACGTCGCTGACGCT
TTTGAAAATGGAATCAGAACGAATTAATGATGGAAGATGTCAAACCTCAGTTGCCAAACCAGACCATACAGCAGCTTCAGTTGCAGAAGAAGCATTTGAAAAAACTGTCA
ATTAA
Protein sequenceShow/hide protein sequence
MFRPTSPIFLLNQAIEAYGKCGCLEDAGELFEEMPHRDGGSSNAMITAYTQNGYALGALNLFSDMNESGVRATEIFLASILGSCGAALALHFSRQIHGHIVKSCFLGIVI
LENSLVDVYGKWAMEERQCRVDVLPNVPRRSDSIEFYLFLLVQVWQHLYKESPKARELFNEMPERNVISWNAMLAGYVDSSQWEEALDFVHLMCSSIKEIDHVTLRLILN
VCTDRLDVAIGKQVHGFVYKIGFYADLYIGSALLACMDVATTKGTIVIELFRLMTMEEGVKPGHVTFQAMVMIGIALPDRNTSNNHNRVKGSWSPQEDATLVKLVEEHGP
RNWSLISTGIPGRSGKSCRLRWCNQLSPAVQHRPFTPAEDDVIVQAHGVHGNKWSTIARLLPGRTDNAIKNHWNSTLRRRRDADLSSDYSNAFLKRPIYDVSRSSSSDDS
EASLKRKCFGSAAEKGGGGATAGVVAGGGGPETSLTLLKMESERINDGRCQTSVAKPDHTAASVAEEAFEKTVN