; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g38450 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g38450
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptiontranscription factor WER-like
Genome locationchr8:28794565..28807422
RNA-Seq ExpressionMoc08g38450
SyntenyMoc08g38450
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PQQ14887.1 uncharacterized protein Pyn_39585 [Prunus yedoensis var. nudiflora]4.5e-16847.19Show/hide
Query:  MGGVAWTEEEDYLLKKCIEQYGEGKWHRVPQLAGLNRCRKSCRLRWLNYLRPNIKRGSFTPEEVDLIFSLHSLLGNRWSVIAGRLPGRTANDIKNYWNCH
        MGG+ WT+EED LL+ CIEQYGEGKWHRVP LAGLNRCRKSCRLRWLNYLRPNIKRGSF  EEVDLI  LH LLGNRWS++A RLPGRT ND+KNYWNCH
Subjt:  MGGVAWTEEEDYLLKKCIEQYGEGKWHRVPQLAGLNRCRKSCRLRWLNYLRPNIKRGSFTPEEVDLIFSLHSLLGNRWSVIAGRLPGRTANDIKNYWNCH

Query:  LSKKLNGQLGGVEDQQANPTKEEAFFGKPQNWEHPQEESSKSRAKGKTCAADHDQNIN------GENSQEMILV-------------QNQ---------N
        LSKKLN Q    ED+  +    +    +  N+     +   S    ++ + +  Q +N      G NS   I +             QNQ         N
Subjt:  LSKKLNGQLGGVEDQQANPTKEEAFFGKPQNWEHPQEESSKSRAKGKTCAADHDQNIN------GENSQEMILV-------------QNQ---------N

Query:  QNLPIIVEQQSNSMGSLGNLQME---FKAL-----EDDHHQQGCNKREVWDDWISEMDLWIDSLYFDCALRS---------------------------C
        +N+P + E+  N   ++G+L ME   FK +     +DD    G +    W +W    D  ++ L   C  +S                            
Subjt:  QNLPIIVEQQSNSMGSLGNLQME---FKAL-----EDDHHQQGCNKREVWDDWISEMDLWIDSLYFDCALRS---------------------------C

Query:  CMAYIPPHKRHSRDMEKASPTAELLVPQFNRKLNLRPKFTQKSDLKPSKSEVDRSGKIVYAHQAVLKWFSIGSSGNGNQFPSCVHLEPFSATSIEHKRGE
         MAYIPPHKRHS++ E+  PT ELL PQF + LNLR           SKS  D  GK++YA+ A  +W  IG   + NQFPS V+LE  S  + E + GE
Subjt:  CMAYIPPHKRHSRDMEKASPTAELLVPQFNRKLNLRPKFTQKSDLKPSKSEVDRSGKIVYAHQAVLKWFSIGSSGNGNQFPSCVHLEPFSATSIEHKRGE

Query:  KPLALVNTNISQGNREEEVEIVTEPWVSIVENLLPDLLSSVEHVKNEMN--QDEDVKPRLVARVGKVLFHGVSEID----RNEPPTETTLRQLRRSFYTN
        KPLAL+NT++ +   E +  +   PWVS+ EN+L DLLSS E+VKN+M   + E+VKP LVARVGKVLF     ++    RN   TE  L+Q +R FYTN
Subjt:  KPLALVNTNISQGNREEEVEIVTEPWVSIVENLLPDLLSSVEHVKNEMN--QDEDVKPRLVARVGKVLFHGVSEID----RNEPPTETTLRQLRRSFYTN

Query:  ISDTYMENITQKVIPLTGLVFVEEKDIYHVKLSDAERPSVTISCKCTVVPELNNLELYKVELNQVRHMVADISCLKQNMDMRLMLCSKKTSEKLTVDEME
        +  +Y ENI ++V+P  G+ F EEKD+YH+KLSD+ RP   +SCKC+V+ E   L+LYK+ELNQ+R MV DISC  +N+D+R+MLC+K+    LT DEM+
Subjt:  ISDTYMENITQKVIPLTGLVFVEEKDIYHVKLSDAERPSVTISCKCTVVPELNNLELYKVELNQVRHMVADISCLKQNMDMRLMLCSKKTSEKLTVDEME

Query:  GIAYLINSAVLDQDVKGGLRWPLGKASSGDRFRVVGVWHTVAKSYVNAFMRLKLRNADRYDFKTSVGEATKEITLKLKQFTSELLREEAEFDVISDMLKD
         I  LI+SAVLD DVKGGLRWPLGK SSGDR++VVG+WH +A  Y N  +RLK+R+ADR+DF+TS GEAT E++L LK+  S+L  E+ E   IS++L++
Subjt:  GIAYLINSAVLDQDVKGGLRWPLGKASSGDRFRVVGVWHTVAKSYVNAFMRLKLRNADRYDFKTSVGEATKEITLKLKQFTSELLREEAEFDVISDMLKD

Query:  HLKLFWNHFVCN
        ++++ W++F+C+
Subjt:  HLKLFWNHFVCN

XP_022144530.1 uncharacterized protein LOC111014191 [Momordica charantia]4.6e-237100Show/hide
Query:  MAYIPPHKRHSRDMEKASPTAELLVPQFNRKLNLRPKFTQKSDLKPSKSEVDRSGKIVYAHQAVLKWFSIGSSGNGNQFPSCVHLEPFSATSIEHKRGEK
        MAYIPPHKRHSRDMEKASPTAELLVPQFNRKLNLRPKFTQKSDLKPSKSEVDRSGKIVYAHQAVLKWFSIGSSGNGNQFPSCVHLEPFSATSIEHKRGEK
Subjt:  MAYIPPHKRHSRDMEKASPTAELLVPQFNRKLNLRPKFTQKSDLKPSKSEVDRSGKIVYAHQAVLKWFSIGSSGNGNQFPSCVHLEPFSATSIEHKRGEK

Query:  PLALVNTNISQGNREEEVEIVTEPWVSIVENLLPDLLSSVEHVKNEMNQDEDVKPRLVARVGKVLFHGVSEIDRNEPPTETTLRQLRRSFYTNISDTYME
        PLALVNTNISQGNREEEVEIVTEPWVSIVENLLPDLLSSVEHVKNEMNQDEDVKPRLVARVGKVLFHGVSEIDRNEPPTETTLRQLRRSFYTNISDTYME
Subjt:  PLALVNTNISQGNREEEVEIVTEPWVSIVENLLPDLLSSVEHVKNEMNQDEDVKPRLVARVGKVLFHGVSEIDRNEPPTETTLRQLRRSFYTNISDTYME

Query:  NITQKVIPLTGLVFVEEKDIYHVKLSDAERPSVTISCKCTVVPELNNLELYKVELNQVRHMVADISCLKQNMDMRLMLCSKKTSEKLTVDEMEGIAYLIN
        NITQKVIPLTGLVFVEEKDIYHVKLSDAERPSVTISCKCTVVPELNNLELYKVELNQVRHMVADISCLKQNMDMRLMLCSKKTSEKLTVDEMEGIAYLIN
Subjt:  NITQKVIPLTGLVFVEEKDIYHVKLSDAERPSVTISCKCTVVPELNNLELYKVELNQVRHMVADISCLKQNMDMRLMLCSKKTSEKLTVDEMEGIAYLIN

Query:  SAVLDQDVKGGLRWPLGKASSGDRFRVVGVWHTVAKSYVNAFMRLKLRNADRYDFKTSVGEATKEITLKLKQFTSELLREEAEFDVISDMLKDHLKLFWN
        SAVLDQDVKGGLRWPLGKASSGDRFRVVGVWHTVAKSYVNAFMRLKLRNADRYDFKTSVGEATKEITLKLKQFTSELLREEAEFDVISDMLKDHLKLFWN
Subjt:  SAVLDQDVKGGLRWPLGKASSGDRFRVVGVWHTVAKSYVNAFMRLKLRNADRYDFKTSVGEATKEITLKLKQFTSELLREEAEFDVISDMLKDHLKLFWN

Query:  HFVCNASCSSPSDTT
        HFVCNASCSSPSDTT
Subjt:  HFVCNASCSSPSDTT

XP_023538039.1 uncharacterized protein LOC111798921 isoform X1 [Cucurbita pepo subsp. pepo]5.3e-16969.88Show/hide
Query:  IDSLYFDCALRSCCMAYIPPHKRHSRDMEKASPTAELLVPQFNRKLNLRPKFTQKSDLKPSKSEVDRSGKIVYAHQAVLKWFSIGSSGNGNQFPSCVHLE
        + S Y  CALRSC MAY+PPHKRHS DMEK SPTAELL P FNRKL          DL+ S  +VDRSG+I+YA QA  KWF IG S + N FPSCV LE
Subjt:  IDSLYFDCALRSCCMAYIPPHKRHSRDMEKASPTAELLVPQFNRKLNLRPKFTQKSDLKPSKSEVDRSGKIVYAHQAVLKWFSIGSSGNGNQFPSCVHLE

Query:  PFSATSIEHKRGEKPLALVNTNISQGNREEEVEIVTEPWVSIVENLLPDLLSSVEHVKNEMNQDEDVKPRLVARVGKVLFHGVSEIDRNEPPTETTLRQL
        P +  + EH+ GEK LALV +N+SQGN+EEE E+V EPW SIV  LLPDLLSS+EH+KNEMNQDE+VKP LVAR+GKV+FHG SE D N+ PT   + QL
Subjt:  PFSATSIEHKRGEKPLALVNTNISQGNREEEVEIVTEPWVSIVENLLPDLLSSVEHVKNEMNQDEDVKPRLVARVGKVLFHGVSEIDRNEPPTETTLRQL

Query:  RRSFYTNISDTYMENITQKVIPLTGLVFVEEKDIYHVKLSDAERPSVTISCKCTVVPELNNLELYKVELNQVRHMVADISCLKQNMDMRLMLCSKKTSEK
        +RSF TN+S+TY+ENI  KVIPL G+ F EEKD YH+KLSDAE P VT++CKCT +PELN LELYKVELN VRHMVADISCLKQN+DMR+M  SKKT  K
Subjt:  RRSFYTNISDTYMENITQKVIPLTGLVFVEEKDIYHVKLSDAERPSVTISCKCTVVPELNNLELYKVELNQVRHMVADISCLKQNMDMRLMLCSKKTSEK

Query:  LTVDEMEGIAYLINSAVLDQDVKGGLRWPLGKASSGDRFRVVGVWHTVAKSYVNAFMRLKLRNADRYDFKTSVGEATKEITLKLKQFTSELLREEAEFDV
        LT DEMEGI  LINSA+LD+DVKGGLRWPLGKASSGDRF+VVG WHTV++SYVN F+R+KLRNADRYDFKT VGE + E+T K+K+ TSELLRE+AE DV
Subjt:  LTVDEMEGIAYLINSAVLDQDVKGGLRWPLGKASSGDRFRVVGVWHTVAKSYVNAFMRLKLRNADRYDFKTSVGEATKEITLKLKQFTSELLREEAEFDV

Query:  ISDMLKDHLKLFWNHFVCNASCSSP
        ISDML ++LKLFWNHFVC+A+CSSP
Subjt:  ISDMLKDHLKLFWNHFVCNASCSSP

XP_023538040.1 uncharacterized protein LOC111798921 isoform X2 [Cucurbita pepo subsp. pepo]9.1e-16970.21Show/hide
Query:  SLYFDCALRSCCMAYIPPHKRHSRDMEKASPTAELLVPQFNRKLNLRPKFTQKSDLKPSKSEVDRSGKIVYAHQAVLKWFSIGSSGNGNQFPSCVHLEPF
        S Y  CALRSC MAY+PPHKRHS DMEK SPTAELL P FNRKL          DL+ S  +VDRSG+I+YA QA  KWF IG S + N FPSCV LEP 
Subjt:  SLYFDCALRSCCMAYIPPHKRHSRDMEKASPTAELLVPQFNRKLNLRPKFTQKSDLKPSKSEVDRSGKIVYAHQAVLKWFSIGSSGNGNQFPSCVHLEPF

Query:  SATSIEHKRGEKPLALVNTNISQGNREEEVEIVTEPWVSIVENLLPDLLSSVEHVKNEMNQDEDVKPRLVARVGKVLFHGVSEIDRNEPPTETTLRQLRR
        +  + EH+ GEK LALV +N+SQGN+EEE E+V EPW SIV  LLPDLLSS+EH+KNEMNQDE+VKP LVAR+GKV+FHG SE D N+ PT   + QL+R
Subjt:  SATSIEHKRGEKPLALVNTNISQGNREEEVEIVTEPWVSIVENLLPDLLSSVEHVKNEMNQDEDVKPRLVARVGKVLFHGVSEIDRNEPPTETTLRQLRR

Query:  SFYTNISDTYMENITQKVIPLTGLVFVEEKDIYHVKLSDAERPSVTISCKCTVVPELNNLELYKVELNQVRHMVADISCLKQNMDMRLMLCSKKTSEKLT
        SF TN+S+TY+ENI  KVIPL G+ F EEKD YH+KLSDAE P VT++CKCT +PELN LELYKVELN VRHMVADISCLKQN+DMR+M  SKKT  KLT
Subjt:  SFYTNISDTYMENITQKVIPLTGLVFVEEKDIYHVKLSDAERPSVTISCKCTVVPELNNLELYKVELNQVRHMVADISCLKQNMDMRLMLCSKKTSEKLT

Query:  VDEMEGIAYLINSAVLDQDVKGGLRWPLGKASSGDRFRVVGVWHTVAKSYVNAFMRLKLRNADRYDFKTSVGEATKEITLKLKQFTSELLREEAEFDVIS
         DEMEGI  LINSA+LD+DVKGGLRWPLGKASSGDRF+VVG WHTV++SYVN F+R+KLRNADRYDFKT VGE + E+T K+K+ TSELLRE+AE DVIS
Subjt:  VDEMEGIAYLINSAVLDQDVKGGLRWPLGKASSGDRFRVVGVWHTVAKSYVNAFMRLKLRNADRYDFKTSVGEATKEITLKLKQFTSELLREEAEFDVIS

Query:  DMLKDHLKLFWNHFVCNASCSSP
        DML ++LKLFWNHFVC+A+CSSP
Subjt:  DMLKDHLKLFWNHFVCNASCSSP

XP_038890293.1 uncharacterized protein LOC120079912 [Benincasa hispida]7.4e-17976.21Show/hide
Query:  MAYIPPHKRHSRDMEKASPTAELLVPQFNRKLNLRPKFTQ--KSDLKPSKSEVDRSGKIVYAHQAVLKWFSIGSSGNGNQFPSCVHLEPFSATSIEHKRG
        MAYIPPHKRHSRDMEK SPTAELL PQFNRKLNLRP      + DL+PS  +VDRSGKIVYA+QA+ KWFSIGSS +GN FPSCVHLEPFS  +IE KRG
Subjt:  MAYIPPHKRHSRDMEKASPTAELLVPQFNRKLNLRPKFTQ--KSDLKPSKSEVDRSGKIVYAHQAVLKWFSIGSSGNGNQFPSCVHLEPFSATSIEHKRG

Query:  EKPLALVNTNISQGNREEEVEIVTEPWVSIVENLLPDLLSSVEHVKNEMNQDEDVKPRLVARVGKVLFHGVSEIDRNEPPTETTLRQLRRSFYTNISDTY
        EKPLALVN+NISQGNREEE E++TEPW SIV NLLPDLLSSV H KNEM+QD DVK  LVA++GKVLFHG+SEID+N+ P + TLRQL+RSFYTN+SD  
Subjt:  EKPLALVNTNISQGNREEEVEIVTEPWVSIVENLLPDLLSSVEHVKNEMNQDEDVKPRLVARVGKVLFHGVSEIDRNEPPTETTLRQLRRSFYTNISDTY

Query:  MENITQKVIPLTGLVFVEEKDIYHVKLSDAERPSVTISCKCTVVPELNNLELYKVELNQVRHMVADISCLKQNMDMRLMLCSKKTSEKLTVDEMEGIAYL
        +++IT KVIP  G+ F EEKDIYH+KLSDA RP VT+SCKCT +PELN L+LYKV+LN VRHMV DISCLKQ+ DMRLML SKKT EKLT DEMEGI  L
Subjt:  MENITQKVIPLTGLVFVEEKDIYHVKLSDAERPSVTISCKCTVVPELNNLELYKVELNQVRHMVADISCLKQNMDMRLMLCSKKTSEKLTVDEMEGIAYL

Query:  INSAVLDQDVKGGLRWPLGKASSGDRFRVVGVWHTVAKSYVNAFMRLKLRNADRYDFKTSVGEATKEITLKLKQFTSELLREEAEFDVISDMLKDHLKLF
        INSAVLDQD  GGLRWPLGKA+SGDRFRV+GVWHTVAKSYVN F+RL LRNADRYDF+TS+GE TKE+T+KLKQ TSELLRE+AE+DVISDML D LK+F
Subjt:  INSAVLDQDVKGGLRWPLGKASSGDRFRVVGVWHTVAKSYVNAFMRLKLRNADRYDFKTSVGEATKEITLKLKQFTSELLREEAEFDVISDMLKDHLKLF

Query:  WNHFVCNASCSS
        WNHFV + SC S
Subjt:  WNHFVCNASCSS

TrEMBL top hitse value%identityAlignment
A0A314ZA25 Uncharacterized protein2.2e-16847.19Show/hide
Query:  MGGVAWTEEEDYLLKKCIEQYGEGKWHRVPQLAGLNRCRKSCRLRWLNYLRPNIKRGSFTPEEVDLIFSLHSLLGNRWSVIAGRLPGRTANDIKNYWNCH
        MGG+ WT+EED LL+ CIEQYGEGKWHRVP LAGLNRCRKSCRLRWLNYLRPNIKRGSF  EEVDLI  LH LLGNRWS++A RLPGRT ND+KNYWNCH
Subjt:  MGGVAWTEEEDYLLKKCIEQYGEGKWHRVPQLAGLNRCRKSCRLRWLNYLRPNIKRGSFTPEEVDLIFSLHSLLGNRWSVIAGRLPGRTANDIKNYWNCH

Query:  LSKKLNGQLGGVEDQQANPTKEEAFFGKPQNWEHPQEESSKSRAKGKTCAADHDQNIN------GENSQEMILV-------------QNQ---------N
        LSKKLN Q    ED+  +    +    +  N+     +   S    ++ + +  Q +N      G NS   I +             QNQ         N
Subjt:  LSKKLNGQLGGVEDQQANPTKEEAFFGKPQNWEHPQEESSKSRAKGKTCAADHDQNIN------GENSQEMILV-------------QNQ---------N

Query:  QNLPIIVEQQSNSMGSLGNLQME---FKAL-----EDDHHQQGCNKREVWDDWISEMDLWIDSLYFDCALRS---------------------------C
        +N+P + E+  N   ++G+L ME   FK +     +DD    G +    W +W    D  ++ L   C  +S                            
Subjt:  QNLPIIVEQQSNSMGSLGNLQME---FKAL-----EDDHHQQGCNKREVWDDWISEMDLWIDSLYFDCALRS---------------------------C

Query:  CMAYIPPHKRHSRDMEKASPTAELLVPQFNRKLNLRPKFTQKSDLKPSKSEVDRSGKIVYAHQAVLKWFSIGSSGNGNQFPSCVHLEPFSATSIEHKRGE
         MAYIPPHKRHS++ E+  PT ELL PQF + LNLR           SKS  D  GK++YA+ A  +W  IG   + NQFPS V+LE  S  + E + GE
Subjt:  CMAYIPPHKRHSRDMEKASPTAELLVPQFNRKLNLRPKFTQKSDLKPSKSEVDRSGKIVYAHQAVLKWFSIGSSGNGNQFPSCVHLEPFSATSIEHKRGE

Query:  KPLALVNTNISQGNREEEVEIVTEPWVSIVENLLPDLLSSVEHVKNEMN--QDEDVKPRLVARVGKVLFHGVSEID----RNEPPTETTLRQLRRSFYTN
        KPLAL+NT++ +   E +  +   PWVS+ EN+L DLLSS E+VKN+M   + E+VKP LVARVGKVLF     ++    RN   TE  L+Q +R FYTN
Subjt:  KPLALVNTNISQGNREEEVEIVTEPWVSIVENLLPDLLSSVEHVKNEMN--QDEDVKPRLVARVGKVLFHGVSEID----RNEPPTETTLRQLRRSFYTN

Query:  ISDTYMENITQKVIPLTGLVFVEEKDIYHVKLSDAERPSVTISCKCTVVPELNNLELYKVELNQVRHMVADISCLKQNMDMRLMLCSKKTSEKLTVDEME
        +  +Y ENI ++V+P  G+ F EEKD+YH+KLSD+ RP   +SCKC+V+ E   L+LYK+ELNQ+R MV DISC  +N+D+R+MLC+K+    LT DEM+
Subjt:  ISDTYMENITQKVIPLTGLVFVEEKDIYHVKLSDAERPSVTISCKCTVVPELNNLELYKVELNQVRHMVADISCLKQNMDMRLMLCSKKTSEKLTVDEME

Query:  GIAYLINSAVLDQDVKGGLRWPLGKASSGDRFRVVGVWHTVAKSYVNAFMRLKLRNADRYDFKTSVGEATKEITLKLKQFTSELLREEAEFDVISDMLKD
         I  LI+SAVLD DVKGGLRWPLGK SSGDR++VVG+WH +A  Y N  +RLK+R+ADR+DF+TS GEAT E++L LK+  S+L  E+ E   IS++L++
Subjt:  GIAYLINSAVLDQDVKGGLRWPLGKASSGDRFRVVGVWHTVAKSYVNAFMRLKLRNADRYDFKTSVGEATKEITLKLKQFTSELLREEAEFDVISDMLKD

Query:  HLKLFWNHFVCN
        ++++ W++F+C+
Subjt:  HLKLFWNHFVCN

A0A6J1CTY4 uncharacterized protein LOC1110141912.2e-237100Show/hide
Query:  MAYIPPHKRHSRDMEKASPTAELLVPQFNRKLNLRPKFTQKSDLKPSKSEVDRSGKIVYAHQAVLKWFSIGSSGNGNQFPSCVHLEPFSATSIEHKRGEK
        MAYIPPHKRHSRDMEKASPTAELLVPQFNRKLNLRPKFTQKSDLKPSKSEVDRSGKIVYAHQAVLKWFSIGSSGNGNQFPSCVHLEPFSATSIEHKRGEK
Subjt:  MAYIPPHKRHSRDMEKASPTAELLVPQFNRKLNLRPKFTQKSDLKPSKSEVDRSGKIVYAHQAVLKWFSIGSSGNGNQFPSCVHLEPFSATSIEHKRGEK

Query:  PLALVNTNISQGNREEEVEIVTEPWVSIVENLLPDLLSSVEHVKNEMNQDEDVKPRLVARVGKVLFHGVSEIDRNEPPTETTLRQLRRSFYTNISDTYME
        PLALVNTNISQGNREEEVEIVTEPWVSIVENLLPDLLSSVEHVKNEMNQDEDVKPRLVARVGKVLFHGVSEIDRNEPPTETTLRQLRRSFYTNISDTYME
Subjt:  PLALVNTNISQGNREEEVEIVTEPWVSIVENLLPDLLSSVEHVKNEMNQDEDVKPRLVARVGKVLFHGVSEIDRNEPPTETTLRQLRRSFYTNISDTYME

Query:  NITQKVIPLTGLVFVEEKDIYHVKLSDAERPSVTISCKCTVVPELNNLELYKVELNQVRHMVADISCLKQNMDMRLMLCSKKTSEKLTVDEMEGIAYLIN
        NITQKVIPLTGLVFVEEKDIYHVKLSDAERPSVTISCKCTVVPELNNLELYKVELNQVRHMVADISCLKQNMDMRLMLCSKKTSEKLTVDEMEGIAYLIN
Subjt:  NITQKVIPLTGLVFVEEKDIYHVKLSDAERPSVTISCKCTVVPELNNLELYKVELNQVRHMVADISCLKQNMDMRLMLCSKKTSEKLTVDEMEGIAYLIN

Query:  SAVLDQDVKGGLRWPLGKASSGDRFRVVGVWHTVAKSYVNAFMRLKLRNADRYDFKTSVGEATKEITLKLKQFTSELLREEAEFDVISDMLKDHLKLFWN
        SAVLDQDVKGGLRWPLGKASSGDRFRVVGVWHTVAKSYVNAFMRLKLRNADRYDFKTSVGEATKEITLKLKQFTSELLREEAEFDVISDMLKDHLKLFWN
Subjt:  SAVLDQDVKGGLRWPLGKASSGDRFRVVGVWHTVAKSYVNAFMRLKLRNADRYDFKTSVGEATKEITLKLKQFTSELLREEAEFDVISDMLKDHLKLFWN

Query:  HFVCNASCSSPSDTT
        HFVCNASCSSPSDTT
Subjt:  HFVCNASCSSPSDTT

A0A6J1HL67 uncharacterized protein LOC111465552 isoform X37.0e-15968.61Show/hide
Query:  MAYIPPHKRHSRDMEKASPTAELLVPQFNRKLNLRPKFTQKSDLKPSKSEVDRSGKIVYAHQAVLKWFSIGSSGNGNQFPSCVHLEPFSATSIEHKRGEK
        MAY+PPHKRHS DMEK SPTAELL P FNRKL          DL+ S  +VDRSG+I+YA QA  KWF IG S + N FPSCV LEP +  + EH+ GEK
Subjt:  MAYIPPHKRHSRDMEKASPTAELLVPQFNRKLNLRPKFTQKSDLKPSKSEVDRSGKIVYAHQAVLKWFSIGSSGNGNQFPSCVHLEPFSATSIEHKRGEK

Query:  PLALVNTNISQGNREEEVEIVTEPWVSIVENLLPDLLSSVEHVKNEMNQDEDVKPRLVARVGKVLFHGVSEIDRNEPPTETTLRQLRRSFYTNISDTYME
         LALV +N+SQGN+EEE E+V EPW SIV  LLPD+LSS+EH+KNE+NQDE+VK  LVAR+GKV+FHG SE D N+ PT   + QL+RSF TN+S+TY+E
Subjt:  PLALVNTNISQGNREEEVEIVTEPWVSIVENLLPDLLSSVEHVKNEMNQDEDVKPRLVARVGKVLFHGVSEIDRNEPPTETTLRQLRRSFYTNISDTYME

Query:  NITQKVIPLTGLVFVEEKDIYHVKLSDAERPSVTISCKCTVVPELNNLELYKVELNQVRHMVADISCLKQNMDMRLMLCSKKTSEKLTVDEMEGIAYLIN
        NI  KVIPL G+ F EEKD YH+KLSDAE P VT++CKCT +PE N L+LYKVELN VRHMVADISCLKQN+DMR+M  SKKT  KLT DEME I  LIN
Subjt:  NITQKVIPLTGLVFVEEKDIYHVKLSDAERPSVTISCKCTVVPELNNLELYKVELNQVRHMVADISCLKQNMDMRLMLCSKKTSEKLTVDEMEGIAYLIN

Query:  SAVLDQDVKGGLRWPLGKASSGDRFRVVGVWHTVAKSYVNAFMRLKLRNADRYDFKTSVGEATKEITLKLKQFTSELLREEAEFDVISDMLKDHLKLFWN
        SA+LD+DVKGGLRWPLGKASSGDRF+VVG WHTV++SYVN F+R+KLRNADRYDFKT VGE +KE+T K+K+ TSELLRE+ E DVI DML D+LKLFWN
Subjt:  SAVLDQDVKGGLRWPLGKASSGDRFRVVGVWHTVAKSYVNAFMRLKLRNADRYDFKTSVGEATKEITLKLKQFTSELLREEAEFDVISDMLKDHLKLFWN

Query:  HFVCNASCSSP
        +FVC+A+CSSP
Subjt:  HFVCNASCSSP

A0A6J1HPW6 uncharacterized protein LOC111465552 isoform X11.9e-16468.47Show/hide
Query:  IDSLYFDCALRSCCMAYIPPHKRHSRDMEKASPTAELLVPQFNRKLNLRPKFTQKSDLKPSKSEVDRSGKIVYAHQAVLKWFSIGSSGNGNQFPSCVHLE
        + S Y DCALRSC MAY+PPHKRHS DMEK SPTAELL P FNRKL          DL+ S  +VDRSG+I+YA QA  KWF IG S + N FPSCV LE
Subjt:  IDSLYFDCALRSCCMAYIPPHKRHSRDMEKASPTAELLVPQFNRKLNLRPKFTQKSDLKPSKSEVDRSGKIVYAHQAVLKWFSIGSSGNGNQFPSCVHLE

Query:  PFSATSIEHKRGEKPLALVNTNISQGNREEEVEIVTEPWVSIVENLLPDLLSSVEHVKNEMNQDEDVKPRLVARVGKVLFHGVSEIDRNEPPTETTLRQL
        P +  + EH+ GEK LALV +N+SQGN+EEE E+V EPW SIV  LLPD+LSS+EH+KNE+NQDE+VK  LVAR+GKV+FHG SE D N+ PT   + QL
Subjt:  PFSATSIEHKRGEKPLALVNTNISQGNREEEVEIVTEPWVSIVENLLPDLLSSVEHVKNEMNQDEDVKPRLVARVGKVLFHGVSEIDRNEPPTETTLRQL

Query:  RRSFYTNISDTYMENITQKVIPLTGLVFVEEKDIYHVKLSDAERPSVTISCKCTVVPELNNLELYKVELNQVRHMVADISCLKQNMDMRLMLCSKKTSEK
        +RSF TN+S+TY+ENI  KVIPL G+ F EEKD YH+KLSDAE P VT++CKCT +PE N L+LYKVELN VRHMVADISCLKQN+DMR+M  SKKT  K
Subjt:  RRSFYTNISDTYMENITQKVIPLTGLVFVEEKDIYHVKLSDAERPSVTISCKCTVVPELNNLELYKVELNQVRHMVADISCLKQNMDMRLMLCSKKTSEK

Query:  LTVDEMEGIAYLINSAVLDQDVKGGLRWPLGKASSGDRFRVVGVWHTVAKSYVNAFMRLKLRNADRYDFKTSVGEATKEITLKLKQFTSELLREEAEFDV
        LT DEME I  LINSA+LD+DVKGGLRWPLGKASSGDRF+VVG WHTV++SYVN F+R+KLRNADRYDFKT VGE +KE+T K+K+ TSELLRE+ E DV
Subjt:  LTVDEMEGIAYLINSAVLDQDVKGGLRWPLGKASSGDRFRVVGVWHTVAKSYVNAFMRLKLRNADRYDFKTSVGEATKEITLKLKQFTSELLREEAEFDV

Query:  ISDMLKDHLKLFWNHFVCNASCSSP
        I DML D+LKLFWN+FVC+A+CSSP
Subjt:  ISDMLKDHLKLFWNHFVCNASCSSP

A0A6J1HRV2 uncharacterized protein LOC111465552 isoform X22.5e-16468.79Show/hide
Query:  SLYFDCALRSCCMAYIPPHKRHSRDMEKASPTAELLVPQFNRKLNLRPKFTQKSDLKPSKSEVDRSGKIVYAHQAVLKWFSIGSSGNGNQFPSCVHLEPF
        S Y DCALRSC MAY+PPHKRHS DMEK SPTAELL P FNRKL          DL+ S  +VDRSG+I+YA QA  KWF IG S + N FPSCV LEP 
Subjt:  SLYFDCALRSCCMAYIPPHKRHSRDMEKASPTAELLVPQFNRKLNLRPKFTQKSDLKPSKSEVDRSGKIVYAHQAVLKWFSIGSSGNGNQFPSCVHLEPF

Query:  SATSIEHKRGEKPLALVNTNISQGNREEEVEIVTEPWVSIVENLLPDLLSSVEHVKNEMNQDEDVKPRLVARVGKVLFHGVSEIDRNEPPTETTLRQLRR
        +  + EH+ GEK LALV +N+SQGN+EEE E+V EPW SIV  LLPD+LSS+EH+KNE+NQDE+VK  LVAR+GKV+FHG SE D N+ PT   + QL+R
Subjt:  SATSIEHKRGEKPLALVNTNISQGNREEEVEIVTEPWVSIVENLLPDLLSSVEHVKNEMNQDEDVKPRLVARVGKVLFHGVSEIDRNEPPTETTLRQLRR

Query:  SFYTNISDTYMENITQKVIPLTGLVFVEEKDIYHVKLSDAERPSVTISCKCTVVPELNNLELYKVELNQVRHMVADISCLKQNMDMRLMLCSKKTSEKLT
        SF TN+S+TY+ENI  KVIPL G+ F EEKD YH+KLSDAE P VT++CKCT +PE N L+LYKVELN VRHMVADISCLKQN+DMR+M  SKKT  KLT
Subjt:  SFYTNISDTYMENITQKVIPLTGLVFVEEKDIYHVKLSDAERPSVTISCKCTVVPELNNLELYKVELNQVRHMVADISCLKQNMDMRLMLCSKKTSEKLT

Query:  VDEMEGIAYLINSAVLDQDVKGGLRWPLGKASSGDRFRVVGVWHTVAKSYVNAFMRLKLRNADRYDFKTSVGEATKEITLKLKQFTSELLREEAEFDVIS
         DEME I  LINSA+LD+DVKGGLRWPLGKASSGDRF+VVG WHTV++SYVN F+R+KLRNADRYDFKT VGE +KE+T K+K+ TSELLRE+ E DVI 
Subjt:  VDEMEGIAYLINSAVLDQDVKGGLRWPLGKASSGDRFRVVGVWHTVAKSYVNAFMRLKLRNADRYDFKTSVGEATKEITLKLKQFTSELLREEAEFDVIS

Query:  DMLKDHLKLFWNHFVCNASCSSP
        DML D+LKLFWN+FVC+A+CSSP
Subjt:  DMLKDHLKLFWNHFVCNASCSSP

SwissProt top hitse value%identityAlignment
A0A2R6S148 Transcription factor MYB14.1e-4781.19Show/hide
Query:  AWTEEEDYLLKKCIEQYGEGKWHRVPQLAGLNRCRKSCRLRWLNYLRPNIKRGSFTPEEVDLIFSLHSLLGNRWSVIAGRLPGRTANDIKNYWNCHLSKK
        AWTEEED LLKKCIE+YGEGKWH+VP  +GLNRCRKSCR+RWLNYLRPNI RG+FT +EVDLI  LH LLGNRWS+IAGRLPGRT+ND+KNYWN HL KK
Subjt:  AWTEEEDYLLKKCIEQYGEGKWHRVPQLAGLNRCRKSCRLRWLNYLRPNIKRGSFTPEEVDLIFSLHSLLGNRWSVIAGRLPGRTANDIKNYWNCHLSKK

Query:  L
        L
Subjt:  L

Q9FE25 Transcription factor MYB757.3e-4477Show/hide
Query:  AWTEEEDYLLKKCIEQYGEGKWHRVPQLAGLNRCRKSCRLRWLNYLRPNIKRGSFTPEEVDLIFSLHSLLGNRWSVIAGRLPGRTANDIKNYWNCHLSKK
        AWT EED LL++CI +YGEGKWH+VP  AGLNRCRKSCRLRWLNYL+P+IKRG  + +EVDL+  LH LLGNRWS+IAGRLPGRTAND+KNYWN HLSKK
Subjt:  AWTEEEDYLLKKCIEQYGEGKWHRVPQLAGLNRCRKSCRLRWLNYLRPNIKRGSFTPEEVDLIFSLHSLLGNRWSVIAGRLPGRTANDIKNYWNCHLSKK

Q9FNV8 Transcription factor MYB1141.9e-4478Show/hide
Query:  AWTEEEDYLLKKCIEQYGEGKWHRVPQLAGLNRCRKSCRLRWLNYLRPNIKRGSFTPEEVDLIFSLHSLLGNRWSVIAGRLPGRTANDIKNYWNCHLSKK
        AWT EED LL++CI +YGEGKWH+VP  AGLNRCRKSCRLRWLNYL+P+IKRG F+ +EVDL+  LH LLGNRWS+IAGRLPGRTAND+KNYWN HLSKK
Subjt:  AWTEEEDYLLKKCIEQYGEGKWHRVPQLAGLNRCRKSCRLRWLNYLRPNIKRGSFTPEEVDLIFSLHSLLGNRWSVIAGRLPGRTANDIKNYWNCHLSKK

Q9FNV9 Transcription factor MYB1139.5e-4476.77Show/hide
Query:  WTEEEDYLLKKCIEQYGEGKWHRVPQLAGLNRCRKSCRLRWLNYLRPNIKRGSFTPEEVDLIFSLHSLLGNRWSVIAGRLPGRTANDIKNYWNCHLSKK
        WT EED LL++CI++YGEGKWHRVP   GLNRCRKSCRLRWLNYL+P+IKRG    +EVDL+  LH LLGNRWS+IAGRLPGRTAND+KNYWN HLSKK
Subjt:  WTEEEDYLLKKCIEQYGEGKWHRVPQLAGLNRCRKSCRLRWLNYLRPNIKRGSFTPEEVDLIFSLHSLLGNRWSVIAGRLPGRTANDIKNYWNCHLSKK

Q9ZTC3 Transcription factor MYB901.2e-4377Show/hide
Query:  AWTEEEDYLLKKCIEQYGEGKWHRVPQLAGLNRCRKSCRLRWLNYLRPNIKRGSFTPEEVDLIFSLHSLLGNRWSVIAGRLPGRTANDIKNYWNCHLSKK
        AWT EED LL+ CI++YGEGKWH+VP  AGLNRCRKSCRLRWLNYL+P+IKRG  + +EVDL+  LH LLGNRWS+IAGRLPGRTAND+KNYWN HLSKK
Subjt:  AWTEEEDYLLKKCIEQYGEGKWHRVPQLAGLNRCRKSCRLRWLNYLRPNIKRGSFTPEEVDLIFSLHSLLGNRWSVIAGRLPGRTANDIKNYWNCHLSKK

Arabidopsis top hitse value%identityAlignment
AT1G56650.1 production of anthocyanin pigment 15.2e-4577Show/hide
Query:  AWTEEEDYLLKKCIEQYGEGKWHRVPQLAGLNRCRKSCRLRWLNYLRPNIKRGSFTPEEVDLIFSLHSLLGNRWSVIAGRLPGRTANDIKNYWNCHLSKK
        AWT EED LL++CI +YGEGKWH+VP  AGLNRCRKSCRLRWLNYL+P+IKRG  + +EVDL+  LH LLGNRWS+IAGRLPGRTAND+KNYWN HLSKK
Subjt:  AWTEEEDYLLKKCIEQYGEGKWHRVPQLAGLNRCRKSCRLRWLNYLRPNIKRGSFTPEEVDLIFSLHSLLGNRWSVIAGRLPGRTANDIKNYWNCHLSKK

AT1G66370.1 myb domain protein 1136.7e-4576.77Show/hide
Query:  WTEEEDYLLKKCIEQYGEGKWHRVPQLAGLNRCRKSCRLRWLNYLRPNIKRGSFTPEEVDLIFSLHSLLGNRWSVIAGRLPGRTANDIKNYWNCHLSKK
        WT EED LL++CI++YGEGKWHRVP   GLNRCRKSCRLRWLNYL+P+IKRG    +EVDL+  LH LLGNRWS+IAGRLPGRTAND+KNYWN HLSKK
Subjt:  WTEEEDYLLKKCIEQYGEGKWHRVPQLAGLNRCRKSCRLRWLNYLRPNIKRGSFTPEEVDLIFSLHSLLGNRWSVIAGRLPGRTANDIKNYWNCHLSKK

AT1G66380.1 myb domain protein 1141.4e-4578Show/hide
Query:  AWTEEEDYLLKKCIEQYGEGKWHRVPQLAGLNRCRKSCRLRWLNYLRPNIKRGSFTPEEVDLIFSLHSLLGNRWSVIAGRLPGRTANDIKNYWNCHLSKK
        AWT EED LL++CI +YGEGKWH+VP  AGLNRCRKSCRLRWLNYL+P+IKRG F+ +EVDL+  LH LLGNRWS+IAGRLPGRTAND+KNYWN HLSKK
Subjt:  AWTEEEDYLLKKCIEQYGEGKWHRVPQLAGLNRCRKSCRLRWLNYLRPNIKRGSFTPEEVDLIFSLHSLLGNRWSVIAGRLPGRTANDIKNYWNCHLSKK

AT2G38430.1 unknown protein3.0e-7738.69Show/hide
Query:  MAYIPPHKRHSRDMEKASPTAELLVPQFNRKLNLRPKFTQKSDLKPSKSEVDRSGKIVYAHQAVLKWFSIGSSGNGNQFPSCVHLEPFSATSIEHKRGEK
        M+YIPPHKRHS+D  K +P  + LV +F + L          D K S S V R+ KI+Y+  ++ KWF +GS+G  +  PS V   P S+ S+E ++GEK
Subjt:  MAYIPPHKRHSRDMEKASPTAELLVPQFNRKLNLRPKFTQKSDLKPSKSEVDRSGKIVYAHQAVLKWFSIGSSGNGNQFPSCVHLEPFSATSIEHKRGEK

Query:  PLALVNTNISQ-----GNREEEVEIVTEPWVSIVENLLPDLLSSVEHVKNEMNQDEDVKPRLVARVGKVLFHGVSEIDRNEPPTETTLRQLRRSFYTNIS
        P  L+N N+       G R++E       W+ + E ++ DL+ + E VK EM +   V  RLVAR GK+ F+G     +  P  E +L+ L + F T++ 
Subjt:  PLALVNTNISQ-----GNREEEVEIVTEPWVSIVENLLPDLLSSVEHVKNEMNQDEDVKPRLVARVGKVLFHGVSEIDRNEPPTETTLRQLRRSFYTNIS

Query:  DTYMENITQKVIPLTGLVFVEEKDIYHVKLSDAERPSVTISCKCTVVPELNNLELYKVELNQVRHMVADISCLKQNMDMRLMLCSKKTSEKLTVDEMEGI
         +Y+++I   V+P  G    +EK+ Y VK+S   RP+ TI+CKCT   E   L +YKV+LN VRH+V D+SC+ +N+DMRL L +K+    L+  E+  I
Subjt:  DTYMENITQKVIPLTGLVFVEEKDIYHVKLSDAERPSVTISCKCTVVPELNNLELYKVELNQVRHMVADISCLKQNMDMRLMLCSKKTSEKLTVDEMEGI

Query:  AYLINSAVLDQDVKGGLRWPLGKASSGDRFRVVGVWHTVAKSYVNAFMRLKLRNADRYDFKTSVGEATKEITLKLKQFTSELLREEAEFDVISDMLKDHL
          L+ SA +D +VKGGLRWP GKA S D ++   V H  A  Y N  +RL++R  DR++  T  GE  +E++L LK   +++  +  E   + +ML+D L
Subjt:  AYLINSAVLDQDVKGGLRWPLGKASSGDRFRVVGVWHTVAKSYVNAFMRLKLRNADRYDFKTSVGEATKEITLKLKQFTSELLREEAEFDVISDMLKDHL

Query:  KLFWNHFVCNA
           W+   C+A
Subjt:  KLFWNHFVCNA

AT3G54310.1 unknown protein1.3e-6736.61Show/hide
Query:  MAYIPPHKRHSRDMEKASPTAELLVPQFNRKLNLRPKFTQKSDLKPSKSEVDRSGKIVYAHQAVLKWFSIGSSGNGNQFPSCVHLEPFSATSIEHKRGEK
        M+Y+PPHKR    +E  +  A  + P   +K N              K+    + KI+YA+  + +WF +GS    + F     L P S+   +    + 
Subjt:  MAYIPPHKRHSRDMEKASPTAELLVPQFNRKLNLRPKFTQKSDLKPSKSEVDRSGKIVYAHQAVLKWFSIGSSGNGNQFPSCVHLEPFSATSIEHKRGEK

Query:  PLALVNTNISQGNREEEVEIVTEPWVSIVENLLPDLLSSVEHVKNEM---NQDEDVKPRLVARVGKVLFHGVSEIDRNEPPTETTLRQLRRSFYTNISDT
         + L  +++ +         +  PW+ + E +  D++   +  K  +      EDVK RL+AR G +         R+ P T++TL +L++ F TN+  +
Subjt:  PLALVNTNISQGNREEEVEIVTEPWVSIVENLLPDLLSSVEHVKNEM---NQDEDVKPRLVARVGKVLFHGVSEIDRNEPPTETTLRQLRRSFYTNISDT

Query:  YMENITQKVIPLTGLVFVEEKDIYHVKLSDAERPSVTISCKCTVVPELNNLELYKVELNQVRHMVADISCLKQNMDMRLMLCSKKTSEKLTVDEMEGIAY
        Y+EN+  + +   G    E K++YHVK+SD  RP VTISCKC              ELN +RH+  D+SCL Q++DMRLM+ SK+T   L+ +E+  +  
Subjt:  YMENITQKVIPLTGLVFVEEKDIYHVKLSDAERPSVTISCKCTVVPELNNLELYKVELNQVRHMVADISCLKQNMDMRLMLCSKKTSEKLTVDEMEGIAY

Query:  LINSAVLDQDVKGGLRWPLGKASSGDRFRVVGVWHTVAKSYVNAFMRLKLRNADRYDFKTSVGEATKEITLKLKQFTSELLREE-AEFDVISDMLKDHLK
        L +SAV+D  VKGGL+WPLGK+S  DR+ V GVWHTV  +Y+N  MRL++R ADRYDF+T VG  + E+  KLK  ++ LL+EE  E   +SDMLKD LK
Subjt:  LINSAVLDQDVKGGLRWPLGKASSGDRFRVVGVWHTVAKSYVNAFMRLKLRNADRYDFKTSVGEATKEITLKLKQFTSELLREE-AEFDVISDMLKDHLK

Query:  LFWNHFV
          W +F+
Subjt:  LFWNHFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGGCGTTGCTTGGACTGAAGAAGAAGATTACTTACTCAAGAAATGCATTGAGCAATATGGAGAAGGGAAATGGCATCGAGTTCCTCAGTTGGCTGGTTTAAACAG
ATGCAGAAAGAGTTGCAGACTGAGATGGTTGAATTATCTTCGTCCAAATATCAAGAGAGGAAGCTTCACACCAGAGGAAGTAGACCTCATTTTCAGCCTCCACAGTCTCT
TGGGCAACAGGTGGTCAGTAATTGCTGGAAGATTACCAGGAAGAACAGCAAATGATATAAAGAACTATTGGAATTGCCATTTGAGCAAGAAGCTGAATGGGCAATTAGGA
GGAGTTGAGGACCAGCAGGCAAACCCCACAAAAGAAGAAGCCTTTTTTGGGAAACCACAAAATTGGGAGCATCCTCAAGAAGAGAGCAGCAAATCAAGGGCAAAAGGAAA
GACGTGTGCTGCAGACCATGACCAAAATATTAATGGAGAAAACAGCCAAGAGATGATTTTGGTTCAAAATCAGAATCAGAATTTGCCCATTATTGTAGAACAACAAAGTA
ATAGCATGGGCAGTCTGGGGAATTTGCAAATGGAGTTCAAAGCATTGGAGGATGATCATCATCAACAGGGCTGCAACAAGAGGGAGGTGTGGGATGATTGGATTTCTGAG
ATGGATCTTTGGATTGATTCTTTATACTTTGATTGTGCATTGAGATCTTGCTGTATGGCCTATATACCTCCACACAAACGTCATTCAAGGGATATGGAGAAGGCTTCTCC
TACTGCAGAGTTACTTGTTCCTCAATTCAACAGAAAATTGAATCTGAGGCCAAAATTCACCCAAAAATCGGATTTGAAGCCATCTAAATCTGAAGTGGACAGGAGTGGAA
AGATAGTCTATGCTCATCAGGCTGTACTTAAATGGTTTAGCATTGGTAGTTCTGGTAATGGTAACCAGTTTCCCTCTTGTGTTCATCTTGAGCCGTTTTCTGCGACGTCC
ATTGAGCATAAAAGGGGTGAAAAACCCCTAGCTTTGGTGAACACTAATATATCTCAAGGAAATAGAGAAGAGGAAGTAGAAATCGTAACAGAACCATGGGTATCTATTGT
AGAAAATCTACTGCCTGACTTACTATCATCAGTCGAACATGTGAAGAATGAAATGAATCAAGATGAAGATGTGAAGCCAAGATTGGTCGCTAGGGTTGGAAAAGTTCTTT
TTCATGGGGTCTCTGAAATTGATAGAAATGAGCCGCCTACTGAAACAACACTGAGACAGTTAAGGAGATCTTTTTATACAAATATCTCTGACACTTACATGGAAAACATT
ACACAAAAAGTTATTCCATTAACTGGACTAGTGTTTGTAGAGGAGAAAGATATATATCATGTGAAGTTGTCGGATGCAGAGCGACCATCCGTGACCATATCATGTAAATG
CACTGTTGTGCCAGAGTTGAATAATCTTGAACTTTATAAGGTTGAACTCAACCAAGTGCGCCATATGGTGGCGGATATCTCGTGCCTGAAACAGAATATGGATATGAGGC
TGATGCTATGCTCCAAGAAGACTTCAGAAAAGCTGACAGTTGATGAGATGGAAGGCATAGCATATCTTATAAACTCTGCTGTTTTAGACCAAGATGTGAAGGGGGGCTTA
AGATGGCCTCTGGGAAAAGCATCCTCTGGGGACAGATTCAGAGTTGTTGGAGTTTGGCATACTGTTGCCAAGTCTTATGTAAATGCATTTATGAGGCTCAAACTAAGAAA
TGCCGACCGATACGATTTCAAGACTTCGGTTGGAGAAGCGACAAAAGAGATTACTTTGAAGCTGAAACAATTTACTTCCGAACTACTTAGAGAGGAAGCTGAGTTTGATG
TGATCTCTGATATGCTTAAGGACCACCTGAAGCTGTTTTGGAACCACTTTGTATGCAATGCATCCTGTTCATCCCCTTCTGATACAACATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGGCGTTGCTTGGACTGAAGAAGAAGATTACTTACTCAAGAAATGCATTGAGCAATATGGAGAAGGGAAATGGCATCGAGTTCCTCAGTTGGCTGGTTTAAACAG
ATGCAGAAAGAGTTGCAGACTGAGATGGTTGAATTATCTTCGTCCAAATATCAAGAGAGGAAGCTTCACACCAGAGGAAGTAGACCTCATTTTCAGCCTCCACAGTCTCT
TGGGCAACAGGTGGTCAGTAATTGCTGGAAGATTACCAGGAAGAACAGCAAATGATATAAAGAACTATTGGAATTGCCATTTGAGCAAGAAGCTGAATGGGCAATTAGGA
GGAGTTGAGGACCAGCAGGCAAACCCCACAAAAGAAGAAGCCTTTTTTGGGAAACCACAAAATTGGGAGCATCCTCAAGAAGAGAGCAGCAAATCAAGGGCAAAAGGAAA
GACGTGTGCTGCAGACCATGACCAAAATATTAATGGAGAAAACAGCCAAGAGATGATTTTGGTTCAAAATCAGAATCAGAATTTGCCCATTATTGTAGAACAACAAAGTA
ATAGCATGGGCAGTCTGGGGAATTTGCAAATGGAGTTCAAAGCATTGGAGGATGATCATCATCAACAGGGCTGCAACAAGAGGGAGGTGTGGGATGATTGGATTTCTGAG
ATGGATCTTTGGATTGATTCTTTATACTTTGATTGTGCATTGAGATCTTGCTGTATGGCCTATATACCTCCACACAAACGTCATTCAAGGGATATGGAGAAGGCTTCTCC
TACTGCAGAGTTACTTGTTCCTCAATTCAACAGAAAATTGAATCTGAGGCCAAAATTCACCCAAAAATCGGATTTGAAGCCATCTAAATCTGAAGTGGACAGGAGTGGAA
AGATAGTCTATGCTCATCAGGCTGTACTTAAATGGTTTAGCATTGGTAGTTCTGGTAATGGTAACCAGTTTCCCTCTTGTGTTCATCTTGAGCCGTTTTCTGCGACGTCC
ATTGAGCATAAAAGGGGTGAAAAACCCCTAGCTTTGGTGAACACTAATATATCTCAAGGAAATAGAGAAGAGGAAGTAGAAATCGTAACAGAACCATGGGTATCTATTGT
AGAAAATCTACTGCCTGACTTACTATCATCAGTCGAACATGTGAAGAATGAAATGAATCAAGATGAAGATGTGAAGCCAAGATTGGTCGCTAGGGTTGGAAAAGTTCTTT
TTCATGGGGTCTCTGAAATTGATAGAAATGAGCCGCCTACTGAAACAACACTGAGACAGTTAAGGAGATCTTTTTATACAAATATCTCTGACACTTACATGGAAAACATT
ACACAAAAAGTTATTCCATTAACTGGACTAGTGTTTGTAGAGGAGAAAGATATATATCATGTGAAGTTGTCGGATGCAGAGCGACCATCCGTGACCATATCATGTAAATG
CACTGTTGTGCCAGAGTTGAATAATCTTGAACTTTATAAGGTTGAACTCAACCAAGTGCGCCATATGGTGGCGGATATCTCGTGCCTGAAACAGAATATGGATATGAGGC
TGATGCTATGCTCCAAGAAGACTTCAGAAAAGCTGACAGTTGATGAGATGGAAGGCATAGCATATCTTATAAACTCTGCTGTTTTAGACCAAGATGTGAAGGGGGGCTTA
AGATGGCCTCTGGGAAAAGCATCCTCTGGGGACAGATTCAGAGTTGTTGGAGTTTGGCATACTGTTGCCAAGTCTTATGTAAATGCATTTATGAGGCTCAAACTAAGAAA
TGCCGACCGATACGATTTCAAGACTTCGGTTGGAGAAGCGACAAAAGAGATTACTTTGAAGCTGAAACAATTTACTTCCGAACTACTTAGAGAGGAAGCTGAGTTTGATG
TGATCTCTGATATGCTTAAGGACCACCTGAAGCTGTTTTGGAACCACTTTGTATGCAATGCATCCTGTTCATCCCCTTCTGATACAACATGA
Protein sequenceShow/hide protein sequence
MGGVAWTEEEDYLLKKCIEQYGEGKWHRVPQLAGLNRCRKSCRLRWLNYLRPNIKRGSFTPEEVDLIFSLHSLLGNRWSVIAGRLPGRTANDIKNYWNCHLSKKLNGQLG
GVEDQQANPTKEEAFFGKPQNWEHPQEESSKSRAKGKTCAADHDQNINGENSQEMILVQNQNQNLPIIVEQQSNSMGSLGNLQMEFKALEDDHHQQGCNKREVWDDWISE
MDLWIDSLYFDCALRSCCMAYIPPHKRHSRDMEKASPTAELLVPQFNRKLNLRPKFTQKSDLKPSKSEVDRSGKIVYAHQAVLKWFSIGSSGNGNQFPSCVHLEPFSATS
IEHKRGEKPLALVNTNISQGNREEEVEIVTEPWVSIVENLLPDLLSSVEHVKNEMNQDEDVKPRLVARVGKVLFHGVSEIDRNEPPTETTLRQLRRSFYTNISDTYMENI
TQKVIPLTGLVFVEEKDIYHVKLSDAERPSVTISCKCTVVPELNNLELYKVELNQVRHMVADISCLKQNMDMRLMLCSKKTSEKLTVDEMEGIAYLINSAVLDQDVKGGL
RWPLGKASSGDRFRVVGVWHTVAKSYVNAFMRLKLRNADRYDFKTSVGEATKEITLKLKQFTSELLREEAEFDVISDMLKDHLKLFWNHFVCNASCSSPSDTT