; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG09G006170 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG09G006170
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Descriptionnuclear transcription factor Y subunit B-1
Genome locationCG_Chr09:5299531..5300785
RNA-Seq ExpressionClCG09G006170
SyntenyClCG09G006170
Gene Ontology termsGO:0045944 - positive regulation of transcription by RNA polymerase II (biological process)
GO:0016602 - CCAAT-binding factor complex (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0001228 - DNA-binding transcription activator activity, RNA polymerase II-specific (molecular function)
GO:0046982 - protein heterodimerization activity (molecular function)
InterPro domainsIPR003956 - Transcription factor, NFYB/HAP3, conserved site
IPR003958 - Transcription factor CBF/NF-Y/archaeal histone domain
IPR009072 - Histone-fold
IPR027113 - Transcription factor NFYB/HAP3


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008451614.1 PREDICTED: nuclear transcription factor Y subunit B-1 [Cucumis melo]8.2e-7196.6Show/hide
Query:  MDENTGMSERLVEFKYDFSGGG-GGVGGSTGGSSEEAGNGVGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHK
        MDENTGM ER VEFKYDF+GGG  GVGGSTGGSSEEAGNGVGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHK
Subjt:  MDENTGMSERLVEFKYDFSGGG-GGVGGSTGGSSEEAGNGVGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHK

Query:  EKRKTVNGDDICCALATLGFDDYAEPLRRYLVRYRDMEGERAQQNKG
        EKRKTVNGDDICCALATLGFDDYAEPLRRYLVRYRDMEGERAQQNKG
Subjt:  EKRKTVNGDDICCALATLGFDDYAEPLRRYLVRYRDMEGERAQQNKG

XP_011659350.1 nuclear transcription factor Y subunit B-5 [Cucumis sativus]1.1e-7095.92Show/hide
Query:  MDENTGMSERLVEFKYDFSGGG-GGVGGSTGGSSEEAGNGVGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHK
        MDENTGM ER +EFKYDF+GGG  GVGGSTGGSSEEAGNGVGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHK
Subjt:  MDENTGMSERLVEFKYDFSGGG-GGVGGSTGGSSEEAGNGVGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHK

Query:  EKRKTVNGDDICCALATLGFDDYAEPLRRYLVRYRDMEGERAQQNKG
        EKRKTVNGDDICCALATLGFDDYAEPLRRYLVRYRDMEGERAQQNKG
Subjt:  EKRKTVNGDDICCALATLGFDDYAEPLRRYLVRYRDMEGERAQQNKG

XP_023549145.1 nuclear transcription factor Y subunit B-1-like [Cucurbita pepo subsp. pepo]1.3e-6892.52Show/hide
Query:  MDENTGMSERLVEFKYDF-SGGGGGVGGSTGGSSEEAGNGVGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHK
        MD+NT M ERLVEFKYDF +GGGGGVGGSTGGSSEEAG+G+GG VKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHK
Subjt:  MDENTGMSERLVEFKYDF-SGGGGGVGGSTGGSSEEAGNGVGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHK

Query:  EKRKTVNGDDICCALATLGFDDYAEPLRRYLVRYRDMEGERAQQNKG
        EKRKTVNGDDICCALATLGFDDYAEPLRRYL+RYR++EGERAQQNKG
Subjt:  EKRKTVNGDDICCALATLGFDDYAEPLRRYLVRYRDMEGERAQQNKG

XP_038876756.1 nuclear transcription factor Y subunit B-1-like [Benincasa hispida]9.7e-7297.28Show/hide
Query:  MDENTGMSERLVEFKYDFSGGGG-GVGGSTGGSSEEAGNGVGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHK
        MDENTGM ERLVEFKYDF+GGGG GVGGSTGGSSEEAGNGVGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHK
Subjt:  MDENTGMSERLVEFKYDFSGGGG-GVGGSTGGSSEEAGNGVGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHK

Query:  EKRKTVNGDDICCALATLGFDDYAEPLRRYLVRYRDMEGERAQQNKG
        EKRKTVNGDDICCALATLGFDDYAEPLRRYLVRYR+MEGERAQQNKG
Subjt:  EKRKTVNGDDICCALATLGFDDYAEPLRRYLVRYRDMEGERAQQNKG

XP_039850662.1 nuclear transcription factor Y subunit B-2-like [Panicum virgatum]2.2e-7162.92Show/hide
Query:  GNGVGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVNGDDICCALATLGFDDYAEPLRRYLMDENTG
        G G    +KEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVNGDD+C A   LGFDDY +P+RR+L      
Subjt:  GNGVGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVNGDDICCALATLGFDDYAEPLRRYLMDENTG

Query:  MSERLVEFKYDFSGGGGGVGGSTGGSSEEAGNGVGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVN
          + L  F       G G    +    + AG+         D LLPIANVGRIMK +LPP AKISK AKET+QEC +EFI FVTGEAS++C +E+RKT+N
Subjt:  MSERLVEFKYDFSGGGGGVGGSTGGSSEEAGNGVGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVN

Query:  GDDICCALATLGFDDYAEPLRRYLVRYRDMEGERAQQNKG
        GDDIC A+ +LG D YA  +RRYL RYR+ E   A  N G
Subjt:  GDDICCALATLGFDDYAEPLRRYLVRYRDMEGERAQQNKG

TrEMBL top hitse value%identityAlignment
A0A0A0K954 CBFD_NFYB_HMF domain-containing protein5.2e-7195.92Show/hide
Query:  MDENTGMSERLVEFKYDFSGGG-GGVGGSTGGSSEEAGNGVGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHK
        MDENTGM ER +EFKYDF+GGG  GVGGSTGGSSEEAGNGVGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHK
Subjt:  MDENTGMSERLVEFKYDFSGGG-GGVGGSTGGSSEEAGNGVGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHK

Query:  EKRKTVNGDDICCALATLGFDDYAEPLRRYLVRYRDMEGERAQQNKG
        EKRKTVNGDDICCALATLGFDDYAEPLRRYLVRYRDMEGERAQQNKG
Subjt:  EKRKTVNGDDICCALATLGFDDYAEPLRRYLVRYRDMEGERAQQNKG

A0A0D9V9Y7 Uncharacterized protein4.4e-7050.78Show/hide
Query:  GVGGSTGGSSEEAGNGVGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVNGDDICCALATLGFDDYA
        G G S  G + E       ++KEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVNGDD+C A + LGFDDY 
Subjt:  GVGGSTGGSSEEAGNGVGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVNGDDICCALATLGFDDYA

Query:  EPLRRYL-----MDENTGMSERLVEFKYDFSGGGGGVGGSTGGS--------------------------------------------------------
        +P+RRYL     ++ +   +          +GGGGG GG  G                                                          
Subjt:  EPLRRYL-----MDENTGMSERLVEFKYDFSGGGGGVGGSTGGS--------------------------------------------------------

Query:  -------SEEAGNGVGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVNGDDICCALATLGFDDYAEP
               S  A     G+   QD LLPIANVGRIMK  LPP AKISK AKET+QEC +EFISFVTGEAS++C +E+RKTVNGDDIC A+ TLG D YA+ 
Subjt:  -------SEEAGNGVGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVNGDDICCALATLGFDDYAEP

Query:  LRRYLVRYRDMEGERAQQN
        + RYL RYR+ E   A  N
Subjt:  LRRYLVRYRDMEGERAQQN

A0A0E0CCP2 Uncharacterized protein4.7e-7262.61Show/hide
Query:  EEAGNGVGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVNGDDICCALATLGFDDYAEPLRRYLMD-
        E A      ++KEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVNGDD+C A   LGFDDY +P+R   ++ 
Subjt:  EEAGNGVGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVNGDDICCALATLGFDDYAEPLRRYLMD-

Query:  ---ENTGMSERLVEFKYDFSGGGGGVGGSTGGSSEEAGNGVGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHK
              G  + +       S          GG  +  G GV  +   QD LLPIANVGRIMK  LPP AKISK AKET+QEC +EFISFVTGEAS++C +
Subjt:  ---ENTGMSERLVEFKYDFSGGGGGVGGSTGGSSEEAGNGVGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHK

Query:  EKRKTVNGDDICCALATLGFDDYAEPLRRYLVRYRDME
        E+RKTVNGDD+C A+ TLG D YA+ + RYL RYR+ E
Subjt:  EKRKTVNGDDICCALATLGFDDYAEPLRRYLVRYRDME

A0A0E0FYG4 Uncharacterized protein9.8e-7052.88Show/hide
Query:  VVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVNGDDICCALATLGFDDYAEPLRRYLMDENTGMSERLV
        ++KEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVNGDD+C A   LGFDDY +P+RRYL        +R  
Subjt:  VVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVNGDDICCALATLGFDDYAEPLRRYLMDENTGMSERLV

Query:  EFKYDFSGGGGGVGGSTGGSSEEA------------------------------------------------------------GNGVGG---VVKEQDR
              SG G   G     SS  A                                                            G+G GG   +   QD 
Subjt:  EFKYDFSGGGGGVGGSTGGSSEEA------------------------------------------------------------GNGVGG---VVKEQDR

Query:  LLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVNGDDICCALATLGFDDYAEPLRRYLVRYRDMEGERAQQN
        LLPIANVGRIMK  LPP AKISK AKET+QEC +EFISFVTGEAS++C +E+RKTVNGDD+C A+ +LG D YA+ + RYL RYR+ E   A  N
Subjt:  LLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVNGDDICCALATLGFDDYAEPLRRYLVRYRDMEGERAQQN

A0A1S3BT19 nuclear transcription factor Y subunit B-14.0e-7196.6Show/hide
Query:  MDENTGMSERLVEFKYDFSGGG-GGVGGSTGGSSEEAGNGVGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHK
        MDENTGM ER VEFKYDF+GGG  GVGGSTGGSSEEAGNGVGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHK
Subjt:  MDENTGMSERLVEFKYDFSGGG-GGVGGSTGGSSEEAGNGVGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHK

Query:  EKRKTVNGDDICCALATLGFDDYAEPLRRYLVRYRDMEGERAQQNKG
        EKRKTVNGDDICCALATLGFDDYAEPLRRYLVRYRDMEGERAQQNKG
Subjt:  EKRKTVNGDDICCALATLGFDDYAEPLRRYLVRYRDMEGERAQQNKG

SwissProt top hitse value%identityAlignment
O82248 Nuclear transcription factor Y subunit B-58.4e-4284.69Show/hide
Query:  VVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVNGDDICCALATLGFDDYAEPLRRYLVRYRDMEGER
        +VKEQDRLLPIANVGRIMK ILP NAK+SKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVNGDDIC A+A LGFDDYA  L++YL RYR +EGE+
Subjt:  VVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVNGDDICCALATLGFDDYAEPLRRYLVRYRDMEGER

P25209 Nuclear transcription factor Y subunit B1.4e-3665.25Show/hide
Query:  GGGVGGSTGGSSEEAGNGVGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVNGDDICCALATLGFDD
        GGG G    GS    G G GG V+EQDR LPIAN+ RIMK+ +P N KI+K+AKET+QECVSEFISF+T EASDKC +EKRKT+NGDD+  A+ATLGF+D
Subjt:  GGGVGGSTGGSSEEAGNGVGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVNGDDICCALATLGFDD

Query:  YAEPLRRYLVRYRDMEGE
        Y EPL+ YL +YR+MEG+
Subjt:  YAEPLRRYLVRYRDMEGE

Q0J7P4 Nuclear transcription factor Y subunit B-111.4e-3665.85Show/hide
Query:  SGGGGGVGGSTGGSSEEAGNGVGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVNGDDICCALATLG
        + GGGG G S G      G G     KEQDR LPIANV RIMK+ LP NAKISKE+KET+QECVSEFISFVTGEASDKC +EKRKT+NGDD+  A+ TLG
Subjt:  SGGGGGVGGSTGGSSEEAGNGVGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVNGDDICCALATLG

Query:  FDDYAEPLRRYLVRYRDMEGERA
        F+ Y  PL+ YL RYR+ EGE+A
Subjt:  FDDYAEPLRRYLVRYRDMEGERA

Q60EQ4 Nuclear transcription factor Y subunit B-32.8e-3764.23Show/hide
Query:  GGGGG---VGGSTGGSSEEAGNGVGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVNGDDICCALAT
        GGGGG    G   GG     G G GG V+EQDR LPIAN+ RIMK+ +P N KI+K+AKET+QECVSEFISF+T EASDKC +EKRKT+NGDD+  A+AT
Subjt:  GGGGG---VGGSTGGSSEEAGNGVGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVNGDDICCALAT

Query:  LGFDDYAEPLRRYLVRYRDMEGE
        LGF+DY EPL+ YL +YR+MEG+
Subjt:  LGFDDYAEPLRRYLVRYRDMEGE

Q9SLG0 Nuclear transcription factor Y subunit B-11.6e-3768.1Show/hide
Query:  SEEAGNG--VGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVNGDDICCALATLGFDDYAEPLRRYL
        S  AG+G   GG V+EQDR LPIAN+ RIMK+ LPPN KI K+AK+T+QECVSEFISF+T EASDKC KEKRKTVNGDD+  A+ATLGF+DY EPL+ YL
Subjt:  SEEAGNG--VGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVNGDDICCALATLGFDDYAEPLRRYL

Query:  VRYRDMEGERAQQNKG
         RYR++EG+    NKG
Subjt:  VRYRDMEGERAQQNKG

Arabidopsis top hitse value%identityAlignment
AT2G38880.1 nuclear factor Y, subunit B11.2e-3868.1Show/hide
Query:  SEEAGNG--VGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVNGDDICCALATLGFDDYAEPLRRYL
        S  AG+G   GG V+EQDR LPIAN+ RIMK+ LPPN KI K+AK+T+QECVSEFISF+T EASDKC KEKRKTVNGDD+  A+ATLGF+DY EPL+ YL
Subjt:  SEEAGNG--VGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVNGDDICCALATLGFDDYAEPLRRYL

Query:  VRYRDMEGERAQQNKG
         RYR++EG+    NKG
Subjt:  VRYRDMEGERAQQNKG

AT2G38880.2 nuclear factor Y, subunit B11.2e-3868.1Show/hide
Query:  SEEAGNG--VGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVNGDDICCALATLGFDDYAEPLRRYL
        S  AG+G   GG V+EQDR LPIAN+ RIMK+ LPPN KI K+AK+T+QECVSEFISF+T EASDKC KEKRKTVNGDD+  A+ATLGF+DY EPL+ YL
Subjt:  SEEAGNG--VGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVNGDDICCALATLGFDDYAEPLRRYL

Query:  VRYRDMEGERAQQNKG
         RYR++EG+    NKG
Subjt:  VRYRDMEGERAQQNKG

AT2G38880.3 nuclear factor Y, subunit B11.2e-3868.1Show/hide
Query:  SEEAGNG--VGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVNGDDICCALATLGFDDYAEPLRRYL
        S  AG+G   GG V+EQDR LPIAN+ RIMK+ LPPN KI K+AK+T+QECVSEFISF+T EASDKC KEKRKTVNGDD+  A+ATLGF+DY EPL+ YL
Subjt:  SEEAGNG--VGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVNGDDICCALATLGFDDYAEPLRRYL

Query:  VRYRDMEGERAQQNKG
         RYR++EG+    NKG
Subjt:  VRYRDMEGERAQQNKG

AT2G38880.5 nuclear factor Y, subunit B11.2e-3868.1Show/hide
Query:  SEEAGNG--VGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVNGDDICCALATLGFDDYAEPLRRYL
        S  AG+G   GG V+EQDR LPIAN+ RIMK+ LPPN KI K+AK+T+QECVSEFISF+T EASDKC KEKRKTVNGDD+  A+ATLGF+DY EPL+ YL
Subjt:  SEEAGNG--VGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVNGDDICCALATLGFDDYAEPLRRYL

Query:  VRYRDMEGERAQQNKG
         RYR++EG+    NKG
Subjt:  VRYRDMEGERAQQNKG

AT2G47810.1 nuclear factor Y, subunit B55.9e-4384.69Show/hide
Query:  VVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVNGDDICCALATLGFDDYAEPLRRYLVRYRDMEGER
        +VKEQDRLLPIANVGRIMK ILP NAK+SKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVNGDDIC A+A LGFDDYA  L++YL RYR +EGE+
Subjt:  VVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVNGDDICCALATLGFDDYAEPLRRYLVRYRDMEGER


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGAAAACACAGGCATGTCAGAGAGATTAGTAGAATTTAAGTACGATTTCAGCGGCGGCGGTGGCGGTGTCGGTGGTAGCACCGGTGGTTCGAGTGAGGAAGCTGG
TAACGGCGTTGGCGGGGTCGTAAAAGAGCAAGATCGGTTACTTCCAATAGCAAATGTGGGGAGGATCATGAAGCAAATTCTACCTCCAAATGCCAAAATCTCCAAAGAAG
CCAAAGAAACTATGCAAGAGTGCGTGTCGGAGTTCATCAGTTTCGTGACAGGCGAGGCGTCGGATAAGTGCCATAAAGAGAAGAGGAAGACTGTTAATGGTGATGATATT
TGCTGTGCTTTGGCCACACTTGGATTTGATGATTATGCTGAGCCTTTGAGAAGGTATTTGATGGATGAAAACACAGGCATGTCAGAGAGATTAGTAGAATTTAAGTACGA
TTTCAGCGGCGGCGGTGGCGGTGTCGGTGGTAGCACCGGTGGTTCGAGTGAGGAAGCTGGTAACGGCGTTGGCGGGGTCGTAAAAGAGCAAGATCGGTTACTTCCAATAG
CAAATGTGGGGAGGATCATGAAGCAAATTCTACCTCCAAATGCCAAAATCTCCAAAGAAGCCAAAGAAACTATGCAAGAGTGCGTGTCGGAGTTCATCAGTTTCGTGACA
GGCGAGGCGTCGGATAAGTGCCATAAAGAGAAGAGGAAGACTGTTAATGGTGATGATATTTGCTGTGCTTTGGCCACACTTGGATTTGATGATTATGCTGAGCCTTTGAG
AAGGTATTTGGTTAGGTATAGAGATATGGAGGGGGAGAGAGCTCAACAAAATAAGGGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGAAAACACAGGCATGTCAGAGAGATTAGTAGAATTTAAGTACGATTTCAGCGGCGGCGGTGGCGGTGTCGGTGGTAGCACCGGTGGTTCGAGTGAGGAAGCTGG
TAACGGCGTTGGCGGGGTCGTAAAAGAGCAAGATCGGTTACTTCCAATAGCAAATGTGGGGAGGATCATGAAGCAAATTCTACCTCCAAATGCCAAAATCTCCAAAGAAG
CCAAAGAAACTATGCAAGAGTGCGTGTCGGAGTTCATCAGTTTCGTGACAGGCGAGGCGTCGGATAAGTGCCATAAAGAGAAGAGGAAGACTGTTAATGGTGATGATATT
TGCTGTGCTTTGGCCACACTTGGATTTGATGATTATGCTGAGCCTTTGAGAAGGTATTTGATGGATGAAAACACAGGCATGTCAGAGAGATTAGTAGAATTTAAGTACGA
TTTCAGCGGCGGCGGTGGCGGTGTCGGTGGTAGCACCGGTGGTTCGAGTGAGGAAGCTGGTAACGGCGTTGGCGGGGTCGTAAAAGAGCAAGATCGGTTACTTCCAATAG
CAAATGTGGGGAGGATCATGAAGCAAATTCTACCTCCAAATGCCAAAATCTCCAAAGAAGCCAAAGAAACTATGCAAGAGTGCGTGTCGGAGTTCATCAGTTTCGTGACA
GGCGAGGCGTCGGATAAGTGCCATAAAGAGAAGAGGAAGACTGTTAATGGTGATGATATTTGCTGTGCTTTGGCCACACTTGGATTTGATGATTATGCTGAGCCTTTGAG
AAGGTATTTGGTTAGGTATAGAGATATGGAGGGGGAGAGAGCTCAACAAAATAAGGGA
Protein sequenceShow/hide protein sequence
MDENTGMSERLVEFKYDFSGGGGGVGGSTGGSSEEAGNGVGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVTGEASDKCHKEKRKTVNGDDI
CCALATLGFDDYAEPLRRYLMDENTGMSERLVEFKYDFSGGGGGVGGSTGGSSEEAGNGVGGVVKEQDRLLPIANVGRIMKQILPPNAKISKEAKETMQECVSEFISFVT
GEASDKCHKEKRKTVNGDDICCALATLGFDDYAEPLRRYLVRYRDMEGERAQQNKG