; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg024287 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg024287
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionHomeobox-leucine zipper protein family
Genome locationscaffold4:18392494..18399420
RNA-Seq ExpressionSpg024287
SyntenySpg024287
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR009057 - Homeobox-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6594841.1 Serine/threonine/tyrosine-protein kinase HT1, partial [Cucurbita argyrosperma subsp. sororia]1.6e-3876Show/hide
Query:  KGAKLTKRRRRRSKAKEAFRATGLKKRKLSLEQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRRLLWKNKKLEEEYSTLRKVHDSILLQK
        +G K TKRRRRRSKAKEA  A GLKKRKLS EQVKLLEMNFG+EHKLESERKDRLASELGLDPRQVAVWFQNRR  WKNKKLEEEYSTL+K HDSILLQ 
Subjt:  KGAKLTKRRRRRSKAKEAFRATGLKKRKLSLEQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRRLLWKNKKLEEEYSTLRKVHDSILLQK

Query:  PHLESESTHSQIEDQSRPKRSDLKR
         HLESE   +++ +Q R  ++ ++R
Subjt:  PHLESESTHSQIEDQSRPKRSDLKR

KAG7026804.1 Homeobox-leucine zipper protein ATHB-21 [Cucurbita argyrosperma subsp. argyrosperma]7.3e-3976Show/hide
Query:  KGAKLTKRRRRRSKAKEAFRATGLKKRKLSLEQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRRLLWKNKKLEEEYSTLRKVHDSILLQK
        +G K TKRRRRRSKAKEA  A GLKKRKLS EQVKLLEMNFG+EHKLESERKDRLASELGLDPRQVAVWFQNRR  WKNKKLEEEYSTL+K HDSILLQ 
Subjt:  KGAKLTKRRRRRSKAKEAFRATGLKKRKLSLEQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRRLLWKNKKLEEEYSTLRKVHDSILLQK

Query:  PHLESESTHSQIEDQSRPKRSDLKR
         HLESE   +++ +Q R  +++++R
Subjt:  PHLESESTHSQIEDQSRPKRSDLKR

XP_004143500.1 homeobox-leucine zipper protein ATHB-40 [Cucumis sativus]1.2e-3864.63Show/hide
Query:  KGAKLTKRRRRRSKAKEAFRATGLKKRKLSLEQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRRLLWKNKKLEEEYSTLRKVHDSILLQK
        +G+K TKRRRRRSKAKE   A GLKKRKLS EQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRR  WKNKKLEEEYSTL+K HDS++LQK
Subjt:  KGAKLTKRRRRRSKAKEAFRATGLKKRKLSLEQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRRLLWKNKKLEEEYSTLRKVHDSILLQK

Query:  PHLESESTHSQIEDQSRPKRSDLKRFSVSTLLRLRRPSSACIKLSVE
         HLESE    ++++Q +  ++++++    + +R    +S    +++E
Subjt:  PHLESESTHSQIEDQSRPKRSDLKRFSVSTLLRLRRPSSACIKLSVE

XP_022963111.1 homeobox-leucine zipper protein ATHB-40-like [Cucurbita moschata]7.3e-3976Show/hide
Query:  KGAKLTKRRRRRSKAKEAFRATGLKKRKLSLEQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRRLLWKNKKLEEEYSTLRKVHDSILLQK
        +G K TKRRRRRSKAKEA  A GLKKRKLS EQVKLLEMNFG+EHKLESERKDRLASELGLDPRQVAVWFQNRR  WKNKKLEEEYSTL+K HDSILLQ 
Subjt:  KGAKLTKRRRRRSKAKEAFRATGLKKRKLSLEQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRRLLWKNKKLEEEYSTLRKVHDSILLQK

Query:  PHLESESTHSQIEDQSRPKRSDLKR
         HLESE   +++ +Q R  +++++R
Subjt:  PHLESESTHSQIEDQSRPKRSDLKR

XP_023518445.1 homeobox-leucine zipper protein ATHB-40-like [Cucurbita pepo subsp. pepo]1.6e-3875.2Show/hide
Query:  KGAKLTKRRRRRSKAKEAFRATGLKKRKLSLEQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRRLLWKNKKLEEEYSTLRKVHDSILLQK
        +G K TKRRRRRSKAKEA  A GLKKRKLS EQVKLLEMNFG+EHKLESERKDRLASELGLDPRQVAVWFQNRR  WKNKKLEEEYSTL+K HDSI+LQ 
Subjt:  KGAKLTKRRRRRSKAKEAFRATGLKKRKLSLEQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRRLLWKNKKLEEEYSTLRKVHDSILLQK

Query:  PHLESESTHSQIEDQSRPKRSDLKR
         HLE E T  ++ +Q R  +++++R
Subjt:  PHLESESTHSQIEDQSRPKRSDLKR

TrEMBL top hitse value%identityAlignment
A0A0A0KGE8 Homeobox domain-containing protein6.0e-3964.63Show/hide
Query:  KGAKLTKRRRRRSKAKEAFRATGLKKRKLSLEQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRRLLWKNKKLEEEYSTLRKVHDSILLQK
        +G+K TKRRRRRSKAKE   A GLKKRKLS EQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRR  WKNKKLEEEYSTL+K HDS++LQK
Subjt:  KGAKLTKRRRRRSKAKEAFRATGLKKRKLSLEQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRRLLWKNKKLEEEYSTLRKVHDSILLQK

Query:  PHLESESTHSQIEDQSRPKRSDLKRFSVSTLLRLRRPSSACIKLSVE
         HLESE    ++++Q +  ++++++    + +R    +S    +++E
Subjt:  PHLESESTHSQIEDQSRPKRSDLKRFSVSTLLRLRRPSSACIKLSVE

A0A1S3B2Q4 homeobox-leucine zipper protein ATHB-408.7e-3858.24Show/hide
Query:  SHMYVIMLPKISEHSVGSRLEVKGAKLTKRRRRRSKAKEAFRATG-LKKRKLSLEQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRRLLW
        SH+Y    P++    +  +   +G K TKRRRRRSKAKE     G LKKRKLS EQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRR  W
Subjt:  SHMYVIMLPKISEHSVGSRLEVKGAKLTKRRRRRSKAKEAFRATG-LKKRKLSLEQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRRLLW

Query:  KNKKLEEEYSTLRKVHDSILLQKPHLESESTHSQIEDQSRPKRSDLKRFSVSTLLRLRRPSSACIKLSVE
        KNKKLEEEYSTL+K HDS+LLQK HLESE    ++++Q +  ++++++    + +R    +S    +++E
Subjt:  KNKKLEEEYSTLRKVHDSILLQKPHLESESTHSQIEDQSRPKRSDLKRFSVSTLLRLRRPSSACIKLSVE

A0A6J1HF68 homeobox-leucine zipper protein ATHB-40-like3.5e-3976Show/hide
Query:  KGAKLTKRRRRRSKAKEAFRATGLKKRKLSLEQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRRLLWKNKKLEEEYSTLRKVHDSILLQK
        +G K TKRRRRRSKAKEA  A GLKKRKLS EQVKLLEMNFG+EHKLESERKDRLASELGLDPRQVAVWFQNRR  WKNKKLEEEYSTL+K HDSILLQ 
Subjt:  KGAKLTKRRRRRSKAKEAFRATGLKKRKLSLEQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRRLLWKNKKLEEEYSTLRKVHDSILLQK

Query:  PHLESESTHSQIEDQSRPKRSDLKR
         HLESE   +++ +Q R  +++++R
Subjt:  PHLESESTHSQIEDQSRPKRSDLKR

A0A6J1IQG4 homeobox-leucine zipper protein ATHB-40-like1.3e-3361.94Show/hide
Query:  LGKFWDSHMYVIMLPKISEHSVGSRLEVKGAKLTKRRRRRSKAKEAFRATGLKKRKLSLEQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQN
        +  F++  +Y  MLP+        +   +G K TKRRRRRS+ KE     GLKKRKLS EQVKLLEMNFGNEHKLE+ERK+RLASELGLDP QVA+WFQN
Subjt:  LGKFWDSHMYVIMLPKISEHSVGSRLEVKGAKLTKRRRRRSKAKEAFRATGLKKRKLSLEQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQN

Query:  RRLLWKNKKLEEEYSTLRKVHDSILLQKPHLESE
        RR  WKN+KLE +YS L+K HDS +LQK HL+S+
Subjt:  RRLLWKNKKLEEEYSTLRKVHDSILLQKPHLESE

A0A6J1KN43 homeobox-leucine zipper protein ATHB-40-like5.1e-3874.4Show/hide
Query:  KGAKLTKRRRRRSKAKEAFRATGLKKRKLSLEQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRRLLWKNKKLEEEYSTLRKVHDSILLQK
        +G K  KRRRRRSKAKEA    GLKKRKLS EQVKLLEMNFG+EHKLESERKDRLASELGLDPRQVAVWFQNRR  WKNKKLEEEYSTL+K HDSILLQ 
Subjt:  KGAKLTKRRRRRSKAKEAFRATGLKKRKLSLEQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRRLLWKNKKLEEEYSTLRKVHDSILLQK

Query:  PHLESESTHSQIEDQSRPKRSDLKR
         HLESE    ++ +Q R  +++++R
Subjt:  PHLESESTHSQIEDQSRPKRSDLKR

SwissProt top hitse value%identityAlignment
A2YN17 Homeobox-leucine zipper protein HOX141.0e-1953.45Show/hide
Query:  EVKGAKLTKRRRRRSKAKEAFRATG--------LKKRKLSLEQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRRLLWKNKKLEEEYSTLR
        EV+G +   RRRRR  A+      G         KKR+LS EQV++LE++F  E KLE+ RK  LASELGLDP+QVAVWFQNRR   K+K LEEE+S L+
Subjt:  EVKGAKLTKRRRRRSKAKEAFRATG--------LKKRKLSLEQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRRLLWKNKKLEEEYSTLR

Query:  KVHDSILLQKPHLESE
          HD+ +L K HLE+E
Subjt:  KVHDSILLQKPHLESE

O23208 Homeobox-leucine zipper protein ATHB-408.7e-2755.12Show/hide
Query:  KLTKRRRRRSKAKEAFRATG---LKKRKLSLEQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRRLLWKNKKLEEEYSTLRKVHDSILLQK
        K  KRRR+++K   A    G    +KRKL+ EQV +LEM+FG+EHKLESERKDRLA+ELGLDPRQVAVWFQNRR  WKNK+LEEEY+ L+  HD++++ K
Subjt:  KLTKRRRRRSKAKEAFRATG---LKKRKLSLEQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRRLLWKNKKLEEEYSTLRKVHDSILLQK

Query:  PHLESESTHSQIEDQSRPKRSDLKRFS
          LESE    Q+++Q      +++R +
Subjt:  PHLESESTHSQIEDQSRPKRSDLKRFS

Q7XI85 Homeobox-leucine zipper protein HOX141.0e-1953.45Show/hide
Query:  EVKGAKLTKRRRRRSKAKEAFRATG--------LKKRKLSLEQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRRLLWKNKKLEEEYSTLR
        EV+G +   RRRRR  A+      G         KKR+LS EQV++LE++F  E KLE+ RK  LASELGLDP+QVAVWFQNRR   K+K LEEE+S L+
Subjt:  EVKGAKLTKRRRRRSKAKEAFRATG--------LKKRKLSLEQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRRLLWKNKKLEEEYSTLR

Query:  KVHDSILLQKPHLESE
          HD+ +L K HLE+E
Subjt:  KVHDSILLQKPHLESE

Q9LVR0 Homeobox-leucine zipper protein ATHB-532.4e-2446.34Show/hide
Query:  KRRRRRSKAKEAFRATG-------LKKRKLSLEQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRRLLWKNKKLEEEYSTLRKVHDSILLQ
        +RR+RRSK   A            L+KRKL+ EQV +LE +FGNEHKLES RK+++A ELGLDPRQVAVWFQNRR  WKNKKLEEEY+ L+  HD+++L 
Subjt:  KRRRRRSKAKEAFRATG-------LKKRKLSLEQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRRLLWKNKKLEEEYSTLRKVHDSILLQ

Query:  KPHLESESTHSQIEDQSRPKRSDLKRFSVSTLLRLRRPSSACIKLSVEVCLREWARQSSSSFEL
        +  LES+    ++ +Q    +S++++  +S  L     +S+   LSVE      A  + + FEL
Subjt:  KPHLESESTHSQIEDQSRPKRSDLKRFSVSTLLRLRRPSSACIKLSVEVCLREWARQSSSSFEL

Q9ZU70 Homeobox-leucine zipper protein ATHB-219.0e-2450.38Show/hide
Query:  AKLTKRRRRRSK----AKEAFRATG--LKKRKLSLEQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRRLLWKNKKLEEEYSTLRKVHDSI
        AK T+RR+R+SK    A+E         +KRKLS EQV++LE++F ++HKLESERKDRLASELGLDPRQVAVWFQNRR  WKNK++E+EY+ L+  +++ 
Subjt:  AKLTKRRRRRSK----AKEAFRATG--LKKRKLSLEQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRRLLWKNKKLEEEYSTLRKVHDSI

Query:  LLQKPHLESESTHSQIEDQSRPKRSDLKRFS
        +++K  L+SE  H  +++Q      +++R +
Subjt:  LLQKPHLESESTHSQIEDQSRPKRSDLKRFS

Arabidopsis top hitse value%identityAlignment
AT2G18550.1 homeobox protein 216.4e-2550.38Show/hide
Query:  AKLTKRRRRRSK----AKEAFRATG--LKKRKLSLEQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRRLLWKNKKLEEEYSTLRKVHDSI
        AK T+RR+R+SK    A+E         +KRKLS EQV++LE++F ++HKLESERKDRLASELGLDPRQVAVWFQNRR  WKNK++E+EY+ L+  +++ 
Subjt:  AKLTKRRRRRSK----AKEAFRATG--LKKRKLSLEQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRRLLWKNKKLEEEYSTLRKVHDSI

Query:  LLQKPHLESESTHSQIEDQSRPKRSDLKRFS
        +++K  L+SE  H  +++Q      +++R +
Subjt:  LLQKPHLESESTHSQIEDQSRPKRSDLKRFS

AT2G22430.1 homeobox protein 61.5e-1863.89Show/hide
Query:  KKRKLSLEQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRRLLWKNKKLEEEYSTLRKVHDSI
        KKR+LS+ QVK LE NF  E+KLE ERK +LA ELGL PRQVAVWFQNRR  WK K+LE++Y  L+  +DS+
Subjt:  KKRKLSLEQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRRLLWKNKKLEEEYSTLRKVHDSI

AT4G36740.1 homeobox protein 406.2e-2855.12Show/hide
Query:  KLTKRRRRRSKAKEAFRATG---LKKRKLSLEQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRRLLWKNKKLEEEYSTLRKVHDSILLQK
        K  KRRR+++K   A    G    +KRKL+ EQV +LEM+FG+EHKLESERKDRLA+ELGLDPRQVAVWFQNRR  WKNK+LEEEY+ L+  HD++++ K
Subjt:  KLTKRRRRRSKAKEAFRATG---LKKRKLSLEQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRRLLWKNKKLEEEYSTLRKVHDSILLQK

Query:  PHLESESTHSQIEDQSRPKRSDLKRFS
          LESE    Q+++Q      +++R +
Subjt:  PHLESESTHSQIEDQSRPKRSDLKRFS

AT4G40060.1 homeobox protein 162.6e-1862.5Show/hide
Query:  KKRKLSLEQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRRLLWKNKKLEEEYSTLRKVHDSI
        KKR+L ++QVK LE NF  E+KLE ERK +LA ELGL PRQVAVWFQNRR  WK K+LE++Y  L+  +DS+
Subjt:  KKRKLSLEQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRRLLWKNKKLEEEYSTLRKVHDSI

AT5G66700.1 homeobox 531.7e-2546.34Show/hide
Query:  KRRRRRSKAKEAFRATG-------LKKRKLSLEQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRRLLWKNKKLEEEYSTLRKVHDSILLQ
        +RR+RRSK   A            L+KRKL+ EQV +LE +FGNEHKLES RK+++A ELGLDPRQVAVWFQNRR  WKNKKLEEEY+ L+  HD+++L 
Subjt:  KRRRRRSKAKEAFRATG-------LKKRKLSLEQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRRLLWKNKKLEEEYSTLRKVHDSILLQ

Query:  KPHLESESTHSQIEDQSRPKRSDLKRFSVSTLLRLRRPSSACIKLSVEVCLREWARQSSSSFEL
        +  LES+    ++ +Q    +S++++  +S  L     +S+   LSVE      A  + + FEL
Subjt:  KPHLESESTHSQIEDQSRPKRSDLKRFSVSTLLRLRRPSSACIKLSVEVCLREWARQSSSSFEL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATGCGCATACTTGTTCTACTTAGAGAGATCCATATGCGTTACCTGTCAAGGATTAAGCATACTGCTACCGCATCCAGCTGGCCAAGCGGAGATTAGTAATGAAAA
GATTATCGCAACTAAGCAGAAAGCAATCAATGTTTCATTTAAATCAAAATATGCGCTTACAAGATGTACTTATCACCGCATGAGGTGTGAACCACTTACTCAGTACCGTG
GTTTTGTACTGACCCACCACCAGGTTTTGCAGGTTGATTTTTGGGGCGTTACAGTCAGGGATCGGTGTGATGAGTCTAGAAAGCATCGGTGCACCTTGGGTACAAATGGC
CAAGGGGCGATGCATAGTTCGAGGCCTTGGGTACAAATGGTCAAGGGTCGAACGTCGAGCTCCGTAGAGAGCATTGTGGCCCTGGGTACAAATGGTCAAGGGACAGTGCG
ACTCGAAGGGATAGTTGTTGCTTATTATTGTCGGATAGTTGTTGTCTTGTTAGGTTTAGTTTCTGAAATTATTGTTGCTGAAACTCCAGGCTTTGGTCTTGGAAAATTTT
GGGATAGTCATATGTATGTCATTATGCTGCCGAAAATTTCGGAGCACTCGGTTGGTTCTCGGTTAGAAGTTAAGGGAGCTAAGCTGACGAAGAGACGGCGGAGGAGGAGC
AAAGCGAAGGAGGCGTTCAGGGCGACGGGGTTGAAGAAGAGGAAGCTGAGTTTAGAGCAGGTGAAGCTTTTGGAGATGAATTTTGGGAACGAGCACAAATTGGAATCGGA
GAGGAAAGATCGGTTGGCTTCTGAACTGGGACTTGATCCCCGACAGGTCGCCGTCTGGTTTCAGAACCGTCGGCTGCTGTGGAAGAACAAGAAGCTCGAAGAGGAATATT
CCACTCTCAGAAAAGTTCATGATTCTATTCTTCTCCAAAAACCCCATCTTGAATCTGAGTCCACGCACTCCCAGATCGAGGACCAGTCACGTCCGAAAAGATCAGATCTA
AAACGATTTTCGGTCAGCACGCTTCTTCGTCTCCGGCGACCATCTTCGGCGTGTATCAAGCTTTCAGTGGAGGTGTGTTTGCGCGAATGGGCGCGGCAGTCTTCTTCTTC
CTTCGAACTCCGGCGACGGCCTAAGGCAATTGCAAGCTTTTTCCACCGTGGGTCTTCTCCTTCGAATTCTGACAACATGGACGGTAGCGATCCCACAATCGCGATCTCAA
GGTGGTCCAAGAGAATAGCGGGTGTTCTTGATTGGTGGTGTCCATTGGTAGAAGATTTGGAGCATAGTGCTATTTTCGGCCTTCAAATCATGTTTACGGCTGCTGGAAGA
AGAAAATCTAGGCAACCAGGGGTGTGGCGTCTCGACGCTGAAGAACAGGCGTTGGAAATTTCGTGCTGGCGTCGAGACGCCAACTTCAGCGTCTCGACGCCGGTGGCGGT
TTCGCTGAAACGCGAGTCGTGTGCGTTTTTCATCCGAAGCATGTGTGGTTGCAACGGTTTCGTCAATTTGGAGTTGGTTTTGGTCTGGTTCGTTGGGGTTTGGTCCGGTT
TAAGGCTGGTTGGTCTGGTTCTTGTAGCAGTTGGTCCGGTTCGAGCTGTGTGGGTCCGATTCGGGCGATTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCATGCGCATACTTGTTCTACTTAGAGAGATCCATATGCGTTACCTGTCAAGGATTAAGCATACTGCTACCGCATCCAGCTGGCCAAGCGGAGATTAGTAATGAAAA
GATTATCGCAACTAAGCAGAAAGCAATCAATGTTTCATTTAAATCAAAATATGCGCTTACAAGATGTACTTATCACCGCATGAGGTGTGAACCACTTACTCAGTACCGTG
GTTTTGTACTGACCCACCACCAGGTTTTGCAGGTTGATTTTTGGGGCGTTACAGTCAGGGATCGGTGTGATGAGTCTAGAAAGCATCGGTGCACCTTGGGTACAAATGGC
CAAGGGGCGATGCATAGTTCGAGGCCTTGGGTACAAATGGTCAAGGGTCGAACGTCGAGCTCCGTAGAGAGCATTGTGGCCCTGGGTACAAATGGTCAAGGGACAGTGCG
ACTCGAAGGGATAGTTGTTGCTTATTATTGTCGGATAGTTGTTGTCTTGTTAGGTTTAGTTTCTGAAATTATTGTTGCTGAAACTCCAGGCTTTGGTCTTGGAAAATTTT
GGGATAGTCATATGTATGTCATTATGCTGCCGAAAATTTCGGAGCACTCGGTTGGTTCTCGGTTAGAAGTTAAGGGAGCTAAGCTGACGAAGAGACGGCGGAGGAGGAGC
AAAGCGAAGGAGGCGTTCAGGGCGACGGGGTTGAAGAAGAGGAAGCTGAGTTTAGAGCAGGTGAAGCTTTTGGAGATGAATTTTGGGAACGAGCACAAATTGGAATCGGA
GAGGAAAGATCGGTTGGCTTCTGAACTGGGACTTGATCCCCGACAGGTCGCCGTCTGGTTTCAGAACCGTCGGCTGCTGTGGAAGAACAAGAAGCTCGAAGAGGAATATT
CCACTCTCAGAAAAGTTCATGATTCTATTCTTCTCCAAAAACCCCATCTTGAATCTGAGTCCACGCACTCCCAGATCGAGGACCAGTCACGTCCGAAAAGATCAGATCTA
AAACGATTTTCGGTCAGCACGCTTCTTCGTCTCCGGCGACCATCTTCGGCGTGTATCAAGCTTTCAGTGGAGGTGTGTTTGCGCGAATGGGCGCGGCAGTCTTCTTCTTC
CTTCGAACTCCGGCGACGGCCTAAGGCAATTGCAAGCTTTTTCCACCGTGGGTCTTCTCCTTCGAATTCTGACAACATGGACGGTAGCGATCCCACAATCGCGATCTCAA
GGTGGTCCAAGAGAATAGCGGGTGTTCTTGATTGGTGGTGTCCATTGGTAGAAGATTTGGAGCATAGTGCTATTTTCGGCCTTCAAATCATGTTTACGGCTGCTGGAAGA
AGAAAATCTAGGCAACCAGGGGTGTGGCGTCTCGACGCTGAAGAACAGGCGTTGGAAATTTCGTGCTGGCGTCGAGACGCCAACTTCAGCGTCTCGACGCCGGTGGCGGT
TTCGCTGAAACGCGAGTCGTGTGCGTTTTTCATCCGAAGCATGTGTGGTTGCAACGGTTTCGTCAATTTGGAGTTGGTTTTGGTCTGGTTCGTTGGGGTTTGGTCCGGTT
TAAGGCTGGTTGGTCTGGTTCTTGTAGCAGTTGGTCCGGTTCGAGCTGTGTGGGTCCGATTCGGGCGATTTTGA
Protein sequenceShow/hide protein sequence
MSCAYLFYLERSICVTCQGLSILLPHPAGQAEISNEKIIATKQKAINVSFKSKYALTRCTYHRMRCEPLTQYRGFVLTHHQVLQVDFWGVTVRDRCDESRKHRCTLGTNG
QGAMHSSRPWVQMVKGRTSSSVESIVALGTNGQGTVRLEGIVVAYYCRIVVVLLGLVSEIIVAETPGFGLGKFWDSHMYVIMLPKISEHSVGSRLEVKGAKLTKRRRRRS
KAKEAFRATGLKKRKLSLEQVKLLEMNFGNEHKLESERKDRLASELGLDPRQVAVWFQNRRLLWKNKKLEEEYSTLRKVHDSILLQKPHLESESTHSQIEDQSRPKRSDL
KRFSVSTLLRLRRPSSACIKLSVEVCLREWARQSSSSFELRRRPKAIASFFHRGSSPSNSDNMDGSDPTIAISRWSKRIAGVLDWWCPLVEDLEHSAIFGLQIMFTAAGR
RKSRQPGVWRLDAEEQALEISCWRRDANFSVSTPVAVSLKRESCAFFIRSMCGCNGFVNLELVLVWFVGVWSGLRLVGLVLVAVGPVRAVWVRFGRF