; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh06G008190 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh06G008190
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionhomeobox-leucine zipper protein HAT4-like
Genome locationCmo_Chr06:4449890..4451610
RNA-Seq ExpressionCmoCh06G008190
SyntenyCmoCh06G008190
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000047 - Helix-turn-helix motif
IPR001356 - Homeobox domain
IPR006712 - HD-ZIP protein, N-terminal
IPR009057 - Homeobox-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596870.1 Homeobox-leucine zipper protein HAT2, partial [Cucurbita argyrosperma subsp. sororia]1.1e-5897.64Show/hide
Query:  VMLNLRPSWNDHASSDRTSENRTPPSPVDCEEEEEEAAVSSPNSTVSTVSGKRSEPETNNGEDLDGDEEDAETSRKKLRLTKDQSAVLEDSFKEHNTLNP
        + LNLRPSWNDHASSDRTSENRT PSPVDCEEEEEEAAVSSPNSTVSTVSGKRSEPETNNGEDLDGDEEDAETSRKKLRLTKDQSAVLEDSFKEHNTLNP
Subjt:  VMLNLRPSWNDHASSDRTSENRTPPSPVDCEEEEEEAAVSSPNSTVSTVSGKRSEPETNNGEDLDGDEEDAETSRKKLRLTKDQSAVLEDSFKEHNTLNP

Query:  KQKLALAKQLGLRPRQVEVWFQNRRAR
        KQKLALAKQLGLRPRQVEVWFQNRRAR
Subjt:  KQKLALAKQLGLRPRQVEVWFQNRRAR

KAG7030145.1 Homeobox-leucine zipper protein HAT2, partial [Cucurbita argyrosperma subsp. argyrosperma]1.2e-5998.43Show/hide
Query:  VMLNLRPSWNDHASSDRTSENRTPPSPVDCEEEEEEAAVSSPNSTVSTVSGKRSEPETNNGEDLDGDEEDAETSRKKLRLTKDQSAVLEDSFKEHNTLNP
        + LNLRPSWNDHASSDRTSENRTPPSPVDCEEEEEEAAVSSPNSTVSTVSGKRSEPETNNGEDLDGDEEDAETSRKKLRLTKDQSAVLEDSFKEHNTLNP
Subjt:  VMLNLRPSWNDHASSDRTSENRTPPSPVDCEEEEEEAAVSSPNSTVSTVSGKRSEPETNNGEDLDGDEEDAETSRKKLRLTKDQSAVLEDSFKEHNTLNP

Query:  KQKLALAKQLGLRPRQVEVWFQNRRAR
        KQKLALAKQLGLRPRQVEVWFQNRRAR
Subjt:  KQKLALAKQLGLRPRQVEVWFQNRRAR

XP_022951800.1 homeobox-leucine zipper protein HAT4-like [Cucurbita moschata]1.2e-5998.43Show/hide
Query:  VMLNLRPSWNDHASSDRTSENRTPPSPVDCEEEEEEAAVSSPNSTVSTVSGKRSEPETNNGEDLDGDEEDAETSRKKLRLTKDQSAVLEDSFKEHNTLNP
        + LNLRPSWNDHASSDRTSENRTPPSPVDCEEEEEEAAVSSPNSTVSTVSGKRSEPETNNGEDLDGDEEDAETSRKKLRLTKDQSAVLEDSFKEHNTLNP
Subjt:  VMLNLRPSWNDHASSDRTSENRTPPSPVDCEEEEEEAAVSSPNSTVSTVSGKRSEPETNNGEDLDGDEEDAETSRKKLRLTKDQSAVLEDSFKEHNTLNP

Query:  KQKLALAKQLGLRPRQVEVWFQNRRAR
        KQKLALAKQLGLRPRQVEVWFQNRRAR
Subjt:  KQKLALAKQLGLRPRQVEVWFQNRRAR

XP_023005508.1 homeobox-leucine zipper protein HAT4-like [Cucurbita maxima]2.0e-5796.12Show/hide
Query:  VMLNLRPSWNDHASSDRTSENRTPPSPVDC--EEEEEEAAVSSPNSTVSTVSGKRSEPETNNGEDLDGDEEDAETSRKKLRLTKDQSAVLEDSFKEHNTL
        + LNLRPSWNDHASSDRTSENR PPSPVDC  EEEEEEAAVSSPNSTVSTVSGKRSEPETNNGEDLDGDEEDAETSRKKLRLTKDQSAVLEDSFKEHNTL
Subjt:  VMLNLRPSWNDHASSDRTSENRTPPSPVDC--EEEEEEAAVSSPNSTVSTVSGKRSEPETNNGEDLDGDEEDAETSRKKLRLTKDQSAVLEDSFKEHNTL

Query:  NPKQKLALAKQLGLRPRQVEVWFQNRRAR
        NPKQKLALAKQLGLRPRQVEVWFQNRRAR
Subjt:  NPKQKLALAKQLGLRPRQVEVWFQNRRAR

XP_023539940.1 homeobox-leucine zipper protein HAT4-like [Cucurbita pepo subsp. pepo]2.0e-5796.12Show/hide
Query:  VMLNLRPSWNDHASSDRTSENRTPPSPVDC--EEEEEEAAVSSPNSTVSTVSGKRSEPETNNGEDLDGDEEDAETSRKKLRLTKDQSAVLEDSFKEHNTL
        + LNLRPSWNDHASSDRTSENR PPSPVDC  EEEEEEAAVSSPNSTVSTVSGKRSEPETNNGEDLDGDEEDAETSRKKLRLTKDQSAVLEDSFKEHNTL
Subjt:  VMLNLRPSWNDHASSDRTSENRTPPSPVDC--EEEEEEAAVSSPNSTVSTVSGKRSEPETNNGEDLDGDEEDAETSRKKLRLTKDQSAVLEDSFKEHNTL

Query:  NPKQKLALAKQLGLRPRQVEVWFQNRRAR
        NPKQKLALAKQLGLRPRQVEVWFQNRRAR
Subjt:  NPKQKLALAKQLGLRPRQVEVWFQNRRAR

TrEMBL top hitse value%identityAlignment
A0A5D3CLM5 Homeobox-leucine zipper protein HAT45.9e-3974.29Show/hide
Query:  SWNDHASSDRTSE-----------NRTPPSPVDCEEEEEEAAVSSPNSTVSTVSGKRSEPETNNGEDLDG---------DEEDAETSRKKLRLTKDQSAV
        SW + ASSDRTSE           NR PPS  DC   EEEAA+SSPNSTVS+VSGKRSE E  NGEDLDG         DEED ETSRKKLRLTKDQSAV
Subjt:  SWNDHASSDRTSE-----------NRTPPSPVDCEEEEEEAAVSSPNSTVSTVSGKRSEPETNNGEDLDG---------DEEDAETSRKKLRLTKDQSAV

Query:  LEDSFKEHNTLNPKQKLALAKQLGLRPRQVEVWFQNRRAR
        LE+SFKEHNTLNPKQKLALAKQLGLRPRQVEVWFQNRRAR
Subjt:  LEDSFKEHNTLNPKQKLALAKQLGLRPRQVEVWFQNRRAR

A0A6J1E9Y0 homeobox-leucine zipper protein HAT41.1e-4274.83Show/hide
Query:  VMLNLRPSWNDHASSDRTSE-----------NRTPPSPVDCEEEEEEAAVSSPNSTVSTVSGKRSEPETNNGEDLDG---------DEEDAETSRKKLRL
        + LNL PSW + ASSDRTSE           NR PPS  DC   EEEAA+SSPNSTVS+VSGKRSE ET NGEDLDG         DEED ETSRKKLRL
Subjt:  VMLNLRPSWNDHASSDRTSE-----------NRTPPSPVDCEEEEEEAAVSSPNSTVSTVSGKRSEPETNNGEDLDG---------DEEDAETSRKKLRL

Query:  TKDQSAVLEDSFKEHNTLNPKQKLALAKQLGLRPRQVEVWFQNRRAR
        TKDQSAVLEDSFKEHNTLNPKQKLALAKQLGLRPRQVEVWFQNRRAR
Subjt:  TKDQSAVLEDSFKEHNTLNPKQKLALAKQLGLRPRQVEVWFQNRRAR

A0A6J1GJW6 homeobox-leucine zipper protein HAT4-like6.0e-6098.43Show/hide
Query:  VMLNLRPSWNDHASSDRTSENRTPPSPVDCEEEEEEAAVSSPNSTVSTVSGKRSEPETNNGEDLDGDEEDAETSRKKLRLTKDQSAVLEDSFKEHNTLNP
        + LNLRPSWNDHASSDRTSENRTPPSPVDCEEEEEEAAVSSPNSTVSTVSGKRSEPETNNGEDLDGDEEDAETSRKKLRLTKDQSAVLEDSFKEHNTLNP
Subjt:  VMLNLRPSWNDHASSDRTSENRTPPSPVDCEEEEEEAAVSSPNSTVSTVSGKRSEPETNNGEDLDGDEEDAETSRKKLRLTKDQSAVLEDSFKEHNTLNP

Query:  KQKLALAKQLGLRPRQVEVWFQNRRAR
        KQKLALAKQLGLRPRQVEVWFQNRRAR
Subjt:  KQKLALAKQLGLRPRQVEVWFQNRRAR

A0A6J1HKW5 homeobox-leucine zipper protein HAT4-like1.5e-4274.83Show/hide
Query:  VMLNLRPSWNDHASSDRTSE-----------NRTPPSPVDCEEEEEEAAVSSPNSTVSTVSGKRSEPETNNGEDLDG---------DEEDAETSRKKLRL
        + LNL PSW + ASSDRTSE           NR PPS  DC   EEEAA+SSPNSTVS+VSGKRSE ET NGEDLDG         DEED ETSRKKLRL
Subjt:  VMLNLRPSWNDHASSDRTSE-----------NRTPPSPVDCEEEEEEAAVSSPNSTVSTVSGKRSEPETNNGEDLDG---------DEEDAETSRKKLRL

Query:  TKDQSAVLEDSFKEHNTLNPKQKLALAKQLGLRPRQVEVWFQNRRAR
        TKDQSAVLEDSFKEHNTLNPKQKLALAKQLGLRPRQVEVWFQNRRAR
Subjt:  TKDQSAVLEDSFKEHNTLNPKQKLALAKQLGLRPRQVEVWFQNRRAR

A0A6J1L2D1 homeobox-leucine zipper protein HAT4-like9.6e-5896.12Show/hide
Query:  VMLNLRPSWNDHASSDRTSENRTPPSPVDC--EEEEEEAAVSSPNSTVSTVSGKRSEPETNNGEDLDGDEEDAETSRKKLRLTKDQSAVLEDSFKEHNTL
        + LNLRPSWNDHASSDRTSENR PPSPVDC  EEEEEEAAVSSPNSTVSTVSGKRSEPETNNGEDLDGDEEDAETSRKKLRLTKDQSAVLEDSFKEHNTL
Subjt:  VMLNLRPSWNDHASSDRTSENRTPPSPVDC--EEEEEEAAVSSPNSTVSTVSGKRSEPETNNGEDLDGDEEDAETSRKKLRLTKDQSAVLEDSFKEHNTL

Query:  NPKQKLALAKQLGLRPRQVEVWFQNRRAR
        NPKQKLALAKQLGLRPRQVEVWFQNRRAR
Subjt:  NPKQKLALAKQLGLRPRQVEVWFQNRRAR

SwissProt top hitse value%identityAlignment
P46600 Homeobox-leucine zipper protein HAT16.1e-2551.79Show/hide
Query:  VMLNLRPS-----------WNDH--ASSDRTSEN-------RTPPSPVDCEEEEEEAAVSSPNSTV-STVSGKRSEPETNN------GEDLD--------
        + LNL+P+           WN    +SSD+  +         + P+ VD    EEE  VSSPNST+ STVSGKR   E         G+DLD        
Subjt:  VMLNLRPS-----------WNDH--ASSDRTSEN-------RTPPSPVDCEEEEEEAAVSSPNSTV-STVSGKRSEPETNN------GEDLD--------

Query:  ---GDEED---AETSRKKLRLTKDQSAVLEDSFKEHNTLNPKQKLALAKQLGLRPRQVEVWFQNRRAR
            DEE+    ET RKKLRL+KDQSAVLED+FKEHNTLNPKQKLALAK+LGL  RQVEVWFQNRRAR
Subjt:  ---GDEED---AETSRKKLRLTKDQSAVLEDSFKEHNTLNPKQKLALAKQLGLRPRQVEVWFQNRRAR

P46601 Homeobox-leucine zipper protein HAT24.6e-2554.79Show/hide
Query:  RPSWN---DHASSDRTSENRTPPSPVDCEEEEEEAAVSSPNSTV-STVSGKRSEPE------TNNGED--------------LDGDEEDAETSRKKLRLT
        R  WN   D  S  R  +  + PS V+C   EE+  VSSPNST+ ST+SGKRSE E        +G+D               D +E+  ETSRKKLRL+
Subjt:  RPSWN---DHASSDRTSENRTPPSPVDCEEEEEEAAVSSPNSTV-STVSGKRSEPE------TNNGED--------------LDGDEEDAETSRKKLRLT

Query:  KDQSAVLEDSFKEHNTLNPKQKLALAKQLGLRPRQVEVWFQNRRAR
        KDQSA LE++FKEHNTLNPKQKLALAK+L L  RQVEVWFQNRRAR
Subjt:  KDQSAVLEDSFKEHNTLNPKQKLALAKQLGLRPRQVEVWFQNRRAR

Q05466 Homeobox-leucine zipper protein HAT43.2e-2656.43Show/hide
Query:  RPSWNDHASS------DRTSENRT---------PPSPVDCEEEEEEAAVSSPNSTVSTVSGKRSEPETN---NGEDLDGDEEDAETSRKKLRLTKDQSAV
        R SWN+  +S          E RT         PPS    E  +E+A VSSPNSTVS+ +GKRSE E +    G     D+ED + SRKKLRL+KDQSA+
Subjt:  RPSWNDHASS------DRTSENRT---------PPSPVDCEEEEEEAAVSSPNSTVSTVSGKRSEPETN---NGEDLDGDEEDAETSRKKLRLTKDQSAV

Query:  LEDSFKEHNTLNPKQKLALAKQLGLRPRQVEVWFQNRRAR
        LE++FK+H+TLNPKQK ALAKQLGLR RQVEVWFQNRRAR
Subjt:  LEDSFKEHNTLNPKQKLALAKQLGLRPRQVEVWFQNRRAR

Q40691 Homeobox-leucine zipper protein HOX15.5e-2659.17Show/hide
Query:  NDHASSDRTSENRTPPSPVDCEEEEEEAAVSSPNSTVSTVSGKRSEPE--TNNGEDLDGDEEDAETSRKKLRLTKDQSAVLEDSFKEHNTLNPKQKLALA
        N   +++ T+     PS   C EE+EE   SSPNST+S++SGKR  P   T        DE+    SRKKLRL+KDQ+AVLED+FKEHNTLNPKQK ALA
Subjt:  NDHASSDRTSENRTPPSPVDCEEEEEEAAVSSPNSTVSTVSGKRSEPE--TNNGEDLDGDEEDAETSRKKLRLTKDQSAVLEDSFKEHNTLNPKQKLALA

Query:  KQLGLRPRQVEVWFQNRRAR
        +QL L+PRQVEVWFQNRRAR
Subjt:  KQLGLRPRQVEVWFQNRRAR

Q7XC54 Homeobox-leucine zipper protein HOX15.5e-2659.17Show/hide
Query:  NDHASSDRTSENRTPPSPVDCEEEEEEAAVSSPNSTVSTVSGKRSEPE--TNNGEDLDGDEEDAETSRKKLRLTKDQSAVLEDSFKEHNTLNPKQKLALA
        N   +++ T+     PS   C EE+EE   SSPNST+S++SGKR  P   T        DE+    SRKKLRL+KDQ+AVLED+FKEHNTLNPKQK ALA
Subjt:  NDHASSDRTSENRTPPSPVDCEEEEEEAAVSSPNSTVSTVSGKRSEPE--TNNGEDLDGDEEDAETSRKKLRLTKDQSAVLEDSFKEHNTLNPKQKLALA

Query:  KQLGLRPRQVEVWFQNRRAR
        +QL L+PRQVEVWFQNRRAR
Subjt:  KQLGLRPRQVEVWFQNRRAR

Arabidopsis top hitse value%identityAlignment
AT2G44910.1 homeobox-leucine zipper protein 41.3e-2254.4Show/hide
Query:  NRTPPSPVDCEEEEEEAAVSSPNSTVSTVSGKRSEPETNNGEDLD---------------GDEED---AETSRKKLRLTKDQSAVLEDSFKEHNTLNPKQ
        NR   S    + EEE A VSSPNS VS++SG + +     G D +                D+ED    + SRKKLRL+KDQ+ VLE++FKEH+TLNPKQ
Subjt:  NRTPPSPVDCEEEEEEAAVSSPNSTVSTVSGKRSEPETNNGEDLD---------------GDEED---AETSRKKLRLTKDQSAVLEDSFKEHNTLNPKQ

Query:  KLALAKQLGLRPRQVEVWFQNRRAR
        KLALAKQL LR RQVEVWFQNRRAR
Subjt:  KLALAKQLGLRPRQVEVWFQNRRAR

AT3G60390.1 homeobox-leucine zipper protein 32.6e-2347.8Show/hide
Query:  LRPSW-NDHASSDRTSENRT---------PPSPVDCEEEEEEAAVSSPNSTVSTV-SGKRSEPET-------------------------NNGEDLDGDE
        ++ +W N   SS+R S+ R+          PS V  + E+E A VSSPNSTVS+V SGK+SE E                             +D DG  
Subjt:  LRPSW-NDHASSDRTSENRT---------PPSPVDCEEEEEEAAVSSPNSTVSTV-SGKRSEPET-------------------------NNGEDLDGDE

Query:  EDAETSRKKLRLTKDQSAVLEDSFKEHNTLNPKQKLALAKQLGLRPRQVEVWFQNRRAR
           ++SRKKLRL+K+Q+ VLE++FKEH+TLNPKQK+ALAKQL LR RQVEVWFQNRRAR
Subjt:  EDAETSRKKLRLTKDQSAVLEDSFKEHNTLNPKQKLALAKQLGLRPRQVEVWFQNRRAR

AT4G16780.1 homeobox protein 22.3e-2756.43Show/hide
Query:  RPSWNDHASS------DRTSENRT---------PPSPVDCEEEEEEAAVSSPNSTVSTVSGKRSEPETN---NGEDLDGDEEDAETSRKKLRLTKDQSAV
        R SWN+  +S          E RT         PPS    E  +E+A VSSPNSTVS+ +GKRSE E +    G     D+ED + SRKKLRL+KDQSA+
Subjt:  RPSWNDHASS------DRTSENRT---------PPSPVDCEEEEEEAAVSSPNSTVSTVSGKRSEPETN---NGEDLDGDEEDAETSRKKLRLTKDQSAV

Query:  LEDSFKEHNTLNPKQKLALAKQLGLRPRQVEVWFQNRRAR
        LE++FK+H+TLNPKQK ALAKQLGLR RQVEVWFQNRRAR
Subjt:  LEDSFKEHNTLNPKQKLALAKQLGLRPRQVEVWFQNRRAR

AT4G17460.1 Homeobox-leucine zipper protein 4 (HB-4) / HD-ZIP protein4.3e-2651.79Show/hide
Query:  VMLNLRPS-----------WNDH--ASSDRTSEN-------RTPPSPVDCEEEEEEAAVSSPNSTV-STVSGKRSEPETNN------GEDLD--------
        + LNL+P+           WN    +SSD+  +         + P+ VD    EEE  VSSPNST+ STVSGKR   E         G+DLD        
Subjt:  VMLNLRPS-----------WNDH--ASSDRTSEN-------RTPPSPVDCEEEEEEAAVSSPNSTV-STVSGKRSEPETNN------GEDLD--------

Query:  ---GDEED---AETSRKKLRLTKDQSAVLEDSFKEHNTLNPKQKLALAKQLGLRPRQVEVWFQNRRAR
            DEE+    ET RKKLRL+KDQSAVLED+FKEHNTLNPKQKLALAK+LGL  RQVEVWFQNRRAR
Subjt:  ---GDEED---AETSRKKLRLTKDQSAVLEDSFKEHNTLNPKQKLALAKQLGLRPRQVEVWFQNRRAR

AT5G47370.1 Homeobox-leucine zipper protein 4 (HB-4) / HD-ZIP protein3.3e-2654.79Show/hide
Query:  RPSWN---DHASSDRTSENRTPPSPVDCEEEEEEAAVSSPNSTV-STVSGKRSEPE------TNNGED--------------LDGDEEDAETSRKKLRLT
        R  WN   D  S  R  +  + PS V+C   EE+  VSSPNST+ ST+SGKRSE E        +G+D               D +E+  ETSRKKLRL+
Subjt:  RPSWN---DHASSDRTSENRTPPSPVDCEEEEEEAAVSSPNSTV-STVSGKRSEPE------TNNGED--------------LDGDEEDAETSRKKLRLT

Query:  KDQSAVLEDSFKEHNTLNPKQKLALAKQLGLRPRQVEVWFQNRRAR
        KDQSA LE++FKEHNTLNPKQKLALAK+L L  RQVEVWFQNRRAR
Subjt:  KDQSAVLEDSFKEHNTLNPKQKLALAKQLGLRPRQVEVWFQNRRAR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGATTATCCTATAGAGTGGGATCTCACAGTCATGCTCAATCTCCGGCCTTCTTGGAATGATCATGCCTCATCAGATCGGACATCGGAAAACAGGACGCCGCCATC
GCCGGTGGATTGCGAGGAGGAGGAGGAGGAGGCGGCGGTGTCGAGTCCTAACAGTACGGTTTCGACTGTAAGTGGGAAGAGGAGTGAACCAGAGACCAACAACGGCGAGG
ATCTCGACGGCGATGAAGAAGACGCTGAAACTTCTAGAAAAAAACTTCGTCTCACTAAAGATCAGTCCGCCGTCTTGGAAGACAGCTTCAAAGAACACAACACTCTCAAT
CCGAAGCAAAAGTTAGCCTTGGCTAAACAATTGGGTCTCCGGCCACGACAAGTCGAAGTCTGGTTCCAAAACAGAAGGGCAAGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTGATTATCCTATAGAGTGGGATCTCACAGTCATGCTCAATCTCCGGCCTTCTTGGAATGATCATGCCTCATCAGATCGGACATCGGAAAACAGGACGCCGCCATC
GCCGGTGGATTGCGAGGAGGAGGAGGAGGAGGCGGCGGTGTCGAGTCCTAACAGTACGGTTTCGACTGTAAGTGGGAAGAGGAGTGAACCAGAGACCAACAACGGCGAGG
ATCTCGACGGCGATGAAGAAGACGCTGAAACTTCTAGAAAAAAACTTCGTCTCACTAAAGATCAGTCCGCCGTCTTGGAAGACAGCTTCAAAGAACACAACACTCTCAAT
CCGAAGCAAAAGTTAGCCTTGGCTAAACAATTGGGTCTCCGGCCACGACAAGTCGAAGTCTGGTTCCAAAACAGAAGGGCAAGGTGAGGAAATTTCCTTAATCTCCAAGG
GTATTTTAGTCCAAATGATAAATTAAATCTAGAATCTAAAATCCCTCGTATTTGGACCCCTATACTTTTTTTTTTTACCAATTTAAACCCATGAATTTTCAGGTATTCAT
ATTTTGACCCATGATTTTGGTTTGAGCAGGACAAAGTTGAAGCAAACGGAAGTTGACTGTGAGTTTCTAAAGAGATGCTGTGAGAATCTAACGGAGGAGAACAGGCGGTT
GCAAAAAGAAGTTCAGGAACTGAGAGCACTGAAACTTTCCCCTCAGTTCTACATGCACATGGCCCCACCCACCACCCTCACCATGTGCCCGTCATGTGAGCGCGTGGCGG
TCCCATCCTCCACGTCGGCTCCACTCAACGTAACGAGAATGGGCCCGGCTCAAGCCCAAGCCCAAGCCCAAGCCCAAGCCCAATCCAAGGCCCTTCACTCTCGGCCTATC
CATGTCAACCCGTGGGCCTCCGCCATCCCGGCCCGGCCCTTTAACGCTCTCCACCCTCGCTCGTAAATACTCTCCGTTGGGCCGGATTTGGTGGGCCTCTTTTTTGTAAT
ATTGGGCCTATCTTGTCATATTAGGGCTCAGTAGTTTTATTTTTTTCTAAGATTTAGGAAATTGGAGACAAAAAATAAAAAAATAAAAAAAATTGAGTTGTAAAGTGAGA
ATGAGATGCCTAATTTATTACTATTATTATTATTTAATTGAAACTAATTTTAATTCAGGC
Protein sequenceShow/hide protein sequence
MVDYPIEWDLTVMLNLRPSWNDHASSDRTSENRTPPSPVDCEEEEEEAAVSSPNSTVSTVSGKRSEPETNNGEDLDGDEEDAETSRKKLRLTKDQSAVLEDSFKEHNTLN
PKQKLALAKQLGLRPRQVEVWFQNRRAR