; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G14800 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G14800
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionzinc finger homeobox protein 4-like isoform X1
Genome locationClcChr08:25729809..25735522
RNA-Seq ExpressionClc08G14800
SyntenyClc08G14800
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008444812.1 PREDICTED: uncharacterized protein LOC103488048 [Cucumis melo]6.0e-8570.19Show/hide
Query:  KHFIAQILLELPLLIQQSHLSLGLSPSWPLRRKRSAVNSPPDSASVIPQRPPPPSSSENVKESSPTTPLSLNSLPLSRSESDENNTKAKVSKKKASLDKR
        +HF+A IL EL LLIQ+S  SLGL PSWP+RRKRSAV SPPD +SV+ Q PPPPSSSE  KESSPTTPLS N   L RSESDEN    KVSK+KA LDK+
Subjt:  KHFIAQILLELPLLIQQSHLSLGLSPSWPLRRKRSAVNSPPDSASVIPQRPPPPSSSENVKESSPTTPLSLNSLPLSRSESDENNTKAKVSKKKASLDKR

Query:  FQYLETIDKLTHQNQALEGDVEAMKQHYNHLKAINSTLKAKKQE--MILGGSNNQSEIPEIGTSA-------SNMENHLHQREPSIKNQTAPMADQSNSN
        FQYLETIDKLTHQNQAL  DVEAMKQH+ HLK INS LKAKKQE  MILGGS NQSEIPEIGTS+       SN+EN+LH+ +PS+KNQTAP+A+QSN N
Subjt:  FQYLETIDKLTHQNQALEGDVEAMKQHYNHLKAINSTLKAKKQE--MILGGSNNQSEIPEIGTSA-------SNMENHLHQREPSIKNQTAPMADQSNSN

Query:  QNFQIPIGAIPLYDPSFGPMGIPDLNLTID---EIDFSKYMAARARQIRIRIWKNKNKNNNNGAA
        QN QIP G IPL D    PMGIPDLNLTI+   +++++KYMAA+ARQ RIRIWKNK  NN+NG A
Subjt:  QNFQIPIGAIPLYDPSFGPMGIPDLNLTID---EIDFSKYMAARARQIRIRIWKNKNKNNNNGAA

XP_008444813.1 PREDICTED: uncharacterized protein LOC103488049 [Cucumis melo]6.0e-8564.45Show/hide
Query:  SHLHQCKPSIMNQTAPMAEQSNKHFIAQILLELPLLIQQSHLSLGLSPSWPLRRKRSAVNSPPDSASVIPQRPPPPS----SSENVKESSPTTPLSLNSL
        S  HQC  S  +      E     F+AQIL +LPLLIQ+S+ SLGLSPSWP+RRKRSAV+SPPD+ S+I Q P PP     SSE  KESSPTTPLSLNSL
Subjt:  SHLHQCKPSIMNQTAPMAEQSNKHFIAQILLELPLLIQQSHLSLGLSPSWPLRRKRSAVNSPPDSASVIPQRPPPPS----SSENVKESSPTTPLSLNSL

Query:  PLSRSESDENNTKAKVSKKKASLDKRFQYLETIDKLTHQNQALEGDVEAMKQHYNHLKAINSTLKAKKQEMILGGSNNQSEIPEIGTSA-----------
        PLSRSESDEN T AKVSKKKA +DK+ QYLETIDKLTHQ QALEGD+EAMK+H+ +LK INS LKAKKQE IL G  N S  PEIGTS+           
Subjt:  PLSRSESDENNTKAKVSKKKASLDKRFQYLETIDKLTHQNQALEGDVEAMKQHYNHLKAINSTLKAKKQEMILGGSNNQSEIPEIGTSA-----------

Query:  ----SNMENHLHQREPSIKNQTAPMADQSNSNQNFQIPIGAIPLYDPSFGPMGIPDLNLTIDEI---DFSKYMAARARQIRIRIWKNKNKNNNNGAAKLH
            SN+EN+  + EPS+KNQT P A+Q NSN+N+QIPIG IPLYDPS GPMGIPDLNL++++I    ++KY+AARARQ RI+IWKNKN NNNNGA KL 
Subjt:  ----SNMENHLHQREPSIKNQTAPMADQSNSNQNFQIPIGAIPLYDPSFGPMGIPDLNLTIDEI---DFSKYMAARARQIRIRIWKNKNKNNNNGAAKLH

Query:  S
        S
Subjt:  S

XP_011649663.1 uncharacterized protein LOC105434650 [Cucumis sativus]2.5e-7061.45Show/hide
Query:  HFIAQILLELPLLIQQSHLSLGLSPSWPLRRKRSAVNSPPDSASVIPQRPPPPSSSENVKESSPTTPLSLNSLPLSRSESDENNTKAKVSKKKASLDKRF
        H +A IL ELPLLIQ+S  SLGL PSWP+RRKRSAV SP   ++V+ Q PPPPSSSE  KE+SPTTPLSL+SL LSRSESDEN    KVSK+KA L K+F
Subjt:  HFIAQILLELPLLIQQSHLSLGLSPSWPLRRKRSAVNSPPDSASVIPQRPPPPSSSENVKESSPTTPLSLNSLPLSRSESDENNTKAKVSKKKASLDKRF

Query:  QYLETIDKLTHQNQALEGDVEAMKQHYNHLKAINSTLKAKKQE--MILGGSNNQSEIPEIGTSAS-------NMENHLHQREPSIKNQTAPMADQSNSNQ
        +  E++DKLTHQNQAL  + EA KQ + H K INS LKAKKQE  MILGGS N+SEIPE GTS S       NMEN+LH+ EPS KNQTAPMA+QSN NQ
Subjt:  QYLETIDKLTHQNQALEGDVEAMKQHYNHLKAINSTLKAKKQE--MILGGSNNQSEIPEIGTSAS-------NMENHLHQREPSIKNQTAPMADQSNSNQ

Query:  NFQIPIGAIPLYDPSFGPMGIPDLNLTIDE---IDFSKYMAARARQIRIRIWKNK-----------NKNNNNGAA
        N QIPI  IPL D     MGIPDLNLT+++   +++ K +AA+ARQ R RI KNK            +NN NG A
Subjt:  NFQIPIGAIPLYDPSFGPMGIPDLNLTIDE---IDFSKYMAARARQIRIRIWKNK-----------NKNNNNGAA

XP_011649664.1 myocardin-related transcription factor A [Cucumis sativus]1.1e-8664.57Show/hide
Query:  SHLHQCKPSIMNQTAPMAEQSNKHFIAQILLELPLLIQQSHLSLGLSPSWPLRRKRSAVNSPPDSASVI--PQRPPPP--SSSENVKESSPTTPLSLNSL
        S  HQC  S  +      E    HF+AQIL +LPLLIQQSH SLGLSPSWP+RRKRSAV+SPPD++S+I  P  PPPP   SSE  KESSPTTPLSL+SL
Subjt:  SHLHQCKPSIMNQTAPMAEQSNKHFIAQILLELPLLIQQSHLSLGLSPSWPLRRKRSAVNSPPDSASVI--PQRPPPP--SSSENVKESSPTTPLSLNSL

Query:  PLSRSESDENNTKAKVSKKKASLDKRFQYLETIDKLTHQNQALEGDVEAMKQHYNHLKAINSTLKAKKQEMILGGSNNQSEIPEIGTSA-----------
        PLSRSESDEN T AKVSKKKA +DK+ QYLETI+KLTHQ QALEGD+EAMK+H+ +LK INS LKAKKQE ILGG +N S  P+ GTS            
Subjt:  PLSRSESDENNTKAKVSKKKASLDKRFQYLETIDKLTHQNQALEGDVEAMKQHYNHLKAINSTLKAKKQEMILGGSNNQSEIPEIGTSA-----------

Query:  ----SNMENHLHQREPSIKNQTAPMADQSNSNQNFQIPIGAIPLYDPSFGPMGIPDLNLTIDEI---DFSKYMAARARQIRIRIWKNK-NKNNNNGAAKL
            SN+EN+  + EPS+KNQT P+A+QSNS QN+QIPIG IPLYDPS GPMGIPDLNL++++I   +++KY+AA+ARQ RI+IWKNK N NNNNGA KL
Subjt:  ----SNMENHLHQREPSIKNQTAPMADQSNSNQNFQIPIGAIPLYDPSFGPMGIPDLNLTIDEI---DFSKYMAARARQIRIRIWKNK-NKNNNNGAAKL

Query:  HS
         S
Subjt:  HS

XP_023002465.1 uncharacterized protein LOC111496295 [Cucurbita maxima]1.2e-6461.51Show/hide
Query:  KHFIAQILLELPLLIQQSHLSLGLSPSWPLRRKRSAVNSPPDSASVIPQRPPPPSSSENVKESSPTTPLSLNSLPLSRSESDENNTKAKVSKKKASLDKR
        +H +AQILLE     ++S + LG  P W LRRKRSA+ SPP+S++ +P  P     S+ VKESSPT+PL LNSLPLSRSESDE +T AK SKKKASLDK+
Subjt:  KHFIAQILLELPLLIQQSHLSLGLSPSWPLRRKRSAVNSPPDSASVIPQRPPPPSSSENVKESSPTTPLSLNSLPLSRSESDENNTKAKVSKKKASLDKR

Query:  FQYLETIDKLTHQNQALEGDVEAMKQHYNHLKAINSTLKAKKQEMILG--GSNNQSEIPEIGTSASNME-NHLHQREPSIKNQTAPMADQSNS-NQNFQI
         Q++E ID+LT QNQ L+G+ EAMKQHYNHLKAINS LKAKKQEMILG   S N+S IPEIGTS+S ME   L   E S   Q APMA+QSN+ +QNFQI
Subjt:  FQYLETIDKLTHQNQALEGDVEAMKQHYNHLKAINSTLKAKKQEMILG--GSNNQSEIPEIGTSASNME-NHLHQREPSIKNQTAPMADQSNS-NQNFQI

Query:  PIGAIPLYDP-SFGPMGIPDLNLTIDEI---DFSKYMAARARQIRIRIWKNKNKNNNNGAAKLHS
        PIG IP YDP S  PMGIPDLN++++EI   ++S++MAARAR+ RI+I KNK    NNG  KL +
Subjt:  PIGAIPLYDP-SFGPMGIPDLNLTIDEI---DFSKYMAARARQIRIRIWKNKNKNNNNGAAKLHS

TrEMBL top hitse value%identityAlignment
A0A0A0LRP1 Uncharacterized protein5.3e-8764.57Show/hide
Query:  SHLHQCKPSIMNQTAPMAEQSNKHFIAQILLELPLLIQQSHLSLGLSPSWPLRRKRSAVNSPPDSASVI--PQRPPPP--SSSENVKESSPTTPLSLNSL
        S  HQC  S  +      E    HF+AQIL +LPLLIQQSH SLGLSPSWP+RRKRSAV+SPPD++S+I  P  PPPP   SSE  KESSPTTPLSL+SL
Subjt:  SHLHQCKPSIMNQTAPMAEQSNKHFIAQILLELPLLIQQSHLSLGLSPSWPLRRKRSAVNSPPDSASVI--PQRPPPP--SSSENVKESSPTTPLSLNSL

Query:  PLSRSESDENNTKAKVSKKKASLDKRFQYLETIDKLTHQNQALEGDVEAMKQHYNHLKAINSTLKAKKQEMILGGSNNQSEIPEIGTSA-----------
        PLSRSESDEN T AKVSKKKA +DK+ QYLETI+KLTHQ QALEGD+EAMK+H+ +LK INS LKAKKQE ILGG +N S  P+ GTS            
Subjt:  PLSRSESDENNTKAKVSKKKASLDKRFQYLETIDKLTHQNQALEGDVEAMKQHYNHLKAINSTLKAKKQEMILGGSNNQSEIPEIGTSA-----------

Query:  ----SNMENHLHQREPSIKNQTAPMADQSNSNQNFQIPIGAIPLYDPSFGPMGIPDLNLTIDEI---DFSKYMAARARQIRIRIWKNK-NKNNNNGAAKL
            SN+EN+  + EPS+KNQT P+A+QSNS QN+QIPIG IPLYDPS GPMGIPDLNL++++I   +++KY+AA+ARQ RI+IWKNK N NNNNGA KL
Subjt:  ----SNMENHLHQREPSIKNQTAPMADQSNSNQNFQIPIGAIPLYDPSFGPMGIPDLNLTIDEI---DFSKYMAARARQIRIRIWKNK-NKNNNNGAAKL

Query:  HS
         S
Subjt:  HS

A0A1S3BAR4 uncharacterized protein LOC1034880492.9e-8564.45Show/hide
Query:  SHLHQCKPSIMNQTAPMAEQSNKHFIAQILLELPLLIQQSHLSLGLSPSWPLRRKRSAVNSPPDSASVIPQRPPPPS----SSENVKESSPTTPLSLNSL
        S  HQC  S  +      E     F+AQIL +LPLLIQ+S+ SLGLSPSWP+RRKRSAV+SPPD+ S+I Q P PP     SSE  KESSPTTPLSLNSL
Subjt:  SHLHQCKPSIMNQTAPMAEQSNKHFIAQILLELPLLIQQSHLSLGLSPSWPLRRKRSAVNSPPDSASVIPQRPPPPS----SSENVKESSPTTPLSLNSL

Query:  PLSRSESDENNTKAKVSKKKASLDKRFQYLETIDKLTHQNQALEGDVEAMKQHYNHLKAINSTLKAKKQEMILGGSNNQSEIPEIGTSA-----------
        PLSRSESDEN T AKVSKKKA +DK+ QYLETIDKLTHQ QALEGD+EAMK+H+ +LK INS LKAKKQE IL G  N S  PEIGTS+           
Subjt:  PLSRSESDENNTKAKVSKKKASLDKRFQYLETIDKLTHQNQALEGDVEAMKQHYNHLKAINSTLKAKKQEMILGGSNNQSEIPEIGTSA-----------

Query:  ----SNMENHLHQREPSIKNQTAPMADQSNSNQNFQIPIGAIPLYDPSFGPMGIPDLNLTIDEI---DFSKYMAARARQIRIRIWKNKNKNNNNGAAKLH
            SN+EN+  + EPS+KNQT P A+Q NSN+N+QIPIG IPLYDPS GPMGIPDLNL++++I    ++KY+AARARQ RI+IWKNKN NNNNGA KL 
Subjt:  ----SNMENHLHQREPSIKNQTAPMADQSNSNQNFQIPIGAIPLYDPSFGPMGIPDLNLTIDEI---DFSKYMAARARQIRIRIWKNKNKNNNNGAAKLH

Query:  S
        S
Subjt:  S

A0A1S3BC34 uncharacterized protein LOC1034880482.9e-8570.19Show/hide
Query:  KHFIAQILLELPLLIQQSHLSLGLSPSWPLRRKRSAVNSPPDSASVIPQRPPPPSSSENVKESSPTTPLSLNSLPLSRSESDENNTKAKVSKKKASLDKR
        +HF+A IL EL LLIQ+S  SLGL PSWP+RRKRSAV SPPD +SV+ Q PPPPSSSE  KESSPTTPLS N   L RSESDEN    KVSK+KA LDK+
Subjt:  KHFIAQILLELPLLIQQSHLSLGLSPSWPLRRKRSAVNSPPDSASVIPQRPPPPSSSENVKESSPTTPLSLNSLPLSRSESDENNTKAKVSKKKASLDKR

Query:  FQYLETIDKLTHQNQALEGDVEAMKQHYNHLKAINSTLKAKKQE--MILGGSNNQSEIPEIGTSA-------SNMENHLHQREPSIKNQTAPMADQSNSN
        FQYLETIDKLTHQNQAL  DVEAMKQH+ HLK INS LKAKKQE  MILGGS NQSEIPEIGTS+       SN+EN+LH+ +PS+KNQTAP+A+QSN N
Subjt:  FQYLETIDKLTHQNQALEGDVEAMKQHYNHLKAINSTLKAKKQE--MILGGSNNQSEIPEIGTSA-------SNMENHLHQREPSIKNQTAPMADQSNSN

Query:  QNFQIPIGAIPLYDPSFGPMGIPDLNLTID---EIDFSKYMAARARQIRIRIWKNKNKNNNNGAA
        QN QIP G IPL D    PMGIPDLNLTI+   +++++KYMAA+ARQ RIRIWKNK  NN+NG A
Subjt:  QNFQIPIGAIPLYDPSFGPMGIPDLNLTID---EIDFSKYMAARARQIRIRIWKNKNKNNNNGAA

A0A5A7VA15 Uncharacterized protein2.9e-8570.19Show/hide
Query:  KHFIAQILLELPLLIQQSHLSLGLSPSWPLRRKRSAVNSPPDSASVIPQRPPPPSSSENVKESSPTTPLSLNSLPLSRSESDENNTKAKVSKKKASLDKR
        +HF+A IL EL LLIQ+S  SLGL PSWP+RRKRSAV SPPD +SV+ Q PPPPSSSE  KESSPTTPLS N   L RSESDEN    KVSK+KA LDK+
Subjt:  KHFIAQILLELPLLIQQSHLSLGLSPSWPLRRKRSAVNSPPDSASVIPQRPPPPSSSENVKESSPTTPLSLNSLPLSRSESDENNTKAKVSKKKASLDKR

Query:  FQYLETIDKLTHQNQALEGDVEAMKQHYNHLKAINSTLKAKKQE--MILGGSNNQSEIPEIGTSA-------SNMENHLHQREPSIKNQTAPMADQSNSN
        FQYLETIDKLTHQNQAL  DVEAMKQH+ HLK INS LKAKKQE  MILGGS NQSEIPEIGTS+       SN+EN+LH+ +PS+KNQTAP+A+QSN N
Subjt:  FQYLETIDKLTHQNQALEGDVEAMKQHYNHLKAINSTLKAKKQE--MILGGSNNQSEIPEIGTSA-------SNMENHLHQREPSIKNQTAPMADQSNSN

Query:  QNFQIPIGAIPLYDPSFGPMGIPDLNLTID---EIDFSKYMAARARQIRIRIWKNKNKNNNNGAA
        QN QIP G IPL D    PMGIPDLNLTI+   +++++KYMAA+ARQ RIRIWKNK  NN+NG A
Subjt:  QNFQIPIGAIPLYDPSFGPMGIPDLNLTID---EIDFSKYMAARARQIRIRIWKNKNKNNNNGAA

A0A5A7VHE1 Uncharacterized protein2.9e-8564.45Show/hide
Query:  SHLHQCKPSIMNQTAPMAEQSNKHFIAQILLELPLLIQQSHLSLGLSPSWPLRRKRSAVNSPPDSASVIPQRPPPPS----SSENVKESSPTTPLSLNSL
        S  HQC  S  +      E     F+AQIL +LPLLIQ+S+ SLGLSPSWP+RRKRSAV+SPPD+ S+I Q P PP     SSE  KESSPTTPLSLNSL
Subjt:  SHLHQCKPSIMNQTAPMAEQSNKHFIAQILLELPLLIQQSHLSLGLSPSWPLRRKRSAVNSPPDSASVIPQRPPPPS----SSENVKESSPTTPLSLNSL

Query:  PLSRSESDENNTKAKVSKKKASLDKRFQYLETIDKLTHQNQALEGDVEAMKQHYNHLKAINSTLKAKKQEMILGGSNNQSEIPEIGTSA-----------
        PLSRSESDEN T AKVSKKKA +DK+ QYLETIDKLTHQ QALEGD+EAMK+H+ +LK INS LKAKKQE IL G  N S  PEIGTS+           
Subjt:  PLSRSESDENNTKAKVSKKKASLDKRFQYLETIDKLTHQNQALEGDVEAMKQHYNHLKAINSTLKAKKQEMILGGSNNQSEIPEIGTSA-----------

Query:  ----SNMENHLHQREPSIKNQTAPMADQSNSNQNFQIPIGAIPLYDPSFGPMGIPDLNLTIDEI---DFSKYMAARARQIRIRIWKNKNKNNNNGAAKLH
            SN+EN+  + EPS+KNQT P A+Q NSN+N+QIPIG IPLYDPS GPMGIPDLNL++++I    ++KY+AARARQ RI+IWKNKN NNNNGA KL 
Subjt:  ----SNMENHLHQREPSIKNQTAPMADQSNSNQNFQIPIGAIPLYDPSFGPMGIPDLNLTIDEI---DFSKYMAARARQIRIRIWKNKNKNNNNGAAKLH

Query:  S
        S
Subjt:  S

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCACTACTCCCACTCATCAATGCTCCACCTCCTTCGATTCCGACGAGTTCACCCCTGACCAACACTTTGTTGCTCAAATCCTCATCGACTTACCTCTTCTCAT
TCAACAATCCGAGTTTTCTCTTGGCTTATCCCCTTCCTGGTCTCTCCGACGCAAGAGATCCGCCGTCAATTCCCCCCCGGACTCCGCCTCCGTCATCCCCCAACCGCCGC
CTCCTCCATCGTCGTCCGAGAATGTCAAGGAGTCAAGCCCTACTACTCCTCTCTCGCTCAACTCTTTGCCTTTGTCGCGGAGTGAATCTGATGAGAATAATACCAAACCT
AAGGTCACCAAGAAGAAAGCCTCTCTCCATAAGAGATTTCAGTATTTGGAAACCATTGATAAATTGACCCACCGGAATCAAGCTCTGGAAGGGGACGTTGAGGCTATGAA
GCAACATTATAATCATCTGAAAGCTATCAATTCGACGTTGAAGGCCAAGAAGCAAGAGATGATTCTGGGTGGTTCCAATAATCAATCAAAAATTCCAGAAATTGGAACCT
CAAGTTCGATCGCCATGGAAATGGCTAAGTTAACTGTGAAATCCTCAGCCTCAAATCTGGAGAGTCATCTTCATCAATGTAAACCGTCGATCATGAATCAGACGGCTCCG
ATGGCAGAACAGAGCAACAAACACTTCATCGCTCAAATCCTCCTCGAATTACCTCTTCTCATTCAACAATCCCACCTTTCTCTTGGCTTATCCCCTTCCTGGCCTCTCCG
ACGCAAGAGATCCGCCGTCAATTCCCCGCCGGACTCCGCCTCCGTCATCCCCCAACGGCCGCCTCCTCCCTCGTCGTCCGAGAATGTCAAGGAGTCCAGTCCTACTACTC
CGCTTTCACTTAACTCTTTACCTCTGTCGCGGAGTGAATCTGATGAGAATAATACTAAGGCTAAGGTCTCCAAGAAGAAAGCCTCTCTCGATAAGAGATTTCAGTATTTG
GAAACCATTGACAAATTGACCCACCAGAATCAAGCTCTGGAAGGGGACGTTGAGGCTATGAAGCAACATTATAATCATCTGAAAGCTATCAATTCGACGTTGAAGGCCAA
GAAGCAAGAGATGATTTTGGGTGGTTCCAATAATCAATCAGAAATTCCAGAAATTGGGACCTCAGCCTCAAATATGGAGAATCATCTTCATCAACGTGAACCGTCGATCA
AGAATCAGACGGCTCCGATGGCAGACCAGAGCAACAGTAATCAGAATTTCCAAATTCCAATTGGGGCAATTCCTTTGTATGATCCTTCATTTGGTCCAATGGGGATTCCT
GATTTAAACCTCACTATTGACGAAATTGATTTCTCAAAATACATGGCTGCTAGAGCAAGACAGATAAGGATTCGGATCTGGAAGAACAAGAACAAAAACAACAACAATGG
AGCTGCCAAATTGCATTCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCACTACTCCCACTCATCAATGCTCCACCTCCTTCGATTCCGACGAGTTCACCCCTGACCAACACTTTGTTGCTCAAATCCTCATCGACTTACCTCTTCTCAT
TCAACAATCCGAGTTTTCTCTTGGCTTATCCCCTTCCTGGTCTCTCCGACGCAAGAGATCCGCCGTCAATTCCCCCCCGGACTCCGCCTCCGTCATCCCCCAACCGCCGC
CTCCTCCATCGTCGTCCGAGAATGTCAAGGAGTCAAGCCCTACTACTCCTCTCTCGCTCAACTCTTTGCCTTTGTCGCGGAGTGAATCTGATGAGAATAATACCAAACCT
AAGGTCACCAAGAAGAAAGCCTCTCTCCATAAGAGATTTCAGTATTTGGAAACCATTGATAAATTGACCCACCGGAATCAAGCTCTGGAAGGGGACGTTGAGGCTATGAA
GCAACATTATAATCATCTGAAAGCTATCAATTCGACGTTGAAGGCCAAGAAGCAAGAGATGATTCTGGGTGGTTCCAATAATCAATCAAAAATTCCAGAAATTGGAACCT
CAAGTTCGATCGCCATGGAAATGGCTAAGTTAACTGTGAAATCCTCAGCCTCAAATCTGGAGAGTCATCTTCATCAATGTAAACCGTCGATCATGAATCAGACGGCTCCG
ATGGCAGAACAGAGCAACAAACACTTCATCGCTCAAATCCTCCTCGAATTACCTCTTCTCATTCAACAATCCCACCTTTCTCTTGGCTTATCCCCTTCCTGGCCTCTCCG
ACGCAAGAGATCCGCCGTCAATTCCCCGCCGGACTCCGCCTCCGTCATCCCCCAACGGCCGCCTCCTCCCTCGTCGTCCGAGAATGTCAAGGAGTCCAGTCCTACTACTC
CGCTTTCACTTAACTCTTTACCTCTGTCGCGGAGTGAATCTGATGAGAATAATACTAAGGCTAAGGTCTCCAAGAAGAAAGCCTCTCTCGATAAGAGATTTCAGTATTTG
GAAACCATTGACAAATTGACCCACCAGAATCAAGCTCTGGAAGGGGACGTTGAGGCTATGAAGCAACATTATAATCATCTGAAAGCTATCAATTCGACGTTGAAGGCCAA
GAAGCAAGAGATGATTTTGGGTGGTTCCAATAATCAATCAGAAATTCCAGAAATTGGGACCTCAGCCTCAAATATGGAGAATCATCTTCATCAACGTGAACCGTCGATCA
AGAATCAGACGGCTCCGATGGCAGACCAGAGCAACAGTAATCAGAATTTCCAAATTCCAATTGGGGCAATTCCTTTGTATGATCCTTCATTTGGTCCAATGGGGATTCCT
GATTTAAACCTCACTATTGACGAAATTGATTTCTCAAAATACATGGCTGCTAGAGCAAGACAGATAAGGATTCGGATCTGGAAGAACAAGAACAAAAACAACAACAATGG
AGCTGCCAAATTGCATTCCTAA
Protein sequenceShow/hide protein sequence
MASTTPTHQCSTSFDSDEFTPDQHFVAQILIDLPLLIQQSEFSLGLSPSWSLRRKRSAVNSPPDSASVIPQPPPPPSSSENVKESSPTTPLSLNSLPLSRSESDENNTKP
KVTKKKASLHKRFQYLETIDKLTHRNQALEGDVEAMKQHYNHLKAINSTLKAKKQEMILGGSNNQSKIPEIGTSSSIAMEMAKLTVKSSASNLESHLHQCKPSIMNQTAP
MAEQSNKHFIAQILLELPLLIQQSHLSLGLSPSWPLRRKRSAVNSPPDSASVIPQRPPPPSSSENVKESSPTTPLSLNSLPLSRSESDENNTKAKVSKKKASLDKRFQYL
ETIDKLTHQNQALEGDVEAMKQHYNHLKAINSTLKAKKQEMILGGSNNQSEIPEIGTSASNMENHLHQREPSIKNQTAPMADQSNSNQNFQIPIGAIPLYDPSFGPMGIP
DLNLTIDEIDFSKYMAARARQIRIRIWKNKNKNNNNGAAKLHS