; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G05670 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G05670
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionGATA transcription factor-like protein
Genome locationClcChr09:4474452..4476101
RNA-Seq ExpressionClc09G05670
SyntenyClc09G05670
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575793.1 hypothetical protein SDJN03_26432, partial [Cucurbita argyrosperma subsp. sororia]1.4e-10381.93Show/hide
Query:  MQSRLTAIAPKSNWAFSLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEHERSQDNLEPDNAKANYEREDSKQEDSNGPFGPHKAQYASSPRLETT
        M SRLTAIA K NW FSLAQFQRLRR GLTTCRTADPSVHANDDN PAV SGE E+SQDNLEPD+AK+NYER+DSKQ DSNGPF P KAQYASSPRLETT
Subjt:  MQSRLTAIAPKSNWAFSLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEHERSQDNLEPDNAKANYEREDSKQEDSNGPFGPHKAQYASSPRLETT

Query:  AVGQASKPITQQKRAQSTVIDDVSCVGVYGGPLEEAKTNRRTETKDQEEDNKDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD
         V QASKPITQQKRA STV+DDVSC+G  GGP    + NR  + K+QE+D ++YYKHHKASPLAEIEF DTRKPITRATDGTAYDG GKDVIGWLPEQ D
Subjt:  AVGQASKPITQQKRAQSTVIDDVSCVGVYGGPLEEAKTNRRTETKDQEEDNKDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD

Query:  TAEDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEQF
        T +DSL+RATEIWKQNAMRGDPDAPQSRVLRALRGEQF
Subjt:  TAEDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEQF

XP_022991237.1 uncharacterized protein LOC111487953 [Cucurbita maxima]3.2e-10382.77Show/hide
Query:  MQSRLTAIAPKSNWAFSLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEHERSQDNLEPDNAKANYEREDSKQEDSNGPFGPHKAQYASSPRLETT
        M SRLTAIA K NWAFSLAQFQRLRR GLTTCRTADPSVHANDDN PAV SGE E+SQDNLEPD AKANY  +DSKQ DSNGPF P KAQYASSPRLETT
Subjt:  MQSRLTAIAPKSNWAFSLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEHERSQDNLEPDNAKANYEREDSKQEDSNGPFGPHKAQYASSPRLETT

Query:  AVGQASKPITQQKRAQSTVIDDVSCVGVYGGPLEEAKTNRRTETKDQEEDNKDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD
         V QASKPITQQKRA STV+ DVSC+G  GGP  E + NR  + K+QEED ++YYKHHKASPLAEIEFADTRKPITRATDGTAYDG GKDVI WLPEQ D
Subjt:  AVGQASKPITQQKRAQSTVIDDVSCVGVYGGPLEEAKTNRRTETKDQEEDNKDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD

Query:  TAEDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEQF
        T +DSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEQF
Subjt:  TAEDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEQF

XP_023548846.1 uncharacterized protein LOC111807374 [Cucurbita pepo subsp. pepo]1.9e-10382.35Show/hide
Query:  MQSRLTAIAPKSNWAFSLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEHERSQDNLEPDNAKANYEREDSKQEDSNGPFGPHKAQYASSPRLETT
        M SRLTAIA K NW+FSLAQFQRLRR GLTTCRTADPSVHANDDN PAV SGE E+SQDNLEPD+AKANYER+DSKQ DSNGPF P KAQYASSPRLETT
Subjt:  MQSRLTAIAPKSNWAFSLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEHERSQDNLEPDNAKANYEREDSKQEDSNGPFGPHKAQYASSPRLETT

Query:  AVGQASKPITQQKRAQSTVIDDVSCVGVYGGPLEEAKTNRRTETKDQEEDNKDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD
         V QASKPITQQKRA STV+ DVSC+G  GGP    + NR  + K+QEED ++YYKHHKASPLAEIEF DTRKPITRATDGTAYDG GKDVIGWLPEQ D
Subjt:  AVGQASKPITQQKRAQSTVIDDVSCVGVYGGPLEEAKTNRRTETKDQEEDNKDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD

Query:  TAEDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEQF
        T +DSLRRA EIWKQNAMRGDPDAPQSRVLRALRGEQF
Subjt:  TAEDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEQF

XP_038899333.1 uncharacterized protein LOC120086662 isoform X1 [Benincasa hispida]1.6e-11086.97Show/hide
Query:  MQSRLTAIAPKSNWAFSLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEHERSQDNLEPDNAKANYEREDSKQEDSNGPFGPHKAQYASSPRLETT
        MQSRLTAIAPKSNWA SLAQFQRLRR  LTT RTADPSVHANDDNDPAVLSGE E SQDNLEPDN KANYER+D K  DSNGPFG  KAQ+ASSPRLET 
Subjt:  MQSRLTAIAPKSNWAFSLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEHERSQDNLEPDNAKANYEREDSKQEDSNGPFGPHKAQYASSPRLETT

Query:  AVGQASKPITQQKRAQSTVIDDVSCVGVYGGPLEEAKTNRRTETKDQEEDNKDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD
         VGQASKPITQQKR QSTV D+VSC+GVYGGPLE+ K NR TE K+QEEDN+DYYKHHKASPLAEIEFADTRKPITRATDGTAYDG GKDVIGWLPEQLD
Subjt:  AVGQASKPITQQKRAQSTVIDDVSCVGVYGGPLEEAKTNRRTETKDQEEDNKDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD

Query:  TAEDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEQF
        T +DSLRRATEIWKQNAMRGDPDAPQSR+LRALRGEQF
Subjt:  TAEDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEQF

XP_038899334.1 uncharacterized protein LOC120086662 isoform X2 [Benincasa hispida]4.3e-10886.13Show/hide
Query:  MQSRLTAIAPKSNWAFSLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEHERSQDNLEPDNAKANYEREDSKQEDSNGPFGPHKAQYASSPRLETT
        MQSRLTAIAPKSNWA SLAQFQRLRR  LTT RTADPSVHANDDNDPAVLSGE E   DNLEPDN KANYER+D K  DSNGPFG  KAQ+ASSPRLET 
Subjt:  MQSRLTAIAPKSNWAFSLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEHERSQDNLEPDNAKANYEREDSKQEDSNGPFGPHKAQYASSPRLETT

Query:  AVGQASKPITQQKRAQSTVIDDVSCVGVYGGPLEEAKTNRRTETKDQEEDNKDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD
         VGQASKPITQQKR QSTV D+VSC+GVYGGPLE+ K NR TE K+QEEDN+DYYKHHKASPLAEIEFADTRKPITRATDGTAYDG GKDVIGWLPEQLD
Subjt:  AVGQASKPITQQKRAQSTVIDDVSCVGVYGGPLEEAKTNRRTETKDQEEDNKDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD

Query:  TAEDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEQF
        T +DSLRRATEIWKQNAMRGDPDAPQSR+LRALRGEQF
Subjt:  TAEDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEQF

TrEMBL top hitse value%identityAlignment
A0A0A0K9G7 Uncharacterized protein4.2e-10181.56Show/hide
Query:  MQSRLTAIAPKSNWAFSLAQFQRLRRGG--LTTCRTADPSVHAN---DDNDPAVLSGEHERSQDNLEPDNAKANYE-REDSKQEDSNGPFGPHKAQYASS
        MQS L AIAPKSNWAF + QFQ LRRGG  LTT RTADPS+HAN   DDNDPAVLSGE ERSQDNLEPDNAKANY+ R+D KQ DS GPFG   AQ+ASS
Subjt:  MQSRLTAIAPKSNWAFSLAQFQRLRRGG--LTTCRTADPSVHAN---DDNDPAVLSGEHERSQDNLEPDNAKANYE-REDSKQEDSNGPFGPHKAQYASS

Query:  PRLETTAVGQASKPITQQKRAQSTVIDDVSCVGVYGGPLEEAKTNRRTETKDQEEDNKDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGW
        PRLETT VGQASKPITQQKRA S  IDDVSC+GVYGGPLE+ K NR TE K++EEDN+DYYKHHKASPLAEIEFADTRKPITRATDGTAYDG    VIGW
Subjt:  PRLETTAVGQASKPITQQKRAQSTVIDDVSCVGVYGGPLEEAKTNRRTETKDQEEDNKDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGW

Query:  LPEQLDTAEDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEQF
        LPEQ+DT +DSLRRATEIWKQNAMRGDPDAPQSRVLRALRGE+F
Subjt:  LPEQLDTAEDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEQF

A0A1S3BR22 uncharacterized protein LOC1034927781.7e-10280.8Show/hide
Query:  MQSRLTAIAPKSNWAFSLAQFQRLRRGGLTTCRTADPSVHANDD--NDPAVLSGEHERSQDNLEPDNAKANYE-REDSKQEDSNGPFGPHKAQYASSPRL
        MQSRL AIAP+SNWA  + QFQ LRRGGLTT RTADPSVHANDD  NDP+VLSGE ERSQDNLEPDNAKANYE R+D KQ DSNGPFGP KAQ+ASSPRL
Subjt:  MQSRLTAIAPKSNWAFSLAQFQRLRRGGLTTCRTADPSVHANDD--NDPAVLSGEHERSQDNLEPDNAKANYE-REDSKQEDSNGPFGPHKAQYASSPRL

Query:  ETTAVGQASKPITQQKRAQSTVIDDVSCVGVYGGPLEEAKTNRRTETKDQE---------EDNKDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAG
        ETT VGQASKPITQQKRA S  IDDVSC+GVYGGPLEE K +R TE KD+E         EDN+DYYKHHKASPLAEIEF DTRKPITRATDGTA  G G
Subjt:  ETTAVGQASKPITQQKRAQSTVIDDVSCVGVYGGPLEEAKTNRRTETKDQE---------EDNKDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAG

Query:  KDVIGWLPEQLDTAEDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEQF
        K VIGWLPEQ+DT +DSLRRATEIWKQNAMRGDPDAPQSRVLRALRGE F
Subjt:  KDVIGWLPEQLDTAEDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEQF

A0A5D3D3D5 Uncharacterized protein4.2e-10180.32Show/hide
Query:  MQSRLTAIAPKSNWAFSLAQFQRLRRGGLTTCRTADPSVHANDD--NDPAVLSGEHERSQDNLEPDNAKANYE-REDSKQEDSNGPFGPHKAQYASSPRL
        MQSRL AIAP+SNWA  + Q Q LRRGGLTT RTADPSVHANDD  NDP+VLSGE ERSQDNLEPDNAKANYE R+D KQ  SNGPFGP KAQ+ASSPRL
Subjt:  MQSRLTAIAPKSNWAFSLAQFQRLRRGGLTTCRTADPSVHANDD--NDPAVLSGEHERSQDNLEPDNAKANYE-REDSKQEDSNGPFGPHKAQYASSPRL

Query:  ETTAVGQASKPITQQKRAQSTVIDDVSCVGVYGGPLEEAKTNRRTETKDQE--------EDNKDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGK
        ETT VGQASKPITQQKRA S  IDDVSC+GVYGGPLEE K +R TE KD+E        EDN+DYYKHHKASPLAEIEF DTRKPITRATDGTA  G GK
Subjt:  ETTAVGQASKPITQQKRAQSTVIDDVSCVGVYGGPLEEAKTNRRTETKDQE--------EDNKDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGK

Query:  DVIGWLPEQLDTAEDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEQF
         VIGWLPEQ+DT +DSLRRATEIWKQNAMRGDPDAPQSRVLRALRGE F
Subjt:  DVIGWLPEQLDTAEDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEQF

A0A6J1GPT4 uncharacterized protein LOC1114563881.7e-10280.67Show/hide
Query:  MQSRLTAIAPKSNWAFSLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEHERSQDNLEPDNAKANYEREDSKQEDSNGPFGPHKAQYASSPRLETT
        M SRLTAIA K NW FSLAQFQRLRR GLTTCRTADPSVHANDDN PAV SGE E+SQDNLEPD+AK+NYER+DSKQ DSNGPF P KAQYASSPRLETT
Subjt:  MQSRLTAIAPKSNWAFSLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEHERSQDNLEPDNAKANYEREDSKQEDSNGPFGPHKAQYASSPRLETT

Query:  AVGQASKPITQQKRAQSTVIDDVSCVGVYGGPLEEAKTNRRTETKDQEEDNKDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD
         V QASKPITQQKRA STV+ DVSC+G  GGP    + NR  + K+Q++D ++YYKHHKASPLAEIEF DTRKPITRATDGTAYDG GKD+IGWLPEQ D
Subjt:  AVGQASKPITQQKRAQSTVIDDVSCVGVYGGPLEEAKTNRRTETKDQEEDNKDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD

Query:  TAEDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEQF
        T +DSL+RATEIWKQNAMRGDPDAPQSRVLRALRGEQF
Subjt:  TAEDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEQF

A0A6J1JL82 uncharacterized protein LOC1114879531.5e-10382.77Show/hide
Query:  MQSRLTAIAPKSNWAFSLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEHERSQDNLEPDNAKANYEREDSKQEDSNGPFGPHKAQYASSPRLETT
        M SRLTAIA K NWAFSLAQFQRLRR GLTTCRTADPSVHANDDN PAV SGE E+SQDNLEPD AKANY  +DSKQ DSNGPF P KAQYASSPRLETT
Subjt:  MQSRLTAIAPKSNWAFSLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEHERSQDNLEPDNAKANYEREDSKQEDSNGPFGPHKAQYASSPRLETT

Query:  AVGQASKPITQQKRAQSTVIDDVSCVGVYGGPLEEAKTNRRTETKDQEEDNKDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD
         V QASKPITQQKRA STV+ DVSC+G  GGP  E + NR  + K+QEED ++YYKHHKASPLAEIEFADTRKPITRATDGTAYDG GKDVI WLPEQ D
Subjt:  AVGQASKPITQQKRAQSTVIDDVSCVGVYGGPLEEAKTNRRTETKDQEEDNKDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLD

Query:  TAEDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEQF
        T +DSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEQF
Subjt:  TAEDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEQF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G02700.1 unknown protein1.5e-5048.78Show/hide
Query:  MQSRLTAIAPKSNWAFSLAQFQRLRRGGLTTCRTADPSVHA-NDDNDPAVLSGEHERSQDNLEPDNAKANYEREDSKQEDSNGPFGPHKAQYASSPRLET
        MQSRL A A  +         +RL  G  T+ RTADP +HA ND  DPA+   + E   D   P  A      +  +      P  P K+  A++ +LE+
Subjt:  MQSRLTAIAPKSNWAFSLAQFQRLRRGGLTTCRTADPSVHA-NDDNDPAVLSGEHERSQDNLEPDNAKANYEREDSKQEDSNGPFGPHKAQYASSPRLET

Query:  TAVGQASKPITQQKRAQSTV----IDDVSCVGVYGG--PLEEAKTNRRTETKDQEEDNKDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIG
        T VG  S+P  QQKR  ST     +D VSC G+ G   P +E +   +   +D+ E ++++YKHHKASPL+EIEFADTRKPIT+ATDGTAY  AGKDVIG
Subjt:  TAVGQASKPITQQKRAQSTV----IDDVSCVGVYGG--PLEEAKTNRRTETKDQEEDNKDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIG

Query:  WLPEQLDTAEDSLRRATEIWKQNAMRGDPDA-PQSRVLRALRGEQF
        WLPEQLDTAE+SL +AT I+K+NA RGDP+  P SR+LR +RGE F
Subjt:  WLPEQLDTAEDSLRRATEIWKQNAMRGDPDA-PQSRVLRALRGEQF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGTCGAGACTGACGGCGATCGCACCGAAATCGAATTGGGCCTTTTCTCTGGCCCAATTCCAACGCCTCCGGCGAGGTGGTCTGACGACATGTCGTACTGCTGACCC
TTCCGTTCACGCCAACGACGACAACGACCCCGCCGTTTTATCCGGTGAACACGAGAGATCACAGGATAATTTAGAGCCGGATAATGCGAAGGCCAATTACGAAAGGGAGG
ACTCTAAACAGGAAGATTCAAATGGGCCATTTGGGCCACATAAGGCCCAATACGCTTCCTCCCCTCGGTTAGAAACCACTGCAGTGGGCCAGGCCTCAAAGCCCATTACT
CAGCAAAAGAGAGCCCAGAGTACGGTGATCGACGACGTGAGTTGCGTCGGCGTTTACGGCGGGCCTTTGGAGGAGGCGAAAACGAACAGAAGAACTGAAACGAAAGATCA
GGAGGAAGACAATAAAGACTATTACAAGCACCACAAGGCGTCTCCGTTGGCGGAGATCGAGTTTGCTGATACGCGTAAGCCGATAACCAGAGCGACGGACGGGACGGCGT
ACGATGGGGCCGGGAAGGATGTGATTGGGTGGTTGCCGGAGCAGCTGGATACGGCGGAGGATTCGCTTCGGAGAGCGACGGAGATTTGGAAACAAAATGCAATGCGTGGA
GATCCGGATGCTCCACAGTCGAGGGTTCTTAGGGCTTTGCGTGGTGAACAGTTTTAA
mRNA sequenceShow/hide mRNA sequence
TTTTCCTTTTAAGACCTCGCCGCAAACTAACATTTGGAACTGAACCATGCAGTCGAGACTGACGGCGATCGCACCGAAATCGAATTGGGCCTTTTCTCTGGCCCAATTCC
AACGCCTCCGGCGAGGTGGTCTGACGACATGTCGTACTGCTGACCCTTCCGTTCACGCCAACGACGACAACGACCCCGCCGTTTTATCCGGTGAACACGAGAGATCACAG
GATAATTTAGAGCCGGATAATGCGAAGGCCAATTACGAAAGGGAGGACTCTAAACAGGAAGATTCAAATGGGCCATTTGGGCCACATAAGGCCCAATACGCTTCCTCCCC
TCGGTTAGAAACCACTGCAGTGGGCCAGGCCTCAAAGCCCATTACTCAGCAAAAGAGAGCCCAGAGTACGGTGATCGACGACGTGAGTTGCGTCGGCGTTTACGGCGGGC
CTTTGGAGGAGGCGAAAACGAACAGAAGAACTGAAACGAAAGATCAGGAGGAAGACAATAAAGACTATTACAAGCACCACAAGGCGTCTCCGTTGGCGGAGATCGAGTTT
GCTGATACGCGTAAGCCGATAACCAGAGCGACGGACGGGACGGCGTACGATGGGGCCGGGAAGGATGTGATTGGGTGGTTGCCGGAGCAGCTGGATACGGCGGAGGATTC
GCTTCGGAGAGCGACGGAGATTTGGAAACAAAATGCAATGCGTGGAGATCCGGATGCTCCACAGTCGAGGGTTCTTAGGGCTTTGCGTGGTGAACAGTTTTAAGTAAAAT
GGATTAAAAATTTTACAAGTATGTAAGCAAATTTTAGCGAGAATAGTTTAAAATGAAGGTGTTTCTGTTGATGACCCTACAAAGAATCGATCCTGCACTATTTCTTTTTT
TTTTAGAAGAAAAAAAGAGACTTTTTGAGAGTAGCTGGCAGGGCTCCTTTATACCCACCGTCTTCCATATACCTGATTCCGATGCTCGACCGATGTGGGCCAAGTTGATT
CCTCCACAGCCGCACCATTGTTTCTAGGTGGCTTCAAAAGAAATAAAAACCTTGATGTAAATGTGAGTCTTACCCTATTTCTAATAGTGCGTTGTGTTGATATGATGATA
AAAGCGAGTTGGTGTGATTATAACAAATAAGGGTAGGTAGTATTTGTAGATTTTATATTTTAATATTAGTGAGTCTTATTTTATTTTTAGTAGTACATGACCAAGTTATG
CATACTTTACTGGAATGGTAAATTTTCCAAATATGAAGATAAAACAAACAAATCACTAATTCAATCATTTTATTAATCTTT
Protein sequenceShow/hide protein sequence
MQSRLTAIAPKSNWAFSLAQFQRLRRGGLTTCRTADPSVHANDDNDPAVLSGEHERSQDNLEPDNAKANYEREDSKQEDSNGPFGPHKAQYASSPRLETTAVGQASKPIT
QQKRAQSTVIDDVSCVGVYGGPLEEAKTNRRTETKDQEEDNKDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGAGKDVIGWLPEQLDTAEDSLRRATEIWKQNAMRG
DPDAPQSRVLRALRGEQF