; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG04G008630 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG04G008630
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionZinc/iron-chelating domain protein
Genome locationCG_Chr04:23540098..23549763
RNA-Seq ExpressionClCG04G008630
SyntenyClCG04G008630
Gene Ontology termsNA
InterPro domainsIPR005358 - Putative zinc- or iron-chelating domain containing protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004142095.1 uncharacterized protein LOC101220204 [Cucumis sativus]2.3e-7887.73Show/hide
Query:  MYQAVAPPQVTVTAGRRPQQIPTTKDNKATQGRNINVGFGGKRKEQLWQCVEGCSACCKLAKGPSFASPEEIFQNTSDIELYKSLIGADGWCIHYEKSSR
        MY AVAPP+VTVTA  RP +   TKD K TQGRNINVGFGGKRKE+LWQC+EGC ACCKLAKGPSFA+PEEIFQNTSDIELYKSLIG DGWCIHYEK++R
Subjt:  MYQAVAPPQVTVTAGRRPQQIPTTKDNKATQGRNINVGFGGKRKEQLWQCVEGCSACCKLAKGPSFASPEEIFQNTSDIELYKSLIGADGWCIHYEKSSR

Query:  KCSIYADRPYFCRVESPVFEKLYGIRENKFNKAACSSCRDTIKAIYGFSSKELDNFNKAVQSS
        KCSIYADRPYFCRVESPVFEKLYGI+ENKFNKAACSSCRDTIKAIYGFSSKEL+NFNKAVQSS
Subjt:  KCSIYADRPYFCRVESPVFEKLYGIRENKFNKAACSSCRDTIKAIYGFSSKELDNFNKAVQSS

XP_008447323.1 PREDICTED: uncharacterized protein LOC103489795 [Cucumis melo]1.9e-8089.63Show/hide
Query:  MYQAVAPPQVTVTAGRRPQQIPTTKDNKATQGRNINVGFGGKRKEQLWQCVEGCSACCKLAKGPSFASPEEIFQNTSDIELYKSLIGADGWCIHYEKSSR
        MY AVAPPQVTVTA R+PQ+I  TKD+K TQGRN NVGFGGKRKEQLWQC+EGC ACCKLAKG SFASPEEIFQNTSDIELYKSLIG DGWCIHYEK++R
Subjt:  MYQAVAPPQVTVTAGRRPQQIPTTKDNKATQGRNINVGFGGKRKEQLWQCVEGCSACCKLAKGPSFASPEEIFQNTSDIELYKSLIGADGWCIHYEKSSR

Query:  KCSIYADRPYFCRVESPVFEKLYGIRENKFNKAACSSCRDTIKAIYGFSSKELDNFNKAVQSSE
        KCSIYADRPYFCRVESPVFEKLYGI+ENKFNKAACSSCRDTIKAIYGFSSKEL+NFNKAVQSSE
Subjt:  KCSIYADRPYFCRVESPVFEKLYGIRENKFNKAACSSCRDTIKAIYGFSSKELDNFNKAVQSSE

XP_022143803.1 uncharacterized protein LOC111013629 isoform X1 [Momordica charantia]4.0e-7887.65Show/hide
Query:  MYQAVAPPQVTVTAGRRPQQIPTTKDNKATQGRNINVGFGGKRKEQLWQCVEGCSACCKLAKGPSFASPEEIFQNTSDIELYKSLIGADGWCIHYEKSSR
        M QAVAPP+VTVTA RRPQQI  TKDNK  +GR+INVGFGGKRKEQLWQCVEGC ACCKLA GPSFA+PEEIF+N+SDIELYKSLIGADGWCIHYEKS+R
Subjt:  MYQAVAPPQVTVTAGRRPQQIPTTKDNKATQGRNINVGFGGKRKEQLWQCVEGCSACCKLAKGPSFASPEEIFQNTSDIELYKSLIGADGWCIHYEKSSR

Query:  KCSIYADRPYFCRVESPVFEKLYGIRENKFNKAACSSCRDTIKAIYGFSSKELDNFNKAVQS
        KCSIYADRPYFCRVESPVFEKLYGI+ENKFNK ACSSCRDTIKA+YGF SKEL+NFNKAVQS
Subjt:  KCSIYADRPYFCRVESPVFEKLYGIRENKFNKAACSSCRDTIKAIYGFSSKELDNFNKAVQS

XP_022143805.1 uncharacterized protein LOC111013629 isoform X2 [Momordica charantia]2.9e-7687.04Show/hide
Query:  MYQAVAPPQVTVTAGRRPQQIPTTKDNKATQGRNINVGFGGKRKEQLWQCVEGCSACCKLAKGPSFASPEEIFQNTSDIELYKSLIGADGWCIHYEKSSR
        M QAVAPP+VTVTA RRPQQI  TKDNK  +GR+INVGFGGKRKEQLWQCVEGC ACCKLA GPSFA+PEEIF+N+SDIELYKSLIGADGWCIHYEKS+R
Subjt:  MYQAVAPPQVTVTAGRRPQQIPTTKDNKATQGRNINVGFGGKRKEQLWQCVEGCSACCKLAKGPSFASPEEIFQNTSDIELYKSLIGADGWCIHYEKSSR

Query:  KCSIYADRPYFCRVESPVFEKLYGIRENKFNKAACSSCRDTIKAIYGFSSKELDNFNKAVQS
        KCSIYADRPYFCRVESPVFEKLYGI+ENKFNK AC SCRDTIKA+YGF SKEL+NFNKAVQS
Subjt:  KCSIYADRPYFCRVESPVFEKLYGIRENKFNKAACSSCRDTIKAIYGFSSKELDNFNKAVQS

XP_038883478.1 uncharacterized protein LOC120074431 [Benincasa hispida]6.3e-8493.9Show/hide
Query:  MYQAVAPPQVTVTAGRRPQQIPTTKDNKATQGRNINVGFGGKRKEQLWQCVEGCSACCKLAKGPSFASPEEIFQNTSDIELYKSLIGADGWCIHYEKSSR
        MYQAVAPPQV++TA RRPQQI TTKD KATQGRNINVGFG KRKEQLWQCVEGC ACCKLAKGPSFASPEEIFQNTSDIELYKSLIGADGWCIHYEKS+R
Subjt:  MYQAVAPPQVTVTAGRRPQQIPTTKDNKATQGRNINVGFGGKRKEQLWQCVEGCSACCKLAKGPSFASPEEIFQNTSDIELYKSLIGADGWCIHYEKSSR

Query:  KCSIYADRPYFCRVESPVFEKLYGIRENKFNKAACSSCRDTIKAIYGFSSKELDNFNKAVQSSE
        KCSIYADRPYFCRVESPVFEKLYGI+ENKFNKAACSSCRDTIKAIYGFSSKEL+NFNKAVQSSE
Subjt:  KCSIYADRPYFCRVESPVFEKLYGIRENKFNKAACSSCRDTIKAIYGFSSKELDNFNKAVQSSE

TrEMBL top hitse value%identityAlignment
A0A0A0KX40 Uncharacterized protein1.1e-7887.73Show/hide
Query:  MYQAVAPPQVTVTAGRRPQQIPTTKDNKATQGRNINVGFGGKRKEQLWQCVEGCSACCKLAKGPSFASPEEIFQNTSDIELYKSLIGADGWCIHYEKSSR
        MY AVAPP+VTVTA  RP +   TKD K TQGRNINVGFGGKRKE+LWQC+EGC ACCKLAKGPSFA+PEEIFQNTSDIELYKSLIG DGWCIHYEK++R
Subjt:  MYQAVAPPQVTVTAGRRPQQIPTTKDNKATQGRNINVGFGGKRKEQLWQCVEGCSACCKLAKGPSFASPEEIFQNTSDIELYKSLIGADGWCIHYEKSSR

Query:  KCSIYADRPYFCRVESPVFEKLYGIRENKFNKAACSSCRDTIKAIYGFSSKELDNFNKAVQSS
        KCSIYADRPYFCRVESPVFEKLYGI+ENKFNKAACSSCRDTIKAIYGFSSKEL+NFNKAVQSS
Subjt:  KCSIYADRPYFCRVESPVFEKLYGIRENKFNKAACSSCRDTIKAIYGFSSKELDNFNKAVQSS

A0A1S3BH68 uncharacterized protein LOC1034897959.2e-8189.63Show/hide
Query:  MYQAVAPPQVTVTAGRRPQQIPTTKDNKATQGRNINVGFGGKRKEQLWQCVEGCSACCKLAKGPSFASPEEIFQNTSDIELYKSLIGADGWCIHYEKSSR
        MY AVAPPQVTVTA R+PQ+I  TKD+K TQGRN NVGFGGKRKEQLWQC+EGC ACCKLAKG SFASPEEIFQNTSDIELYKSLIG DGWCIHYEK++R
Subjt:  MYQAVAPPQVTVTAGRRPQQIPTTKDNKATQGRNINVGFGGKRKEQLWQCVEGCSACCKLAKGPSFASPEEIFQNTSDIELYKSLIGADGWCIHYEKSSR

Query:  KCSIYADRPYFCRVESPVFEKLYGIRENKFNKAACSSCRDTIKAIYGFSSKELDNFNKAVQSSE
        KCSIYADRPYFCRVESPVFEKLYGI+ENKFNKAACSSCRDTIKAIYGFSSKEL+NFNKAVQSSE
Subjt:  KCSIYADRPYFCRVESPVFEKLYGIRENKFNKAACSSCRDTIKAIYGFSSKELDNFNKAVQSSE

A0A5A7TWI1 Flagellin N-methylase9.2e-8189.63Show/hide
Query:  MYQAVAPPQVTVTAGRRPQQIPTTKDNKATQGRNINVGFGGKRKEQLWQCVEGCSACCKLAKGPSFASPEEIFQNTSDIELYKSLIGADGWCIHYEKSSR
        MY AVAPPQVTVTA R+PQ+I  TKD+K TQGRN NVGFGGKRKEQLWQC+EGC ACCKLAKG SFASPEEIFQNTSDIELYKSLIG DGWCIHYEK++R
Subjt:  MYQAVAPPQVTVTAGRRPQQIPTTKDNKATQGRNINVGFGGKRKEQLWQCVEGCSACCKLAKGPSFASPEEIFQNTSDIELYKSLIGADGWCIHYEKSSR

Query:  KCSIYADRPYFCRVESPVFEKLYGIRENKFNKAACSSCRDTIKAIYGFSSKELDNFNKAVQSSE
        KCSIYADRPYFCRVESPVFEKLYGI+ENKFNKAACSSCRDTIKAIYGFSSKEL+NFNKAVQSSE
Subjt:  KCSIYADRPYFCRVESPVFEKLYGIRENKFNKAACSSCRDTIKAIYGFSSKELDNFNKAVQSSE

A0A6J1CPU4 uncharacterized protein LOC111013629 isoform X21.4e-7687.04Show/hide
Query:  MYQAVAPPQVTVTAGRRPQQIPTTKDNKATQGRNINVGFGGKRKEQLWQCVEGCSACCKLAKGPSFASPEEIFQNTSDIELYKSLIGADGWCIHYEKSSR
        M QAVAPP+VTVTA RRPQQI  TKDNK  +GR+INVGFGGKRKEQLWQCVEGC ACCKLA GPSFA+PEEIF+N+SDIELYKSLIGADGWCIHYEKS+R
Subjt:  MYQAVAPPQVTVTAGRRPQQIPTTKDNKATQGRNINVGFGGKRKEQLWQCVEGCSACCKLAKGPSFASPEEIFQNTSDIELYKSLIGADGWCIHYEKSSR

Query:  KCSIYADRPYFCRVESPVFEKLYGIRENKFNKAACSSCRDTIKAIYGFSSKELDNFNKAVQS
        KCSIYADRPYFCRVESPVFEKLYGI+ENKFNK AC SCRDTIKA+YGF SKEL+NFNKAVQS
Subjt:  KCSIYADRPYFCRVESPVFEKLYGIRENKFNKAACSSCRDTIKAIYGFSSKELDNFNKAVQS

A0A6J1CQF1 uncharacterized protein LOC111013629 isoform X11.9e-7887.65Show/hide
Query:  MYQAVAPPQVTVTAGRRPQQIPTTKDNKATQGRNINVGFGGKRKEQLWQCVEGCSACCKLAKGPSFASPEEIFQNTSDIELYKSLIGADGWCIHYEKSSR
        M QAVAPP+VTVTA RRPQQI  TKDNK  +GR+INVGFGGKRKEQLWQCVEGC ACCKLA GPSFA+PEEIF+N+SDIELYKSLIGADGWCIHYEKS+R
Subjt:  MYQAVAPPQVTVTAGRRPQQIPTTKDNKATQGRNINVGFGGKRKEQLWQCVEGCSACCKLAKGPSFASPEEIFQNTSDIELYKSLIGADGWCIHYEKSSR

Query:  KCSIYADRPYFCRVESPVFEKLYGIRENKFNKAACSSCRDTIKAIYGFSSKELDNFNKAVQS
        KCSIYADRPYFCRVESPVFEKLYGI+ENKFNK ACSSCRDTIKA+YGF SKEL+NFNKAVQS
Subjt:  KCSIYADRPYFCRVESPVFEKLYGIRENKFNKAACSSCRDTIKAIYGFSSKELDNFNKAVQS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G02710.1 unknown protein9.2e-4955.36Show/hide
Query:  AVAPPQVTVTAGRRPQQIPTTKDNKATQGRN----INVGF-GGKRKEQLWQCVEGCSACCKLAKGPSFASPEEIFQNTSDIELYKSLIGADGWCIHYEKS
        + AP   T+ +  R  Q+   K  K             GF GG  KE  W+CVEGC ACCK+AK  SFA+P+EIF N  D+ELY+S+IG DGWC++Y+K+
Subjt:  AVAPPQVTVTAGRRPQQIPTTKDNKATQGRN----INVGF-GGKRKEQLWQCVEGCSACCKLAKGPSFASPEEIFQNTSDIELYKSLIGADGWCIHYEKS

Query:  SRKCSIYADRPYFCRVESPVFEKLYGIRENKFNKAACSSCRDTIKAIYGFSSKELDNFNKAVQSSEVS
        +RKCSIYADRPYFCRVE  VF+ LYGI E KFNK A S C DTIK IYG  SKELD+FN+A++S+  S
Subjt:  SRKCSIYADRPYFCRVESPVFEKLYGIRENKFNKAACSSCRDTIKAIYGFSSKELDNFNKAVQSSEVS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTTGGCTAGAATAATTTCTGAAACATGTTTGCTATTGAAATCCTTTTCTGTCTTTCTACGTTTGAGGCCATGTTCAGTTGCGACGCTTATCCAAAACTCCAAGAC
AATGTACCAGGCGGTGGCTCCGCCGCAAGTCACCGTAACCGCCGGTCGCCGACCGCAGCAGATTCCAACGACGAAGGATAATAAGGCAACACAAGGCCGGAACATCAATG
TAGGGTTTGGGGGTAAACGAAAGGAGCAATTATGGCAGTGCGTCGAGGGATGTAGCGCCTGCTGCAAGCTCGCCAAGGGGCCGTCCTTCGCCTCGCCGGAGGAAATCTTC
CAAAATACTTCCGATATTGAGCTCTATAAAAGCTTGATTGGCGCAGATGGATGGTGCATTCACTACGAGAAAAGCTCGCGTAAATGCTCCATTTATGCCGATCGCCCTTA
TTTTTGCCGCGTAGAGTCTCCTGTATTTGAGAAGTTGTATGGAATCAGAGAAAACAAATTCAATAAGGCTGCTTGCAGTAGCTGCAGGGACACTATAAAAGCAATTTATG
GTTTCTCGTCCAAGGAATTGGACAACTTCAACAAAGCAGTTCAAAGCTCCGAGGTCTCAGAGAGTAAGCTTCAACATAACATCGCTTGTACTTCAGAAGATAAGAAAAAT
GGTGAATGGCACATGATAACCACTATGGGGACGGTTTGCTCAAAGGATATCTCTACCAGAAAGCTAAGAGGAAAAAGAATTCTTATCCTGCAGTTGGAGGCAAACTACTT
TCTTCTTCTTCACTCTCTCTCAGGGAAAGAAATTAGATAA
mRNA sequenceShow/hide mRNA sequence
ATTCTTCACCCGCCCCCCTTCCCCATCTATGTTCTAACCGTCCTATATTCGTTCTCTCTTTCTTCTCGAACCTCGCCGCTGAACTCTTCGCTCGGGACGATCTCCGGCGG
CATCATAATGGCGAATTCACCCTTCCGCCTCCCTTTCCTCCTCCTCCTCCTCGGCCTCCTCGCCACCTCCTCCACCGCCGAGATCAAATCCCTCAAGATCTCCTCCGACA
ACCGTCCCATGATTCTCTTCGAGAAATTTGGATTCACTCACACCGGCCAGGTCTCCATTTCCGTCAAATCCGTCTCGGTTACTACCTCCCTTGCTGAAACCGATCCTTCT
CGCCTTGGATTCTTCCTCTTATCTGAAGAATCGCTTCTTCAAGTTCTTCTCGAGATCCAGCAAAACCCTCAGTTTTGCGTTCTTGATTCTCGCTACATCCATCGCCTCTT
TACCTTCCGAGATCTCTCTCCGCCGCCTCAGAGCTCCTTCAGCCATTCCTACCCTGTTACTGCCCCCAATGAATATAGCCTTTTCTTTGCGAATTGCGCCCCGGAATCCG
CAGTTTCTATGCAGGTGCGAACGGAGGTTTTCAATTTGGACCGCGATGGTTCCAAGGATTATCTCTCCGCTGGTCTTACTCAGCTCCCTTCGCTCTATTTTGTGTTCTCT
CTTGCCTATCTTGCCTTTTTAGGGCTTTGGATCTACGCGGGTATCACGAATAAGCGCAGCGTTCATAGGATCCACTTGTTGATGGGTGGATTGTTGTTGATGAAAGCGTT
GAATCTCATTTGCGCTGCTGAGGATAAGCATTACGTCAAAATCACCGGAACGCCTCATGGTTGGGATGTGTTGTTTTACATCTTTCAGTTTATTCGGGTTGTTTTGCTTT
TCACGGTGATTGTTTTGATCGGAACTGGGTGGTCATTTTTGAAGCCCTTCTTGCAAGAGAAGGAGAAGAAGGTGTTGATGATTGTGATCCCACTTCAAGTCTTGGCTAAT
GTGGCCTCGGTCGTGATCGGCGAGACTGGGCCATTTATTAAGGATTGGGTCACTTGGAATCAGGTTTTCTTGTTGGTGGATATCATATGTTGCTGTGCCATAATTTTCCC
CATTGTTTGGTCAATTCGATCTCTGAGAGAGACATCGAAAACCGACGGGAAGGCTGCAAGGAATTTGGCAAAGCTCACTCTTTTCAGGCAGTTCTACATTGTTGTGATTG
GGTATTTGTATTTTACTCGGATTGTTGTTTTCGCGCTCAAAACCATTGCAGCATACAAGTATCAGTGGGTGAGTAATGCAGCTGAAGAGATTGCAAGCCTTGTCTTTTAC
ATGGTGATGTTCTATATGTTTAGGCCTGTGGAGAGAAATGAATACTTTGTTCTCGATGAGGAAGAAGAAGAGGCTGCAGAGTTGGCTCTCAGGGACGAAGAATTTGAGCT
TTGAAATGGGCATTGAAGAGTATTTTCTAGTTCTTATTGTCATCTTTTGCTTCTGCCATACAAATGCAAGTGATATAGCTGATTTACAAATAGAAATTATTGAAGTCGGG
TCTGCTGCTCATTTTTCTGATGTTTTTTTTTTGCCCTCCTTTCAACTCGTGTAGAAACTTGAAATTGTTCTCTGCTTTTTGTTATCCATGTTTATTTTTCACGTCATGCT
ATTAGTGACTGCCATATATAGAGTTCGCTCGAACATGAATTGTTGAAAGCCTCTTTGTTCTGTTCCTCTGTCATTATCTTGCTCTCCTGATCCTTCTTATTGTTGGAGAT
CATGAGTTAGCAGGCAATTTGGTTGAATTCATTGCAAAATGATTATTAAGTTTGTAGTTGCAGGGGAAGTTACAGAGCCAAATGGCCCTTCTGTTGCTTGCTGTAGGAAG
TGGCATCTATATAATCCTTTAACTCGTATCTTTAGCTTTTCTTTTGTTATACAATCTTATATGTGTGCTTGTTATTCCTAGCTTGGTTTTTCATTTTTACTCTTTGGTTA
CTCTTTGAAGGCATGTTCAAGAGTAATTTTGAAACAGTTAAAATCACTTCTATCATATTCAAAATCACTTTCAAACATAGTTTTAATATTTCAAAATCAATATAATGTTT
GATTTTACACTTTTTAATACGATTTGCATACTATCTAATTTAAAGTATGTTTCAAAGTAATGTTAATCATGACAACTGATTTAAAGTATGTTTCAAAATAATGTTGAACA
TGACAACTGATTTTAACTATTTCATTCAAGCCACTCTCAAACATTGCCTCAAATCTCTTTCTGGTGCAGATTCATGGTGGTTCCATGGAGTATTGTTGTTAGGTCTCCAA
AGAGTTTTTCCCATTTTGCAGAGTTAATTTGATTATATTATGTTTGAGCTGAAAATTGTTCTTGTTTCTTCAGAAAATTGTCGGTGAGACTATGGAGTTGGCTAGAATAA
TTTCTGAAACATGTTTGCTATTGAAATCCTTTTCTGTCTTTCTACGTTTGAGGCCATGTTCAGTTGCGACGCTTATCCAAAACTCCAAGACAATGTACCAGGCGGTGGCT
CCGCCGCAAGTCACCGTAACCGCCGGTCGCCGACCGCAGCAGATTCCAACGACGAAGGATAATAAGGCAACACAAGGCCGGAACATCAATGTAGGGTTTGGGGGTAAACG
AAAGGAGCAATTATGGCAGTGCGTCGAGGGATGTAGCGCCTGCTGCAAGCTCGCCAAGGGGCCGTCCTTCGCCTCGCCGGAGGAAATCTTCCAAAATACTTCCGATATTG
AGCTCTATAAAAGCTTGATTGGCGCAGATGGATGGTGCATTCACTACGAGAAAAGCTCGCGTAAATGCTCCATTTATGCCGATCGCCCTTATTTTTGCCGCGTAGAGTCT
CCTGTATTTGAGAAGTTGTATGGAATCAGAGAAAACAAATTCAATAAGGCTGCTTGCAGTAGCTGCAGGGACACTATAAAAGCAATTTATGGTTTCTCGTCCAAGGAATT
GGACAACTTCAACAAAGCAGTTCAAAGCTCCGAGGTCTCAGAGAGTAAGCTTCAACATAACATCGCTTGTACTTCAGAAGATAAGAAAAATGGTGAATGGCACATGATAA
CCACTATGGGGACGGTTTGCTCAAAGGATATCTCTACCAGAAAGCTAAGAGGAAAAAGAATTCTTATCCTGCAGTTGGAGGCAAACTACTTTCTTCTTCTTCACTCTCTC
TCAGGGAAAGAAATTAGATAAAGATACTTTGACAACTAACCTGGTGATACCTTTCCCCTTTTTTATTTTTTTGCTGCTTTTCATCCTGCCCCCAAAGAAAGCTGTTAAGA
ATATTCTCTTATGCAAGCTAAAAATGTGAGATATCTAAATTCATTTCTTACTTCAAAAAGATGTTAGAAGTTACCATTTCAACTGTTAATCACTAATTTACCCAAAAGTT
TAAACTTATGGGTGAAGGTAAATTTAATATTGTAGTAGAAATTAATATTAATTGGAAAGGAAATAATATGGTAGAGGTTTGAACACATGACCTCCTAAATCACCTATTCT
AGTACCATGTTGAATCACTGATTGACCCAAAAGGTTAAAAGGTTAAACTTATGATTGAAGGTAAATTTAATATTATATCATCTAATACTGACCTATTTGTAGAACCATCC
TTTGTAAGGAGGTTGTGTGAAGGTTTTGAGTGGAAATGAGCAATATTTCCTCGATAAATGGTTGAAGGAAGAAAAAGAAAACAAGCTACAACTATCCTACAGCGGAGTTC
GGTTTCATGAGCCAACACCAGTCTGATGGAGATCCAATTGGAGACAATCCTGAGGGTGAACAGACTTACCACAGTTATTATGGATGGTTGTTGAGTGAACTCGACTCCAC
TGATCTCTTCGATTGCCAAGACCCTTATGGCAATCAAAATGGCTGCAGAATGGAGAATACAGAACTCATTCTACTCCAGAATGAGGATCAAGTTGAGGATTATAATGGTG
ATTCTAGAAACTTGTACTATGAATTTACTTGGGATTGCTTTCAGTTATGGTTTGGACTTGGGGAAGATGAGCAAGGTCAAGGCTGGAATGGGCAAGAATTGAAACCTGAA
AACGACTGCAATTCTGATGCTTTGGCACTCCTAGAAGCCGTCCTGGGCTATTGACATTGGCATATCACCTTTTACTTTTCATCAAACACCAATTGGCTTACTGCAATCTA
ATGGATAGTTCTTTATTTCTTTTGAAGTGTTATTTTACCAGAAATAATAACAAATTTTCCCTAATTTGTTACATTAAAGTAAACAAAATTCAGACAGCTATCCTGTGTGA
TCAAAGAATAATGCAG
Protein sequenceShow/hide protein sequence
MELARIISETCLLLKSFSVFLRLRPCSVATLIQNSKTMYQAVAPPQVTVTAGRRPQQIPTTKDNKATQGRNINVGFGGKRKEQLWQCVEGCSACCKLAKGPSFASPEEIF
QNTSDIELYKSLIGADGWCIHYEKSSRKCSIYADRPYFCRVESPVFEKLYGIRENKFNKAACSSCRDTIKAIYGFSSKELDNFNKAVQSSEVSESKLQHNIACTSEDKKN
GEWHMITTMGTVCSKDISTRKLRGKRILILQLEANYFLLLHSLSGKEIR