; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG07G004800 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG07G004800
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionARM repeat superfamily protein
Genome locationCG_Chr07:6187904..6190623
RNA-Seq ExpressionClCG07G004800
SyntenyClCG07G004800
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146451.1 uncharacterized protein LOC101212969 isoform X1 [Cucumis sativus]9.8e-9090.1Show/hide
Query:  MRIIASKLTT--HLCRREPVRTLQFRTFSAYDERDDALKMGFEGSLEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKR
        MRIIASKLTT  HLCRREPVRTLQFRTFSAYDER            EIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLL+VKDPLFKR
Subjt:  MRIIASKLTT--HLCRREPVRTLQFRTFSAYDERDDALKMGFEGSLEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKR

Query:  MGASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISRSDEAVGALHKAGAILVIKSTPDSGEDMKVNEYKSDLMKRFRDLRYDV
        MGASRLARFSIDDE+RMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAIS SDEA GALHKAGAILVIKSTPDS EDMKVNEYKS+LMKRFRDLRYDV
Subjt:  MGASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISRSDEAVGALHKAGAILVIKSTPDSGEDMKVNEYKSDLMKRFRDLRYDV

Query:  SS
        SS
Subjt:  SS

XP_011655026.1 uncharacterized protein LOC101212969 isoform X2 [Cucumis sativus]3.0e-9191Show/hide
Query:  MRIIASKLTTHLCRREPVRTLQFRTFSAYDERDDALKMGFEGSLEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMG
        MRIIASKLTTHLCRREPVRTLQFRTFSAYDER            EIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLL+VKDPLFKRMG
Subjt:  MRIIASKLTTHLCRREPVRTLQFRTFSAYDERDDALKMGFEGSLEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMG

Query:  ASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISRSDEAVGALHKAGAILVIKSTPDSGEDMKVNEYKSDLMKRFRDLRYDVSS
        ASRLARFSIDDE+RMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAIS SDEA GALHKAGAILVIKSTPDS EDMKVNEYKS+LMKRFRDLRYDVSS
Subjt:  ASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISRSDEAVGALHKAGAILVIKSTPDSGEDMKVNEYKSDLMKRFRDLRYDVSS

XP_022941868.1 uncharacterized protein LOC111447100 isoform X2 [Cucurbita moschata]1.6e-8788Show/hide
Query:  MRIIASKLTTHLCRREPVRTLQFRTFSAYDERDDALKMGFEGSLEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMG
        MRII SKLT HLCRREP RTLQFR FSAYDER            EIEKEAERKVGWLLKLIFAGTATFLGY IFPYMGDNLLQQSVSLLQVKDPLFKRMG
Subjt:  MRIIASKLTTHLCRREPVRTLQFRTFSAYDERDDALKMGFEGSLEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMG

Query:  ASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISRSDEAVGALHKAGAILVIKSTPDSGEDMKVNEYKSDLMKRFRDLRYDVSS
        ASRLARFSIDDE+RMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAIS SDEAVGALHKAGAILVIKSTPDS ED +VNEYKS+LMKRFRDL YDVSS
Subjt:  ASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISRSDEAVGALHKAGAILVIKSTPDSGEDMKVNEYKSDLMKRFRDLRYDVSS

XP_023526983.1 uncharacterized protein LOC111790336 isoform X2 [Cucurbita pepo subsp. pepo]3.2e-8888.5Show/hide
Query:  MRIIASKLTTHLCRREPVRTLQFRTFSAYDERDDALKMGFEGSLEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMG
        MRII SKLT HLCRREP RTLQFR FSAYDER            EIEKEAERKVGWLLKLIFAGTATFLGY IFPYMGDNLLQQSVSLLQVKDPLFKRMG
Subjt:  MRIIASKLTTHLCRREPVRTLQFRTFSAYDERDDALKMGFEGSLEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMG

Query:  ASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISRSDEAVGALHKAGAILVIKSTPDSGEDMKVNEYKSDLMKRFRDLRYDVSS
        ASRLARFSIDDE+RMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAIS SDEAVGALHKAGAILVIKSTPDS EDM+VNEYKS+LMKRFRDL YDVSS
Subjt:  ASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISRSDEAVGALHKAGAILVIKSTPDSGEDMKVNEYKSDLMKRFRDLRYDVSS

XP_038891777.1 uncharacterized protein LOC120081166 isoform X1 [Benincasa hispida]1.4e-9191.5Show/hide
Query:  MRIIASKLTTHLCRREPVRTLQFRTFSAYDERDDALKMGFEGSLEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMG
        MRIIASKLTTHLCRREPVRTLQFRTFSAYDER            EIEKEAERKVGWLLKLIFAGTATF+GYQIFPYMGDNLLQQSVSLLQVKDPLFKRMG
Subjt:  MRIIASKLTTHLCRREPVRTLQFRTFSAYDERDDALKMGFEGSLEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMG

Query:  ASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISRSDEAVGALHKAGAILVIKSTPDSGEDMKVNEYKSDLMKRFRDLRYDVSS
        ASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAIS SDEAVGALHKAGAILVIKSTPDS EDMKVNEYKS+LMKRFRDL YDVSS
Subjt:  ASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISRSDEAVGALHKAGAILVIKSTPDSGEDMKVNEYKSDLMKRFRDLRYDVSS

TrEMBL top hitse value%identityAlignment
A0A0A0KM85 Uncharacterized protein1.5e-9191Show/hide
Query:  MRIIASKLTTHLCRREPVRTLQFRTFSAYDERDDALKMGFEGSLEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMG
        MRIIASKLTTHLCRREPVRTLQFRTFSAYDER            EIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLL+VKDPLFKRMG
Subjt:  MRIIASKLTTHLCRREPVRTLQFRTFSAYDERDDALKMGFEGSLEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMG

Query:  ASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISRSDEAVGALHKAGAILVIKSTPDSGEDMKVNEYKSDLMKRFRDLRYDVSS
        ASRLARFSIDDE+RMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAIS SDEA GALHKAGAILVIKSTPDS EDMKVNEYKS+LMKRFRDLRYDVSS
Subjt:  ASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISRSDEAVGALHKAGAILVIKSTPDSGEDMKVNEYKSDLMKRFRDLRYDVSS

A0A1S3C497 uncharacterized protein LOC103496719 isoform X29.9e-8888Show/hide
Query:  MRIIASKLTTHLCRREPVRTLQFRTFSAYDERDDALKMGFEGSLEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMG
        MRIIASKLT HLCRREPVRTLQFRTFSAYDER            EIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSV+LL+VKDPLFKRMG
Subjt:  MRIIASKLTTHLCRREPVRTLQFRTFSAYDERDDALKMGFEGSLEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMG

Query:  ASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISRSDEAVGALHKAGAILVIKSTPDSGEDMKVNEYKSDLMKRFRDLRYDVSS
        ASRLARFSIDD+RRMKIVE GGAQELLNML  AKDDRTRKEALKAL+AIS SDEAVG LHKAGAILVIKSTPDS EDMKVNEYKS+LMKRFRDLRYDVSS
Subjt:  ASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISRSDEAVGALHKAGAILVIKSTPDSGEDMKVNEYKSDLMKRFRDLRYDVSS

A0A6J1FNN3 uncharacterized protein LOC111447100 isoform X12.4e-8687.13Show/hide
Query:  MRIIASKLTT--HLCRREPVRTLQFRTFSAYDERDDALKMGFEGSLEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKR
        MRII SKLT   HLCRREP RTLQFR FSAYDER            EIEKEAERKVGWLLKLIFAGTATFLGY IFPYMGDNLLQQSVSLLQVKDPLFKR
Subjt:  MRIIASKLTT--HLCRREPVRTLQFRTFSAYDERDDALKMGFEGSLEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKR

Query:  MGASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISRSDEAVGALHKAGAILVIKSTPDSGEDMKVNEYKSDLMKRFRDLRYDV
        MGASRLARFSIDDE+RMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAIS SDEAVGALHKAGAILVIKSTPDS ED +VNEYKS+LMKRFRDL YDV
Subjt:  MGASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISRSDEAVGALHKAGAILVIKSTPDSGEDMKVNEYKSDLMKRFRDLRYDV

Query:  SS
        SS
Subjt:  SS

A0A6J1FTA5 uncharacterized protein LOC111447100 isoform X27.6e-8888Show/hide
Query:  MRIIASKLTTHLCRREPVRTLQFRTFSAYDERDDALKMGFEGSLEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMG
        MRII SKLT HLCRREP RTLQFR FSAYDER            EIEKEAERKVGWLLKLIFAGTATFLGY IFPYMGDNLLQQSVSLLQVKDPLFKRMG
Subjt:  MRIIASKLTTHLCRREPVRTLQFRTFSAYDERDDALKMGFEGSLEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMG

Query:  ASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISRSDEAVGALHKAGAILVIKSTPDSGEDMKVNEYKSDLMKRFRDLRYDVSS
        ASRLARFSIDDE+RMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAIS SDEAVGALHKAGAILVIKSTPDS ED +VNEYKS+LMKRFRDL YDVSS
Subjt:  ASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISRSDEAVGALHKAGAILVIKSTPDSGEDMKVNEYKSDLMKRFRDLRYDVSS

A0A6J1K968 uncharacterized protein LOC111491787 isoform X21.3e-8788Show/hide
Query:  MRIIASKLTTHLCRREPVRTLQFRTFSAYDERDDALKMGFEGSLEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMG
        MRII SKLT HLCRREP RTLQFR FSAYDER            EIEKEAERKVGWLLKLIFAGTATFLGY IFPYMGDNLLQQSVSLLQVKDPLFKRMG
Subjt:  MRIIASKLTTHLCRREPVRTLQFRTFSAYDERDDALKMGFEGSLEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMG

Query:  ASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISRSDEAVGALHKAGAILVIKSTPDSGEDMKVNEYKSDLMKRFRDLRYDVSS
        ASRLARFSIDDE+RMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAIS SDEAVGALHKAGAILVIKSTPDS ED +VNEYKS+LMKRFRDL YDVSS
Subjt:  ASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISRSDEAVGALHKAGAILVIKSTPDSGEDMKVNEYKSDLMKRFRDLRYDVSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G56210.1 ARM repeat superfamily protein1.6e-5856.5Show/hide
Query:  MRIIASKLTTHLCRREPVRTLQFRTFSAYDERDDALKMGFEGSLEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMG
        MRI+ +++  H CR       + R FS+ +++DD         L +E+EAERK+GW LK+ FAGTAT++GYQ FPY+GDNL+ QS+SLL VKDPLFKRMG
Subjt:  MRIIASKLTTHLCRREPVRTLQFRTFSAYDERDDALKMGFEGSLEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMG

Query:  ASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISRSDEAVGALHKAGAILVIKSTPDSGEDMKVNEYKSDLMKRFRDLRYDVSS
        ASRL+RF+IDDERRMK+VE+GGAQELL+MLG+AKDD+TRKEALKAL A+S+S EA   L   GA+ ++KSTP+S ED  ++ YKS+++++  +    VSS
Subjt:  ASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISRSDEAVGALHKAGAILVIKSTPDSGEDMKVNEYKSDLMKRFRDLRYDVSS

AT3G56210.2 ARM repeat superfamily protein1.6e-5856.5Show/hide
Query:  MRIIASKLTTHLCRREPVRTLQFRTFSAYDERDDALKMGFEGSLEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMG
        MRI+ +++  H CR       + R FS+ +++DD         L +E+EAERK+GW LK+ FAGTAT++GYQ FPY+GDNL+ QS+SLL VKDPLFKRMG
Subjt:  MRIIASKLTTHLCRREPVRTLQFRTFSAYDERDDALKMGFEGSLEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMG

Query:  ASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISRSDEAVGALHKAGAILVIKSTPDSGEDMKVNEYKSDLMKRFRDLRYDVSS
        ASRL+RF+IDDERRMK+VE+GGAQELL+MLG+AKDD+TRKEALKAL A+S+S EA   L   GA+ ++KSTP+S ED  ++ YKS+++++  +    VSS
Subjt:  ASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISRSDEAVGALHKAGAILVIKSTPDSGEDMKVNEYKSDLMKRFRDLRYDVSS

AT3G56210.4 ARM repeat superfamily protein1.6e-5856.5Show/hide
Query:  MRIIASKLTTHLCRREPVRTLQFRTFSAYDERDDALKMGFEGSLEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMG
        MRI+ +++  H CR       + R FS+ +++DD         L +E+EAERK+GW LK+ FAGTAT++GYQ FPY+GDNL+ QS+SLL VKDPLFKRMG
Subjt:  MRIIASKLTTHLCRREPVRTLQFRTFSAYDERDDALKMGFEGSLEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMG

Query:  ASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISRSDEAVGALHKAGAILVIKSTPDSGEDMKVNEYKSDLMKRFRDLRYDVSS
        ASRL+RF+IDDERRMK+VE+GGAQELL+MLG+AKDD+TRKEALKAL A+S+S EA   L   GA+ ++KSTP+S ED  ++ YKS+++++  +    VSS
Subjt:  ASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISRSDEAVGALHKAGAILVIKSTPDSGEDMKVNEYKSDLMKRFRDLRYDVSS

AT3G56210.5 ARM repeat superfamily protein5.2e-5755.94Show/hide
Query:  MRIIASKLTTHLCRREPVRTLQFRTFSAYDERDDALKMGFEGSLEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMG
        MRI+ +++  H CR       + R FS+ +++DD         L +E+EAERK+GW LK+ FAGTAT++GYQ FPY+GDNL+ QS+SLL VKDPLFKRMG
Subjt:  MRIIASKLTTHLCRREPVRTLQFRTFSAYDERDDALKMGFEGSLEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMG

Query:  ASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISRS--DEAVGALHKAGAILVIKSTPDSGEDMKVNEYKSDLMKRFRDLRYDV
        ASRL+RF+IDDERRMK+VE+GGAQELL+MLG+AKDD+TRKEALKAL A+S+S   EA   L   GA+ ++KSTP+S ED  ++ YKS+++++  +    V
Subjt:  ASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISRS--DEAVGALHKAGAILVIKSTPDSGEDMKVNEYKSDLMKRFRDLRYDV

Query:  SS
        SS
Subjt:  SS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGCATAATCGCATCGAAGCTAACCACTCATCTGTGCAGAAGGGAACCTGTGCGAACCCTGCAATTTCGCACCTTTTCAGCTTACGACGAAAGAGATGATGCA
CTTAAAATGGGATTTGAAGGTTCCTTAGAGATCGAGAAGGAGGCTGAAAGAAAAGTAGGATGGTTATTAAAACTAATCTTTGCTGGGACTGCGACATTTCTGGGT
TACCAGATTTTTCCATACATGGGAGATAACTTGCTGCAACAATCTGTGTCGCTCTTGCAAGTCAAGGATCCACTGTTTAAGAGGATGGGAGCATCTAGATTAGCT
CGCTTTTCGATTGATGATGAAAGAAGGATGAAAATAGTGGAGATAGGTGGTGCCCAAGAGCTCTTAAACATGCTCGGGGCTGCCAAAGATGACCGGACACGTAAG
GAAGCTTTGAAGGCTTTACATGCCATCTCACGTTCAGATGAAGCTGTTGGTGCGTTGCATAAAGCAGGGGCAATCTTGGTTATTAAATCTACTCCAGATTCGGGT
GAAGATATGAAAGTGAATGAGTACAAGTCAGACCTAATGAAGAGATTTAGAGATCTTAGATATGATGTTTCATCTTGA
mRNA sequenceShow/hide mRNA sequence
TAATTTTTTTTTAAATGAGTCATTTCATAGTTGGCCCGAAACTCAAAGCCACAAATCTATAATAAGCGACGGAGCGGGTGGCAGTTCACAAACGGGGTTAGACCC
ACAGTGGCGGAGGCCGGAGCCGGTGGGTGATTCTTTGTTCTTCACCATGCGCATAATCGCATCGAAGCTAACCACTCATCTGTGCAGAAGGGAACCTGTGCGAAC
CCTGCAATTTCGCACCTTTTCAGCTTACGACGAAAGAGATGATGCACTTAAAATGGGATTTGAAGGTTCCTTAGAGATCGAGAAGGAGGCTGAAAGAAAAGTAGG
ATGGTTATTAAAACTAATCTTTGCTGGGACTGCGACATTTCTGGGTTACCAGATTTTTCCATACATGGGAGATAACTTGCTGCAACAATCTGTGTCGCTCTTGCA
AGTCAAGGATCCACTGTTTAAGAGGATGGGAGCATCTAGATTAGCTCGCTTTTCGATTGATGATGAAAGAAGGATGAAAATAGTGGAGATAGGTGGTGCCCAAGA
GCTCTTAAACATGCTCGGGGCTGCCAAAGATGACCGGACACGTAAGGAAGCTTTGAAGGCTTTACATGCCATCTCACGTTCAGATGAAGCTGTTGGTGCGTTGCA
TAAAGCAGGGGCAATCTTGGTTATTAAATCTACTCCAGATTCGGGTGAAGATATGAAAGTGAATGAGTACAAGTCAGACCTAATGAAGAGATTTAGAGATCTTAG
ATATGATGTTTCATCTTGACAAGTAAAAATATTATTTCTTTCTATATTATTATTATTATTTTGAGAGAGAGAACAACACTTACTAGTGACGTGAGGAGGACGGTA
CTGCAGCTTAATGTCGCCGGAGGGTGGAGGTAGGTTGGGGGTGAGATGGATTGAAGGAGATGGGATGGAGAGAGGAATCGTGAGAGAGGCCTAGGAGGGAGAGGA
GGGTCGGGGGTGTTTGAAAGAGAAACCTTTTCCTTTTTTGAAATTTAATCATAATTTTGCTCATA
Protein sequenceShow/hide protein sequence
MRIIASKLTTHLCRREPVRTLQFRTFSAYDERDDALKMGFEGSLEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLLQQSVSLLQVKDPLFKRMGASRLA
RFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTRKEALKALHAISRSDEAVGALHKAGAILVIKSTPDSGEDMKVNEYKSDLMKRFRDLRYDVSS