; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G08990 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G08990
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionprotein MODIFIER OF SNC1 11-like
Genome locationClcChr04:22655570..22659332
RNA-Seq ExpressionClc04G08990
SyntenyClc04G08990
Gene Ontology termsGO:0016973 - poly(A)+ mRNA export from nucleus (biological process)
GO:0005634 - nucleus (cellular component)
InterPro domainsIPR040746 - THO1_MOS11, C-terminal domain
IPR044209 - Protein MODIFIER OF SNC1 11


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008447291.1 PREDICTED: protein MODIFIER OF SNC1 11-like [Cucumis melo]5.3e-9193.63Show/hide
Query:  MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVVDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
        MATVA AGLNDI+N+KNPLGENPSQTL+PTISSSAQTPASVPPSSTDGGSSKE DESK SGK S  DDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
Subjt:  MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVVDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR

Query:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTSNSLTQVEGKGNVETIADVAGKA
        AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPT NS+TQV+GKGNVETIADVAGK+
Subjt:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTSNSLTQVEGKGNVETIADVAGKA

Query:  GGGA
        GGGA
Subjt:  GGGA

XP_022949856.1 protein MODIFIER OF SNC1 11-like [Cucurbita moschata]4.6e-8790.2Show/hide
Query:  MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVVDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
        MATVASAG+NDI+N+KNPLGENPSQTLDPTISSSAQ P SVP SSTDGGSSKE DESKGS K SV  DG PVSDVQRKMRRAERFGISVQLSEEEKRNSR
Subjt:  MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVVDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR

Query:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTSNSLTQVEGKGNVETIADVAGKA
        AERFGMGTTTNGLGASNK+EEVKRKARAERFGL ASV TDDEAKKKARL RFSSTSKPD QEEEKRKARAIRFSNPTSNSLTQV+GKGN+ETIAD+AGKA
Subjt:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTSNSLTQVEGKGNVETIADVAGKA

Query:  GGGA
        GGGA
Subjt:  GGGA

XP_022977777.1 protein MODIFIER OF SNC1 11-like [Cucurbita maxima]3.0e-8689.71Show/hide
Query:  MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVVDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
        MATVASAG+NDI+N+KN LGENPSQTLDPTISSSAQ P SVP SSTDGGSSKE DESKGS K SV  DG PVSDVQRKMRRAERFGISVQLSEEEKRNSR
Subjt:  MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVVDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR

Query:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTSNSLTQVEGKGNVETIADVAGKA
        AERFGMGTTTNGLGASNK+EEVKRKARAERFGL ASV TDDEAKKKARL RFSSTSKPD QEEEKRKARAIRFSNPTSNSLTQV+GKGN+ETIAD+AGKA
Subjt:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTSNSLTQVEGKGNVETIADVAGKA

Query:  GGGA
        GGGA
Subjt:  GGGA

XP_031740008.1 protein MODIFIER OF SNC1 11 [Cucumis sativus]1.7e-8992.65Show/hide
Query:  MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVVDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
        MATVA AGLNDI+N+KNPLGENPSQTL+ TISSSAQ PASVPPSSTDGGSSKE DESK +GK S  DDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
Subjt:  MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVVDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR

Query:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTSNSLTQVEGKGNVETIADVAGKA
        AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPT NSLTQV+GKGNVETIADVAGK+
Subjt:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTSNSLTQVEGKGNVETIADVAGKA

Query:  GGGA
        GGGA
Subjt:  GGGA

XP_038882639.1 protein MODIFIER OF SNC1 11 [Benincasa hispida]3.1e-9194.12Show/hide
Query:  MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVVDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
        MATVASAGLNDI+N+K+PLGENPSQTLDPTISSSAQ PASVP SSTDGGSSK+RDESKGSGKASV  DGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
Subjt:  MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVVDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR

Query:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTSNSLTQVEGKGNVETIADVAGKA
        AERFGMGT TNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTSNS TQ++GKGNVETIADVAGKA
Subjt:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTSNSLTQVEGKGNVETIADVAGKA

Query:  GGGA
        GGGA
Subjt:  GGGA

TrEMBL top hitse value%identityAlignment
A0A0A0L0R4 Tho1_MOS11_C domain-containing protein8.3e-9092.65Show/hide
Query:  MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVVDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
        MATVA AGLNDI+N+KNPLGENPSQTL+ TISSSAQ PASVPPSSTDGGSSKE DESK +GK S  DDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
Subjt:  MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVVDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR

Query:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTSNSLTQVEGKGNVETIADVAGKA
        AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPT NSLTQV+GKGNVETIADVAGK+
Subjt:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTSNSLTQVEGKGNVETIADVAGKA

Query:  GGGA
        GGGA
Subjt:  GGGA

A0A1S3BHQ8 protein MODIFIER OF SNC1 11-like2.6e-9193.63Show/hide
Query:  MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVVDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
        MATVA AGLNDI+N+KNPLGENPSQTL+PTISSSAQTPASVPPSSTDGGSSKE DESK SGK S  DDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
Subjt:  MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVVDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR

Query:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTSNSLTQVEGKGNVETIADVAGKA
        AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPT NS+TQV+GKGNVETIADVAGK+
Subjt:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTSNSLTQVEGKGNVETIADVAGKA

Query:  GGGA
        GGGA
Subjt:  GGGA

A0A5A7TZD8 Protein MODIFIER OF SNC1 11-like2.6e-9193.63Show/hide
Query:  MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVVDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
        MATVA AGLNDI+N+KNPLGENPSQTL+PTISSSAQTPASVPPSSTDGGSSKE DESK SGK S  DDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
Subjt:  MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVVDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR

Query:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTSNSLTQVEGKGNVETIADVAGKA
        AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPT NS+TQV+GKGNVETIADVAGK+
Subjt:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTSNSLTQVEGKGNVETIADVAGKA

Query:  GGGA
        GGGA
Subjt:  GGGA

A0A6J1GD78 protein MODIFIER OF SNC1 11-like2.2e-8790.2Show/hide
Query:  MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVVDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
        MATVASAG+NDI+N+KNPLGENPSQTLDPTISSSAQ P SVP SSTDGGSSKE DESKGS K SV  DG PVSDVQRKMRRAERFGISVQLSEEEKRNSR
Subjt:  MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVVDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR

Query:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTSNSLTQVEGKGNVETIADVAGKA
        AERFGMGTTTNGLGASNK+EEVKRKARAERFGL ASV TDDEAKKKARL RFSSTSKPD QEEEKRKARAIRFSNPTSNSLTQV+GKGN+ETIAD+AGKA
Subjt:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTSNSLTQVEGKGNVETIADVAGKA

Query:  GGGA
        GGGA
Subjt:  GGGA

A0A6J1IR28 protein MODIFIER OF SNC1 11-like1.5e-8689.71Show/hide
Query:  MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVVDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
        MATVASAG+NDI+N+KN LGENPSQTLDPTISSSAQ P SVP SSTDGGSSKE DESKGS K SV  DG PVSDVQRKMRRAERFGISVQLSEEEKRNSR
Subjt:  MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVVDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR

Query:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTSNSLTQVEGKGNVETIADVAGKA
        AERFGMGTTTNGLGASNK+EEVKRKARAERFGL ASV TDDEAKKKARL RFSSTSKPD QEEEKRKARAIRFSNPTSNSLTQV+GKGN+ETIAD+AGKA
Subjt:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTSNSLTQVEGKGNVETIADVAGKA

Query:  GGGA
        GGGA
Subjt:  GGGA

SwissProt top hitse value%identityAlignment
P82979 SAP domain-containing ribonucleoprotein3.0e-0437.3Show/hide
Query:  SGKASVVDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSRAERFGMGTT-TNGLGASNK--TEEVKRKARAERFGL---SASVTTDDEAKKKARLARF-
        + +  VV   + +   +R  +RAERF + V L  E K+ +RA RFG+ +  T GL + NK      K K RA+RFGL   S S  ++D+ K K R  RF 
Subjt:  SGKASVVDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSRAERFGMGTT-TNGLGASNK--TEEVKRKARAERFGL---SASVTTDDEAKKKARLARF-

Query:  -----SSTSKPDPQEEEKRKARAIRF
             + T   +  E +KRK RA RF
Subjt:  -----SSTSKPDPQEEEKRKARAIRF

Q5R4V4 SAP domain-containing ribonucleoprotein3.0e-0437.3Show/hide
Query:  SGKASVVDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSRAERFGMGTT-TNGLGASNK--TEEVKRKARAERFGL---SASVTTDDEAKKKARLARF-
        + +  VV   + +   +R  +RAERF + V L  E K+ +RA RFG+ +  T GL + NK      K K RA+RFGL   S S  ++D+ K K R  RF 
Subjt:  SGKASVVDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSRAERFGMGTT-TNGLGASNK--TEEVKRKARAERFGL---SASVTTDDEAKKKARLARF-

Query:  -----SSTSKPDPQEEEKRKARAIRF
             + T   +  E +KRK RA RF
Subjt:  -----SSTSKPDPQEEEKRKARAIRF

Q9LZ08 Protein MODIFIER OF SNC1 113.0e-2846.77Show/hide
Query:  INNNKNPLGENPSQTLDPTISSSAQTPASVP---PSSTDGGSSKERDESKGSGKASVVDDG--APVSDVQRKMRRAERFGISVQLSEEEKRNSRAERFG-
        +N      GENP + +D   +   +T   +       +D G  KE  +S G G  + VD G  +PV D+Q+K+RRAERFG+SV+L+EEEKRNSRAERFG 
Subjt:  INNNKNPLGENPSQTLDPTISSSAQTPASVP---PSSTDGGSSKERDESKGSGKASVVDDG--APVSDVQRKMRRAERFGISVQLSEEEKRNSRAERFG-

Query:  -MGTTTNGLGASNKTEEVKRKARAERFGL-SASVTTD---DEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTS-NSLTQVEGKGNVETIADVAGK
              NG   + K EE+KRKARA+RFG+ SA+ TTD   +EAKKKARLARF   +K D  EE KRKARA+RFS   S ++ + +  K  +   A V+G 
Subjt:  -MGTTTNGLGASNKTEEVKRKARAERFGL-SASVTTD---DEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTS-NSLTQVEGKGNVETIADVAGK

Query:  A
        A
Subjt:  A

Arabidopsis top hitse value%identityAlignment
AT5G02770.1 unknown protein2.1e-2946.77Show/hide
Query:  INNNKNPLGENPSQTLDPTISSSAQTPASVP---PSSTDGGSSKERDESKGSGKASVVDDG--APVSDVQRKMRRAERFGISVQLSEEEKRNSRAERFG-
        +N      GENP + +D   +   +T   +       +D G  KE  +S G G  + VD G  +PV D+Q+K+RRAERFG+SV+L+EEEKRNSRAERFG 
Subjt:  INNNKNPLGENPSQTLDPTISSSAQTPASVP---PSSTDGGSSKERDESKGSGKASVVDDG--APVSDVQRKMRRAERFGISVQLSEEEKRNSRAERFG-

Query:  -MGTTTNGLGASNKTEEVKRKARAERFGL-SASVTTD---DEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTS-NSLTQVEGKGNVETIADVAGK
              NG   + K EE+KRKARA+RFG+ SA+ TTD   +EAKKKARLARF   +K D  EE KRKARA+RFS   S ++ + +  K  +   A V+G 
Subjt:  -MGTTTNGLGASNKTEEVKRKARAERFGL-SASVTTD---DEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTS-NSLTQVEGKGNVETIADVAGK

Query:  A
        A
Subjt:  A


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGACGGTAGCATCTGCAGGACTAAATGACATTAACAATAACAAGAATCCGCTTGGAGAAAACCCTAGTCAAACCCTAGACCCGACGATTTCTTCCTCGGCCCAGAC
TCCTGCCTCCGTTCCGCCTTCGTCAACTGATGGCGGGTCATCCAAGGAAAGGGATGAATCCAAGGGCTCCGGTAAGGCGTCGGTTGTAGATGATGGTGCACCGGTTTCTG
ATGTCCAGAGGAAAATGCGCCGCGCCGAGCGTTTTGGGATTTCAGTGCAATTGTCCGAAGAAGAAAAGCGTAATTCTCGAGCAGAACGGTTTGGTATGGGTACTACAACT
AATGGATTGGGAGCATCTAATAAGACGGAGGAGGTGAAGAGAAAGGCTAGAGCTGAGAGATTTGGACTTTCTGCATCTGTGACTACTGATGATGAGGCAAAGAAGAAAGC
TCGTCTTGCTAGGTTTTCATCAACATCAAAGCCAGACCCCCAGGAGGAAGAAAAGAGGAAAGCAAGGGCAATCAGGTTTTCTAATCCAACATCAAACTCTCTCACACAAG
TGGAAGGGAAAGGAAATGTTGAGACGATTGCAGATGTTGCAGGCAAGGCTGGAGGTGGTGCCTGA
mRNA sequenceShow/hide mRNA sequence
TATACCACTAAAACGAGATGTCCCAATGAATTTATAAAAGTAACAAAAACAAAAGAAAGAACCAATGAAAAACTAGCACGTATACACAGGACCGGAAAATATCCCTCGTT
CTCCTTCTGATAGGAATTTACATTATTAACCCTGCCCTTCTGATCCACGATGTGAAATATCTGACCAGAACCAAAAACGAATAGAAAGCTGTGCCCTCCCTTTCACGCGG
CAATTTCCCTTGCAGCTTTCGTGGTTCGCAAGCAGTTCCCGATTAGGTTTCCGGTCTCAAATCTGCTCCTTCAATGGCGACGGTAGCATCTGCAGGACTAAATGACATTA
ACAATAACAAGAATCCGCTTGGAGAAAACCCTAGTCAAACCCTAGACCCGACGATTTCTTCCTCGGCCCAGACTCCTGCCTCCGTTCCGCCTTCGTCAACTGATGGCGGG
TCATCCAAGGAAAGGGATGAATCCAAGGGCTCCGGTAAGGCGTCGGTTGTAGATGATGGTGCACCGGTTTCTGATGTCCAGAGGAAAATGCGCCGCGCCGAGCGTTTTGG
GATTTCAGTGCAATTGTCCGAAGAAGAAAAGCGTAATTCTCGAGCAGAACGGTTTGGTATGGGTACTACAACTAATGGATTGGGAGCATCTAATAAGACGGAGGAGGTGA
AGAGAAAGGCTAGAGCTGAGAGATTTGGACTTTCTGCATCTGTGACTACTGATGATGAGGCAAAGAAGAAAGCTCGTCTTGCTAGGTTTTCATCAACATCAAAGCCAGAC
CCCCAGGAGGAAGAAAAGAGGAAAGCAAGGGCAATCAGGTTTTCTAATCCAACATCAAACTCTCTCACACAAGTGGAAGGGAAAGGAAATGTTGAGACGATTGCAGATGT
TGCAGGCAAGGCTGGAGGTGGTGCCTGAAAAGAACATTCCCTTTTGATGGCAAGATAGCTTTTGCTTGTAAATTTGTTTTGTGGTTCGTTTTTCTTTTTCTTTTTCTTTT
TAAAGTCTGATTGGCATTCCACCTTCAGAACTTTACATATCTTATCCAAATCTATATTGTTTCTTGTTGTACCTGCGTTATTGAATCTGATGATTGATTTCTTTCTCTGG
CAGGTCTAGTATCCTGGTTAGGATTCCCTGTAATGGCTTCCATACCCATGTGCATTCCTTTATAATATTATCTAGTTCAAAAGTCTTCCATTCTGAGGGCAGATTCCATA
CTGCAAGGTGCTGTACTACGTGCTGCTGTGAGACAAGGATAGGCCGCGTCTTTGGAGGTAGATAAATGAGGTAATAGAAGTCAGACTTCTCTGCAGGCCTCATACTCTCT
TAGAAGTCTGCATGTATGGAGTTCTGTAGCTGAGAATGGATCGTACTCTACAAATAGGTTTCTCGTCAAGTTTTTTTCTCTTTCTTTTCCTTTTATCTTTAATTGCTTAG
CTACTTAGCAGCTACATGTACTTAGTCTCTTTTGGATTTTTGGTTGGATGTAATGGTATGGAAGGACCGCAACATTTATCAAGGGAATCAATATTTTGATCAAAATTTCT
TTGTG
Protein sequenceShow/hide protein sequence
MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVVDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSRAERFGMGTTT
NGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTSNSLTQVEGKGNVETIADVAGKAGGGA