; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G14090 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G14090
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
Descriptionprotein MODIFIER OF SNC1 11-like
Genome locationChr4:11952151..11955434
RNA-Seq ExpressionCSPI04G14090
SyntenyCSPI04G14090
Gene Ontology termsGO:0016973 - poly(A)+ mRNA export from nucleus (biological process)
GO:0005634 - nucleus (cellular component)
InterPro domainsIPR040746 - THO1_MOS11, C-terminal domain
IPR044209 - Protein MODIFIER OF SNC1 11


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008447291.1 PREDICTED: protein MODIFIER OF SNC1 11-like [Cucumis melo]1.1e-9697.55Show/hide
Query:  MATVAPAGLNDISNSKNPLGENPSQTLETTISSSAQPPASVPPSSTDGGSSKEGDESKCAGKPSAEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
        MATVAPAGLNDISNSKNPLGENPSQTLE TISSSAQ PASVPPSSTDGGSSKEGDESKC+GKPSA DDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
Subjt:  MATVAPAGLNDISNSKNPLGENPSQTLETTISSSAQPPASVPPSSTDGGSSKEGDESKCAGKPSAEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR

Query:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTPNSLTQVDGKGNVETIADVAGKS
        AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTPNS+TQVDGKGNVETIADVAGKS
Subjt:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTPNSLTQVDGKGNVETIADVAGKS

Query:  GGGA
        GGGA
Subjt:  GGGA

XP_022949856.1 protein MODIFIER OF SNC1 11-like [Cucurbita moschata]4.0e-8689.22Show/hide
Query:  MATVAPAGLNDISNSKNPLGENPSQTLETTISSSAQPPASVPPSSTDGGSSKEGDESKCAGKPSAEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
        MATVA AG+NDISNSKNPLGENPSQTL+ TISSSAQPP SVP SSTDGGSSKEGDESK + K S E DG PVSDVQRKMRRAERFGISVQLSEEEKRNSR
Subjt:  MATVAPAGLNDISNSKNPLGENPSQTLETTISSSAQPPASVPPSSTDGGSSKEGDESKCAGKPSAEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR

Query:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTPNSLTQVDGKGNVETIADVAGKS
        AERFGMGTTTNGLGASNK+EEVKRKARAERFGL ASV TDDEAKKKARL RFSSTSKPD QEEEKRKARAIRFSNPT NSLTQVDGKGN+ETIAD+AGK+
Subjt:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTPNSLTQVDGKGNVETIADVAGKS

Query:  GGGA
        GGGA
Subjt:  GGGA

XP_022977777.1 protein MODIFIER OF SNC1 11-like [Cucurbita maxima]3.7e-8488.24Show/hide
Query:  MATVAPAGLNDISNSKNPLGENPSQTLETTISSSAQPPASVPPSSTDGGSSKEGDESKCAGKPSAEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
        MATVA AG+NDISNSKN LGENPSQTL+ TISSSAQPP SVP SSTDGGSSKE DESK + K S E DG PVSDVQRKMRRAERFGISVQLSEEEKRNSR
Subjt:  MATVAPAGLNDISNSKNPLGENPSQTLETTISSSAQPPASVPPSSTDGGSSKEGDESKCAGKPSAEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR

Query:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTPNSLTQVDGKGNVETIADVAGKS
        AERFGMGTTTNGLGASNK+EEVKRKARAERFGL ASV TDDEAKKKARL RFSSTSKPD QEEEKRKARAIRFSNPT NSLTQVDGKGN+ETIAD+AGK+
Subjt:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTPNSLTQVDGKGNVETIADVAGKS

Query:  GGGA
        GGGA
Subjt:  GGGA

XP_031740008.1 protein MODIFIER OF SNC1 11 [Cucumis sativus]8.2e-100100Show/hide
Query:  MATVAPAGLNDISNSKNPLGENPSQTLETTISSSAQPPASVPPSSTDGGSSKEGDESKCAGKPSAEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
        MATVAPAGLNDISNSKNPLGENPSQTLETTISSSAQPPASVPPSSTDGGSSKEGDESKCAGKPSAEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
Subjt:  MATVAPAGLNDISNSKNPLGENPSQTLETTISSSAQPPASVPPSSTDGGSSKEGDESKCAGKPSAEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR

Query:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTPNSLTQVDGKGNVETIADVAGKS
        AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTPNSLTQVDGKGNVETIADVAGKS
Subjt:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTPNSLTQVDGKGNVETIADVAGKS

Query:  GGGA
        GGGA
Subjt:  GGGA

XP_038882639.1 protein MODIFIER OF SNC1 11 [Benincasa hispida]4.2e-8891.67Show/hide
Query:  MATVAPAGLNDISNSKNPLGENPSQTLETTISSSAQPPASVPPSSTDGGSSKEGDESKCAGKPSAEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
        MATVA AGLNDISNSK+PLGENPSQTL+ TISSSAQPPASVP SSTDGGSSK+ DESK +GK S E DGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
Subjt:  MATVAPAGLNDISNSKNPLGENPSQTLETTISSSAQPPASVPPSSTDGGSSKEGDESKCAGKPSAEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR

Query:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTPNSLTQVDGKGNVETIADVAGKS
        AERFGMGT TNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPT NS TQ+DGKGNVETIADVAGK+
Subjt:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTPNSLTQVDGKGNVETIADVAGKS

Query:  GGGA
        GGGA
Subjt:  GGGA

TrEMBL top hitse value%identityAlignment
A0A0A0L0R4 Tho1_MOS11_C domain-containing protein4.0e-100100Show/hide
Query:  MATVAPAGLNDISNSKNPLGENPSQTLETTISSSAQPPASVPPSSTDGGSSKEGDESKCAGKPSAEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
        MATVAPAGLNDISNSKNPLGENPSQTLETTISSSAQPPASVPPSSTDGGSSKEGDESKCAGKPSAEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
Subjt:  MATVAPAGLNDISNSKNPLGENPSQTLETTISSSAQPPASVPPSSTDGGSSKEGDESKCAGKPSAEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR

Query:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTPNSLTQVDGKGNVETIADVAGKS
        AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTPNSLTQVDGKGNVETIADVAGKS
Subjt:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTPNSLTQVDGKGNVETIADVAGKS

Query:  GGGA
        GGGA
Subjt:  GGGA

A0A1S3BHQ8 protein MODIFIER OF SNC1 11-like5.4e-9797.55Show/hide
Query:  MATVAPAGLNDISNSKNPLGENPSQTLETTISSSAQPPASVPPSSTDGGSSKEGDESKCAGKPSAEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
        MATVAPAGLNDISNSKNPLGENPSQTLE TISSSAQ PASVPPSSTDGGSSKEGDESKC+GKPSA DDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
Subjt:  MATVAPAGLNDISNSKNPLGENPSQTLETTISSSAQPPASVPPSSTDGGSSKEGDESKCAGKPSAEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR

Query:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTPNSLTQVDGKGNVETIADVAGKS
        AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTPNS+TQVDGKGNVETIADVAGKS
Subjt:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTPNSLTQVDGKGNVETIADVAGKS

Query:  GGGA
        GGGA
Subjt:  GGGA

A0A5A7TZD8 Protein MODIFIER OF SNC1 11-like5.4e-9797.55Show/hide
Query:  MATVAPAGLNDISNSKNPLGENPSQTLETTISSSAQPPASVPPSSTDGGSSKEGDESKCAGKPSAEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
        MATVAPAGLNDISNSKNPLGENPSQTLE TISSSAQ PASVPPSSTDGGSSKEGDESKC+GKPSA DDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
Subjt:  MATVAPAGLNDISNSKNPLGENPSQTLETTISSSAQPPASVPPSSTDGGSSKEGDESKCAGKPSAEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR

Query:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTPNSLTQVDGKGNVETIADVAGKS
        AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTPNS+TQVDGKGNVETIADVAGKS
Subjt:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTPNSLTQVDGKGNVETIADVAGKS

Query:  GGGA
        GGGA
Subjt:  GGGA

A0A6J1GD78 protein MODIFIER OF SNC1 11-like1.9e-8689.22Show/hide
Query:  MATVAPAGLNDISNSKNPLGENPSQTLETTISSSAQPPASVPPSSTDGGSSKEGDESKCAGKPSAEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
        MATVA AG+NDISNSKNPLGENPSQTL+ TISSSAQPP SVP SSTDGGSSKEGDESK + K S E DG PVSDVQRKMRRAERFGISVQLSEEEKRNSR
Subjt:  MATVAPAGLNDISNSKNPLGENPSQTLETTISSSAQPPASVPPSSTDGGSSKEGDESKCAGKPSAEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR

Query:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTPNSLTQVDGKGNVETIADVAGKS
        AERFGMGTTTNGLGASNK+EEVKRKARAERFGL ASV TDDEAKKKARL RFSSTSKPD QEEEKRKARAIRFSNPT NSLTQVDGKGN+ETIAD+AGK+
Subjt:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTPNSLTQVDGKGNVETIADVAGKS

Query:  GGGA
        GGGA
Subjt:  GGGA

A0A6J1IR28 protein MODIFIER OF SNC1 11-like1.8e-8488.24Show/hide
Query:  MATVAPAGLNDISNSKNPLGENPSQTLETTISSSAQPPASVPPSSTDGGSSKEGDESKCAGKPSAEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
        MATVA AG+NDISNSKN LGENPSQTL+ TISSSAQPP SVP SSTDGGSSKE DESK + K S E DG PVSDVQRKMRRAERFGISVQLSEEEKRNSR
Subjt:  MATVAPAGLNDISNSKNPLGENPSQTLETTISSSAQPPASVPPSSTDGGSSKEGDESKCAGKPSAEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR

Query:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTPNSLTQVDGKGNVETIADVAGKS
        AERFGMGTTTNGLGASNK+EEVKRKARAERFGL ASV TDDEAKKKARL RFSSTSKPD QEEEKRKARAIRFSNPT NSLTQVDGKGN+ETIAD+AGK+
Subjt:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTPNSLTQVDGKGNVETIADVAGKS

Query:  GGGA
        GGGA
Subjt:  GGGA

SwissProt top hitse value%identityAlignment
P82979 SAP domain-containing ribonucleoprotein3.0e-0439.47Show/hide
Query:  VSDVQRKMRRAERFGISVQLSEEEKRNSRAERFGMGTT-TNGLGASNK--TEEVKRKARAERFGL---SASVTTDDEAKKKARLARF------SSTSKPD
        +   +R  +RAERF + V L  E K+ +RA RFG+ +  T GL + NK      K K RA+RFGL   S S  ++D+ K K R  RF      + T   +
Subjt:  VSDVQRKMRRAERFGISVQLSEEEKRNSRAERFGMGTT-TNGLGASNK--TEEVKRKARAERFGL---SASVTTDDEAKKKARLARF------SSTSKPD

Query:  PQEEEKRKARAIRF
          E +KRK RA RF
Subjt:  PQEEEKRKARAIRF

Q5R4V4 SAP domain-containing ribonucleoprotein3.0e-0439.47Show/hide
Query:  VSDVQRKMRRAERFGISVQLSEEEKRNSRAERFGMGTT-TNGLGASNK--TEEVKRKARAERFGL---SASVTTDDEAKKKARLARF------SSTSKPD
        +   +R  +RAERF + V L  E K+ +RA RFG+ +  T GL + NK      K K RA+RFGL   S S  ++D+ K K R  RF      + T   +
Subjt:  VSDVQRKMRRAERFGISVQLSEEEKRNSRAERFGMGTT-TNGLGASNK--TEEVKRKARAERFGL---SASVTTDDEAKKKARLARF------SSTSKPD

Query:  PQEEEKRKARAIRF
          E +KRK RA RF
Subjt:  PQEEEKRKARAIRF

Q9LZ08 Protein MODIFIER OF SNC1 118.1e-2650.6Show/hide
Query:  GENPSQTLE---TTISSSAQPPASVPPSSTDGGSSKEGDESKCAGKPSAEDDG--APVSDVQRKMRRAERFGISVQLSEEEKRNSRAERFG--MGTTTNG
        GENP + ++   T +  +           +D G  KE  +S   G  +  D G  +PV D+Q+K+RRAERFG+SV+L+EEEKRNSRAERFG       NG
Subjt:  GENPSQTLE---TTISSSAQPPASVPPSSTDGGSSKEGDESKCAGKPSAEDDG--APVSDVQRKMRRAERFGISVQLSEEEKRNSRAERFG--MGTTTNG

Query:  LGASNKTEEVKRKARAERFGL-SASVTTD---DEAKKKARLARFSSTSKPDPQEEEKRKARAIRFS
           + K EE+KRKARA+RFG+ SA+ TTD   +EAKKKARLARF   +K D  EE KRKARA+RFS
Subjt:  LGASNKTEEVKRKARAERFGL-SASVTTD---DEAKKKARLARFSSTSKPDPQEEEKRKARAIRFS

Arabidopsis top hitse value%identityAlignment
AT5G02770.1 unknown protein5.8e-2750.6Show/hide
Query:  GENPSQTLE---TTISSSAQPPASVPPSSTDGGSSKEGDESKCAGKPSAEDDG--APVSDVQRKMRRAERFGISVQLSEEEKRNSRAERFG--MGTTTNG
        GENP + ++   T +  +           +D G  KE  +S   G  +  D G  +PV D+Q+K+RRAERFG+SV+L+EEEKRNSRAERFG       NG
Subjt:  GENPSQTLE---TTISSSAQPPASVPPSSTDGGSSKEGDESKCAGKPSAEDDG--APVSDVQRKMRRAERFGISVQLSEEEKRNSRAERFG--MGTTTNG

Query:  LGASNKTEEVKRKARAERFGL-SASVTTD---DEAKKKARLARFSSTSKPDPQEEEKRKARAIRFS
           + K EE+KRKARA+RFG+ SA+ TTD   +EAKKKARLARF   +K D  EE KRKARA+RFS
Subjt:  LGASNKTEEVKRKARAERFGL-SASVTTD---DEAKKKARLARFSSTSKPDPQEEEKRKARAIRFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCACGGTAGCACCCGCAGGACTCAATGACATTAGCAATAGCAAGAATCCGCTTGGAGAAAACCCTAGTCAAACCCTAGAAACGACGATTTCTTCATCTGCTCAGCC
TCCTGCCTCCGTTCCGCCTTCGTCAACTGATGGCGGGTCATCCAAGGAAGGGGATGAATCCAAATGCGCCGGTAAGCCGTCGGCTGAAGATGACGGTGCCCCGGTTTCTG
ATGTTCAGAGGAAAATGCGCCGTGCTGAGCGTTTTGGAATTTCGGTGCAATTGTCTGAAGAGGAAAAGCGTAATTCTCGAGCGGAACGGTTTGGTATGGGTACCACAACT
AATGGATTGGGAGCATCTAACAAGACAGAGGAGGTGAAGAGAAAGGCTAGAGCTGAGAGATTTGGGCTTTCTGCATCTGTGACCACTGATGATGAGGCAAAGAAGAAAGC
TCGTCTTGCTAGGTTTTCATCAACATCAAAGCCAGACCCTCAGGAGGAAGAAAAAAGGAAAGCAAGGGCAATCAGGTTTTCTAATCCAACACCAAACTCTCTTACACAAG
TGGATGGGAAAGGAAACGTTGAGACGATTGCAGATGTTGCAGGCAAGTCTGGAGGTGGTGCCTGA
mRNA sequenceShow/hide mRNA sequence
CAAAAAACGAATAGAAAAGCTGTGCTCTCCCTCTCTTTCACGCGGCAATTCCCCTTTGCAGCTTTCGTGGTTCGCAAGCAGCTCAGATTAGGTTTCCGGTTTCAAATCTG
CTCCTCCAATGGCCACGGTAGCACCCGCAGGACTCAATGACATTAGCAATAGCAAGAATCCGCTTGGAGAAAACCCTAGTCAAACCCTAGAAACGACGATTTCTTCATCT
GCTCAGCCTCCTGCCTCCGTTCCGCCTTCGTCAACTGATGGCGGGTCATCCAAGGAAGGGGATGAATCCAAATGCGCCGGTAAGCCGTCGGCTGAAGATGACGGTGCCCC
GGTTTCTGATGTTCAGAGGAAAATGCGCCGTGCTGAGCGTTTTGGAATTTCGGTGCAATTGTCTGAAGAGGAAAAGCGTAATTCTCGAGCGGAACGGTTTGGTATGGGTA
CCACAACTAATGGATTGGGAGCATCTAACAAGACAGAGGAGGTGAAGAGAAAGGCTAGAGCTGAGAGATTTGGGCTTTCTGCATCTGTGACCACTGATGATGAGGCAAAG
AAGAAAGCTCGTCTTGCTAGGTTTTCATCAACATCAAAGCCAGACCCTCAGGAGGAAGAAAAAAGGAAAGCAAGGGCAATCAGGTTTTCTAATCCAACACCAAACTCTCT
TACACAAGTGGATGGGAAAGGAAACGTTGAGACGATTGCAGATGTTGCAGGCAAGTCTGGAGGTGGTGCCTGAAAAGAACCTTCCCTTTTGATGGCAAGATAGCTTTTGC
TTGTAAATTTTTTTTTTGTGGTCTAGTATCCTGGCTAGGATTCCCCGTAATGGCTTCCATACCCTTGTGCATTCCTTTATAATGTTATCTAGTTCAAAAGTCATCCATTC
TGAGGGCAGATTTCATACTGCGAGGTGCTGTACTATGTGCTGAAGTGAGACAGGTTCCCTAGATTCAGTTGTGAGGATAATAACGCTCTGGTTAGGTGTTATGTAGTAGC
AAATACCTCCCTACGTCAAACCTATATTACAAGTTTTTTAAAGAATATCAGAAATTTCTGAAGTTTTTAGCCCTCCAATGTTCCGGTTATCATCTTCAGTTTATAACTCC
CTTATGATGACAAGTTGGATGTTCATGATATCATTTGAGCGGCAGTGTTAAGTTTCCATGTGCATTGGGTACTAGCTGATTCTGACCGGTTGAAATTTACTAACTTGTCA
AATTGACCTTTAGGATTAGAAAGTAAACATATCTTGCTTTTCTTTTAATTAAATATCATTAGGTTTTTAGTTTCTAACTATAATTTAGTTTGGATTTTTATGTTGTTTCT
TTTGAATGATAGGTTTTTACTTCGCTGACGGTAATAGTGTAAGACTAATGGCAAACTTTCAATATTAGTAAACTTGCATGTATATCTTTTGCCTTTATTATCCAATCACT
TACGTTTAAGGCTTGGTCTCTTTATATGATGCCAGCACGTCTCATCTAATGGCATCACTTATACTACTTATCATGGGAATGATTGCTACACTCTTTAGATGCCTAGTTTT
ACTATAACCTGCAGAATGCTTCGATAAGAATGGCCATGTGTTGAATGTTATTTAGAACTGCCGTTCAAAATGTCATCATGAAATAATAACTCCAACTTGTGTGAGAATAA
ACTGAAGATAAGTCTTGGAAGGGGAAGAAATTATCAATTCCATTGGTAAGTGGTCGTGTGGGAGAAGCTGAATTCTCTCTTCCAACTTGGGAGCTCTCTGATATTCTTGT
TTGGTAATATAGGATAGGGCGTCTTGGGAGGTATTTGTTGATCTCTTAAAAGTCTGTATCGAGGTCTGTAGCCGAGAATGGATCATACTCAAAAAGTAGGTTTCTCGTTA
AGTTTTTGTTGTTTCTTTCTTCTTTTCCTTTTATCTTTATTTGCTTAGCTACTTAGCAGCTACATGTATTTAGTTTCTTTTGGATTTTTGGTTGGATGTAATGGTATGGA
AGGACCGCAACATTTATTGAGGGAATCAATATTTTGATCAAAATTTCTTTTGCTACACACACACAACTCTTGCTCCTTGCAAACGATTACCAGAACGAAGTAACAATTGT
ATTACCA
Protein sequenceShow/hide protein sequence
MATVAPAGLNDISNSKNPLGENPSQTLETTISSSAQPPASVPPSSTDGGSSKEGDESKCAGKPSAEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSRAERFGMGTTT
NGLGASNKTEEVKRKARAERFGLSASVTTDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNPTPNSLTQVDGKGNVETIADVAGKSGGGA