; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC04G074280 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC04G074280
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
Descriptionprotein MODIFIER OF SNC1 11-like
Genome locationCiama_Chr04:23395523..23399172
RNA-Seq ExpressionCaUC04G074280
SyntenyCaUC04G074280
Gene Ontology termsGO:0016973 - poly(A)+ mRNA export from nucleus (biological process)
GO:0005634 - nucleus (cellular component)
InterPro domainsIPR040746 - THO1_MOS11, C-terminal domain
IPR044209 - Protein MODIFIER OF SNC1 11


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008447291.1 PREDICTED: protein MODIFIER OF SNC1 11-like [Cucumis melo]3.8e-8992.65Show/hide
Query:  MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
        MATVA AGLNDI+N+KNPLGENPSQTL+PTISSSAQTPASVPPSSTDGGSSKE DESK SGK S  DDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
Subjt:  MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR

Query:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTIDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNRTSNSLTQVEGKGNVETIADVAGKA
        AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVT DDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSN T NS+TQV+GKGNVETIADVAGK+
Subjt:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTIDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNRTSNSLTQVEGKGNVETIADVAGKA

Query:  GGGA
        GGGA
Subjt:  GGGA

XP_022949856.1 protein MODIFIER OF SNC1 11-like [Cucurbita moschata]6.7e-8689.71Show/hide
Query:  MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
        MATVASAG+NDI+N+KNPLGENPSQTLDPTISSSAQ P SVP SSTDGGSSKE DESKGS K SVE DG PVSDVQRKMRRAERFGISVQLSEEEKRNSR
Subjt:  MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR

Query:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTIDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNRTSNSLTQVEGKGNVETIADVAGKA
        AERFGMGTTTNGLGASNK+EEVKRKARAERFGL ASV  DDEAKKKARL RFSSTSKPD QEEEKRKARAIRFSN TSNSLTQV+GKGN+ETIAD+AGKA
Subjt:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTIDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNRTSNSLTQVEGKGNVETIADVAGKA

Query:  GGGA
        GGGA
Subjt:  GGGA

XP_022977777.1 protein MODIFIER OF SNC1 11-like [Cucurbita maxima]4.3e-8589.22Show/hide
Query:  MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
        MATVASAG+NDI+N+KN LGENPSQTLDPTISSSAQ P SVP SSTDGGSSKE DESKGS K SVE DG PVSDVQRKMRRAERFGISVQLSEEEKRNSR
Subjt:  MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR

Query:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTIDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNRTSNSLTQVEGKGNVETIADVAGKA
        AERFGMGTTTNGLGASNK+EEVKRKARAERFGL ASV  DDEAKKKARL RFSSTSKPD QEEEKRKARAIRFSN TSNSLTQV+GKGN+ETIAD+AGKA
Subjt:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTIDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNRTSNSLTQVEGKGNVETIADVAGKA

Query:  GGGA
        GGGA
Subjt:  GGGA

XP_031740008.1 protein MODIFIER OF SNC1 11 [Cucumis sativus]2.5e-8892.16Show/hide
Query:  MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
        MATVA AGLNDI+N+KNPLGENPSQTL+ TISSSAQ PASVPPSSTDGGSSKE DESK +GK S EDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
Subjt:  MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR

Query:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTIDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNRTSNSLTQVEGKGNVETIADVAGKA
        AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVT DDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSN T NSLTQV+GKGNVETIADVAGK+
Subjt:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTIDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNRTSNSLTQVEGKGNVETIADVAGKA

Query:  GGGA
        GGGA
Subjt:  GGGA

XP_038882639.1 protein MODIFIER OF SNC1 11 [Benincasa hispida]4.5e-9093.63Show/hide
Query:  MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
        MATVASAGLNDI+N+K+PLGENPSQTLDPTISSSAQ PASVP SSTDGGSSK+RDESKGSGKASVE DGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
Subjt:  MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR

Query:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTIDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNRTSNSLTQVEGKGNVETIADVAGKA
        AERFGMGT TNGLGASNKTEEVKRKARAERFGLSASVT DDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSN TSNS TQ++GKGNVETIADVAGKA
Subjt:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTIDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNRTSNSLTQVEGKGNVETIADVAGKA

Query:  GGGA
        GGGA
Subjt:  GGGA

TrEMBL top hitse value%identityAlignment
A0A0A0L0R4 Tho1_MOS11_C domain-containing protein1.2e-8892.16Show/hide
Query:  MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
        MATVA AGLNDI+N+KNPLGENPSQTL+ TISSSAQ PASVPPSSTDGGSSKE DESK +GK S EDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
Subjt:  MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR

Query:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTIDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNRTSNSLTQVEGKGNVETIADVAGKA
        AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVT DDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSN T NSLTQV+GKGNVETIADVAGK+
Subjt:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTIDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNRTSNSLTQVEGKGNVETIADVAGKA

Query:  GGGA
        GGGA
Subjt:  GGGA

A0A1S3BHQ8 protein MODIFIER OF SNC1 11-like1.8e-8992.65Show/hide
Query:  MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
        MATVA AGLNDI+N+KNPLGENPSQTL+PTISSSAQTPASVPPSSTDGGSSKE DESK SGK S  DDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
Subjt:  MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR

Query:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTIDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNRTSNSLTQVEGKGNVETIADVAGKA
        AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVT DDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSN T NS+TQV+GKGNVETIADVAGK+
Subjt:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTIDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNRTSNSLTQVEGKGNVETIADVAGKA

Query:  GGGA
        GGGA
Subjt:  GGGA

A0A5A7TZD8 Protein MODIFIER OF SNC1 11-like1.8e-8992.65Show/hide
Query:  MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
        MATVA AGLNDI+N+KNPLGENPSQTL+PTISSSAQTPASVPPSSTDGGSSKE DESK SGK S  DDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
Subjt:  MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR

Query:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTIDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNRTSNSLTQVEGKGNVETIADVAGKA
        AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVT DDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSN T NS+TQV+GKGNVETIADVAGK+
Subjt:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTIDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNRTSNSLTQVEGKGNVETIADVAGKA

Query:  GGGA
        GGGA
Subjt:  GGGA

A0A6J1GD78 protein MODIFIER OF SNC1 11-like3.2e-8689.71Show/hide
Query:  MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
        MATVASAG+NDI+N+KNPLGENPSQTLDPTISSSAQ P SVP SSTDGGSSKE DESKGS K SVE DG PVSDVQRKMRRAERFGISVQLSEEEKRNSR
Subjt:  MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR

Query:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTIDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNRTSNSLTQVEGKGNVETIADVAGKA
        AERFGMGTTTNGLGASNK+EEVKRKARAERFGL ASV  DDEAKKKARL RFSSTSKPD QEEEKRKARAIRFSN TSNSLTQV+GKGN+ETIAD+AGKA
Subjt:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTIDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNRTSNSLTQVEGKGNVETIADVAGKA

Query:  GGGA
        GGGA
Subjt:  GGGA

A0A6J1IR28 protein MODIFIER OF SNC1 11-like2.1e-8589.22Show/hide
Query:  MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR
        MATVASAG+NDI+N+KN LGENPSQTLDPTISSSAQ P SVP SSTDGGSSKE DESKGS K SVE DG PVSDVQRKMRRAERFGISVQLSEEEKRNSR
Subjt:  MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSR

Query:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTIDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNRTSNSLTQVEGKGNVETIADVAGKA
        AERFGMGTTTNGLGASNK+EEVKRKARAERFGL ASV  DDEAKKKARL RFSSTSKPD QEEEKRKARAIRFSN TSNSLTQV+GKGN+ETIAD+AGKA
Subjt:  AERFGMGTTTNGLGASNKTEEVKRKARAERFGLSASVTIDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNRTSNSLTQVEGKGNVETIADVAGKA

Query:  GGGA
        GGGA
Subjt:  GGGA

SwissProt top hitse value%identityAlignment
P82979 SAP domain-containing ribonucleoprotein3.9e-0439.47Show/hide
Query:  VSDVQRKMRRAERFGISVQLSEEEKRNSRAERFGMGTT-TNGLGASNK--TEEVKRKARAERFGL---SASVTIDDEAKKKARLARF------SSTSKPD
        +   +R  +RAERF + V L  E K+ +RA RFG+ +  T GL + NK      K K RA+RFGL   S S   +D+ K K R  RF      + T   +
Subjt:  VSDVQRKMRRAERFGISVQLSEEEKRNSRAERFGMGTT-TNGLGASNK--TEEVKRKARAERFGL---SASVTIDDEAKKKARLARF------SSTSKPD

Query:  PQEEEKRKARAIRF
          E +KRK RA RF
Subjt:  PQEEEKRKARAIRF

Q5R4V4 SAP domain-containing ribonucleoprotein3.9e-0439.47Show/hide
Query:  VSDVQRKMRRAERFGISVQLSEEEKRNSRAERFGMGTT-TNGLGASNK--TEEVKRKARAERFGL---SASVTIDDEAKKKARLARF------SSTSKPD
        +   +R  +RAERF + V L  E K+ +RA RFG+ +  T GL + NK      K K RA+RFGL   S S   +D+ K K R  RF      + T   +
Subjt:  VSDVQRKMRRAERFGISVQLSEEEKRNSRAERFGMGTT-TNGLGASNK--TEEVKRKARAERFGL---SASVTIDDEAKKKARLARF------SSTSKPD

Query:  PQEEEKRKARAIRF
          E +KRK RA RF
Subjt:  PQEEEKRKARAIRF

Q9LZ08 Protein MODIFIER OF SNC1 119.6e-2745.77Show/hide
Query:  INNNKNPLGENPSQTLDPTISSSAQTPASVP---PSSTDGGSSKERDESKGSGKASVEDDG--APVSDVQRKMRRAERFGISVQLSEEEKRNSRAERFG-
        +N      GENP + +D   +   +T   +       +D G  KE  +S G G  +  D G  +PV D+Q+K+RRAERFG+SV+L+EEEKRNSRAERFG 
Subjt:  INNNKNPLGENPSQTLDPTISSSAQTPASVP---PSSTDGGSSKERDESKGSGKASVEDDG--APVSDVQRKMRRAERFGISVQLSEEEKRNSRAERFG-

Query:  -MGTTTNGLGASNKTEEVKRKARAERFGL-SASVTID---DEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNRTS-NSLTQVEGKGNVETIADVAGK
              NG   + K EE+KRKARA+RFG+ SA+ T D   +EAKKKARLARF   +K D  EE KRKARA+RFS   S ++ + +  K  +   A V+G 
Subjt:  -MGTTTNGLGASNKTEEVKRKARAERFGL-SASVTID---DEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNRTS-NSLTQVEGKGNVETIADVAGK

Query:  A
        A
Subjt:  A

Arabidopsis top hitse value%identityAlignment
AT5G02770.1 unknown protein6.8e-2845.77Show/hide
Query:  INNNKNPLGENPSQTLDPTISSSAQTPASVP---PSSTDGGSSKERDESKGSGKASVEDDG--APVSDVQRKMRRAERFGISVQLSEEEKRNSRAERFG-
        +N      GENP + +D   +   +T   +       +D G  KE  +S G G  +  D G  +PV D+Q+K+RRAERFG+SV+L+EEEKRNSRAERFG 
Subjt:  INNNKNPLGENPSQTLDPTISSSAQTPASVP---PSSTDGGSSKERDESKGSGKASVEDDG--APVSDVQRKMRRAERFGISVQLSEEEKRNSRAERFG-

Query:  -MGTTTNGLGASNKTEEVKRKARAERFGL-SASVTID---DEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNRTS-NSLTQVEGKGNVETIADVAGK
              NG   + K EE+KRKARA+RFG+ SA+ T D   +EAKKKARLARF   +K D  EE KRKARA+RFS   S ++ + +  K  +   A V+G 
Subjt:  -MGTTTNGLGASNKTEEVKRKARAERFGL-SASVTID---DEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNRTS-NSLTQVEGKGNVETIADVAGK

Query:  A
        A
Subjt:  A


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGACGGTAGCATCTGCAGGACTAAATGACATTAACAATAACAAGAATCCGCTTGGAGAAAACCCTAGTCAAACCCTAGACCCGACGATTTCTTCCTCGGCCCAGAC
TCCTGCCTCCGTTCCGCCTTCGTCAACTGATGGCGGGTCATCCAAGGAAAGGGATGAATCCAAGGGCTCCGGTAAGGCGTCGGTTGAAGATGATGGTGCACCGGTTTCTG
ATGTCCAGAGGAAAATGCGCCGCGCCGAGCGTTTTGGGATTTCAGTGCAATTGTCCGAAGAAGAAAAGCGTAATTCTCGAGCAGAACGGTTTGGTATGGGTACTACAACT
AATGGATTGGGAGCATCTAATAAGACGGAGGAGGTGAAGAGAAAGGCTAGAGCTGAGAGATTTGGACTTTCTGCATCTGTGACTATTGATGATGAGGCAAAGAAGAAAGC
TCGTCTTGCTAGGTTTTCATCAACATCAAAGCCAGACCCCCAGGAGGAAGAAAAGAGGAAAGCAAGGGCAATCAGGTTTTCTAATCGAACATCAAACTCTCTCACACAAG
TGGAAGGGAAAGGAAATGTTGAGACGATTGCAGATGTTGCAGGCAAGGCTGGAGGTGGTGCCTGA
mRNA sequenceShow/hide mRNA sequence
GTATACACAAGACCGGAAAATATCCCTCGTTCTCATTCTGATAGGAATTTACATTATTAACCCTGCCCTTCTGATCCACGATGTGAAATATCTGACCAGAACCAAAACCG
AATAGAAAGCTGTGCCCTCCCTTTCACGCGGCAATTTCCCTTGCAGCTTTCGTGGTTCGCAAGCAGTTCCCGATTAGGTTTCCGGTCTCAAATCTGCTCCTTCAATGGCG
ACGGTAGCATCTGCAGGACTAAATGACATTAACAATAACAAGAATCCGCTTGGAGAAAACCCTAGTCAAACCCTAGACCCGACGATTTCTTCCTCGGCCCAGACTCCTGC
CTCCGTTCCGCCTTCGTCAACTGATGGCGGGTCATCCAAGGAAAGGGATGAATCCAAGGGCTCCGGTAAGGCGTCGGTTGAAGATGATGGTGCACCGGTTTCTGATGTCC
AGAGGAAAATGCGCCGCGCCGAGCGTTTTGGGATTTCAGTGCAATTGTCCGAAGAAGAAAAGCGTAATTCTCGAGCAGAACGGTTTGGTATGGGTACTACAACTAATGGA
TTGGGAGCATCTAATAAGACGGAGGAGGTGAAGAGAAAGGCTAGAGCTGAGAGATTTGGACTTTCTGCATCTGTGACTATTGATGATGAGGCAAAGAAGAAAGCTCGTCT
TGCTAGGTTTTCATCAACATCAAAGCCAGACCCCCAGGAGGAAGAAAAGAGGAAAGCAAGGGCAATCAGGTTTTCTAATCGAACATCAAACTCTCTCACACAAGTGGAAG
GGAAAGGAAATGTTGAGACGATTGCAGATGTTGCAGGCAAGGCTGGAGGTGGTGCCTGAAAAGAACATTCCCTTTTGATGGCAAGATAGCTTTTGCTTGTAAATTTGTTT
TGTGGTCTAGTATCCTGGTTAGGATTCCCTGTAATGGCTTCCATACCCATGTGCATTCCTTTATAATATTATCTAGTTCAAAAGTCTTCCATTCTGAGGGCAGATTCCAT
ACTGCGAGGTGCTGTACTACGTGTTGCTGTGAGACAAGGTTCCCTAGTTTCAGTTGTATCAGGATATTAGTTCTCTGGTTAGGTGTTATATAGTAGCAAATACCTCCCTA
TCTTACAAGTTTATTAAAGAATATCAGAAATTTCTGAAGTTTTTAGCCCTCCAATGTTTTGGTTATCATCTTCAGTTTATATCTCCTTTGTGATGACAAGTTGGATGTTC
ATGATATCATTTGAACGGCAGTGTTAAGTTTCCATGTGCATTGGTACCATCTGACACTGACTGGTTGAAATTTTCTAATTTGCCAAATTAACCTTCAGGATTAGAAAATT
AACGCATCTTGCTTTTCTTTGAGTTAATATCATTAGGTTTTTTCTTATCCAAAAAATATCATTAGATTTTTAGTTTTCTAACTACAATTTAGTTCGAACTTTTAGGTTCC
TTTTTAATGATAGGTTTTTATTTTGCTGACTGTAATAGTGTAAGACTAATGGCCAACTATCAATGTTAGGAAACTTGCATGTATATCTTTTGCCTTCTTTTTCAGAGACA
GATCAATAGTCTGAGCACAGCTAGTTATTATCCAATCACTTACGTTTGAGGTTTGGTCTCTTTGTATGATGCCGGCACGTCTCATCTCATGGCATCACTTAATTACTCAT
CATGGGAATGACTATTACACTCTTTAGATGCCTAGTTTTTTACTATAACCTGCAGAGCACTTCGATTAGAATGGGTCATGTGTTGAATGTCATTTAGAACTGCCGTTCAA
AATGTCATCATGAGATAATAGCTCCAACTTGTGTGAGAATAAACTGAAGATGAGTCTTGGGAGGGGAAGAAATTATCAGTTCCATTGGTAAGTGGTCGCATGGGAGAAGC
TGAATTCTCTCTTCTAACTTGGGAGTTTCTGATATTCTTGTTTGGTTATATAGGATAGGCCGCGTCTTTGGAGGTATTTGTTGGCTCTCTTAGAAGTCTGCATGTATGGA
GTTCTGTAGCTGAGAATGGATCGTACTCTACAAATAGGTTTCTCGTCAAGTTTTTTTCTCTTTCTTTTCCTTTTATCTTTAATTGCTTAGCTACTTAGCAGCTACATGTA
CTTAGTTTCTTTTGGATTTTTGGTTGGATGTAATGGTATGGAAGGACCACAACATTTATCAAGGGAATCAATATTTTGATCAAAATTTCTTTGTGATA
Protein sequenceShow/hide protein sequence
MATVASAGLNDINNNKNPLGENPSQTLDPTISSSAQTPASVPPSSTDGGSSKERDESKGSGKASVEDDGAPVSDVQRKMRRAERFGISVQLSEEEKRNSRAERFGMGTTT
NGLGASNKTEEVKRKARAERFGLSASVTIDDEAKKKARLARFSSTSKPDPQEEEKRKARAIRFSNRTSNSLTQVEGKGNVETIADVAGKAGGGA