; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G18640 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G18640
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionThioredoxin-like protein
Genome locationClcChr08:28550190..28559052
RNA-Seq ExpressionClc08G18640
SyntenyClc08G18640
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036249 - Thioredoxin-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6598213.1 Type 2 DNA topoisomerase 6 subunit B-like protein, partial [Cucurbita argyrosperma subsp. sororia]1.0e-7196.35Show/hide
Query:  QYYTGFPKDLGPSRVIHFTSEREFVQLLHEGYPVVVAFTVRGNYTKHLDKVLEEAAVEFYPNVKFMRVECPKYPGFCISRQRKEYPFIEMFHSPQQASNQ
        +YYTGFPKDLGPSRVIHFTSEREFVQLLHEGYPVVVAFTVRGNYTKHLDKVLEEAAVEFYP VKFMRVECPKYPGFCISRQRKEYPFIEMFHSPQQAS+Q
Subjt:  QYYTGFPKDLGPSRVIHFTSEREFVQLLHEGYPVVVAFTVRGNYTKHLDKVLEEAAVEFYPNVKFMRVECPKYPGFCISRQRKEYPFIEMFHSPQQASNQ

Query:  GKFSDSNITKYSVKVLPFNYDPSAYGFREFFKRHGIY
        GK SD NITKYSVKVLPFNYDPSAYGFREFFKRHGIY
Subjt:  GKFSDSNITKYSVKVLPFNYDPSAYGFREFFKRHGIY

XP_008447507.1 PREDICTED: uncharacterized protein LOC103489940 [Cucumis melo]6.0e-7288.44Show/hide
Query:  VRYVSSSEQYYTGFPKDLGPSRVIHFTSEREFVQLLHEGYPVVVAFTVRGNYTKHLDKVLEEAAVEFYPNVKFMRVECPKYPGFCISRQRKEYPFIEMFH
        +R++ S+ +YYTGFPKDLGPSRVIHFTSEREFVQLLHEGYPVVVAFT+R NY+KHLDKVLEEAAVEFYPNVKFMRVECPKYPGFCISRQRKEYPFIEMFH
Subjt:  VRYVSSSEQYYTGFPKDLGPSRVIHFTSEREFVQLLHEGYPVVVAFTVRGNYTKHLDKVLEEAAVEFYPNVKFMRVECPKYPGFCISRQRKEYPFIEMFH

Query:  SPQQASNQGKFSDSNITKYSVKVLPFNYDPSAYGFREFFKRHGIYGR
        SP+QAS+QGK +DSN+TKYSVKV+PFNYD SAYGFREFFKRHGIYGR
Subjt:  SPQQASNQGKFSDSNITKYSVKVLPFNYDPSAYGFREFFKRHGIYGR

XP_022962537.1 uncharacterized protein LOC111462937 [Cucurbita moschata]2.1e-7295.68Show/hide
Query:  QYYTGFPKDLGPSRVIHFTSEREFVQLLHEGYPVVVAFTVRGNYTKHLDKVLEEAAVEFYPNVKFMRVECPKYPGFCISRQRKEYPFIEMFHSPQQASNQ
        +YYTGFPKDLGPSRVIHFTSEREFVQLLHEGYPVVVAFTVRGNYTKHLDKVLEEAAVEFYP VKFMRVECPKYPGFCISRQRKEYPFIEMFHSP+QAS+Q
Subjt:  QYYTGFPKDLGPSRVIHFTSEREFVQLLHEGYPVVVAFTVRGNYTKHLDKVLEEAAVEFYPNVKFMRVECPKYPGFCISRQRKEYPFIEMFHSPQQASNQ

Query:  GKFSDSNITKYSVKVLPFNYDPSAYGFREFFKRHGIYGR
        GK SD NITKYSVKVLPFNYDPSAYGFREFFKRHGIYGR
Subjt:  GKFSDSNITKYSVKVLPFNYDPSAYGFREFFKRHGIYGR

XP_022996547.1 uncharacterized protein LOC111491762 [Cucurbita maxima]7.1e-7396.4Show/hide
Query:  QYYTGFPKDLGPSRVIHFTSEREFVQLLHEGYPVVVAFTVRGNYTKHLDKVLEEAAVEFYPNVKFMRVECPKYPGFCISRQRKEYPFIEMFHSPQQASNQ
        +YYTGFPKDLGPSRVIHFTSEREFVQLLHEGYPVVVAFTVRGNYTKHLDKVLEEAAVEFYP VKFMRVECPKYPGFCISRQRKEYPFIEMFHSPQQAS+Q
Subjt:  QYYTGFPKDLGPSRVIHFTSEREFVQLLHEGYPVVVAFTVRGNYTKHLDKVLEEAAVEFYPNVKFMRVECPKYPGFCISRQRKEYPFIEMFHSPQQASNQ

Query:  GKFSDSNITKYSVKVLPFNYDPSAYGFREFFKRHGIYGR
        GK SD NITKYSVKVLPFNYDPSAYGFREFFKRHGIYGR
Subjt:  GKFSDSNITKYSVKVLPFNYDPSAYGFREFFKRHGIYGR

XP_038885436.1 uncharacterized protein LOC120075825 isoform X1 [Benincasa hispida]8.4e-7496.4Show/hide
Query:  QYYTGFPKDLGPSRVIHFTSEREFVQLLHEGYPVVVAFTVRGNYTKHLDKVLEEAAVEFYPNVKFMRVECPKYPGFCISRQRKEYPFIEMFHSPQQASNQ
        +YYTGFPKDLGPSRVIHFTSEREFVQLLHEGYPVVVAFTVRGNYTKHLDKVLEEAAVEFYPN+KFMRVECPKYPGFCISRQRKEYPFIEMFHSPQQASNQ
Subjt:  QYYTGFPKDLGPSRVIHFTSEREFVQLLHEGYPVVVAFTVRGNYTKHLDKVLEEAAVEFYPNVKFMRVECPKYPGFCISRQRKEYPFIEMFHSPQQASNQ

Query:  GKFSDSNITKYSVKVLPFNYDPSAYGFREFFKRHGIYGR
        GK SDSN+TKYSVKVLPFNYDPSAYG REFFKRHGIYGR
Subjt:  GKFSDSNITKYSVKVLPFNYDPSAYGFREFFKRHGIYGR

TrEMBL top hitse value%identityAlignment
A0A0A0LAW0 Uncharacterized protein9.3e-7192.09Show/hide
Query:  QYYTGFPKDLGPSRVIHFTSEREFVQLLHEGYPVVVAFTVRGNYTKHLDKVLEEAAVEFYPNVKFMRVECPKYPGFCISRQRKEYPFIEMFHSPQQASNQ
        +YYTGFPKDLGPSRVIHFTSEREFVQLLHEGYPVVVAFT+R NY+KHLD VLEEAAVEFYPNVKFMRVECPKYPGFCISRQRKEYPFIEMFHSP+QAS+Q
Subjt:  QYYTGFPKDLGPSRVIHFTSEREFVQLLHEGYPVVVAFTVRGNYTKHLDKVLEEAAVEFYPNVKFMRVECPKYPGFCISRQRKEYPFIEMFHSPQQASNQ

Query:  GKFSDSNITKYSVKVLPFNYDPSAYGFREFFKRHGIYGR
        GK +DSN+TKYSVKVLPFNYD SAYGFREFFKRHGIYGR
Subjt:  GKFSDSNITKYSVKVLPFNYDPSAYGFREFFKRHGIYGR

A0A1S3BIH6 uncharacterized protein LOC1034899402.9e-7288.44Show/hide
Query:  VRYVSSSEQYYTGFPKDLGPSRVIHFTSEREFVQLLHEGYPVVVAFTVRGNYTKHLDKVLEEAAVEFYPNVKFMRVECPKYPGFCISRQRKEYPFIEMFH
        +R++ S+ +YYTGFPKDLGPSRVIHFTSEREFVQLLHEGYPVVVAFT+R NY+KHLDKVLEEAAVEFYPNVKFMRVECPKYPGFCISRQRKEYPFIEMFH
Subjt:  VRYVSSSEQYYTGFPKDLGPSRVIHFTSEREFVQLLHEGYPVVVAFTVRGNYTKHLDKVLEEAAVEFYPNVKFMRVECPKYPGFCISRQRKEYPFIEMFH

Query:  SPQQASNQGKFSDSNITKYSVKVLPFNYDPSAYGFREFFKRHGIYGR
        SP+QAS+QGK +DSN+TKYSVKV+PFNYD SAYGFREFFKRHGIYGR
Subjt:  SPQQASNQGKFSDSNITKYSVKVLPFNYDPSAYGFREFFKRHGIYGR

A0A6J1BSI0 uncharacterized protein LOC1110051431.6e-7088.97Show/hide
Query:  VRYVSSSEQYYTGFPKDLGPSRVIHFTSEREFVQLLHEGYPVVVAFTVRGNYTKHLDKVLEEAAVEFYPNVKFMRVECPKYPGFCISRQRKEYPFIEMFH
        + ++ ++ +YYTG+PKDLGPSRVIHFTSEREFVQLLHEGYPVVVAFTVRGNYTKHLDKVLEEAAVEFYPNVKFMRVECPKYPGFCISRQRKEYPFIEMFH
Subjt:  VRYVSSSEQYYTGFPKDLGPSRVIHFTSEREFVQLLHEGYPVVVAFTVRGNYTKHLDKVLEEAAVEFYPNVKFMRVECPKYPGFCISRQRKEYPFIEMFH

Query:  SPQQASNQGKFSDSNITKYSVKVLPFNYDPSAYGFREFFKRHGIY
        SPQQASNQGK +D +ITKYSVKVLPFNYD SAYGFREFFKRHGI+
Subjt:  SPQQASNQGKFSDSNITKYSVKVLPFNYDPSAYGFREFFKRHGIY

A0A6J1HCY2 uncharacterized protein LOC1114629371.0e-7295.68Show/hide
Query:  QYYTGFPKDLGPSRVIHFTSEREFVQLLHEGYPVVVAFTVRGNYTKHLDKVLEEAAVEFYPNVKFMRVECPKYPGFCISRQRKEYPFIEMFHSPQQASNQ
        +YYTGFPKDLGPSRVIHFTSEREFVQLLHEGYPVVVAFTVRGNYTKHLDKVLEEAAVEFYP VKFMRVECPKYPGFCISRQRKEYPFIEMFHSP+QAS+Q
Subjt:  QYYTGFPKDLGPSRVIHFTSEREFVQLLHEGYPVVVAFTVRGNYTKHLDKVLEEAAVEFYPNVKFMRVECPKYPGFCISRQRKEYPFIEMFHSPQQASNQ

Query:  GKFSDSNITKYSVKVLPFNYDPSAYGFREFFKRHGIYGR
        GK SD NITKYSVKVLPFNYDPSAYGFREFFKRHGIYGR
Subjt:  GKFSDSNITKYSVKVLPFNYDPSAYGFREFFKRHGIYGR

A0A6J1K286 uncharacterized protein LOC1114917623.4e-7396.4Show/hide
Query:  QYYTGFPKDLGPSRVIHFTSEREFVQLLHEGYPVVVAFTVRGNYTKHLDKVLEEAAVEFYPNVKFMRVECPKYPGFCISRQRKEYPFIEMFHSPQQASNQ
        +YYTGFPKDLGPSRVIHFTSEREFVQLLHEGYPVVVAFTVRGNYTKHLDKVLEEAAVEFYP VKFMRVECPKYPGFCISRQRKEYPFIEMFHSPQQAS+Q
Subjt:  QYYTGFPKDLGPSRVIHFTSEREFVQLLHEGYPVVVAFTVRGNYTKHLDKVLEEAAVEFYPNVKFMRVECPKYPGFCISRQRKEYPFIEMFHSPQQASNQ

Query:  GKFSDSNITKYSVKVLPFNYDPSAYGFREFFKRHGIYGR
        GK SD NITKYSVKVLPFNYDPSAYGFREFFKRHGIYGR
Subjt:  GKFSDSNITKYSVKVLPFNYDPSAYGFREFFKRHGIYGR

SwissProt top hitse value%identityAlignment
P04146 Copia protein9.4e-1243.02Show/hide
Query:  WKRAMNEEFSALCKNWTWSLVLPSLQYNIVGCKWVFRIKKNVDGSVHRHKARLVAKGFHQSLGVDFFETFSPVAKSSTIRIQVTYV
        W+ A+N E +A   N TW++       NIV  +WVF +K N  G+  R+KARLVA+GF Q   +D+ ETF+PVA+ S+ R  ++ V
Subjt:  WKRAMNEEFSALCKNWTWSLVLPSLQYNIVGCKWVFRIKKNVDGSVHRHKARLVAKGFHQSLGVDFFETFSPVAKSSTIRIQVTYV

P04146 Copia protein1.8e-0730Show/hide
Query:  SVAADCIFLAQAIAEISWISNLLNEIVSPLSDTPILWCDNIGAGALATNPVFHARTKHIEIDVHYDRDQVLQEHFAVRYVSSSEQYYTGFPKDLGPSRVI
        S  A+ + L +A+ E  W+  LL  I   L +   ++ DN G  ++A NP  H R KHI+I  H+ R+QV      + Y+ +  Q    F K L  +R +
Subjt:  SVAADCIFLAQAIAEISWISNLLNEIVSPLSDTPILWCDNIGAGALATNPVFHARTKHIEIDVHYDRDQVLQEHFAVRYVSSSEQYYTGFPKDLGPSRVI

Query:  HFTSEREFVQ
            +   +Q
Subjt:  HFTSEREFVQ

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.9e-1447.83Show/hide
Query:  LKDALATP---QWKRAMNEEFSALCKNWTWSLVLPSLQYNIVGCKWVFRIKKNVDGSVHRHKARLVAKGFHQSLGVDFFETFSPVAKSSTIR
        LK+ L+ P   Q  +AM EE  +L KN T+ LV        + CKWVF++KK+ D  + R+KARLV KGF Q  G+DF E FSPV K ++IR
Subjt:  LKDALATP---QWKRAMNEEFSALCKNWTWSLVLPSLQYNIVGCKWVFRIKKNVDGSVHRHKARLVAKGFHQSLGVDFFETFSPVAKSSTIR

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.3e-0430.86Show/hide
Query:  ADCIFLAQAIAEISWISNLLNEIVSPLSDTPILWCDNIGAGALATNPVFHARTKHIEIDVHYDRDQVLQEHFAVRYVSSSE
        A+ I   +   E+ W+   L E+     +  +++CD+  A  L+ N ++HARTKHI++  H+ R+ V  E   V  +S++E
Subjt:  ADCIFLAQAIAEISWISNLLNEIVSPLSDTPILWCDNIGAGALATNPVFHARTKHIEIDVHYDRDQVLQEHFAVRYVSSSE

P92520 Uncharacterized mitochondrial protein AtMg008202.9e-2150Show/hide
Query:  LKTSLSENLPKLKDALATPQWKRAMNEEFSALCKNWTWSLVLPSLQYNIVGCKWVFRIKKNVDGSVHRHKARLVAKGFHQSLGVDFFETFSPVAKSSTIR
        + T++ +    +  AL  P W +AM EE  AL +N TW LV P +  NI+GCKWVF+ K + DG++ R KARLVAKGFHQ  G+ F ET+SPV +++TIR
Subjt:  LKTSLSENLPKLKDALATPQWKRAMNEEFSALCKNWTWSLVLPSLQYNIVGCKWVFRIKKNVDGSVHRHKARLVAKGFHQSLGVDFFETFSPVAKSSTIR

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.1e-1931.93Show/hide
Query:  FRYNPKLNKPKQTQKKAESAYTAATLQRKGFSQVSQNQYTLMANSVIAN------LENVADPSWQADSGASKHVTTNPGYLTASTDYRGLIKNIKFGFHN
        F  +P+   P+Q   +  +  T    Q       SQN  T  + S +A         + + PS    + +S    T P  L         I N     +N
Subjt:  FRYNPKLNKPKQTQKKAESAYTAATLQRKGFSQVSQNQYTLMANSVIAN------LENVADPSWQADSGASKHVTTNPGYLTASTDYRGLIKNIKFGFHN

Query:  EQCVFLGPTSIHKGAHCLMATGKIIIFRHCPPLKTSLS---ENLPKLK-DALATPQWKRAMNEEFSALCKNWTWSLVLPSLQY-NIVGCKWVFRIKKNVD
         Q             H +    K  I +  P    ++S   E+ P+    AL   +W+ AM  E +A   N TW LV P   +  IVGC+W+F  K N D
Subjt:  EQCVFLGPTSIHKGAHCLMATGKIIIFRHCPPLKTSLS---ENLPKLK-DALATPQWKRAMNEEFSALCKNWTWSLVLPSLQY-NIVGCKWVFRIKKNVD

Query:  GSVHRHKARLVAKGFHQSLGVDFFETFSPVAKSSTIRI
        GS++R+KARLVAKG++Q  G+D+ ETFSPV KS++IRI
Subjt:  GSVHRHKARLVAKGFHQSLGVDFFETFSPVAKSSTIRI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.3e-1338.32Show/hide
Query:  RKSVAADCIFLAQAIAEISWISNLLNEIVSPLSDTPILWCDNIGAGALATNPVFHARTKHIEIDVHYDRDQVLQEHFAVRYVSSSEQYYTGFPKDLGPSR
        R S  A+   +A   +E+ WI +LL E+   L+  P+++CDN+GA  L  NPVFH+R KHI ID H+ R+QV      V +VS+ +Q      K L  + 
Subjt:  RKSVAADCIFLAQAIAEISWISNLLNEIVSPLSDTPILWCDNIGAGALATNPVFHARTKHIEIDVHYDRDQVLQEHFAVRYVSSSEQYYTGFPKDLGPSR

Query:  VIHFTSE
          +F S+
Subjt:  VIHFTSE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.2e-1852.27Show/hide
Query:  ALATPQWKRAMNEEFSALCKNWTWSLV-LPSLQYNIVGCKWVFRIKKNVDGSVHRHKARLVAKGFHQSLGVDFFETFSPVAKSSTIRI
        A+   +W++AM  E +A   N TW LV  P     IVGC+W+F  K N DGS++R+KARLVAKG++Q  G+D+ ETFSPV KS++IRI
Subjt:  ALATPQWKRAMNEEFSALCKNWTWSLV-LPSLQYNIVGCKWVFRIKKNVDGSVHRHKARLVAKGFHQSLGVDFFETFSPVAKSSTIRI

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.8e-1340.62Show/hide
Query:  RKSVAADCIFLAQAIAEISWISNLLNEIVSPLSDTPILWCDNIGAGALATNPVFHARTKHIEIDVHYDRDQVLQEHFAVRYVSSSEQYYTGFPKDL
        R S  A+   +A   +E+ WI +LL E+   LS  P+++CDN+GA  L  NPVFH+R KHI +D H+ R+QV      V +VS+ +Q      K L
Subjt:  RKSVAADCIFLAQAIAEISWISNLLNEIVSPLSDTPILWCDNIGAGALATNPVFHARTKHIEIDVHYDRDQVLQEHFAVRYVSSSEQYYTGFPKDL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.3e-1645.68Show/hide
Query:  WKRAMNEEFSALCKNWTWSLVLPSLQYNIVGCKWVFRIKKNVDGSVHRHKARLVAKGFHQSLGVDFFETFSPVAKSSTIRI
        W  AM++E  A+    TW +         +GCKWV++IK N DG++ R+KARLVAKG+ Q  G+DF ETFSPV K +++++
Subjt:  WKRAMNEEFSALCKNWTWSLVLPSLQYNIVGCKWVFRIKKNVDGSVHRHKARLVAKGFHQSLGVDFFETFSPVAKSSTIRI

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.1e-1038.38Show/hide
Query:  RKSVAADCIFLAQAIAEISWISNLLNEIVSPLSDTPILWCDNIGAGALATNPVFHARTKHIEIDVHYDRDQ-VLQEHFAVRYVSSSEQYYTGFPKDLGP
        + S  A+   L+ A  E+ W++    E+  PLS   +L+CDN  A  +ATN VFH RTKHIE D H  R++ V Q   +  + +  EQ   GF + L P
Subjt:  RKSVAADCIFLAQAIAEISWISNLLNEIVSPLSDTPILWCDNIGAGALATNPVFHARTKHIEIDVHYDRDQ-VLQEHFAVRYVSSSEQYYTGFPKDLGP

AT5G57230.1 Thioredoxin superfamily protein4.4e-6577.7Show/hide
Query:  SSEQYYTGFPKDLGPSRVIHFTSEREFVQLLHEGYPVVVAFTVRGNYTKHLDKVLEEAAVEFYPNVKFMRVECPKYPGFCISRQRKEYPFIEMFHSPQQA
        S+ +YY+G+PKDLGPSRV+HFTSEREFVQLLH+GYPVVVAFT+R NYT+HLD++LEEAA EFYPN+KFMRVECPKYPGFCI+RQ+ EYPFIE+FHSPQ A
Subjt:  SSEQYYTGFPKDLGPSRVIHFTSEREFVQLLHEGYPVVVAFTVRGNYTKHLDKVLEEAAVEFYPNVKFMRVECPKYPGFCISRQRKEYPFIEMFHSPQQA

Query:  SNQGKFSDSNITKYSVKVLPFNYDPSAYGFREFFKRHGI
         N+GK  D NIT+YSVKV+P+NYD S YGFREFFKR G+
Subjt:  SNQGKFSDSNITKYSVKVLPFNYDPSAYGFREFFKRHGI

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.1e-2250Show/hide
Query:  LKTSLSENLPKLKDALATPQWKRAMNEEFSALCKNWTWSLVLPSLQYNIVGCKWVFRIKKNVDGSVHRHKARLVAKGFHQSLGVDFFETFSPVAKSSTIR
        + T++ +    +  AL  P W +AM EE  AL +N TW LV P +  NI+GCKWVF+ K + DG++ R KARLVAKGFHQ  G+ F ET+SPV +++TIR
Subjt:  LKTSLSENLPKLKDALATPQWKRAMNEEFSALCKNWTWSLVLPSLQYNIVGCKWVFRIKKNVDGSVHRHKARLVAKGFHQSLGVDFFETFSPVAKSSTIR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAACGTCAATCAATTTTTTGCTTCCGCTTCATCTTCTGTAGCTTCGCCTGTGGAAGCTATTCAAGAGCTCTTTGGTGTTCAATCCAGAGTTGAAATTGATCATCT
CAAGAGGCTATTCGATGAAAGTGGGCGAGTATCTTGCTACAATGCCTCTGATGGTCTAGCTCTGGCCGACAGCCCTGTTCTAGTTAGTGATTTGGTCTCACAGGTGTTAT
CAGGCCTTGACGAGGAGTATAATCCTGTAGTAATGTTGATTCAAGGCAACTGCAGTTCGAACCAAAATTTCAATAATCAGACACAAGGGAGTTCATATCACGGTTCAAAA
GGGAAAAATAATCGGGGAAAAGAGAGGTGGAACAATGCCTCCTCGAACTGGCTAGTGTGCCAAGTTTGTGACCAATTGGATCACACAGCTGACATCTGCTTTTTTCGATA
TAACCCAAAACTTAACAAGCCCAAACAAACACAAAAGAAGGCTGAATCTGCCTATACGGCTGCGACTCTTCAGCGTAAGGGATTTTCTCAAGTTTCTCAAAATCAATATA
CTCTTATGGCTAACTCAGTCATAGCCAATCTTGAGAATGTAGCTGACCCTAGTTGGCAGGCCGACAGTGGAGCATCAAAACATGTTACCACAAATCCTGGATATCTCACT
GCCTCTACTGACTACAGAGGCCTTATCAAAAACATTAAGTTTGGGTTCCACAATGAACAATGTGTTTTTCTTGGGCCAACTTCAATCCACAAAGGAGCTCATTGTCTAAT
GGCCACGGGAAAGATTATCATCTTCAGGCATTGTCCACCATTGAAGACTAGTCTCTCCGAGAACCTACCAAAGTTAAAAGATGCATTAGCAACTCCTCAGTGGAAACGTG
CAATGAATGAGGAATTTAGTGCTCTCTGTAAAAATTGGACTTGGTCACTTGTTCTACCATCCTTGCAGTACAACATTGTAGGATGTAAATGGGTTTTTCGGATAAAGAAA
AACGTTGACGGCTCGGTGCATAGACACAAGGCTAGACTTGTAGCTAAAGGCTTTCATCAAAGCCTAGGTGTCGATTTCTTTGAAACTTTCAGCCCAGTGGCAAAGTCCTC
AACAATTCGAATTCAAGTTACCTATGTTCCAAATGGGATTCGTCTTAATCAATCGAAGTACATATCTGATCACCTAGTGACGTTAAATCTGCAGAACCTAAACCCCTGTC
CCTCTTCAGCTGTCTTGGGAAAATCACTCTCTTTAGGATGTATAGATGACCGGAAGTCAGTTGCTGCTGACTGCATATTTCTTGCTCAAGCTATTGCTGAAATTTCTTGG
ATAAGTAATCTACTTAATGAGATTGTCTCCCCTCTGTCTGATACACCCATTCTTTGGTGTGATAATATTGGTGCTGGTGCTCTTGCCACAAATCCAGTTTTTCATGCTAG
AACAAAACACATTGAGATAGATGTGCATTACGATCGTGATCAAGTCCTTCAAGAACATTTTGCTGTTCGCTATGTTTCGTCTAGTGAGCAGTATTATACCGGTTTTCCAA
AAGATCTTGGGCCTTCCAGGGTCATACATTTTACGTCTGAACGTGAGTTTGTCCAGCTCCTTCATGAAGGCTATCCGGTCGTTGTTGCTTTTACGGTTAGAGGTAACTAC
ACAAAGCATCTTGACAAAGTATTGGAGGAAGCTGCTGTTGAGTTTTATCCGAATGTTAAATTTATGCGAGTTGAGTGCCCAAAGTATCCTGGTTTCTGCATTTCACGGCA
GAGGAAGGAATACCCATTTATCGAGATGTTTCATAGCCCACAACAAGCATCTAACCAGGGAAAGTTTTCTGATTCTAACATTACAAAATACTCGGTGAAGGTTCTACCTT
TCAACTATGACCCCAGTGCCTATGGATTCAGAGAGTTTTTTAAGCGTCATGGGATATATGGTCGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCAACGTCAATCAATTTTTTGCTTCCGCTTCATCTTCTGTAGCTTCGCCTGTGGAAGCTATTCAAGAGCTCTTTGGTGTTCAATCCAGAGTTGAAATTGATCATCT
CAAGAGGCTATTCGATGAAAGTGGGCGAGTATCTTGCTACAATGCCTCTGATGGTCTAGCTCTGGCCGACAGCCCTGTTCTAGTTAGTGATTTGGTCTCACAGGTGTTAT
CAGGCCTTGACGAGGAGTATAATCCTGTAGTAATGTTGATTCAAGGCAACTGCAGTTCGAACCAAAATTTCAATAATCAGACACAAGGGAGTTCATATCACGGTTCAAAA
GGGAAAAATAATCGGGGAAAAGAGAGGTGGAACAATGCCTCCTCGAACTGGCTAGTGTGCCAAGTTTGTGACCAATTGGATCACACAGCTGACATCTGCTTTTTTCGATA
TAACCCAAAACTTAACAAGCCCAAACAAACACAAAAGAAGGCTGAATCTGCCTATACGGCTGCGACTCTTCAGCGTAAGGGATTTTCTCAAGTTTCTCAAAATCAATATA
CTCTTATGGCTAACTCAGTCATAGCCAATCTTGAGAATGTAGCTGACCCTAGTTGGCAGGCCGACAGTGGAGCATCAAAACATGTTACCACAAATCCTGGATATCTCACT
GCCTCTACTGACTACAGAGGCCTTATCAAAAACATTAAGTTTGGGTTCCACAATGAACAATGTGTTTTTCTTGGGCCAACTTCAATCCACAAAGGAGCTCATTGTCTAAT
GGCCACGGGAAAGATTATCATCTTCAGGCATTGTCCACCATTGAAGACTAGTCTCTCCGAGAACCTACCAAAGTTAAAAGATGCATTAGCAACTCCTCAGTGGAAACGTG
CAATGAATGAGGAATTTAGTGCTCTCTGTAAAAATTGGACTTGGTCACTTGTTCTACCATCCTTGCAGTACAACATTGTAGGATGTAAATGGGTTTTTCGGATAAAGAAA
AACGTTGACGGCTCGGTGCATAGACACAAGGCTAGACTTGTAGCTAAAGGCTTTCATCAAAGCCTAGGTGTCGATTTCTTTGAAACTTTCAGCCCAGTGGCAAAGTCCTC
AACAATTCGAATTCAAGTTACCTATGTTCCAAATGGGATTCGTCTTAATCAATCGAAGTACATATCTGATCACCTAGTGACGTTAAATCTGCAGAACCTAAACCCCTGTC
CCTCTTCAGCTGTCTTGGGAAAATCACTCTCTTTAGGATGTATAGATGACCGGAAGTCAGTTGCTGCTGACTGCATATTTCTTGCTCAAGCTATTGCTGAAATTTCTTGG
ATAAGTAATCTACTTAATGAGATTGTCTCCCCTCTGTCTGATACACCCATTCTTTGGTGTGATAATATTGGTGCTGGTGCTCTTGCCACAAATCCAGTTTTTCATGCTAG
AACAAAACACATTGAGATAGATGTGCATTACGATCGTGATCAAGTCCTTCAAGAACATTTTGCTGTTCGCTATGTTTCGTCTAGTGAGCAGTATTATACCGGTTTTCCAA
AAGATCTTGGGCCTTCCAGGGTCATACATTTTACGTCTGAACGTGAGTTTGTCCAGCTCCTTCATGAAGGCTATCCGGTCGTTGTTGCTTTTACGGTTAGAGGTAACTAC
ACAAAGCATCTTGACAAAGTATTGGAGGAAGCTGCTGTTGAGTTTTATCCGAATGTTAAATTTATGCGAGTTGAGTGCCCAAAGTATCCTGGTTTCTGCATTTCACGGCA
GAGGAAGGAATACCCATTTATCGAGATGTTTCATAGCCCACAACAAGCATCTAACCAGGGAAAGTTTTCTGATTCTAACATTACAAAATACTCGGTGAAGGTTCTACCTT
TCAACTATGACCCCAGTGCCTATGGATTCAGAGAGTTTTTTAAGCGTCATGGGATATATGGTCGTTGA
Protein sequenceShow/hide protein sequence
MANVNQFFASASSSVASPVEAIQELFGVQSRVEIDHLKRLFDESGRVSCYNASDGLALADSPVLVSDLVSQVLSGLDEEYNPVVMLIQGNCSSNQNFNNQTQGSSYHGSK
GKNNRGKERWNNASSNWLVCQVCDQLDHTADICFFRYNPKLNKPKQTQKKAESAYTAATLQRKGFSQVSQNQYTLMANSVIANLENVADPSWQADSGASKHVTTNPGYLT
ASTDYRGLIKNIKFGFHNEQCVFLGPTSIHKGAHCLMATGKIIIFRHCPPLKTSLSENLPKLKDALATPQWKRAMNEEFSALCKNWTWSLVLPSLQYNIVGCKWVFRIKK
NVDGSVHRHKARLVAKGFHQSLGVDFFETFSPVAKSSTIRIQVTYVPNGIRLNQSKYISDHLVTLNLQNLNPCPSSAVLGKSLSLGCIDDRKSVAADCIFLAQAIAEISW
ISNLLNEIVSPLSDTPILWCDNIGAGALATNPVFHARTKHIEIDVHYDRDQVLQEHFAVRYVSSSEQYYTGFPKDLGPSRVIHFTSEREFVQLLHEGYPVVVAFTVRGNY
TKHLDKVLEEAAVEFYPNVKFMRVECPKYPGFCISRQRKEYPFIEMFHSPQQASNQGKFSDSNITKYSVKVLPFNYDPSAYGFREFFKRHGIYGR