; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc05G13875 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc05G13875
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationClcChr05:12861020..12865950
RNA-Seq ExpressionClc05G13875
SyntenyClc05G13875
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW35813.1 putative ribonuclease H protein [Vitis vinifera]5.1e-3231.54Show/hide
Query:  PIFYLGMPLGGNPKSHAFRLPIVEKIDKKLDSWKFSYISKRGHLTLLQTILGNLPTYYLSLFQAHVSIYKEIEKLMRNFLWEGSEKNDASHLVQRDVVTS
        P+ YLG+PLGGNPK+  F  P+VE+I ++LD WK +Y+S  G +TL+Q+ L ++P+Y+LSLF+  VSI  +IEK+ R+FLW G  +    HL++ +VV+ 
Subjt:  PIFYLGMPLGGNPKSHAFRLPIVEKIDKKLDSWKFSYISKRGHLTLLQTILGNLPTYYLSLFQAHVSIYKEIEKLMRNFLWEGSEKNDASHLVQRDVVTS

Query:  PKSMGGL----DALDSEA----HCWRLFPRQPL-LYRELEA--WNSLTSGW---------------RQPTIPTNPTDIEKIFYNLSSDGYFSVKAMKILP
        P+ MGGL     ++ + A      WR FPR+   L+ ++ A  + +  +GW                  ++  +P+  +   ++LSS G FSVK+     
Subjt:  PKSMGGL----DALDSEA----HCWRLFPRQPL-LYRELEA--WNSLTSGW---------------RQPTIPTNPTDIEKIFYNLSSDGYFSVKAMKILP

Query:  LEKKSESLEHIFISCIFTRNI--------WDRLLLGATAQPIYSINEL----WPQLKTTTKDRILKSNIMTVILWCIWLEQNKRTFQGIDRNHSHLWD
        L K S  L  +    +++  +        W   L+G    P  SI ++    +  L  + + +IL       ++W +W E+N R F+   R    +WD
Subjt:  LEKKSESLEHIFISCIFTRNI--------WDRLLLGATAQPIYSINEL----WPQLKTTTKDRILKSNIMTVILWCIWLEQNKRTFQGIDRNHSHLWD

RVW87853.1 putative mitochondrial protein [Vitis vinifera]2.2e-3027.06Show/hide
Query:  RAKQKDLISSCRF-ENELNHLLFADDILLFSST--ENIWSYTPTTL-------------PIFYLGMPLGGNPKSHAFRLPIVEKIDKKLDSWKFSYISKR
        +A++++++   +   N++ HL FADD + FSS+  E++ +     L             PI YLG+PLGGNPK+  F  P++E+I ++LD W+ +Y+S  
Subjt:  RAKQKDLISSCRF-ENELNHLLFADDILLFSST--ENIWSYTPTTL-------------PIFYLGMPLGGNPKSHAFRLPIVEKIDKKLDSWKFSYISKR

Query:  GHLTLLQTILGNLPTYYLSLFQAHVSIYKEIEKLMRNFLWEGSEKNDASHLVQRDVVTSPKSMGGLDALDSEAHCW------RLFPRQP-----LLYR--
        G +TL+Q+ L ++P Y+LSLF+   ++  +IE++ R+FLW G  +    HLV  DVV        L    + ++ W      R   R P     L+Y+  
Subjt:  GHLTLLQTILGNLPTYYLSLFQAHVSIYKEIEKLMRNFLWEGSEKNDASHLVQRDVVTSPKSMGGLDALDSEAHCW------RLFPRQP-----LLYR--

Query:  ----------ELEAWNSLTSGWRQPTIPTNPTDI---------------------EKIFYNLSSDGYFSVKAMKILPL-EKKSESLEHIFISCIFTRNIW
                  E+E  + +T G       +N  D                      +K  + LS    +   +  I  L  K  E+++H+F+ C  T  +W
Subjt:  ----------ELEAWNSLTSGWRQPTIPTNPTDI---------------------EKIFYNLSSDGYFSVKAMKILPL-EKKSESLEHIFISCIFTRNIW

Query:  DRLLLGATAQPI--YSINEL----WPQLKTTTKDRILKSNIMTVILWCIWLEQNKRTFQGIDRNHSHLWDDIILIVA
         RL   A    +   SI+++    +     + +  +L  N    ++W +W E+N R F+   RN  +LWD I  + +
Subjt:  DRLLLGATAQPI--YSINEL----WPQLKTTTKDRILKSNIMTVILWCIWLEQNKRTFQGIDRNHSHLWDDIILIVA

RVX23238.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]9.7e-3130.8Show/hide
Query:  IFYLGMPLGGNPKSHAFRLPIVEKIDKKLDSWKFSYISKRGHLTLLQTILGNLPTYYLSLFQAHVSIYKEIEKLMRNFLWEGSEKNDASHLVQRDVVTSP
        I YLG+PLGGNPK+  F  P++E+I  +LD W+ +Y+S RG +T +Q+ L +LP Y+LSLF+   S+  +IE+L R+FLW G  +    HLV+  VV S 
Subjt:  IFYLGMPLGGNPKSHAFRLPIVEKIDKKLDSWKFSYISKRGHLTLLQTILGNLPTYYLSLFQAHVSIYKEIEKLMRNFLWEGSEKNDASHLVQRDVVTSP

Query:  KSMGGLDAL---------------DSEAHCWRLFPRQPLLYRELEAWNSLTSGWRQPTIPTNPTDIEKIFYNLSSDGYFSVKA-MKILPLEKKSESLEHI
          +  L  L                +    W L  R+ L   E+E    L     +  +  +P+D    F   +S   F VK+ ++++  +K  ES +H+
Subjt:  KSMGGLDAL---------------DSEAHCWRLFPRQPLLYRELEAWNSLTSGWRQPTIPTNPTDIEKIFYNLSSDGYFSVKA-MKILPLEKKSESLEHI

Query:  FISCIFTRNIWDRLLLGA-----TAQPIYSINEL-WPQLKTTTKDRILKSNIMTVILWCIWLEQNKRTFQGIDRNHSHLWDDIILIVAL
        F+ C  T  +W RL   A       + IY +  + +     + +  +L       ++  +W E+N R F+   RN   LWD I+ + +L
Subjt:  FISCIFTRNIWDRLLLGA-----TAQPIYSINEL-WPQLKTTTKDRILKSNIMTVILWCIWLEQNKRTFQGIDRNHSHLWDDIILIVAL

TYK05764.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.1e-2936.78Show/hide
Query:  IVNKSLNATYIALILKKSHCIRVSDFRPISFTT------------RAKQK---------------------DLISSCRFENELN--HLLFADDILLF---
        I+N+++N T IALI KK  C   +D+RPIS TT            R K+                      D I   R  N LN  HLLFADDILLF   
Subjt:  IVNKSLNATYIALILKKSHCIRVSDFRPISFTT------------RAKQK---------------------DLISSCRFENELN--HLLFADDILLF---

Query:  --SSTENI--------------------------------------WSYTPTTLPIFYLGMPLGGNPKSHAFRLPIVEKIDKKLDSWKFSYISKRGHLTL
           S  N+                                      W  +    P  YLG+PL G P S  F   I EK+ KKL +WK+S +SK   +TL
Subjt:  --SSTENI--------------------------------------WSYTPTTLPIFYLGMPLGGNPKSHAFRLPIVEKIDKKLDSWKFSYISKRGHLTL

Query:  LQTILGNLPTYYLSLFQAHVSIYKEIEKLMRNFLWEGSEKNDASHLVQRDVVTSPKSMGGL
        + + L +LPTY LS+F+A  SIYK IEK  RNFLW  S +    HLV+  +VTSPK  GGL
Subjt:  LQTILGNLPTYYLSLFQAHVSIYKEIEKLMRNFLWEGSEKNDASHLVQRDVVTSPKSMGGL

TYK10356.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.1e-2925.32Show/hide
Query:  ECSWLIGNLNWRPISEAQAFTLIQSFLQPE----------------------------------------------IVNKSLNATYIALILKKSHCIRVS
        E  WLI NL+W PIS +QA  L  SF + E                                              I+NK++N T IALI KK  C   +
Subjt:  ECSWLIGNLNWRPISEAQAFTLIQSFLQPE----------------------------------------------IVNKSLNATYIALILKKSHCIRVS

Query:  DFRPISFTT-------------------------------------------------RAKQ--------------KDLISSCRF------------ENE
        D+RPIS TT                                                 R K+              K  ISS ++            E+ 
Subjt:  DFRPISFTT-------------------------------------------------RAKQ--------------KDLISSCRF------------ENE

Query:  LNHLLFADDILLFSSTENI-----------------------WSYTPTTLPIFYLGMPLGGNPKSHAFRLPIVEKIDKKLDSWKFSYISKRGHLTLLQTI
        L +L    ++   +S  NI                       W  T    PI YLG+PLGG P + AF   I EKI KKL SWK+S +SK G +TL+++ 
Subjt:  LNHLLFADDILLFSSTENI-----------------------WSYTPTTLPIFYLGMPLGGNPKSHAFRLPIVEKIDKKLDSWKFSYISKRGHLTLLQTI

Query:  LGNLPTYYLSLFQAHVSIYKEIEKLMRNFLWEGSEKNDASHLVQRDV--------------VTSPKSM--------------GGLDALDSEAHCWRLFPR
        L +LPTY LS+F+A VS YK IEK  RNF W+   +    HLV   +                SP S+                 D  ++    W L PR
Subjt:  LGNLPTYYLSLFQAHVSIYKEIEKLMRNFLWEGSEKNDASHLVQRDV--------------VTSPKSM--------------GGLDALDSEAHCWRLFPR

Query:  QPLLYRELEAW----NSLTSGWRQPTIPTNPTDIEKIFYNLSSDGYFSVKAMK---------ILPLE---------------------------------
        + L   E   W    NSL + + +     +PT I      L+SDG++SV ++K         IL L+                                 
Subjt:  QPLLYRELEAW----NSLTSGWRQPTIPTNPTDIEKIFYNLSSDGYFSVKAMK---------ILPLE---------------------------------

Query:  --------------------KKSESLEHIFISCIFTRNIWDRLLLGATAQPIYSINELWPQ--------LKTTTKDRILKSNIMTVILWCIWLEQNKRTF
                            +  E   H+FI C   ++IW+ +    ++    ++N L P+         K  TK  I+  N     LW IWLE+N R F
Subjt:  --------------------KKSESLEHIFISCIFTRNIWDRLLLGATAQPIYSINELWPQ--------LKTTTKDRILKSNIMTVILWCIWLEQNKRTF

Query:  QGIDRNHSHLWDDIILIVAL
         G ++  + LW+DI  +  L
Subjt:  QGIDRNHSHLWDDIILIVAL

TrEMBL top hitse value%identityAlignment
A0A438DK26 Putative ribonuclease H protein2.5e-3231.54Show/hide
Query:  PIFYLGMPLGGNPKSHAFRLPIVEKIDKKLDSWKFSYISKRGHLTLLQTILGNLPTYYLSLFQAHVSIYKEIEKLMRNFLWEGSEKNDASHLVQRDVVTS
        P+ YLG+PLGGNPK+  F  P+VE+I ++LD WK +Y+S  G +TL+Q+ L ++P+Y+LSLF+  VSI  +IEK+ R+FLW G  +    HL++ +VV+ 
Subjt:  PIFYLGMPLGGNPKSHAFRLPIVEKIDKKLDSWKFSYISKRGHLTLLQTILGNLPTYYLSLFQAHVSIYKEIEKLMRNFLWEGSEKNDASHLVQRDVVTS

Query:  PKSMGGL----DALDSEA----HCWRLFPRQPL-LYRELEA--WNSLTSGW---------------RQPTIPTNPTDIEKIFYNLSSDGYFSVKAMKILP
        P+ MGGL     ++ + A      WR FPR+   L+ ++ A  + +  +GW                  ++  +P+  +   ++LSS G FSVK+     
Subjt:  PKSMGGL----DALDSEA----HCWRLFPRQPL-LYRELEA--WNSLTSGW---------------RQPTIPTNPTDIEKIFYNLSSDGYFSVKAMKILP

Query:  LEKKSESLEHIFISCIFTRNI--------WDRLLLGATAQPIYSINEL----WPQLKTTTKDRILKSNIMTVILWCIWLEQNKRTFQGIDRNHSHLWD
        L K S  L  +    +++  +        W   L+G    P  SI ++    +  L  + + +IL       ++W +W E+N R F+   R    +WD
Subjt:  LEKKSESLEHIFISCIFTRNI--------WDRLLLGATAQPIYSINEL----WPQLKTTTKDRILKSNIMTVILWCIWLEQNKRTFQGIDRNHSHLWD

A0A438HTQ7 Putative mitochondrial protein1.0e-3027.06Show/hide
Query:  RAKQKDLISSCRF-ENELNHLLFADDILLFSST--ENIWSYTPTTL-------------PIFYLGMPLGGNPKSHAFRLPIVEKIDKKLDSWKFSYISKR
        +A++++++   +   N++ HL FADD + FSS+  E++ +     L             PI YLG+PLGGNPK+  F  P++E+I ++LD W+ +Y+S  
Subjt:  RAKQKDLISSCRF-ENELNHLLFADDILLFSST--ENIWSYTPTTL-------------PIFYLGMPLGGNPKSHAFRLPIVEKIDKKLDSWKFSYISKR

Query:  GHLTLLQTILGNLPTYYLSLFQAHVSIYKEIEKLMRNFLWEGSEKNDASHLVQRDVVTSPKSMGGLDALDSEAHCW------RLFPRQP-----LLYR--
        G +TL+Q+ L ++P Y+LSLF+   ++  +IE++ R+FLW G  +    HLV  DVV        L    + ++ W      R   R P     L+Y+  
Subjt:  GHLTLLQTILGNLPTYYLSLFQAHVSIYKEIEKLMRNFLWEGSEKNDASHLVQRDVVTSPKSMGGLDALDSEAHCW------RLFPRQP-----LLYR--

Query:  ----------ELEAWNSLTSGWRQPTIPTNPTDI---------------------EKIFYNLSSDGYFSVKAMKILPL-EKKSESLEHIFISCIFTRNIW
                  E+E  + +T G       +N  D                      +K  + LS    +   +  I  L  K  E+++H+F+ C  T  +W
Subjt:  ----------ELEAWNSLTSGWRQPTIPTNPTDI---------------------EKIFYNLSSDGYFSVKAMKILPL-EKKSESLEHIFISCIFTRNIW

Query:  DRLLLGATAQPI--YSINEL----WPQLKTTTKDRILKSNIMTVILWCIWLEQNKRTFQGIDRNHSHLWDDIILIVA
         RL   A    +   SI+++    +     + +  +L  N    ++W +W E+N R F+   RN  +LWD I  + +
Subjt:  DRLLLGATAQPI--YSINEL----WPQLKTTTKDRILKSNIMTVILWCIWLEQNKRTFQGIDRNHSHLWDDIILIVA

A0A438KPW8 LINE-1 retrotransposable element ORF2 protein4.7e-3130.8Show/hide
Query:  IFYLGMPLGGNPKSHAFRLPIVEKIDKKLDSWKFSYISKRGHLTLLQTILGNLPTYYLSLFQAHVSIYKEIEKLMRNFLWEGSEKNDASHLVQRDVVTSP
        I YLG+PLGGNPK+  F  P++E+I  +LD W+ +Y+S RG +T +Q+ L +LP Y+LSLF+   S+  +IE+L R+FLW G  +    HLV+  VV S 
Subjt:  IFYLGMPLGGNPKSHAFRLPIVEKIDKKLDSWKFSYISKRGHLTLLQTILGNLPTYYLSLFQAHVSIYKEIEKLMRNFLWEGSEKNDASHLVQRDVVTSP

Query:  KSMGGLDAL---------------DSEAHCWRLFPRQPLLYRELEAWNSLTSGWRQPTIPTNPTDIEKIFYNLSSDGYFSVKA-MKILPLEKKSESLEHI
          +  L  L                +    W L  R+ L   E+E    L     +  +  +P+D    F   +S   F VK+ ++++  +K  ES +H+
Subjt:  KSMGGLDAL---------------DSEAHCWRLFPRQPLLYRELEAWNSLTSGWRQPTIPTNPTDIEKIFYNLSSDGYFSVKA-MKILPLEKKSESLEHI

Query:  FISCIFTRNIWDRLLLGA-----TAQPIYSINEL-WPQLKTTTKDRILKSNIMTVILWCIWLEQNKRTFQGIDRNHSHLWDDIILIVAL
        F+ C  T  +W RL   A       + IY +  + +     + +  +L       ++  +W E+N R F+   RN   LWD I+ + +L
Subjt:  FISCIFTRNIWDRLLLGA-----TAQPIYSINEL-WPQLKTTTKDRILKSNIMTVILWCIWLEQNKRTFQGIDRNHSHLWDDIILIVAL

A0A5D3C384 LINE-1 retrotransposable element ORF2 protein5.2e-3036.78Show/hide
Query:  IVNKSLNATYIALILKKSHCIRVSDFRPISFTT------------RAKQK---------------------DLISSCRFENELN--HLLFADDILLF---
        I+N+++N T IALI KK  C   +D+RPIS TT            R K+                      D I   R  N LN  HLLFADDILLF   
Subjt:  IVNKSLNATYIALILKKSHCIRVSDFRPISFTT------------RAKQK---------------------DLISSCRFENELN--HLLFADDILLF---

Query:  --SSTENI--------------------------------------WSYTPTTLPIFYLGMPLGGNPKSHAFRLPIVEKIDKKLDSWKFSYISKRGHLTL
           S  N+                                      W  +    P  YLG+PL G P S  F   I EK+ KKL +WK+S +SK   +TL
Subjt:  --SSTENI--------------------------------------WSYTPTTLPIFYLGMPLGGNPKSHAFRLPIVEKIDKKLDSWKFSYISKRGHLTL

Query:  LQTILGNLPTYYLSLFQAHVSIYKEIEKLMRNFLWEGSEKNDASHLVQRDVVTSPKSMGGL
        + + L +LPTY LS+F+A  SIYK IEK  RNFLW  S +    HLV+  +VTSPK  GGL
Subjt:  LQTILGNLPTYYLSLFQAHVSIYKEIEKLMRNFLWEGSEKNDASHLVQRDVVTSPKSMGGL

A0A5D3CJ08 LINE-1 retrotransposable element ORF2 protein5.2e-3025.32Show/hide
Query:  ECSWLIGNLNWRPISEAQAFTLIQSFLQPE----------------------------------------------IVNKSLNATYIALILKKSHCIRVS
        E  WLI NL+W PIS +QA  L  SF + E                                              I+NK++N T IALI KK  C   +
Subjt:  ECSWLIGNLNWRPISEAQAFTLIQSFLQPE----------------------------------------------IVNKSLNATYIALILKKSHCIRVS

Query:  DFRPISFTT-------------------------------------------------RAKQ--------------KDLISSCRF------------ENE
        D+RPIS TT                                                 R K+              K  ISS ++            E+ 
Subjt:  DFRPISFTT-------------------------------------------------RAKQ--------------KDLISSCRF------------ENE

Query:  LNHLLFADDILLFSSTENI-----------------------WSYTPTTLPIFYLGMPLGGNPKSHAFRLPIVEKIDKKLDSWKFSYISKRGHLTLLQTI
        L +L    ++   +S  NI                       W  T    PI YLG+PLGG P + AF   I EKI KKL SWK+S +SK G +TL+++ 
Subjt:  LNHLLFADDILLFSSTENI-----------------------WSYTPTTLPIFYLGMPLGGNPKSHAFRLPIVEKIDKKLDSWKFSYISKRGHLTLLQTI

Query:  LGNLPTYYLSLFQAHVSIYKEIEKLMRNFLWEGSEKNDASHLVQRDV--------------VTSPKSM--------------GGLDALDSEAHCWRLFPR
        L +LPTY LS+F+A VS YK IEK  RNF W+   +    HLV   +                SP S+                 D  ++    W L PR
Subjt:  LGNLPTYYLSLFQAHVSIYKEIEKLMRNFLWEGSEKNDASHLVQRDV--------------VTSPKSM--------------GGLDALDSEAHCWRLFPR

Query:  QPLLYRELEAW----NSLTSGWRQPTIPTNPTDIEKIFYNLSSDGYFSVKAMK---------ILPLE---------------------------------
        + L   E   W    NSL + + +     +PT I      L+SDG++SV ++K         IL L+                                 
Subjt:  QPLLYRELEAW----NSLTSGWRQPTIPTNPTDIEKIFYNLSSDGYFSVKAMK---------ILPLE---------------------------------

Query:  --------------------KKSESLEHIFISCIFTRNIWDRLLLGATAQPIYSINELWPQ--------LKTTTKDRILKSNIMTVILWCIWLEQNKRTF
                            +  E   H+FI C   ++IW+ +    ++    ++N L P+         K  TK  I+  N     LW IWLE+N R F
Subjt:  --------------------KKSESLEHIFISCIFTRNIWDRLLLGATAQPIYSINELWPQ--------LKTTTKDRILKSNIMTVILWCIWLEQNKRTF

Query:  QGIDRNHSHLWDDIILIVAL
         G ++  + LW+DI  +  L
Subjt:  QGIDRNHSHLWDDIILIVAL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCCCAACAGAAATAAAAATTCTGCCTACTTTCATAAAGTGTGTATGGCTCGTAGGCGATTCAACTTCATATGAAAGATACTGGATACCCAAGTCTGAATGTTCTTG
GCTAATTGGAAACTTAAATTGGAGGCCTATATCTGAAGCGCAAGCTTTTACTCTCATACAGTCCTTCTTACAGCCTGAAATTGTGAATAAATCTCTAAATGCCACATACA
TTGCCTTAATTCTAAAGAAGTCTCATTGCATTAGAGTATCTGACTTCCGCCCCATTAGTTTTACAACCAGAGCTAAACAAAAAGATCTCATCTCTAGTTGTCGCTTTGAA
AATGAACTCAACCATCTTCTATTTGCTGATGACATATTGCTTTTCTCTAGTACAGAGAACATCTGGAGCTACACACCAACCACACTTCCTATTTTTTACTTGGGTATGCC
ATTAGGGGGAAATCCCAAATCACATGCTTTCCGGCTCCCTATTGTTGAAAAGATTGATAAGAAATTAGACTCTTGGAAATTTTCATACATCTCTAAAAGAGGACATTTAA
CTCTGCTACAAACCATTCTTGGTAACCTTCCTACTTACTATCTATCGTTATTTCAAGCTCATGTCTCCATTTACAAAGAAATTGAGAAACTGATGAGGAATTTTCTTTGG
GAAGGTTCTGAGAAAAATGATGCATCACACCTTGTCCAACGGGATGTTGTTACCTCTCCTAAATCTATGGGTGGCTTGGATGCTTTGGACTCTGAAGCCCATTGTTGGAG
ATTATTCCCTCGTCAGCCTTTATTATATAGGGAATTAGAGGCATGGAACTCCCTTACTAGTGGTTGGAGACAGCCAACCATTCCAACCAATCCTACTGATATAGAAAAGA
TTTTCTACAATTTATCATCGGATGGTTATTTCTCAGTTAAAGCCATGAAAATTCTTCCTTTGGAGAAAAAATCAGAATCTTTAGAGCACATCTTCATCTCATGTATCTTT
ACACGAAATATTTGGGATCGCTTATTACTAGGGGCCACTGCACAGCCAATATACTCCATAAATGAGCTTTGGCCCCAACTCAAAACGACTACAAAAGATCGTATCTTAAA
GAGTAACATCATGACTGTCATTTTATGGTGTATTTGGCTCGAGCAGAATAAAAGAACTTTCCAAGGCATTGATAGAAATCACAGCCATCTCTGGGACGATATCATCTTGA
TTGTTGCCTTGAGGAAGCTCAGTGGGACTCGTGAGTTGAGAGCTAGGAGAGAGCTCGAGCTTGAGAACTCACGGGAGAGCCCAAGAGAGCTTAGAGAGAGCTCAACGGGA
CCCATGAGAGCTTATAGGAGCCTAGGAGAGAGAGCTCTAGAGGACCTGCTAGAGAGCCTAACCCGTGAGGTCATTCTGAGAAAGGAGGTCGACCTGACGAGTCGAGCTAA
GAAACGAGGTCAACCTAAGAAAGGAGGTCAACCTGATAACCAGATGTTAAAGGTTTACCCAAGAGTCAAACACATTTTATGGTTGAGGGGTGTAACGACTTTAACAATAA
TGACTAGATGGGTAACGGTTGAGGGAAGTCTGATAGTGGCTTGGAAGCTGAAGGGTCATGTCAGCTTTAGTTATGAAGAGCTCGACCTATATAATCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGACCCCAACAGAAATAAAAATTCTGCCTACTTTCATAAAGTGTGTATGGCTCGTAGGCGATTCAACTTCATATGAAAGATACTGGATACCCAAGTCTGAATGTTCTTG
GCTAATTGGAAACTTAAATTGGAGGCCTATATCTGAAGCGCAAGCTTTTACTCTCATACAGTCCTTCTTACAGCCTGAAATTGTGAATAAATCTCTAAATGCCACATACA
TTGCCTTAATTCTAAAGAAGTCTCATTGCATTAGAGTATCTGACTTCCGCCCCATTAGTTTTACAACCAGAGCTAAACAAAAAGATCTCATCTCTAGTTGTCGCTTTGAA
AATGAACTCAACCATCTTCTATTTGCTGATGACATATTGCTTTTCTCTAGTACAGAGAACATCTGGAGCTACACACCAACCACACTTCCTATTTTTTACTTGGGTATGCC
ATTAGGGGGAAATCCCAAATCACATGCTTTCCGGCTCCCTATTGTTGAAAAGATTGATAAGAAATTAGACTCTTGGAAATTTTCATACATCTCTAAAAGAGGACATTTAA
CTCTGCTACAAACCATTCTTGGTAACCTTCCTACTTACTATCTATCGTTATTTCAAGCTCATGTCTCCATTTACAAAGAAATTGAGAAACTGATGAGGAATTTTCTTTGG
GAAGGTTCTGAGAAAAATGATGCATCACACCTTGTCCAACGGGATGTTGTTACCTCTCCTAAATCTATGGGTGGCTTGGATGCTTTGGACTCTGAAGCCCATTGTTGGAG
ATTATTCCCTCGTCAGCCTTTATTATATAGGGAATTAGAGGCATGGAACTCCCTTACTAGTGGTTGGAGACAGCCAACCATTCCAACCAATCCTACTGATATAGAAAAGA
TTTTCTACAATTTATCATCGGATGGTTATTTCTCAGTTAAAGCCATGAAAATTCTTCCTTTGGAGAAAAAATCAGAATCTTTAGAGCACATCTTCATCTCATGTATCTTT
ACACGAAATATTTGGGATCGCTTATTACTAGGGGCCACTGCACAGCCAATATACTCCATAAATGAGCTTTGGCCCCAACTCAAAACGACTACAAAAGATCGTATCTTAAA
GAGTAACATCATGACTGTCATTTTATGGTGTATTTGGCTCGAGCAGAATAAAAGAACTTTCCAAGGCATTGATAGAAATCACAGCCATCTCTGGGACGATATCATCTTGA
TTGTTGCCTTGAGGAAGCTCAGTGGGACTCGTGAGTTGAGAGCTAGGAGAGAGCTCGAGCTTGAGAACTCACGGGAGAGCCCAAGAGAGCTTAGAGAGAGCTCAACGGGA
CCCATGAGAGCTTATAGGAGCCTAGGAGAGAGAGCTCTAGAGGACCTGCTAGAGAGCCTAACCCGTGAGGTCATTCTGAGAAAGGAGGTCGACCTGACGAGTCGAGCTAA
GAAACGAGGTCAACCTAAGAAAGGAGGTCAACCTGATAACCAGATGTTAAAGGTTTACCCAAGAGTCAAACACATTTTATGGTTGAGGGGTGTAACGACTTTAACAATAA
TGACTAGATGGGTAACGGTTGAGGGAAGTCTGATAGTGGCTTGGAAGCTGAAGGGTCATGTCAGCTTTAGTTATGAAGAGCTCGACCTATATAATCTTTAA
Protein sequenceShow/hide protein sequence
MTPTEIKILPTFIKCVWLVGDSTSYERYWIPKSECSWLIGNLNWRPISEAQAFTLIQSFLQPEIVNKSLNATYIALILKKSHCIRVSDFRPISFTTRAKQKDLISSCRFE
NELNHLLFADDILLFSSTENIWSYTPTTLPIFYLGMPLGGNPKSHAFRLPIVEKIDKKLDSWKFSYISKRGHLTLLQTILGNLPTYYLSLFQAHVSIYKEIEKLMRNFLW
EGSEKNDASHLVQRDVVTSPKSMGGLDALDSEAHCWRLFPRQPLLYRELEAWNSLTSGWRQPTIPTNPTDIEKIFYNLSSDGYFSVKAMKILPLEKKSESLEHIFISCIF
TRNIWDRLLLGATAQPIYSINELWPQLKTTTKDRILKSNIMTVILWCIWLEQNKRTFQGIDRNHSHLWDDIILIVALRKLSGTRELRARRELELENSRESPRELRESSTG
PMRAYRSLGERALEDLLESLTREVILRKEVDLTSRAKKRGQPKKGGQPDNQMLKVYPRVKHILWLRGVTTLTIMTRWVTVEGSLIVAWKLKGHVSFSYEELDLYNL