; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G17285 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G17285
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionRNA-directed DNA polymerase
Genome locationctg27:2001413..2004232
RNA-Seq ExpressionCucsat.G17285
SyntenyCucsat.G17285
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001969 - Aspartic peptidase, active site
IPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054966.1 transposon Ty3-I Gag-Pol polyprotein isoform X1 [Cucumis melo var. makuwa]2.34e-7346.62Show/hide
Query:  MVALKLKSGASTWWDQIQANRRLIGKTPIRSWPRMLKMMKERFLPTDFEQILYQQYQQCRQDNRKVAEYAKEFHRLSARTQTNESENYQIGRFVDGLKEN
        +VALKLK GAS WWDQI  NR+  GK PIRSW +M K+MK+RF+P ++EQ LY QYQ CRQ  RK AEY +EFHRL  RT   E E + I  FV GL+ +
Subjt:  MVALKLKSGASTWWDQIQANRRLIGKTPIRSWPRMLKMMKERFLPTDFEQILYQQYQQCRQDNRKVAEYAKEFHRLSARTQTNESENYQIGRFVDGLKEN

Query:  IHEQLDLQPIATLPATISMVFKAELKLEKRQKNSDTKKNQWEKAFIPYQRKNYDNTKQAQGSGTSKAKEEQPSKTNQSPRTQELSTKNGSTNYPRPNLGF
        + E++ LQP   L   I+     E  +E R K+  T+K  WE +    ++    N+K    +     ++E+ S   + P  +    K G   Y RP  G 
Subjt:  IHEQLDLQPIATLPATISMVFKAELKLEKRQKNSDTKKNQWEKAFIPYQRKNYDNTKQAQGSGTSKAKEEQPSKTNQSPRTQELSTKNGSTNYPRPNLGF

Query:  CYRRNQNGHLSNQCPQRKTVAYVEEGGSQEDEAKPNSEEEINELEPDEGEQLSCVIQRILLTPKTETHPRRHSLFRTRCTINGKLCNVIIDSGSSESIVY
        CYR  Q GH SNQCPQRKT+A  ++     + +    +EE   +E DEG+ LSC++QR+L++PK E   +RHSLF+TRCTI GK+CNVIIDSGSSE+ V 
Subjt:  CYRRNQNGHLSNQCPQRKTVAYVEEGGSQEDEAKPNSEEEINELEPDEGEQLSCVIQRILLTPKTETHPRRHSLFRTRCTINGKLCNVIIDSGSSESIVY

Query:  SKLVQALKPQT
         KLV AL  +T
Subjt:  SKLVQALKPQT

XP_022138327.1 uncharacterized protein LOC111009540 isoform X1 [Momordica charantia]3.11e-7145.6Show/hide
Query:  MVALKLKSGASTWWDQIQANRRLIGKTPIRSWPRMLKMMKERFLPTDFEQILYQQYQQCRQDNRKVAEYAKEFHRLSARTQTNESENYQIGRFVDGLKEN
        +VA K++SGAS WWDQ++ N R +GK PIRSWPRML++M+ERFLP +FEQ+LYQ YQ+CRQ  + +A+Y + FHRL A+T   E+E+Y+I RFVDGL+E+
Subjt:  MVALKLKSGASTWWDQIQANRRLIGKTPIRSWPRMLKMMKERFLPTDFEQILYQQYQQCRQDNRKVAEYAKEFHRLSARTQTNESENYQIGRFVDGLKEN

Query:  IHEQLDLQPIATLPATISMVFKAELKLEKRQKNSDTKKNQWEKAFIPYQRKNYDNTKQAQGSGTSKAKEEQPSKTNQSP--RTQELSTKNGSTNYPRPNL
        I +Q+D+QPI  L   I M  K E   +K++  +  ++  W+K  I  +    D  K  Q   TS +  + P    +S   +T + S+K G+  Y RP L
Subjt:  IHEQLDLQPIATLPATISMVFKAELKLEKRQKNSDTKKNQWEKAFIPYQRKNYDNTKQAQGSGTSKAKEEQPSKTNQSP--RTQELSTKNGSTNYPRPNL

Query:  GFCYRRNQNGHLSNQCPQRKTVAYVEEGGSQEDEAKPNSEEEINELEPDEGEQLSCVIQRILLTPKTETHPRRHSLFRTRCTINGKLC-----NVIIDSG
        G C+R  Q  HLSN+CPQR+ +A V++    E +    +E++   +EPDEG+ LSCV+Q++L TPK E  P+R+SLFRT  TINGKL      +V  D  
Subjt:  GFCYRRNQNGHLSNQCPQRKTVAYVEEGGSQEDEAKPNSEEEINELEPDEGEQLSCVIQRILLTPKTETHPRRHSLFRTRCTINGKLC-----NVIIDSG

Query:  SSESIVY
        +S SI Y
Subjt:  SSESIVY

XP_022138328.1 uncharacterized protein LOC111009540 isoform X2 [Momordica charantia]2.15e-7146.5Show/hide
Query:  MVALKLKSGASTWWDQIQANRRLIGKTPIRSWPRMLKMMKERFLPTDFEQILYQQYQQCRQDNRKVAEYAKEFHRLSARTQTNESENYQIGRFVDGLKEN
        +VA K++SGAS WWDQ++ N R +GK PIRSWPRML++M+ERFLP +FEQ+LYQ YQ+CRQ  + +A+Y + FHRL A+T   E+E+Y+I RFVDGL+E+
Subjt:  MVALKLKSGASTWWDQIQANRRLIGKTPIRSWPRMLKMMKERFLPTDFEQILYQQYQQCRQDNRKVAEYAKEFHRLSARTQTNESENYQIGRFVDGLKEN

Query:  IHEQLDLQPIATLPATISMVFKAELKLEKRQKNSDTKKNQWEKAFIPYQRKNYDNTKQAQGSGTSKAKEEQPSKTNQSP--RTQELSTKNGSTNYPRPNL
        I +Q+D+QPI  L   I M  K E   +K++  +  ++  W+K  I  +    D  K  Q   TS +  + P    +S   +T + S+K G+  Y RP L
Subjt:  IHEQLDLQPIATLPATISMVFKAELKLEKRQKNSDTKKNQWEKAFIPYQRKNYDNTKQAQGSGTSKAKEEQPSKTNQSP--RTQELSTKNGSTNYPRPNL

Query:  GFCYRRNQNGHLSNQCPQRKTVAYVEEGGSQEDEAKPNSEEEINELEPDEGEQLSCVIQRILLTPKTETHPRRHSLFRTRCTINGK
        G C+R  Q  HLSN+CPQR+ +A V++    E +    +E++   +EPDEG+ LSCV+Q++L TPK E  P+R+SLFRT  TINGK
Subjt:  GFCYRRNQNGHLSNQCPQRKTVAYVEEGGSQEDEAKPNSEEEINELEPDEGEQLSCVIQRILLTPKTETHPRRHSLFRTRCTINGK

XP_031741035.1 uncharacterized protein LOC116403692 [Cucumis sativus]1.73e-7848.21Show/hide
Query:  MVALKLKSGASTWWDQIQANRRLIGKTPIRSWPRMLKMMKERFLPTDFEQILYQQYQQCRQDNRKVAEYAKEFHRLSARTQTNESENYQIGRFVDGLKEN
        +VALKL++GAS WWDQ++ NR+  GK P+RSW +M K++K RFLP ++EQ LY QYQ CRQ  R VAEY +EFHRLSART  +E+E +Q+ RFV GL+ +
Subjt:  MVALKLKSGASTWWDQIQANRRLIGKTPIRSWPRMLKMMKERFLPTDFEQILYQQYQQCRQDNRKVAEYAKEFHRLSARTQTNESENYQIGRFVDGLKEN

Query:  IHEQLDLQPIATLPATISMVFKAELKLEKRQKNSDTKKNQWEKAFIPYQRKNYDNTKQAQGSGTSKAKEEQPSKTNQSPRTQELSTKNGSTNYPRPNLGF
        I E++ LQP   L   IS     E  +  R KN + +++ WE      +  +  +T     S  +K KE    +     + ++    +G  +Y RP+LG 
Subjt:  IHEQLDLQPIATLPATISMVFKAELKLEKRQKNSDTKKNQWEKAFIPYQRKNYDNTKQAQGSGTSKAKEEQPSKTNQSPRTQELSTKNGSTNYPRPNLGF

Query:  CYRRNQNGHLSNQCPQRKTVAYVEEGGSQEDEAKPNSEEEINELEPDEGEQLSCVIQRILLTPKTETHPRRHSLFRTRCTINGKLCNVIIDSGSSESIVY
        C+R  Q GHLS+ CPQRKT+A  EEGG Q  E    +EEE   +E D+GE++SCVIQR+L+TPK E + +RH LF+TRCTING++C+VIIDSGSSE+ V 
Subjt:  CYRRNQNGHLSNQCPQRKTVAYVEEGGSQEDEAKPNSEEEINELEPDEGEQLSCVIQRILLTPKTETHPRRHSLFRTRCTINGKLCNVIIDSGSSESIVY

Query:  SKLVQAL
         KLV  L
Subjt:  SKLVQAL

XP_031743026.1 uncharacterized protein LOC116404533 [Cucumis sativus]2.61e-8348.86Show/hide
Query:  MVALKLKSGASTWWDQIQANRRLIGKTPIRSWPRMLKMMKERFLPTDFEQILYQQYQQCRQDNRKVAEYAKEFHRLSARTQTNESENYQIGRFVDGLKEN
        +VALKL++GAS WWDQ++ NR+  GK P+RSW +M K++K RFLP ++EQ LY QYQ CRQ  R VAEY +EFHRLSART  +E+E +Q+ RFV GL+ +
Subjt:  MVALKLKSGASTWWDQIQANRRLIGKTPIRSWPRMLKMMKERFLPTDFEQILYQQYQQCRQDNRKVAEYAKEFHRLSARTQTNESENYQIGRFVDGLKEN

Query:  IHEQLDLQPIATLPATISMVFKAELKLEKRQKNSDTKKNQWEKAFIPYQRKNYDNTKQAQGSGTSKAKEEQPSKTNQSPRTQELSTKNGSTNYPRPNLGF
        I E++ LQP   L   IS     E  +  R KN + +++ WE      +  +  +T     S  +K KE    +     + ++    +G  NY RP+LG 
Subjt:  IHEQLDLQPIATLPATISMVFKAELKLEKRQKNSDTKKNQWEKAFIPYQRKNYDNTKQAQGSGTSKAKEEQPSKTNQSPRTQELSTKNGSTNYPRPNLGF

Query:  CYRRNQNGHLSNQCPQRKTVAYVEEGGSQEDEAKPNSEEEINELEPDEGEQLSCVIQRILLTPKTETHPRRHSLFRTRCTINGKLCNVIIDSGSSESIVY
        C+R  Q GHLSN CPQRKT+A  EEGG Q  E    +EEE   +E D+GE++SCVIQR+L+TPK E + +RH LF+TRCTING++C+VIIDSGSSE+ V 
Subjt:  CYRRNQNGHLSNQCPQRKTVAYVEEGGSQEDEAKPNSEEEINELEPDEGEQLSCVIQRILLTPKTETHPRRHSLFRTRCTINGKLCNVIIDSGSSESIVY

Query:  SKLVQAL
         KLV  L
Subjt:  SKLVQAL

TrEMBL top hitse value%identityAlignment
A0A5B7BER3 Uncharacterized protein2.67e-6945.02Show/hide
Query:  MVALKLKSGASTWWDQIQANRRLIGKTPIRSWPRMLKMMKERFLPTDFEQILYQQYQQCRQDNRKVAEYAKEFHRLSARTQTNESENYQIGRFVDGLKEN
        +VA KLK GAS WWDQ+Q NRR  GK P+R+W +M ++++ERFLP D+EQ+LYQQYQ CRQ  R V+EY++EF+ LS+R    E+EN Q+ R+V GL+  
Subjt:  MVALKLKSGASTWWDQIQANRRLIGKTPIRSWPRMLKMMKERFLPTDFEQILYQQYQQCRQDNRKVAEYAKEFHRLSARTQTNESENYQIGRFVDGLKEN

Query:  IHEQLDLQPIATLPATISMVFKAELKLEKRQKNSDTKKNQWEKAFIPYQRKNYDNTKQAQGSGTSKAKEEQPSKTNQSPRTQELSTKNG---------ST
        I +QL+L+ I  L    S+     LK+E +Q     +     +++    R   +  KQ +G         QP K   +PR Q  S+KN          ST
Subjt:  IHEQLDLQPIATLPATISMVFKAELKLEKRQKNSDTKKNQWEKAFIPYQRKNYDNTKQAQGSGTSKAKEEQPSKTNQSPRTQELSTKNG---------ST

Query:  N-YPRPNLGFCYRRNQNGHLSNQCPQRKTVAYVEEGGSQEDEAKPNSEEEINEL----------EPDEGEQLSCVIQRILLTPKTETHPRRHSLFRTRCT
        N Y RP  G C+R  Q GH SN+CP R+ V  V   G  ED +     EE  E           E DEGE +SCV+QR+LL PK E  P+RH++FRTRCT
Subjt:  N-YPRPNLGFCYRRNQNGHLSNQCPQRKTVAYVEEGGSQEDEAKPNSEEEINEL----------EPDEGEQLSCVIQRILLTPKTETHPRRHSLFRTRCT

Query:  INGKLCNVIIDSGSSESIVYSKLVQALKPQT
        IN K+C+VIIDSGSSE+IV   LV+AL+ +T
Subjt:  INGKLCNVIIDSGSSESIVYSKLVQALKPQT

A0A5D3DGR0 Reverse transcriptase1.13e-7346.62Show/hide
Query:  MVALKLKSGASTWWDQIQANRRLIGKTPIRSWPRMLKMMKERFLPTDFEQILYQQYQQCRQDNRKVAEYAKEFHRLSARTQTNESENYQIGRFVDGLKEN
        +VALKLK GAS WWDQI  NR+  GK PIRSW +M K+MK+RF+P ++EQ LY QYQ CRQ  RK AEY +EFHRL  RT   E E + I  FV GL+ +
Subjt:  MVALKLKSGASTWWDQIQANRRLIGKTPIRSWPRMLKMMKERFLPTDFEQILYQQYQQCRQDNRKVAEYAKEFHRLSARTQTNESENYQIGRFVDGLKEN

Query:  IHEQLDLQPIATLPATISMVFKAELKLEKRQKNSDTKKNQWEKAFIPYQRKNYDNTKQAQGSGTSKAKEEQPSKTNQSPRTQELSTKNGSTNYPRPNLGF
        + E++ LQP   L   I+     E  +E R K+  T+K  WE +    ++    N+K    +     ++E+ S   + P  +    K G   Y RP  G 
Subjt:  IHEQLDLQPIATLPATISMVFKAELKLEKRQKNSDTKKNQWEKAFIPYQRKNYDNTKQAQGSGTSKAKEEQPSKTNQSPRTQELSTKNGSTNYPRPNLGF

Query:  CYRRNQNGHLSNQCPQRKTVAYVEEGGSQEDEAKPNSEEEINELEPDEGEQLSCVIQRILLTPKTETHPRRHSLFRTRCTINGKLCNVIIDSGSSESIVY
        CYR  Q GH SNQCPQRKT+A  ++     + +    +EE   +E DEG+ LSC++QR+L++PK E   +RHSLF+TRCTI GK+CNVIIDSGSSE+ V 
Subjt:  CYRRNQNGHLSNQCPQRKTVAYVEEGGSQEDEAKPNSEEEINELEPDEGEQLSCVIQRILLTPKTETHPRRHSLFRTRCTINGKLCNVIIDSGSSESIVY

Query:  SKLVQALKPQT
         KLV AL  +T
Subjt:  SKLVQALKPQT

A0A6J1CAS9 uncharacterized protein LOC111009540 isoform X11.51e-7145.6Show/hide
Query:  MVALKLKSGASTWWDQIQANRRLIGKTPIRSWPRMLKMMKERFLPTDFEQILYQQYQQCRQDNRKVAEYAKEFHRLSARTQTNESENYQIGRFVDGLKEN
        +VA K++SGAS WWDQ++ N R +GK PIRSWPRML++M+ERFLP +FEQ+LYQ YQ+CRQ  + +A+Y + FHRL A+T   E+E+Y+I RFVDGL+E+
Subjt:  MVALKLKSGASTWWDQIQANRRLIGKTPIRSWPRMLKMMKERFLPTDFEQILYQQYQQCRQDNRKVAEYAKEFHRLSARTQTNESENYQIGRFVDGLKEN

Query:  IHEQLDLQPIATLPATISMVFKAELKLEKRQKNSDTKKNQWEKAFIPYQRKNYDNTKQAQGSGTSKAKEEQPSKTNQSP--RTQELSTKNGSTNYPRPNL
        I +Q+D+QPI  L   I M  K E   +K++  +  ++  W+K  I  +    D  K  Q   TS +  + P    +S   +T + S+K G+  Y RP L
Subjt:  IHEQLDLQPIATLPATISMVFKAELKLEKRQKNSDTKKNQWEKAFIPYQRKNYDNTKQAQGSGTSKAKEEQPSKTNQSP--RTQELSTKNGSTNYPRPNL

Query:  GFCYRRNQNGHLSNQCPQRKTVAYVEEGGSQEDEAKPNSEEEINELEPDEGEQLSCVIQRILLTPKTETHPRRHSLFRTRCTINGKLC-----NVIIDSG
        G C+R  Q  HLSN+CPQR+ +A V++    E +    +E++   +EPDEG+ LSCV+Q++L TPK E  P+R+SLFRT  TINGKL      +V  D  
Subjt:  GFCYRRNQNGHLSNQCPQRKTVAYVEEGGSQEDEAKPNSEEEINELEPDEGEQLSCVIQRILLTPKTETHPRRHSLFRTRCTINGKLC-----NVIIDSG

Query:  SSESIVY
        +S SI Y
Subjt:  SSESIVY

A0A6J1CCQ8 uncharacterized protein LOC111009540 isoform X21.04e-7146.5Show/hide
Query:  MVALKLKSGASTWWDQIQANRRLIGKTPIRSWPRMLKMMKERFLPTDFEQILYQQYQQCRQDNRKVAEYAKEFHRLSARTQTNESENYQIGRFVDGLKEN
        +VA K++SGAS WWDQ++ N R +GK PIRSWPRML++M+ERFLP +FEQ+LYQ YQ+CRQ  + +A+Y + FHRL A+T   E+E+Y+I RFVDGL+E+
Subjt:  MVALKLKSGASTWWDQIQANRRLIGKTPIRSWPRMLKMMKERFLPTDFEQILYQQYQQCRQDNRKVAEYAKEFHRLSARTQTNESENYQIGRFVDGLKEN

Query:  IHEQLDLQPIATLPATISMVFKAELKLEKRQKNSDTKKNQWEKAFIPYQRKNYDNTKQAQGSGTSKAKEEQPSKTNQSP--RTQELSTKNGSTNYPRPNL
        I +Q+D+QPI  L   I M  K E   +K++  +  ++  W+K  I  +    D  K  Q   TS +  + P    +S   +T + S+K G+  Y RP L
Subjt:  IHEQLDLQPIATLPATISMVFKAELKLEKRQKNSDTKKNQWEKAFIPYQRKNYDNTKQAQGSGTSKAKEEQPSKTNQSP--RTQELSTKNGSTNYPRPNL

Query:  GFCYRRNQNGHLSNQCPQRKTVAYVEEGGSQEDEAKPNSEEEINELEPDEGEQLSCVIQRILLTPKTETHPRRHSLFRTRCTINGK
        G C+R  Q  HLSN+CPQR+ +A V++    E +    +E++   +EPDEG+ LSCV+Q++L TPK E  P+R+SLFRT  TINGK
Subjt:  GFCYRRNQNGHLSNQCPQRKTVAYVEEGGSQEDEAKPNSEEEINELEPDEGEQLSCVIQRILLTPKTETHPRRHSLFRTRCTINGK

A0A6P9EIQ8 uncharacterized protein LOC1089912421.56e-6941.59Show/hide
Query:  MVALKLKSGASTWWDQIQANRRLIGKTPIRSWPRMLKMMKERFLPTDFEQILYQQYQQCRQDNRKVAEYAKEFHRLSARTQTNESENYQIGRFVDGLKEN
        +VA KL+ GAS WW+Q Q NRR  GK P+R W +M ++M+ RFLP D+EQ+LYQQYQ CRQ  R + EY +EF+RL++R   +E+E  Q+ R++ GL+  
Subjt:  MVALKLKSGASTWWDQIQANRRLIGKTPIRSWPRMLKMMKERFLPTDFEQILYQQYQQCRQDNRKVAEYAKEFHRLSARTQTNESENYQIGRFVDGLKEN

Query:  IHEQLDLQPIATLPATISMVFKAELKLEKRQKNSDTKKNQWEKAFIPYQRKNYDNTKQAQGSGTSKAKEEQPSKTNQSPRTQELSTKN----GSTNYPRP
        I +++ L  + TL   +++  K EL+L +    + +        F P  +     T+ +     S + + +     Q+P+    +T N    G+  Y RP
Subjt:  IHEQLDLQPIATLPATISMVFKAELKLEKRQKNSDTKKNQWEKAFIPYQRKNYDNTKQAQGSGTSKAKEEQPSKTNQSPRTQELSTKN----GSTNYPRP

Query:  NLGFCYRRNQNGHLSNQCPQRKTVAYVEEGGSQEDEAKPNSEEEINELEPDEGEQLSCVIQRILLTPKTETHPRRHSLFRTRCTINGKLCNVIIDSGSSE
          G C+R NQ GH S +CP R++V  V+ G     E +  SEEE   +E DEG+ ++C+IQR+LLTPK E H +RH +F+TRCTIN K+CN+IIDSGS E
Subjt:  NLGFCYRRNQNGHLSNQCPQRKTVAYVEEGGSQEDEAKPNSEEEINELEPDEGEQLSCVIQRILLTPKTETHPRRHSLFRTRCTINGKLCNVIIDSGSSE

Query:  SIVYSKLVQALKPQT
        +IV   LV  LK  T
Subjt:  SIVYSKLVQALKPQT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCGCGTTGAAGCTCAAATCGGGTGCATCGACTTGGTGGGATCAAATCCAAGCCAATCGACGTTTAATTGGCAAAACACCCATAAGGAGCTGGCCAAGAATGCTCAA
GATGATGAAAGAACGCTTCCTACCCACGGATTTCGAACAGATTCTTTATCAACAATACCAACAATGCCGCCAAGATAATAGGAAAGTGGCAGAATATGCTAAGGAGTTCC
ACCGTCTAAGTGCAAGAACCCAAACGAACGAGAGTGAAAATTATCAAATAGGCAGATTTGTTGATGGTCTTAAGGAGAACATACACGAACAGCTAGATTTACAGCCCATA
GCCACGTTACCGGCCACAATTTCGATGGTTTTTAAAGCTGAATTGAAGCTGGAAAAAAGGCAGAAAAACAGCGACACCAAGAAGAACCAGTGGGAGAAGGCATTCATCCC
GTACCAACGAAAGAATTATGACAATACTAAACAAGCTCAAGGCTCTGGTACATCAAAGGCGAAAGAAGAACAACCTTCCAAAACAAACCAAAGCCCAAGAACTCAAGAAC
TCTCTACCAAGAACGGTTCAACCAACTACCCAAGACCGAATTTGGGATTTTGCTACCGACGCAACCAAAACGGACACTTATCCAATCAGTGCCCACAACGGAAAACGGTA
GCATATGTTGAGGAAGGAGGGAGCCAAGAAGACGAAGCGAAACCTAATTCCGAAGAGGAAATAAACGAGTTGGAACCGGATGAAGGGGAACAACTATCTTGTGTGATACA
ACGAATTCTCCTAACACCAAAAACGGAAACTCACCCTCGACGACATTCATTGTTCCGTACACGCTGCACAATTAACGGTAAACTTTGCAATGTCATAATTGATAGCGGAA
GTAGCGAGAGCATTGTCTACTCAAAACTTGTTCAAGCGCTCAAACCTCAAACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTCGCGTTGAAGCTCAAATCGGGTGCATCGACTTGGTGGGATCAAATCCAAGCCAATCGACGTTTAATTGGCAAAACACCCATAAGGAGCTGGCCAAGAATGCTCAA
GATGATGAAAGAACGCTTCCTACCCACGGATTTCGAACAGATTCTTTATCAACAATACCAACAATGCCGCCAAGATAATAGGAAAGTGGCAGAATATGCTAAGGAGTTCC
ACCGTCTAAGTGCAAGAACCCAAACGAACGAGAGTGAAAATTATCAAATAGGCAGATTTGTTGATGGTCTTAAGGAGAACATACACGAACAGCTAGATTTACAGCCCATA
GCCACGTTACCGGCCACAATTTCGATGGTTTTTAAAGCTGAATTGAAGCTGGAAAAAAGGCAGAAAAACAGCGACACCAAGAAGAACCAGTGGGAGAAGGCATTCATCCC
GTACCAACGAAAGAATTATGACAATACTAAACAAGCTCAAGGCTCTGGTACATCAAAGGCGAAAGAAGAACAACCTTCCAAAACAAACCAAAGCCCAAGAACTCAAGAAC
TCTCTACCAAGAACGGTTCAACCAACTACCCAAGACCGAATTTGGGATTTTGCTACCGACGCAACCAAAACGGACACTTATCCAATCAGTGCCCACAACGGAAAACGGTA
GCATATGTTGAGGAAGGAGGGAGCCAAGAAGACGAAGCGAAACCTAATTCCGAAGAGGAAATAAACGAGTTGGAACCGGATGAAGGGGAACAACTATCTTGTGTGATACA
ACGAATTCTCCTAACACCAAAAACGGAAACTCACCCTCGACGACATTCATTGTTCCGTACACGCTGCACAATTAACGGTAAACTTTGCAATGTCATAATTGATAGCGGAA
GTAGCGAGAGCATTGTCTACTCAAAACTTGTTCAAGCGCTCAAACCTCAAACTTGA
Protein sequenceShow/hide protein sequence
MVALKLKSGASTWWDQIQANRRLIGKTPIRSWPRMLKMMKERFLPTDFEQILYQQYQQCRQDNRKVAEYAKEFHRLSARTQTNESENYQIGRFVDGLKENIHEQLDLQPI
ATLPATISMVFKAELKLEKRQKNSDTKKNQWEKAFIPYQRKNYDNTKQAQGSGTSKAKEEQPSKTNQSPRTQELSTKNGSTNYPRPNLGFCYRRNQNGHLSNQCPQRKTV
AYVEEGGSQEDEAKPNSEEEINELEPDEGEQLSCVIQRILLTPKTETHPRRHSLFRTRCTINGKLCNVIIDSGSSESIVYSKLVQALKPQT