; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg032974 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg032974
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRibonuclease H
Genome locationscaffold5:6729864..6731368
RNA-Seq ExpressionSpg032974
SyntenySpg032974
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFS29200.1 hypothetical protein Acr_00g0005780 [Actinidia rufa]2.0e-3134.32Show/hide
Query:  GEMDPLFTDNIMGAEMDFHGANEATRCRAFALTLTGLARQWFGKIPRKSIESFKELARAFATQFLGVRYQQKPQINLLTVKQGPRESLKNYITRFSNEVL
        G  DP+   +   + M   G ++   C+AF+ TL G AR WF K+   +I+SF EL+R F   F+  R +QK   +L TV Q   ESLK+++ RF+  +L
Subjt:  GEMDPLFTDNIMGAEMDFHGANEATRCRAFALTLTGLARQWFGKIPRKSIESFKELARAFATQFLGVRYQQKPQINLLTVKQGPRESLKNYITRFSNEVL

Query:  Q---------GTKIINAEELLKSKKETMTPESRKYSLYDQDRD--KDHNRKKRRTHD-----NDRGREDPMGKFKEYTPTSIPQEQILMEITNTDLPRHP
        +           K I AEEL ++K+     +  K    D  R+  ++  R KR   D     N R R  P        P + P  Q+L EI + +  + P
Subjt:  Q---------GTKIINAEELLKSKKETMTPESRKYSLYDQDRD--KDHNRKKRRTHD-----NDRGREDPMGKFKEYTPTSIPQEQILMEITNTDLPRHP

Query:  GKMKTSLEGRDKSQFCLFHRNHGHPTENCIQLKDEIETLIRHGFLKEFVEDKSQKRPRQTRGGRGRGDDEP
        GK+KT  + R+++++C FHR+HGH TE+C QLK++I  LI+ G+L++++ D    RP      R  GD+ P
Subjt:  GKMKTSLEGRDKSQFCLFHRNHGHPTENCIQLKDEIETLIRHGFLKEFVEDKSQKRPRQTRGGRGRGDDEP

MQL96670.1 hypothetical protein [Colocasia esculenta]2.0e-3136.61Show/hide
Query:  GEMDPLFTDNIMGAE--MDFHGANEATRCRAFALTLTGLARQWFGKIPRKSIESFKELARAFATQFLGVRYQQKPQINLLTVKQGPRESLKNYITRFSNE
        G  DP   D+I G E  M FHGA++A +CRAF  TL   AR WF  +P  SI SF++L ++F   FLG R Q +   +LL V+Q   E+L ++I RF  E
Subjt:  GEMDPLFTDNIMGAE--MDFHGANEATRCRAFALTLTGLARQWFGKIPRKSIESFKELARAFATQFLGVRYQQKPQINLLTVKQGPRESLKNYITRFSNE

Query:  VL---------------QGTK-------------IINAEELLKSKKETMTPESRKYSLYDQ----------DRDKDHNRKKRRTHDNDRGREDPMGKFKE
         L               QGT+                AE L  +++     ES   S  +Q          D   D +RKK R + +   R  P   F  
Subjt:  VL---------------QGTK-------------IINAEELLKSKKETMTPESRKYSLYDQ----------DRDKDHNRKKRRTHDNDRGREDPMGKFKE

Query:  YTPTSIPQEQILMEITNTDLPRHPGKMKTSLEGRDKSQFCLFHRNHGHPTENCIQLKDEIETLIRHGFLKEFVEDKSQKRPRQTRGGRGRGDDEP
        YTP ++  EQIL EI N    R P +M++    RD++++C FHR+HGH T  C QLKDEIE LI+ G+L  FV  ++++RPR+    R R  ++P
Subjt:  YTPTSIPQEQILMEITNTDLPRHPGKMKTSLEGRDKSQFCLFHRNHGHPTENCIQLKDEIETLIRHGFLKEFVEDKSQKRPRQTRGGRGRGDDEP

MQM14562.1 hypothetical protein [Colocasia esculenta]1.2e-3136.88Show/hide
Query:  GEMDPLFTDNIMGAE--MDFHGANEATRCRAFALTLTGLARQWFGKIPRKSIESFKELARAFATQFLGVRYQQKPQINLLTVKQGPRESLKNYITRFSNE
        G  DP   D+I G E  M FHGA++A +CRAF  TL   AR WF  +P  SI SF++L ++F   FLG R Q +   +LL V+Q   E+L +++ RF  E
Subjt:  GEMDPLFTDNIMGAE--MDFHGANEATRCRAFALTLTGLARQWFGKIPRKSIESFKELARAFATQFLGVRYQQKPQINLLTVKQGPRESLKNYITRFSNE

Query:  VL---------------QGTK-------------IINAEELLKSKKETMTPESRKYSLYDQ----------DRDKDHNRKKRRTHDNDRGREDPMGKFKE
         L               QGT+                AE L  +++     ES   S  +Q          D   D +RKK R + +   R  P   F  
Subjt:  VL---------------QGTK-------------IINAEELLKSKKETMTPESRKYSLYDQ----------DRDKDHNRKKRRTHDNDRGREDPMGKFKE

Query:  YTPTSIPQEQILMEITNTDLPRHPGKMKTSLEGRDKSQFCLFHRNHGHPTENCIQLKDEIETLIRHGFLKEFVEDKSQKRPR
        YTP ++  EQIL EI N    R P +M++    RD++++C FHR+HGH T  C QLKDEIE LI+ G+L+ FV  ++++RPR
Subjt:  YTPTSIPQEQILMEITNTDLPRHPGKMKTSLEGRDKSQFCLFHRNHGHPTENCIQLKDEIETLIRHGFLKEFVEDKSQKRPR

XP_022158830.1 uncharacterized protein LOC111025293 [Momordica charantia]1.4e-4533.72Show/hide
Query:  SKDDQLCKDPKKGKGMADEEVGDSA-SVTSRM-------------PTLKDDWTHKEPEPSHKKVRRSSPSRPKQGTHVKINGREKFEAREKSEAEHSRGG
        S +  L +DPKKGKG  + +  +S  SV S++                K    HK P P+  K    S    +    + ++  +  +  E SE  HS   
Subjt:  SKDDQLCKDPKKGKGMADEEVGDSA-SVTSRM-------------PTLKDDWTHKEPEPSHKKVRRSSPSRPKQGTHVKINGREKFEAREKSEAEHSRGG

Query:  REQELYTWLKEEDSPYSSYKRTDN-------EDIEELLGEMDPLFTDNIMGAEMDFHGANEATRCRAFALTLTGLARQWFGKIPRKSIESFKELARAFAT
        +  +L   L + DSP++     +          +++     DP+   +     MD +G +EA RCR F+ TL G AR WF ++ R SI SFK LARAF T
Subjt:  REQELYTWLKEEDSPYSSYKRTDN-------EDIEELLGEMDPLFTDNIMGAEMDFHGANEATRCRAFALTLTGLARQWFGKIPRKSIESFKELARAFAT

Query:  QFLGVRYQQKPQINLLTVKQGPRESLKNYITRFSNEVLQGTKIINAEELL--------------------KSKKETMTPESRKYS---LYDQDRDKDHNR
        QF+G R + +P   LLT+KQ   ESL++Y+ RF+ E LQ   + +A  LL                     +  E ++   R  S    +   R+ D  R
Subjt:  QFLGVRYQQKPQINLLTVKQGPRESLKNYITRFSNEVLQGTKIINAEELL--------------------KSKKETMTPESRKYS---LYDQDRDKDHNR

Query:  ---KKRRTHDNDRG----------REDPMGKFKEYTPTSIPQEQILMEITNTDLPRHPGKMKTSLEGRDKSQFCLFHRNHGHPTENCIQLKDEIETLIRH
           K+ R+ D  +G          ++DP  KF++YTPT++P EQ+LMEI +  L + P +MK S   R K ++CLFHR+HGH T++C  LK+E+E LIR 
Subjt:  ---KKRRTHDNDRG----------REDPMGKFKEYTPTSIPQEQILMEITNTDLPRHPGKMKTSLEGRDKSQFCLFHRNHGHPTENCIQLKDEIETLIRH

Query:  GFLKEFVEDKSQKRPRQTRGGRGRGDDEPPLEIRTI
        G+LKE+VE+     P+ T+   G  D  P  EIRTI
Subjt:  GFLKEFVEDKSQKRPRQTRGGRGRGDDEPPLEIRTI

XP_030955724.1 uncharacterized protein LOC115977839 [Quercus lobata]4.8e-3335.19Show/hide
Query:  GEMDPLFTDNIMGAEMDFHGANEATRCRAFALTLTGLARQWFGKIPRKSIESFKELARAFATQFLGVRYQQKPQINLLTVKQGPRESLKNYITRFSNEVL
        G  DP          M   G  +   CRAF  TL G AR WF KIP  S+ SF+EL++ F   F+G +  ++   +LLT++QG  ESL+++ITRF+ E L
Subjt:  GEMDPLFTDNIMGAEMDFHGANEATRCRAFALTLTGLARQWFGKIPRKSIESFKELARAFATQFLGVRYQQKPQINLLTVKQGPRESLKNYITRFSNEVL

Query:  QGTKI------------INAE----ELLKSKKETMTPESRKYSLYDQDRDKDHNR-----KKRRTHD---NDRGREDPMGKFKEYTPTSIPQEQILMEIT
           ++            IN++    +L + + +TM    RK +   +     H+      KK RT D    D  +  P+G+ + YTP + P  Q+LM+I 
Subjt:  QGTKI------------INAE----ELLKSKKETMTPESRKYSLYDQDRDKDHNR-----KKRRTHD---NDRGREDPMGKFKEYTPTSIPQEQILMEIT

Query:  NTDLPRHPGKMKTSLEGRDKSQFCLFHRNHGHPTENCIQLKDEIETLIRHGFLKEFVEDKSQKRPRQTRGGRGRGDDEPPL-EIRTI
        +    + P KMK     R+K+++C FHR+HGH T+ C  LK +IE LIR G LK FV    + R  +   G+      PPL EIR I
Subjt:  NTDLPRHPGKMKTSLEGRDKSQFCLFHRNHGHPTENCIQLKDEIETLIRHGFLKEFVEDKSQKRPRQTRGGRGRGDDEPPL-EIRTI

TrEMBL top hitse value%identityAlignment
A0A2N9EL41 Reverse transcriptase7.5e-3234.8Show/hide
Query:  IEELLGEMDPLFTDNIMGAEMDFHGANEATRCRAFALTLTGLARQWFGKIPRKSIESFKELARAFATQFLGVRYQQKPQINLLTVKQGPRESLKNYITRF
        +E   G  DP          M      E   CRAF L L G AR WF K+  +SI SF +L+RAF   F+G + + +P I+LL+VKQ   ESL+ ++ RF
Subjt:  IEELLGEMDPLFTDNIMGAEMDFHGANEATRCRAFALTLTGLARQWFGKIPRKSIESFKELARAFATQFLGVRYQQKPQINLLTVKQGPRESLKNYITRF

Query:  SNEVLQGTKIINAEELLKSKKETM--TPESRKYSLYDQDRDKDHNRKKRRTHDNDRGRED-PMGKFKEYTPTSIPQEQILMEITNTDLPRHPGKMKTSLE
        + E +   KI   +E +    E M   P  R+ +  D+  +    +  + T   +R R   P  KF  +TP + P +++L++I +    R PGK+++   
Subjt:  SNEVLQGTKIINAEELLKSKKETM--TPESRKYSLYDQDRDKDHNRKKRRTHDNDRGRED-PMGKFKEYTPTSIPQEQILMEITNTDLPRHPGKMKTSLE

Query:  GRDKSQFCLFHRNHGHPTENCIQLKDEIETLIRHGFLKEFVEDKSQKRPRQTRGGRGRGDDE---PPLEIRTI
         R K+ +C FHR+HGH TE+C+ LK+++ETLIR G L+++V   +  RP +    R + +     P  EIRTI
Subjt:  GRDKSQFCLFHRNHGHPTENCIQLKDEIETLIRHGFLKEFVEDKSQKRPRQTRGGRGRGDDE---PPLEIRTI

A0A2N9F3B6 Ribonuclease H3.1e-3032.53Show/hide
Query:  MDFHGANEATRCRAFALTLTGLARQWFGKIPRKSIESFKELARAFATQFLGVRYQQKPQINLLTVKQGPRESLKNYITRFSNEVLQ--------------
        M      E   CRAF ++L G AR WF K+  +SI SF +L+RAF   F+G +++ +P  +LL+VKQ   +SL+ Y+ RF+ E +Q              
Subjt:  MDFHGANEATRCRAFALTLTGLARQWFGKIPRKSIESFKELARAFATQFLGVRYQQKPQINLLTVKQGPRESLKNYITRFSNEVLQ--------------

Query:  --------------------------GTKIINAEELLKSKKETMTPESRKYSLYDQDRDKDHNRKKRRTHDNDRGR-EDPMGKFKEYTPTSIPQEQILME
                                    K +NAE+ L++  +   P+ RK  + D+ ++    +  + +   +R R   P+GKF  +TP + P E++LM+
Subjt:  --------------------------GTKIINAEELLKSKKETMTPESRKYSLYDQDRDKDHNRKKRRTHDNDRGR-EDPMGKFKEYTPTSIPQEQILME

Query:  ITNTDLPRHPGKMKTSLEGRDKSQFCLFHRNHGHPTENCIQLKDEIETLIRHGFLKEFVEDKSQKRPRQTRGGRGRGDDEPPL----EIRTI
        I +    R PGK+ +    R K+ +C FHR+HGH TE+C+ LK+++ETLIR G L+++V      RP + +G + R    P L    EIRTI
Subjt:  ITNTDLPRHPGKMKTSLEGRDKSQFCLFHRNHGHPTENCIQLKDEIETLIRHGFLKEFVEDKSQKRPRQTRGGRGRGDDEPPL----EIRTI

A0A2N9HAM0 Ribonuclease H4.9e-3134.07Show/hide
Query:  IEELLGEMDPLFTDNIMGAEMDFHGANEATRCRAFALTLTGLARQWFGKIPRKSIESFKELARAFATQFLGVRYQQKPQINLLTVKQGPRESLKNYITRF
        +E   G  DP          M      E   CRAF L L G AR WF K+  +SI SF +L+RAF   F+G + + +P  +LL+VKQ   ESL+ ++ RF
Subjt:  IEELLGEMDPLFTDNIMGAEMDFHGANEATRCRAFALTLTGLARQWFGKIPRKSIESFKELARAFATQFLGVRYQQKPQINLLTVKQGPRESLKNYITRF

Query:  SNEVLQGTKIINAEELLKSKKETM--TPESRKYSLYDQDRDKDHNRKKRRTHDNDRGRED-PMGKFKEYTPTSIPQEQILMEITNTDLPRHPGKMKTSLE
        + E ++  +    E++ +   E M   P  R+    D+  +    +  + T   +R R   P  KF  +TP + P +++L++I +    R PGK+++   
Subjt:  SNEVLQGTKIINAEELLKSKKETM--TPESRKYSLYDQDRDKDHNRKKRRTHDNDRGRED-PMGKFKEYTPTSIPQEQILMEITNTDLPRHPGKMKTSLE

Query:  GRDKSQFCLFHRNHGHPTENCIQLKDEIETLIRHGFLKEFVEDKSQKRPRQTRGGRGRGDDE---PPLEIRTI
         R K+ +C FHR+HGH TE C+ LK++IETLIR G L+++V   +  RP +    R + +     P  EIRTI
Subjt:  GRDKSQFCLFHRNHGHPTENCIQLKDEIETLIRHGFLKEFVEDKSQKRPRQTRGGRGRGDDE---PPLEIRTI

A0A6J1DWY0 uncharacterized protein LOC1110252937.0e-4633.72Show/hide
Query:  SKDDQLCKDPKKGKGMADEEVGDSA-SVTSRM-------------PTLKDDWTHKEPEPSHKKVRRSSPSRPKQGTHVKINGREKFEAREKSEAEHSRGG
        S +  L +DPKKGKG  + +  +S  SV S++                K    HK P P+  K    S    +    + ++  +  +  E SE  HS   
Subjt:  SKDDQLCKDPKKGKGMADEEVGDSA-SVTSRM-------------PTLKDDWTHKEPEPSHKKVRRSSPSRPKQGTHVKINGREKFEAREKSEAEHSRGG

Query:  REQELYTWLKEEDSPYSSYKRTDN-------EDIEELLGEMDPLFTDNIMGAEMDFHGANEATRCRAFALTLTGLARQWFGKIPRKSIESFKELARAFAT
        +  +L   L + DSP++     +          +++     DP+   +     MD +G +EA RCR F+ TL G AR WF ++ R SI SFK LARAF T
Subjt:  REQELYTWLKEEDSPYSSYKRTDN-------EDIEELLGEMDPLFTDNIMGAEMDFHGANEATRCRAFALTLTGLARQWFGKIPRKSIESFKELARAFAT

Query:  QFLGVRYQQKPQINLLTVKQGPRESLKNYITRFSNEVLQGTKIINAEELL--------------------KSKKETMTPESRKYS---LYDQDRDKDHNR
        QF+G R + +P   LLT+KQ   ESL++Y+ RF+ E LQ   + +A  LL                     +  E ++   R  S    +   R+ D  R
Subjt:  QFLGVRYQQKPQINLLTVKQGPRESLKNYITRFSNEVLQGTKIINAEELL--------------------KSKKETMTPESRKYS---LYDQDRDKDHNR

Query:  ---KKRRTHDNDRG----------REDPMGKFKEYTPTSIPQEQILMEITNTDLPRHPGKMKTSLEGRDKSQFCLFHRNHGHPTENCIQLKDEIETLIRH
           K+ R+ D  +G          ++DP  KF++YTPT++P EQ+LMEI +  L + P +MK S   R K ++CLFHR+HGH T++C  LK+E+E LIR 
Subjt:  ---KKRRTHDNDRG----------REDPMGKFKEYTPTSIPQEQILMEITNTDLPRHPGKMKTSLEGRDKSQFCLFHRNHGHPTENCIQLKDEIETLIRH

Query:  GFLKEFVEDKSQKRPRQTRGGRGRGDDEPPLEIRTI
        G+LKE+VE+     P+ T+   G  D  P  EIRTI
Subjt:  GFLKEFVEDKSQKRPRQTRGGRGRGDDEPPLEIRTI

A0A7J0D7V6 Retrotrans_gag domain-containing protein9.8e-3234.32Show/hide
Query:  GEMDPLFTDNIMGAEMDFHGANEATRCRAFALTLTGLARQWFGKIPRKSIESFKELARAFATQFLGVRYQQKPQINLLTVKQGPRESLKNYITRFSNEVL
        G  DP+   +   + M   G ++   C+AF+ TL G AR WF K+   +I+SF EL+R F   F+  R +QK   +L TV Q   ESLK+++ RF+  +L
Subjt:  GEMDPLFTDNIMGAEMDFHGANEATRCRAFALTLTGLARQWFGKIPRKSIESFKELARAFATQFLGVRYQQKPQINLLTVKQGPRESLKNYITRFSNEVL

Query:  Q---------GTKIINAEELLKSKKETMTPESRKYSLYDQDRD--KDHNRKKRRTHD-----NDRGREDPMGKFKEYTPTSIPQEQILMEITNTDLPRHP
        +           K I AEEL ++K+     +  K    D  R+  ++  R KR   D     N R R  P        P + P  Q+L EI + +  + P
Subjt:  Q---------GTKIINAEELLKSKKETMTPESRKYSLYDQDRD--KDHNRKKRRTHD-----NDRGREDPMGKFKEYTPTSIPQEQILMEITNTDLPRHP

Query:  GKMKTSLEGRDKSQFCLFHRNHGHPTENCIQLKDEIETLIRHGFLKEFVEDKSQKRPRQTRGGRGRGDDEP
        GK+KT  + R+++++C FHR+HGH TE+C QLK++I  LI+ G+L++++ D    RP      R  GD+ P
Subjt:  GKMKTSLEGRDKSQFCLFHRNHGHPTENCIQLKDEIETLIRHGFLKEFVEDKSQKRPRQTRGGRGRGDDEP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCGCAACAGTGAAGCAGCCGTAGGAGAGGTATACCACCAGGCTCGACTACATGCCCAAGAAGCCGAAATAGCATCACTTAAAGGAAGGATGGATGATATGGGACA
GAATCTTATGGAGATTCTGAGTTTATTGAAGAAACCCGAGCACTTGGGGAGCAAGGATGACCAGCTGTGCAAGGATCCTAAAAAAGGAAAAGGAATGGCCGACGAGGAGG
TGGGAGATTCAGCAAGTGTCACGAGCAGGATGCCGACCTTAAAAGATGACTGGACTCATAAAGAACCCGAGCCTAGTCACAAGAAAGTTCGCAGAAGCTCGCCATCAAGG
CCAAAGCAAGGTACGCATGTTAAAATCAATGGCAGGGAAAAATTTGAGGCTCGGGAAAAGTCCGAGGCCGAGCATAGTCGAGGAGGGCGCGAACAAGAGCTGTACACATG
GTTGAAGGAGGAGGATAGTCCTTACAGCTCATACAAGAGGACAGATAATGAGGATATAGAAGAACTGCTAGGAGAGATGGATCCGCTCTTCACAGACAACATTATGGGGG
CTGAAATGGACTTCCACGGGGCCAATGAGGCAACCAGATGCCGAGCCTTCGCACTTACCCTAACAGGCTTGGCTAGACAATGGTTTGGCAAAATACCCAGGAAGTCCATA
GAGTCATTCAAAGAGTTGGCACGAGCCTTCGCCACACAGTTTTTGGGGGTTCGATATCAACAAAAGCCACAGATCAACCTGCTGACGGTTAAACAAGGACCGAGGGAGAG
CCTGAAGAACTATATCACCAGATTCAGTAACGAAGTCCTGCAGGGCACAAAAATCATAAATGCTGAAGAGTTGCTCAAGTCAAAAAAGGAAACAATGACACCAGAATCCA
GAAAATATTCGTTATATGATCAAGACAGAGACAAAGACCACAACCGCAAAAAACGGAGAACACATGACAATGATCGAGGGCGAGAAGACCCCATGGGTAAATTCAAAGAA
TACACCCCCACTTCCATTCCGCAGGAACAAATATTGATGGAGATTACAAATACAGATCTTCCGAGACATCCTGGAAAAATGAAAACAAGTCTAGAAGGAAGAGATAAAAG
TCAGTTTTGCCTTTTCCATAGGAACCACGGACACCCTACCGAGAATTGCATCCAGCTTAAAGATGAGATTGAAACGCTGATTCGTCATGGATTCCTCAAAGAGTTTGTTG
AAGACAAGAGCCAGAAAAGGCCGAGGCAGACCAGGGGTGGCCGAGGCCGAGGAGATGATGAACCTCCTTTAGAGATTAGAACCATCTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGCGCAACAGTGAAGCAGCCGTAGGAGAGGTATACCACCAGGCTCGACTACATGCCCAAGAAGCCGAAATAGCATCACTTAAAGGAAGGATGGATGATATGGGACA
GAATCTTATGGAGATTCTGAGTTTATTGAAGAAACCCGAGCACTTGGGGAGCAAGGATGACCAGCTGTGCAAGGATCCTAAAAAAGGAAAAGGAATGGCCGACGAGGAGG
TGGGAGATTCAGCAAGTGTCACGAGCAGGATGCCGACCTTAAAAGATGACTGGACTCATAAAGAACCCGAGCCTAGTCACAAGAAAGTTCGCAGAAGCTCGCCATCAAGG
CCAAAGCAAGGTACGCATGTTAAAATCAATGGCAGGGAAAAATTTGAGGCTCGGGAAAAGTCCGAGGCCGAGCATAGTCGAGGAGGGCGCGAACAAGAGCTGTACACATG
GTTGAAGGAGGAGGATAGTCCTTACAGCTCATACAAGAGGACAGATAATGAGGATATAGAAGAACTGCTAGGAGAGATGGATCCGCTCTTCACAGACAACATTATGGGGG
CTGAAATGGACTTCCACGGGGCCAATGAGGCAACCAGATGCCGAGCCTTCGCACTTACCCTAACAGGCTTGGCTAGACAATGGTTTGGCAAAATACCCAGGAAGTCCATA
GAGTCATTCAAAGAGTTGGCACGAGCCTTCGCCACACAGTTTTTGGGGGTTCGATATCAACAAAAGCCACAGATCAACCTGCTGACGGTTAAACAAGGACCGAGGGAGAG
CCTGAAGAACTATATCACCAGATTCAGTAACGAAGTCCTGCAGGGCACAAAAATCATAAATGCTGAAGAGTTGCTCAAGTCAAAAAAGGAAACAATGACACCAGAATCCA
GAAAATATTCGTTATATGATCAAGACAGAGACAAAGACCACAACCGCAAAAAACGGAGAACACATGACAATGATCGAGGGCGAGAAGACCCCATGGGTAAATTCAAAGAA
TACACCCCCACTTCCATTCCGCAGGAACAAATATTGATGGAGATTACAAATACAGATCTTCCGAGACATCCTGGAAAAATGAAAACAAGTCTAGAAGGAAGAGATAAAAG
TCAGTTTTGCCTTTTCCATAGGAACCACGGACACCCTACCGAGAATTGCATCCAGCTTAAAGATGAGATTGAAACGCTGATTCGTCATGGATTCCTCAAAGAGTTTGTTG
AAGACAAGAGCCAGAAAAGGCCGAGGCAGACCAGGGGTGGCCGAGGCCGAGGAGATGATGAACCTCCTTTAGAGATTAGAACCATCTTTTGA
Protein sequenceShow/hide protein sequence
MERNSEAAVGEVYHQARLHAQEAEIASLKGRMDDMGQNLMEILSLLKKPEHLGSKDDQLCKDPKKGKGMADEEVGDSASVTSRMPTLKDDWTHKEPEPSHKKVRRSSPSR
PKQGTHVKINGREKFEAREKSEAEHSRGGREQELYTWLKEEDSPYSSYKRTDNEDIEELLGEMDPLFTDNIMGAEMDFHGANEATRCRAFALTLTGLARQWFGKIPRKSI
ESFKELARAFATQFLGVRYQQKPQINLLTVKQGPRESLKNYITRFSNEVLQGTKIINAEELLKSKKETMTPESRKYSLYDQDRDKDHNRKKRRTHDNDRGREDPMGKFKE
YTPTSIPQEQILMEITNTDLPRHPGKMKTSLEGRDKSQFCLFHRNHGHPTENCIQLKDEIETLIRHGFLKEFVEDKSQKRPRQTRGGRGRGDDEPPLEIRTIF