; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022190 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022190
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRibonuclease H
Genome locationchr7:20621183..20634987
RNA-Seq ExpressionLag0022190
SyntenyLag0022190
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_015384470.1 uncharacterized protein LOC107176464 [Citrus sinensis]5.6e-14637.51Show/hide
Query:  LSEERKDLLKVGFIREVHYPQWLSNVVLVKKANGKWRMCIDFTDLNKACPKDSYPLPMIDQLVDATAGHEMLSFMDAYSGYNQIKMYGPDQEKT------
        + +E   LL  GFIRE  YP W+SNVVLVKK+NGKWRMC+DFTDLNK+CPKDS+PLP +DQLVDATAGHEMLSFMDA+SGYNQI MY PDQ+KT      
Subjt:  LSEERKDLLKVGFIREVHYPQWLSNVVLVKKANGKWRMCIDFTDLNKACPKDSYPLPMIDQLVDATAGHEMLSFMDAYSGYNQIKMYGPDQEKT------

Query:  --------------------------FLSPTGGCIAISIPD---------------------TAKVPNEIEFEQVCFQSNFREILGVLVHQRGIEANLDK
                                  F    G  + + I D                       K    +  E+  F     + LG +V QRGIEAN DK
Subjt:  --------------------------FLSPTGGCIAISIPD---------------------TAKVPNEIEFEQVCFQSNFREILGVLVHQRGIEANLDK

Query:  IRAILEMSSPSNLKQLQRL-----------------------------RFEWTAECEKAFKELKAYLGFAPLLTKPQPGTNCCSF-GSFKTAVSSVLIKE
        I+A+L M SPS +K++Q L                             + +WTAECE+AF++LK YL  +PLL KP+ G     +    + A SSVL++E
Subjt:  IRAILEMSSPSNLKQLQRL-----------------------------RFEWTAECEKAFKELKAYLGFAPLLTKPQPGTNCCSF-GSFKTAVSSVLIKE

Query:  -EGNLQRPVYYTSKTMVGAETRYPQVE------------------------------RQVLQKPETSGCLMKWAIELSEYDIHYKPRTSMKGQATADFVA
         E  +QRP+YYTS+ MV AE RYP  E                              RQ+LQK + SG L++W++ELSE+DI +KPR+++K QA ADF+A
Subjt:  -EGNLQRPVYYTSKTMVGAETRYPQVE------------------------------RQVLQKPETSGCLMKWAIELSEYDIHYKPRTSMKGQATADFVA

Query:  EL----------------TPAKVEARPTRAELSLGSISWGMWSRNIVGIPRGRRFEYALRFNFRASNNEAEYEALLAGLKLAREIGISSLLVQSDSQLIV
        E                 T A  E +     +   S S G  +  I+  P   +  YAL+F F+ASNNEAEYEA++AGL++++ +G   + ++SDSQL+V
Subjt:  EL----------------TPAKVEARPTRAELSLGSISWGMWSRNIVGIPRGRRFEYALRFNFRASNNEAEYEALLAGLKLAREIGISSLLVQSDSQLIV

Query:  KQV--------------------------------------VEVDEQVSVGDRARTEAKTPVA------------EADQEGG-----SWMDPLVKYLEKG
         Q+                                       E D    +     T++  P+             E ++ G       WM P++ YL  G
Subjt:  KQV--------------------------------------VEVDEQVSVGDRARTEAKTPVA------------EADQEGG-----SWMDPLVKYLEKG

Query:  DLPIDKAEAKRLQRRASHYVLREGRLYKR-------------------------------GERSLCHKIVRQGYFWPMMLQDTKDFTKACDRCQRFAPVP
        DLP +K EA++L+ + + Y L E  L++R                               G +SL  K +RQG++WP M +D K+  K+CD+CQRFA VP
Subjt:  DLPIDKAEAKRLQRRASHYVLREGRLYKR-------------------------------GERSLCHKIVRQGYFWPMMLQDTKDFTKACDRCQRFAPVP

Query:  RQPPEPLTNVISPCPFAQWGIDLIGPLPEGKGQTKYTVVVVDYFTKWAEAEALATITERKVTDFIWRS--------------RGSKQDH-----------
          PPE LT + SP PFA WGIDLIGPLP G+GQ KY +V VDYFTKW EAE LA+ITERK TDF+WR+               G + D+           
Subjt:  RQPPEPLTNVISPCPFAQWGIDLIGPLPEGKGQTKYTVVVVDYFTKWAEAEALATITERKVTDFIWRS--------------RGSKQDH-----------

Query:  ---------------QTE---------LKTKLKGLKGLWAEELPSVLCAYQTTARTSTRETPFSLSFGAKVVVLVEIGLPSLRVEQFHENEGS
                       Q E         LK +L+  KG W +ELP VL AY+TT   STRETPFSL+FG   V+  EIG+PS R   F E E +
Subjt:  ---------------QTE---------LKTKLKGLKGLWAEELPSVLCAYQTTARTSTRETPFSLSFGAKVVVLVEIGLPSLRVEQFHENEGS

XP_024035690.1 uncharacterized protein LOC112096473 [Citrus clementina]1.1e-14637.63Show/hide
Query:  LSEERKDLLKVGFIREVHYPQWLSNVVLVKKANGKWRMCIDFTDLNKACPKDSYPLPMIDQLVDATAGHEMLSFMDAYSGYNQIKMYGPDQEKT------
        + +E   LL  GFIRE  YP+W+SNVVLVKK+NGKWRMC+DFTDLNK+CPKDS+PLP +DQLVDATAGHEMLSFMDA+SGYNQI MY PDQ+KT      
Subjt:  LSEERKDLLKVGFIREVHYPQWLSNVVLVKKANGKWRMCIDFTDLNKACPKDSYPLPMIDQLVDATAGHEMLSFMDAYSGYNQIKMYGPDQEKT------

Query:  --------------------------FLSPTGGCIAISIPD---------------------TAKVPNEIEFEQVCFQSNFREILGVLVHQRGIEANLDK
                                  F    G  + + I D                       K    +  E+  F     + LG +V QRGIEAN DK
Subjt:  --------------------------FLSPTGGCIAISIPD---------------------TAKVPNEIEFEQVCFQSNFREILGVLVHQRGIEANLDK

Query:  IRAILEMSSPSNLKQLQRL-----------------------------RFEWTAECEKAFKELKAYLGFAPLLTKPQPGTNCCSF-GSFKTAVSSVLIKE
        I+A+L M SPS +K++Q L                             + +WTAECE+AF++LK YL  +PLL KP+ G     +    + A SSVL++E
Subjt:  IRAILEMSSPSNLKQLQRL-----------------------------RFEWTAECEKAFKELKAYLGFAPLLTKPQPGTNCCSF-GSFKTAVSSVLIKE

Query:  -EGNLQRPVYYTSKTMVGAETRYPQVE------------------------------RQVLQKPETSGCLMKWAIELSEYDIHYKPRTSMKGQATADFVA
         E  +QRP+YYTS+ MV AE RYP  E                              RQ+LQK + SG L++W++ELSE+DI +KPR+++K QA ADF+A
Subjt:  -EGNLQRPVYYTSKTMVGAETRYPQVE------------------------------RQVLQKPETSGCLMKWAIELSEYDIHYKPRTSMKGQATADFVA

Query:  EL----------------TPAKVEARPTRAELSLGSISWGMWSRNIVGIPRGRRFEYALRFNFRASNNEAEYEALLAGLKLAREIGISSLLVQSDSQLIV
        E                 T A  E +     +   S S G  +  I+  P   +  YAL+F F+ASNNEAEYEA++AGL++++ +G   + ++SDSQL+V
Subjt:  EL----------------TPAKVEARPTRAELSLGSISWGMWSRNIVGIPRGRRFEYALRFNFRASNNEAEYEALLAGLKLAREIGISSLLVQSDSQLIV

Query:  KQV--------------------------------------VEVDEQVSVGDRARTEAKTPVA------------EADQEGG-----SWMDPLVKYLEKG
         Q+                                       E D    +     T++  P+             E ++ G       WM P++ YL  G
Subjt:  KQV--------------------------------------VEVDEQVSVGDRARTEAKTPVA------------EADQEGG-----SWMDPLVKYLEKG

Query:  DLPIDKAEAKRLQRRASHYVLREGRLYKR-------------------------------GERSLCHKIVRQGYFWPMMLQDTKDFTKACDRCQRFAPVP
        DLP +K EA++L+ +A+ Y L E  L++R                               G +SL  K +RQG++WP M +D K+  K+CD+CQRFA VP
Subjt:  DLPIDKAEAKRLQRRASHYVLREGRLYKR-------------------------------GERSLCHKIVRQGYFWPMMLQDTKDFTKACDRCQRFAPVP

Query:  RQPPEPLTNVISPCPFAQWGIDLIGPLPEGKGQTKYTVVVVDYFTKWAEAEALATITERKVTDFIWRS--------------RGSKQDH-----------
          PPE LT + SP PFA WGIDLIGPLP G+GQ KY +V VDYFTKW EAE LA+ITERK TDF+WR+               G + D+           
Subjt:  RQPPEPLTNVISPCPFAQWGIDLIGPLPEGKGQTKYTVVVVDYFTKWAEAEALATITERKVTDFIWRS--------------RGSKQDH-----------

Query:  ---------------QTE---------LKTKLKGLKGLWAEELPSVLCAYQTTARTSTRETPFSLSFGAKVVVLVEIGLPSLRVEQFHENEGS
                       Q E         LK +L+  KG W +ELP VL AY+TT   STRETPFSL+FG   V+  EIG+PS R   F E E +
Subjt:  ---------------QTE---------LKTKLKGLKGLWAEELPSVLCAYQTTARTSTRETPFSLSFGAKVVVLVEIGLPSLRVEQFHENEGS

XP_024039511.1 uncharacterized protein LOC112098123 [Citrus clementina]2.1e-14538.66Show/hide
Query:  LSEERKDLLKVGFIREVHYPQWLSNVVLVKKANGKWRMCIDFTDLNKACPKDSYPLPMIDQLVDATAGHEMLSFMDAYSGYNQIKMYGPDQE--------
        +++E + LL+ GFIREV YP+W+SNVVLVKKANGKWRMC+DFTDLNKACPKDS+PLP IDQLVD+TAGH +LSFMDA+SGYNQI MY  D+E        
Subjt:  LSEERKDLLKVGFIREVHYPQWLSNVVLVKKANGKWRMCIDFTDLNKACPKDSYPLPMIDQLVDATAGHEMLSFMDAYSGYNQIKMYGPDQE--------

Query:  ------------------------KTFLSPTGGCIAISIPD---TAKVPNE------------------IEFEQVCFQSNFREILGVLVHQRGIEANLDK
                                K F    G  + + + D    +K+P E                  +  E+  F     + LG +V  RGIEAN +K
Subjt:  ------------------------KTFLSPTGGCIAISIPD---TAKVPNE------------------IEFEQVCFQSNFREILGVLVHQRGIEANLDK

Query:  IRAILEMSSPSNLKQLQRL-----------------------------RFEWTAECEKAFKELKAYLGFAPLLTKPQPGTNCCSFGSFK-TAVSSVLIKE
        I+AI++M+SP NLK+ Q L                             + EWT ECE+AF+ LK YL  APLL+ P+ G     + +    A SSVL++E
Subjt:  IRAILEMSSPSNLKQLQRL-----------------------------RFEWTAECEKAFKELKAYLGFAPLLTKPQPGTNCCSFGSFK-TAVSSVLIKE

Query:  EGNLQRPVYYTSKTMVGAETRYPQVE------------------------------RQVLQKPETSGCLMKWAIELSEYDIHYKPRTSMKGQATADFVAE
        E  +Q P+YYTSK ++ AETRYP +E                              RQ L KP+TSG L+KWA+ELSE+DI YKPR ++K QA ADFVAE
Subjt:  EGNLQRPVYYTSKTMVGAETRYPQVE------------------------------RQVLQKPETSGCLMKWAIELSEYDIHYKPRTSMKGQATADFVAE

Query:  LTPAKVEARPTRAELSLGSISWGMWSRNIVGI--------------PRGRRFEYALRFNFRASNNEAEYEALLAGLKLAREIGISSLLVQSDSQL-----
         T  + E    + +  +G+    +W  ++ G               P G    YA++  F  +NN+AEYEAL+AGL+LAR +    + +++DSQL     
Subjt:  LTPAKVEARPTRAELSLGSISWGMWSRNIVGI--------------PRGRRFEYALRFNFRASNNEAEYEALLAGLKLAREIGISSLLVQSDSQL-----

Query:  ----------------IVKQVVEVDEQVSV-------GDRARTEAKTPVAEADQ--------------------------EGGSWMDPLVKYLEKGDLPI
                        IV+Q++   E V V         RA   A+   A ADQ                          +  SW DP+V YL  G LP 
Subjt:  ----------------IVKQVVEVDEQVSV-------GDRARTEAKTPVAEADQ--------------------------EGGSWMDPLVKYLEKGDLPI

Query:  DKAEAKRLQRRASHYVLREGRLYKR-------------------------------GERSLCHKIVRQGYFWPMMLQDTKDFTKACDRCQRFAPVPRQPP
        DK  A++++ +AS Y + +G LY+R                               G RSL HK++RQGYFWP M QD +  T+ C  CQ FA    QPP
Subjt:  DKAEAKRLQRRASHYVLREGRLYKR-------------------------------GERSLCHKIVRQGYFWPMMLQDTKDFTKACDRCQRFAPVPRQPP

Query:  EPLTNVISPCPFAQWGIDLIGPLPEGKGQTKYTVVVVDYFTKWAEAEALATITERKVTDFIWRSRGSKQDHQTELKTKLKGLKGLWAEELPSVLCAYQTT
        E LT++ SP PFAQWGIDLIGPLP+G+G   + +V +DYFTKW E EAL+ ITE++ TDF+WR+ G+                  W +EL  VL AY+TT
Subjt:  EPLTNVISPCPFAQWGIDLIGPLPEGKGQTKYTVVVVDYFTKWAEAEALATITERKVTDFIWRSRGSKQDHQTELKTKLKGLKGLWAEELPSVLCAYQTT

Query:  ARTSTRETPFSLSFGAKVVVLVEIGLPSLRVEQFHENE
         +T+T ETPF+L+FG + VV  EIG  + R + F+E E
Subjt:  ARTSTRETPFSLSFGAKVVVLVEIGLPSLRVEQFHENE

XP_024045974.1 uncharacterized protein LOC112100756 [Citrus clementina]1.5e-14637.63Show/hide
Query:  LSEERKDLLKVGFIREVHYPQWLSNVVLVKKANGKWRMCIDFTDLNKACPKDSYPLPMIDQLVDATAGHEMLSFMDAYSGYNQIKMYGPDQEKT------
        + +E   LL  GFIRE  YP W+SNVVLVKK+NGKWRMC+DFTDLNK+CPKDS+PLP +DQLVDATAGHEMLSFMDA+SGYNQI MY PDQ+KT      
Subjt:  LSEERKDLLKVGFIREVHYPQWLSNVVLVKKANGKWRMCIDFTDLNKACPKDSYPLPMIDQLVDATAGHEMLSFMDAYSGYNQIKMYGPDQEKT------

Query:  --------------------------FLSPTGGCIAISIPD---------------------TAKVPNEIEFEQVCFQSNFREILGVLVHQRGIEANLDK
                                  F    G  + + I D                       K    +  E+  F     + LG +V QRGIEAN DK
Subjt:  --------------------------FLSPTGGCIAISIPD---------------------TAKVPNEIEFEQVCFQSNFREILGVLVHQRGIEANLDK

Query:  IRAILEMSSPSNLKQLQRL-----------------------------RFEWTAECEKAFKELKAYLGFAPLLTKPQPGTNCCSF-GSFKTAVSSVLIKE
        I+A+L M SPS +K++Q L                             + +WTAECE+AF++LK YL  +PLL KP+ G     +    + A SSVL++E
Subjt:  IRAILEMSSPSNLKQLQRL-----------------------------RFEWTAECEKAFKELKAYLGFAPLLTKPQPGTNCCSF-GSFKTAVSSVLIKE

Query:  -EGNLQRPVYYTSKTMVGAETRYPQVE------------------------------RQVLQKPETSGCLMKWAIELSEYDIHYKPRTSMKGQATADFVA
         E  +QRP+YYTS+ MV AE RYP  E                              RQ+LQK + SG L++W++ELSE+DI +KPR+++K QA ADF+A
Subjt:  -EGNLQRPVYYTSKTMVGAETRYPQVE------------------------------RQVLQKPETSGCLMKWAIELSEYDIHYKPRTSMKGQATADFVA

Query:  EL----------------TPAKVEARPTRAELSLGSISWGMWSRNIVGIPRGRRFEYALRFNFRASNNEAEYEALLAGLKLAREIGISSLLVQSDSQLIV
        E                 T A  E +     +   S S G  +  I+  P   +  YAL+F F+ASNNEAEYEA++AGL++++ +G   + ++SDSQL+V
Subjt:  EL----------------TPAKVEARPTRAELSLGSISWGMWSRNIVGIPRGRRFEYALRFNFRASNNEAEYEALLAGLKLAREIGISSLLVQSDSQLIV

Query:  KQV--------------------------------------VEVDEQVSVGDRARTEAKTPVA------------EADQEGG-----SWMDPLVKYLEKG
         Q+                                       E D    +     T++  P+             E ++ G       WM P++ YL  G
Subjt:  KQV--------------------------------------VEVDEQVSVGDRARTEAKTPVA------------EADQEGG-----SWMDPLVKYLEKG

Query:  DLPIDKAEAKRLQRRASHYVLREGRLYKR-------------------------------GERSLCHKIVRQGYFWPMMLQDTKDFTKACDRCQRFAPVP
        DLP +K EA++L+ +A+ Y L E  L++R                               G +SL  K +RQG++WP M +D K+  K+CD+CQRFA VP
Subjt:  DLPIDKAEAKRLQRRASHYVLREGRLYKR-------------------------------GERSLCHKIVRQGYFWPMMLQDTKDFTKACDRCQRFAPVP

Query:  RQPPEPLTNVISPCPFAQWGIDLIGPLPEGKGQTKYTVVVVDYFTKWAEAEALATITERKVTDFIWRS--------------RGSKQDH-----------
          PPE LT + SP PFA WGIDLIGPLP G+GQ KY +V VDYFTKW EAE LA+ITERK TDF+WR+               G + D+           
Subjt:  RQPPEPLTNVISPCPFAQWGIDLIGPLPEGKGQTKYTVVVVDYFTKWAEAEALATITERKVTDFIWRS--------------RGSKQDH-----------

Query:  ---------------QTE---------LKTKLKGLKGLWAEELPSVLCAYQTTARTSTRETPFSLSFGAKVVVLVEIGLPSLRVEQFHENEGS
                       Q E         LK +L+  KG W +ELP VL AY+TT   STRETPFSL+FG   V+  EIG+PS R   F E E +
Subjt:  ---------------QTE---------LKTKLKGLKGLWAEELPSVLCAYQTTARTSTRETPFSLSFGAKVVVLVEIGLPSLRVEQFHENEGS

XP_024046767.1 uncharacterized protein LOC112101078 [Citrus clementina]4.6e-14835.98Show/hide
Query:  MQEKKGQTEVDPMKEEEDHDLSSQKEVDPSLGRDEGDLRGQPAEELESVSL--TTEERRVNIGTKL----------------------------------
        M+   G +   P  ++E+     +  +DP + +DE  +RG P E+L SVS+  T   + V +G+ L                                  
Subjt:  MQEKKGQTEVDPMKEEEDHDLSSQKEVDPSLGRDEGDLRGQPAEELESVSL--TTEERRVNIGTKL----------------------------------

Query:  ------------------GLSEERKD--------LLKVGFIREVHYPQWLSNVVLVKKANGKWRMCIDFTDLNKACPKDSYPLPMIDQLVDATAGHEMLS
                            ++ER D        LL   FIRE  YP W+SNVVLVKK+NGKWRMC+DFTDLNK+CPKDS+PLP +DQLVDATAGHEMLS
Subjt:  ------------------GLSEERKD--------LLKVGFIREVHYPQWLSNVVLVKKANGKWRMCIDFTDLNKACPKDSYPLPMIDQLVDATAGHEMLS

Query:  FMDAYSGYNQIKMYGPDQEKT--------------------------------FLSPTGGCIAISIPD---------------------TAKVPNEIEFE
        FMDA+SGYNQI MY PDQEKT                                F    G  + + I D                       K    +  E
Subjt:  FMDAYSGYNQIKMYGPDQEKT--------------------------------FLSPTGGCIAISIPD---------------------TAKVPNEIEFE

Query:  QVCFQSNFREILGVLVHQRGIEANLDKIRAILEMSSPSNLKQLQRL-----------------------------RFEWTAECEKAFKELKAYLGFAPLL
        +  F     + LG +V QRGIEAN DKIRA+L+M SPS +K++Q L                             + +WT +CE+AF+ELK YL  APLL
Subjt:  QVCFQSNFREILGVLVHQRGIEANLDKIRAILEMSSPSNLKQLQRL-----------------------------RFEWTAECEKAFKELKAYLGFAPLL

Query:  TKPQPGTNCCSF-GSFKTAVSSVLIKEEGN-LQRPVYYTSKTMVGAETRYPQVE------------------------------RQVLQKPETSGCLMKW
         KP+PG     +    + A SSVL++E+ N +QRP+YYTSK MV AE RYP  E                              RQ+LQK + SG L++W
Subjt:  TKPQPGTNCCSF-GSFKTAVSSVLIKEEGN-LQRPVYYTSKTMVGAETRYPQVE------------------------------RQVLQKPETSGCLMKW

Query:  AIELSEYDIHYKPRTSMKGQATADFVAELT----------------PAKVEARPTRAELSLGSISWGMWSRNIVGIPRGRRFEYALRFNFRASNNEAEYE
        ++ELSE+DI +K R+++K QA ADF+AE                  P   E +     +   S S G  +  I+  P   +  YAL+F F+ASNNEAEYE
Subjt:  AIELSEYDIHYKPRTSMKGQATADFVAELT----------------PAKVEARPTRAELSLGSISWGMWSRNIVGIPRGRRFEYALRFNFRASNNEAEYE

Query:  ALLAGLKLAREIGISSLLVQSDSQLIVKQV---------------------------VEVDE----QVSVGDRARTEAKTPVAEA---------------
        A++AGL++++ +G   + V+SDSQL+V Q+                           V+V+     + S  D     A   VA++               
Subjt:  ALLAGLKLAREIGISSLLVQSDSQLIVKQV---------------------------VEVDE----QVSVGDRARTEAKTPVAEA---------------

Query:  ---DQEGGS------WMDPLVKYLEKGDLPIDKAEAKRLQRRASHYVLREGRLYKR-------------------------------GERSLCHKIVRQG
             E GS      WM+P++++L+ GDLP DK+EA+RL+ +A+ Y L +  LY+R                               G +SL  K +RQG
Subjt:  ---DQEGGS------WMDPLVKYLEKGDLPIDKAEAKRLQRRASHYVLREGRLYKR-------------------------------GERSLCHKIVRQG

Query:  YFWPMMLQDTKDFTKACDRCQRFAPVPRQPPEPLTNVISPCPFAQWGIDLIGPLPEGKGQTKYTVVVVDYFTKWAEAEALATITERKVTDFIWRS-----
        Y+WP M +D K+  + CD+CQRFA VP  PPE LT + SP PFA WG+DLIGPLP GKGQ K+ +V VDYFTKWAEAE L +ITERK T+FIW++     
Subjt:  YFWPMMLQDTKDFTKACDRCQRFAPVPRQPPEPLTNVISPCPFAQWGIDLIGPLPEGKGQTKYTVVVVDYFTKWAEAEALATITERKVTDFIWRS-----

Query:  ---------RGSKQDH--------------------------QTE---------LKTKLKGLKGLWAEELPSVLCAYQTTARTSTRETPFSLSFGAKVVV
                  G + D+                          Q E         LK KL+  KG W +ELP VL AY+TT  TSTRETPFSL+FG   V+
Subjt:  ---------RGSKQDH--------------------------QTE---------LKTKLKGLKGLWAEELPSVLCAYQTTARTSTRETPFSLSFGAKVVV

Query:  LVEIGLPSLRVEQFHENEGS
          EIG+PS RVE F E E +
Subjt:  LVEIGLPSLRVEQFHENEGS

TrEMBL top hitse value%identityAlignment
A0A2N9EKM0 Ribonuclease H5.8e-15738.24Show/hide
Query:  QPAEELESVSLT--TEERRVNIGTKL---------------------------GLSEERKDLLKVGFIREVHYPQWLSNVVLVKKANGKWRMCIDFTDLN
        +P E+LE ++LT   E+R+  IGT +                            +  E   LL  GFIREV+YP+WL+NVV+VKK NGKWRMC+DFTDLN
Subjt:  QPAEELESVSLT--TEERRVNIGTKL---------------------------GLSEERKDLLKVGFIREVHYPQWLSNVVLVKKANGKWRMCIDFTDLN

Query:  KACPKDSYPLPMIDQLVDATAGHEMLSFMDAYSGYNQIKMYGPDQEKT--------------------------------FLSPTGGCIAISIPDTAKVP
        KACPKDSYPLP IDQLVD+TAGH++LSFMDA+SGYNQI+M   DQEKT                                F    G  + + + D     
Subjt:  KACPKDSYPLPMIDQLVDATAGHEMLSFMDAYSGYNQIKMYGPDQEKT--------------------------------FLSPTGGCIAISIPDTAKVP

Query:  NEIEF---------------------EQVCFQSNFREILGVLVHQRGIEANLDKIRAILEMSSPSNLKQLQ---------------------------RL
         E E                      E+  F  +  + LG +V QRGIEAN DKI+AILEMS P+ +K++Q                           R 
Subjt:  NEIEF---------------------EQVCFQSNFREILGVLVHQRGIEANLDKIRAILEMSSPSNLKQLQ---------------------------RL

Query:  RFEWTAECEKAFKELKAYLGFAPLLTKPQPGTNCCSF-GSFKTAVSSVLIKEEGNLQRPVYYTSKTMVGAETRYPQVE----------------------
         F+WT EC++AF+ELK YL   PLL+  + G     +     +AVS  LI+EE  +Q+PVYYTS+ + GAE RY  +E                      
Subjt:  RFEWTAECEKAFKELKAYLGFAPLLTKPQPGTNCCSF-GSFKTAVSSVLIKEEGNLQRPVYYTSKTMVGAETRYPQVE----------------------

Query:  --------RQVLQKPETSGCLMKWAIELSEYDIHYKPRTSMKGQATADFVAELT-PAKVEARPTRAELSL----GSISWGMWSRNIVGI-PRGRRFEYAL
                R+ + KP+ +G L++W+IE+SE+DI Y+PRT++K QA ADF+AE T P + E  P + E       GS +  M    I+ + P   +FEYA+
Subjt:  --------RQVLQKPETSGCLMKWAIELSEYDIHYKPRTSMKGQATADFVAELT-PAKVEARPTRAELSL----GSISWGMWSRNIVGI-PRGRRFEYAL

Query:  RFNFRASNNEAEYEALLAGLKLAREIGISSLLVQSDSQLIVKQVV--------EVDEQVSVGDR------------------ARTEAKTPVAEADQEGGS
        +  FRA+NNEAEYEALLAGLKL++++G+ +L V+SDSQL+V Q+          + + + + DR                   R   +     A ++  S
Subjt:  RFNFRASNNEAEYEALLAGLKLAREIGISSLLVQSDSQLIVKQVV--------EVDEQVSVGDR------------------ARTEAKTPVAEADQEGGS

Query:  WMDPLVKYLEKGDLPIDKAEAKRLQRRASHYVLREGRLYK-------------------------------RGERSLCHKIVRQGYFWPMMLQDTKDFTK
        WM P+V+YL++G LP DK EA++L+ RASH+ L +G LYK                                G R+L HK+ R GY+WP +L D   + K
Subjt:  WMDPLVKYLEKGDLPIDKAEAKRLQRRASHYVLREGRLYK-------------------------------RGERSLCHKIVRQGYFWPMMLQDTKDFTK

Query:  ACDRCQRFAPVPRQPPEPLTNVISPCPFAQWGIDLIGPLPEGKGQTKYTVVVVDYFTKWAEAEALATITERKVTDFIWR-------------SRGSKQ--
         CD+CQRFA +PR PPE +T + SP PFAQWG+D++GP P G  Q K+ VV +DYFTKW EAE LATITE+ V +F+W+             S   KQ  
Subjt:  ACDRCQRFAPVPRQPPEPLTNVISPCPFAQWGIDLIGPLPEGKGQTKYTVVVVDYFTKWAEAEALATITERKVTDFIWR-------------SRGSKQ--

Query:  --------------DHQT--------------------ELKTKLKGLKGLWAEELPSVLCAYQTTARTSTRETPFSLSFGAKVVVLVEIGLPSLRVEQFH
                      +H +                    ++KT+L+G KG+W EELPS+L AY+TT R  T ETPF L+FG + V+ VEIGL +LR   FH
Subjt:  --------------DHQT--------------------ELKTKLKGLKGLWAEELPSVLCAYQTTARTSTRETPFSLSFGAKVVVLVEIGLPSLRVEQFH

Query:  ---ENEGSCR
           ENEG  R
Subjt:  ---ENEGSCR

A0A2N9EMY0 Ribonuclease H2.0e-15738.01Show/hide
Query:  SQKEVDPSLGRDEGDLRGQPAEELESVSLT--TEERRVNIGTKL---------------------------GLSEERKDLLKVGFIREVHYPQWLSNVVL
        S  +V  +L  +E     +P E+LE ++LT   E+R+  IGT +                            +  E   LL  GFIREV+YP+WL+NVV+
Subjt:  SQKEVDPSLGRDEGDLRGQPAEELESVSLT--TEERRVNIGTKL---------------------------GLSEERKDLLKVGFIREVHYPQWLSNVVL

Query:  VKKANGKWRMCIDFTDLNKACPKDSYPLPMIDQLVDATAGHEMLSFMDAYSGYNQIKMYGPDQEKTFLSPTGGCIAISI---------PDTAKVPNEIEF
        VKK NGKWRMC+DFTDLNKACPKDSYPLP IDQLVD+TAGH++LSFMDA+SGYNQI+M   DQEKT    + G     +             ++ N++  
Subjt:  VKKANGKWRMCIDFTDLNKACPKDSYPLPMIDQLVDATAGHEMLSFMDAYSGYNQIKMYGPDQEKTFLSPTGGCIAISI---------PDTAKVPNEIEF

Query:  EQV-------------------CFQSNFREILGVLVHQRGIEANLDKIRAILEMSSPSNLKQLQ---------------------------RLRFEWTAE
        +Q+                    F  +  + LG +V QRGIEAN DKI+AILEMS P+ +K++Q                           R  F+WT E
Subjt:  EQV-------------------CFQSNFREILGVLVHQRGIEANLDKIRAILEMSSPSNLKQLQ---------------------------RLRFEWTAE

Query:  CEKAFKELKAYLGFAPLLTKPQPGTNCCSF-GSFKTAVSSVLIKEEGNLQRPVYYTSKTMVGAETRYPQVE-----------------------------
        C++AF+ELK YL   PLL+  + G     +     +AVSS LI+EE  +Q+PVYYTS+ + GAE RY  +E                             
Subjt:  CEKAFKELKAYLGFAPLLTKPQPGTNCCSF-GSFKTAVSSVLIKEEGNLQRPVYYTSKTMVGAETRYPQVE-----------------------------

Query:  -RQVLQKPETSGCLMKWAIELSEYDIHYKPRTSMKGQATADFVAELT-PAKVEARPTRAELSL----GSISWGMWSRNIVGI-PRGRRFEYALRFNFRAS
         R+ + KP+ +G L++W+IE+SE+DI Y+PRT++K QA ADF+AE T P + E  P + E       GS +  M    I+ + P   +FEYA++  FRA+
Subjt:  -RQVLQKPETSGCLMKWAIELSEYDIHYKPRTSMKGQATADFVAELT-PAKVEARPTRAELSL----GSISWGMWSRNIVGI-PRGRRFEYALRFNFRAS

Query:  NNEAEYEALLAGLKLAREIGISSLLVQSDSQLIVKQV------------------------------VEVDEQVSV-GDR------------------AR
        NNEAEYEALLAGLKL++++G+ +L V+SDSQL+V Q+                              V++  + +V  DR                   R
Subjt:  NNEAEYEALLAGLKLAREIGISSLLVQSDSQLIVKQV------------------------------VEVDEQVSV-GDR------------------AR

Query:  TEAKTPVAEADQEGGSWMDPLVKYLEKGDLPIDKAEAKRLQRRASHYVLREGRLYK-------------------------------RGERSLCHKIVRQ
          ++     A ++  SWM P+V+YL++G LP DK EA++L+ RASH+ L +G LYK                                G R+L HK+ R 
Subjt:  TEAKTPVAEADQEGGSWMDPLVKYLEKGDLPIDKAEAKRLQRRASHYVLREGRLYK-------------------------------RGERSLCHKIVRQ

Query:  GYFWPMMLQDTKDFTKACDRCQRFAPVPRQPPEPLTNVISPCPFAQWGIDLIGPLPEGKGQTKYTVVVVDYFTKWAEAEALATITERKVTDFIWR-----
        GY+WP +L D   + K CD+CQRFA +PR PPE +T + SP PFAQWG+D++GP P G  Q K+ VV +DYFTKW EAE LATITE+ V +F+W+     
Subjt:  GYFWPMMLQDTKDFTKACDRCQRFAPVPRQPPEPLTNVISPCPFAQWGIDLIGPLPEGKGQTKYTVVVVDYFTKWAEAEALATITERKVTDFIWR-----

Query:  --------SRGSKQ----------------DHQT--------------------ELKTKLKGLKGLWAEELPSVLCAYQTTARTSTRETPFSLSFGAKVV
                S   KQ                +H +                    ++KT+L+G KG+W EELPS+L AY+TT R  T ETPF L+FG + V
Subjt:  --------SRGSKQ----------------DHQT--------------------ELKTKLKGLKGLWAEELPSVLCAYQTTARTSTRETPFSLSFGAKVV

Query:  VLVEIGLPSLRVEQFH---ENEGSCR
        + VEIGL +LR   FH   ENEG  R
Subjt:  VLVEIGLPSLRVEQFH---ENEGSCR

A0A2N9G8B9 Ribonuclease H3.2e-15541.33Show/hide
Query:  EERKDLLKVGFIREVHYPQWLSNVVLVKKANGKWRMCIDFTDLNKACPKDSYPLPMIDQLVDATAGHEMLSFMDAYSGYNQIKMYGPDQEKTFLSPTGGC
        EE   LLK GFIREV+YP+WL+NVV+VKK+ GKWRMC+DFTDLNKACPKDSYPLP IDQLVD+TAGH++LSFMDA+SGYNQI+M   DQEKT    + G 
Subjt:  EERKDLLKVGFIREVHYPQWLSNVVLVKKANGKWRMCIDFTDLNKACPKDSYPLPMIDQLVDATAGHEMLSFMDAYSGYNQIKMYGPDQEKTFLSPTGGC

Query:  IAISI---------PDTAKVPNEIEFEQV---------------------------CFQSNFR-----------------EILGVLVHQRGIEANLDKIR
            +             ++ N++  +Q+                            FQ+  R                 + LG +V QRGIEAN DKI+
Subjt:  IAISI---------PDTAKVPNEIEFEQV---------------------------CFQSNFR-----------------EILGVLVHQRGIEANLDKIR

Query:  AILEMSSPSNLKQLQRLRFEWTAECEKAFKELKAYLGFAPLLTKPQPGTNCCSFGSFK-TAVSSVLIKEEGNLQRPVYYTSKTMVGAETRYPQVE-----
        AIL+MS P  +K+     F+WT EC+KAF+ELKAYL   PLL+  Q G     + +   +AVSS LI+EE  +Q+PVYYTS+ + GAE RY  +E     
Subjt:  AILEMSSPSNLKQLQRLRFEWTAECEKAFKELKAYLGFAPLLTKPQPGTNCCSFGSFK-TAVSSVLIKEEGNLQRPVYYTSKTMVGAETRYPQVE-----

Query:  -------------------------RQVLQKPETSGCLMKWAIELSEYDIHYKPRTSMKGQATADFVAELT-PAKVEARPTRAELSL----GSISWGMWS
                                 R+ + KP+ +G L++W+IE+SE+ I Y+PRT++K QA ADF+AE T P K E      +       GS +  M  
Subjt:  -------------------------RQVLQKPETSGCLMKWAIELSEYDIHYKPRTSMKGQATADFVAELT-PAKVEARPTRAELSL----GSISWGMWS

Query:  RNIVGI-PRGRRFEYALRFNFRASNNEAEYEALLAGLKLAREIGISSLLVQSDSQL----------------------IVKQVVEVDEQVSVGDRARTEA
          +V + P   +FEYAL+  FRA+NNEAEYEALLAGLKL++ +GI +L V+SDSQL                      +    VE+D  V +  R  TE 
Subjt:  RNIVGI-PRGRRFEYALRFNFRASNNEAEYEALLAGLKLAREIGISSLLVQSDSQL----------------------IVKQVVEVDEQVSVGDRARTEA

Query:  KT--PVAEADQEGGSWMDPLVKYLEKGDLPIDKAEAKRLQRRASHYVLREGRLYK-------------------------------RGERSLCHKIVRQG
        +T  P++       +WM P+++YL++G LP D+AEA +L+ RAS + L  G LYK                                G RSL HK+ R G
Subjt:  KT--PVAEADQEGGSWMDPLVKYLEKGDLPIDKAEAKRLQRRASHYVLREGRLYK-------------------------------RGERSLCHKIVRQG

Query:  YFWPMMLQDTKDFTKACDRCQRFAPVPRQPPEPLTNVISPCPFAQWGIDLIGPLPEGKGQTKYTVVVVDYFTKWAEAEALATITERKVTDFIWR------
        Y+WP +L D   F KACD+CQRFA VPR PPE  T + SP PFAQWG+D++GP P G  Q K+ VV +DYFTKW EAE LA I+E+ V  F+W+      
Subjt:  YFWPMMLQDTKDFTKACDRCQRFAPVPRQPPEPLTNVISPCPFAQWGIDLIGPLPEGKGQTKYTVVVVDYFTKWAEAEALATITERKVTDFIWR------

Query:  -------SRGSKQ----------------DHQT--------------------ELKTKLKGLKGLWAEELPSVLCAYQTTARTSTRETPFSLSFGAKVVV
               S   KQ                +H +                    ++KT+L+G KG+W EELPSVL AY+TT RT T+ETPF L++G + V+
Subjt:  -------SRGSKQ----------------DHQT--------------------ELKTKLKGLKGLWAEELPSVLCAYQTTARTSTRETPFSLSFGAKVVV

Query:  LVEIGLPSLRVEQFH---ENEGSCR
         VEIGL +LR   FH   ENEG  R
Subjt:  LVEIGLPSLRVEQFH---ENEGSCR

A0A2N9H694 Ribonuclease H2.2e-15640.65Show/hide
Query:  LSEERKDLLKVGFIREVHYPQWLSNVVLVKKANGKWRMCIDFTDLNKACPKDSYPLPMIDQLVDATAGHEMLSFMDAYSGYNQIKMYGPDQEKTFLSPTG
        ++EE   LL+ GFIREV+YP+WL+NVV+VKKA GKWRMC+DFTDLNKACPKDSYPLP IDQLVD+TAGH++LSFMDA+SGYNQI+M   DQEKT    + 
Subjt:  LSEERKDLLKVGFIREVHYPQWLSNVVLVKKANGKWRMCIDFTDLNKACPKDSYPLPMIDQLVDATAGHEMLSFMDAYSGYNQIKMYGPDQEKTFLSPTG

Query:  GCIAISIPDTAKVPNEIEFEQVCFQSNFREILGVLVH-----QRGIEANLDKIRAILEMSSPSNLKQLQ---------------------------RLRF
        G     +     +P  ++     +QS   E L V        +RGIEAN DKI+AILEMS P  +K++Q                           R  F
Subjt:  GCIAISIPDTAKVPNEIEFEQVCFQSNFREILGVLVH-----QRGIEANLDKIRAILEMSSPSNLKQLQ---------------------------RLRF

Query:  EWTAECEKAFKELKAYLGFAPLLTKPQPGTNCCSFGSFK-TAVSSVLIKEEGNLQRPVYYTSKTMVGAETRYPQVE------------------------
        +WT EC++AF+ELKAYL   PLL+  Q G     + +   +AVSS LI+EE  +Q+ VYYTS+ + GAE RY  +E                        
Subjt:  EWTAECEKAFKELKAYLGFAPLLTKPQPGTNCCSFGSFK-TAVSSVLIKEEGNLQRPVYYTSKTMVGAETRYPQVE------------------------

Query:  ------RQVLQKPETSGCLMKWAIELSEYDIHYKPRTSMKGQATADFVAELT-PAKVEARPTRAELSLGSISWGMWSRNIVGI------PRGRRFEYALR
              R+ + KP+ +G L++W+IE+ E+DI Y+PRT++K QA ADF+AE T P K + +P   E    SI  G  ++ + G       P G +FEYAL+
Subjt:  ------RQVLQKPETSGCLMKWAIELSEYDIHYKPRTSMKGQATADFVAELT-PAKVEARPTRAELSLGSISWGMWSRNIVGI------PRGRRFEYALR

Query:  FNFRASNNEAEYEALLAGLKLAREIGISSLLVQSDSQLIVKQV---------------------------------------------------VEVDEQ
          FRA+NNEAEYEALLAGL+L++ +GI +L ++SDSQLIV QV                                                   +E+D  
Subjt:  FNFRASNNEAEYEALLAGLKLAREIGISSLLVQSDSQLIVKQV---------------------------------------------------VEVDEQ

Query:  VSVGDRARTEAKTPVAEADQEGGSWMDPLVKYLEKGDLPIDKAEAKRLQRRASHYVLREGRLYK-------------------------------RGERS
        + +  R  TE +   + AD    +WM P+ +YL +G LP D+ EA +L+ RASH+ L  G LYK                                G RS
Subjt:  VSVGDRARTEAKTPVAEADQEGGSWMDPLVKYLEKGDLPIDKAEAKRLQRRASHYVLREGRLYK-------------------------------RGERS

Query:  LCHKIVRQGYFWPMMLQDTKDFTKACDRCQRFAPVPRQPPEPLTNVISPCPFAQWGIDLIGPLPEGKGQTKYTVVVVDYFTKWAEAEALATITERKVTDF
        L HK+ R GY+WP +L D   + KACD+CQRFA +PR PPE  T + SP PFAQWG+D++GP P G  Q K+ VV +DYFTKW EAE LA I+E+ V  F
Subjt:  LCHKIVRQGYFWPMMLQDTKDFTKACDRCQRFAPVPRQPPEPLTNVISPCPFAQWGIDLIGPLPEGKGQTKYTVVVVDYFTKWAEAEALATITERKVTDF

Query:  IWR-------------SRGSKQ----------------DHQT--------------------ELKTKLKGLKGLWAEELPSVLCAYQTTARTSTRETPFS
        +W+             S   KQ                +H +                    ++KT+L+G KG+W EELPS+L AY+TT RT TRETPF 
Subjt:  IWR-------------SRGSKQ----------------DHQT--------------------ELKTKLKGLKGLWAEELPSVLCAYQTTARTSTRETPFS

Query:  LSFGAKVVVLVEIGLPSLRVEQFH---ENEGSCR
        L+FG + V+ VEIGL + R   FH   ENEG  R
Subjt:  LSFGAKVVVLVEIGLPSLRVEQFH---ENEGSCR

A0A2N9IJ69 Ribonuclease H1.4e-15539.84Show/hide
Query:  EERKDLLKVGFIREVHYPQWLSNVVLVKKANGKWRMCIDFTDLNKACPKDSYPLPMIDQLVDATAGHEMLSFMDAYSGYNQIKMYGPDQEKT--------
        EE   LL+ GFIREV+YP+WL+NVV+VKK+ GKWRMC+DFTDLNKACPKDSYPLP IDQLVD+TAGH++LSFMDA+SGYNQI+M   DQEKT        
Subjt:  EERKDLLKVGFIREVHYPQWLSNVVLVKKANGKWRMCIDFTDLNKACPKDSYPLPMIDQLVDATAGHEMLSFMDAYSGYNQIKMYGPDQEKT--------

Query:  FLSPTGGCIAISIPD----TAKVPNEIEFEQVCFQSNFR-----------------EILGVLVHQRGIEANLDKIRAILEMSSPSNLKQLQ---------
        F    G  + + + D    + K  + +   +  FQ+  R                 + LG +V QRGIEAN DKI+AILEM+ P  +K++Q         
Subjt:  FLSPTGGCIAISIPD----TAKVPNEIEFEQVCFQSNFR-----------------EILGVLVHQRGIEANLDKIRAILEMSSPSNLKQLQ---------

Query:  ------------------RLRFEWTAECEKAFKELKAYLGFAPLLTKPQPGTNCCSFGSFK-TAVSSVLIKEEGNLQRPVYYTSKTMVGAETRYPQVE--
                          R  F+WT EC++AF+ELKAYL   PLL+  Q G     + +   +AVSS LI+EE  +Q+PVYYTS+ + GAE RY  +E  
Subjt:  ------------------RLRFEWTAECEKAFKELKAYLGFAPLLTKPQPGTNCCSFGSFK-TAVSSVLIKEEGNLQRPVYYTSKTMVGAETRYPQVE--

Query:  ----------------------------RQVLQKPETSGCLMKWAIELSEYDIHYKPRTSMKGQATADFVAELT-PAKVEARPTRAELSLGSISWGMWSR
                                    R+ + KP+ +G L++W+IE+ E+DI Y+PRT++K QA ADF+AE T P K + +P   E    SI  G  ++
Subjt:  ----------------------------RQVLQKPETSGCLMKWAIELSEYDIHYKPRTSMKGQATADFVAELT-PAKVEARPTRAELSLGSISWGMWSR

Query:  NIVGI------PRGRRFEYALRFNFRASNNEAEYEALLAGLKLAREIGISSLLVQSDSQLIVKQV-----------------------------------
         + G       P G +FEYAL+  FRA+NNEAEYEALLAGL+L++ +GI +L ++SDSQLIV QV                                   
Subjt:  NIVGI------PRGRRFEYALRFNFRASNNEAEYEALLAGLKLAREIGISSLLVQSDSQLIVKQV-----------------------------------

Query:  ----------------VEVDEQVSVGDRARTEAKTPVAEADQEGGSWMDPLVKYLEKGDLPIDKAEAKRLQRRASHYVLREGRLYK--------------
                        +E+D  + +  R  TE +  +        +WM P+  YL +G LP D+ EA +L+ RASH+ L  G LYK              
Subjt:  ----------------VEVDEQVSVGDRARTEAKTPVAEADQEGGSWMDPLVKYLEKGDLPIDKAEAKRLQRRASHYVLREGRLYK--------------

Query:  -----------------RGERSLCHKIVRQGYFWPMMLQDTKDFTKACDRCQRFAPVPRQPPEPLTNVISPCPFAQWGIDLIGPLPEGKGQTKYTVVVVD
                          G RSL HK+ R GY+WP +L D   + KACD+CQRFA +PR PPE +T + SP PFAQWG+D++GP P G  Q K+ VV +D
Subjt:  -----------------RGERSLCHKIVRQGYFWPMMLQDTKDFTKACDRCQRFAPVPRQPPEPLTNVISPCPFAQWGIDLIGPLPEGKGQTKYTVVVVD

Query:  YFTKWAEAEALATITERKVTDFIWR-------------SRGSKQ----------------DHQT--------------------ELKTKLKGLKGLWAEE
        YFTKW EAE LA I+E+ V  F+W+             S   KQ                +H +                    ++KT+L+G KG+W EE
Subjt:  YFTKWAEAEALATITERKVTDFIWR-------------SRGSKQ----------------DHQT--------------------ELKTKLKGLKGLWAEE

Query:  LPSVLCAYQTTARTSTRETPFSLSFGAKVVVLVEIGLPSLRVEQFH---ENEGSCR
        LPSVL AY+TT RT TRETPF L+FG + V+ VEIGL + R   FH   ENEG  R
Subjt:  LPSVLCAYQTTARTSTRETPFSLSFGAKVVVLVEIGLPSLRVEQFH---ENEGSCR

SwissProt top hitse value%identityAlignment
P0CT42 Transposon Tf2-7 polyprotein4.3e-0824.27Show/hide
Query:  LVRAELPPFWSVLKSFGSPSSTWFEPNRLPTPENPNMLTQASERVWQAPHRCAVSAGFAGHVFP----ASTKSLLVSREGQVSPLSGFWHQQLAPSVGKR
        +++AELP F   +            P  + T    N++T+ + R  + P R    +   G V+P      T  L +S  G +S  + F        V K+
Subjt:  LVRAELPPFWSVLKSFGSPSSTWFEPNRLPTPENPNMLTQASERVWQAPHRCAVSAGFAGHVFP----ASTKSLLVSREGQVSPLSGFWHQQLAPSVGKR

Query:  LASENCISIVVLFLCEV-MQESVGVLGRMQEKKGQTEVDPMKEEEDHDLSSQKEVDPSLGRDEGDLRGQPAEELE-SVSLTTEERRVNI-------GTKL
         +    IS   L+   + +  S   L +M +       + +KE E  D+   KE          +   +P + LE  V LT E  R+ I       G   
Subjt:  LASENCISIVVLFLCEV-MQESVGVLGRMQEKKGQTEVDPMKEEEDHDLSSQKEVDPSLGRDEGDLRGQPAEELE-SVSLTTEERRVNI-------GTKL

Query:  GLSEERKDLLKVGFIREVHYPQWLSNVVLVKKANGKWRMCIDFTDLNKACPKDSYPLPMIDQLVDATAGHEMLSFMDAYSGYNQIKMYGPDQEK-TFLSP
         +++E    LK G IRE         V+ V K  G  RM +D+  LNK    + YPLP+I+QL+    G  + + +D  S Y+ I++   D+ K  F  P
Subjt:  GLSEERKDLLKVGFIREVHYPQWLSNVVLVKKANGKWRMCIDFTDLNKACPKDSYPLPMIDQLVDATAGHEMLSFMDAYSGYNQIKMYGPDQEK-TFLSP

Query:  TGGCIAISIPDTAKVPNEIEFEQVCFQSNFREILG-------------VLVHQRGIEANLDKIRAILEMSSPSNL
         G      + +   +P  I      FQ     ILG             +L+H +    ++  ++ +L+    +NL
Subjt:  TGGCIAISIPDTAKVPNEIEFEQVCFQSNFREILG-------------VLVHQRGIEANLDKIRAILEMSSPSNL

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein5.5e-1143.59Show/hide
Query:  SNVVLVKKANGKWRMCIDFTDLNKACPKDSYPLPMIDQLVDATAGHEMLSFMDAYSGYNQIKMYGPDQEKT-FLSPTG
        S VVLV K +G +R+C+D+  LNKA   D +PLP ID L+      ++ + +D +SGY+QI M   D+ KT F++P+G
Subjt:  SNVVLVKKANGKWRMCIDFTDLNKACPKDSYPLPMIDQLVDATAGHEMLSFMDAYSGYNQIKMYGPDQEKT-FLSPTG

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.9e-0822.19Show/hide
Query:  DLLKVGFIREVHYPQ----WLSNVVLVKKANG--KWRMCIDFTDLNKACPKDSYPLPMIDQLVDATAGHEMLSFMDAYSGYNQIKMYGPDQEKTFLSPTG
        +LL+ G IR  + P     W+  V    K NG  ++RM +DF  LN     D+YP+P I+  + +    +  + +D  SG++QI M   D  KT  S   
Subjt:  DLLKVGFIREVHYPQ----WLSNVVLVKKANG--KWRMCIDFTDLNKACPKDSYPLPMIDQLVDATAGHEMLSFMDAYSGYNQIKMYGPDQEKTFLSPTG

Query:  G---------------------------------C--------------------IAISIPDTAKVPNEIEFEQVCFQSNFREILGVLVHQRGIEANLDK
        G                                 C                    + + +   +K   ++  E+  F     E LG +V   GI+A+  K
Subjt:  G---------------------------------C--------------------IAISIPDTAKVPNEIEFEQVCFQSNFREILGVLVHQRGIEANLDK

Query:  IRAILEMSSPSNLKQLQR------------------------------------------LRFEWTAECEKAFKELKAYLGFA-----PLLTKPQPGTNC
        +RAI EM  P+++K+L+R                                          +  + TA   ++F +LK+ L  +     P  TKP   T  
Subjt:  IRAILEMSSPSNLKQLQR------------------------------------------LRFEWTAECEKAFKELKAYLGFA-----PLLTKPQPGTNC

Query:  CSFGSFKTAVSSVLIKEEGNLQRPVYYTSKTMVGAETRYPQVERQVL
         S      A+ +VL +++    RP+ Y S+++   E  Y  +E+++L
Subjt:  CSFGSFKTAVSSVLIKEEGNLQRPVYYTSKTMVGAETRYPQVERQVL

Q99315 Transposon Ty3-G Gag-Pol polyprotein5.5e-1143.59Show/hide
Query:  SNVVLVKKANGKWRMCIDFTDLNKACPKDSYPLPMIDQLVDATAGHEMLSFMDAYSGYNQIKMYGPDQEKT-FLSPTG
        S VVLV K +G +R+C+D+  LNKA   D +PLP ID L+      ++ + +D +SGY+QI M   D+ KT F++P+G
Subjt:  SNVVLVKKANGKWRMCIDFTDLNKACPKDSYPLPMIDQLVDATAGHEMLSFMDAYSGYNQIKMYGPDQEKT-FLSPTG

Q9UR07 Transposon Tf2-11 polyprotein4.3e-0824.27Show/hide
Query:  LVRAELPPFWSVLKSFGSPSSTWFEPNRLPTPENPNMLTQASERVWQAPHRCAVSAGFAGHVFP----ASTKSLLVSREGQVSPLSGFWHQQLAPSVGKR
        +++AELP F   +            P  + T    N++T+ + R  + P R    +   G V+P      T  L +S  G +S  + F        V K+
Subjt:  LVRAELPPFWSVLKSFGSPSSTWFEPNRLPTPENPNMLTQASERVWQAPHRCAVSAGFAGHVFP----ASTKSLLVSREGQVSPLSGFWHQQLAPSVGKR

Query:  LASENCISIVVLFLCEV-MQESVGVLGRMQEKKGQTEVDPMKEEEDHDLSSQKEVDPSLGRDEGDLRGQPAEELE-SVSLTTEERRVNI-------GTKL
         +    IS   L+   + +  S   L +M +       + +KE E  D+   KE          +   +P + LE  V LT E  R+ I       G   
Subjt:  LASENCISIVVLFLCEV-MQESVGVLGRMQEKKGQTEVDPMKEEEDHDLSSQKEVDPSLGRDEGDLRGQPAEELE-SVSLTTEERRVNI-------GTKL

Query:  GLSEERKDLLKVGFIREVHYPQWLSNVVLVKKANGKWRMCIDFTDLNKACPKDSYPLPMIDQLVDATAGHEMLSFMDAYSGYNQIKMYGPDQEK-TFLSP
         +++E    LK G IRE         V+ V K  G  RM +D+  LNK    + YPLP+I+QL+    G  + + +D  S Y+ I++   D+ K  F  P
Subjt:  GLSEERKDLLKVGFIREVHYPQWLSNVVLVKKANGKWRMCIDFTDLNKACPKDSYPLPMIDQLVDATAGHEMLSFMDAYSGYNQIKMYGPDQEK-TFLSP

Query:  TGGCIAISIPDTAKVPNEIEFEQVCFQSNFREILG-------------VLVHQRGIEANLDKIRAILEMSSPSNL
         G      + +   +P  I      FQ     ILG             +L+H +    ++  ++ +L+    +NL
Subjt:  TGGCIAISIPDTAKVPNEIEFEQVCFQSNFREILG-------------VLVHQRGIEANLDKIRAILEMSSPSNL

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAAGAAGAAAAGGAAAGAAGGGGAAAAGAAAAAGGAAAAAAAAAGTTGTCGCCGGCCACTGGCCGACGAACGGCGGTCGCCAACGACTGGTGGTTGCCGGCGGCA
ACATTGGAATGATGGTGGTGATGGTGTTAAGAGAGGATTCGAGCCCCCTGGGCCTGAGGTGGATCCACCGCTACTCATGGGCATCTCTAATTTTGATGAGGCCGAAGGAA
TTGGGCCTTGGCCCAACCCCGCTCGCCCTCGGCCGACCCTCGGCCCGCTTGTGCGGGCCGAGCTTCCTCCCTTTTGGTCGGTCCTAAAGTCTTTTGGCTCCCCCAGTTCA
ACTTGGTTCGAACCGAATCGTCTTCCAACGCCCGAAAACCCTAATATGCTAACCCAGGCATCAGAGCGTGTGTGGCAAGCACCACACCGGTGTGCAGTTTCTGCTGGTTT
TGCAGGTCACGTCTTCCCAGCTTCTACAAAATCGCTGTTGGTGTCACGTGAAGGTCAGGTGAGTCCTCTGTCCGGATTTTGGCATCAACAGTTGGCGCCGTCTGTGGGGA
AAAGACTGGCTAGTGAAAACTGTATATCGATCGTGGTTTTGTTTTTGTGTGAGGTGATGCAGGAATCTGTAGGGGTTCTTGGACGCATGCAGGAAAAGAAGGGACAGACC
GAGGTTGACCCTATGAAGGAAGAGGAGGACCACGACCTGAGCAGTCAGAAAGAGGTCGACCCTAGCCTCGGTCGCGACGAAGGTGATCTGAGAGGACAACCTGCAGAGGA
ATTAGAGTCTGTGTCACTAACAACTGAAGAAAGAAGAGTTAATATTGGCACCAAACTGGGGCTGAGCGAAGAAAGAAAGGACCTACTCAAGGTTGGGTTCATTAGAGAGG
TCCATTACCCCCAATGGTTGTCTAATGTGGTGTTGGTGAAGAAAGCAAATGGTAAATGGCGGATGTGCATTGATTTTACTGATTTGAATAAGGCCTGCCCCAAGGACAGT
TATCCCTTGCCGATGATCGATCAGTTGGTGGATGCAACAGCTGGGCATGAGATGTTGAGCTTCATGGACGCTTACTCGGGTTACAACCAGATCAAGATGTATGGGCCTGA
CCAGGAGAAAACCTTTTTATCACCAACAGGGGGTTGTATTGCTATAAGCATTCCAGATACTGCGAAAGTACCAAATGAAATTGAATTCGAGCAAGTGTGCTTTCAGAGTA
ACTTCAGGGAAATTCTTGGGGTTTTGGTGCATCAGAGGGGGATAGAAGCTAATCTAGACAAGATAAGGGCGATCTTAGAAATGTCCTCTCCATCCAATCTCAAGCAGCTT
CAAAGGTTGCGATTTGAGTGGACTGCAGAATGTGAGAAGGCCTTCAAGGAATTGAAAGCGTACTTGGGTTTTGCCCCGCTTTTAACTAAACCTCAACCGGGGACAAACTG
TTGCTCATTTGGCAGCTTCAAAACAGCTGTGAGTTCAGTGTTGATAAAGGAAGAGGGTAACCTTCAGCGACCGGTGTACTATACGAGTAAGACTATGGTTGGAGCAGAGA
CGAGATATCCCCAAGTGGAAAGGCAAGTATTGCAGAAACCCGAAACGTCAGGGTGTCTCATGAAGTGGGCAATTGAGTTGAGTGAATATGACATCCACTACAAACCGAGG
ACGTCAATGAAAGGGCAAGCGACTGCAGACTTTGTTGCAGAATTGACACCTGCCAAGGTCGAGGCGAGGCCGACCAGAGCAGAATTGAGTCTAGGATCAATTTCTTGGGG
GATGTGGAGCAGGAATATTGTTGGAATCCCCAGAGGGAGGAGATTTGAGTATGCGTTGAGATTCAATTTTAGAGCCTCAAACAATGAAGCAGAATATGAGGCTCTCCTTG
CAGGACTGAAGCTAGCCAGGGAAATTGGGATTTCAAGTCTTTTGGTTCAGAGTGATTCACAGTTAATTGTGAAGCAGGTAGTAGAGGTGGATGAGCAAGTTTCCGTGGGA
GATCGAGCCAGGACAGAAGCCAAAACCCCTGTGGCCGAGGCTGACCAAGAGGGGGGCTCGTGGATGGATCCATTAGTGAAATATCTGGAGAAAGGGGATCTACCTATAGA
CAAAGCTGAAGCCAAGAGGTTACAGAGGCGAGCATCACATTATGTGTTGAGAGAGGGTAGGTTGTATAAACGAGGGGAAAGATCTTTATGTCATAAGATCGTTAGACAAG
GCTACTTCTGGCCAATGATGTTACAAGATACCAAAGATTTTACGAAAGCTTGTGACCGATGTCAGAGATTTGCACCAGTCCCAAGGCAACCACCAGAGCCTTTGACCAAT
GTCATCAGTCCGTGCCCCTTTGCACAGTGGGGAATAGATCTTATTGGGCCTTTACCCGAGGGAAAAGGGCAGACCAAGTATACAGTGGTGGTAGTAGACTATTTTACAAA
ATGGGCAGAAGCTGAAGCGCTGGCGACTATCACAGAGAGAAAGGTCACTGATTTCATCTGGCGAAGTAGAGGCAGTAAACAAGATCATCAAACAGAATTGAAGACGAAAC
TCAAAGGTTTAAAAGGGTTGTGGGCCGAAGAACTTCCTAGTGTCTTGTGTGCATATCAGACTACAGCTCGGACCTCAACGAGAGAAACACCTTTTTCTCTCTCGTTTGGG
GCAAAGGTAGTGGTTCTGGTGGAGATTGGTTTACCCTCCCTTAGAGTGGAACAGTTCCACGAGAATGAGGGGTCATGCAGAACACCAAAGATCCCAAAACGGAGGTGTTG
GGACCAGCATGGGAAGGGCCCTATGAGGTTTACTACCAGTAGCTTCGACAGAAGTGGAGCACCATTGCCCCATCACAGGAAGGCCGAGGCCGACCAGGCCAAGGCCTACC
AAGCCGAGGTCGTGCAGGCCGAGGCTCTCCTATACTCCCTGAGGCAATCGACTTGTGGAAAACTGTCCTCGGTGGAGTGGTCCAGTCCTATACTCCCTGAGGTAATCGAC
TTGTGGAAAAATGTCCTTGGGAATGGAGCCAAGACTCCAGTCAGAGCCGCACAGATAGAAGAAGGGGCGGAGACATTAAGGCTTCGAGGACTTCAAACCCCAAAGGAAGT
TAGGGAGGTGGGATCCACTTCCCTAGGTTGTCTAGTGAGTAATGTGGTCGATGATTCATTGGCACCCATTCAAGAGCTTAGTTCTTTTGATCTTGCTAAGGATGAGCATT
TAGGGGTGAGGTACATAGACAGTGGAGAAGAATTGGAGAGTTGTAGCACCTTTCAAGAACATGTTTGTGAGGAAGAAAAAGAAAATGAGCTTACAGTGACAGAGGAAGTT
CAGGAGGTCTTGGTTCCAGTTTCTCCTTTGCTTGCAAGCTTCCAGTCAAGAAGTCGTCATCATTCCAGGACAGTAATTTTGGACCATCCCGGAATCCAAGGAGCAGTCGA
GGACAATGCGATTCGAGACCAAAGACACAACATGAAAACGAACCCTAAAAGGGAATTGGGCCTTGGCCCAACCCCGCTCGGCCTCGGCCCGAGGCCGAGCTTCCTCCCTT
CCGGTCGGTCCCTGAAGTCTTTTGGCTCCCCCGGTTCAACTTGGTTCGAACCGAATCGTCTTCCAGTGCCTGAAAACCCTAATATGCTAACCCAGGCATCAAAGCGTGTG
TGGCAAGCACCATACCAGTGTGCAGTTTCTGCTGGTTTTGCAGGTCACATCTTCCCAGCTTCTACAAATTCACTGTTTGTGTCACGTGAAGGTCAGTGCCACATCATCTG
CCACATCAGCAAATTTGACCGTTGGATATGTGACATCATCAATTTGACCGTTGACCGTCCTCGTCAGCCGCCACATCATCAAAGTGCCACGTGTTATGCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGAAGAAGAAAAGGAAAGAAGGGGAAAAGAAAAAGGAAAAAAAAAGTTGTCGCCGGCCACTGGCCGACGAACGGCGGTCGCCAACGACTGGTGGTTGCCGGCGGCA
ACATTGGAATGATGGTGGTGATGGTGTTAAGAGAGGATTCGAGCCCCCTGGGCCTGAGGTGGATCCACCGCTACTCATGGGCATCTCTAATTTTGATGAGGCCGAAGGAA
TTGGGCCTTGGCCCAACCCCGCTCGCCCTCGGCCGACCCTCGGCCCGCTTGTGCGGGCCGAGCTTCCTCCCTTTTGGTCGGTCCTAAAGTCTTTTGGCTCCCCCAGTTCA
ACTTGGTTCGAACCGAATCGTCTTCCAACGCCCGAAAACCCTAATATGCTAACCCAGGCATCAGAGCGTGTGTGGCAAGCACCACACCGGTGTGCAGTTTCTGCTGGTTT
TGCAGGTCACGTCTTCCCAGCTTCTACAAAATCGCTGTTGGTGTCACGTGAAGGTCAGGTGAGTCCTCTGTCCGGATTTTGGCATCAACAGTTGGCGCCGTCTGTGGGGA
AAAGACTGGCTAGTGAAAACTGTATATCGATCGTGGTTTTGTTTTTGTGTGAGGTGATGCAGGAATCTGTAGGGGTTCTTGGACGCATGCAGGAAAAGAAGGGACAGACC
GAGGTTGACCCTATGAAGGAAGAGGAGGACCACGACCTGAGCAGTCAGAAAGAGGTCGACCCTAGCCTCGGTCGCGACGAAGGTGATCTGAGAGGACAACCTGCAGAGGA
ATTAGAGTCTGTGTCACTAACAACTGAAGAAAGAAGAGTTAATATTGGCACCAAACTGGGGCTGAGCGAAGAAAGAAAGGACCTACTCAAGGTTGGGTTCATTAGAGAGG
TCCATTACCCCCAATGGTTGTCTAATGTGGTGTTGGTGAAGAAAGCAAATGGTAAATGGCGGATGTGCATTGATTTTACTGATTTGAATAAGGCCTGCCCCAAGGACAGT
TATCCCTTGCCGATGATCGATCAGTTGGTGGATGCAACAGCTGGGCATGAGATGTTGAGCTTCATGGACGCTTACTCGGGTTACAACCAGATCAAGATGTATGGGCCTGA
CCAGGAGAAAACCTTTTTATCACCAACAGGGGGTTGTATTGCTATAAGCATTCCAGATACTGCGAAAGTACCAAATGAAATTGAATTCGAGCAAGTGTGCTTTCAGAGTA
ACTTCAGGGAAATTCTTGGGGTTTTGGTGCATCAGAGGGGGATAGAAGCTAATCTAGACAAGATAAGGGCGATCTTAGAAATGTCCTCTCCATCCAATCTCAAGCAGCTT
CAAAGGTTGCGATTTGAGTGGACTGCAGAATGTGAGAAGGCCTTCAAGGAATTGAAAGCGTACTTGGGTTTTGCCCCGCTTTTAACTAAACCTCAACCGGGGACAAACTG
TTGCTCATTTGGCAGCTTCAAAACAGCTGTGAGTTCAGTGTTGATAAAGGAAGAGGGTAACCTTCAGCGACCGGTGTACTATACGAGTAAGACTATGGTTGGAGCAGAGA
CGAGATATCCCCAAGTGGAAAGGCAAGTATTGCAGAAACCCGAAACGTCAGGGTGTCTCATGAAGTGGGCAATTGAGTTGAGTGAATATGACATCCACTACAAACCGAGG
ACGTCAATGAAAGGGCAAGCGACTGCAGACTTTGTTGCAGAATTGACACCTGCCAAGGTCGAGGCGAGGCCGACCAGAGCAGAATTGAGTCTAGGATCAATTTCTTGGGG
GATGTGGAGCAGGAATATTGTTGGAATCCCCAGAGGGAGGAGATTTGAGTATGCGTTGAGATTCAATTTTAGAGCCTCAAACAATGAAGCAGAATATGAGGCTCTCCTTG
CAGGACTGAAGCTAGCCAGGGAAATTGGGATTTCAAGTCTTTTGGTTCAGAGTGATTCACAGTTAATTGTGAAGCAGGTAGTAGAGGTGGATGAGCAAGTTTCCGTGGGA
GATCGAGCCAGGACAGAAGCCAAAACCCCTGTGGCCGAGGCTGACCAAGAGGGGGGCTCGTGGATGGATCCATTAGTGAAATATCTGGAGAAAGGGGATCTACCTATAGA
CAAAGCTGAAGCCAAGAGGTTACAGAGGCGAGCATCACATTATGTGTTGAGAGAGGGTAGGTTGTATAAACGAGGGGAAAGATCTTTATGTCATAAGATCGTTAGACAAG
GCTACTTCTGGCCAATGATGTTACAAGATACCAAAGATTTTACGAAAGCTTGTGACCGATGTCAGAGATTTGCACCAGTCCCAAGGCAACCACCAGAGCCTTTGACCAAT
GTCATCAGTCCGTGCCCCTTTGCACAGTGGGGAATAGATCTTATTGGGCCTTTACCCGAGGGAAAAGGGCAGACCAAGTATACAGTGGTGGTAGTAGACTATTTTACAAA
ATGGGCAGAAGCTGAAGCGCTGGCGACTATCACAGAGAGAAAGGTCACTGATTTCATCTGGCGAAGTAGAGGCAGTAAACAAGATCATCAAACAGAATTGAAGACGAAAC
TCAAAGGTTTAAAAGGGTTGTGGGCCGAAGAACTTCCTAGTGTCTTGTGTGCATATCAGACTACAGCTCGGACCTCAACGAGAGAAACACCTTTTTCTCTCTCGTTTGGG
GCAAAGGTAGTGGTTCTGGTGGAGATTGGTTTACCCTCCCTTAGAGTGGAACAGTTCCACGAGAATGAGGGGTCATGCAGAACACCAAAGATCCCAAAACGGAGGTGTTG
GGACCAGCATGGGAAGGGCCCTATGAGGTTTACTACCAGTAGCTTCGACAGAAGTGGAGCACCATTGCCCCATCACAGGAAGGCCGAGGCCGACCAGGCCAAGGCCTACC
AAGCCGAGGTCGTGCAGGCCGAGGCTCTCCTATACTCCCTGAGGCAATCGACTTGTGGAAAACTGTCCTCGGTGGAGTGGTCCAGTCCTATACTCCCTGAGGTAATCGAC
TTGTGGAAAAATGTCCTTGGGAATGGAGCCAAGACTCCAGTCAGAGCCGCACAGATAGAAGAAGGGGCGGAGACATTAAGGCTTCGAGGACTTCAAACCCCAAAGGAAGT
TAGGGAGGTGGGATCCACTTCCCTAGGTTGTCTAGTGAGTAATGTGGTCGATGATTCATTGGCACCCATTCAAGAGCTTAGTTCTTTTGATCTTGCTAAGGATGAGCATT
TAGGGGTGAGGTACATAGACAGTGGAGAAGAATTGGAGAGTTGTAGCACCTTTCAAGAACATGTTTGTGAGGAAGAAAAAGAAAATGAGCTTACAGTGACAGAGGAAGTT
CAGGAGGTCTTGGTTCCAGTTTCTCCTTTGCTTGCAAGCTTCCAGTCAAGAAGTCGTCATCATTCCAGGACAGTAATTTTGGACCATCCCGGAATCCAAGGAGCAGTCGA
GGACAATGCGATTCGAGACCAAAGACACAACATGAAAACGAACCCTAAAAGGGAATTGGGCCTTGGCCCAACCCCGCTCGGCCTCGGCCCGAGGCCGAGCTTCCTCCCTT
CCGGTCGGTCCCTGAAGTCTTTTGGCTCCCCCGGTTCAACTTGGTTCGAACCGAATCGTCTTCCAGTGCCTGAAAACCCTAATATGCTAACCCAGGCATCAAAGCGTGTG
TGGCAAGCACCATACCAGTGTGCAGTTTCTGCTGGTTTTGCAGGTCACATCTTCCCAGCTTCTACAAATTCACTGTTTGTGTCACGTGAAGGTCAGTGCCACATCATCTG
CCACATCAGCAAATTTGACCGTTGGATATGTGACATCATCAATTTGACCGTTGACCGTCCTCGTCAGCCGCCACATCATCAAAGTGCCACGTGTTATGCCTAG
Protein sequenceShow/hide protein sequence
MEKKKRKEGEKKKEKKSCRRPLADERRSPTTGGCRRQHWNDGGDGVKRGFEPPGPEVDPPLLMGISNFDEAEGIGPWPNPARPRPTLGPLVRAELPPFWSVLKSFGSPSS
TWFEPNRLPTPENPNMLTQASERVWQAPHRCAVSAGFAGHVFPASTKSLLVSREGQVSPLSGFWHQQLAPSVGKRLASENCISIVVLFLCEVMQESVGVLGRMQEKKGQT
EVDPMKEEEDHDLSSQKEVDPSLGRDEGDLRGQPAEELESVSLTTEERRVNIGTKLGLSEERKDLLKVGFIREVHYPQWLSNVVLVKKANGKWRMCIDFTDLNKACPKDS
YPLPMIDQLVDATAGHEMLSFMDAYSGYNQIKMYGPDQEKTFLSPTGGCIAISIPDTAKVPNEIEFEQVCFQSNFREILGVLVHQRGIEANLDKIRAILEMSSPSNLKQL
QRLRFEWTAECEKAFKELKAYLGFAPLLTKPQPGTNCCSFGSFKTAVSSVLIKEEGNLQRPVYYTSKTMVGAETRYPQVERQVLQKPETSGCLMKWAIELSEYDIHYKPR
TSMKGQATADFVAELTPAKVEARPTRAELSLGSISWGMWSRNIVGIPRGRRFEYALRFNFRASNNEAEYEALLAGLKLAREIGISSLLVQSDSQLIVKQVVEVDEQVSVG
DRARTEAKTPVAEADQEGGSWMDPLVKYLEKGDLPIDKAEAKRLQRRASHYVLREGRLYKRGERSLCHKIVRQGYFWPMMLQDTKDFTKACDRCQRFAPVPRQPPEPLTN
VISPCPFAQWGIDLIGPLPEGKGQTKYTVVVVDYFTKWAEAEALATITERKVTDFIWRSRGSKQDHQTELKTKLKGLKGLWAEELPSVLCAYQTTARTSTRETPFSLSFG
AKVVVLVEIGLPSLRVEQFHENEGSCRTPKIPKRRCWDQHGKGPMRFTTSSFDRSGAPLPHHRKAEADQAKAYQAEVVQAEALLYSLRQSTCGKLSSVEWSSPILPEVID
LWKNVLGNGAKTPVRAAQIEEGAETLRLRGLQTPKEVREVGSTSLGCLVSNVVDDSLAPIQELSSFDLAKDEHLGVRYIDSGEELESCSTFQEHVCEEEKENELTVTEEV
QEVLVPVSPLLASFQSRSRHHSRTVILDHPGIQGAVEDNAIRDQRHNMKTNPKRELGLGPTPLGLGPRPSFLPSGRSLKSFGSPGSTWFEPNRLPVPENPNMLTQASKRV
WQAPYQCAVSAGFAGHIFPASTNSLFVSREGQCHIICHISKFDRWICDIINLTVDRPRQPPHHQSATCYA