; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020230 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020230
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationtig00153450:275966..277608
RNA-Seq ExpressionSgr020230
SyntenySgr020230
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PKU66732.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Dendrobium catenatum]3.4e-4739.93Show/hide
Query:  VTSRKYNGEDKVAMSNGQNILIANAGSGILLTPSHTFQLSTIFHTPTILANLLSVHKCCQDNNCIFIFDVDSFFNQDKTTGQILFKGKSENGLYLIPSLF
        ++S  Y G D + + +G +I IAN GSGIL TPS   +L+ + H P I  NLLS+    +DN+    F+ + F  +D TT +I+ +G  ++GLY I    
Subjt:  VTSRKYNGEDKVAMSNGQNILIANAGSGILLTPSHTFQLSTIFHTPTILANLLSVHKCCQDNNCIFIFDVDSFFNQDKTTGQILFKGKSENGLYLIPSLF

Query:  SISSSSQAAHSKTSSFFFFFFFFFFFFANKEIVLWHNRLGHPSLLILNKALASTDVKVDISFATCTCSSCLKAKMSKVRFPLSKSLSSAPLALLHSDVWG
        S S++ QA  S +                    +WHNRLGHP+   + + +A  + +++IS +   C SC   K  K+ F  S +    PLALLHSDVWG
Subjt:  SISSSSQAAHSKTSSFFFFFFFFFFFFANKEIVLWHNRLGHPSLLILNKALASTDVKVDISFATCTCSSCLKAKMSKVRFPLSKSLSSAPLALLHSDVWG

Query:  PSPIYFFYGHRYYVSFIDDFSKYTWIFPLFHKSDVHHVLKKFVPFAKNMLTSKLKVCRSDGGGEFVNTSISLF
        PSP+    G RYYV+FIDDFS+YTWIFP+ +KSDV H+   F  F +N    K+K+ R+DGG E+ N +++ F
Subjt:  PSPIYFFYGHRYYVSFIDDFSKYTWIFPLFHKSDVHHVLKKFVPFAKNMLTSKLKVCRSDGGGEFVNTSISLF

PKU68311.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Dendrobium catenatum]2.1e-4939.86Show/hide
Query:  TSRKYNGEDKVAMSNGQNILIANAGSGILLTPSHTFQLSTIFHTPTILANLLSVHKCCQDNNCIFIFDVDSFFNQDKTTGQILFKGKSENGLYLIPSLFS
        TS  Y G +KV + +G+++ IA++G G+L TPS    LS + HTP+I  NLLSV    +DNN   IFD   F  +D TT +IL +G   +GLY I     
Subjt:  TSRKYNGEDKVAMSNGQNILIANAGSGILLTPSHTFQLSTIFHTPTILANLLSVHKCCQDNNCIFIFDVDSFFNQDKTTGQILFKGKSENGLYLIPSLFS

Query:  ISSSSQAAHSKTSSFFFFFFFFFFFFANKEIVLWHNRLGHPSLLILNKALASTDVKVDISFATCTCSSCLKAKMSKVRFPLSKSLSSAPLALLHSDVWGP
             Q  H+  S             A      WHNRLGHPSL IL + ++  +  + IS  + +C+ C + K  K++FP+S++   APL LLHSDVWGP
Subjt:  ISSSSQAAHSKTSSFFFFFFFFFFFFANKEIVLWHNRLGHPSLLILNKALASTDVKVDISFATCTCSSCLKAKMSKVRFPLSKSLSSAPLALLHSDVWGP

Query:  SPIYFFYGHRYYVSFIDDFSKYTWIFPLFHKSDVHHVLKKFVPFAKNMLTSKLKVCRSDGGGEFVNTSISLFQRVSYTKNLV-----PIHPSKMGL
        SP+    G RYYV F+DD+S++ W+FP+  KSDV ++   FV F +   + K+K  R+DGGGE+VN     FQ+ +  K ++     P  P + GL
Subjt:  SPIYFFYGHRYYVSFIDDFSKYTWIFPLFHKSDVHHVLKKFVPFAKNMLTSKLKVCRSDGGGEFVNTSISLFQRVSYTKNLV-----PIHPSKMGL

PKU73682.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Dendrobium catenatum]5.1e-5141.61Show/hide
Query:  TSRKYNGEDKVAMSNGQNILIANAGSGILLTPSHTFQLSTIFHTPTILANLLSVHKCCQDNNCIFIFDVDSFFNQDKTTGQILFKGKSENGLYLIPSLFS
        TS  Y G DKV + +G+++ IA++G+G+L TPS    LS + HTP+I  NLLSV    +DNN   IFD   F  +D TT +IL +G   +GLY I     
Subjt:  TSRKYNGEDKVAMSNGQNILIANAGSGILLTPSHTFQLSTIFHTPTILANLLSVHKCCQDNNCIFIFDVDSFFNQDKTTGQILFKGKSENGLYLIPSLFS

Query:  ISSSSQAAHSKTSSFFFFFFFFFFFFANKEIVLWHNRLGHPSLLILNKALASTDVKVDISFATCTCSSCLKAKMSKVRFPLSKSLSSAPLALLHSDVWGP
         +  +  + ++TSS                   WHNRLGHPSL IL + ++  +  + IS  + +C+ C + K  K++FP+S++L  APL LLHSDVWGP
Subjt:  ISSSSQAAHSKTSSFFFFFFFFFFFFANKEIVLWHNRLGHPSLLILNKALASTDVKVDISFATCTCSSCLKAKMSKVRFPLSKSLSSAPLALLHSDVWGP

Query:  SPIYFFYGHRYYVSFIDDFSKYTWIFPLFHKSDVHHVLKKFVPFAKNMLTSKLKVCRSDGGGEFVNTSISLFQR
        SP+    G RYYV F+DD+S++ W+FP+  KSDV ++   FV F +   + K+K  R+DGGGE+VN     F R
Subjt:  SPIYFFYGHRYYVSFIDDFSKYTWIFPLFHKSDVHHVLKKFVPFAKNMLTSKLKVCRSDGGGEFVNTSISLFQR

PKU80502.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Dendrobium catenatum]5.3e-4841.42Show/hide
Query:  YNGEDKVAMSNGQNILIANAGSGILLTPSHTFQLSTIFHTPTILANLLSVHKCCQDNNCIFIFDVDSFFNQDKTTGQILFKGKSENGLYLIPSLFSISSS
        YNG D V++ +G++I IA+ G GIL TP     LS I H P+I  NLLS+    +DN     FD + F  +D+T  Q+L  G   NGLY I +    +S+
Subjt:  YNGEDKVAMSNGQNILIANAGSGILLTPSHTFQLSTIFHTPTILANLLSVHKCCQDNNCIFIFDVDSFFNQDKTTGQILFKGKSENGLYLIPSLFSISSS

Query:  SQAAHSKTSSFFFFFFFFFFFFANKEIVLWHNRLGHPSLLILNKALASTDVKVDISFATCTCSSCLKAKMSKVRFPLSKSLSSAPLALLHSDVWGPSPIY
           + +  SS                  +WH RLGHP+  +L K++AS++ K+ I   T  C SCL  K  K+ F +S++++ APL L+HSDVWGPSP+ 
Subjt:  SQAAHSKTSSFFFFFFFFFFFFANKEIVLWHNRLGHPSLLILNKALASTDVKVDISFATCTCSSCLKAKMSKVRFPLSKSLSSAPLALLHSDVWGPSPIY

Query:  FFYGHRYYVSFIDDFSKYTWIFPLFHKSDVHHVLKKFVPFAKNMLTSKLKVCRSDGGGEFVNTSISLF
           G RYY+  +DD+S++ W+FPL  KSDV +  K+FV F +     KLKV R+DGGGEFVN +   F
Subjt:  FFYGHRYYVSFIDDFSKYTWIFPLFHKSDVHHVLKKFVPFAKNMLTSKLKVCRSDGGGEFVNTSISLF

XP_022158189.1 uncharacterized protein LOC111024722 [Momordica charantia]8.3e-7052.63Show/hide
Query:  YNGEDKVAMSNGQNILIANAGSGILLTPSHTFQLSTIFHTPTILANLLSVHKCCQDNNCIFIFDVDSFFNQDKTTGQILFKGKSENGLYLIPSLFSISSS
        YNGE+ V + NGQ++ I++ GSGIL   SH F +S + H P +  NLLSVHK C DN+CIF++D D F  QDK T   L+KGKS NGLY IPS  ++SS+
Subjt:  YNGEDKVAMSNGQNILIANAGSGILLTPSHTFQLSTIFHTPTILANLLSVHKCCQDNNCIFIFDVDSFFNQDKTTGQILFKGKSENGLYLIPSLFSISSS

Query:  SQAAHSKTSSFFFFFFFFFFFFANKEIVLWHNRLGHPSLLILNKALASTDVKVDISFATCTCSSCLKAKMSKVRFPLSKSLSSAPLALLHSDVWGPSPIY
            H K  +            A     LWH+RLGH S  IL  AL++  + +  SF TC C+SCLKAKMSK+ FP+S S S APL  +HSDVWG SP+ 
Subjt:  SQAAHSKTSSFFFFFFFFFFFFANKEIVLWHNRLGHPSLLILNKALASTDVKVDISFATCTCSSCLKAKMSKVRFPLSKSLSSAPLALLHSDVWGPSPIY

Query:  FFYGHRYYVSFIDDFSKYTWIFPLFHKSDVHHVLKKFVPFAKNMLTSKLKVCRSDGGGEFVNTSIS
           G RYYVS +D+FSK+TW+FP+ +KSDV  +L KFVPFA+N+L+SKLK   S+GGGEFVN+S+S
Subjt:  FFYGHRYYVSFIDDFSKYTWIFPLFHKSDVHHVLKKFVPFAKNMLTSKLKVCRSDGGGEFVNTSIS

TrEMBL top hitse value%identityAlignment
A0A2I0VY30 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-4939.86Show/hide
Query:  TSRKYNGEDKVAMSNGQNILIANAGSGILLTPSHTFQLSTIFHTPTILANLLSVHKCCQDNNCIFIFDVDSFFNQDKTTGQILFKGKSENGLYLIPSLFS
        TS  Y G +KV + +G+++ IA++G G+L TPS    LS + HTP+I  NLLSV    +DNN   IFD   F  +D TT +IL +G   +GLY I     
Subjt:  TSRKYNGEDKVAMSNGQNILIANAGSGILLTPSHTFQLSTIFHTPTILANLLSVHKCCQDNNCIFIFDVDSFFNQDKTTGQILFKGKSENGLYLIPSLFS

Query:  ISSSSQAAHSKTSSFFFFFFFFFFFFANKEIVLWHNRLGHPSLLILNKALASTDVKVDISFATCTCSSCLKAKMSKVRFPLSKSLSSAPLALLHSDVWGP
             Q  H+  S             A      WHNRLGHPSL IL + ++  +  + IS  + +C+ C + K  K++FP+S++   APL LLHSDVWGP
Subjt:  ISSSSQAAHSKTSSFFFFFFFFFFFFANKEIVLWHNRLGHPSLLILNKALASTDVKVDISFATCTCSSCLKAKMSKVRFPLSKSLSSAPLALLHSDVWGP

Query:  SPIYFFYGHRYYVSFIDDFSKYTWIFPLFHKSDVHHVLKKFVPFAKNMLTSKLKVCRSDGGGEFVNTSISLFQRVSYTKNLV-----PIHPSKMGL
        SP+    G RYYV F+DD+S++ W+FP+  KSDV ++   FV F +   + K+K  R+DGGGE+VN     FQ+ +  K ++     P  P + GL
Subjt:  SPIYFFYGHRYYVSFIDDFSKYTWIFPLFHKSDVHHVLKKFVPFAKNMLTSKLKVCRSDGGGEFVNTSISLFQRVSYTKNLV-----PIHPSKMGL

A0A2I0WDH3 Retrovirus-related Pol polyprotein from transposon TNT 1-942.5e-5141.61Show/hide
Query:  TSRKYNGEDKVAMSNGQNILIANAGSGILLTPSHTFQLSTIFHTPTILANLLSVHKCCQDNNCIFIFDVDSFFNQDKTTGQILFKGKSENGLYLIPSLFS
        TS  Y G DKV + +G+++ IA++G+G+L TPS    LS + HTP+I  NLLSV    +DNN   IFD   F  +D TT +IL +G   +GLY I     
Subjt:  TSRKYNGEDKVAMSNGQNILIANAGSGILLTPSHTFQLSTIFHTPTILANLLSVHKCCQDNNCIFIFDVDSFFNQDKTTGQILFKGKSENGLYLIPSLFS

Query:  ISSSSQAAHSKTSSFFFFFFFFFFFFANKEIVLWHNRLGHPSLLILNKALASTDVKVDISFATCTCSSCLKAKMSKVRFPLSKSLSSAPLALLHSDVWGP
         +  +  + ++TSS                   WHNRLGHPSL IL + ++  +  + IS  + +C+ C + K  K++FP+S++L  APL LLHSDVWGP
Subjt:  ISSSSQAAHSKTSSFFFFFFFFFFFFANKEIVLWHNRLGHPSLLILNKALASTDVKVDISFATCTCSSCLKAKMSKVRFPLSKSLSSAPLALLHSDVWGP

Query:  SPIYFFYGHRYYVSFIDDFSKYTWIFPLFHKSDVHHVLKKFVPFAKNMLTSKLKVCRSDGGGEFVNTSISLFQR
        SP+    G RYYV F+DD+S++ W+FP+  KSDV ++   FV F +   + K+K  R+DGGGE+VN     F R
Subjt:  SPIYFFYGHRYYVSFIDDFSKYTWIFPLFHKSDVHHVLKKFVPFAKNMLTSKLKVCRSDGGGEFVNTSISLFQR

A0A2N9G2I0 Uncharacterized protein8.8e-4941.11Show/hide
Query:  YNGEDKVAMSNGQNILIANAGSGILLTPSHTFQLSTIFHTPTILANLLSVHKCCQDNNCIFIFDVDSFFNQDKTTGQILFKGKSENGLYLIPSLFSISSS
        Y G D+VA+ NGQ+I I N G+G L T  + F+L ++ H+  I +NLLSVHK C+DNNC   FD + F  QD  +G++L+KG SENGLY I +      +
Subjt:  YNGEDKVAMSNGQNILIANAGSGILLTPSHTFQLSTIFHTPTILANLLSVHKCCQDNNCIFIFDVDSFFNQDKTTGQILFKGKSENGLYLIPSLFSISSS

Query:  SQAAHSKTSSFFFFFFFFFFFFANKEIVLWHNRLGHPSLLILNKALASTD--VKVDISFATCTCSSCLKAKMSKVRFPLSKSLSSAPLALLHSDVWGPSP
        + A+ S  +          F  +  +  LWH+RLGHPS  +L  AL S    + V        C  CL  KM K+ F  SK  S+ PL L+HSDVWGP+P
Subjt:  SQAAHSKTSSFFFFFFFFFFFFANKEIVLWHNRLGHPSLLILNKALASTD--VKVDISFATCTCSSCLKAKMSKVRFPLSKSLSSAPLALLHSDVWGPSP

Query:  IYFFYGHRYYVSFIDDFSKYTWIFPLFHKSDVHHVLKKFVPFAKNMLTSKLKVCRSDGGGEFVNTSISLF
        I  F G+RYY+ F+DD+++++W++ L +KSDV    K F    +N L+ ++K  R+D GGE+ +   + F
Subjt:  IYFFYGHRYYVSFIDDFSKYTWIFPLFHKSDVHHVLKKFVPFAKNMLTSKLKVCRSDGGGEFVNTSISLF

A0A2N9G2N5 Uncharacterized protein2.0e-4841.11Show/hide
Query:  YNGEDKVAMSNGQNILIANAGSGILLTPSHTFQLSTIFHTPTILANLLSVHKCCQDNNCIFIFDVDSFFNQDKTTGQILFKGKSENGLYLIPSLFSISSS
        Y G D+VA+ NGQ+I I N G+G L T  + F+L ++ H+  I +NLLSVHK C+DNNC   FD + F  QD  +G++L+KG SENGLY I +      +
Subjt:  YNGEDKVAMSNGQNILIANAGSGILLTPSHTFQLSTIFHTPTILANLLSVHKCCQDNNCIFIFDVDSFFNQDKTTGQILFKGKSENGLYLIPSLFSISSS

Query:  SQAAHSKTSSFFFFFFFFFFFFANKEIVLWHNRLGHPSLLILNKALASTD--VKVDISFATCTCSSCLKAKMSKVRFPLSKSLSSAPLALLHSDVWGPSP
        + A+ S ++          F  +  +  LWH+RLGHPS  +L  AL S    + V        C  CL  KM K+ F  SK  S+ PL L+ SDVWGP+P
Subjt:  SQAAHSKTSSFFFFFFFFFFFFANKEIVLWHNRLGHPSLLILNKALASTD--VKVDISFATCTCSSCLKAKMSKVRFPLSKSLSSAPLALLHSDVWGPSP

Query:  IYFFYGHRYYVSFIDDFSKYTWIFPLFHKSDVHHVLKKFVPFAKNMLTSKLKVCRSDGGGEFVNTSISLF
        I  F G+RYY+ F+DD+++++W++ L +KSDV    K F    +N L+ K+K  R+D GGE+ +   + F
Subjt:  IYFFYGHRYYVSFIDDFSKYTWIFPLFHKSDVHHVLKKFVPFAKNMLTSKLKVCRSDGGGEFVNTSISLF

A0A6J1DYN6 uncharacterized protein LOC1110247224.0e-7052.63Show/hide
Query:  YNGEDKVAMSNGQNILIANAGSGILLTPSHTFQLSTIFHTPTILANLLSVHKCCQDNNCIFIFDVDSFFNQDKTTGQILFKGKSENGLYLIPSLFSISSS
        YNGE+ V + NGQ++ I++ GSGIL   SH F +S + H P +  NLLSVHK C DN+CIF++D D F  QDK T   L+KGKS NGLY IPS  ++SS+
Subjt:  YNGEDKVAMSNGQNILIANAGSGILLTPSHTFQLSTIFHTPTILANLLSVHKCCQDNNCIFIFDVDSFFNQDKTTGQILFKGKSENGLYLIPSLFSISSS

Query:  SQAAHSKTSSFFFFFFFFFFFFANKEIVLWHNRLGHPSLLILNKALASTDVKVDISFATCTCSSCLKAKMSKVRFPLSKSLSSAPLALLHSDVWGPSPIY
            H K  +            A     LWH+RLGH S  IL  AL++  + +  SF TC C+SCLKAKMSK+ FP+S S S APL  +HSDVWG SP+ 
Subjt:  SQAAHSKTSSFFFFFFFFFFFFANKEIVLWHNRLGHPSLLILNKALASTDVKVDISFATCTCSSCLKAKMSKVRFPLSKSLSSAPLALLHSDVWGPSPIY

Query:  FFYGHRYYVSFIDDFSKYTWIFPLFHKSDVHHVLKKFVPFAKNMLTSKLKVCRSDGGGEFVNTSIS
           G RYYVS +D+FSK+TW+FP+ +KSDV  +L KFVPFA+N+L+SKLK   S+GGGEFVN+S+S
Subjt:  FFYGHRYYVSFIDDFSKYTWIFPLFHKSDVHHVLKKFVPFAKNMLTSKLKVCRSDGGGEFVNTSIS

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.2e-1438.41Show/hide
Query:  LWHNRLGHPSLLILNKALASTDVKVDISFATCT----CSSCLKAKMSKVRFPLSKSLSSAPLALLHSDVWGPSPIYFFYGHRYYVSFIDDFSKYTWIFPL
        LWH R+GH S     K L     K  IS+A  T    C  CL  K  +V F  S       L L++SDV GP  I    G++Y+V+FIDD S+  W++ L
Subjt:  LWHNRLGHPSLLILNKALASTDVKVDISFATCT----CSSCLKAKMSKVRFPLSKSLSSAPLALLHSDVWGPSPIYFFYGHRYYVSFIDDFSKYTWIFPL

Query:  FHKSDVHHVLKKFVPFAKNMLTSKLKVCRSDGGGEFVN
          K  V  V +KF    +     KLK  RSD GGE+ +
Subjt:  FHKSDVHHVLKKFVPFAKNMLTSKLKVCRSDGGGEFVN

Q07163 Transposon TyH3 Gag-Pol polyprotein1.5e-0522.22Show/hide
Query:  VTSRKYNGEDKVAMSNGQNILIANAGSGILLTPSHTFQLSTIFHTPTILANLLSVHKCCQDNNCIFIFDVDSFFNQ---DKTTGQIL--------FKGKS
        + S   N +  V  +  +NI I   G        +T     + HTP I  +LLS+       N +   D+ + F +   +++ G +L        F   S
Subjt:  VTSRKYNGEDKVAMSNGQNILIANAGSGILLTPSHTFQLSTIFHTPTILANLLSVHKCCQDNNCIFIFDVDSFFNQ---DKTTGQIL--------FKGKS

Query:  ENGLYLIPSLFSISSSSQAAHSKTSSFFFFFFFFFFFFANKEIVLWHNRLGHPSLLILNKALASTDV----KVDISFATC---TCSSCLKAKMSKVRFPL
        +   YL+PS  S+ + +    S+++  + + F              H  L H +   +  +L +  +    + D+ +++     C  CL  K +K R   
Subjt:  ENGLYLIPSLFSISSSSQAAHSKTSSFFFFFFFFFFFFANKEIVLWHNRLGHPSLLILNKALASTDV----KVDISFATC---TCSSCLKAKMSKVRFPL

Query:  SKSL----SSAPLALLHSDVWGPSPIYFFYGHRYYVSFIDDFSKYTWIFPLFHKSD--VHHVLKKFVPFAKNMLTSKLKVCRSDGGGEFVNTSISLF
           L    S  P   LH+D++GP          Y++SF D+ +K+ W++PL  + +  +  V    + F KN   + + V + D G E+ N ++  F
Subjt:  SKSL----SSAPLALLHSDVWGPSPIYFFYGHRYYVSFIDDFSKYTWIFPLFHKSD--VHHVLKKFVPFAKNMLTSKLKVCRSDGGGEFVNTSISLF

Q12490 Transposon Ty1-BL Gag-Pol polyprotein1.5e-0522.22Show/hide
Query:  VTSRKYNGEDKVAMSNGQNILIANAGSGILLTPSHTFQLSTIFHTPTILANLLSVHKCCQDNNCIFIFDVDSFFNQ---DKTTGQIL--------FKGKS
        + S   N +  V  +  +NI I   G        +T     + HTP I  +LLS+       N +   D+ + F +   +++ G +L        F   S
Subjt:  VTSRKYNGEDKVAMSNGQNILIANAGSGILLTPSHTFQLSTIFHTPTILANLLSVHKCCQDNNCIFIFDVDSFFNQ---DKTTGQIL--------FKGKS

Query:  ENGLYLIPSLFSISSSSQAAHSKTSSFFFFFFFFFFFFANKEIVLWHNRLGHPSLLILNKALASTDV----KVDISFATC---TCSSCLKAKMSKVRFPL
        +   YL+PS  S+ + +    S+++  + + F              H  L H +   +  +L +  +    + D+ +++     C  CL  K +K R   
Subjt:  ENGLYLIPSLFSISSSSQAAHSKTSSFFFFFFFFFFFFANKEIVLWHNRLGHPSLLILNKALASTDV----KVDISFATC---TCSSCLKAKMSKVRFPL

Query:  SKSL----SSAPLALLHSDVWGPSPIYFFYGHRYYVSFIDDFSKYTWIFPLFHKSD--VHHVLKKFVPFAKNMLTSKLKVCRSDGGGEFVNTSISLF
           L    S  P   LH+D++GP          Y++SF D+ +K+ W++PL  + +  +  V    + F KN   + + V + D G E+ N ++  F
Subjt:  SKSL----SSAPLALLHSDVWGPSPIYFFYGHRYYVSFIDDFSKYTWIFPLFHKSD--VHHVLKKFVPFAKNMLTSKLKVCRSDGGGEFVNTSISLF

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.4e-3236.43Show/hide
Query:  YNGEDKVAMSNGQNILIANAGSGILLTPSHTFQLSTIFHTPTILANLLSVHKCCQDNNCIFIFDVDSFFNQDKTTGQILFKGKSENGLYLIP-------S
        Y G D V +++G  I I++ GS  L T S    L  I + P I  NL+SV++ C  N     F   SF  +D  TG  L +GK+++ LY  P       S
Subjt:  YNGEDKVAMSNGQNILIANAGSGILLTPSHTFQLSTIFHTPTILANLLSVHKCCQDNNCIFIFDVDSFFNQDKTTGQILFKGKSENGLYLIP-------S

Query:  LFSISSSSQAAHSKTSSFFFFFFFFFFFFANKEIVLWHNRLGHPSLLILNKALASTDVKV-DISFATCTCSSCLKAKMSKVRFPLSKSLSSAPLALLHSD
        LF+ S SS+A HS                       WH RLGHP+  ILN  +++  + V + S    +CS CL  K +KV F  S   S+ PL  ++SD
Subjt:  LFSISSSSQAAHSKTSSFFFFFFFFFFFFANKEIVLWHNRLGHPSLLILNKALASTDVKV-DISFATCTCSSCLKAKMSKVRFPLSKSLSSAPLALLHSD

Query:  VWGPSPIYFFYGHRYYVSFIDDFSKYTWIFPLFHKSDVHHVLKKFVPFAKNMLTSKLKVCRSDGGGEFV
        VW  SPI     +RYYV F+D F++YTW++PL  KS V      F    +N   +++    SD GGEFV
Subjt:  VWGPSPIYFFYGHRYYVSFIDDFSKYTWIFPLFHKSDVHHVLKKFVPFAKNMLTSKLKVCRSDGGGEFV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.3e-3336.06Show/hide
Query:  YNGEDKVAMSNGQNILIANAGSGILLTPSHTFQLSTIFHTPTILANLLSVHKCCQDNNCIFIFDVDSFFNQDKTTGQILFKGKSENGLYLIP-------S
        Y G D V +++G  I I + GS  L T S +  L+ + + P I  NL+SV++ C  N     F   SF  +D  TG  L +GK+++ LY  P       S
Subjt:  YNGEDKVAMSNGQNILIANAGSGILLTPSHTFQLSTIFHTPTILANLLSVHKCCQDNNCIFIFDVDSFFNQDKTTGQILFKGKSENGLYLIP-------S

Query:  LFSISSSSQAAHSKTSSFFFFFFFFFFFFANKEIVLWHNRLGHPSLLILNKALASTDVKV-DISFATCTCSSCLKAKMSKVRFPLSKSLSSAPLALLHSD
        +F+ S  S+A HS                       WH+RLGHPSL ILN  +++  + V + S    +CS C   K  KV F  S   SS PL  ++SD
Subjt:  LFSISSSSQAAHSKTSSFFFFFFFFFFFFANKEIVLWHNRLGHPSLLILNKALASTDVKV-DISFATCTCSSCLKAKMSKVRFPLSKSLSSAPLALLHSD

Query:  VWGPSPIYFFYGHRYYVSFIDDFSKYTWIFPLFHKSDVHHVLKKFVPFAKNMLTSKLKVCRSDGGGEFV
        VW  SPI     +RYYV F+D F++YTW++PL  KS V      F    +N   +++    SD GGEFV
Subjt:  VWGPSPIYFFYGHRYYVSFIDDFSKYTWIFPLFHKSDVHHVLKKFVPFAKNMLTSKLKVCRSDGGGEFV

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein6.5e-0424.79Show/hide
Query:  LFKGKSENGLYLIPSLFSISSSSQAAHSKTSSFFFFFFFFFFFFANKEIVLWHNRLGHPSL----LILNKALASTDVKVDISFATCTCSSCLKAKMSKVR
        + KG   + LY++        S+ A  +K                  E  LWH+RL H S     L++ K    +     + F    C  C+  K  +V 
Subjt:  LFKGKSENGLYLIPSLFSISSSSQAAHSKTSSFFFFFFFFFFFFANKEIVLWHNRLGHPSL----LILNKALASTDVKVDISFATCTCSSCLKAKMSKVR

Query:  FPLSKSLSSAPLALLHSDVWG
        F   +  +  PL  +HSD+WG
Subjt:  FPLSKSLSSAPLALLHSDVWG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGAGGGAATTGTGGACTTTAATCAAGAAGCGGTATTCATCTCTATCTCAGACCTGCATTCTTGAGTTATGCTCCGAGTTGTACTCGGCGAAGAAGAAGGCAAATGA
GTCAATCGATGCTTATATCCTGCGTATAAAAGAAATTATTGATAAACTTGTTGTGGTTTTGATTATAATAGAAGATGAAGAGGCGGTACCAATCGTGGTCAAGGAAACAA
CTTTAACAAAGGGCGTGGAAATTCTGACAAAGGTATATTTCCCCCTTCTGTTCATAGTTGCAGTTCTGATTCGACTAGCAATCCTTCACAGTCTCAGTTACAAGGCAATT
ATAGTGGTCGTATTTGCTGTCAAATTTGTCACAAGTCGGAAATACAATGGCGAAGACAAGGTTGCTATGAGTAATGGTCAAAATATACTTATTGCTAATGCAGGTAGTGG
TATTCTTCTAACTCCCTCTCATACTTTCCAACTTTCGACTATCTTTCACACTCCCACCATTTTGGCCAATCTTTTGTCTGTTCACAAATGTTGTCAGGATAACAATTGTA
TCTTCATATTTGATGTTGATTCATTCTTCAACCAGGACAAAACCACAGGGCAGATCTTGTTCAAGGGCAAGAGTGAAAATGGACTCTATTTGATTCCAAGTTTGTTCTCT
ATTTCTTCTAGCTCGCAGGCAGCACACTCCAAAACTTCTAGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGCAAATAAGGAGATTGTTCTTTGGCATAATCG
GTTAGGCCACCCATCTCTTCTTATTCTAAATAAAGCCTTGGCTAGTACTGATGTTAAAGTGGACATATCATTCGCCACTTGTACTTGTAGTAGTTGTCTTAAGGCAAAAA
TGTCTAAAGTACGTTTTCCTTTATCTAAGTCTTTATCTTCGGCTCCTTTGGCCTTATTGCACAGTGACGTATGGGGACCATCTCCCATTTATTTTTTTTATGGACATCGC
TACTATGTTAGCTTTATTGACGATTTTAGTAAATATACATGGATCTTTCCTTTGTTTCACAAATCTGATGTACACCATGTGCTTAAGAAATTTGTTCCATTTGCTAAAAA
TATGTTAACTTCTAAACTTAAGGTCTGTCGGTCTGATGGTGGTGGCGAGTTTGTAAACACCTCGATTTCTTTGTTTCAAAGGGTGTCTTACACCAAAAATCTTGTCCCTA
TACACCCGAGCAAAATGGGGTTGCTGAACGTAAACATAGGCATATTATTGAAACTACCTTGGCTTTTATGTATCACGCTTCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGAGGGAATTGTGGACTTTAATCAAGAAGCGGTATTCATCTCTATCTCAGACCTGCATTCTTGAGTTATGCTCCGAGTTGTACTCGGCGAAGAAGAAGGCAAATGA
GTCAATCGATGCTTATATCCTGCGTATAAAAGAAATTATTGATAAACTTGTTGTGGTTTTGATTATAATAGAAGATGAAGAGGCGGTACCAATCGTGGTCAAGGAAACAA
CTTTAACAAAGGGCGTGGAAATTCTGACAAAGGTATATTTCCCCCTTCTGTTCATAGTTGCAGTTCTGATTCGACTAGCAATCCTTCACAGTCTCAGTTACAAGGCAATT
ATAGTGGTCGTATTTGCTGTCAAATTTGTCACAAGTCGGAAATACAATGGCGAAGACAAGGTTGCTATGAGTAATGGTCAAAATATACTTATTGCTAATGCAGGTAGTGG
TATTCTTCTAACTCCCTCTCATACTTTCCAACTTTCGACTATCTTTCACACTCCCACCATTTTGGCCAATCTTTTGTCTGTTCACAAATGTTGTCAGGATAACAATTGTA
TCTTCATATTTGATGTTGATTCATTCTTCAACCAGGACAAAACCACAGGGCAGATCTTGTTCAAGGGCAAGAGTGAAAATGGACTCTATTTGATTCCAAGTTTGTTCTCT
ATTTCTTCTAGCTCGCAGGCAGCACACTCCAAAACTTCTAGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGCAAATAAGGAGATTGTTCTTTGGCATAATCG
GTTAGGCCACCCATCTCTTCTTATTCTAAATAAAGCCTTGGCTAGTACTGATGTTAAAGTGGACATATCATTCGCCACTTGTACTTGTAGTAGTTGTCTTAAGGCAAAAA
TGTCTAAAGTACGTTTTCCTTTATCTAAGTCTTTATCTTCGGCTCCTTTGGCCTTATTGCACAGTGACGTATGGGGACCATCTCCCATTTATTTTTTTTATGGACATCGC
TACTATGTTAGCTTTATTGACGATTTTAGTAAATATACATGGATCTTTCCTTTGTTTCACAAATCTGATGTACACCATGTGCTTAAGAAATTTGTTCCATTTGCTAAAAA
TATGTTAACTTCTAAACTTAAGGTCTGTCGGTCTGATGGTGGTGGCGAGTTTGTAAACACCTCGATTTCTTTGTTTCAAAGGGTGTCTTACACCAAAAATCTTGTCCCTA
TACACCCGAGCAAAATGGGGTTGCTGAACGTAAACATAGGCATATTATTGAAACTACCTTGGCTTTTATGTATCACGCTTCCTTGA
Protein sequenceShow/hide protein sequence
MLRELWTLIKKRYSSLSQTCILELCSELYSAKKKANESIDAYILRIKEIIDKLVVVLIIIEDEEAVPIVVKETTLTKGVEILTKVYFPLLFIVAVLIRLAILHSLSYKAI
IVVVFAVKFVTSRKYNGEDKVAMSNGQNILIANAGSGILLTPSHTFQLSTIFHTPTILANLLSVHKCCQDNNCIFIFDVDSFFNQDKTTGQILFKGKSENGLYLIPSLFS
ISSSSQAAHSKTSSFFFFFFFFFFFFANKEIVLWHNRLGHPSLLILNKALASTDVKVDISFATCTCSSCLKAKMSKVRFPLSKSLSSAPLALLHSDVWGPSPIYFFYGHR
YYVSFIDDFSKYTWIFPLFHKSDVHHVLKKFVPFAKNMLTSKLKVCRSDGGGEFVNTSISLFQRVSYTKNLVPIHPSKMGLLNVNIGILLKLPWLLCITLP