; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011341 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011341
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr1:22178165..22179766
RNA-Seq ExpressionLag0011341
SyntenyLag0011341
Gene Ontology termsNA
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA3453657.1 reverse transcriptase [Gossypium australe]7.6e-5934.66Show/hide
Query:  MKILCWNVRGMGNPRTIRALRHETRRQNPNIIFISETKCGLNKVEKVRRDLKFDFAFSVPSEGKSGGLAIFWQEDSTLQIRSYSVGHIDTTITDDSK--W
        MKIL WNVRG+G PRT+R L+++ RR  P I+F+ ETK    ++E +RR   F     V + G  GGL++ W+E   L ++S+S  HID  + D+ +   
Subjt:  MKILCWNVRGMGNPRTIRALRHETRRQNPNIIFISETKCGLNKVEKVRRDLKFDFAFSVPSEGKSGGLAIFWQEDSTLQIRSYSVGHIDTTITDDSK--W

Query:  WRFTGLYGNPSTNKRKDLWNLLDKLSRDSNLHWIVDEDLNEILF----------------------EGC-----------LTWEKKVRGSRVVKERLDRF
        WRFTG Y  P  N+RK+ W LL  L R ++  W++  D NEILF                      E C            TWE+       ++ERLDR 
Subjt:  WRFTGLYGNPSTNKRKDLWNLLDKLSRDSNLHWIVDEDLNEILF----------------------EGC-----------LTWEKKVRGSRVVKERLDRF

Query:  LATNEIKNIFKSIDIRHLSKHNSDHKAIV----AVLEKRPQE-------------NKKSWNKDMLKGSIQSAIRRKESEIGEIIAGSDVLKDLRLTKAEI
        +A  E   +F    + HL    S+H  IV     + E R +E             N   +++       ++ I   ++E G ++   D + +L     E 
Subjt:  LATNEIKNIFKSIDIRHLSKHNSDHKAIV----AVLEKRPQE-------------NKKSWNKDMLKGSIQSAIRRKESEIGEIIAGSDVLKDLRLTKAEI

Query:  ELENLMEEEEIYWKNILDGIQLKISEDQCRFLDAPFVKEEIKTALKGMNPSKAPEEDGAHAMFYQNYWDIVGARVTHVCLEVLNEDTDIGPLNSTKIVLI
          E    +E      +++     I+ +    L A F +EE+  A+K + P KA  +DG  A+FYQ YW IVG  VT  CL+VLN   +I  +N T IVLI
Subjt:  ELENLMEEEEIYWKNILDGIQLKISEDQCRFLDAPFVKEEIKTALKGMNPSKAPEEDGAHAMFYQNYWDIVGARVTHVCLEVLNEDTDIGPLNSTKIVLI

Query:  PKKKNPTRFEKFRPINLCNLIYKIIAK
        PK+K+P    KFRPI+LCN+IYKII+K
Subjt:  PKKKNPTRFEKFRPINLCNLIYKIIAK

KAA3457419.1 reverse transcriptase [Gossypium australe]8.7e-5530.1Show/hide
Query:  MKILCWNVRGMGNPRTIRALRHETRRQNPNIIFISETKCGLNKVEKVRRDLKFDFAFSVPSEGKSGGLAIFWQEDSTLQIRSYSVGHIDTTIT-DDSK-W
        MK++CWNVRG+G+PR ++ LRH  ++ NP+++F+ ETK    ++E+VRR+  F     V ++G  GGL++ W+E + + +RS+S  HID  +  DDS+  
Subjt:  MKILCWNVRGMGNPRTIRALRHETRRQNPNIIFISETKCGLNKVEKVRRDLKFDFAFSVPSEGKSGGLAIFWQEDSTLQIRSYSVGHIDTTIT-DDSK-W

Query:  WRFTGLYGNPSTNKRKDLWNLLDKLSRDSNLHWIVDEDLNEILF----------------------EGCL-----------TWEKKVRGSRVVKERLDRF
        WRFTG YG+P    +  +W +L +LSR++N  W+V  D  EI++                      E C+           TWE+       ++ERLDR 
Subjt:  WRFTGLYGNPSTNKRKDLWNLLDKLSRDSNLHWIVDEDLNEILF----------------------EGCL-----------TWEKKVRGSRVVKERLDRF

Query:  LATNEIKNIFKSIDIRHLSKHNSDHKAI------VAVLEKRPQENKKSW------------------------NKDMLKGSIQ---------SAIRRKES
        +A ++  ++F  + I+HL    SDH  +      V +L +  + + ++W                           +L+G ++           I++K +
Subjt:  LATNEIKNIFKSIDIRHLSKHNSDHKAI------VAVLEKRPQENKKSW------------------------NKDMLKGSIQ---------SAIRRKES

Query:  EIGE--IIAGSDVLKDLRLTKAEIELENLMEEEEIYWKN----------------------------------ILDGIQL----------KISEDQCRFL
        E  E  ++A  D     ++   +I L   +++EE+YW+                                   + DGI++          K   +    L
Subjt:  EIGE--IIAGSDVLKDLRLTKAEIELENLMEEEEIYWKN----------------------------------ILDGIQL----------KISEDQCRFL

Query:  DAPFVKEEIKTALKGMNPSKAPEEDGAHAMFYQNYWDIVGARVTHVCLEVLNEDTDIGPLNSTKIVLIPKKKNPTRFEKFRPINLCNLIYKIIAK
         +PF +EE+++ALKGM   KAP  DG   +F+Q YW IVG  V   CL VLNE  +I   N T IVLIPK   PT    FRPI+LC +IYK++ K
Subjt:  DAPFVKEEIKTALKGMNPSKAPEEDGAHAMFYQNYWDIVGARVTHVCLEVLNEDTDIGPLNSTKIVLIPKKKNPTRFEKFRPINLCNLIYKIIAK

KAA3460530.1 reverse transcriptase [Gossypium australe]1.0e-5533.41Show/hide
Query:  MKILCWNVRGMGNPRTIRALRHETRRQNPNIIFISETKCGLNKVEKVRRDLKFDFAFSVPSEGKSGGLAIFWQEDSTLQIRSYSVGHIDTTITDDSKW--
        MK L WNVRG+G+PR +R LR+  ++QNP ++F+ ETK    ++EKVR+         V +EG  GGL + W++D  +  RS+S  H+D  + +D     
Subjt:  MKILCWNVRGMGNPRTIRALRHETRRQNPNIIFISETKCGLNKVEKVRRDLKFDFAFSVPSEGKSGGLAIFWQEDSTLQIRSYSVGHIDTTITDDSKW--

Query:  WRFTGLYGNPSTNKRKDLWNLLDKLS------RDSNLHWIVDEDLNEILFEGC------LTWEKKVRGSRVVKERLDRFLATNEIKNIFKSIDIRHLSKH
        WRFTGLYG+P    +  +WNLL +LS      RD     I  + L E    G        TWE+       ++ERLDR +A  + +N+F    I H    
Subjt:  WRFTGLYGNPSTNKRKDLWNLLDKLS------RDSNLHWIVDEDLNEILFEGC------LTWEKKVRGSRVVKERLDRFLATNEIKNIFKSIDIRHLSKH

Query:  NSDHKAIVA------------------------VLEKRPQENKKSWNKDML--------------------KGSIQSAIRRKESEIGEIIAGSDVLKDLR
         SDH  ++                           E   +E  +S ++ ++                    KG +Q  + ++   + +     D L+   
Subjt:  NSDHKAIVA------------------------VLEKRPQENKKSWNKDML--------------------KGSIQSAIRRKESEIGEIIAGSDVLKDLR

Query:  LTK------AEIELENLMEEE-EIYWKN------------ILDGIQLKISEDQCRFLDAPFVKEEIKTALKGMNPSKAPEEDGAHAMFYQNYWDIVGARV
        ++K       EI  E+ +EEE ++Y++N            IL+GI+  IS +    L APF +EE++ ALKGM P+K P  DG  A+F+Q YW IVG  V
Subjt:  LTK------AEIELENLMEEE-EIYWKN------------ILDGIQLKISEDQCRFLDAPFVKEEIKTALKGMNPSKAPEEDGAHAMFYQNYWDIVGARV

Query:  THVCLEVLNEDTDIGPLNSTKIVLIPKKKNPTRFEKFRPINLCNLIYKIIAK
           CL VLNE  +    NST IVLI K   PT    FRPI+LC ++YKI+AK
Subjt:  THVCLEVLNEDTDIGPLNSTKIVLIPKKKNPTRFEKFRPINLCNLIYKIIAK

XP_015895368.1 uncharacterized protein LOC107429231 [Ziziphus jujuba]1.3e-5834.98Show/hide
Query:  MKILCWNVRGMGNPRTIRALRHETRRQNPNIIFISETKCGLNKVEKVRRDLKFDFAFSVPSEGKSGGLAIFWQEDSTLQIRSYSVGHIDTTITDDSKWWR
        MK+L WN RG+ NPRT RA     + +NP+I+F+ ET     ++E++R  +     F V      GGLA+FW+ D  +Q++SYSVGHID+ I    K W 
Subjt:  MKILCWNVRGMGNPRTIRALRHETRRQNPNIIFISETKCGLNKVEKVRRDLKFDFAFSVPSEGKSGGLAIFWQEDSTLQIRSYSVGHIDTTITDDSKWWR

Query:  F-TGLYGNPSTNKRKDLWNLLDKLSRDSNLHWIVDEDLNEIL----------FEGC-LTWEKKVRGSRVVKERLDRFLATNEIKNIFKSIDIRHLSKHNS
        + TG YGNP  N+R   W LL +L   S+  W V  D NEIL          F G   TW     G   V+E LDR   + E + +F ++++RHL    S
Subjt:  F-TGLYGNPSTNKRKDLWNLLDKLSRDSNLHWIVDEDLNEIL----------FEGC-LTWEKKVRGSRVVKERLDRFLATNEIKNIFKSIDIRHLSKHNS

Query:  DHKAIVAVLEKRPQEN---------KKSWNKDM----LKGSI--QSAIRRKESEIGEIIAGSDVLKDLRLTKAEIEL---ENLMEE-EEIYWKNILDGIQ
        DH  I+  L+   + N         +  W K++    + G++   ++  ++ + I  +  G  +  D  ++  ++ L   +NL      +  +++L G+ 
Subjt:  DHKAIVAVLEKRPQEN---------KKSWNKDM----LKGSI--QSAIRRKESEIGEIIAGSDVLKDLRLTKAEIEL---ENLMEE-EEIYWKNILDGIQ

Query:  LKISEDQCRFLDAPFVKEEIKTALKGMNPSKAPEEDGAHAMFYQNYWDIVGARVTHVCLEVLNEDTDIGPLNSTKIVLIPKKKNPTRFEKFRPINLCNLI
         ++S++    L  PF  +E+K +L  M+P+KAP  DG  A+F+Q YW IVG ++T  CL+VLN + D+  +N T I LIPK K P +   +RPI+LC +I
Subjt:  LKISEDQCRFLDAPFVKEEIKTALKGMNPSKAPEEDGAHAMFYQNYWDIVGARVTHVCLEVLNEDTDIGPLNSTKIVLIPKKKNPTRFEKFRPINLCNLI

Query:  YKIIAK
        YK+I+K
Subjt:  YKIIAK

XP_042958050.1 uncharacterized protein LOC122293561 [Carya illinoinensis]3.9e-5532.87Show/hide
Query:  MKILCWNVRGMGNPRTIRALRHETRRQNPNIIFISETKCGLNKVEKVRRDLKFDFAFSVPSEGKSGGLAIFWQEDSTLQIRSYSVGHIDTTITDD--SKW
        M +L WN RG+GNPR++R L    + + P ++F+ ETKC   ++E VRR LK D  F V S G SGG+A+ W+E+  +Q+ SY+  H+   + ++   + 
Subjt:  MKILCWNVRGMGNPRTIRALRHETRRQNPNIIFISETKCGLNKVEKVRRDLKFDFAFSVPSEGKSGGLAIFWQEDSTLQIRSYSVGHIDTTITDD--SKW

Query:  WRFTGLYGNPSTNKRKDLWNLLDKLSRDSNLHWIVDEDLNEIL----------------------FEGC-----------LTWEKKVRGSRVVKERLDRF
        W FTG YG+P T KRK  W LL+ L   S++ W+   D NE+L                       E C            TW    RG    KER+DR 
Subjt:  WRFTGLYGNPSTNKRKDLWNLLDKLSRDSNLHWIVDEDLNEIL----------------------FEGC-----------LTWEKKVRGSRVVKERLDRF

Query:  LATNEIKNIFKSIDIR-HLSKHNSDHKAIVAVLEKRPQE-NKKSWNKDMLKGSIQSAIRRKESEIGEIIAGSDVLKDLRLTKAEIELENLMEEEEIYWK-
        +A  E + +F+       L  HN      +  + K P    KK   +D  K   ++   ++ S +     GS + +  +L K    +E+ +  E++ W+ 
Subjt:  LATNEIKNIFKSIDIR-HLSKHNSDHKAIVAVLEKRPQE-NKKSWNKDMLKGSIQSAIRRKESEIGEIIAGSDVLKDLRLTKAEIELENLMEEEEIYWK-

Query:  ------------NILDGIQLKISEDQCR----FLDAPFVKEEIKTALKGMNPSKAPEEDGAHAMFYQNYWDIVGARVTHVCLEVLNEDTDIGPLNSTKIV
                     +L    L   ++Q R    +L  PF +EE++ A+  MNP  +P  DG  A FYQ +W++VG  V    LEVLN    +  +N T I 
Subjt:  ------------NILDGIQLKISEDQCR----FLDAPFVKEEIKTALKGMNPSKAPEEDGAHAMFYQNYWDIVGARVTHVCLEVLNEDTDIGPLNSTKIV

Query:  LIPKKKNPTRFEKFRPINLCNLIYKIIAK
        LIP+ KNP R  +FRPI+LCN++YKI++K
Subjt:  LIPKKKNPTRFEKFRPINLCNLIYKIIAK

TrEMBL top hitse value%identityAlignment
A0A2N9HDH5 Uncharacterized protein8.5e-5629.17Show/hide
Query:  MKILCWNVRGMGNPRTIRALRHETRRQNPNIIFISETKCGLNKVEKVRRDLKFDFAFSVPSEGKSGGLAIFWQEDSTLQIRSYSVGHIDTTI-TDDSKWW
        M I+ WN RG+GN R + AL +  + Q P I+F+ ETK  + K+E +R  L+F F F+VPS G+SGGLA+ W +D  L I+++S+ HID  +    +  W
Subjt:  MKILCWNVRGMGNPRTIRALRHETRRQNPNIIFISETKCGLNKVEKVRRDLKFDFAFSVPSEGKSGGLAIFWQEDSTLQIRSYSVGHIDTTI-TDDSKWW

Query:  RFTGLYGNPSTNKRKDLWNLLDKLSRDSNLHWIVDEDLNEIL--------------------------------FEGC-LTWEKKVRGSRVVKERLDRFL
        RFTG YGNP  ++R+  W LLDKL    +L W++  D NEIL                                + G   TWE        V++RLDR L
Subjt:  RFTGLYGNPSTNKRKDLWNLLDKLSRDSNLHWIVDEDLNEIL--------------------------------FEGC-LTWEKKVRGSRVVKERLDRFL

Query:  ATNEIKNIFKSIDIRHLSKHNSDHKAIV---------AVLEKRPQE--------------NKKSWNKDMLKGS--------------------------I
        A+N   ++F    I HL    SDH  I+         A  ++RP++               +K W++  ++GS                           
Subjt:  ATNEIKNIFKSIDIRHLSKHNSDHKAIV---------AVLEKRPQE--------------NKKSWNKDMLKGS--------------------------I

Query:  QSAIRRKESEIGEIIAGSDV-LKDLRLTKAEIELENLMEEEEIYW----------------------------KNILDG---------------------
        Q  IR K   +  ++AG+ +   ++ +   + E+ +L+  EE++W                            KN + G                     
Subjt:  QSAIRRKESEIGEIIAGSDV-LKDLRLTKAEIELENLMEEEEIYW----------------------------KNILDG---------------------

Query:  --------------------IQLKISEDQCRFLDAPFVKEEIKTALKGMNPSKAPEEDGAHAMFYQNYWDIVGARVTHVCLEVLNEDTDIGPLNSTKIVL
                            ++  +SE+  + L  P+  EE++ AL  M+PSKAP  DG  + F+Q YW IVG  +T+  L +LN    +  +N T + L
Subjt:  --------------------IQLKISEDQCRFLDAPFVKEEIKTALKGMNPSKAPEEDGAHAMFYQNYWDIVGARVTHVCLEVLNEDTDIGPLNSTKIVL

Query:  IPKKKNPTRFEKFRPINLCNLIYKIIAK
        IPKKKNP +   +RPI+LCN++YKII+K
Subjt:  IPKKKNPTRFEKFRPINLCNLIYKIIAK

A0A2N9HWG1 Reverse transcriptase domain-containing protein1.1e-5830.51Show/hide
Query:  MKILCWNVRGMGNPRTIRALRHETRRQNPNIIFISETKCGLNKVEKVRRDLKFDFAFSVPSEGKSGGLAIFWQEDSTLQIRSYSVGHIDTTI-TDDSKWW
        M I+ WN RG+GN R + AL +  + Q P I+F+ ETK  + K+E +R  L+F F F+VPS G+SGGLA+ W +D  L I+++S+ HID  +    +  W
Subjt:  MKILCWNVRGMGNPRTIRALRHETRRQNPNIIFISETKCGLNKVEKVRRDLKFDFAFSVPSEGKSGGLAIFWQEDSTLQIRSYSVGHIDTTI-TDDSKWW

Query:  RFTGLYGNPSTNKRKDLWNLLDKLSRDSNLHWIVDEDLNEIL--------------------------------FEGC-LTWEKKVRGSRVVKERLDRFL
        RFTG YGNP  ++R++ W LLDKL    +L W++  D NEIL                                + G   TWE        +++RLDR L
Subjt:  RFTGLYGNPSTNKRKDLWNLLDKLSRDSNLHWIVDEDLNEIL--------------------------------FEGC-LTWEKKVRGSRVVKERLDRFL

Query:  ATNEIKNIFKSIDIRHLSKHNSDHKAIV---------AVLEKRPQE--------------NKKSWNKDMLKGS--------------------------I
        A+N   ++F    I HL    SDH  I+         A  ++RP++               +K W+++   GS                           
Subjt:  ATNEIKNIFKSIDIRHLSKHNSDHKAIV---------AVLEKRPQE--------------NKKSWNKDMLKGS--------------------------I

Query:  QSAIRRKESEIGEIIAGSDV-LKDLRLTKAEIELENLMEEEEIYWK------------------------------------NILDGIQLKISEDQCRFL
        Q  IR K   +  ++AG+ +   ++ +   + E+ +L+  EE++W+                                      L  ++  +SE+  + L
Subjt:  QSAIRRKESEIGEIIAGSDV-LKDLRLTKAEIELENLMEEEEIYWK------------------------------------NILDGIQLKISEDQCRFL

Query:  DAPFVKEEIKTALKGMNPSKAPEEDGAHAMFYQNYWDIVGARVTHVCLEVLNEDTDIGPLNSTKIVLIPKKKNPTRFEKFRPINLCNLIYKIIAK
          P+  EE++ AL  M+PSKAP  DG  + F+Q YW IVG  ++   L VLN    +  +N T + LIPKKKNP +   +RPI+LCN++YKII+K
Subjt:  DAPFVKEEIKTALKGMNPSKAPEEDGAHAMFYQNYWDIVGARVTHVCLEVLNEDTDIGPLNSTKIVLIPKKKNPTRFEKFRPINLCNLIYKIIAK

A0A5B6U9Z8 Reverse transcriptase3.7e-5934.66Show/hide
Query:  MKILCWNVRGMGNPRTIRALRHETRRQNPNIIFISETKCGLNKVEKVRRDLKFDFAFSVPSEGKSGGLAIFWQEDSTLQIRSYSVGHIDTTITDDSK--W
        MKIL WNVRG+G PRT+R L+++ RR  P I+F+ ETK    ++E +RR   F     V + G  GGL++ W+E   L ++S+S  HID  + D+ +   
Subjt:  MKILCWNVRGMGNPRTIRALRHETRRQNPNIIFISETKCGLNKVEKVRRDLKFDFAFSVPSEGKSGGLAIFWQEDSTLQIRSYSVGHIDTTITDDSK--W

Query:  WRFTGLYGNPSTNKRKDLWNLLDKLSRDSNLHWIVDEDLNEILF----------------------EGC-----------LTWEKKVRGSRVVKERLDRF
        WRFTG Y  P  N+RK+ W LL  L R ++  W++  D NEILF                      E C            TWE+       ++ERLDR 
Subjt:  WRFTGLYGNPSTNKRKDLWNLLDKLSRDSNLHWIVDEDLNEILF----------------------EGC-----------LTWEKKVRGSRVVKERLDRF

Query:  LATNEIKNIFKSIDIRHLSKHNSDHKAIV----AVLEKRPQE-------------NKKSWNKDMLKGSIQSAIRRKESEIGEIIAGSDVLKDLRLTKAEI
        +A  E   +F    + HL    S+H  IV     + E R +E             N   +++       ++ I   ++E G ++   D + +L     E 
Subjt:  LATNEIKNIFKSIDIRHLSKHNSDHKAIV----AVLEKRPQE-------------NKKSWNKDMLKGSIQSAIRRKESEIGEIIAGSDVLKDLRLTKAEI

Query:  ELENLMEEEEIYWKNILDGIQLKISEDQCRFLDAPFVKEEIKTALKGMNPSKAPEEDGAHAMFYQNYWDIVGARVTHVCLEVLNEDTDIGPLNSTKIVLI
          E    +E      +++     I+ +    L A F +EE+  A+K + P KA  +DG  A+FYQ YW IVG  VT  CL+VLN   +I  +N T IVLI
Subjt:  ELENLMEEEEIYWKNILDGIQLKISEDQCRFLDAPFVKEEIKTALKGMNPSKAPEEDGAHAMFYQNYWDIVGARVTHVCLEVLNEDTDIGPLNSTKIVLI

Query:  PKKKNPTRFEKFRPINLCNLIYKIIAK
        PK+K+P    KFRPI+LCN+IYKII+K
Subjt:  PKKKNPTRFEKFRPINLCNLIYKIIAK

A0A5B6UR53 Reverse transcriptase5.0e-5633.41Show/hide
Query:  MKILCWNVRGMGNPRTIRALRHETRRQNPNIIFISETKCGLNKVEKVRRDLKFDFAFSVPSEGKSGGLAIFWQEDSTLQIRSYSVGHIDTTITDDSKW--
        MK L WNVRG+G+PR +R LR+  ++QNP ++F+ ETK    ++EKVR+         V +EG  GGL + W++D  +  RS+S  H+D  + +D     
Subjt:  MKILCWNVRGMGNPRTIRALRHETRRQNPNIIFISETKCGLNKVEKVRRDLKFDFAFSVPSEGKSGGLAIFWQEDSTLQIRSYSVGHIDTTITDDSKW--

Query:  WRFTGLYGNPSTNKRKDLWNLLDKLS------RDSNLHWIVDEDLNEILFEGC------LTWEKKVRGSRVVKERLDRFLATNEIKNIFKSIDIRHLSKH
        WRFTGLYG+P    +  +WNLL +LS      RD     I  + L E    G        TWE+       ++ERLDR +A  + +N+F    I H    
Subjt:  WRFTGLYGNPSTNKRKDLWNLLDKLS------RDSNLHWIVDEDLNEILFEGC------LTWEKKVRGSRVVKERLDRFLATNEIKNIFKSIDIRHLSKH

Query:  NSDHKAIVA------------------------VLEKRPQENKKSWNKDML--------------------KGSIQSAIRRKESEIGEIIAGSDVLKDLR
         SDH  ++                           E   +E  +S ++ ++                    KG +Q  + ++   + +     D L+   
Subjt:  NSDHKAIVA------------------------VLEKRPQENKKSWNKDML--------------------KGSIQSAIRRKESEIGEIIAGSDVLKDLR

Query:  LTK------AEIELENLMEEE-EIYWKN------------ILDGIQLKISEDQCRFLDAPFVKEEIKTALKGMNPSKAPEEDGAHAMFYQNYWDIVGARV
        ++K       EI  E+ +EEE ++Y++N            IL+GI+  IS +    L APF +EE++ ALKGM P+K P  DG  A+F+Q YW IVG  V
Subjt:  LTK------AEIELENLMEEE-EIYWKN------------ILDGIQLKISEDQCRFLDAPFVKEEIKTALKGMNPSKAPEEDGAHAMFYQNYWDIVGARV

Query:  THVCLEVLNEDTDIGPLNSTKIVLIPKKKNPTRFEKFRPINLCNLIYKIIAK
           CL VLNE  +    NST IVLI K   PT    FRPI+LC ++YKI+AK
Subjt:  THVCLEVLNEDTDIGPLNSTKIVLIPKKKNPTRFEKFRPINLCNLIYKIIAK

A0A6P4B957 uncharacterized protein LOC1074292316.3e-5934.98Show/hide
Query:  MKILCWNVRGMGNPRTIRALRHETRRQNPNIIFISETKCGLNKVEKVRRDLKFDFAFSVPSEGKSGGLAIFWQEDSTLQIRSYSVGHIDTTITDDSKWWR
        MK+L WN RG+ NPRT RA     + +NP+I+F+ ET     ++E++R  +     F V      GGLA+FW+ D  +Q++SYSVGHID+ I    K W 
Subjt:  MKILCWNVRGMGNPRTIRALRHETRRQNPNIIFISETKCGLNKVEKVRRDLKFDFAFSVPSEGKSGGLAIFWQEDSTLQIRSYSVGHIDTTITDDSKWWR

Query:  F-TGLYGNPSTNKRKDLWNLLDKLSRDSNLHWIVDEDLNEIL----------FEGC-LTWEKKVRGSRVVKERLDRFLATNEIKNIFKSIDIRHLSKHNS
        + TG YGNP  N+R   W LL +L   S+  W V  D NEIL          F G   TW     G   V+E LDR   + E + +F ++++RHL    S
Subjt:  F-TGLYGNPSTNKRKDLWNLLDKLSRDSNLHWIVDEDLNEIL----------FEGC-LTWEKKVRGSRVVKERLDRFLATNEIKNIFKSIDIRHLSKHNS

Query:  DHKAIVAVLEKRPQEN---------KKSWNKDM----LKGSI--QSAIRRKESEIGEIIAGSDVLKDLRLTKAEIEL---ENLMEE-EEIYWKNILDGIQ
        DH  I+  L+   + N         +  W K++    + G++   ++  ++ + I  +  G  +  D  ++  ++ L   +NL      +  +++L G+ 
Subjt:  DHKAIVAVLEKRPQEN---------KKSWNKDM----LKGSI--QSAIRRKESEIGEIIAGSDVLKDLRLTKAEIEL---ENLMEE-EEIYWKNILDGIQ

Query:  LKISEDQCRFLDAPFVKEEIKTALKGMNPSKAPEEDGAHAMFYQNYWDIVGARVTHVCLEVLNEDTDIGPLNSTKIVLIPKKKNPTRFEKFRPINLCNLI
         ++S++    L  PF  +E+K +L  M+P+KAP  DG  A+F+Q YW IVG ++T  CL+VLN + D+  +N T I LIPK K P +   +RPI+LC +I
Subjt:  LKISEDQCRFLDAPFVKEEIKTALKGMNPSKAPEEDGAHAMFYQNYWDIVGARVTHVCLEVLNEDTDIGPLNSTKIVLIPKKKNPTRFEKFRPINLCNLI

Query:  YKIIAK
        YK+I+K
Subjt:  YKIIAK

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.7e-0529.46Show/hide
Query:  ELENLMEEEEIYWKNILDGIQL-KISEDQCRFLDAPFVKEEIKTALKGMNPSKAPEEDGAHAMFYQNYWDIVGARVTHVCLEVLNEDTDIGPLNSTKIVL
        +LENL E +       LD   L ++++++   L+ P    EI   +  +   K+P  DG  A FYQ Y + +   +  +   +  E           I+L
Subjt:  ELENLMEEEEIYWKNILDGIQL-KISEDQCRFLDAPFVKEEIKTALKGMNPSKAPEEDGAHAMFYQNYWDIVGARVTHVCLEVLNEDTDIGPLNSTKIVL

Query:  IPKK-KNPTRFEKFRPINLCNLIYKIIAK
        IPK  ++ T+ E FRPI+L N+  KI+ K
Subjt:  IPKK-KNPTRFEKFRPINLCNLIYKIIAK

P08548 LINE-1 reverse transcriptase homolog7.8e-0626.35Show/hide
Query:  RRKESEIGEIIAGSDVLKDLRLTKAEIELENLMEE--EEIY---WKNI------LDGIQL-KISEDQCRFLDAPFVKEEIKTALKGMNPSKAPEEDGAHA
        +R +S I  I  G+D      +T    E++ ++ E  +++Y   ++N+      L+   L ++S+ +   L+ P    EI + ++ +   K+P  DG  +
Subjt:  RRKESEIGEIIAGSDVLKDLRLTKAEIELENLMEE--EEIY---WKNI------LDGIQL-KISEDQCRFLDAPFVKEEIKTALKGMNPSKAPEEDGAHA

Query:  MFYQNYWDIVGARVTHVCLEVLNEDTDIGPLNSTKIVLIPKK-KNPTRFEKFRPINLCNLIYKIIAK
         FYQ + + +   + ++   +  E           I LIPK  K+PTR E +RPI+L N+  KI+ K
Subjt:  MFYQNYWDIVGARVTHVCLEVLNEDTDIGPLNSTKIVLIPKK-KNPTRFEKFRPINLCNLIYKIIAK

P11369 LINE-1 retrotransposable element ORF2 protein1.3e-0833.33Show/hide
Query:  ELENLMEEEEIYWKNILDGIQL-KISEDQCRFLDAPFVKEEIKTALKGMNPSKAPEEDGAHAMFYQNYWDIVGARVTHVCLEVLNEDTDIGPLNSTKIVL
        +LENL E ++      LD  Q+ K+++DQ   L++P   +EI+  +  +   K+P  DG  A FYQ + + +   +  +  ++  E T         I L
Subjt:  ELENLMEEEEIYWKNILDGIQL-KISEDQCRFLDAPFVKEEIKTALKGMNPSKAPEEDGAHAMFYQNYWDIVGARVTHVCLEVLNEDTDIGPLNSTKIVL

Query:  IPK-KKNPTRFEKFRPINLCNLIYKIIAK
        IPK +K+PT+ E FRPI+L N+  KI+ K
Subjt:  IPK-KKNPTRFEKFRPINLCNLIYKIIAK

P14381 Transposon TX1 uncharacterized 149 kDa protein8.3e-0831.62Show/hide
Query:  KNILDGIQLKISEDQCRFLDAPFVKEEIKTALKGMNPSKAPEEDGAHAMFYQNYWDIVGARVTHVCLEVLNEDTDIGPLNSTKIV--LIPKKKNPTRFEK
        + + DG+ + +SE +   L+ P   +E+  AL+ M  +K+P  DG    F+Q +WD +G     V  E   +     PL+  + V  L+PKK +    + 
Subjt:  KNILDGIQLKISEDQCRFLDAPFVKEEIKTALKGMNPSKAPEEDGAHAMFYQNYWDIVGARVTHVCLEVLNEDTDIGPLNSTKIV--LIPKKKNPTRFEK

Query:  FRPINLCNLIYKIIAKA
        +RP++L +  YKI+AKA
Subjt:  FRPINLCNLIYKIIAKA

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein3.6e-0632.18Show/hide
Query:  EEIKTALKGMNPSKAPEEDGAHAMFYQNYWDIVGARVTHVCLEVLNEDTDIGPLNSTKIVLIPKKKNPTRFEKFRPINLCNLIYKII
        +EI  A+  M  +KAP  D   A F+   W +V         E       +   N+T I LIPK     +   FRP++ C ++YKII
Subjt:  EEIKTALKGMNPSKAPEEDGAHAMFYQNYWDIVGARVTHVCLEVLNEDTDIGPLNSTKIVLIPKKKNPTRFEKFRPINLCNLIYKII


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAATATTATGCTGGAACGTTCGAGGGATGGGGAACCCTCGAACAATCCGAGCCCTCCGACATGAGACCAGGAGACAAAACCCCAACATTATTTTCATTTCGGAGAC
TAAGTGTGGTCTTAACAAGGTGGAGAAAGTGAGAAGGGATCTCAAGTTTGATTTTGCGTTCAGTGTCCCCAGTGAAGGCAAGAGTGGTGGGTTGGCTATTTTTTGGCAGG
AGGATTCCACTTTGCAAATTAGATCGTATTCGGTAGGCCATATAGATACGACTATCACAGATGATTCTAAGTGGTGGAGATTCACAGGGTTATATGGAAACCCAAGTACT
AACAAAAGAAAGGACTTGTGGAACCTTCTGGATAAGCTTAGCAGGGACTCCAACCTCCATTGGATTGTCGACGAAGATTTAAACGAAATTCTCTTCGAAGGTTGTTTAAC
TTGGGAGAAGAAGGTTAGAGGCTCGAGGGTGGTTAAAGAAAGGTTGGATAGATTCCTAGCCACAAACGAGATCAAGAACATCTTTAAAAGCATCGACATTCGGCATTTAT
CCAAGCACAACTCAGATCACAAGGCCATTGTGGCAGTTCTGGAGAAAAGGCCGCAAGAGAACAAGAAAAGTTGGAACAAAGATATGCTTAAAGGATCGATCCAATCAGCC
ATCCGCAGGAAAGAGAGTGAGATTGGTGAGATCATCGCGGGAAGTGACGTCCTCAAAGATCTGAGACTTACCAAGGCAGAAATTGAGCTAGAAAACCTCATGGAGGAAGA
AGAAATCTATTGGAAGAACATTTTGGATGGTATCCAGTTGAAAATCTCAGAGGACCAGTGTAGATTTCTCGATGCCCCGTTTGTTAAAGAGGAGATTAAAACAGCTCTTA
AAGGAATGAACCCTAGCAAAGCCCCTGAGGAGGATGGAGCCCATGCCATGTTCTATCAAAACTATTGGGATATAGTGGGAGCAAGGGTAACTCATGTTTGTTTGGAGGTC
TTGAATGAGGACACGGATATTGGCCCGCTTAACAGTACCAAGATTGTTCTTATCCCGAAAAAAAAGAACCCTACCAGATTTGAGAAGTTTAGGCCTATAAACTTGTGCAA
TTTGATTTACAAGATTATTGCAAAAGCCTGGCTAACAGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAATATTATGCTGGAACGTTCGAGGGATGGGGAACCCTCGAACAATCCGAGCCCTCCGACATGAGACCAGGAGACAAAACCCCAACATTATTTTCATTTCGGAGAC
TAAGTGTGGTCTTAACAAGGTGGAGAAAGTGAGAAGGGATCTCAAGTTTGATTTTGCGTTCAGTGTCCCCAGTGAAGGCAAGAGTGGTGGGTTGGCTATTTTTTGGCAGG
AGGATTCCACTTTGCAAATTAGATCGTATTCGGTAGGCCATATAGATACGACTATCACAGATGATTCTAAGTGGTGGAGATTCACAGGGTTATATGGAAACCCAAGTACT
AACAAAAGAAAGGACTTGTGGAACCTTCTGGATAAGCTTAGCAGGGACTCCAACCTCCATTGGATTGTCGACGAAGATTTAAACGAAATTCTCTTCGAAGGTTGTTTAAC
TTGGGAGAAGAAGGTTAGAGGCTCGAGGGTGGTTAAAGAAAGGTTGGATAGATTCCTAGCCACAAACGAGATCAAGAACATCTTTAAAAGCATCGACATTCGGCATTTAT
CCAAGCACAACTCAGATCACAAGGCCATTGTGGCAGTTCTGGAGAAAAGGCCGCAAGAGAACAAGAAAAGTTGGAACAAAGATATGCTTAAAGGATCGATCCAATCAGCC
ATCCGCAGGAAAGAGAGTGAGATTGGTGAGATCATCGCGGGAAGTGACGTCCTCAAAGATCTGAGACTTACCAAGGCAGAAATTGAGCTAGAAAACCTCATGGAGGAAGA
AGAAATCTATTGGAAGAACATTTTGGATGGTATCCAGTTGAAAATCTCAGAGGACCAGTGTAGATTTCTCGATGCCCCGTTTGTTAAAGAGGAGATTAAAACAGCTCTTA
AAGGAATGAACCCTAGCAAAGCCCCTGAGGAGGATGGAGCCCATGCCATGTTCTATCAAAACTATTGGGATATAGTGGGAGCAAGGGTAACTCATGTTTGTTTGGAGGTC
TTGAATGAGGACACGGATATTGGCCCGCTTAACAGTACCAAGATTGTTCTTATCCCGAAAAAAAAGAACCCTACCAGATTTGAGAAGTTTAGGCCTATAAACTTGTGCAA
TTTGATTTACAAGATTATTGCAAAAGCCTGGCTAACAGATTAA
Protein sequenceShow/hide protein sequence
MKILCWNVRGMGNPRTIRALRHETRRQNPNIIFISETKCGLNKVEKVRRDLKFDFAFSVPSEGKSGGLAIFWQEDSTLQIRSYSVGHIDTTITDDSKWWRFTGLYGNPST
NKRKDLWNLLDKLSRDSNLHWIVDEDLNEILFEGCLTWEKKVRGSRVVKERLDRFLATNEIKNIFKSIDIRHLSKHNSDHKAIVAVLEKRPQENKKSWNKDMLKGSIQSA
IRRKESEIGEIIAGSDVLKDLRLTKAEIELENLMEEEEIYWKNILDGIQLKISEDQCRFLDAPFVKEEIKTALKGMNPSKAPEEDGAHAMFYQNYWDIVGARVTHVCLEV
LNEDTDIGPLNSTKIVLIPKKKNPTRFEKFRPINLCNLIYKIIAKAWLTD