; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022770 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022770
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr7:37595253..37598693
RNA-Seq ExpressionLag0022770
SyntenyLag0022770
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA3453657.1 reverse transcriptase [Gossypium australe]4.7e-7236.85Show/hide
Query:  GKRGGLTLMWEEGIQVQVLSSLEGHIYMVIKDFK--GWWRFTRFYGNLESHRRKDSWELLERLSQASKLSWIVGGDFNEILTTYEKQGRSKRIQRQ-GYF
        G RGGL+L W+EG+ + + S    HI + ++D      WRFT FY     + RK+SW+LL  L + +   W++ GDFNEIL ++EK+G   R +RQ   F
Subjt:  GKRGGLTLMWEEGIQVQVLSSLEGHIYMVIKDFK--GWWRFTRFYGNLESHRRKDSWELLERLSQASKLSWIVGGDFNEILTTYEKQGRSKRIQRQ-GYF

Query:  RVAIDSCNLIDLGFKGHKFTWRKSRNDPNRTRERIGRFFATHELIDMFGSTKVEHRFWHRSDHVSLAA-SLGKHPWEEEEQYWRIRSREDWLLGGDRNTK
        R  ++ C L DLGF G   TW + R   N  RER+ R  A  E   +F   KV H     S+H  +   + G      EEQ    R+R +WL  GDRNT 
Subjt:  RVAIDSCNLIDLGFKGHKFTWRKSRNDPNRTRERIGRFFATHELIDMFGSTKVEHRFWHRSDHVSLAA-SLGKHPWEEEEQYWRIRSREDWLLGGDRNTK

Query:  WFHSRANARRKKNTMQGVLDHDGNWVADEERMGEIATKFFIDLFNSSDPNS------------------------------------------------A
        +FH  A  R++KN +  + +  G+ V + + +  +AT++F ++F+S + ++                                                A
Subjt:  WFHSRANARRKKNTMQGVLDHDGNWVADEERMGEIATKFFIDLFNSSDPNS------------------------------------------------A

Query:  TVEKKYWDIIGNEVTKISLEVLNEGKDISPLNKTFISLIPKVHQPKKMEEYRPISLCSVIYKLIAKTLANMLKKVLRFIISQTQAAFVLGMLITDNVVVG
           +KYW I+G EVT+  L+VLN  ++I  +NKT I LIPK   PK + ++RPISLC+VIYK+I+K L    +KVL   I  TQ AFV+G  ITDN+ + 
Subjt:  TVEKKYWDIIGNEVTKISLEVLNEGKDISPLNKTFISLIPKVHQPKKMEEYRPISLCSVIYKLIAKTLANMLKKVLRFIISQTQAAFVLGMLITDNVVVG

Query:  FECIHAINSKRKLKSGNATLKLDMSKAYDRVEWFFLRKAMQGLGF
        +E +H++  KR     +  LKLDMSK YDR+EW FL K ++G+GF
Subjt:  FECIHAINSKRKLKSGNATLKLDMSKAYDRVEWFFLRKAMQGLGF

XP_006487889.1 uncharacterized protein LOC102617714 [Citrus sinensis]6.0e-7536.8Show/hide
Query:  GKRGGLTLMWEEGIQVQVLSSLEGHIYMVIKDFKG-WWRFTRFYGNLESHRRKDSWELLERLSQASKLSWIVGGDFNEILTTYEKQGRSKR-IQRQGYFR
        G  GGL L+W   + ++V S  + HI  VI +  G  WR T  YG+ ES ++K +W LL RL+  S L W+  GDFNEI    EK G ++R   R   FR
Subjt:  GKRGGLTLMWEEGIQVQVLSSLEGHIYMVIKDFKG-WWRFTRFYGNLESHRRKDSWELLERLSQASKLSWIVGGDFNEILTTYEKQGRSKR-IQRQGYFR

Query:  VAIDSCNLIDLGFKGHKFTWRKSRNDP-----NRTRERIG--------RFFATHELIDMFGS--TKVEHRFWHR--SDHVSLAASLGKHPWEEEEQYWRI
         A+  C L+DLG KG+ FTW   RN+       + RE +G         F    + ++   +    + H F H    D +    +   +  ++EE +W+ 
Subjt:  VAIDSCNLIDLGFKGHKFTWRKSRNDP-----NRTRERIG--------RFFATHELIDMFGS--TKVEHRFWHR--SDHVSLAASLGKHPWEEEEQYWRI

Query:  RSREDWLLGGDRNTKWFHSRANARRKKNTMQGVLDHDGNWVADEERMGEIATKFFIDLFNSSDPNSATVE------------------------------
        RSR DWL  GD+NTK+FH++A+ARRKKN + G+LD  G W  D + +  I  + F  LF+++ P +  ++                              
Subjt:  RSREDWLLGGDRNTKWFHSRANARRKKNTMQGVLDHDGNWVADEERMGEIATKFFIDLFNSSDPNSATVE------------------------------

Query:  --------------------KKYWDIIGNEVTKISLEVLNEGKDISPLNKTFISLIPKVHQPKKMEEYRPISLCSVIYKLIAKTLANMLKKVLRFIISQT
                            +K+W  +   V    L +LN+  +++PLN T+I+LIPK  +PK + E+RPISLC+VIY++IAK++AN LK +L  I+S  
Subjt:  --------------------KKYWDIIGNEVTKISLEVLNEGKDISPLNKTFISLIPKVHQPKKMEEYRPISLCSVIYKLIAKTLANMLKKVLRFIISQT

Query:  QAAFVLGMLITDNVVVGFECIHAINSKRKLKSGNATLKLDMSKAYDRVEWFFLRKAMQGLGF
        Q+AF+L  LITDN+++G+E ++ I   +  K+G   LKLD+SKAYDRVEW FLR AMQ LGF
Subjt:  QAAFVLGMLITDNVVVGFECIHAINSKRKLKSGNATLKLDMSKAYDRVEWFFLRKAMQGLGF

XP_023897447.1 uncharacterized protein LOC112009345 [Quercus suber]4.4e-7840.27Show/hide
Query:  SIGKRGGLTLMWEEGIQVQVLSSLEGHIYMVIKD--FKGWWRFTRFYGNLESHRRKDSWELLERLSQASKLSWIVGGDFNEILTTYEKQGRSKRIQRQ-G
        S+GK GGL L W+EG+ V+VL S + HI  V+K      WW  T FYG+ ++ RR +SW LL+R+   S+L W+  GDFNEI    EK+G S R++RQ  
Subjt:  SIGKRGGLTLMWEEGIQVQVLSSLEGHIYMVIKD--FKGWWRFTRFYGNLESHRRKDSWELLERLSQASKLSWIVGGDFNEILTTYEKQGRSKRIQRQ-G

Query:  YFRVAIDSCNLIDLGFKGHKFTWRKSRNDPNRTRERIGRFFATHELIDMFGSTKVEH-RFWHRSDHVSLAASLGKHPWEEEEQYWRIRSREDWLLGGDRN
         F   I+ C L DLG+ G  FTW   R D  + RER+ R  A+ + +  F   K+ H   W     + + A LG   W+E+ Q ++          GD+N
Subjt:  YFRVAIDSCNLIDLGFKGHKFTWRKSRNDPNRTRERIGRFFATHELIDMFGSTKVEH-RFWHRSDHVSLAASLGKHPWEEEEQYWRIRSREDWLLGGDRN

Query:  TKWFHSRANARRKKNTMQGVLDHDGNWVADEERMGEIATKFFIDLFNSSDPNS-----ATVEKK------------------------------------
        T++FH+RA++R KKN + G+LD  G W  +E  +GEI    + DLF S+ P         V+ K                                    
Subjt:  TKWFHSRANARRKKNTMQGVLDHDGNWVADEERMGEIATKFFIDLFNSSDPNS-----ATVEKK------------------------------------

Query:  -------YWDIIGNEVTKISLEVLNEGKDISPLNKTFISLIPKVHQPKKMEEYRPISLCSVIYKLIAKTLANMLKKVLRFIISQTQAAFVLGMLITDNVV
               +W  IGN VTK+ L+ LN G      N T + LIPK   PK + +YRPISLC+VI+K+ +K +AN LKK+L  IIS TQ+AFV G LITDN++
Subjt:  -------YWDIIGNEVTKISLEVLNEGKDISPLNKTFISLIPKVHQPKKMEEYRPISLCSVIYKLIAKTLANMLKKVLRFIISQTQAAFVLGMLITDNVV

Query:  VGFECIHAINSKRKLKSGNATLKLDMSKAYDRVEWFFLRKAMQGLGF
        V FE +H I+ K+  + G   LKLDMSKAYDRVEW  L K M+ LGF
Subjt:  VGFECIHAINSKRKLKSGNATLKLDMSKAYDRVEWFFLRKAMQGLGF

XP_024044510.1 uncharacterized protein LOC112100177 [Citrus clementina]4.0e-7937.75Show/hide
Query:  GKRGGLTLMWEEGIQVQVLSSLEGHIYMVIKDFKG-WWRFTRFYGNLESHRRKDSWELLERLSQASKLSWIVGGDFNEILTTYEK-QGRSKRIQRQGYFR
        G  GGL L+W E + V + S  + H+  V+    G +WR T  YG+ ES ++K +WELL RL+Q S L W+  GDFNE+L+  EK  G+ KR+     FR
Subjt:  GKRGGLTLMWEEGIQVQVLSSLEGHIYMVIKDFKG-WWRFTRFYGNLESHRRKDSWELLERLSQASKLSWIVGGDFNEILTTYEK-QGRSKRIQRQGYFR

Query:  VAIDSCNLIDLGFKGHKFTWRKSRNDPNRTRERIGRFFATHELIDMFGSTKVEHRFWHRSDHVSLAASLGKHPWEEEEQYWRIRSREDWLLGGDRNTKWF
         A+  C+L DLG  G+ FTW   R  P+   E++ RF    +    +      +     SDH+ +   +               SR DWL  GDRNTK+F
Subjt:  VAIDSCNLIDLGFKGHKFTWRKSRNDPNRTRERIGRFFATHELIDMFGSTKVEHRFWHRSDHVSLAASLGKHPWEEEEQYWRIRSREDWLLGGDRNTKWF

Query:  HSRANARRKKNTMQGVLDHDGNWVADEERMGEIATKFFIDLFNSSDPN----SATVE-------------------------------------------
        HS+A+ARRKKN +QG+ D +G W  D E +  +   +F ++F +S P+    +A +E                                           
Subjt:  HSRANARRKKNTMQGVLDHDGNWVADEERMGEIATKFFIDLFNSSDPN----SATVE-------------------------------------------

Query:  ---KKYWDIIGNEVTKISLEVLNEGKDISPLNKTFISLIPKVHQPKKMEEYRPISLCSVIYKLIAKTLANMLKKVLRFIISQTQAAFVLGMLITDNVVVG
           +K+W  +   V    L +LNEG +++ LN TFI+L+PKV +P+K+ E+RPI+LC+VIY+++AKT+AN LK VL  IIS  Q+AFV   LITDN+++G
Subjt:  ---KKYWDIIGNEVTKISLEVLNEGKDISPLNKTFISLIPKVHQPKKMEEYRPISLCSVIYKLIAKTLANMLKKVLRFIISQTQAAFVLGMLITDNVVVG

Query:  FECIHAINSKRKLKSGNATLKLDMSKAYDRVEWFFLRKAMQGLGF
        +EC++ I   R  ++G   +KLD+SKAYDRVEW F++  MQ LGF
Subjt:  FECIHAINSKRKLKSGNATLKLDMSKAYDRVEWFFLRKAMQGLGF

XP_028068804.1 uncharacterized protein LOC114271378 [Camellia sinensis]1.9e-7337.01Show/hide
Query:  IGKRGGLTLMWEEGIQVQVLSSLEGHIYMVIKDFKGW--WRFTRFYGNLESHRRKDSWELLERLSQASKLSWIVGGDFNEILTTYEKQGRSKRIQRQ-GY
        +G  GGL L W  G+ V + +   GH+  V++   G   WRFT FYG+ E  ++ DSWELL RL     L W+V  DFNEIL   EK G   R + Q   
Subjt:  IGKRGGLTLMWEEGIQVQVLSSLEGHIYMVIKDFKGW--WRFTRFYGNLESHRRKDSWELLERLSQASKLSWIVGGDFNEILTTYEKQGRSKRIQRQ-GY

Query:  FRVAIDSCNLIDLGFKGHKFTWRKSRNDPNRTRERIGRFFATHELIDMFGSTKVEHRFWHRSDHVSLAASL---------------------------GK
        F+ A+  C+L DLGF G  FTW  +R       ER+ R  A    + +F   +V H     SDH+ +   L                           GK
Subjt:  FRVAIDSCNLIDLGFKGHKFTWRKSRNDPNRTRERIGRFFATHELIDMFGSTKVEHRFWHRSDHVSLAASL---------------------------GK

Query:  HP---------WEEEEQYWRIRSREDWLLGGDRNTKWFHSRANARRKKNTMQGVLDHDGNWVADEERMGEIATKFFIDLFNSSD---------------P
                    E EE  W  R+R +WL  GDRNT +FHS+A  R KK  + G+ D    W +  E +  I   +F +LF++                 P
Subjt:  HP---------WEEEEQYWRIRSREDWLLGGDRNTKWFHSRANARRKKNTMQGVLDHDGNWVADEERMGEIATKFFIDLFNSSD---------------P

Query:  NSATVE---------------------------------KKYWDIIGNEVTKISLEVLNEGKDISPLNKTFISLIPKVHQPKKMEEYRPISLCSVIYKLI
            VE                                 +++W I+G +VT+  L VLNEGK ++ +N TFI LIPKV  PK+M ++RPISLC+V+YKL+
Subjt:  NSATVE---------------------------------KKYWDIIGNEVTKISLEVLNEGKDISPLNKTFISLIPKVHQPKKMEEYRPISLCSVIYKLI

Query:  AKTLANMLKKVLRFIISQTQAAFVLGMLITDNVVVGFECIHAINSKRKLKSGNATLKLDMSKAYDRVEWFFLRKAMQGLGF
        +K LAN ++K+L  IIS  Q+AFV G LI+DN++  FE  H + +KR  K G+  LKLDMSKAYDRVEW FLR  M+ +GF
Subjt:  AKTLANMLKKVLRFIISQTQAAFVLGMLITDNVVVGFECIHAINSKRKLKSGNATLKLDMSKAYDRVEWFFLRKAMQGLGF

TrEMBL top hitse value%identityAlignment
A0A2N9EVP1 Reverse transcriptase domain-containing protein3.2e-7435.74Show/hide
Query:  DQQKIKGNNSKIGVKE--REKSIGKRGGLTLMWEEGIQVQVLSSLEGHIYMVIKDFK-GWWRFTRFYGNLESHRRKDSWELLERLSQASKLSWIVGGDFN
        D  +++G   K+G +      S+G+ GGL L+W+   ++ + +  + HI   ++  +   WR T FYG  E HR K+SW LL+ LS      W+  GDFN
Subjt:  DQQKIKGNNSKIGVKE--REKSIGKRGGLTLMWEEGIQVQVLSSLEGHIYMVIKDFK-GWWRFTRFYGNLESHRRKDSWELLERLSQASKLSWIVGGDFN

Query:  EILTTYEKQGRSKRIQRQGY-FRVAIDSCNLIDLGFKGHKFTWRKSRNDPNRTRERIGRFFATHELIDMFGS--TKVEHRFWHRSDHV------------
        E+L  +EK+G + R  RQ   F+ ++++CN +DLG++G  +TW  +R+D    + R+ R  AT  L+D F     K+    W++   V            
Subjt:  EILTTYEKQGRSKRIQRQGY-FRVAIDSCNLIDLGFKGHKFTWRKSRNDPNRTRERIGRFFATHELIDMFGS--TKVEHRFWHRSDHV------------

Query:  ---------------SLAASL----------------GKHP-------------WEEEEQYWRIRSREDWLLGGDRNTKWFHSRANARRKKNTMQGVLDH
                       +LA  +                G+H                ++E +WR RSRE WL  GDRNTK+FH RA  RR KNT++G+LD 
Subjt:  ---------------SLAASL----------------GKHP-------------WEEEEQYWRIRSREDWLLGGDRNTKWFHSRANARRKKNTMQGVLDH

Query:  DGNWVADEERMGEIATKFFIDLFNSSDPNSATVEKKYWDIIGNEVTKISLEVLN----------------------EGKDISPLNKTFISLIPKVHQPKK
         G W  DE  MG+IA ++F ++F+SS      VE   W I     T ++ ++L+                      +G  +   N + I LIPK   P+ 
Subjt:  DGNWVADEERMGEIATKFFIDLFNSSDPNSATVEKKYWDIIGNEVTKISLEVLN----------------------EGKDISPLNKTFISLIPKVHQPKK

Query:  MEEYRPISLCSVIYKLIAKTLANMLKKVLRFIISQTQAAFVLGMLITDNVVVGFECIHAINSKRKLKSGNATLKLDMSKAYDRVEWFFLRKAMQGLGF
        + +YRPISL +V+YK+++K LAN LK VL  IIS++Q+AF+ G  ITDNV V FE IHA+ S+RK K     +KLDMSKAYDRVEW FL + MQ +GF
Subjt:  MEEYRPISLCSVIYKLIAKTLANMLKKVLRFIISQTQAAFVLGMLITDNVVVGFECIHAINSKRKLKSGNATLKLDMSKAYDRVEWFFLRKAMQGLGF

A0A2N9F6W6 Reverse transcriptase domain-containing protein5.8e-7636.21Show/hide
Query:  SIGKRGGLTLMWEEGIQVQVLSSLEGHIYMVIKDFKG-WWRFTRFYGNLESHRRKDSWELLERLSQASKLSWIVGGDFNEILTTYEKQGRSKRIQ-RQGY
        S G  GGL L+W + +++ + +  + HI   I+      WRFT FYG+ ESHR ++SW LL RL+   +L W+  GDFNEIL+  E+ G    +  +   
Subjt:  SIGKRGGLTLMWEEGIQVQVLSSLEGHIYMVIKDFKG-WWRFTRFYGNLESHRRKDSWELLERLSQASKLSWIVGGDFNEILTTYEKQGRSKRIQ-RQGY

Query:  FRVAIDSCNLIDLGFKGHKFTWRKSRNDPNRTRERIGRFFATHELIDMFGSTKVEHRFWHRSDHVSLAASLG---------------KHPWE--------
        F   ++ C LID+GF+G  FTW   R+     ++R+ R  AT   +D F  + V H     SDHV +   L                +  W         
Subjt:  FRVAIDSCNLIDLGFKGHKFTWRKSRNDPNRTRERIGRFFATHELIDMFGSTKVEHRFWHRSDHVSLAASLG---------------KHPWE--------

Query:  ------------------EEEQYWRIRSREDWLLGGDRNTKWFHSRANARRKKNTMQGVLDHDGNWVADEERMGEIATKFFIDLFNSSDPN---------
                           EE +WR RSR  WL  GD NTK+FH+ A  RR+ N M G+L+    W   +++   +A ++F  LF SS+P          
Subjt:  ------------------EEEQYWRIRSREDWLLGGDRNTKWFHSRANARRKKNTMQGVLDHDGNWVADEERMGEIATKFFIDLFNSSDPN---------

Query:  ---------------------------------------SATVEKKYWDIIGNEVTKISLEVLNEGKDISPLNKTFISLIPKVHQPKKMEEYRPISLCSV
                                               S+   +K+W +IG  V+   L VLN  + +  +N T ISLIPK   P+ M +YRPISLC+V
Subjt:  ---------------------------------------SATVEKKYWDIIGNEVTKISLEVLNEGKDISPLNKTFISLIPKVHQPKKMEEYRPISLCSV

Query:  IYKLIAKTLANMLKKVLRFIISQTQAAFVLGMLITDNVVVGFECIHAINSKRKLKSGNATLKLDMSKAYDRVEWFFLRKAMQGLGF
        +YK+I+K +AN LK VL  IIS  Q+AFV G LITDNV V FE IH    KRK K G   LKLDMSKAYDRVEW FL   ++ LGF
Subjt:  IYKLIAKTLANMLKKVLRFIISQTQAAFVLGMLITDNVVVGFECIHAINSKRKLKSGNATLKLDMSKAYDRVEWFFLRKAMQGLGF

A0A2N9GKW3 Reverse transcriptase domain-containing protein9.0e-7736.65Show/hide
Query:  KSIGKRGGLTLMWEEGIQVQVLSSLEGHIYMVIKDFK-GWWRFTRFYGNLESHRRKDSWELLERLSQASKLSWIVGGDFNEILTTYEKQGRSKRIQRQ-G
        +S  K GGL L W++ ++++V S    HI  ++ + +   WRFT FYG  E+H+R++SW LL RL+   KL W   GDFNE++   EK GR  R +RQ  
Subjt:  KSIGKRGGLTLMWEEGIQVQVLSSLEGHIYMVIKDFK-GWWRFTRFYGNLESHRRKDSWELLERLSQASKLSWIVGGDFNEILTTYEKQGRSKRIQRQ-G

Query:  YFRVAIDSCNLIDLGFKGHKFTWRKSRNDPNRTRERIGRFFATHELIDMFGSTKVEH--------RFWHRS---------------------------DH
         FR  +D C  +DLGF G KFTW  +R   + T ER+ R  AT   +  F S +V H        R W R+                           DH
Subjt:  YFRVAIDSCNLIDLGFKGHKFTWRKSRNDPNRTRERIGRFFATHELIDMFGSTKVEH--------RFWHRS---------------------------DH

Query:  VSLAASLGK--HPWEEEEQYWRIRSREDWLLGGDRNTKWFHSRANARRKKNTMQGVLDHDGNWVADEERMGEIATKFFIDLFNSSD-----------PNS
          ++   G+      +EE+ WR RSR +WL  GDRNT++FH RA  R+++N +  +   DG W     ++  +  +++  +F +++           P  
Subjt:  VSLAASLGK--HPWEEEEQYWRIRSREDWLLGGDRNTKWFHSRANARRKKNTMQGVLDHDGNWVADEERMGEIATKFFIDLFNSSD-----------PNS

Query:  ATVE-------------------------------------KKYWDIIGNEVTKISLEVLNEGKDISPLNKTFISLIPKVHQPKKMEEYRPISLCSVIYK
         T E                                     +KYW +IG +VTK  L  LN GK +  +N T+I+LIPKV  P+++ E+RPISLC+VIYK
Subjt:  ATVE-------------------------------------KKYWDIIGNEVTKISLEVLNEGKDISPLNKTFISLIPKVHQPKKMEEYRPISLCSVIYK

Query:  LIAKTLANMLKKVLRFIISQTQAAFVLGMLITDNVVVGFECIHAINSKRKLKSGNATLKLDMSKAYDRVEWFFLRKAMQGLGF
        LI+K LAN LK +L  IIS++Q+AF+ G LITDN++V FE +H +  +R  + G+  +KLDMSKAYDRVEW +L+  MQ +GF
Subjt:  LIAKTLANMLKKVLRFIISQTQAAFVLGMLITDNVVVGFECIHAINSKRKLKSGNATLKLDMSKAYDRVEWFFLRKAMQGLGF

A0A7N2R0C3 Reverse transcriptase domain-containing protein2.5e-7433.63Show/hide
Query:  QQKIKGNNSKIGVKE--REKSIGKRGGLTLMWEEGIQVQVLSSLEGHIYMVIKDFKGW--WRFTRFYGNLESHRRKDSWELLERLSQASKLSWIVGGDFN
        Q++IKG   K+G+ +     S G+ GGL ++W EG+ V + S    HI +V+    G   WR T FYG+ ++  R  SW+LLE LS+   + W+V GDFN
Subjt:  QQKIKGNNSKIGVKE--REKSIGKRGGLTLMWEEGIQVQVLSSLEGHIYMVIKDFKGW--WRFTRFYGNLESHRRKDSWELLERLSQASKLSWIVGGDFN

Query:  EILTTYEKQGRSKRIQRQ-GYFRVAIDSCNLIDLGFKGHKFTWRKSRNDPNRTRERIGRFFATHELIDMFGSTKVEHRFWHRSDHVSLAAS---------
        EIL + EK G  +R  RQ   FR  + +C L+DLGF G +FTW   R    RT  R+ R  A  E +++F   KV HR    SDH  L+ S         
Subjt:  EILTTYEKQGRSKRIQRQ-GYFRVAIDSCNLIDLGFKGHKFTWRKSRNDPNRTRERIGRFFATHELIDMFGSTKVEHRFWHRSDHVSLAAS---------

Query:  ---------------------------LGKHP-----------------WE----------------------------------------------EEE
                                   LG +P                 W                                                EE
Subjt:  ---------------------------LGKHP-----------------WE----------------------------------------------EEE

Query:  QYWRIRSREDWLLGGDRNTKWFHSRANARRKKNTMQGVLDHDGNWVADEERMGEIATKFFIDLFNSSDPN------------------------------
          W  RSR  W+  GDRNT++FH+ AN RR+KN ++G+LD +G W  + E + EI  ++F ++++S+ P                               
Subjt:  QYWRIRSREDWLLGGDRNTKWFHSRANARRKKNTMQGVLDHDGNWVADEERMGEIATKFFIDLFNSSDPN------------------------------

Query:  ------------------SATVEKKYWDIIGNEVTKISLEVLNEGKDISPLNKTFISLIPKVHQPKKMEEYRPISLCSVIYKLIAKTLANMLKKVLRFII
                          S    +KYWD++G +V +  +  L  G     +N+T+I LIPKV  P+K+ EYRPISLC+VIYKL++K LAN LK VL  ++
Subjt:  ------------------SATVEKKYWDIIGNEVTKISLEVLNEGKDISPLNKTFISLIPKVHQPKKMEEYRPISLCSVIYKLIAKTLANMLKKVLRFII

Query:  SQTQAAFVLGMLITDNVVVGFECIHAINSKRKLKSGNATLKLDMSKAYDRVEWFFLRKAMQGLGF
         + Q+AFV G  ITDNV+V FE +H IN +RK K G   +KLDMSKAYDRVEW +L   M+ +GF
Subjt:  SQTQAAFVLGMLITDNVVVGFECIHAINSKRKLKSGNATLKLDMSKAYDRVEWFFLRKAMQGLGF

A0A803PRV5 Uncharacterized protein5.5e-7434.12Show/hide
Query:  GKRGGLTLMWEEGIQVQVLSSLEGHIYMVIKDFKG-WWRFTRFYGNLESHRRKDSWELLERLSQASKLSWIVGGDFNEILTTYEK-QGRSKRIQRQGYFR
        GK GGL L+W   I+ QVLS  + HI   I+   G WWRFT FYG+ +  +R  SW+LL+RL++     W+VGGDFNEIL+  EK  G  K       FR
Subjt:  GKRGGLTLMWEEGIQVQVLSSLEGHIYMVIKDFKG-WWRFTRFYGNLESHRRKDSWELLERLSQASKLSWIVGGDFNEILTTYEK-QGRSKRIQRQGYFR

Query:  VAIDSCNLIDLGFKGHKFTWRKSRNDPNRTRERIGRFFATHELIDMFGSTKVEH------------------------------RF--------------
         A+D C L D+G++G+ +TW   R + +   ER+ R     +  DMF S KV H                              RF              
Subjt:  VAIDSCNLIDLGFKGHKFTWRKSRNDPNRTRERIGRFFATHELIDMFGSTKVEH------------------------------RF--------------

Query:  -------------------------------WHRS-------------DHVS-LAASLGKHPW--------------EEEEQYWRIRSREDWLLGGDRNT
                                       W++S             D +S L+ S     W              ++EE++W+ RSR  WL  GD+NT
Subjt:  -------------------------------WHRS-------------DHVS-LAASLGKHPW--------------EEEEQYWRIRSREDWLLGGDRNT

Query:  KWFHSRANARRKKNTMQGVLDHDGNWVADEERMGEIATKFFIDLFNSSDPNSATVE--------------------------------------------
        K+FH +A+ R+ KNT++G++D    W+ + + MG++A  +F  LF S  PN   +E                                            
Subjt:  KWFHSRANARRKKNTMQGVLDHDGNWVADEERMGEIATKFFIDLFNSSDPNSATVE--------------------------------------------

Query:  ------KKYWDIIGNEVTKISLEVLNEGKDISPLNKTFISLIPKVHQPKKMEEYRPISLCSVIYKLIAKTLANMLKKVLRFIISQTQAAFVLGMLITDNV
              +KYW IIG EV+ + L +LNEG  I  +N T I LIPK+ +P +M E+RPISLC+VIYK+IAK LA  +K  +  +IS+ Q+AFV G LI DN 
Subjt:  ------KKYWDIIGNEVTKISLEVLNEGKDISPLNKTFISLIPKVHQPKKMEEYRPISLCSVIYKLIAKTLANMLKKVLRFIISQTQAAFVLGMLITDNV

Query:  VVGFECIHAINSKRKLKSGNATLKLDMSKAYDRVEWFFLRKAMQGLGF
        ++GFE +  + +KR       TLKLDMSKAYDRVEW FLR  M+GLG+
Subjt:  VVGFECIHAINSKRKLKSGNATLKLDMSKAYDRVEWFFLRKAMQGLGF

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.6e-0627.92Show/hide
Query:  SSDPNSATVE--KKYWDIIGNEVTKISLEVLNEGKDISPLNKTFISLIPKV-HQPKKMEEYRPISLCSVIYKLIAKTLANMLKKVLRFIISQTQAAFVLG
        S  P+  T E  ++Y + +   + K+   +  EG   +   +  I LIPK      K E +RPISL ++  K++ K LAN +++ ++ +I   Q  F+ G
Subjt:  SSDPNSATVE--KKYWDIIGNEVTKISLEVLNEGKDISPLNKTFISLIPKV-HQPKKMEEYRPISLCSVIYKLIAKTLANMLKKVLRFIISQTQAAFVLG

Query:  MLITDNVVVGFECIHAINSKRKLKSGNATLKLDMSKAYDRVEWFFLRKAMQGLG
        M    N+      I  IN  R     +  + +D  KA+D+++  F+ K +  LG
Subjt:  MLITDNVVVGFECIHAINSKRKLKSGNATLKLDMSKAYDRVEWFFLRKAMQGLG

P08548 LINE-1 reverse transcriptase homolog6.1e-0629.3Show/hide
Query:  SSDPNSATVEKKYWDIIGNEVTKISLEVLNEGKDISPLNKTF----ISLIPKV-HQPKKMEEYRPISLCSVIYKLIAKTLANMLKKVLRFIISQTQAAFV
        S  P+  T E  ++     E+  I L +    +    L  TF    I+LIPK    P + E YRPISL ++  K++ K L N +++ ++ II   Q  F+
Subjt:  SSDPNSATVEKKYWDIIGNEVTKISLEVLNEGKDISPLNKTF----ISLIPKV-HQPKKMEEYRPISLCSVIYKLIAKTLANMLKKVLRFIISQTQAAFV

Query:  LGMLITDNVVVGFECIHAINSKRKLKS-GNATLKLDMSKAYDRVEWFFLRKAMQGLG
         G     N+      I  IN   KLK+  +  L +D  KA+D ++  F+ + ++ +G
Subjt:  LGMLITDNVVVGFECIHAINSKRKLKS-GNATLKLDMSKAYDRVEWFFLRKAMQGLG

P11369 LINE-1 retrotransposable element ORF2 protein2.3e-0836.94Show/hide
Query:  ISLIPKVHQ-PKKMEEYRPISLCSVIYKLIAKTLANMLKKVLRFIISQTQAAFVLGMLITDNVVVGFECIHAINSKRKLKSGN-ATLKLDMSKAYDRVEW
        I+LIPK  + P K+E +RPISL ++  K++ K LAN +++ ++ II   Q  F+ GM    N+      IH IN   KLK  N   + LD  KA+D+++ 
Subjt:  ISLIPKVHQ-PKKMEEYRPISLCSVIYKLIAKTLANMLKKVLRFIISQTQAAFVLGMLITDNVVVGFECIHAINSKRKLKSGN-ATLKLDMSKAYDRVEW

Query:  FFLRKAMQGLG
         F+ K ++  G
Subjt:  FFLRKAMQGLG

P14381 Transposon TX1 uncharacterized 149 kDa protein4.4e-1224.33Show/hide
Query:  IRSREDWLLGGDRNTKWFHSRANARRKKNTMQGVLDHDGNWVADEERMGEIATKFFIDLFNSSDP-----------------------------------
        +RSR   L   DR +++F++    +  +  +  +   DG  + D E + + A  F+ +LF S DP                                   
Subjt:  IRSREDWLLGGDRNTKWFHSRANARRKKNTMQGVLDHDGNWVADEERMGEIATKFFIDLFNSSDP-----------------------------------

Query:  -------------NSATVE--KKYWDIIGNEVTKISLEVLNEGKDISPLNKTFISLIPKVHQPKKMEEYRPISLCSVIYKLIAKTLANMLKKVLRFIISQ
                     +  T+E  + +WD +G +  ++  E   +G+      +  +SL+PK    + ++ +RP+SL S  YK++AK ++  LK VL  +I  
Subjt:  -------------NSATVE--KKYWDIIGNEVTKISLEVLNEGKDISPLNKTFISLIPKVHQPKKMEEYRPISLCSVIYKLIAKTLANMLKKVLRFIISQ

Query:  TQAAFVLGMLITDNVVVGFECIHAINSKRKLKSGNATLKLDMSKAYDRVEWFFLRKAMQGLGF
         Q+  V G  I DNV +  + +H     R+     A L LD  KA+DRV+  +L   +Q   F
Subjt:  TQAAFVLGMLITDNVVVGFECIHAINSKRKLKSGNATLKLDMSKAYDRVEWFFLRKAMQGLGF

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein2.2e-0622.75Show/hide
Query:  EQYWRIRSREDWLLGGDRNTKWFHSRANARRKKNTMQGVLDHDGNWVADEERMGEIATKFFIDLFNSSD-------------------------------
        E ++R +SR  WL  GD NT++FH    A + KN ++ +   D   V +  ++ E+   ++  L  S                                 
Subjt:  EQYWRIRSREDWLLGGDRNTKWFHSRANARRKKNTMQGVLDHDGNWVADEERMGEIATKFFIDLFNSSD-------------------------------

Query:  -------------------PNSATVE--KKYWDIIGNEVTKISLEVLNEGKDISPLNKTFISLIPKVHQPKKMEEYRPISLCSVIYKLI
                           P+S T E   + W ++ +       E    G  +   N T I+LIPKV    ++  +RP+S C+V+YK+I
Subjt:  -------------------PNSATVE--KKYWDIIGNEVTKISLEVLNEGKDISPLNKTFISLIPKVHQPKKMEEYRPISLCSVIYKLI

AT4G20520.1 RNA binding;RNA-directed DNA polymerases5.1e-0837.18Show/hide
Query:  LANMLKKVLRFIISQTQAAFVLGMLITDNVVVGFECIHAINSKRKLKSGNATLKLDMSKAYDRVEWFFLRKAMQGLGF
        +   LK ++  +I   QA+F+ G + TDN+V   E +H++  K+ +K G   LKLD+ KAYDR+ W +L   +   GF
Subjt:  LANMLKKVLRFIISQTQAAFVLGMLITDNVVVGFECIHAINSKRKLKSGNATLKLDMSKAYDRVEWFFLRKAMQGLGF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGATGAAACGGCTGCTCCAAACCACCAAGAACATGAAGAACTTCGTCTCCTTGAGTGGAATGCAAATGGTCTTCTTAGATCTCCATTCTGCCGTCAATTCGGCCTC
CGCCGATTCATTCGGTCTCCAATCGAAATACGCAGAAGTTCTAGGAAATGCTATTGGCATCTTTGAAGAGCAGACACATATGAGTGGGGAAGATCGAAGGGAGACTCTGA
GGGTGAAGGTCAGAATAGATGTCAACAAACCGATAAAAGAGGCACTCACGAGAAGATTGGGTCAAAAGAGGACATGGTGTGGATTCCAATCACCTTCGAGAAATTTCCGA
ATTTTTGTTATTATTGTGACAAGGAAACTCAAAGTAGCAAAGGAGTTTACAGAGGATACATTCCAAAACCAAGGGAAGAGATTTTTGGGAAGAGGAAGGGGAAGAGGAGA
CAGAGGAGGTCAGAGAAGAGGCAAGAACTGGAGACCTAACCAAAGAGAGTCTCCAGGGAAACTGCAGCAAGAGGAAGAAGGGGAATTGCTGGAAAAGCAAAACCGACCGA
AAAGTCACTCACCTGAGAAGAAGAGGGGTACAGAACAAGGCACGATGGAGAGAGAATCTGACAAAGGAAAGCAGGTGGCAGAAAGCAGGCGTCTGATGGAAAAGATAGAA
CTAGAAAAGGACAAAAGGGGTCAAACAAACGTCCAAATAGAAAAGGCTCCTGATCCATGTGATAATAAGAAAGAAAGCTCTGAGACAGTGAAAATCGACGATCAACAGAA
GATAAAAGGTAATAACAGTAAAATAGGCGTCAAGGAAAGGGAAAAAAGTATTGGTAAAAGAGGTGGTTTAACTCTGATGTGGGAAGAGGGCATTCAGGTTCAAGTCCTAT
CTTCATTAGAAGGGCATATTTATATGGTGATAAAAGACTTTAAAGGGTGGTGGAGATTTACCAGGTTCTACGGGAATCTGGAAAGTCATAGAAGGAAGGATTCGTGGGAG
CTTCTAGAGAGGCTTAGTCAAGCTTCAAAGCTCTCGTGGATCGTTGGAGGTGATTTTAATGAAATTCTCACGACATATGAAAAACAAGGACGATCGAAAAGAATTCAAAG
GCAGGGATATTTTAGAGTTGCCATTGATTCGTGCAATCTTATTGACCTTGGGTTTAAAGGTCACAAGTTCACTTGGAGAAAATCTAGAAATGACCCAAACAGGACTCGTG
AGAGGATTGGTAGATTTTTTGCCACTCATGAGCTCATTGACATGTTCGGTTCCACAAAAGTTGAGCACAGATTTTGGCATAGATCTGATCATGTGTCGTTAGCAGCGTCT
CTGGGGAAGCATCCCTGGGAAGAAGAGGAGCAATACTGGAGGATTAGATCAAGGGAAGATTGGTTGTTAGGAGGAGATAGGAACACCAAATGGTTCCATTCTAGAGCAAA
TGCCAGAAGGAAAAAGAATACCATGCAAGGAGTGCTCGACCATGATGGTAATTGGGTGGCTGATGAGGAGAGAATGGGAGAAATTGCCACAAAATTTTTCATTGATCTTT
TTAACTCCTCTGATCCTAATTCTGCAACAGTGGAGAAGAAATATTGGGACATTATTGGGAATGAGGTGACAAAGATTTCCTTGGAGGTTCTAAACGAGGGTAAAGACATT
AGTCCTCTGAACAAAACCTTTATATCCTTAATCCCAAAAGTGCATCAACCAAAGAAAATGGAGGAATATAGGCCAATTAGCCTATGCAGTGTGATCTACAAATTGATTGC
CAAAACATTAGCGAACATGTTGAAAAAAGTGCTGAGGTTTATCATATCTCAAACTCAAGCAGCTTTTGTTCTAGGAATGTTGATTACTGACAACGTAGTGGTTGGGTTCG
AATGTATTCATGCCATTAATAGCAAAAGGAAATTAAAAAGTGGTAACGCAACCCTTAAGCTCGACATGAGCAAGGCATACGACAGAGTGGAATGGTTTTTCCTAAGGAAA
GCTATGCAAGGCTTGGGTTTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGATGAAACGGCTGCTCCAAACCACCAAGAACATGAAGAACTTCGTCTCCTTGAGTGGAATGCAAATGGTCTTCTTAGATCTCCATTCTGCCGTCAATTCGGCCTC
CGCCGATTCATTCGGTCTCCAATCGAAATACGCAGAAGTTCTAGGAAATGCTATTGGCATCTTTGAAGAGCAGACACATATGAGTGGGGAAGATCGAAGGGAGACTCTGA
GGGTGAAGGTCAGAATAGATGTCAACAAACCGATAAAAGAGGCACTCACGAGAAGATTGGGTCAAAAGAGGACATGGTGTGGATTCCAATCACCTTCGAGAAATTTCCGA
ATTTTTGTTATTATTGTGACAAGGAAACTCAAAGTAGCAAAGGAGTTTACAGAGGATACATTCCAAAACCAAGGGAAGAGATTTTTGGGAAGAGGAAGGGGAAGAGGAGA
CAGAGGAGGTCAGAGAAGAGGCAAGAACTGGAGACCTAACCAAAGAGAGTCTCCAGGGAAACTGCAGCAAGAGGAAGAAGGGGAATTGCTGGAAAAGCAAAACCGACCGA
AAAGTCACTCACCTGAGAAGAAGAGGGGTACAGAACAAGGCACGATGGAGAGAGAATCTGACAAAGGAAAGCAGGTGGCAGAAAGCAGGCGTCTGATGGAAAAGATAGAA
CTAGAAAAGGACAAAAGGGGTCAAACAAACGTCCAAATAGAAAAGGCTCCTGATCCATGTGATAATAAGAAAGAAAGCTCTGAGACAGTGAAAATCGACGATCAACAGAA
GATAAAAGGTAATAACAGTAAAATAGGCGTCAAGGAAAGGGAAAAAAGTATTGGTAAAAGAGGTGGTTTAACTCTGATGTGGGAAGAGGGCATTCAGGTTCAAGTCCTAT
CTTCATTAGAAGGGCATATTTATATGGTGATAAAAGACTTTAAAGGGTGGTGGAGATTTACCAGGTTCTACGGGAATCTGGAAAGTCATAGAAGGAAGGATTCGTGGGAG
CTTCTAGAGAGGCTTAGTCAAGCTTCAAAGCTCTCGTGGATCGTTGGAGGTGATTTTAATGAAATTCTCACGACATATGAAAAACAAGGACGATCGAAAAGAATTCAAAG
GCAGGGATATTTTAGAGTTGCCATTGATTCGTGCAATCTTATTGACCTTGGGTTTAAAGGTCACAAGTTCACTTGGAGAAAATCTAGAAATGACCCAAACAGGACTCGTG
AGAGGATTGGTAGATTTTTTGCCACTCATGAGCTCATTGACATGTTCGGTTCCACAAAAGTTGAGCACAGATTTTGGCATAGATCTGATCATGTGTCGTTAGCAGCGTCT
CTGGGGAAGCATCCCTGGGAAGAAGAGGAGCAATACTGGAGGATTAGATCAAGGGAAGATTGGTTGTTAGGAGGAGATAGGAACACCAAATGGTTCCATTCTAGAGCAAA
TGCCAGAAGGAAAAAGAATACCATGCAAGGAGTGCTCGACCATGATGGTAATTGGGTGGCTGATGAGGAGAGAATGGGAGAAATTGCCACAAAATTTTTCATTGATCTTT
TTAACTCCTCTGATCCTAATTCTGCAACAGTGGAGAAGAAATATTGGGACATTATTGGGAATGAGGTGACAAAGATTTCCTTGGAGGTTCTAAACGAGGGTAAAGACATT
AGTCCTCTGAACAAAACCTTTATATCCTTAATCCCAAAAGTGCATCAACCAAAGAAAATGGAGGAATATAGGCCAATTAGCCTATGCAGTGTGATCTACAAATTGATTGC
CAAAACATTAGCGAACATGTTGAAAAAAGTGCTGAGGTTTATCATATCTCAAACTCAAGCAGCTTTTGTTCTAGGAATGTTGATTACTGACAACGTAGTGGTTGGGTTCG
AATGTATTCATGCCATTAATAGCAAAAGGAAATTAAAAAGTGGTAACGCAACCCTTAAGCTCGACATGAGCAAGGCATACGACAGAGTGGAATGGTTTTTCCTAAGGAAA
GCTATGCAAGGCTTGGGTTTCTGA
Protein sequenceShow/hide protein sequence
MGMKRLLQTTKNMKNFVSLSGMQMVFLDLHSAVNSASADSFGLQSKYAEVLGNAIGIFEEQTHMSGEDRRETLRVKVRIDVNKPIKEALTRRLGQKRTWCGFQSPSRNFR
IFVIIVTRKLKVAKEFTEDTFQNQGKRFLGRGRGRGDRGGQRRGKNWRPNQRESPGKLQQEEEGELLEKQNRPKSHSPEKKRGTEQGTMERESDKGKQVAESRRLMEKIE
LEKDKRGQTNVQIEKAPDPCDNKKESSETVKIDDQQKIKGNNSKIGVKEREKSIGKRGGLTLMWEEGIQVQVLSSLEGHIYMVIKDFKGWWRFTRFYGNLESHRRKDSWE
LLERLSQASKLSWIVGGDFNEILTTYEKQGRSKRIQRQGYFRVAIDSCNLIDLGFKGHKFTWRKSRNDPNRTRERIGRFFATHELIDMFGSTKVEHRFWHRSDHVSLAAS
LGKHPWEEEEQYWRIRSREDWLLGGDRNTKWFHSRANARRKKNTMQGVLDHDGNWVADEERMGEIATKFFIDLFNSSDPNSATVEKKYWDIIGNEVTKISLEVLNEGKDI
SPLNKTFISLIPKVHQPKKMEEYRPISLCSVIYKLIAKTLANMLKKVLRFIISQTQAAFVLGMLITDNVVVGFECIHAINSKRKLKSGNATLKLDMSKAYDRVEWFFLRK
AMQGLGF