; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0016474 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0016474
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr12:38125165..38129151
RNA-Seq ExpressionLag0016474
SyntenyLag0016474
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_017613541.1 PREDICTED: uncharacterized protein LOC108458646 [Gossypium arboreum]2.5e-6636.34Show/hide
Query:  IKSVTPSSAWRFIGIYGNPKRELHHETWTFLRRLGEGVSYPWMVGNDFNEIVSNTEKFGGLLRSDRDMQKFWECIDYCSLIDPGFTGPEFTWCNKHFQSD
        IK      AWRF G YG+P       +W  LR+LG+    PW+V  DFNEI+ + EK GG+ R    M+ F E +    L+D G++G  FTW   +    
Subjt:  IKSVTPSSAWRFIGIYGNPKRELHHETWTFLRRLGEGVSYPWMVGNDFNEIVSNTEKFGGLLRSDRDMQKFWECIDYCSLIDPGFTGPEFTWCNKHFQSD

Query:  HIWERLDRFLLNHHMQDLCSVFRVRHLALIAYDHRPLVVEWSLDQNSQQDE-------------GELEDDKIYWHQRAWEEWLKWGNKNTKWFHRKANHR
        +I ERLDR + N     L  +  + HL     DHRPL++  + D+  + DE              E++ D++YW QRA   WL++G+KNT +FH+ A+ R
Subjt:  HIWERLDRFLLNHHMQDLCSVFRVRHLALIAYDHRPLVVEWSLDQNSQQDE-------------GELEDDKIYWHQRAWEEWLKWGNKNTKWFHRKANHR

Query:  KKVNTIRGLLDDVGNWNIKDSEMELVVIQYFQDMFQT-------------------------SNP-NLDDIECILKTIPTTITPNQNPVPT-KCFTQGKF
        K+ NTI  L  D G      +E+  +   +FQ +F T                         S+P  +++I+  LK +  T  P  +  P  +       
Subjt:  KKVNTIRGLLDDVGNWNIKDSEMELVVIQYFQDMFQT-------------------------SNP-NLDDIECILKTIPTTITPNQNPVPT-KCFTQGKF

Query:  LKWLKICTHQSSWSRWPISLCSVIYKVIAKTLANRLKTVLNDIISPTQSAFVPGRLISNNTIIDFESIHLVKNRRKGKKGIVALKLDMSKAYDHVEWIYI
        +   K+ +     +  PISLC+VIYK++AKT+A RL+ V+   I   QSAFVPGRLISNN ++ +E +H  + +  GKK  +A+KLDMSKAYD VEW ++
Subjt:  LKWLKICTHQSSWSRWPISLCSVIYKVIAKTLANRLKTVLNDIISPTQSAFVPGRLISNNTIIDFESIHLVKNRRKGKKGIVALKLDMSKAYDHVEWIYI

Query:  WKIMQKMGFCDRWVNLIMKCV
          +M +MGF   WV LIMKC+
Subjt:  WKIMQKMGFCDRWVNLIMKCV

XP_030924992.1 uncharacterized protein LOC115952038 [Quercus lobata]2.7e-6532.84Show/hide
Query:  AWRFIGIYGNPKRELHHETWTFLRRLGEGVSYPWMVGNDFNEIVSNTEKFGGLLRSDRDMQKFWECIDYCSLIDPGFTGPEFTWCNKHFQSDHIWERLDR
        AWRF G YG P+     ++WT LR L    S PW+   DFNEI    EK GG LRSD+ MQ F +C+D+C   D GF+G  FTWCN  F    +W RLDR
Subjt:  AWRFIGIYGNPKRELHHETWTFLRRLGEGVSYPWMVGNDFNEIVSNTEKFGGLLRSDRDMQKFWECIDYCSLIDPGFTGPEFTWCNKHFQSDHIWERLDR

Query:  FLLNHHMQDLCSVFRVRHLALIAYDHRPLVV--------------------EWSLDQ-------------------------------------------
         + +     +    R+ HL+  + DH+P+ +                     W+ D+                                           
Subjt:  FLLNHHMQDLCSVFRVRHLALIAYDHRPLVV--------------------EWSLDQ-------------------------------------------

Query:  ----NSQ------QDEGELEDDKIYWHQRAWEEWLKWGNKNTKWFHRKANHRKKVNTIRGLLDDVGNWNIKDSEMELVVIQYFQDMFQTSNPNLDDIECI
            NSQ      +    L+ ++  W QRA  +WL++G++N+K+FH +A  R K N I GL D  GNW   +  +  ++  ++  +F +S P   +   +
Subjt:  ----NSQ------QDEGELEDDKIYWHQRAWEEWLKWGNKNTKWFHRKANHRKKVNTIRGLLDDVGNWNIKDSEMELVVIQYFQDMFQTSNPNLDDIECI

Query:  LKTIPTTITPNQNPVPTKCFTQGK---------------------------FLKWLKICTHQSSWSRWPISLCSVIYKVIAKTLANRLKTVLNDIISPTQ
        L+ + + ++ + N V  + F   +                            +  +K   H + +   PISLC+ +YK+I+K LANRLK  L  +IS +Q
Subjt:  LKTIPTTITPNQNPVPTKCFTQGK---------------------------FLKWLKICTHQSSWSRWPISLCSVIYKVIAKTLANRLKTVLNDIISPTQ

Query:  SAFVPGRLISNNTIIDFESIHLVKNRRKGKKGIVALKLDMSKAYDHVEWIYIWKIMQKMGFCDRWVNLIMKCVES
        SAF+PGRLI++N +I  E++H +K++R GK+G++ALKLDMSKAYD VEW  + KIM KMGF  RW+ LI  C+ S
Subjt:  SAFVPGRLISNNTIIDFESIHLVKNRRKGKKGIVALKLDMSKAYDHVEWIYIWKIMQKMGFCDRWVNLIMKCVES

XP_030940268.1 uncharacterized protein LOC115965235 [Quercus lobata]5.2e-6431.42Show/hide
Query:  AWRFIGIYGNPKRELHHETWTFLRRLGEGVSYPWMVGNDFNEIVSNTEKFGGLLRSDRDMQKFWECIDYCSLIDPGFTGPEFTWCNKHFQSDHIWERLDR
        AWRF G YG P+     ++W+ LR L      PW+   DFNEI S  EK GG LRSDR MQ F +C+D+C   D GFTG  FTWCN  F    +W RLDR
Subjt:  AWRFIGIYGNPKRELHHETWTFLRRLGEGVSYPWMVGNDFNEIVSNTEKFGGLLRSDRDMQKFWECIDYCSLIDPGFTGPEFTWCNKHFQSDHIWERLDR

Query:  FLLNHHMQDLCSVFRVRHLALIAYDHRPLVV--------------------EWSLDQNSQ----------------------------------------
         L +          R+ HLA  + DH+P+ +                     W  D+  +                                        
Subjt:  FLLNHHMQDLCSVFRVRHLALIAYDHRPLVV--------------------EWSLDQNSQ----------------------------------------

Query:  ----------------------QDEGELED-----------DKIYWHQRAWEEWLKWGNKNTKWFHRKANHRKKVNTIRGLLDDVGNWNIKDSEMELVVI
                              Q+ G+++D           ++  W QRA  +WL++G++N+K+FH +A+ R K N I GL DD G W  ++  +  ++ 
Subjt:  ----------------------QDEGELED-----------DKIYWHQRAWEEWLKWGNKNTKWFHRKANHRKKVNTIRGLLDDVGNWNIKDSEMELVVI

Query:  QYFQDMFQTSNP--------------------------NLDDIECILKTIPTTITPNQNPVPTKCFTQ---------------------------GKFLK
        +Y+  +F +SNP                             ++E  L  +    +P  +  P   + Q                             FL 
Subjt:  QYFQDMFQTSNP--------------------------NLDDIECILKTIPTTITPNQNPVPTKCFTQ---------------------------GKFLK

Query:  WL-KICTHQSSWSRWPISLCSVIYKVIAKTLANRLKTVLNDIISPTQSAFVPGRLISNNTIIDFESIHLVKNRRKGKKGIVALKLDMSKAYDHVEWIYIW
         + KI + +      PISL +V+YK+IAK LANRLK  L ++IS TQSAFVPGRLI++N +I  E++H +K++R GK G++ALKLDMSKAYD VEW ++ 
Subjt:  WL-KICTHQSSWSRWPISLCSVIYKVIAKTLANRLKTVLNDIISPTQSAFVPGRLISNNTIIDFESIHLVKNRRKGKKGIVALKLDMSKAYDHVEWIYIW

Query:  KIMQKMGFCDRWVNLIMKCVES
        KIM+ +GF  RW++LI  C+ S
Subjt:  KIMQKMGFCDRWVNLIMKCVES

XP_030942013.1 uncharacterized protein LOC115967068 [Quercus lobata]2.7e-6532.03Show/hide
Query:  WRFIGIYGNPKRELHHETWTFLRRLGEGVSYPWMVGNDFNEIVSNTEKFGGLLRSDRDMQKFWECIDYCSLIDPGFTGPEFTWCNKHFQSDHIWERLDRF
        WR  G YG+P   +   +W  +R L      PW++  DFNEIV + EK G L R  R M+ F +C+  C L+D GF G  +TWCN          RLDR 
Subjt:  WRFIGIYGNPKRELHHETWTFLRRLGEGVSYPWMVGNDFNEIVSNTEKFGGLLRSDRDMQKFWECIDYCSLIDPGFTGPEFTWCNKHFQSDHIWERLDRF

Query:  LLNHHMQDLCSVFRVRHLALIAYDH------------------RPLVVEWS------------LDQNSQQD-----------------EGELED----DK
        + N    ++    +V H A+ A DH                  +  +  W+            L QN  Q+                 + E+ +    ++
Subjt:  LLNHHMQDLCSVFRVRHLALIAYDH------------------RPLVVEWS------------LDQNSQQD-----------------EGELED----DK

Query:  IYWHQRAWEEWLKWGNKNTKWFHRKANHRKKVNTIRGLLDDVGNWNIKDSEMELVVIQYFQDMFQTSNPN-------------LDD--------------
        + W+QR+   W+K G++NTK+FH  A++R++ N I+GL D+ G W     E+E +++ YFQ++F TS P+              DD              
Subjt:  IYWHQRAWEEWLKWGNKNTKWFHRKANHRKKVNTIRGLLDDVGNWNIKDSEMELVVIQYFQDMFQTSNPN-------------LDD--------------

Query:  ----------------------------------IECILKTIPTTITPNQNPVPTKCFTQGKFLKWLKICTHQSSWSRWPISLCSVIYKVIAKTLANRLK
                                          ++C+L T+ T + PN       C          K+   Q      PISLC+VIYK+++K LANRLK
Subjt:  ----------------------------------IECILKTIPTTITPNQNPVPTKCFTQGKFLKWLKICTHQSSWSRWPISLCSVIYKVIAKTLANRLK

Query:  TVLNDIISPTQSAFVPGRLISNNTIIDFESIHLVKNRRKGKKGIVALKLDMSKAYDHVEWIYIWKIMQKMGFCDRWVNLIMKCVESDYRWRIGIGINVEA
         VL  IIS  QSAFVPGR I++N ++ FE++H +  +R GKKG++A+KLDMSKAYD VEW Y+  IM ++GF +RW++L+M CV S       + +N E 
Subjt:  TVLNDIISPTQSAFVPGRLISNNTIIDFESIHLVKNRRKGKKGIVALKLDMSKAYDHVEWIYIWKIMQKMGFCDRWVNLIMKCVESDYRWRIGIGINVEA

Query:  ARDPWIPKGRLL
              PKGR+L
Subjt:  ARDPWIPKGRLL

XP_030969743.1 uncharacterized protein LOC115990020 [Quercus lobata]2.6e-6330.21Show/hide
Query:  SAWRFIGIYGNPKRELHHETWTFLRRLGEGVSYPWMVGNDFNEIVSNTEKFGGLLRSDRDMQKFWECIDYCSLIDPGFTGPEFTWCNKHFQSDHIWERLD
        +AWR  G YG P+    HE W  LR L       W    DFNE++  ++K GG+ RS   MQ F + +D C  +D GF+GPEFTW  +  + + IWERLD
Subjt:  SAWRFIGIYGNPKRELHHETWTFLRRLGEGVSYPWMVGNDFNEIVSNTEKFGGLLRSDRDMQKFWECIDYCSLIDPGFTGPEFTWCNKHFQSDHIWERLD

Query:  RFLLNHHMQDLCSVFRVRHLALIAYDHRPLVVEWSLDQNSQQD---------------------------------------------------------
        R + N+         RV+HL     DHRPL++  SLD N ++                                                          
Subjt:  RFLLNHHMQDLCSVFRVRHLALIAYDHRPLVVEWSLDQNSQQD---------------------------------------------------------

Query:  --------------------------------------EGELEDDKIYWHQRAWEEWLKWGNKNTKWFHRKANHRKKVNTIRGLLDDVGNWNIKDSEMEL
                                               G LE ++  WHQR+  +WL+ G++NT++FH  A  RK+ N I+GL D+ G W  ++     
Subjt:  --------------------------------------EGELEDDKIYWHQRAWEEWLKWGNKNTKWFHRKANHRKKVNTIRGLLDDVGNWNIKDSEMEL

Query:  VVIQYFQDMFQTSNP--------------------------NLDDIECILKTIPTTITPNQNPVP-------------------TKCFTQGKFLKWL---
        ++  +++ +F++SNP                          + D++E  +K +     P  + +P                     C   G  LK +   
Subjt:  VVIQYFQDMFQTSNP--------------------------NLDDIECILKTIPTTITPNQNPVP-------------------TKCFTQGKFLKWL---

Query:  ------KICTHQSSWSRWPISLCSVIYKVIAKTLANRLKTVLNDIISPTQSAFVPGRLISNNTIIDFESIHLVKNRRKGKKGIVALKLDMSKAYDHVEWI
              K+   +      PISLC+V+YK+++K +ANRLK +LN IIS TQSAF+  RLI++N +I FES+H +KN   GK G +ALKLDMSKAYD VEW 
Subjt:  ------KICTHQSSWSRWPISLCSVIYKVIAKTLANRLKTVLNDIISPTQSAFVPGRLISNNTIIDFESIHLVKNRRKGKKGIVALKLDMSKAYDHVEWI

Query:  YIWKIMQKMGFCDRWVNLIMKCV
        ++ K++ K+GF + WV+LIM+C+
Subjt:  YIWKIMQKMGFCDRWVNLIMKCV

TrEMBL top hitse value%identityAlignment
A0A2N9E9A1 Reverse transcriptase domain-containing protein3.0e-6531.93Show/hide
Query:  SSAWRFIGIYGNPKRELHHETWTFLRRLGEGVSYPWMVGNDFNEIVSNTEKFGGLLRSDRDMQKFWECIDYCSLIDPGFTGPEFTWCNKHFQSDHIWERL
        + AWR  G YG P+  L  E+WT LRRL    S PW    DFNE+    EK G + RS+  MQ+F + ID C  +D G+ GP FTW N     D  WERL
Subjt:  SSAWRFIGIYGNPKRELHHETWTFLRRLGEGVSYPWMVGNDFNEIVSNTEKFGGLLRSDRDMQKFWECIDYCSLIDPGFTGPEFTWCNKHFQSDHIWERL

Query:  DRFLLNHHMQDLCSVFRVRHLALIAYDHRPLVVE-----------------WSLDQNSQQDEG----ELED--------DKIY-----------------
        DR L       L    +V HL     DH+P+++                  W+ D   ++        L+D        DKI+                 
Subjt:  DRFLLNHHMQDLCSVFRVRHLALIAYDHRPLVVE-----------------WSLDQNSQQDEG----ELED--------DKIY-----------------

Query:  --------------------------------------------WHQRAWEEWLKWGNKNTKWFHRKANHRKKVNTIRGLLDDVGNWNIKDSEMELVVIQ
                                                    W QR+  EWL+ G+KNT++FH +A  R++ N I  L D  G+W    +++  V I 
Subjt:  --------------------------------------------WHQRAWEEWLKWGNKNTKWFHRKANHRKKVNTIRGLLDDVGNWNIKDSEMELVVIQ

Query:  YFQDMFQTSNPNLDDIECILKTIPTTITPNQNPVPTK-----------------------------------------------CFTQGKFLKWL-----
        ++ D+F ++NPN   IE +++ IP  +T   N   TK                                               C   GK LK +     
Subjt:  YFQDMFQTSNPNLDDIECILKTIPTTITPNQNPVPTK-----------------------------------------------CFTQGKFLKWL-----

Query:  ----KICTHQSSWSRWPISLCSVIYKVIAKTLANRLKTVLNDIISPTQSAFVPGRLISNNTIIDFESIHLVKNRRKGKKGIVALKLDMSKAYDHVEWIYI
            K+   +S     PISLC+VIYK+I+K L NRLK++L  I+S +QSAFVPGRLI++N ++ FE++H +  +R+GK G VALKLDMSKAYD VEW Y+
Subjt:  ----KICTHQSSWSRWPISLCSVIYKVIAKTLANRLKTVLNDIISPTQSAFVPGRLISNNTIIDFESIHLVKNRRKGKKGIVALKLDMSKAYDHVEWIYI

Query:  WKIMQKMGFCDRWVNLIMKCVES
         ++MQ+MGF ++WV ++M+C+ +
Subjt:  WKIMQKMGFCDRWVNLIMKCVES

A0A2N9EX83 Reverse transcriptase domain-containing protein1.7e-6535.31Show/hide
Query:  WRFIGIYGNPKRELHHETWTFLRRLGEGVSYPWMVGNDFNEIVSNTEKFGGLLRSDRDMQKFWECIDYCSLIDPGFTGPEFTWCNKHFQSDHIWERLDRF
        WRF G YG+P        W  LR L    + PW  G DFNE++   EK+G + R +  M++F   +D C  +D GF G  +TW NK   +  + ERLDR 
Subjt:  WRFIGIYGNPKRELHHETWTFLRRLGEGVSYPWMVGNDFNEIVSNTEKFGGLLRSDRDMQKFWECIDYCSLIDPGFTGPEFTWCNKHFQSDHIWERLDRF

Query:  LLNHHMQDLCSVFRVRHLALIAYDHRPLVVE--------------------WSLDQNS------------------QQDEGELED----DKIYWHQRAWE
        L            RV HL  +  DH+PL VE                    W+L                      Q+  GEL D    ++  W QR+  
Subjt:  LLNHHMQDLCSVFRVRHLALIAYDHRPLVVE--------------------WSLDQNS------------------QQDEGELED----DKIYWHQRAWE

Query:  EWLKWGNKNTKWFHRKANHRKKVNTIRGLLDDVGNWNIKDSEMELVVIQYFQDMFQTSNP-NLDDIECILKTIPTTITPNQN------------------
         WL+ G++NTK+FH +A +RK+ N I G+ D VG W     E+E  +++Y++D+F TS P N D+   IL  +   IT + N                  
Subjt:  EWLKWGNKNTKWFHRKANHRKKVNTIRGLLDDVGNWNIKDSEMELVVIQYFQDMFQTSNP-NLDDIECILKTIPTTITPNQN------------------

Query:  ------PVP-----------------------TKCFTQGKFLKWL---------KICTHQSSWSRWPISLCSVIYKVIAKTLANRLKTVLNDIISPTQSA
              P P                         C   G  LK +         K+   +S     PISLC+VIYK+IAK LANRLK +L  IIS +QSA
Subjt:  ------PVP-----------------------TKCFTQGKFLKWL---------KICTHQSSWSRWPISLCSVIYKVIAKTLANRLKTVLNDIISPTQSA

Query:  FVPGRLISNNTIIDFESIHLVKNRRKGKKGIVALKLDMSKAYDHVEWIYIWKIMQKMGFCDRWVNLIMKCVES
        FVPGRLIS+N +I FE++H +K+ +  K+G +ALKLDMSKAYD VEW ++ +IM  MGF + WV++IM+CV +
Subjt:  FVPGRLISNNTIIDFESIHLVKNRRKGKKGIVALKLDMSKAYDHVEWIYIWKIMQKMGFCDRWVNLIMKCVES

A0A2N9F6W6 Reverse transcriptase domain-containing protein1.0e-6535.32Show/hide
Query:  VTPSSAWRFIGIYGNPKRELHHETWTFLRRLGEGVSYPWMVGNDFNEIVSNTEKFGGLLRSDRDMQKFWECIDYCSLIDPGFTGPEFTWCNKHFQSDHIW
        V PS  WRF G YG+P+   H E+W  L RL      PW+   DFNEI+S  E+ G +      MQ F E ++ C LID GF G  FTW N+      + 
Subjt:  VTPSSAWRFIGIYGNPKRELHHETWTFLRRLGEGVSYPWMVGNDFNEIVSNTEKFGGLLRSDRDMQKFWECIDYCSLIDPGFTGPEFTWCNKHFQSDHIW

Query:  ERLDRFLLNHHMQDLCSVFRVRHLALIAYDHRPLVVE--------------------WSL-------DQNS------QQDEGELEDDKIYWHQRAWEEWL
        +RLDR L      D  S+  V H+     DH P++V+                    W+L         NS       +    L  ++++W QR+   WL
Subjt:  ERLDRFLLNHHMQDLCSVFRVRHLALIAYDHRPLVVE--------------------WSL-------DQNS------QQDEGELEDDKIYWHQRAWEEWL

Query:  KWGNKNTKWFHRKANHRKKVNTIRGLLDDVGNWNIKDSEMELVVIQYFQDMFQTSNPNLDDIECILKTIPTTIT------------------------PN
          G+ NTK+FH  A  R++ N + G+L+D   W   D + + V ++YFQ +F +SNP    I   L+ +  T+T                        P+
Subjt:  KWGNKNTKWFHRKANHRKKVNTIRGLLDDVGNWNIKDSEMELVVIQYFQDMFQTSNPNLDDIECILKTIPTTIT------------------------PN

Query:  QNPVPTKCFTQGKFLKWLKI-----------------------CTHQSSWSR----------WPISLCSVIYKVIAKTLANRLKTVLNDIISPTQSAFVP
        + P P  C +   F K+  +                        TH S   +           PISLC+V+YK+I+K +ANRLK VL  IIS  QSAFVP
Subjt:  QNPVPTKCFTQGKFLKWLKI-----------------------CTHQSSWSR----------WPISLCSVIYKVIAKTLANRLKTVLNDIISPTQSAFVP

Query:  GRLISNNTIIDFESIHLVKNRRKGKKGIVALKLDMSKAYDHVEWIYIWKIMQKMGFCDRWVNLIMKCVES
        GRLI++N  + FE IH  K +RKGKKG +ALKLDMSKAYD VEW ++  I++K+GF ++WV +IM CV S
Subjt:  GRLISNNTIIDFESIHLVKNRRKGKKGIVALKLDMSKAYDHVEWIYIWKIMQKMGFCDRWVNLIMKCVES

A0A2N9IZB6 Reverse transcriptase domain-containing protein7.8e-6638.14Show/hide
Query:  AWRFIGIYGNPKRELHHETWTFLRRLGEGVSYPWMVGNDFNEIVSNTEKFGGLLRSDRDMQKFWECIDYCSLIDPGFTGPEFTWCNKHFQSDHIWERLDR
        +WR  G YG+P+  L  +TW  LRRL    + PWMV +DFNE +S  E+ G   RS   M  F E +    L D GF GPEFTW N+    D +  RLDR
Subjt:  AWRFIGIYGNPKRELHHETWTFLRRLGEGVSYPWMVGNDFNEIVSNTEKFGGLLRSDRDMQKFWECIDYCSLIDPGFTGPEFTWCNKHFQSDHIWERLDR

Query:  FLLNHHMQDLCSVFRVRHLALIAYDH---------------------RPLVVEWSLDQNSQQDE--GELEDDKIYWHQRAWEEWLKWGNKNTKWFHRKAN
         + N     L    +V H+ + + DH                     +P+V   +   N+ + E    L  +++ W QR+   WL  G++NT +FH  A 
Subjt:  FLLNHHMQDLCSVFRVRHLALIAYDH---------------------RPLVVEWSLDQNSQQDE--GELEDDKIYWHQRAWEEWLKWGNKNTKWFHRKAN

Query:  HRKKVNTIRGLLDDVGNWNIKDSEMELVVIQYFQDMFQTSNPNLDDIECILKTIPTTITPNQNPVPTKCFTQGKFLKWL-----------KICTHQSSWS
         RK+ NTI+GLLD    W     E+E +   YF  +F +S P   D++ +++ + + +T + N    + F+  +  + L            + + +S   
Subjt:  HRKKVNTIRGLLDDVGNWNIKDSEMELVVIQYFQDMFQTSNPNLDDIECILKTIPTTITPNQNPVPTKCFTQGKFLKWL-----------KICTHQSSWS

Query:  RWPISLCSVIYKVIAKTLANRLKTVLNDIISPTQSAFVPGRLISNNTIIDFESIHLVKNRRKGKKGIVALKLDMSKAYDHVEWIYIWKIMQKMGFCDRWV
          PISLC+VIYK+I+K L NR+K VL  +IS +Q AFVPGR+I++N II FE+IH +KN R GK   +A KLDMSKAYD VEW Y+  +M+K+GF  RWV
Subjt:  RWPISLCSVIYKVIAKTLANRLKTVLNDIISPTQSAFVPGRLISNNTIIDFESIHLVKNRRKGKKGIVALKLDMSKAYDHVEWIYIWKIMQKMGFCDRWV

Query:  NLIMKCVES
        +LIM CV S
Subjt:  NLIMKCVES

A0A6P4MF11 uncharacterized protein LOC1084586461.2e-6636.34Show/hide
Query:  IKSVTPSSAWRFIGIYGNPKRELHHETWTFLRRLGEGVSYPWMVGNDFNEIVSNTEKFGGLLRSDRDMQKFWECIDYCSLIDPGFTGPEFTWCNKHFQSD
        IK      AWRF G YG+P       +W  LR+LG+    PW+V  DFNEI+ + EK GG+ R    M+ F E +    L+D G++G  FTW   +    
Subjt:  IKSVTPSSAWRFIGIYGNPKRELHHETWTFLRRLGEGVSYPWMVGNDFNEIVSNTEKFGGLLRSDRDMQKFWECIDYCSLIDPGFTGPEFTWCNKHFQSD

Query:  HIWERLDRFLLNHHMQDLCSVFRVRHLALIAYDHRPLVVEWSLDQNSQQDE-------------GELEDDKIYWHQRAWEEWLKWGNKNTKWFHRKANHR
        +I ERLDR + N     L  +  + HL     DHRPL++  + D+  + DE              E++ D++YW QRA   WL++G+KNT +FH+ A+ R
Subjt:  HIWERLDRFLLNHHMQDLCSVFRVRHLALIAYDHRPLVVEWSLDQNSQQDE-------------GELEDDKIYWHQRAWEEWLKWGNKNTKWFHRKANHR

Query:  KKVNTIRGLLDDVGNWNIKDSEMELVVIQYFQDMFQT-------------------------SNP-NLDDIECILKTIPTTITPNQNPVPT-KCFTQGKF
        K+ NTI  L  D G      +E+  +   +FQ +F T                         S+P  +++I+  LK +  T  P  +  P  +       
Subjt:  KKVNTIRGLLDDVGNWNIKDSEMELVVIQYFQDMFQT-------------------------SNP-NLDDIECILKTIPTTITPNQNPVPT-KCFTQGKF

Query:  LKWLKICTHQSSWSRWPISLCSVIYKVIAKTLANRLKTVLNDIISPTQSAFVPGRLISNNTIIDFESIHLVKNRRKGKKGIVALKLDMSKAYDHVEWIYI
        +   K+ +     +  PISLC+VIYK++AKT+A RL+ V+   I   QSAFVPGRLISNN ++ +E +H  + +  GKK  +A+KLDMSKAYD VEW ++
Subjt:  LKWLKICTHQSSWSRWPISLCSVIYKVIAKTLANRLKTVLNDIISPTQSAFVPGRLISNNTIIDFESIHLVKNRRKGKKGIVALKLDMSKAYDHVEWIYI

Query:  WKIMQKMGFCDRWVNLIMKCV
          +M +MGF   WV LIMKC+
Subjt:  WKIMQKMGFCDRWVNLIMKCV

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein6.2e-0428.43Show/hide
Query:  PISLCSVIYKVIAKTLANRLKTVLNDIISPTQSAFVPGRLISNNTIIDFESIHLVKNRRKGK-KGIVALKLDMSKAYDHVEWIYIWKIMQKMGFCDRWVN
        PISL ++  K++ K LANR++  +  +I   Q  F+PG     N     +SI+++++  + K K  V + +D  KA+D ++  ++ K + K+G    ++ 
Subjt:  PISLCSVIYKVIAKTLANRLKTVLNDIISPTQSAFVPGRLISNNTIIDFESIHLVKNRRKGK-KGIVALKLDMSKAYDHVEWIYIWKIMQKMGFCDRWVN

Query:  LI
        +I
Subjt:  LI

P11369 LINE-1 retrotransposable element ORF2 protein3.9e-0631.68Show/hide
Query:  PISLCSVIYKVIAKTLANRLKTVLNDIISPTQSAFVPGRLISNNTIIDFESIHLVKNRRKGKKGIVALKLDMSKAYDHVEWIYIWKIMQKMGFCDRWVNL
        PISL ++  K++ K LANR++  +  II P Q  F+PG     N       IH + N+ K K  ++ + LD  KA+D ++  ++ K++++ G    ++N+
Subjt:  PISLCSVIYKVIAKTLANRLKTVLNDIISPTQSAFVPGRLISNNTIIDFESIHLVKNRRKGKKGIVALKLDMSKAYDHVEWIYIWKIMQKMGFCDRWVNL

Query:  I
        I
Subjt:  I

P14381 Transposon TX1 uncharacterized 149 kDa protein2.9e-0937.62Show/hide
Query:  PISLCSVIYKVIAKTLANRLKTVLNDIISPTQSAFVPGRLISNNTIIDFESIHLVKNRRKGKKGIVALKLDMSKAYDHVEWIYIWKIMQKMGFCDRWVNL
        P+SL S  YK++AK ++ RLK+VL ++I P QS  VPGR I +N  +  + +H    RR G   +  L LD  KA+D V+  Y+   +Q   F  ++V  
Subjt:  PISLCSVIYKVIAKTLANRLKTVLNDIISPTQSAFVPGRLISNNTIIDFESIHLVKNRRKGKKGIVALKLDMSKAYDHVEWIYIWKIMQKMGFCDRWVNL

Query:  I
        +
Subjt:  I

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases2.9e-1237.35Show/hide
Query:  LANRLKTVLNDIISPTQSAFVPGRLISNNTIIDFESIHLVKNRRKGKKGIVALKLDMSKAYDHVEWIYIWKIMQKMGFCDRWV
        +  RLK ++ ++I P Q++F+PGR+ ++N +   E++H ++ R+KG KG + LKLD+ KAYD + W Y+   +   GF + W+
Subjt:  LANRLKTVLNDIISPTQSAFVPGRLISNNTIIDFESIHLVKNRRKGKKGIVALKLDMSKAYDHVEWIYIWKIMQKMGFCDRWV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCATATGGTCCTTGGCTTCGAGAATCGATTAAATTAAAAGTCATGGAAATTATGGATTATGAAAAAAAGAAAGTATTGTCAAAGGAATGTCAGGATACAACAGTGAT
GAAGGTGGATTTACCAACAGCTGAACCCGGAGTAAAGGAAGTCATGCAAAAGGATTCGAAATTAAATTCTGAGGAAACTATCCTTGGGGAAGAGAGGAAAGAGAAGTCTA
AAAAAATGGAGAAGATTACCACGTATACAGGTATCAGTAAAAAGAAAATCGTGATGGTAGAGGAAAATTCAACTGCAAGATCGGTGGAGGCTGCGAGTCAGCCCGCCGGG
CACAATGAAAACCATAAGTTGGAATGTCCAAGGGTTGGGGAACCCTCGGACATTAAGAGCGTTACGCCATCTAGTGCGTGGAGGTTCATAGGGATTTATGGTAATCCAAA
GAGAGAATTACATCATGAAACTTGGACGTTCCTAAGGCGGTTAGGAGAAGGAGTCTCTTATCCTTGGATGGTAGGCAATGATTTTAATGAAATCGTTTCCAATACAGAGA
AATTTGGTGGTTTGCTTCGTTCTGATAGAGATATGCAAAAGTTTTGGGAGTGCATCGACTACTGTAGCTTGATAGACCCTGGTTTTACGGGGCCAGAATTCACGTGGTGT
AACAAACATTTCCAGTCTGATCATATTTGGGAAAGATTGGATAGGTTTTTACTCAACCATCATATGCAGGACCTGTGTAGTGTGTTCAGAGTTAGACATCTAGCTCTCAT
TGCATATGATCATCGGCCACTTGTAGTTGAATGGAGTTTGGACCAGAATAGCCAACAAGATGAAGGTGAGCTTGAGGATGATAAGATCTATTGGCATCAACGAGCCTGGG
AGGAGTGGCTAAAATGGGGCAACAAGAACACGAAGTGGTTCCACAGGAAAGCCAACCATCGAAAAAAGGTTAACACTATCCGGGGTTTGTTGGATGATGTTGGGAACTGG
AATATAAAGGATTCAGAGATGGAGTTGGTGGTGATCCAGTACTTTCAGGACATGTTCCAAACATCTAATCCAAACTTAGATGATATTGAATGTATTCTGAAGACTATACC
AACAACCATCACGCCAAATCAAAATCCAGTGCCGACTAAATGCTTTACTCAGGGGAAATTTTTGAAGTGGTTAAAAATATGCACCCACCAAAGCTCCTGGAGCAGATGGC
CTATTAGCCTTTGCTCGGTGATTTACAAAGTGATTGCAAAAACGCTTGCCAATAGGCTGAAAACAGTGTTAAATGATATTATTTCTCCTACTCAATCTGCTTTTGTGCCT
GGAAGACTAATATCAAATAATACTATTATCGACTTTGAATCTATTCATTTGGTTAAGAACCGTAGGAAGGGTAAGAAAGGTATAGTTGCGTTAAAACTGGATATGAGCAA
GGCTTATGATCATGTGGAGTGGATATATATCTGGAAAATTATGCAAAAGATGGGCTTCTGCGACAGATGGGTTAATTTGATAATGAAATGTGTTGAATCCGATTATCGTT
GGAGAATTGGTATTGGAATTAACGTTGAGGCTGCTAGGGATCCTTGGATTCCTAAAGGAAGGCTCCTGAAATACACTGAATCAACAACCCATTTATTCTGGGAGTGCAAA
GCGATCTGGACTCACAGGAACGAGGTGACTCAAAGCAATTACAAGGAAAATGTGCAGATGTTGGAGGCTAAAATCAAAAGATACATAACAAAATTTCTAAATCAAGATGC
GATTGGGAGTTATTCGGGTATCTGGTCCCTCGAGGTTAATCGTCATCTGGCACCTAATTTGCAACCTCAGATCGCGAGCATGGAGAACCTACAATCTCAAATCGATGTTA
TTCAACCTCGAGTGGCTTCGTGGATTCCTCCCTCAGATGGTATCTGCAAGCTTAAATATGATGCCACTTGGAGCACGACATACCGGTGCGATAGGATTGGATGGATCCTA
AGAGATTGGTTTGGACGTCTATTGCGAAGGGGGTACAAATGCGTGAATCACCGATGGAAAATCAGATGGTTGGAGGCGTTCTCCGTTTGTGAAGCGTTGCGTGCTCTCCC
TCAGGATTCCCCCATTTTCAGCTTGAGTTGGATGCCCTTCAGGTGGCAAAGCAGACCAACGAGGATATGGATGCCACGGAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCATATGGTCCTTGGCTTCGAGAATCGATTAAATTAAAAGTCATGGAAATTATGGATTATGAAAAAAAGAAAGTATTGTCAAAGGAATGTCAGGATACAACAGTGAT
GAAGGTGGATTTACCAACAGCTGAACCCGGAGTAAAGGAAGTCATGCAAAAGGATTCGAAATTAAATTCTGAGGAAACTATCCTTGGGGAAGAGAGGAAAGAGAAGTCTA
AAAAAATGGAGAAGATTACCACGTATACAGGTATCAGTAAAAAGAAAATCGTGATGGTAGAGGAAAATTCAACTGCAAGATCGGTGGAGGCTGCGAGTCAGCCCGCCGGG
CACAATGAAAACCATAAGTTGGAATGTCCAAGGGTTGGGGAACCCTCGGACATTAAGAGCGTTACGCCATCTAGTGCGTGGAGGTTCATAGGGATTTATGGTAATCCAAA
GAGAGAATTACATCATGAAACTTGGACGTTCCTAAGGCGGTTAGGAGAAGGAGTCTCTTATCCTTGGATGGTAGGCAATGATTTTAATGAAATCGTTTCCAATACAGAGA
AATTTGGTGGTTTGCTTCGTTCTGATAGAGATATGCAAAAGTTTTGGGAGTGCATCGACTACTGTAGCTTGATAGACCCTGGTTTTACGGGGCCAGAATTCACGTGGTGT
AACAAACATTTCCAGTCTGATCATATTTGGGAAAGATTGGATAGGTTTTTACTCAACCATCATATGCAGGACCTGTGTAGTGTGTTCAGAGTTAGACATCTAGCTCTCAT
TGCATATGATCATCGGCCACTTGTAGTTGAATGGAGTTTGGACCAGAATAGCCAACAAGATGAAGGTGAGCTTGAGGATGATAAGATCTATTGGCATCAACGAGCCTGGG
AGGAGTGGCTAAAATGGGGCAACAAGAACACGAAGTGGTTCCACAGGAAAGCCAACCATCGAAAAAAGGTTAACACTATCCGGGGTTTGTTGGATGATGTTGGGAACTGG
AATATAAAGGATTCAGAGATGGAGTTGGTGGTGATCCAGTACTTTCAGGACATGTTCCAAACATCTAATCCAAACTTAGATGATATTGAATGTATTCTGAAGACTATACC
AACAACCATCACGCCAAATCAAAATCCAGTGCCGACTAAATGCTTTACTCAGGGGAAATTTTTGAAGTGGTTAAAAATATGCACCCACCAAAGCTCCTGGAGCAGATGGC
CTATTAGCCTTTGCTCGGTGATTTACAAAGTGATTGCAAAAACGCTTGCCAATAGGCTGAAAACAGTGTTAAATGATATTATTTCTCCTACTCAATCTGCTTTTGTGCCT
GGAAGACTAATATCAAATAATACTATTATCGACTTTGAATCTATTCATTTGGTTAAGAACCGTAGGAAGGGTAAGAAAGGTATAGTTGCGTTAAAACTGGATATGAGCAA
GGCTTATGATCATGTGGAGTGGATATATATCTGGAAAATTATGCAAAAGATGGGCTTCTGCGACAGATGGGTTAATTTGATAATGAAATGTGTTGAATCCGATTATCGTT
GGAGAATTGGTATTGGAATTAACGTTGAGGCTGCTAGGGATCCTTGGATTCCTAAAGGAAGGCTCCTGAAATACACTGAATCAACAACCCATTTATTCTGGGAGTGCAAA
GCGATCTGGACTCACAGGAACGAGGTGACTCAAAGCAATTACAAGGAAAATGTGCAGATGTTGGAGGCTAAAATCAAAAGATACATAACAAAATTTCTAAATCAAGATGC
GATTGGGAGTTATTCGGGTATCTGGTCCCTCGAGGTTAATCGTCATCTGGCACCTAATTTGCAACCTCAGATCGCGAGCATGGAGAACCTACAATCTCAAATCGATGTTA
TTCAACCTCGAGTGGCTTCGTGGATTCCTCCCTCAGATGGTATCTGCAAGCTTAAATATGATGCCACTTGGAGCACGACATACCGGTGCGATAGGATTGGATGGATCCTA
AGAGATTGGTTTGGACGTCTATTGCGAAGGGGGTACAAATGCGTGAATCACCGATGGAAAATCAGATGGTTGGAGGCGTTCTCCGTTTGTGAAGCGTTGCGTGCTCTCCC
TCAGGATTCCCCCATTTTCAGCTTGAGTTGGATGCCCTTCAGGTGGCAAAGCAGACCAACGAGGATATGGATGCCACGGAATTAA
Protein sequenceShow/hide protein sequence
MPYGPWLRESIKLKVMEIMDYEKKKVLSKECQDTTVMKVDLPTAEPGVKEVMQKDSKLNSEETILGEERKEKSKKMEKITTYTGISKKKIVMVEENSTARSVEAASQPAG
HNENHKLECPRVGEPSDIKSVTPSSAWRFIGIYGNPKRELHHETWTFLRRLGEGVSYPWMVGNDFNEIVSNTEKFGGLLRSDRDMQKFWECIDYCSLIDPGFTGPEFTWC
NKHFQSDHIWERLDRFLLNHHMQDLCSVFRVRHLALIAYDHRPLVVEWSLDQNSQQDEGELEDDKIYWHQRAWEEWLKWGNKNTKWFHRKANHRKKVNTIRGLLDDVGNW
NIKDSEMELVVIQYFQDMFQTSNPNLDDIECILKTIPTTITPNQNPVPTKCFTQGKFLKWLKICTHQSSWSRWPISLCSVIYKVIAKTLANRLKTVLNDIISPTQSAFVP
GRLISNNTIIDFESIHLVKNRRKGKKGIVALKLDMSKAYDHVEWIYIWKIMQKMGFCDRWVNLIMKCVESDYRWRIGIGINVEAARDPWIPKGRLLKYTESTTHLFWECK
AIWTHRNEVTQSNYKENVQMLEAKIKRYITKFLNQDAIGSYSGIWSLEVNRHLAPNLQPQIASMENLQSQIDVIQPRVASWIPPSDGICKLKYDATWSTTYRCDRIGWIL
RDWFGRLLRRGYKCVNHRWKIRWLEAFSVCEALRALPQDSPIFSLSWMPFRWQSRPTRIWMPRN