; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0010629 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0010629
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr1:2512596..2520729
RNA-Seq ExpressionLag0010629
SyntenyLag0010629
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_010693052.1 PREDICTED: uncharacterized protein LOC104906048 [Beta vulgaris subsp. vulgaris]6.6e-4226.74Show/hide
Query:  IYERLDRCFSSSSWQEIYPNFVVNHLDYCKYDHRSVELVLNPLPRCWAIRQRIKRFDETWLRYSDLQDSVRSSWASSSSGTHPDNLANRAKCCMQSMAGW
        I ERLDR   S SW  +YPN +V H    K DH ++ L  N + R    +QR   F+ +WL     ++++R++W  S+     D+L  R     Q +  W
Subjt:  IYERLDRCFSSSSWQEIYPNFVVNHLDYCKYDHRSVELVLNPLPRCWAIRQRIKRFDETWLRYSDLQDSVRSSWASSSSGTHPDNLANRAKCCMQSMAGW

Query:  GRRR--------------------------NCTGNNDLESCGCEKGTGTPVGFINVLPTEGKQIRFGALQDVSPCVDEGMNRELLRPFTEDDILLALKQT
           R                          NC     LE    E         +  +P +G       L  V P V E  NR LL+PFT++++  AL Q 
Subjt:  GRRR--------------------------NCTGNNDLESCGCEKGTGTPVGFINVLPTEGKQIRFGALQDVSPCVDEGMNRELLRPFTEDDILLALKQT

Query:  HPHKAPRLDGLSGSFYKNHWNIVGSDVIQSCLVVLNQGFPPCSLNDTMIVLIPKA---------------------------------------------
        HP KAP  DG+   FY+  W+I+G DV      +L+   PP  +N T I LIPK                                              
Subjt:  HPHKAPRLDGLSGSFYKNHWNIVGSDVIQSCLVVLNQGFPPCSLNDTMIVLIPKA---------------------------------------------

Query:  -------------------------------------------------------------WVDFIIQCVSSVTFSFNLNGEKLGHVVPTRGLRQGDPLS
                                                                     WV+ I+ CVSSV++SF +NG   G V P RGLRQGDPLS
Subjt:  -------------------------------------------------------------WVDFIIQCVSSVTFSFNLNGEKLGHVVPTRGLRQGDPLS

Query:  PYLFLFCAEGLSSLLREKA----------------------------------FEGTVIRDLLLLYERASGRTINYEKSVVAFSLNTGNECQQYISQILS
        PYLF+  A+  S ++++K                                    E T+I D+L  YE+ASG+ INYEKS V+FS       ++ ++ IL 
Subjt:  PYLFLFCAEGLSSLLREKA----------------------------------FEGTVIRDLLLLYERASGRTINYEKSVVAFSLNTGNECQQYISQILS

Query:  VSCSPCHHQYLGLPSFMTRNRSEVFCENLDEMHKSVKYMSELCQELNLLDQAMEKQIEKSL
        +     H +YLG+PS   R+R+ +F   +D + K ++   E      LL +A ++ + KS+
Subjt:  VSCSPCHHQYLGLPSFMTRNRSEVFCENLDEMHKSVKYMSELCQELNLLDQAMEKQIEKSL

XP_019199913.1 PREDICTED: uncharacterized protein LOC109193526 [Ipomoea nil]7.1e-4426.21Show/hide
Query:  SKTTPGALF-SETKVSSNRMNYVKQVLGFDCCFSVDCLGRSGGLALLLDS-----AVSFANDLLDFMVSHRLRPALDIVPLV-----RRQLNGWN-VRIQ
        SK  P  +F  ETKV+ +    ++  +GF+  F VD  G SGGLALL         +S++ + +D  V+    PA  +          R+   W+ +R+ 
Subjt:  SKTTPGALF-SETKVSSNRMNYVKQVLGFDCCFSVDCLGRSGGLALLLDS-----AVSFANDLLDFMVSHRLRPALDIVPLV-----RRQLNGWN-VRIQ

Query:  SSKKDLGHPYSLASSKGSVTTI----FFQLIFVALSILCNRRPG-GETIYERLDRCFSSSSWQEIYPNFVVNHLDYCKYDHRSVELVLNPLPRCWAIRQR
        + +K  G+P+  +  +G   TI      QL  +       +  G    I ERLD+   ++ W+ I P+  V +L   K DH ++   +        +R+R
Subjt:  SSKKDLGHPYSLASSKGSVTTI----FFQLIFVALSILCNRRPG-GETIYERLDRCFSSSSWQEIYPNFVVNHLDYCKYDHRSVELVLNPLPRCWAIRQR

Query:  IKRFDETWLRYSDLQDSVRSSWASSSSGTHPDNLANRAKCCMQSMAGWGRRRNCTGNNDLESCGCEKGTGTPVGFINVLPTEGKQIRFGALQDVSPCVDE
          RF+ TWL     +  V  +W           L    + C   +  WG  R         + G   G GT   F            FG ++   P V +
Subjt:  IKRFDETWLRYSDLQDSVRSSWASSSSGTHPDNLANRAKCCMQSMAGWGRRRNCTGNNDLESCGCEKGTGTPVGFINVLPTEGKQIRFGALQDVSPCVDE

Query:  GMNRELLRPFTEDDILLALKQTHPHKAPRLDGLSGSFYKNHWNIVGSDVIQSCLVVLNQGFPPCSLNDTMIVLIPK------------------------
          N  LLRPF  +++  AL   +P KAP  DG++  FY++ W++V  DV    L  L  G  P  LN+T IVLIPK                        
Subjt:  GMNRELLRPFTEDDILLALKQTHPHKAPRLDGLSGSFYKNHWNIVGSDVIQSCLVVLNQGFPPCSLNDTMIVLIPK------------------------

Query:  ----------------------------------------------------------------------------------AWVDFIIQCVSSVTFSFN
                                                                                           WVD I++CV++V++S  
Subjt:  ----------------------------------------------------------------------------------AWVDFIIQCVSSVTFSFN

Query:  LNGEKLGHVVPTRGLRQGDPLSPYLFLFCAEGLSSLLREKAFEGTV----------------------------------IRDLLLLYERASGRTINYEK
        +NG +   ++PTRGLRQGDPLSPYLF+ CAEGLS LL++    G +                                  I+  L +YE  SG+ +NY K
Subjt:  LNGEKLGHVVPTRGLRQGDPLSPYLFLFCAEGLSSLLREKAFEGTV----------------------------------IRDLLLLYERASGRTINYEK

Query:  SVVAFSLNTGNECQQYISQILSVSCSPCHHQYLGLPSFMTR
        S + +S NT N  ++ ++Q+L V  +P   +YLGLPSF  R
Subjt:  SVVAFSLNTGNECQQYISQILSVSCSPCHHQYLGLPSFMTR

XP_030478037.1 uncharacterized protein LOC115695084 [Cannabis sativa]4.3e-4126.12Show/hide
Query:  ETIYERLDRCFSSSSWQEIYPNFVVNHLDYCKYDHRSVEL----------VLNPLPRCWAIRQR--IKRFDETWLRYSDLQDSVRS-SWASSSSGTHPDN
        +TI ERLD CF+++SW   +   V +HLDY   DHR++ +           +  L  C    Q+   ++F +     S +Q SV + + A+  S +H   
Subjt:  ETIYERLDRCFSSSSWQEIYPNFVVNHLDYCKYDHRSVEL----------VLNPLPRCWAIRQR--IKRFDETWLRYSDLQDSVRS-SWASSSSGTHPDN

Query:  LANRAKCCMQSMAG----WGRR-----------------------------RNCTGNNDLESCGCEKGTGTPVGFINVL-PTEG--KQIRFGALQDVSPC
        L N      + +A     W +R                             +  T ++ +   G    T     + + L  +EG         L  +   
Subjt:  LANRAKCCMQSMAG----WGRR-----------------------------RNCTGNNDLESCGCEKGTGTPVGFINVL-PTEG--KQIRFGALQDVSPC

Query:  VDEGMNRELLRPFTEDDILLALKQTHPHKAPRLDGLSGSFYKNHWNIVGSDVIQSCLVVLNQGFPPCSLNDTMIVLIPKA--------------------
        +   MN+ L RPFT D+I+ AL+   P K+P  DG+S  FY+N+WNIVG  V +  L VLN+G     LN ++I LIPK                     
Subjt:  VDEGMNRELLRPFTEDDILLALKQTHPHKAPRLDGLSGSFYKNHWNIVGSDVIQSCLVVLNQGFPPCSLNDTMIVLIPKA--------------------

Query:  --------------------------------------------------------------------------------------WVDFIIQCVSSVTF
                                                                                              WV  I+ C+++ +F
Subjt:  --------------------------------------------------------------------------------------WVDFIIQCVSSVTF

Query:  SFNLNGEKLGHVVPTRGLRQGDPLSPYLFLFCAEGLSSLLREKAFEGTV----------------------------------IRDLLLLYERASGRTIN
        SF+LNGE +GHV P RGLRQGDPLSPYLFL C+EGLS LL  +   G +                                  I+  L +Y RASG+ +N
Subjt:  SFNLNGEKLGHVVPTRGLRQGDPLSPYLFLFCAEGLSSLLREKAFEGTV----------------------------------IRDLLLLYERASGRTIN

Query:  YEKSVVAFSLNTGNECQQYISQILSVSCSPCHHQYLGLPSFMTRNRSEVFCENLDEMHK
          KS ++FS NT    Q + +Q L++  + CH +YLGLPS+  R++ E+F    +++ K
Subjt:  YEKSVVAFSLNTGNECQQYISQILSVSCSPCHHQYLGLPSFMTRNRSEVFCENLDEMHK

XP_030936142.1 uncharacterized protein LOC115961277 [Quercus lobata]6.6e-4226.09Show/hide
Query:  SETKVSSNRMNYVKQVLGFDCCFSVDCLGRSGGLALLLDSAV-----SFANDLLDFMV-------------------SHRLRPALDIVPLVRRQ------
        SETKV   RM  +K+ +GF     V C GRSGGLALL          SF++  +D ++                   +H  + + D++  +  Q      
Subjt:  SETKVSSNRMNYVKQVLGFDCCFSVDCLGRSGGLALLLDSAV-----SFANDLLDFMV-------------------SHRLRPALDIVPLVRRQ------

Query:  -LNGWNVRIQSSKKDLGHPYSLASSKGSVTTI----FFQLIFVALSIL-CNRRPGGETIYERLDRCFSSSSWQEIYPNFVVNHLDYCKYDHRSVELVLNP
            +N  +   +K  G   S          +    F  L F+      CN + G   +Y RLDR F++S W   +    VNHL     DH ++ +    
Subjt:  -LNGWNVRIQSSKKDLGHPYSLASSKGSVTTI----FFQLIFVALSIL-CNRRPGGETIYERLDRCFSSSSWQEIYPNFVVNHLDYCKYDHRSVELVLNP

Query:  LPRCWAIRQRIKR--FDETWLRYSDLQDSVRSSWASSSSGTHPDNLANRAKCCMQSMAGWG------------RRRNCTGN---NDLES-CGCEKG----
         P+    R R  R  F+  W +  + +D ++++W+S+S+    + +      C   +  W              +R    N   +D++   G E      
Subjt:  LPRCWAIRQRIKR--FDETWLRYSDLQDSVRSSWASSSSGTHPDNLANRAKCCMQSMAGWG------------RRRNCTGN---NDLES-CGCEKG----

Query:  TGTPVGFINVLPTEGKQIRFGALQDVSPC-VDEGMNRELLRPFTEDDILLALKQTHPHKAPRLDGLSGSFYKNHW---NIV--------------GSDVI
            + +   + +     +  A+ D+ P  V E MN EL R FT+++++ ALKQ HP K+   D    +F  N     N++              G+D  
Subjt:  TGTPVGFINVLPTEGKQIRFGALQDVSPC-VDEGMNRELLRPFTEDDILLALKQTHPHKAPRLDGLSGSFYKNHW---NIV--------------GSDVI

Query:  QSCLVVLNQGFPPCSLNDTMIVL----IPKAWVDFIIQCVSSVTFSFNLNGEKLGHVVPTRGLRQGDPLSPYLF--------LFCAEGLSSLLREKAFEG
         +  + +++ F     +   +++      ++W+  +++C++SVT+S  +NG   G +VPTRGLRQGDPLSPYLF        LF  +      +    E 
Subjt:  QSCLVVLNQGFPPCSLNDTMIVL----IPKAWVDFIIQCVSSVTFSFNLNGEKLGHVVPTRGLRQGDPLSPYLF--------LFCAEGLSSLLREKAFEG

Query:  TVIRDLLLLYERASGRTINYEKSVVAFSLNTGNECQQYISQILSVSCSPCHHQYLGLPSFMTRNRSEVFCE
          ++++L  YE ASG+ IN +KS + FS N+  EC+  I  IL       H +YLGL   + R++  +F E
Subjt:  TVIRDLLLLYERASGRTINYEKSVVAFSLNTGNECQQYISQILSVSCSPCHHQYLGLPSFMTRNRSEVFCE

XP_030946032.1 uncharacterized protein LOC115970553 [Quercus lobata]1.9e-4127.95Show/hide
Query:  CNRRPGGETIYERLDRCFSSSSWQEIYPNFVVNHLDYCKYDHRSVELVLNPLPRCWAIRQRIKRFDETWLRYSDLQDSVRSSWASSSSGTHPDNLANRAK
        CN  P G T++ERLDR  ++  W   +P   V HL+Y   DH+ + + LN +P+    R +  RF+  WL     +D+V ++W           +   + 
Subjt:  CNRRPGGETIYERLDRCFSSSSWQEIYPNFVVNHLDYCKYDHRSVELVLNPLPRCWAIRQRIKRFDETWLRYSDLQDSVRSSWASSSSGTHPDNLANRAK

Query:  C-CMQSMAGWGRRRN---CTGNNDLESCGCEKGTGTPV--GFINVLPTEGKQIRFGALQDVSPCVDEGMNRELLRPFTEDDILLALKQTHPHKAPRLDGL
             S A    RRN      N+  E C  E G    +   + N+  +        AL+     V E MN+ L+ PF   ++ LALKQ  P KA   DG+
Subjt:  C-CMQSMAGWGRRRN---CTGNNDLESCGCEKGTGTPV--GFINVLPTEGKQIRFGALQDVSPCVDEGMNRELLRPFTEDDILLALKQTHPHKAPRLDGL

Query:  SGSFYKNHWNIVGSDVIQSCLVVLNQGFPPCSLNDTMIVLIPKA--------------------------------------------------------
           F+++ W  +G DV+++ L  LN G  P  +N T I LIPK                                                         
Subjt:  SGSFYKNHWNIVGSDVIQSCLVVLNQGFPPCSLNDTMIVLIPKA--------------------------------------------------------

Query:  --------------------------------------------------WVDFIIQCVSSVTFSFNLNGEKLGHVVPTRGLRQGDPLSPYLFLFCAEGL
                                                          WV  + +C+SSV++S  +NGE  G + P+RGLRQGDPLSPYLFL C+EGL
Subjt:  --------------------------------------------------WVDFIIQCVSSVTFSFNLNGEKLGHVVPTRGLRQGDPLSPYLFLFCAEGL

Query:  SSLLREKAFEG--TVIRDLLLLYERASGRTINYEKSVVAFSLNTGNECQQYISQILSV
        + +L+  A E    VI+D+L LYE+ASG+ +N EK+ + FS     + +  +S  L V
Subjt:  SSLLREKAFEG--TVIRDLLLLYERASGRTINYEKSVVAFSLNTGNECQQYISQILSV

TrEMBL top hitse value%identityAlignment
A0A2N9G3M1 Uncharacterized protein2.6e-4428.54Show/hide
Query:  NRRPGGETIYERLDRCFSSSSWQEIYPNFVVNHLDYCKYDHRSVELVLNPLPRCWAIRQRIKRFDETWLRYSDLQDSVRSSWASSSSGTHPDNLANRAKC
        NRR G    + RLDR   S +W +++PN  + H+     DH ++ L      + +  R+   RF++ W++    +  V  +W +   GT    +  + K 
Subjt:  NRRPGGETIYERLDRCFSSSSWQEIYPNFVVNHLDYCKYDHRSVELVLNPLPRCWAIRQRIKRFDETWLRYSDLQDSVRSSWASSSSGTHPDNLANRAKC

Query:  CMQSMAGWGRRRNCTGNNDLESCGCEKGTGTPVGFINVLPTEGKQIRFGALQDVSPCVDEGMNRELLRPFTEDDILLALKQTHPHKAPRLDGLSGSFYKN
        C   +  W R R        +S   ++     +  +      G QI   A   +   V + MN  LL P TE+++ +AL Q +P KAP  DG+S  F++ 
Subjt:  CMQSMAGWGRRRNCTGNNDLESCGCEKGTGTPVGFINVLPTEGKQIRFGALQDVSPCVDEGMNRELLRPFTEDDILLALKQTHPHKAPRLDGLSGSFYKN

Query:  HWNIVGSDVIQSCLVVLNQGFPPCSLNDTMIVLIPKA------------------------------------------------WVDFIIQCVSSVTFS
        +W+++G+D+  + L  ++      S+N T I LIPKA                                                WVD +++C+S+V++S
Subjt:  HWNIVGSDVIQSCLVVLNQGFPPCSLNDTMIVLIPKA------------------------------------------------WVDFIIQCVSSVTFS

Query:  FNLNGEKLGHVVPTRGLRQGDPLSPYLFLFCAEGLSSLLRE----------------------------------KAFEGTVIRDLLLLYERASGRTINY
          LNG   GH++P+RGLRQGDPLSPYLFL CAEG +SL+R+                                  K  E   +R++L LYE ASG+ +N 
Subjt:  FNLNGEKLGHVVPTRGLRQGDPLSPYLFLFCAEGLSSLLRE----------------------------------KAFEGTVIRDLLLLYERASGRTINY

Query:  EKSVVAFSLNTGNECQQYISQILSVSCSPCHHQYLGLP
        EK+ + FS NT    ++ I   + V       +YLGLP
Subjt:  EKSVVAFSLNTGNECQQYISQILSVSCSPCHHQYLGLP

A0A2N9GW00 Uncharacterized protein1.5e-4428.63Show/hide
Query:  DIVPLVRRQLNGWNVRIQSSKKDLGHPYSLASSKGSVTTIFFQLIFVALSILCNRRPGGETIYERLDRCFSSSSWQEIYPNFVVNHLDYCKYDHRSVELV
        D   +VR  +     R+QS ++  G P  +    G +   F  L F      CN R G  T++ RLDR  +++ W   +   V++HL+    DH+ + L 
Subjt:  DIVPLVRRQLNGWNVRIQSSKKDLGHPYSLASSKGSVTTIFFQLIFVALSILCNRRPGGETIYERLDRCFSSSSWQEIYPNFVVNHLDYCKYDHRSVELV

Query:  LNP--LPRCWAIRQRIKRFDETWLRYSDLQDSVRSSWASSSSGTHPDNLANRAKCCMQSMAGWGRRRN-------CTGNNDLESCGCEKGTGTPVGFINV
          P  +PR    R+R+ R ++ W      + +V  +W S + G+    + ++ + C + +  W R  N                   +K TG  V   N+
Subjt:  LNP--LPRCWAIRQRIKRFDETWLRYSDLQDSVRSSWASSSSGTHPDNLANRAKCCMQSMAGWGRRRN-------CTGNNDLESCGCEKGTGTPVGFINV

Query:  LPTEGK---QIRFGA---------LQDVSPCVDEGMNRELLRPFTEDDILLALKQTHPHKAPRLDGLSGSFYKNHWNIVGSDVIQSCLVVLNQG------
        + T+     Q  F A         L  + PCV   MN+ L   FTE++++ A+KQ  P KAP  DG+   FY+++W++VG DV  + L  L+ G      
Subjt:  LPTEGK---QIRFGA---------LQDVSPCVDEGMNRELLRPFTEDDILLALKQTHPHKAPRLDGLSGSFYKNHWNIVGSDVIQSCLVVLNQG------

Query:  -----------FPPCSLNDTMIVL-IPKAWVDFIIQCVSSVTFSFNLNGEKLGHVVPTRGLRQGDPLSPYLFLFCAEGLSSLLREKAFEGTVIRDLLLLY
                        L   M+ +   + W+  I++C+S V++S  +NGE  GH+ PTRGLRQ D L     LFC        R   F+   I+++L++Y
Subjt:  -----------FPPCSLNDTMIVL-IPKAWVDFIIQCVSSVTFSFNLNGEKLGHVVPTRGLRQGDPLSPYLFLFCAEGLSSLLREKAFEGTVIRDLLLLY

Query:  ERASGRTINYEKSVVAFSLNTGNECQQYISQILSVSCSPCHHQYLGLPSFMTRNRSEVFCENLDEMHKSVKYMSE
        E+ASG+ +N  K+ + F  NT    Q+ I  IL V     + +YLGLPSF+ + +   F +  + +   VK   E
Subjt:  ERASGRTINYEKSVVAFSLNTGNECQQYISQILSVSCSPCHHQYLGLPSFMTRNRSEVFCENLDEMHKSVKYMSE

A0A2N9HZ73 Reverse transcriptase domain-containing protein5.3e-4527.68Show/hide
Query:  CNRRPGGETIYERLDRCFSSSSWQEIYPNFVVNHLDYCKYDHRSVELVLNPLPRCWAIRQRIKRFDETWLRYSDLQDSVRSSWASSSSGTHPDNLANRAK
        CN R G  T + RLDR  +++ W   +P+ VV HLD       S E                  F++ W      +++V  +W   +       + +   
Subjt:  CNRRPGGETIYERLDRCFSSSSWQEIYPNFVVNHLDYCKYDHRSVELVLNPLPRCWAIRQRIKRFDETWLRYSDLQDSVRSSWASSSSGTHPDNLANRAK

Query:  CCMQSMAGWGRRRNCTGNNDLESCGCEKGTGTPVGFINVLPTEGKQIRFGALQDVSPCVDEGMNRELLRPFTEDDILLALKQTHPHKAPRLDGLSGSFYK
        CC + +  W      +    +      K     +       T G+ ++   L+ +   V   +N +LLRP+TE ++ +ALKQ  P KAP  D +   FY+
Subjt:  CCMQSMAGWGRRRNCTGNNDLESCGCEKGTGTPVGFINVLPTEGKQIRFGALQDVSPCVDEGMNRELLRPFTEDDILLALKQTHPHKAPRLDGLSGSFYK

Query:  NHWNIVGSDVIQSCLVVLNQGFPPCSLNDTMIVLIPK-------------------------------------AWVDFIIQCVSSVTFSFNLNGEKLGH
        ++W  +G+DV+Q+ L  +N G    S+N T + LIPK                                      WV  I++C+++V++S  +NGE  GH
Subjt:  NHWNIVGSDVIQSCLVVLNQGFPPCSLNDTMIVLIPK-------------------------------------AWVDFIIQCVSSVTFSFNLNGEKLGH

Query:  VVPTRGLRQGDPLSPYLFLFCAEGLSSLLREKAFEGTV----------------------------------IRDLLLLYERASGRTINYEKSVVAFSLN
        +  +RGLRQGDP+SPYLFL CAEGL+ LL ++A  G +                                  I+++L +YE+ASG+ +N  K+ + FS N
Subjt:  VVPTRGLRQGDPLSPYLFLFCAEGLSSLLREKAFEGTV----------------------------------IRDLLLLYERASGRTINYEKSVVAFSLN

Query:  TGNECQQYISQILSVSCSPCHHQYLGLPSFMTRNRSEVFCENLDEMHKSVKYMSELCQELNLLDQA
        T    Q+ I  IL V     + +YLGLPS + + + + F +  + +   VK   E      LL QA
Subjt:  TGNECQQYISQILSVSCSPCHHQYLGLPSFMTRNRSEVFCENLDEMHKSVKYMSELCQELNLLDQA

A0A2N9J3G2 Fe2OG dioxygenase domain-containing protein4.2e-5027.77Show/hide
Query:  CNRRPGGETIYERLDRCFSSSSWQEIYPNFVVNHLDYCKYDHRSVELVLNPL--PRCWAIRQRIKRFDETWLRYSDLQDSVRSSWASSSSGTHPDNLANR
        CN R G  T + RLDR  +++ W + + +  V+H++    DH+ + +  +P+  PR    RQ++ RF++ W  + D +  V  +W   + G+    +  +
Subjt:  CNRRPGGETIYERLDRCFSSSSWQEIYPNFVVNHLDYCKYDHRSVELVLNPL--PRCWAIRQRIKRFDETWLRYSDLQDSVRSSWASSSSGTHPDNLANR

Query:  AKCCMQSMAGWGRRR--NCT-------------------------GNNDLESCGCEKG---TGTPVGFINVLPTEGKQIRFGA---------LQDVSPCV
         + C + ++ W R +  N T                           N +++   E G   TG+    I    T+  Q  F A         L  + PCV
Subjt:  AKCCMQSMAGWGRRR--NCT-------------------------GNNDLESCGCEKG---TGTPVGFINVLPTEGKQIRFGA---------LQDVSPCV

Query:  DEGMNRELLRPFTEDDILLALKQTHPHKAPRLDGLSGSFYKNHWNIVGSDVIQSCLVVLNQG-----------------FPPCSLNDTMIVL-IPKAWVD
         + MN+ L+  FTE++++ A+KQ  P KAP  DG+   FY+++W++VG D+  + L  L+ G                      L   M+ +     W+ 
Subjt:  DEGMNRELLRPFTEDDILLALKQTHPHKAPRLDGLSGSFYKNHWNIVGSDVIQSCLVVLNQG-----------------FPPCSLNDTMIVL-IPKAWVD

Query:  FIIQCVSSVTFSFNLNGEKLGHVVPTRGLRQGDPLSPYLFLFCAEGLSSLLREKAFEGTV----------------------------------IRDLLL
         I++C+S+VT+S  +NG   GH+ PTRGLRQGDP+SPYLFL CAEGL+ L+R+ + +G +                                  I+++L 
Subjt:  FIIQCVSSVTFSFNLNGEKLGHVVPTRGLRQGDPLSPYLFLFCAEGLSSLLREKAFEGTV----------------------------------IRDLLL

Query:  LYERASGRTINYEKSVVAFSLNTGNECQQYISQILSVSCSPCHHQYLGLPSFMTRNRSEVFCENLDEMHKSVKYMSELCQELNLLDQAMEKQIEKSL
        +YE+ASG+ +N  K+ + FS NT    Q+ + +IL V     + +YLGLPS + + +   F +  + +   VK   E      LL QA  + + K++
Subjt:  LYERASGRTINYEKSVVAFSLNTGNECQQYISQILSVSCSPCHHQYLGLPSFMTRNRSEVFCENLDEMHKSVKYMSELCQELNLLDQAMEKQIEKSL

A0A803PVM0 Uncharacterized protein6.9e-4526.64Show/hide
Query:  RPGGETIYERLDRCFSSSSWQEIYPNFVVNHLDYCKYDHRSVELVLNPLPRCWAIRQRIK----RFDETWLRYSDLQDSVRSSWASSSSGTHPDNLANRA
        R   + I ERLD CF ++ W E +   V  HLDY + DHR+  L +  LP+    +Q+ K    RF++ WL+  +  + ++ +W   S+        N  
Subjt:  RPGGETIYERLDRCFSSSSWQEIYPNFVVNHLDYCKYDHRSVELVLNPLPRCWAIRQRIK----RFDETWLRYSDLQDSVRSSWASSSSGTHPDNLANRA

Query:  KCCMQSMAGWGRRR-------------------NCTG-------------------------------------------------------NNDLESCG
          C  S+  W +++                   N T                                                        +N ++S  
Subjt:  KCCMQSMAGWGRRR-------------------NCTG-------------------------------------------------------NNDLESCG

Query:  CEKGT--GTPVGFINVLPTEGKQIRFGA-----------LQDVSPCVDEGMNRELLRPFTEDDILLALKQTHPHKAPRLDGLSGSFYKNHWNIVGSDVIQ
          +GT   +  G  +++ +    + F A           L  +   V   MN  L++PFT ++I  ALK  +P K+P  DG+S  FY  +W IVG  V +
Subjt:  CEKGT--GTPVGFINVLPTEGKQIRFGA-----------LQDVSPCVDEGMNRELLRPFTEDDILLALKQTHPHKAPRLDGLSGSFYKNHWNIVGSDVIQ

Query:  SCLVVLNQGFPPCSLNDTMIVLIPKA--------------------------------------------------------WVDFIIQCVSSVTFSFNL
          L VLN G    ++N ++I LIPK                                                         WVD I+ C++S  FSF L
Subjt:  SCLVVLNQGFPPCSLNDTMIVLIPKA--------------------------------------------------------WVDFIIQCVSSVTFSFNL

Query:  NGEKLGHVVPTRGLRQGDPLSPYLFLFCAEGLSSLLREKAF----------------------------------EGTVIRDLLLLYERASGRTINYEKS
        NGE++GHV PTRGLRQGDPLSPYLFL C+EGLS LL+ +                                        I+ +L +Y +ASG+ +N  KS
Subjt:  NGEKLGHVVPTRGLRQGDPLSPYLFLFCAEGLSSLLREKAF----------------------------------EGTVIRDLLLLYERASGRTINYEKS

Query:  VVAFSLNTGNECQQYISQILSVSCSPCHHQYLGLPSFMTRNRSEVFCE
        V++FS NT    Q +    L +  S CH +YLGLP+F  R++ E+F +
Subjt:  VVAFSLNTGNECQQYISQILSVSCSPCHHQYLGLPSFMTRNRSEVFCE

SwissProt top hitse value%identityAlignment
P14381 Transposon TX1 uncharacterized 149 kDa protein2.6e-0431.4Show/hide
Query:  LQDVSPCVDEGMNRELLRPFTEDDILLALKQTHPHKAPRLDGLSGSFYKNHWNIVGSDVIQSCLVVLNQGFPPCSLNDTMIVLIPK
        L D  P V E     L  P T D++  AL+    +K+P LDGL+  F++  W+ +G D  +       +G  P S    ++ L+PK
Subjt:  LQDVSPCVDEGMNRELLRPFTEDDILLALKQTHPHKAPRLDGLSGSFYKNHWNIVGSDVIQSCLVVLNQGFPPCSLNDTMIVLIPK

P92555 Uncharacterized mitochondrial protein AtMg012501.9e-0758.7Show/hide
Query:  FNLNGEKLGHVVPTRGLRQGDPLSPYLFLFCAEGLSSLLREKAFEG
        F +NG   G V P+RGLRQGDPLSPYLF+ C E LS L R    +G
Subjt:  FNLNGEKLGHVVPTRGLRQGDPLSPYLFLFCAEGLSSLLREKAFEG

Arabidopsis top hitse value%identityAlignment
ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.3e-0858.7Show/hide
Query:  FNLNGEKLGHVVPTRGLRQGDPLSPYLFLFCAEGLSSLLREKAFEG
        F +NG   G V P+RGLRQGDPLSPYLF+ C E LS L R    +G
Subjt:  FNLNGEKLGHVVPTRGLRQGDPLSPYLFLFCAEGLSSLLREKAFEG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGTCAAGAGCATGGCAGGGGAGGCAGAGGTTGGGGAAGGGGTCATGGGCATATTGGGGGACCAACATATAGATTTTGTTAGTGAACGCGCTCAGATTGAGGTCGT
GCCTACTGAAGTCCTATCTACTAATGTTTTACCCACTGAGGTTTTACCCACTGAGGTTTTGCCTACTAAGGGTCTACTGACTGAGGTCCAAAATACTGAGGTCTTTCCTA
CTGAGGGTCTGCCTACCAGGTCTTCTTCCCATCTTGATAAAGGAAAGGCGGTGGTTGGGGAGGCTGTCTCTCTAGCGGAGAAGGTAGTAAGCACAGCATCGGGTAAGAAG
AGTTGGAAAAGAATGGCCAGGGGAATTTGGCGGAGGCTGGTTGCTAGCCCCGCCGAGGATTATGAGTATGATATTTTGGAATGTTTGAAGTCTGAGGTCACCTCAAGCGC
TCCGACGCTTGACCAAGTTGGTACAAGCAAAACGACCCCTGGTGCTCTTTTTTCTGAAACTAAGGTATCTTCGAACAGGATGAATTATGTGAAACAAGTTTTGGGCTTTG
ATTGTTGCTTTAGCGTTGATTGTTTGGGTAGGAGTGGTGGTTTAGCTCTTCTGTTGGATTCTGCTGTCTCTTTTGCTAACGACTTACTGGATTTTATGGTCTCCCATCGG
CTGAGACCTGCATTAGACATCGTCCCTTTGGTGCGTCGTCAGCTGAATGGATGGAATGTTAGAATTCAGTCATCTAAGAAGGACCTCGGGCATCCCTACTCATTGGCGTC
GAGCAAGGGTTCTGTCACCACCATTTTCTTCCAATTGATCTTTGTGGCCTTGTCGATTTTGTGCAACAGGAGGCCAGGGGGTGAAACGATTTATGAGCGTTTGGACCGAT
GTTTTAGTTCGTCTTCATGGCAGGAGATTTATCCAAATTTTGTAGTAAATCATCTGGATTACTGCAAATATGATCATCGATCTGTGGAATTAGTTTTAAATCCTCTACCC
CGATGTTGGGCCATTCGTCAGCGAATAAAAAGGTTTGATGAAACTTGGCTTAGATATTCCGACCTCCAGGATTCGGTTCGATCTTCATGGGCCTCGTCTTCCTCAGGTAC
TCATCCAGATAATCTTGCAAATAGGGCTAAATGTTGTATGCAATCTATGGCAGGTTGGGGGAGGAGGAGGAATTGTACTGGAAACAACGATCTTGAGAGTTGTGGTTGCG
AGAAGGGGACAGGAACTCCCGTTGGTTTCATCAACGTGCTTCCTACAGAAGGAAAACAAATCAGATTTGGGGCCTTACAGGATGTTAGCCCATGTGTTGATGAAGGTATG
AATCGTGAATTGTTGCGCCCTTTTACAGAAGATGACATCCTTTTGGCATTGAAACAGACCCACCCTCATAAGGCTCCTAGGCTCGATGGCTTGTCAGGTAGCTTCTATAA
GAACCATTGGAATATTGTGGGTTCAGATGTGATTCAGAGTTGTTTGGTCGTACTTAATCAAGGTTTTCCTCCATGTTCCCTGAATGATACTATGATTGTCCTCATTCCTA
AGGCATGGGTGGATTTCATTATTCAATGTGTTAGTTCGGTCACTTTCTCATTTAATCTGAATGGTGAGAAGCTTGGGCATGTGGTTCCAACGAGAGGCCTTAGGCAGGGG
GATCCCTTGTCGCCTTATCTTTTCTTATTCTGTGCGGAAGGGCTTTCTAGTTTGCTGCGAGAGAAGGCATTTGAGGGCACGGTTATCCGAGACTTATTGTTGCTTTATGA
GAGAGCCTCTGGGCGAACCATTAATTATGAGAAGTCTGTAGTTGCTTTCAGTCTGAATACTGGGAATGAATGTCAACAGTATATTAGCCAGATTTTATCGGTGTCTTGCA
GTCCTTGCCACCATCAGTATCTAGGCCTTCCCTCATTTATGACTCGTAATCGGTCAGAAGTCTTTTGTGAAAATTTGGATGAAATGCATAAATCAGTGAAGTACATGTCT
GAATTATGTCAAGAGTTGAATTTGTTGGACCAAGCAATGGAGAAACAAATCGAAAAGTCCCTTCAAGAAGAAGAAGATTGTCCCATGCCTCAAGATGATGTTTCAAATGA
TGAAGTTGTTATTGATGATGTGGCAACCAACAAGGAAAGGCAACAAATGAAGCGATGTTTGTATTTTGCTTCATCGTCCATTCATGGTCGACAGTCAAAAGTGAAGAACT
TGAGGAGAGCAAGATTGCGAGATGAAAAGATTTCAAGTTTGCACTACAAGGGGATCCTTGGATTCGAGAAGAATCCTCTTGAAGAAGGGGAGGATGATACGAATCGAGGT
GTCTTAATCATGCTAAGAAAACTTCAAGCTAGTGGTCATTATAGCATTGATGGACATGAAAGTTACAACCCTTCATATGACCATAGGCTCACACAATCTCATATCTGGAT
GAGGCGAGGTAGACTCGGTAGTCCAGGATCGCGTAATGTGATGGGCTCAAGCCAAATGAGGCGAGGTAAACTCGATAGTCCAGGCTCGCGTAATGTGATGGGCTCAAGCC
AGATGAGGCGGGGTAGACCCGTGGTCAAGGCTCACATAATCTCAAATCTGGATGAGGCGAGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGGTCAAGAGCATGGCAGGGGAGGCAGAGGTTGGGGAAGGGGTCATGGGCATATTGGGGGACCAACATATAGATTTTGTTAGTGAACGCGCTCAGATTGAGGTCGT
GCCTACTGAAGTCCTATCTACTAATGTTTTACCCACTGAGGTTTTACCCACTGAGGTTTTGCCTACTAAGGGTCTACTGACTGAGGTCCAAAATACTGAGGTCTTTCCTA
CTGAGGGTCTGCCTACCAGGTCTTCTTCCCATCTTGATAAAGGAAAGGCGGTGGTTGGGGAGGCTGTCTCTCTAGCGGAGAAGGTAGTAAGCACAGCATCGGGTAAGAAG
AGTTGGAAAAGAATGGCCAGGGGAATTTGGCGGAGGCTGGTTGCTAGCCCCGCCGAGGATTATGAGTATGATATTTTGGAATGTTTGAAGTCTGAGGTCACCTCAAGCGC
TCCGACGCTTGACCAAGTTGGTACAAGCAAAACGACCCCTGGTGCTCTTTTTTCTGAAACTAAGGTATCTTCGAACAGGATGAATTATGTGAAACAAGTTTTGGGCTTTG
ATTGTTGCTTTAGCGTTGATTGTTTGGGTAGGAGTGGTGGTTTAGCTCTTCTGTTGGATTCTGCTGTCTCTTTTGCTAACGACTTACTGGATTTTATGGTCTCCCATCGG
CTGAGACCTGCATTAGACATCGTCCCTTTGGTGCGTCGTCAGCTGAATGGATGGAATGTTAGAATTCAGTCATCTAAGAAGGACCTCGGGCATCCCTACTCATTGGCGTC
GAGCAAGGGTTCTGTCACCACCATTTTCTTCCAATTGATCTTTGTGGCCTTGTCGATTTTGTGCAACAGGAGGCCAGGGGGTGAAACGATTTATGAGCGTTTGGACCGAT
GTTTTAGTTCGTCTTCATGGCAGGAGATTTATCCAAATTTTGTAGTAAATCATCTGGATTACTGCAAATATGATCATCGATCTGTGGAATTAGTTTTAAATCCTCTACCC
CGATGTTGGGCCATTCGTCAGCGAATAAAAAGGTTTGATGAAACTTGGCTTAGATATTCCGACCTCCAGGATTCGGTTCGATCTTCATGGGCCTCGTCTTCCTCAGGTAC
TCATCCAGATAATCTTGCAAATAGGGCTAAATGTTGTATGCAATCTATGGCAGGTTGGGGGAGGAGGAGGAATTGTACTGGAAACAACGATCTTGAGAGTTGTGGTTGCG
AGAAGGGGACAGGAACTCCCGTTGGTTTCATCAACGTGCTTCCTACAGAAGGAAAACAAATCAGATTTGGGGCCTTACAGGATGTTAGCCCATGTGTTGATGAAGGTATG
AATCGTGAATTGTTGCGCCCTTTTACAGAAGATGACATCCTTTTGGCATTGAAACAGACCCACCCTCATAAGGCTCCTAGGCTCGATGGCTTGTCAGGTAGCTTCTATAA
GAACCATTGGAATATTGTGGGTTCAGATGTGATTCAGAGTTGTTTGGTCGTACTTAATCAAGGTTTTCCTCCATGTTCCCTGAATGATACTATGATTGTCCTCATTCCTA
AGGCATGGGTGGATTTCATTATTCAATGTGTTAGTTCGGTCACTTTCTCATTTAATCTGAATGGTGAGAAGCTTGGGCATGTGGTTCCAACGAGAGGCCTTAGGCAGGGG
GATCCCTTGTCGCCTTATCTTTTCTTATTCTGTGCGGAAGGGCTTTCTAGTTTGCTGCGAGAGAAGGCATTTGAGGGCACGGTTATCCGAGACTTATTGTTGCTTTATGA
GAGAGCCTCTGGGCGAACCATTAATTATGAGAAGTCTGTAGTTGCTTTCAGTCTGAATACTGGGAATGAATGTCAACAGTATATTAGCCAGATTTTATCGGTGTCTTGCA
GTCCTTGCCACCATCAGTATCTAGGCCTTCCCTCATTTATGACTCGTAATCGGTCAGAAGTCTTTTGTGAAAATTTGGATGAAATGCATAAATCAGTGAAGTACATGTCT
GAATTATGTCAAGAGTTGAATTTGTTGGACCAAGCAATGGAGAAACAAATCGAAAAGTCCCTTCAAGAAGAAGAAGATTGTCCCATGCCTCAAGATGATGTTTCAAATGA
TGAAGTTGTTATTGATGATGTGGCAACCAACAAGGAAAGGCAACAAATGAAGCGATGTTTGTATTTTGCTTCATCGTCCATTCATGGTCGACAGTCAAAAGTGAAGAACT
TGAGGAGAGCAAGATTGCGAGATGAAAAGATTTCAAGTTTGCACTACAAGGGGATCCTTGGATTCGAGAAGAATCCTCTTGAAGAAGGGGAGGATGATACGAATCGAGGT
GTCTTAATCATGCTAAGAAAACTTCAAGCTAGTGGTCATTATAGCATTGATGGACATGAAAGTTACAACCCTTCATATGACCATAGGCTCACACAATCTCATATCTGGAT
GAGGCGAGGTAGACTCGGTAGTCCAGGATCGCGTAATGTGATGGGCTCAAGCCAAATGAGGCGAGGTAAACTCGATAGTCCAGGCTCGCGTAATGTGATGGGCTCAAGCC
AGATGAGGCGGGGTAGACCCGTGGTCAAGGCTCACATAATCTCAAATCTGGATGAGGCGAGGTAG
Protein sequenceShow/hide protein sequence
MGVKSMAGEAEVGEGVMGILGDQHIDFVSERAQIEVVPTEVLSTNVLPTEVLPTEVLPTKGLLTEVQNTEVFPTEGLPTRSSSHLDKGKAVVGEAVSLAEKVVSTASGKK
SWKRMARGIWRRLVASPAEDYEYDILECLKSEVTSSAPTLDQVGTSKTTPGALFSETKVSSNRMNYVKQVLGFDCCFSVDCLGRSGGLALLLDSAVSFANDLLDFMVSHR
LRPALDIVPLVRRQLNGWNVRIQSSKKDLGHPYSLASSKGSVTTIFFQLIFVALSILCNRRPGGETIYERLDRCFSSSSWQEIYPNFVVNHLDYCKYDHRSVELVLNPLP
RCWAIRQRIKRFDETWLRYSDLQDSVRSSWASSSSGTHPDNLANRAKCCMQSMAGWGRRRNCTGNNDLESCGCEKGTGTPVGFINVLPTEGKQIRFGALQDVSPCVDEGM
NRELLRPFTEDDILLALKQTHPHKAPRLDGLSGSFYKNHWNIVGSDVIQSCLVVLNQGFPPCSLNDTMIVLIPKAWVDFIIQCVSSVTFSFNLNGEKLGHVVPTRGLRQG
DPLSPYLFLFCAEGLSSLLREKAFEGTVIRDLLLLYERASGRTINYEKSVVAFSLNTGNECQQYISQILSVSCSPCHHQYLGLPSFMTRNRSEVFCENLDEMHKSVKYMS
ELCQELNLLDQAMEKQIEKSLQEEEDCPMPQDDVSNDEVVIDDVATNKERQQMKRCLYFASSSIHGRQSKVKNLRRARLRDEKISSLHYKGILGFEKNPLEEGEDDTNRG
VLIMLRKLQASGHYSIDGHESYNPSYDHRLTQSHIWMRRGRLGSPGSRNVMGSSQMRRGKLDSPGSRNVMGSSQMRRGRPVVKAHIISNLDEAR