; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr029126 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr029126
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationtig00153210:3461096..3466346
RNA-Seq ExpressionSgr029126
SyntenySgr029126
Gene Ontology termsGO:0010468 - regulation of gene expression (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR039349 - Protein PLASTID REDOX INSENSITIVE 2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]3.8e-8864.75Show/hide
Query:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ
        PL LI  D+WGP+V  S NGFRYY+SFVD YSR+TWIYFL SKSD +  F  F+T +EK LG  I+ +QTDGG EF+   P+L  +GI HR +CPYTS+Q
Subjt:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ

Query:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFNTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN
        N IVERKHR+I++MGLTLLSQA+LPL FWD+AFST+VY INRLP+ VL +ISP+E+LF  KP +  L+ FGC C+P LRPY SHKL  RS+PCTFLGYS 
Subjt:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFNTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN

Query:  IHKGYKCMDSSGRIFISRHVQFDENSFPY----SISYKPKSSYV
         HKGYKC+ S GR+FISRHV FDENSFPY    S S  PKS  V
Subjt:  IHKGYKCMDSSGRIFISRHVQFDENSFPY----SISYKPKSSYV

KAG8502752.1 hypothetical protein CXB51_000614 [Gossypium anomalum]4.0e-8242.93Show/hide
Query:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ
        PL L+ +D+WGP+   S NGFRYYV+F D ++R+TW+YFL  KS+V + F  F   +E+ LG  ++ +QTDGGGEF+ L  YL   GI HR SCPY+S Q
Subjt:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ

Query:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFNTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN
        NGIVERKHR IV+ GL++L+ A++PL +W DAFSTAVY INRLPS  L ++SP E+LF   P YS L+TFGC+CFP LRPYN+HKL FRS+PCTFLG S 
Subjt:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFNTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN

Query:  IHKGYKCMDSSGRIFISRHVQFDENSFPYS--------ISYKPKSSYVTRCDNAVVSHLPITPMTCEVTTQKPAMLSSCDADNSNSLNSNSTGSTSTLPA
         HKGY+C D++GRI++SRHV F+E  FP+         IS  P++S      ++ +S  P +      T+Q P++++SC    S S N  S  S +    
Subjt:  IHKGYKCMDSSGRIFISRHVQFDENSFPYS--------ISYKPKSSYVTRCDNAVVSHLPITPMTCEVTTQKPAMLSSCDADNSNSLNSNSTGSTSTLPA

Query:  VDNTLPALQSPPNSNLTAPFNADVPTGTDPCTSQPFVSIISNGQSNIHPMTS--QGRRPKSDWVAAQDHAMDPMRAFQIDAHTKLGLSLGVGYPIQWRYT
        + +T   L SPP+ +  +P        + PC +  F  +I+  ++ +    +        S+ V A  HA     ++++  H +L       + +    T
Subjt:  VDNTLPALQSPPNSNLTAPFNADVPTGTDPCTSQPFVSIISNGQSNIHPMTS--QGRRPKSDWVAAQDHAMDPMRAFQIDAHTKLGLSLGVGYPIQWRYT

Query:  IALCSRPLPS
         +LC  PLPS
Subjt:  IALCSRPLPS

KZV26181.1 hypothetical protein F511_06348 [Dorcoceras hygrometricum]1.8e-8566.07Show/hide
Query:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ
        P  ++ SD+WGP+  PSRNG RYY+SFVD Y+R+TWIYFL+ KS+V  TFI+F+ + E      I+ +QTDGGGEFR+LT Y +SNGI HRFSCPYTS+Q
Subjt:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ

Query:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFNTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN
        NG+VERKHRH+VD GL+LL+ ASLP EFW+DAF +AVY INRLPS  L   SP   L+  +P YS L+ FGCLCFPCLRPYN+HKL FRS+PCTFLGYS 
Subjt:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFNTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN

Query:  IHKGYKCMDSSGRIFISRHVQFDE
         HKGYKC+ SSGR++ISRHVQF+E
Subjt:  IHKGYKCMDSSGRIFISRHVQFDE

RVW60229.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]5.3e-8250.29Show/hide
Query:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ
        PL L+ SD+WGP+   S  GF YYVSFVD YSR+TW+YFL++KS     F+ F+   E   G  ++  QTD GGEFR+L  Y   NGI HR SCP+TS+Q
Subjt:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ

Query:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFNTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN
        NGI+ERKHRHIV++GLTLL+QASLPL++W DAFSTAV+ INRLP+ VL    P E LFN+KP YS LK FGCLCFP LRPYN HKL FRSSPCTFLGYS+
Subjt:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFNTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN

Query:  IHKGYKCMDSSGRIFISRHVQFDENSFPYSISYKPKSSYVTRCDNAVVSH----LPITPM--TCEVTTQKPAM-LSSCDADNSNSLNSN-----------
         HKGYKC++  GR+FISR V FDE  FP++   +            +VSH    LP  P+    E  +  P++ L +  A +S+ L+ N           
Subjt:  IHKGYKCMDSSGRIFISRHVQFDENSFPYSISYKPKSSYVTRCDNAVVSH----LPITPM--TCEVTTQKPAM-LSSCDADNSNSLNSN-----------

Query:  --STGSTSTLPAVDNTLPALQSPPNSNLTA-PFNADVPTGTD
          +T S+ST+P ++    +   P +SNL A P    + T +D
Subjt:  --STGSTSTLPAVDNTLPALQSPPNSNLTA-PFNADVPTGTD

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]6.4e-8864.75Show/hide
Query:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ
        PL LI  D+WGP+V  S NGFRYY+SFVD YSR+TWIYFL SKSD +  F  F+T +EK LG  I+ +QTDGG EF+   P+L  +GI HR +CPYTS+Q
Subjt:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ

Query:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFNTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN
        N IVERKHR+I++MGLTLLSQA+LPL FWD+AFST+VY INRLP+ VL +ISP+E+LF  KP +  L+ FGC C+P LRPY SHKL  RS+PCTFLGYS 
Subjt:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFNTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN

Query:  IHKGYKCMDSSGRIFISRHVQFDENSFPY----SISYKPKSSYV
         HKGYKC+ S GR+FISRHV FDENSFPY    S S  PKS  V
Subjt:  IHKGYKCMDSSGRIFISRHVQFDENSFPY----SISYKPKSSYV

TrEMBL top hitse value%identityAlignment
A0A2Z7AWA7 Integrase catalytic domain-containing protein8.5e-8666.07Show/hide
Query:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ
        P  ++ SD+WGP+  PSRNG RYY+SFVD Y+R+TWIYFL+ KS+V  TFI+F+ + E      I+ +QTDGGGEFR+LT Y +SNGI HRFSCPYTS+Q
Subjt:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ

Query:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFNTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN
        NG+VERKHRH+VD GL+LL+ ASLP EFW+DAF +AVY INRLPS  L   SP   L+  +P YS L+ FGCLCFPCLRPYN+HKL FRS+PCTFLGYS 
Subjt:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFNTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN

Query:  IHKGYKCMDSSGRIFISRHVQFDE
         HKGYKC+ SSGR++ISRHVQF+E
Subjt:  IHKGYKCMDSSGRIFISRHVQFDE

A0A438FJP6 Retrovirus-related Pol polyprotein from transposon TNT 1-942.6e-8250.29Show/hide
Query:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ
        PL L+ SD+WGP+   S  GF YYVSFVD YSR+TW+YFL++KS     F+ F+   E   G  ++  QTD GGEFR+L  Y   NGI HR SCP+TS+Q
Subjt:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ

Query:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFNTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN
        NGI+ERKHRHIV++GLTLL+QASLPL++W DAFSTAV+ INRLP+ VL    P E LFN+KP YS LK FGCLCFP LRPYN HKL FRSSPCTFLGYS+
Subjt:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFNTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN

Query:  IHKGYKCMDSSGRIFISRHVQFDENSFPYSISYKPKSSYVTRCDNAVVSH----LPITPM--TCEVTTQKPAM-LSSCDADNSNSLNSN-----------
         HKGYKC++  GR+FISR V FDE  FP++   +            +VSH    LP  P+    E  +  P++ L +  A +S+ L+ N           
Subjt:  IHKGYKCMDSSGRIFISRHVQFDENSFPYSISYKPKSSYVTRCDNAVVSH----LPITPM--TCEVTTQKPAM-LSSCDADNSNSLNSN-----------

Query:  --STGSTSTLPAVDNTLPALQSPPNSNLTA-PFNADVPTGTD
          +T S+ST+P ++    +   P +SNL A P    + T +D
Subjt:  --STGSTSTLPAVDNTLPALQSPPNSNLTA-PFNADVPTGTD

A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-8864.75Show/hide
Query:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ
        PL LI  D+WGP+V  S NGFRYY+SFVD YSR+TWIYFL SKSD +  F  F+T +EK LG  I+ +QTDGG EF+   P+L  +GI HR +CPYTS+Q
Subjt:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ

Query:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFNTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN
        N IVERKHR+I++MGLTLLSQA+LPL FWD+AFST+VY INRLP+ VL +ISP+E+LF  KP +  L+ FGC C+P LRPY SHKL  RS+PCTFLGYS 
Subjt:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFNTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN

Query:  IHKGYKCMDSSGRIFISRHVQFDENSFPY----SISYKPKSSYV
         HKGYKC+ S GR+FISRHV FDENSFPY    S S  PKS  V
Subjt:  IHKGYKCMDSSGRIFISRHVQFDENSFPY----SISYKPKSSYV

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-943.1e-8864.75Show/hide
Query:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ
        PL LI  D+WGP+V  S NGFRYY+SFVD YSR+TWIYFL SKSD +  F  F+T +EK LG  I+ +QTDGG EF+   P+L  +GI HR +CPYTS+Q
Subjt:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ

Query:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFNTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN
        N IVERKHR+I++MGLTLLSQA+LPL FWD+AFST+VY INRLP+ VL +ISP+E+LF  KP +  L+ FGC C+P LRPY SHKL  RS+PCTFLGYS 
Subjt:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFNTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN

Query:  IHKGYKCMDSSGRIFISRHVQFDENSFPY----SISYKPKSSYV
         HKGYKC+ S GR+FISRHV FDENSFPY    S S  PKS  V
Subjt:  IHKGYKCMDSSGRIFISRHVQFDENSFPY----SISYKPKSSYV

A0A803Q9W1 Uncharacterized protein4.7e-8461.38Show/hide
Query:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ
        PL L+ SD+WGPS  PS NG++YY+ FVD YSRFTWIY L+ KSD   TF  F+   E  LG  I+ +QTD GGEFR+ T +L  NGI HR  CP T QQ
Subjt:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ

Query:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFNTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN
        NG+ ERKHRHIV+ GL LL+QASLPL+FWD+AF TAVY  NRLP+ +L   SP+E LF+TKP Y+  KTFGC C+P +RPYN HKL+FRSSPCTF+GYS 
Subjt:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFNTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN

Query:  IHKGYKCMDSSGRIFISRHVQFDENSFPYSISYK---PKSSYVTRC
         HKGYKC+DS+GR++ISR V FDE SFPY  + K   P  S++T C
Subjt:  IHKGYKCMDSSGRIFISRHVQFDENSFPYSISYK---PKSSYVTRC

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.8e-2530.14Show/hide
Query:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEF--RALTPYLRSNGISHRFSCPYTS
        PL ++ SDV GP    + +   Y+V FVD ++ +   Y ++ KSDV+S F  F    E    L +  +  D G E+    +  +    GIS+  + P+T 
Subjt:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEF--RALTPYLRSNGISHRFSCPYTS

Query:  QQNGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDIS--PMERLFNTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFL
        Q NG+ ER  R I +   T++S A L   FW +A  TA Y INR+PS  L D S  P E   N KP    L+ FG   +  ++     K   +S    F+
Subjt:  QQNGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDIS--PMERLFNTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFL

Query:  GYSNIHKGYKCMDSSGRIFI-SRHVQFDENSFPYSISYKPKSSYVTRCDNAVVSHLPITPMTCEVTTQKPAMLSSCD-----ADNSNSLNSN
        GY     G+K  D+    FI +R V  DE +   S + K ++ ++     +   + P       + T+ P     CD      D+  S N N
Subjt:  GYSNIHKGYKCMDSSGRIFI-SRHVQFDENSFPYSISYKPKSSYVTRCDNAVVSHLPITPMTCEVTTQKPAMLSSCD-----ADNSNSLNSN

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.9e-3637.78Show/hide
Query:  LIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEF--RALTPYLRSNGISHRFSCPYTSQQN
        L+ SDV GP    S  G +Y+V+F+D  SR  W+Y L++K  V+  F  F   +E+  G  ++ +++D GGE+  R    Y  S+GI H  + P T Q N
Subjt:  LIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEF--RALTPYLRSNGISHRFSCPYTSQQN

Query:  GIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFNTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSNI
        G+ ER +R IV+   ++L  A LP  FW +A  TA Y INR PS  L    P     N +  YS LK FGC  F  +      KL  +S PC F+GY + 
Subjt:  GIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFNTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSNI

Query:  HKGYKCMDS-SGRIFISRHVQFDEN
          GY+  D    ++  SR V F E+
Subjt:  HKGYKCMDS-SGRIFISRHVQFDEN

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.5e-6041.47Show/hide
Query:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ
        PL  I SDVW   +  S + +RYYV FVD ++R+TW+Y L+ KS V  TFI+F+  +E      I    +D GGEF AL  Y   +GISH  S P+T + 
Subjt:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ

Query:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFNTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN
        NG+ ERKHRHIV+ GLTLLS AS+P  +W  AF+ AVY INRLP+ +L   SP ++LF T P Y  L+ FGC C+P LRPYN HKL  +S  C FLGYS 
Subjt:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFNTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN

Query:  IHKGYKCMD-SSGRIFISRHVQFDENSFPYS---------ISYKPKSSYVTRCDNAVVSHLPITPM-TCEVTTQKPAMLSSCDADNSNSLNSNS---TGS
            Y C+   + R++ISRHV+FDEN FP+S            + +SS V      + +  P+ P  +C          SS  A   NS  S+S   +  
Subjt:  IHKGYKCMD-SSGRIFISRHVQFDENSFPYS---------ISYKPKSSYVTRCDNAVVSHLPITPM-TCEVTTQKPAMLSSCDADNSNSLNSNS---TGS

Query:  TSTLPAVDNTLPALQSPPNSNLTAPFNADVPTGTDPCTSQ
        +S+ P+        Q+ P    T P      T +   TSQ
Subjt:  TSTLPAVDNTLPALQSPPNSNLTAPFNADVPTGTDPCTSQ

Q9XIK0 Protein PLASTID REDOX INSENSITIVE 2, chloroplastic2.2e-2253.85Show/hide
Query:  ETLKFREQLSKKLAKDRETFGNDLDSVVEVCSKIFGEYLHVEYGGPGTLLVEPFTNMFIALNERKLPGAPLAARTSLLWAQNHLDRDWNIW
        ET KFR+ +  KL+K R+ F + +D +V VC++IF  +L  EYGGPGTLLV PF +M   LNER+LPG P AAR ++ WAQ+H+D+DW  W
Subjt:  ETLKFREQLSKKLAKDRETFGNDLDSVVEVCSKIFGEYLHVEYGGPGTLLVEPFTNMFIALNERKLPGAPLAARTSLLWAQNHLDRDWNIW

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.6e-6343.93Show/hide
Query:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ
        PL  I SDVW   +  S + +RYYV FVD ++R+TW+Y L+ KS V  TFI F++ +E      I  + +D GGEF  L  YL  +GISH  S P+T + 
Subjt:  PLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQ

Query:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFNTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN
        NG+ ERKHRHIV+MGLTLLS AS+P  +W  AFS AVY INRLP+ +L   SP ++LF   P Y  LK FGC C+P LRPYN HKL+ +S  C F+GYS 
Subjt:  NGIVERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFNTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSN

Query:  IHKGYKCMD-SSGRIFISRHVQFDENSFPYS-ISYKPKSSYVTRCDNAV--VSH--LPITPMTCEV-TTQKPAMLSSCDADNSNSLNSNSTGSTSTLPAV
            Y C+   +GR++ SRHVQFDE  FP+S  ++   +S   R D+A    SH  LP TP+         P + +S    +S S    +  S+S LP+ 
Subjt:  IHKGYKCMD-SSGRIFISRHVQFDENSFPYS-ISYKPKSSYVTRCDNAV--VSH--LPITPMTCEV-TTQKPAMLSSCDADNSNSLNSNSTGSTSTLPAV

Query:  DNTLPALQSPPNSNLTAPFNADVPTGTDPCTSQPFVSIISNGQSNI
             ++ SP +S  TAP +     G  P T+QP  +  SN  S I
Subjt:  DNTLPALQSPPNSNLTAPFNADVPTGTDPCTSQPFVSIISNGQSNI

Arabidopsis top hitse value%identityAlignment
AT1G10522.1 unknown protein1.6e-2353.85Show/hide
Query:  ETLKFREQLSKKLAKDRETFGNDLDSVVEVCSKIFGEYLHVEYGGPGTLLVEPFTNMFIALNERKLPGAPLAARTSLLWAQNHLDRDWNIW
        ET KFR+ +  KL+K R+ F + +D +V VC++IF  +L  EYGGPGTLLV PF +M   LNER+LPG P AAR ++ WAQ+H+D+DW  W
Subjt:  ETLKFREQLSKKLAKDRETFGNDLDSVVEVCSKIFGEYLHVEYGGPGTLLVEPFTNMFIALNERKLPGAPLAARTSLLWAQNHLDRDWNIW

AT1G10522.2 unknown protein1.6e-2353.85Show/hide
Query:  ETLKFREQLSKKLAKDRETFGNDLDSVVEVCSKIFGEYLHVEYGGPGTLLVEPFTNMFIALNERKLPGAPLAARTSLLWAQNHLDRDWNIW
        ET KFR+ +  KL+K R+ F + +D +V VC++IF  +L  EYGGPGTLLV PF +M   LNER+LPG P AAR ++ WAQ+H+D+DW  W
Subjt:  ETLKFREQLSKKLAKDRETFGNDLDSVVEVCSKIFGEYLHVEYGGPGTLLVEPFTNMFIALNERKLPGAPLAARTSLLWAQNHLDRDWNIW

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.7e-0738.24Show/hide
Query:  HRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFNTKPLYSFLKTFGCLCF
        +R I++   ++L +  LP  F  DA +TAV+ IN+ PST ++   P E  F + P YS+L+ FGC+ +
Subjt:  HRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFNTKPLYSFLKTFGCLCF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCTTCTTCTTATTGAATCTGATGTATGGGGTCCTTCTGTCAAACCATCTCGCAATGGTTTTCGATATTATGTTAGCTTTGTTGATGTGTATTCCAGATTT
ACATGGATATATTTTCTTCAATCTAAGTCAGATGTCTATTCCACTTTTATCTCTTTTCGTACTCATATAGAAAAACTTCTTGGATTACCCATTCGCATGATTCAA
ACTGATGGAGGGGGGGAATTTCGCGCTCTCACCCCTTATTTACGCTCCAATGGTATTTCTCATCGCTTTTCTTGTCCTTATACGTCTCAGCAAAACGGTATTGTT
GAACGGAAACATCGTCATATAGTTGACATGGGTCTAACCTTACTTTCTCAAGCATCCTTACCACTTGAGTTTTGGGATGATGCTTTCTCCACAGCAGTTTATACC
ATTAATCGGCTGCCTTCTACAGTCCTGCATGACATCAGTCCAATGGAGCGTTTGTTCAATACGAAACCCCTTTACTCTTTTCTTAAAACTTTTGGCTGCCTGTGT
TTTCCTTGCTTACGTCCATATAACTCTCATAAACTTCAATTTCGTTCTTCTCCTTGTACCTTTCTTGGCTATAGCAATATCCATAAAGGCTATAAATGCATGGAT
TCTTCTGGGCGAATTTTCATTTCCAGACATGTTCAATTTGATGAAAATTCTTTTCCTTATAGTATATCCTACAAGCCCAAGTCTTCTTATGTGACTAGATGTGAT
AATGCGGTAGTTTCTCACTTACCTATTACTCCTATGACATGTGAAGTTACTACCCAAAAACCTGCTATGCTCTCATCTTGTGATGCTGACAATTCAAACTCACTT
AATAGTAATAGTACTGGTTCCACTAGTACTTTGCCAGCTGTTGATAATACTTTACCAGCTCTTCAGTCTCCTCCAAACAGCAATTTAACAGCTCCATTTAATGCT
GATGTGCCTACTGGTACAGACCCCTGTACCTCACAGCCTTTTGTATCTATAATATCCAATGGTCAATCTAATATACATCCAATGACTTCACAGGGTAGGAGGCCC
AAAAGTGATTGGGTTGCGGCCCAAGATCATGCAATGGACCCTATGCGGGCTTTTCAAATCGACGCACACACGAAGTTAGGGCTTAGCTTAGGGGTGGGATACCCC
ATCCAATGGCGCTACACGATTGCGCTCTGCTCTCGGCCTCTGCCGTCCCTCCCGCCAAGATCGCACGCCTCAACGTTGCAGAGCCACTCCGCTCCCGCCCAAGTT
CGTCTATCCCGATCCGATACCGGAATTTGCAGAAGTTGTCGTTTTTCGCAGGAAACCCTGAAATTTAGGGAACAGCTATCGAAGAAGCTCGCAAAGGATCGCGAG
ACATTTGGGAACGACCTCGATTCGGTTGTGGAGGTTTGCTCGAAGATATTTGGTGAATATTTGCATGTGGAGTACGGAGGTCCTGGGACATTATTGGTGGAGCCT
TTCACCAATATGTTCATTGCTCTCAACGAGAGGAAATTACCTGGAGCGCCTTTGGCCGCAAGAACTTCGCTACTATGGGCTCAAAATCATCTAGATCGCGATTGG
AACATTTGGAACTCAAAAAGGGCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCTCTTCTTCTTATTGAATCTGATGTATGGGGTCCTTCTGTCAAACCATCTCGCAATGGTTTTCGATATTATGTTAGCTTTGTTGATGTGTATTCCAGATTT
ACATGGATATATTTTCTTCAATCTAAGTCAGATGTCTATTCCACTTTTATCTCTTTTCGTACTCATATAGAAAAACTTCTTGGATTACCCATTCGCATGATTCAA
ACTGATGGAGGGGGGGAATTTCGCGCTCTCACCCCTTATTTACGCTCCAATGGTATTTCTCATCGCTTTTCTTGTCCTTATACGTCTCAGCAAAACGGTATTGTT
GAACGGAAACATCGTCATATAGTTGACATGGGTCTAACCTTACTTTCTCAAGCATCCTTACCACTTGAGTTTTGGGATGATGCTTTCTCCACAGCAGTTTATACC
ATTAATCGGCTGCCTTCTACAGTCCTGCATGACATCAGTCCAATGGAGCGTTTGTTCAATACGAAACCCCTTTACTCTTTTCTTAAAACTTTTGGCTGCCTGTGT
TTTCCTTGCTTACGTCCATATAACTCTCATAAACTTCAATTTCGTTCTTCTCCTTGTACCTTTCTTGGCTATAGCAATATCCATAAAGGCTATAAATGCATGGAT
TCTTCTGGGCGAATTTTCATTTCCAGACATGTTCAATTTGATGAAAATTCTTTTCCTTATAGTATATCCTACAAGCCCAAGTCTTCTTATGTGACTAGATGTGAT
AATGCGGTAGTTTCTCACTTACCTATTACTCCTATGACATGTGAAGTTACTACCCAAAAACCTGCTATGCTCTCATCTTGTGATGCTGACAATTCAAACTCACTT
AATAGTAATAGTACTGGTTCCACTAGTACTTTGCCAGCTGTTGATAATACTTTACCAGCTCTTCAGTCTCCTCCAAACAGCAATTTAACAGCTCCATTTAATGCT
GATGTGCCTACTGGTACAGACCCCTGTACCTCACAGCCTTTTGTATCTATAATATCCAATGGTCAATCTAATATACATCCAATGACTTCACAGGGTAGGAGGCCC
AAAAGTGATTGGGTTGCGGCCCAAGATCATGCAATGGACCCTATGCGGGCTTTTCAAATCGACGCACACACGAAGTTAGGGCTTAGCTTAGGGGTGGGATACCCC
ATCCAATGGCGCTACACGATTGCGCTCTGCTCTCGGCCTCTGCCGTCCCTCCCGCCAAGATCGCACGCCTCAACGTTGCAGAGCCACTCCGCTCCCGCCCAAGTT
CGTCTATCCCGATCCGATACCGGAATTTGCAGAAGTTGTCGTTTTTCGCAGGAAACCCTGAAATTTAGGGAACAGCTATCGAAGAAGCTCGCAAAGGATCGCGAG
ACATTTGGGAACGACCTCGATTCGGTTGTGGAGGTTTGCTCGAAGATATTTGGTGAATATTTGCATGTGGAGTACGGAGGTCCTGGGACATTATTGGTGGAGCCT
TTCACCAATATGTTCATTGCTCTCAACGAGAGGAAATTACCTGGAGCGCCTTTGGCCGCAAGAACTTCGCTACTATGGGCTCAAAATCATCTAGATCGCGATTGG
AACATTTGGAACTCAAAAAGGGCTTAA
Protein sequenceShow/hide protein sequence
MPLLLIESDVWGPSVKPSRNGFRYYVSFVDVYSRFTWIYFLQSKSDVYSTFISFRTHIEKLLGLPIRMIQTDGGGEFRALTPYLRSNGISHRFSCPYTSQQNGIV
ERKHRHIVDMGLTLLSQASLPLEFWDDAFSTAVYTINRLPSTVLHDISPMERLFNTKPLYSFLKTFGCLCFPCLRPYNSHKLQFRSSPCTFLGYSNIHKGYKCMD
SSGRIFISRHVQFDENSFPYSISYKPKSSYVTRCDNAVVSHLPITPMTCEVTTQKPAMLSSCDADNSNSLNSNSTGSTSTLPAVDNTLPALQSPPNSNLTAPFNA
DVPTGTDPCTSQPFVSIISNGQSNIHPMTSQGRRPKSDWVAAQDHAMDPMRAFQIDAHTKLGLSLGVGYPIQWRYTIALCSRPLPSLPPRSHASTLQSHSAPAQV
RLSRSDTGICRSCRFSQETLKFREQLSKKLAKDRETFGNDLDSVVEVCSKIFGEYLHVEYGGPGTLLVEPFTNMFIALNERKLPGAPLAARTSLLWAQNHLDRDW
NIWNSKRA