; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr013207 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr013207
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationtig00153756:37591..44103
RNA-Seq ExpressionSgr013207
SyntenySgr013207
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
DAD44881.1 TPA_asm: hypothetical protein HUJ06_003111 [Nelumbo nucifera]8.5e-6939.65Show/hide
Query:  GPNDMKFFSNIKQFEADVLLIGKRKDSFYVFSASDVHIEKTE-----AYWRS----------------------SSFKEIHLDVVCLGCQFGKSRRLLFS
        GP+ +    N++  +ADVL IG+ K+S YV S S+ ++++T      + W +                        F       VC GCQ+GKS RL F 
Subjt:  GPNDMKFFSNIKQFEADVLLIGKRKDSFYVFSASDVHIEKTE-----AYWRS----------------------SSFKEIHLDVVCLGCQFGKSRRLLFS

Query:  NSNNRAIVALQLVHSNLMGLTRTPSYFGCRYVMIIVDDFSLFTEVYFLES-------------------------------------------KKNGIQR
         S NR     QL+HSNLMG T TPSY G RY+M++VDDFS +  VYFLE+                                           K++GI+R
Subjt:  NSNNRAIVALQLVHSNLMGLTRTPSYFGCRYVMIIVDDFSLFTEVYFLES-------------------------------------------KKNGIQR

Query:  QMTCPDTPQQNEVAKRKLAHLTSMCLSWLHEKNLPKELWAAAVHSA------LPLWVASNLSSFDVLYHRKPNVSYLRVFGSICYVHVSKNKRTKLDPKV
        Q TCP+TPQQN VA++KLAHL ++ LSWLH+KNLP+ELWA A+  A      LP W  ++ S F++L++ KP+VSY R+FGS+CYVHV K+ R KLDPK 
Subjt:  QMTCPDTPQQNEVAKRKLAHLTSMCLSWLHEKNLPKELWAAAVHSA------LPLWVASNLSSFDVLYHRKPNVSYLRVFGSICYVHVSKNKRTKLDPKV

Query:  R----CGYDTHRKGWRCMDPNTKEVVVSRDVVFDEISSHQMDASTKKGVADLSPFFSDN-ASNDKENNTTVGGNIQSEMK---VQGSLFEASNLSF
        R     GYD HRKGW+CMDP +K++V SRDV+FDE+S     +ST+    +  PF   N  S   E  ++  G+++S  +    QG L + + L++
Subjt:  R----CGYDTHRKGWRCMDPNTKEVVVSRDVVFDEISSHQMDASTKKGVADLSPFFSDN-ASNDKENNTTVGGNIQSEMK---VQGSLFEASNLSF

KAA8549858.1 hypothetical protein F0562_001542 [Nyssa sinensis]8.8e-8248.59Show/hide
Query:  GPNDMKFFSNIKQFEADVLLIGKRKDSFYVFSASDVHIEKT-----EAYWRS----------------------SSFKEIHLDVVCLGCQFGKSRRLLFS
        GP+D+K  SNIK  EADVL  GKRKDS YV SASD ++EKT        W +                        FKEIH DVVC GCQ+GKS  L F 
Subjt:  GPNDMKFFSNIKQFEADVLLIGKRKDSFYVFSASDVHIEKT-----EAYWRS----------------------SSFKEIHLDVVCLGCQFGKSRRLLFS

Query:  NSNNRAIVALQLV----HSN------LMGLTRTPSYFGCRYVMIIVDDFSLFTEVYFLE-SKKNGIQRQMTCPDTPQQNEVAKRKLAHLTSMCLSWLHEK
        NS N+A  ALQLV    H +      +    +    FG     + +D+   +    FL   +++ I+ +MTCP+TPQQN VA+RKLAHLTSMCLSWLH K
Subjt:  NSNNRAIVALQLV----HSN------LMGLTRTPSYFGCRYVMIIVDDFSLFTEVYFLE-SKKNGIQRQMTCPDTPQQNEVAKRKLAHLTSMCLSWLHEK

Query:  NLPKELWAAAVHSA------LPLWVASNLSSFDVLYHRKPNVSYLRVFGSICYVHVSKNKRTKLDPKVR----CGYDTHRKGWRCMDPNTKEVVVSRDVV
        +LP+ELWAAA+  A      LP W  +  S F+ LYH KPNVSY RVF S+CYVHVSK   TK DP+ R     GY+TH+KGWRCMDP TK+V+VS DVV
Subjt:  NLPKELWAAAVHSA------LPLWVASNLSSFDVLYHRKPNVSYLRVFGSICYVHVSKNKRTKLDPKVR----CGYDTHRKGWRCMDPNTKEVVVSRDVV

Query:  FDEISSHQMDASTKKGVADLSPFFSDNASNDKENNTTVGGNIQSEMKVQGSLFE
        FD++SS+Q++A+T +G+ADLSPFFS++AS++K +NT+  G    + +V G+  +
Subjt:  FDEISSHQMDASTKKGVADLSPFFSDNASNDKENNTTVGGNIQSEMKVQGSLFE

KAG6433862.1 hypothetical protein SASPL_105481 [Salvia splendens]2.3e-7442Show/hide
Query:  GPNDMKFFSNIKQFEADVLLIGKRKDSFYVFSASDVHIEKTE-----AYW------------RSSSFK----------EIHLDVVCLGCQFGKSRRLLFS
        GPND+K   N+K   ADV  IG++K S +V S  + +++KT      + W            R  S K           +  DV+C GCQ+GKS RL F 
Subjt:  GPNDMKFFSNIKQFEADVLLIGKRKDSFYVFSASDVHIEKTE-----AYW------------RSSSFK----------EIHLDVVCLGCQFGKSRRLLFS

Query:  NSNNRAIVALQLVHSNLMGLTRTPSYFGCRYVMIIVDDFSLFTEVYFLESKK-------------------------------------------NGIQR
         S NR     +LVH++LMG TRTPS    RYVM++VDD S FT V FL+ K                                            NG QR
Subjt:  NSNNRAIVALQLVHSNLMGLTRTPSYFGCRYVMIIVDDFSLFTEVYFLESKK-------------------------------------------NGIQR

Query:  QMTCPDTPQQNEVAKRKLAHLTSMCLSWLHEKNLPKELWAAAVHSA------LPLWVASNLSSFDVLYHRKPNVSYLRVFGSICYVHVSKNKRTKLDPKV
        QMTCPDTPQQN VA+RKLAHLTS+CLSWLH+KNLP+ELWA A+  A      L LW ++  S F++LY   P+VSY RVFGSICYVHV+K+KRTKLDPK 
Subjt:  QMTCPDTPQQNEVAKRKLAHLTSMCLSWLHEKNLPKELWAAAVHSA------LPLWVASNLSSFDVLYHRKPNVSYLRVFGSICYVHVSKNKRTKLDPKV

Query:  R----CGYDTHRKGWRCMDPNTKEVVVSRDVVFDEISS---HQMDASTKKGVADLSPFFSD-----NASNDKENNTTVGGNIQSEMKVQGSLFEASNLSF
        +     GYD  RKGWRCMDP T +   SRDVVFDEISS    Q  A+      +L P F D     ++  + EN++ +  +I     V G +  A N   
Subjt:  R----CGYDTHRKGWRCMDPNTKEVVVSRDVVFDEISS---HQMDASTKKGVADLSPFFSD-----NASNDKENNTTVGGNIQSEMKVQGSLFEASNLSF

Query:  GSLSELASPSIRL-DEIEH
          +S      IR  D ++H
Subjt:  GSLSELASPSIRL-DEIEH

KAG6437849.1 hypothetical protein SASPL_102779 [Salvia splendens]1.5e-7343.32Show/hide
Query:  GPNDMKFFSNIKQFEADVLLIGKRKDSFYVFSASDVHIEKTE-----AYWRS-------------SSFK---------EIHLDVVCLGCQFGKSRRLLFS
        GPND+K   N+K   ADV  IG++K S +V SA + +++KT      + W +             SS K          +  DV+C GCQ+GKS RL F 
Subjt:  GPNDMKFFSNIKQFEADVLLIGKRKDSFYVFSASDVHIEKTE-----AYWRS-------------SSFK---------EIHLDVVCLGCQFGKSRRLLFS

Query:  NSNNRAIVALQLVHSNLMGLTRTPSYFGCRYVMIIVDDFSLFTEVYFLESKK-------------------------------------------NGIQR
         S NR     +LVH++LMG TRTPS    RYVM++VDD S FT V FL+ K                                            NGIQR
Subjt:  NSNNRAIVALQLVHSNLMGLTRTPSYFGCRYVMIIVDDFSLFTEVYFLESKK-------------------------------------------NGIQR

Query:  QMTCPDTPQQNEVAKRKLAHLTSMCLSWLHEKNLPKELWAAAVHSA------LPLWVASNLSSFDVLYHRKPNVSYLRVFGSICYVHVSKNKRTKLDPKV
        QMTCPDTPQQN VA+RKLAHLTS+CLSWLH+KNLP+ELWA AV  A      LP W  +  S F+++Y   P+VSY RVFGSICYVHV+K+ RTKLDPK 
Subjt:  QMTCPDTPQQNEVAKRKLAHLTSMCLSWLHEKNLPKELWAAAVHSA------LPLWVASNLSSFDVLYHRKPNVSYLRVFGSICYVHVSKNKRTKLDPKV

Query:  R----CGYDTHRKGWRCMDPNTKEVVVSRDVVFDEISS---HQMDASTKKGVADLSPFFSD-----NASNDKENNTTVGGNIQSEMKVQGSLFEASN
        +     GYD  RKGWRCMDP T +    RDVVFDEISS    Q  AS      +L P F D     ++  + EN++ +   I     V G +  A N
Subjt:  R----CGYDTHRKGWRCMDPNTKEVVVSRDVVFDEISS---HQMDASTKKGVADLSPFFSD-----NASNDKENNTTVGGNIQSEMKVQGSLFEASN

RWR74934.1 Integrase, catalytic core [Cinnamomum micranthum f. kanehirae]4.8e-8851.65Show/hide
Query:  GPNDMKFFSNIKQFEADVLLIGKRKDSFYVFSASDVHIEKT-----EAYWRS----------------------SSFKEIHLDVVCLGCQFGKSRRLLFS
        GP +++  SNIK  EADVL  G+RK+S YV SASD ++EKT        W S                        FKEIH DVVC GCQ+ KS RL F 
Subjt:  GPNDMKFFSNIKQFEADVLLIGKRKDSFYVFSASDVHIEKT-----EAYWRS----------------------SSFKEIHLDVVCLGCQFGKSRRLLFS

Query:  NSNNRAIVALQLVHSNLMGLTRTPSYFGCRYVMIIVDDFSLFTEVYFLES-------------------------------------------KKNGIQR
         S NRA   LQLVHS+LMG T+T SY   RYVMI+VDDFS FT VYFLE+                                           K++GIQ 
Subjt:  NSNNRAIVALQLVHSNLMGLTRTPSYFGCRYVMIIVDDFSLFTEVYFLES-------------------------------------------KKNGIQR

Query:  QMTCPDTPQQNEVAKRKLAHLTSMCLSWLHEKNLPKELWAAAVHSA------LPLWVASNLSSFDVLYHRKPNVSYLRVFGSICYVHVSKNKRTKLDPKV
        QMTCP+TPQQN VA+RKLAHLTSMCLSWLH KNLP+ELWAAAV SA      LP W  +  S F+ LYH KPNVSY +VFGS CYVH+SK  RTKLDP+ 
Subjt:  QMTCPDTPQQNEVAKRKLAHLTSMCLSWLHEKNLPKELWAAAVHSA------LPLWVASNLSSFDVLYHRKPNVSYLRVFGSICYVHVSKNKRTKLDPKV

Query:  R----CGYDTHRKGWRCMDPNTKEVVVSRDVVFDEISSHQMDASTKKGVADLSPFFSDNASNDK
        R     GYD HRKGW+CMDP TK+V VSRDVVFDE+SS Q+D  TK+G  D SPF    +  D+
Subjt:  R----CGYDTHRKGWRCMDPNTKEVVVSRDVVFDEISSHQMDASTKKGVADLSPFFSDNASNDK

TrEMBL top hitse value%identityAlignment
A0A1J3CK86 Retrovirus-related Pol polyprotein from transposon TNT 1-942.7e-6038.86Show/hide
Query:  GPNDMKFFSNIKQFEADVLLIGKRKDSFYVFSASDVHIEK-----TEAYWRS-------SSFK-EIHLDVV--------------CLGCQFGKSRRLLFS
        GP D+KF  NI++ +ADV+  G R    YV SAS+ +IEK      +  W +       +  K  ++ D+V              C GCQ+GKS RL F 
Subjt:  GPNDMKFFSNIKQFEADVLLIGKRKDSFYVFSASDVHIEK-----TEAYWRS-------SSFK-EIHLDVV--------------CLGCQFGKSRRLLFS

Query:  NSNNRAIVALQLVHSNLMGLTRTPSYFGCRYVMIIVDDFSLFTEVYFLESK-------------------------------------------KNGIQR
        NS +R    L+ VHS+LMG TRT SY G RY+++ VDDFS +T VYF++ K                                           K GI+R
Subjt:  NSNNRAIVALQLVHSNLMGLTRTPSYFGCRYVMIIVDDFSLFTEVYFLESK-------------------------------------------KNGIQR

Query:  QMTCPDTPQQNEVAKRKLAHLTSMCLSWLHEKNLPKELW------AAAVHSALPLWVASNLSSFDVLYHRKPNVSYLRVFGSICYVHVSKNKRTKLDPKV
        + TCP TPQQN VA+RK+ HL+  C SWLH KNLPK LW      AA V + +PL   +  S +++++ +KP V + R+FGSICYVHV  ++RTKL+ K 
Subjt:  QMTCPDTPQQNEVAKRKLAHLTSMCLSWLHEKNLPKELW------AAAVHSALPLWVASNLSSFDVLYHRKPNVSYLRVFGSICYVHVSKNKRTKLDPKV

Query:  R----CGYDTHRKGWRCMDPNTKEVVVSRDVVFDEISSH---------QMDASTKKGVADLSPFFSDNASNDKENNTTVGGNIQSE
        +     GYD  RKGWRCMDP T    +SRDVVFDE+SS+         Q  A + KG        SD  S++ E     G   Q E
Subjt:  R----CGYDTHRKGWRCMDPNTKEVVVSRDVVFDEISSH---------QMDASTKKGVADLSPFFSDNASNDKENNTTVGGNIQSE

A0A443N8T5 Integrase, catalytic core2.3e-8851.65Show/hide
Query:  GPNDMKFFSNIKQFEADVLLIGKRKDSFYVFSASDVHIEKT-----EAYWRS----------------------SSFKEIHLDVVCLGCQFGKSRRLLFS
        GP +++  SNIK  EADVL  G+RK+S YV SASD ++EKT        W S                        FKEIH DVVC GCQ+ KS RL F 
Subjt:  GPNDMKFFSNIKQFEADVLLIGKRKDSFYVFSASDVHIEKT-----EAYWRS----------------------SSFKEIHLDVVCLGCQFGKSRRLLFS

Query:  NSNNRAIVALQLVHSNLMGLTRTPSYFGCRYVMIIVDDFSLFTEVYFLES-------------------------------------------KKNGIQR
         S NRA   LQLVHS+LMG T+T SY   RYVMI+VDDFS FT VYFLE+                                           K++GIQ 
Subjt:  NSNNRAIVALQLVHSNLMGLTRTPSYFGCRYVMIIVDDFSLFTEVYFLES-------------------------------------------KKNGIQR

Query:  QMTCPDTPQQNEVAKRKLAHLTSMCLSWLHEKNLPKELWAAAVHSA------LPLWVASNLSSFDVLYHRKPNVSYLRVFGSICYVHVSKNKRTKLDPKV
        QMTCP+TPQQN VA+RKLAHLTSMCLSWLH KNLP+ELWAAAV SA      LP W  +  S F+ LYH KPNVSY +VFGS CYVH+SK  RTKLDP+ 
Subjt:  QMTCPDTPQQNEVAKRKLAHLTSMCLSWLHEKNLPKELWAAAVHSA------LPLWVASNLSSFDVLYHRKPNVSYLRVFGSICYVHVSKNKRTKLDPKV

Query:  R----CGYDTHRKGWRCMDPNTKEVVVSRDVVFDEISSHQMDASTKKGVADLSPFFSDNASNDK
        R     GYD HRKGW+CMDP TK+V VSRDVVFDE+SS Q+D  TK+G  D SPF    +  D+
Subjt:  R----CGYDTHRKGWRCMDPNTKEVVVSRDVVFDEISSHQMDASTKKGVADLSPFFSDNASNDK

A0A5J5C3K7 Uncharacterized protein4.2e-8248.59Show/hide
Query:  GPNDMKFFSNIKQFEADVLLIGKRKDSFYVFSASDVHIEKT-----EAYWRS----------------------SSFKEIHLDVVCLGCQFGKSRRLLFS
        GP+D+K  SNIK  EADVL  GKRKDS YV SASD ++EKT        W +                        FKEIH DVVC GCQ+GKS  L F 
Subjt:  GPNDMKFFSNIKQFEADVLLIGKRKDSFYVFSASDVHIEKT-----EAYWRS----------------------SSFKEIHLDVVCLGCQFGKSRRLLFS

Query:  NSNNRAIVALQLV----HSN------LMGLTRTPSYFGCRYVMIIVDDFSLFTEVYFLE-SKKNGIQRQMTCPDTPQQNEVAKRKLAHLTSMCLSWLHEK
        NS N+A  ALQLV    H +      +    +    FG     + +D+   +    FL   +++ I+ +MTCP+TPQQN VA+RKLAHLTSMCLSWLH K
Subjt:  NSNNRAIVALQLV----HSN------LMGLTRTPSYFGCRYVMIIVDDFSLFTEVYFLE-SKKNGIQRQMTCPDTPQQNEVAKRKLAHLTSMCLSWLHEK

Query:  NLPKELWAAAVHSA------LPLWVASNLSSFDVLYHRKPNVSYLRVFGSICYVHVSKNKRTKLDPKVR----CGYDTHRKGWRCMDPNTKEVVVSRDVV
        +LP+ELWAAA+  A      LP W  +  S F+ LYH KPNVSY RVF S+CYVHVSK   TK DP+ R     GY+TH+KGWRCMDP TK+V+VS DVV
Subjt:  NLPKELWAAAVHSA------LPLWVASNLSSFDVLYHRKPNVSYLRVFGSICYVHVSKNKRTKLDPKVR----CGYDTHRKGWRCMDPNTKEVVVSRDVV

Query:  FDEISSHQMDASTKKGVADLSPFFSDNASNDKENNTTVGGNIQSEMKVQGSLFE
        FD++SS+Q++A+T +G+ADLSPFFS++AS++K +NT+  G    + +V G+  +
Subjt:  FDEISSHQMDASTKKGVADLSPFFSDNASNDKENNTTVGGNIQSEMKVQGSLFE

A0A6D2JED0 Uncharacterized protein1.0e-5940.71Show/hide
Query:  GPNDMKFFSNIKQFEADVLLIGKRKDSFYVFSASDVHIEKTEA-----YWRS-------SSFKE-IHLDVV--------------CLGCQFGKSRRLLFS
        GP D+KF  NI++ +ADV+  G R    YV SAS+ +IEK         W +       +  K  ++ D+V              C GCQ+GKS RL F 
Subjt:  GPNDMKFFSNIKQFEADVLLIGKRKDSFYVFSASDVHIEKTEA-----YWRS-------SSFKE-IHLDVV--------------CLGCQFGKSRRLLFS

Query:  NSNNRAIVALQLVHSNLMGLTRTPSYFGCRYVMIIVDDFSLFTEVYFLESK-------------------------------------------KNGIQR
        NS +R    L+ +HS+LMG TRT SY G RY+++ VDD+S +T VYF++ K                                           K+GI+R
Subjt:  NSNNRAIVALQLVHSNLMGLTRTPSYFGCRYVMIIVDDFSLFTEVYFLESK-------------------------------------------KNGIQR

Query:  QMTCPDTPQQNEVAKRKLAHLTSMCLSWLHEKNLPKELW------AAAVHSALPLWVASNLSSFDVLYHRKPNVSYLRVFGSICYVHVSKNKRTKLDPKV
        + +CP TPQQN VA+RK+ HL+  C SWLH KNLPK LW      AA V + +PL   +  S +++++ +KP V +LR+FGSICYVHV  ++RTKL+ K 
Subjt:  QMTCPDTPQQNEVAKRKLAHLTSMCLSWLHEKNLPKELW------AAAVHSALPLWVASNLSSFDVLYHRKPNVSYLRVFGSICYVHVSKNKRTKLDPKV

Query:  R----CGYDTHRKGWRCMDPNTKEVVVSRDVVFDEISSH
        +     GYD  RKGWRCMDP T    VSRDVVFDE+SS+
Subjt:  R----CGYDTHRKGWRCMDPNTKEVVVSRDVVFDEISSH

A0A6D2KZV0 Uncharacterized protein1.0e-5940.71Show/hide
Query:  GPNDMKFFSNIKQFEADVLLIGKRKDSFYVFSASDVHIEKTEA-----YWRS-------SSFKE-IHLDVV--------------CLGCQFGKSRRLLFS
        GP D+KF  NI++ +ADV+  G R    YV SAS+ +IEK         W +       +  K  ++ D+V              C GCQ+GKS RL F 
Subjt:  GPNDMKFFSNIKQFEADVLLIGKRKDSFYVFSASDVHIEKTEA-----YWRS-------SSFKE-IHLDVV--------------CLGCQFGKSRRLLFS

Query:  NSNNRAIVALQLVHSNLMGLTRTPSYFGCRYVMIIVDDFSLFTEVYFLESK-------------------------------------------KNGIQR
        NS +R    L+ +HS+LMG TRT SY G RY+++ VDD+S +T VYF++ K                                           K+GI+R
Subjt:  NSNNRAIVALQLVHSNLMGLTRTPSYFGCRYVMIIVDDFSLFTEVYFLESK-------------------------------------------KNGIQR

Query:  QMTCPDTPQQNEVAKRKLAHLTSMCLSWLHEKNLPKELW------AAAVHSALPLWVASNLSSFDVLYHRKPNVSYLRVFGSICYVHVSKNKRTKLDPKV
        + +CP TPQQN VA+RK+ HL+  C SWLH KNLPK LW      AA V + +PL   +  S +++++ +KP V +LR+FGSICYVHV  ++RTKL+ K 
Subjt:  QMTCPDTPQQNEVAKRKLAHLTSMCLSWLHEKNLPKELW------AAAVHSALPLWVASNLSSFDVLYHRKPNVSYLRVFGSICYVHVSKNKRTKLDPKV

Query:  R----CGYDTHRKGWRCMDPNTKEVVVSRDVVFDEISSH
        +     GYD  RKGWRCMDP T    VSRDVVFDE+SS+
Subjt:  R----CGYDTHRKGWRCMDPNTKEVVVSRDVVFDEISSH

SwissProt top hitse value%identityAlignment
P04146 Copia protein9.6e-1524.83Show/hide
Query:  VCLGCQFGKSRRLLFSNSNNRAIV--ALQLVHSNLMGLTRTPSYFGCRYVMIIVDDFSLFTEVYFLESK-------------------------------
        +C  C  GK  RL F    ++  +   L +VHS++ G     +     Y +I VD F+ +   Y ++ K                               
Subjt:  VCLGCQFGKSRRLLFSNSNNRAIV--ALQLVHSNLMGLTRTPSYFGCRYVMIIVDDFSLFTEVYFLESK-------------------------------

Query:  ------------KNGIQRQMTCPDTPQQNEVAKRKLAHLTSMCLSWLHEKNLPKELWAAAVHSALPL--------WVASNLSSFDVLYHRKPNVSYLRVF
                    K GI   +T P TPQ N V++R +  +T    + +    L K  W  AV +A  L         V S+ + +++ +++KP + +LRVF
Subjt:  ------------KNGIQRQMTCPDTPQQNEVAKRKLAHLTSMCLSWLHEKNLPKELWAAAVHSALPL--------WVASNLSSFDVLYHRKPNVSYLRVF

Query:  GSICYVHVSKNKRTKLDPK----VRCGYDTHRKGWRCMDPNTKEVVVSRDVVFDEISSHQMDASTKKGVADLSPFFSDNASNDKEN
        G+  YVH+ KNK+ K D K    +  GY+ +  G++  D   ++ +V+RDVV DE      +    + V   + F  D+  ++ +N
Subjt:  GSICYVHVSKNKRTKLDPK----VRCGYDTHRKGWRCMDPNTKEVVVSRDVVFDEISSHQMDASTKKGVADLSPFFSDNASNDKEN

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.4e-2029.29Show/hide
Query:  CLGCQFGKSRRLLFSNSNNRAIVALQLVHSNLMGLTRTPSYFGCRYVMIIVDDFSLFTEVYFLESK----------------------------------
        C  C FGK  R+ F  S+ R +  L LV+S++ G     S  G +Y +  +DD S    VY L++K                                  
Subjt:  CLGCQFGKSRRLLFSNSNNRAIVALQLVHSNLMGLTRTPSYFGCRYVMIIVDDFSLFTEVYFLESK----------------------------------

Query:  ---------KNGIQRQMTCPDTPQQNEVAKRKLAHLTSMCLSWLHEKNLPKELWAAAVHSALPLWVASNL--SSFD----VLYHRKPNVSYLRVFGSICY
                  +GI+ + T P TPQ N VA+R    +     S L    LPK  W  AV +A  L   S     +F+    V  +++ + S+L+VFG   +
Subjt:  ---------KNGIQRQMTCPDTPQQNEVAKRKLAHLTSMCLSWLHEKNLPKELWAAAVHSALPLWVASNL--SSFD----VLYHRKPNVSYLRVFGSICY

Query:  VHVSKNKRTKLD----PKVRCGYDTHRKGWRCMDPNTKEVVVSRDVVFDEISSHQMDASTKKGVADLSPFFSDNASNDKENNTTVGGNIQSEMKVQG
         HV K +RTKLD    P +  GY     G+R  DP  K+V+ SRDVVF E         ++K    + P F    S    NN T   +   E+  QG
Subjt:  VHVSKNKRTKLD----PKVRCGYDTHRKGWRCMDPNTKEVVVSRDVVFDEISSHQMDASTKKGVADLSPFFSDNASNDKENNTTVGGNIQSEMKVQG

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCTCTTTGGGTCCAAATGATATGAAATTTTTTTCTAATATTAAGCAATTTGAAGCTGATGTTTTGTTAATTGGAAAGAGGAAAGATTCCTTCTACGTTTTCTCTGC
AAGTGATGTACATATTGAAAAGACAGAGGCTTATTGGCGGAGTTCCTCTTTTAAAGAAATTCACCTAGATGTGGTTTGTCTTGGTTGCCAATTTGGGAAATCACGTCGTC
TTCTTTTCTCGAATTCAAATAATAGGGCTATTGTTGCATTGCAACTAGTTCATTCAAATTTGATGGGACTAACTAGAACACCCAGTTATTTTGGTTGTCGCTATGTTATG
ATTATTGTGGACGATTTCTCTCTGTTTACTGAGGTGTATTTCTTGGAAAGTAAAAAGAATGGCATTCAGCGCCAAATGACATGTCCTGACACTCCACAGCAGAATGAAGT
TGCTAAACGTAAATTGGCACATCTTACATCTATGTGCTTGTCTTGGCTGCATGAGAAGAACCTTCCAAAGGAGCTTTGGGCAGCGGCTGTTCATTCAGCTCTACCTTTAT
GGGTGGCATCAAATCTATCTTCTTTTGACGTATTATATCATCGTAAACCCAATGTGAGTTATCTTCGAGTTTTTGGGTCAATTTGTTATGTTCATGTTTCTAAGAATAAG
CGGACTAAACTTGACCCAAAGGTAAGATGTGGTTATGATACTCATAGAAAAGGATGGAGATGTATGGATCCAAATACAAAGGAAGTAGTTGTCTCTCGAGATGTGGTGTT
TGACGAAATTTCGTCACATCAAATGGATGCAAGTACAAAGAAAGGTGTTGCTGATCTGTCACCTTTCTTTAGTGATAATGCGTCAAATGATAAGGAGAACAATACTACTG
TCGGAGGAAATATTCAATCAGAGATGAAGGTACAGGGATCGTTGTTCGAAGCTTCGAATCTTTCATTCGGATCCCTCTCTGAATTGGCAAGCCCCTCCATTAGGTTGGAT
GAAATTGAACACATATGCGGCGTGCAAATCAACATGCCCTCATTTTGGAATAGAAGCTGCAGAGGAGGCAATAAGGTGGCTCACACCATTGCGTCTTCTGCTCTTGTTTC
AAAGAAATCTCTTGTTTGGAAGAGATTTTTTCTTGAATGGTTGCTTAGTATAGTTGATGAGGATAAACACTTCAGTGTAGCCCATGTGGAAGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTCTCTTTGGGTCCAAATGATATGAAATTTTTTTCTAATATTAAGCAATTTGAAGCTGATGTTTTGTTAATTGGAAAGAGGAAAGATTCCTTCTACGTTTTCTCTGC
AAGTGATGTACATATTGAAAAGACAGAGGCTTATTGGCGGAGTTCCTCTTTTAAAGAAATTCACCTAGATGTGGTTTGTCTTGGTTGCCAATTTGGGAAATCACGTCGTC
TTCTTTTCTCGAATTCAAATAATAGGGCTATTGTTGCATTGCAACTAGTTCATTCAAATTTGATGGGACTAACTAGAACACCCAGTTATTTTGGTTGTCGCTATGTTATG
ATTATTGTGGACGATTTCTCTCTGTTTACTGAGGTGTATTTCTTGGAAAGTAAAAAGAATGGCATTCAGCGCCAAATGACATGTCCTGACACTCCACAGCAGAATGAAGT
TGCTAAACGTAAATTGGCACATCTTACATCTATGTGCTTGTCTTGGCTGCATGAGAAGAACCTTCCAAAGGAGCTTTGGGCAGCGGCTGTTCATTCAGCTCTACCTTTAT
GGGTGGCATCAAATCTATCTTCTTTTGACGTATTATATCATCGTAAACCCAATGTGAGTTATCTTCGAGTTTTTGGGTCAATTTGTTATGTTCATGTTTCTAAGAATAAG
CGGACTAAACTTGACCCAAAGGTAAGATGTGGTTATGATACTCATAGAAAAGGATGGAGATGTATGGATCCAAATACAAAGGAAGTAGTTGTCTCTCGAGATGTGGTGTT
TGACGAAATTTCGTCACATCAAATGGATGCAAGTACAAAGAAAGGTGTTGCTGATCTGTCACCTTTCTTTAGTGATAATGCGTCAAATGATAAGGAGAACAATACTACTG
TCGGAGGAAATATTCAATCAGAGATGAAGGTACAGGGATCGTTGTTCGAAGCTTCGAATCTTTCATTCGGATCCCTCTCTGAATTGGCAAGCCCCTCCATTAGGTTGGAT
GAAATTGAACACATATGCGGCGTGCAAATCAACATGCCCTCATTTTGGAATAGAAGCTGCAGAGGAGGCAATAAGGTGGCTCACACCATTGCGTCTTCTGCTCTTGTTTC
AAAGAAATCTCTTGTTTGGAAGAGATTTTTTCTTGAATGGTTGCTTAGTATAGTTGATGAGGATAAACACTTCAGTGTAGCCCATGTGGAAGATTGA
Protein sequenceShow/hide protein sequence
MLSLGPNDMKFFSNIKQFEADVLLIGKRKDSFYVFSASDVHIEKTEAYWRSSSFKEIHLDVVCLGCQFGKSRRLLFSNSNNRAIVALQLVHSNLMGLTRTPSYFGCRYVM
IIVDDFSLFTEVYFLESKKNGIQRQMTCPDTPQQNEVAKRKLAHLTSMCLSWLHEKNLPKELWAAAVHSALPLWVASNLSSFDVLYHRKPNVSYLRVFGSICYVHVSKNK
RTKLDPKVRCGYDTHRKGWRCMDPNTKEVVVSRDVVFDEISSHQMDASTKKGVADLSPFFSDNASNDKENNTTVGGNIQSEMKVQGSLFEASNLSFGSLSELASPSIRLD
EIEHICGVQINMPSFWNRSCRGGNKVAHTIASSALVSKKSLVWKRFFLEWLLSIVDEDKHFSVAHVED