; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018052 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018052
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol protein
Genome locationchr5:14834029..14840928
RNA-Seq ExpressionLag0018052
SyntenyLag0018052
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]6.9e-15146.57Show/hide
Query:  VHDEYDIDCLAHL-----NENKFDLLVIETCIVENDDTSWILDSGATHHVCSSFEGMSSWQQLDTGEMTLRVGTGELISAKVVGDIKMFFSREHYMLLDN
        V + +  +C  +L      E K+DLLV+ETC+VEND  +WILDSGAT+HVCSS +  SS++QL+  EMTL+VGTG++ISA+ VGD K+FF  + +M L+N
Subjt:  VHDEYDIDCLAHL-----NENKFDLLVIETCIVENDDTSWILDSGATHHVCSSFEGMSSWQQLDTGEMTLRVGTGELISAKVVGDIKMFFSREHYMLLDN

Query:  VSKIMLMSTVLFEDITSVKEHVVNVLYARNSLVSWLAYGTTIEARRRTTRYSFPITAMLENRCCKYAIVQIIPIDMPERRINIMSPTSNVMRDESDKVRV
        +  +  +   L   ++ + EH+ ++ ++ N       Y   +             +A LEN                   + ++ P              
Subjt:  VSKIMLMSTVLFEDITSVKEHVVNVLYARNSLVSWLAYGTTIEARRRTTRYSFPITAMLENRCCKYAIVQIIPIDMPERRINIMSPTSNVMRDESDKVRV

Query:  FGQLGEKVSLELILRVVADVGLVAREGNKVSYVCCSSFSPLDVDKATHVGRGGGGAFGDGETGVSHLLSAIVCTDLPSNFAVNEMFKTAETQPKRQKVS-
                                                                  + +  ++H                 EMF+TA TQ KRQ++S 
Subjt:  FGQLGEKVSLELILRVVADVGLVAREGNKVSYVCCSSFSPLDVDKATHVGRGGGGAFGDGETGVSHLLSAIVCTDLPSNFAVNEMFKTAETQPKRQKVS-

Query:  -QSTYLWHLRLGNINLNMIGRLVKNGLLSQLEDGTLPSCESGLEG-------------------------------------------------------
          +TYLWHLRLG+INL+ IGRLVKNGLL++L+D +LP CES LEG                                                       
Subjt:  -QSTYLWHLRLGNINLNMIGRLVKNGLLSQLEDGTLPSCESGLEG-------------------------------------------------------

Query:  -----------------------KTIKTLRSDRGGEYLGLKFQNYLIEHEISSQLLAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVEIAV
                               K IK LRSDRGGEY+ L+FQ+Y+IEH I SQL APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVE AV
Subjt:  -----------------------KTIKTLRSDRGGEYLGLKFQNYLIEHEISSQLLAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVEIAV

Query:  YILNVVSSKSVSKTPFELWRGRKASLPHFRIWGCPAHVLVANPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLEEDHVREHKPHSKMVL
        +ILN V SKSVS+TPFELWRGRK SL HFRIWGCPAHVLV NPKKL+ RS+LC FVGYPKETRGG FFDPQEN+VFVSTNATFLEEDH+R HKP SK+VL
Subjt:  YILNVVSSKSVSKTPFELWRGRKASLPHFRIWGCPAHVLVANPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLEEDHVREHKPHSKMVL

Query:  SGLSGEATNESTRVGPSTRVVDEPSTQTGPSTRVVDEPSTSGQAQPSQQFRMPRRSGRVVTQPERYLSLAETPAVISDDGIEDPLTFRQAMNDVDHDEWV
        S    EAT+E      STRVVDE     GPS+R VDE +TSGQ+ PSQ  RMPRRSGRVV+QP RYL L ET  VI DDG+EDPL+++QAMNDVD D+WV
Subjt:  SGLSGEATNESTRVGPSTRVVDEPSTQTGPSTRVVDEPSTSGQAQPSQQFRMPRRSGRVVTQPERYLSLAETPAVISDDGIEDPLTFRQAMNDVDHDEWV

Query:  KAMDLEMESMYFNQV
        KAMDLEMESMYFN V
Subjt:  KAMDLEMESMYFNQV

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]6.4e-14946.15Show/hide
Query:  VHDEYDIDCLAHL-----NENKFDLLVIETCIVENDDTSWILDSGATHHVCSSFEGMSSWQQLDTGEMTLRVGTGELISAKVVGDIKMFFSREHYMLLDN
        V + +  +C  +L      E+K+DLLV+ETC+VEND  +WILDSGAT+HVCSS +  SS++QL+  EMTL+VGTG++ISA+ VGD K+FF  + +M L+N
Subjt:  VHDEYDIDCLAHL-----NENKFDLLVIETCIVENDDTSWILDSGATHHVCSSFEGMSSWQQLDTGEMTLRVGTGELISAKVVGDIKMFFSREHYMLLDN

Query:  VSKIMLMSTVLFEDITSVKEHVVNVLYARNSLVSWLAYGTTIEARRRTTRYSFPITAMLENRCCKYAIVQIIPIDMPERRINIMSPTSNVMRDESDKVRV
        +  +  +   L   ++ + EH+ ++ ++ N       Y   +             +A LEN                   + ++ P              
Subjt:  VSKIMLMSTVLFEDITSVKEHVVNVLYARNSLVSWLAYGTTIEARRRTTRYSFPITAMLENRCCKYAIVQIIPIDMPERRINIMSPTSNVMRDESDKVRV

Query:  FGQLGEKVSLELILRVVADVGLVAREGNKVSYVCCSSFSPLDVDKATHVGRGGGGAFGDGETGVSHLLSAIVCTDLPSNFAVNEMFKTAETQPKRQKVS-
                                                                  + +  ++H                 EMF+TA TQ KRQ++S 
Subjt:  FGQLGEKVSLELILRVVADVGLVAREGNKVSYVCCSSFSPLDVDKATHVGRGGGGAFGDGETGVSHLLSAIVCTDLPSNFAVNEMFKTAETQPKRQKVS-

Query:  -QSTYLWHLRLGNINLNMIGRLVKNGLLSQLEDGTLPSCESGLEG-------------------------------------------------------
          +TYLWHLRLG+INL+ IGRLVK+GLL++L+D +LP CES LEG                                                       
Subjt:  -QSTYLWHLRLGNINLNMIGRLVKNGLLSQLEDGTLPSCESGLEG-------------------------------------------------------

Query:  -----------------------KTIKTLRSDRGGEYLGLKFQNYLIEHEISSQLLAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVEIAV
                               K IK  RSDRGGEY+ L FQ+Y+IEH I SQL APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVE AV
Subjt:  -----------------------KTIKTLRSDRGGEYLGLKFQNYLIEHEISSQLLAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVEIAV

Query:  YILNVVSSKSVSKTPFELWRGRKASLPHFRIWGCPAHVLVANPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLEEDHVREHKPHSKMVL
        +ILN V SKSVS+TPFELWRGRK SL HFRIWGCPAHVLV NPKKL+ RS+LC FVGYPKETRGG FFDP+EN+VFVSTNATFLEEDH+R HKP SK+VL
Subjt:  YILNVVSSKSVSKTPFELWRGRKASLPHFRIWGCPAHVLVANPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLEEDHVREHKPHSKMVL

Query:  SGLSGEATNESTRVGPSTRVVDEPSTQTGPSTRVVDEPSTSGQAQPSQQFRMPRRSGRVVTQPERYLSLAETPAVISDDGIEDPLTFRQAMNDVDHDEWV
        S    EAT+E      STRVVDE     GPS+R VDE +TSGQ+ PSQ  RMPRRSGRVV+QP RYL L ET  VI DDG+EDPL+++QAMNDVD D+WV
Subjt:  SGLSGEATNESTRVGPSTRVVDEPSTQTGPSTRVVDEPSTSGQAQPSQQFRMPRRSGRVVTQPERYLSLAETPAVISDDGIEDPLTFRQAMNDVDHDEWV

Query:  KAMDLEMESMYFNQV
        KAMDLEMESMYFN V
Subjt:  KAMDLEMESMYFNQV

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]1.8e-14347.06Show/hide
Query:  ENKFDLLVIETCIVENDDTSWILDSGATHHVCSSFEGMSSWQQLDTGEMTLRVGTGELISAKVVGDIKMFFSREHYMLLDNVSKIMLMSTVLFEDITSVK
        + K+DLLV+ETC+VENDD++WI+DSGAT+HVCSSF+G+SSW+QL+TGEMT+RVGTG ++SA  VG +++   +  ++LL+NV                  
Subjt:  ENKFDLLVIETCIVENDDTSWILDSGATHHVCSSFEGMSSWQQLDTGEMTLRVGTGELISAKVVGDIKMFFSREHYMLLDNVSKIMLMSTVLFEDITSVK

Query:  EHVVNVLYARNSLVSWLAYGTTIEARRRTTRYSFPITAMLENRCCKYAIVQIIPIDMPERRINIMSPTSNVMRDESDKVRVFGQLGEKVSLELILRVVAD
         +VV  L  + +L+S                    +  +LE                        S T NV     +KV ++    E  S +L       
Subjt:  EHVVNVLYARNSLVSWLAYGTTIEARRRTTRYSFPITAMLENRCCKYAIVQIIPIDMPERRINIMSPTSNVMRDESDKVRVFGQLGEKVSLELILRVVAD

Query:  VGLVAREGNKVSYVCCSSFSPLDVDKATHVGRGGGGAFGDGETGVSHLLSAIVCTDLPSNFAVN-EMFKTAETQPKRQKVS--QSTYLWHLRLGNINLNM
              E N                                           V   L S   +N EMFKTA TQ KR K+S  ++ +LWHLRLG+INLN 
Subjt:  VGLVAREGNKVSYVCCSSFSPLDVDKATHVGRGGGGAFGDGETGVSHLLSAIVCTDLPSNFAVN-EMFKTAETQPKRQKVS--QSTYLWHLRLGNINLNM

Query:  IGRLVKNGLLSQLEDGTLPSCESGLEG-------------------------------------------------------------------------
        I RLVKNGLLS+LE+ +LP CES LEG                                                                         
Subjt:  IGRLVKNGLLSQLEDGTLPSCESGLEG-------------------------------------------------------------------------

Query:  -----KTIKTLRSDRGGEYLGLKFQNYLIEHEISSQLLAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVEIAVYILNVVSSKSVSKTPFEL
             KTIKT RSDRGGEY+ LKFQNYL+E  I SQL APGTPQQNGVSERRNRTLLDMVRSMMSYA LP+SFWGYAV+ AVYILN V SKSVS+TP +L
Subjt:  -----KTIKTLRSDRGGEYLGLKFQNYLIEHEISSQLLAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVEIAVYILNVVSSKSVSKTPFEL

Query:  WRGRKASLPHFRIWGCPAHVLVANPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLEEDHVREHKPHSKMVLSGLSGEATNESTRVGPST
        W GRK SL HFRIWGCPAHVL  NPKKL+ RSKLCLFVGYPK TRGGYF+DP++NKVFVSTNATFLEEDH+REHKP SK+VL+ LS E T       PST
Subjt:  WRGRKASLPHFRIWGCPAHVLVANPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLEEDHVREHKPHSKMVLSGLSGEATNESTRVGPST

Query:  RVVDEPSTQTGPSTRVVDEPSTSGQAQPSQQFRMPRRSGRVVTQPERYLSLAETPAVISDDGIEDPLTFRQAMNDVDHDEWVKAMDLEMESMYFNQV
        RVV+EPS      TRVV   S++   QP Q  R PRRSGRV   P RY+SL ET  VISD  IEDPLTF++AM DVD DEW+KAM+LE+ESMYFN V
Subjt:  RVVDEPSTQTGPSTRVVDEPSTSGQAQPSQQFRMPRRSGRVVTQPERYLSLAETPAVISDDGIEDPLTFRQAMNDVDHDEWVKAMDLEMESMYFNQV

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]1.8e-14347.06Show/hide
Query:  ENKFDLLVIETCIVENDDTSWILDSGATHHVCSSFEGMSSWQQLDTGEMTLRVGTGELISAKVVGDIKMFFSREHYMLLDNVSKIMLMSTVLFEDITSVK
        + K+DLLV+ETC+VENDD++WI+DSGAT+HVCSSF+G+SSW+QL+TGEMT+RVGTG ++SA  VG +++   +  ++LL+NV                  
Subjt:  ENKFDLLVIETCIVENDDTSWILDSGATHHVCSSFEGMSSWQQLDTGEMTLRVGTGELISAKVVGDIKMFFSREHYMLLDNVSKIMLMSTVLFEDITSVK

Query:  EHVVNVLYARNSLVSWLAYGTTIEARRRTTRYSFPITAMLENRCCKYAIVQIIPIDMPERRINIMSPTSNVMRDESDKVRVFGQLGEKVSLELILRVVAD
         +VV  L  + +L+S                    +  +LE                        S T NV     +KV ++    E  S +L       
Subjt:  EHVVNVLYARNSLVSWLAYGTTIEARRRTTRYSFPITAMLENRCCKYAIVQIIPIDMPERRINIMSPTSNVMRDESDKVRVFGQLGEKVSLELILRVVAD

Query:  VGLVAREGNKVSYVCCSSFSPLDVDKATHVGRGGGGAFGDGETGVSHLLSAIVCTDLPSNFAVN-EMFKTAETQPKRQKVS--QSTYLWHLRLGNINLNM
              E N                                           V   L S   +N EMFKTA TQ KR K+S  ++ +LWHLRLG+INLN 
Subjt:  VGLVAREGNKVSYVCCSSFSPLDVDKATHVGRGGGGAFGDGETGVSHLLSAIVCTDLPSNFAVN-EMFKTAETQPKRQKVS--QSTYLWHLRLGNINLNM

Query:  IGRLVKNGLLSQLEDGTLPSCESGLEG-------------------------------------------------------------------------
        I RLVKNGLLS+LE+ +LP CES LEG                                                                         
Subjt:  IGRLVKNGLLSQLEDGTLPSCESGLEG-------------------------------------------------------------------------

Query:  -----KTIKTLRSDRGGEYLGLKFQNYLIEHEISSQLLAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVEIAVYILNVVSSKSVSKTPFEL
             KTIKT RSDRGGEY+ LKFQNYL+E  I SQL APGTPQQNGVSERRNRTLLDMVRSMMSYA LP+SFWGYAV+ AVYILN V SKSVS+TP +L
Subjt:  -----KTIKTLRSDRGGEYLGLKFQNYLIEHEISSQLLAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVEIAVYILNVVSSKSVSKTPFEL

Query:  WRGRKASLPHFRIWGCPAHVLVANPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLEEDHVREHKPHSKMVLSGLSGEATNESTRVGPST
        W GRK SL HFRIWGCPAHVL  NPKKL+ RSKLCLFVGYPK TRGGYF+DP++NKVFVSTNATFLEEDH+REHKP SK+VL+ LS E T       PST
Subjt:  WRGRKASLPHFRIWGCPAHVLVANPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLEEDHVREHKPHSKMVLSGLSGEATNESTRVGPST

Query:  RVVDEPSTQTGPSTRVVDEPSTSGQAQPSQQFRMPRRSGRVVTQPERYLSLAETPAVISDDGIEDPLTFRQAMNDVDHDEWVKAMDLEMESMYFNQV
        RVV+EPS      TRVV   S++   QP Q  R PRRSGRV   P RY+SL ET  VISD  IEDPLTF++AM DVD DEW+KAM+LE+ESMYFN V
Subjt:  RVVDEPSTQTGPSTRVVDEPSTSGQAQPSQQFRMPRRSGRVVTQPERYLSLAETPAVISDDGIEDPLTFRQAMNDVDHDEWVKAMDLEMESMYFNQV

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]1.8e-14347.06Show/hide
Query:  ENKFDLLVIETCIVENDDTSWILDSGATHHVCSSFEGMSSWQQLDTGEMTLRVGTGELISAKVVGDIKMFFSREHYMLLDNVSKIMLMSTVLFEDITSVK
        + K+DLLV+ETC+VENDD++WI+DSGAT+HVCSSF+G+SSW+QL+TGEMT+RVGTG ++SA  VG +++   +  ++LL+NV                  
Subjt:  ENKFDLLVIETCIVENDDTSWILDSGATHHVCSSFEGMSSWQQLDTGEMTLRVGTGELISAKVVGDIKMFFSREHYMLLDNVSKIMLMSTVLFEDITSVK

Query:  EHVVNVLYARNSLVSWLAYGTTIEARRRTTRYSFPITAMLENRCCKYAIVQIIPIDMPERRINIMSPTSNVMRDESDKVRVFGQLGEKVSLELILRVVAD
         +VV  L  + +L+S                    +  +LE                        S T NV     +KV ++    E  S +L       
Subjt:  EHVVNVLYARNSLVSWLAYGTTIEARRRTTRYSFPITAMLENRCCKYAIVQIIPIDMPERRINIMSPTSNVMRDESDKVRVFGQLGEKVSLELILRVVAD

Query:  VGLVAREGNKVSYVCCSSFSPLDVDKATHVGRGGGGAFGDGETGVSHLLSAIVCTDLPSNFAVN-EMFKTAETQPKRQKVS--QSTYLWHLRLGNINLNM
              E N                                           V   L S   +N EMFKTA TQ KR K+S  ++ +LWHLRLG+INLN 
Subjt:  VGLVAREGNKVSYVCCSSFSPLDVDKATHVGRGGGGAFGDGETGVSHLLSAIVCTDLPSNFAVN-EMFKTAETQPKRQKVS--QSTYLWHLRLGNINLNM

Query:  IGRLVKNGLLSQLEDGTLPSCESGLEG-------------------------------------------------------------------------
        I RLVKNGLLS+LE+ +LP CES LEG                                                                         
Subjt:  IGRLVKNGLLSQLEDGTLPSCESGLEG-------------------------------------------------------------------------

Query:  -----KTIKTLRSDRGGEYLGLKFQNYLIEHEISSQLLAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVEIAVYILNVVSSKSVSKTPFEL
             KTIKT RSDRGGEY+ LKFQNYL+E  I SQL APGTPQQNGVSERRNRTLLDMVRSMMSYA LP+SFWGYAV+ AVYILN V SKSVS+TP +L
Subjt:  -----KTIKTLRSDRGGEYLGLKFQNYLIEHEISSQLLAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVEIAVYILNVVSSKSVSKTPFEL

Query:  WRGRKASLPHFRIWGCPAHVLVANPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLEEDHVREHKPHSKMVLSGLSGEATNESTRVGPST
        W GRK SL HFRIWGCPAHVL  NPKKL+ RSKLCLFVGYPK TRGGYF+DP++NKVFVSTNATFLEEDH+REHKP SK+VL+ LS E T       PST
Subjt:  WRGRKASLPHFRIWGCPAHVLVANPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLEEDHVREHKPHSKMVLSGLSGEATNESTRVGPST

Query:  RVVDEPSTQTGPSTRVVDEPSTSGQAQPSQQFRMPRRSGRVVTQPERYLSLAETPAVISDDGIEDPLTFRQAMNDVDHDEWVKAMDLEMESMYFNQV
        RVV+EPS      TRVV   S++   QP Q  R PRRSGRV   P RY+SL ET  VISD  IEDPLTF++AM DVD DEW+KAM+LE+ESMYFN V
Subjt:  RVVDEPSTQTGPSTRVVDEPSTSGQAQPSQQFRMPRRSGRVVTQPERYLSLAETPAVISDDGIEDPLTFRQAMNDVDHDEWVKAMDLEMESMYFNQV

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein8.8e-14447.06Show/hide
Query:  ENKFDLLVIETCIVENDDTSWILDSGATHHVCSSFEGMSSWQQLDTGEMTLRVGTGELISAKVVGDIKMFFSREHYMLLDNVSKIMLMSTVLFEDITSVK
        + K+DLLV+ETC+VENDD++WI+DSGAT+HVCSSF+G+SSW+QL+TGEMT+RVGTG ++SA  VG +++   +  ++LL+NV                  
Subjt:  ENKFDLLVIETCIVENDDTSWILDSGATHHVCSSFEGMSSWQQLDTGEMTLRVGTGELISAKVVGDIKMFFSREHYMLLDNVSKIMLMSTVLFEDITSVK

Query:  EHVVNVLYARNSLVSWLAYGTTIEARRRTTRYSFPITAMLENRCCKYAIVQIIPIDMPERRINIMSPTSNVMRDESDKVRVFGQLGEKVSLELILRVVAD
         +VV  L  + +L+S                    +  +LE                        S T NV     +KV ++    E  S +L       
Subjt:  EHVVNVLYARNSLVSWLAYGTTIEARRRTTRYSFPITAMLENRCCKYAIVQIIPIDMPERRINIMSPTSNVMRDESDKVRVFGQLGEKVSLELILRVVAD

Query:  VGLVAREGNKVSYVCCSSFSPLDVDKATHVGRGGGGAFGDGETGVSHLLSAIVCTDLPSNFAVN-EMFKTAETQPKRQKVS--QSTYLWHLRLGNINLNM
              E N                                           V   L S   +N EMFKTA TQ KR K+S  ++ +LWHLRLG+INLN 
Subjt:  VGLVAREGNKVSYVCCSSFSPLDVDKATHVGRGGGGAFGDGETGVSHLLSAIVCTDLPSNFAVN-EMFKTAETQPKRQKVS--QSTYLWHLRLGNINLNM

Query:  IGRLVKNGLLSQLEDGTLPSCESGLEG-------------------------------------------------------------------------
        I RLVKNGLLS+LE+ +LP CES LEG                                                                         
Subjt:  IGRLVKNGLLSQLEDGTLPSCESGLEG-------------------------------------------------------------------------

Query:  -----KTIKTLRSDRGGEYLGLKFQNYLIEHEISSQLLAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVEIAVYILNVVSSKSVSKTPFEL
             KTIKT RSDRGGEY+ LKFQNYL+E  I SQL APGTPQQNGVSERRNRTLLDMVRSMMSYA LP+SFWGYAV+ AVYILN V SKSVS+TP +L
Subjt:  -----KTIKTLRSDRGGEYLGLKFQNYLIEHEISSQLLAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVEIAVYILNVVSSKSVSKTPFEL

Query:  WRGRKASLPHFRIWGCPAHVLVANPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLEEDHVREHKPHSKMVLSGLSGEATNESTRVGPST
        W GRK SL HFRIWGCPAHVL  NPKKL+ RSKLCLFVGYPK TRGGYF+DP++NKVFVSTNATFLEEDH+REHKP SK+VL+ LS E T       PST
Subjt:  WRGRKASLPHFRIWGCPAHVLVANPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLEEDHVREHKPHSKMVLSGLSGEATNESTRVGPST

Query:  RVVDEPSTQTGPSTRVVDEPSTSGQAQPSQQFRMPRRSGRVVTQPERYLSLAETPAVISDDGIEDPLTFRQAMNDVDHDEWVKAMDLEMESMYFNQV
        RVV+EPS      TRVV   S++   QP Q  R PRRSGRV   P RY+SL ET  VISD  IEDPLTF++AM DVD DEW+KAM+LE+ESMYFN V
Subjt:  RVVDEPSTQTGPSTRVVDEPSTSGQAQPSQQFRMPRRSGRVVTQPERYLSLAETPAVISDDGIEDPLTFRQAMNDVDHDEWVKAMDLEMESMYFNQV

A0A5A7T2V9 Gag/pol protein3.1e-14946.15Show/hide
Query:  VHDEYDIDCLAHL-----NENKFDLLVIETCIVENDDTSWILDSGATHHVCSSFEGMSSWQQLDTGEMTLRVGTGELISAKVVGDIKMFFSREHYMLLDN
        V + +  +C  +L      E+K+DLLV+ETC+VEND  +WILDSGAT+HVCSS +  SS++QL+  EMTL+VGTG++ISA+ VGD K+FF  + +M L+N
Subjt:  VHDEYDIDCLAHL-----NENKFDLLVIETCIVENDDTSWILDSGATHHVCSSFEGMSSWQQLDTGEMTLRVGTGELISAKVVGDIKMFFSREHYMLLDN

Query:  VSKIMLMSTVLFEDITSVKEHVVNVLYARNSLVSWLAYGTTIEARRRTTRYSFPITAMLENRCCKYAIVQIIPIDMPERRINIMSPTSNVMRDESDKVRV
        +  +  +   L   ++ + EH+ ++ ++ N       Y   +             +A LEN                   + ++ P              
Subjt:  VSKIMLMSTVLFEDITSVKEHVVNVLYARNSLVSWLAYGTTIEARRRTTRYSFPITAMLENRCCKYAIVQIIPIDMPERRINIMSPTSNVMRDESDKVRV

Query:  FGQLGEKVSLELILRVVADVGLVAREGNKVSYVCCSSFSPLDVDKATHVGRGGGGAFGDGETGVSHLLSAIVCTDLPSNFAVNEMFKTAETQPKRQKVS-
                                                                  + +  ++H                 EMF+TA TQ KRQ++S 
Subjt:  FGQLGEKVSLELILRVVADVGLVAREGNKVSYVCCSSFSPLDVDKATHVGRGGGGAFGDGETGVSHLLSAIVCTDLPSNFAVNEMFKTAETQPKRQKVS-

Query:  -QSTYLWHLRLGNINLNMIGRLVKNGLLSQLEDGTLPSCESGLEG-------------------------------------------------------
          +TYLWHLRLG+INL+ IGRLVK+GLL++L+D +LP CES LEG                                                       
Subjt:  -QSTYLWHLRLGNINLNMIGRLVKNGLLSQLEDGTLPSCESGLEG-------------------------------------------------------

Query:  -----------------------KTIKTLRSDRGGEYLGLKFQNYLIEHEISSQLLAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVEIAV
                               K IK  RSDRGGEY+ L FQ+Y+IEH I SQL APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVE AV
Subjt:  -----------------------KTIKTLRSDRGGEYLGLKFQNYLIEHEISSQLLAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVEIAV

Query:  YILNVVSSKSVSKTPFELWRGRKASLPHFRIWGCPAHVLVANPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLEEDHVREHKPHSKMVL
        +ILN V SKSVS+TPFELWRGRK SL HFRIWGCPAHVLV NPKKL+ RS+LC FVGYPKETRGG FFDP+EN+VFVSTNATFLEEDH+R HKP SK+VL
Subjt:  YILNVVSSKSVSKTPFELWRGRKASLPHFRIWGCPAHVLVANPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLEEDHVREHKPHSKMVL

Query:  SGLSGEATNESTRVGPSTRVVDEPSTQTGPSTRVVDEPSTSGQAQPSQQFRMPRRSGRVVTQPERYLSLAETPAVISDDGIEDPLTFRQAMNDVDHDEWV
        S    EAT+E      STRVVDE     GPS+R VDE +TSGQ+ PSQ  RMPRRSGRVV+QP RYL L ET  VI DDG+EDPL+++QAMNDVD D+WV
Subjt:  SGLSGEATNESTRVGPSTRVVDEPSTQTGPSTRVVDEPSTSGQAQPSQQFRMPRRSGRVVTQPERYLSLAETPAVISDDGIEDPLTFRQAMNDVDHDEWV

Query:  KAMDLEMESMYFNQV
        KAMDLEMESMYFN V
Subjt:  KAMDLEMESMYFNQV

A0A5A7TZD0 Gag/pol protein3.3e-15146.57Show/hide
Query:  VHDEYDIDCLAHL-----NENKFDLLVIETCIVENDDTSWILDSGATHHVCSSFEGMSSWQQLDTGEMTLRVGTGELISAKVVGDIKMFFSREHYMLLDN
        V + +  +C  +L      E K+DLLV+ETC+VEND  +WILDSGAT+HVCSS +  SS++QL+  EMTL+VGTG++ISA+ VGD K+FF  + +M L+N
Subjt:  VHDEYDIDCLAHL-----NENKFDLLVIETCIVENDDTSWILDSGATHHVCSSFEGMSSWQQLDTGEMTLRVGTGELISAKVVGDIKMFFSREHYMLLDN

Query:  VSKIMLMSTVLFEDITSVKEHVVNVLYARNSLVSWLAYGTTIEARRRTTRYSFPITAMLENRCCKYAIVQIIPIDMPERRINIMSPTSNVMRDESDKVRV
        +  +  +   L   ++ + EH+ ++ ++ N       Y   +             +A LEN                   + ++ P              
Subjt:  VSKIMLMSTVLFEDITSVKEHVVNVLYARNSLVSWLAYGTTIEARRRTTRYSFPITAMLENRCCKYAIVQIIPIDMPERRINIMSPTSNVMRDESDKVRV

Query:  FGQLGEKVSLELILRVVADVGLVAREGNKVSYVCCSSFSPLDVDKATHVGRGGGGAFGDGETGVSHLLSAIVCTDLPSNFAVNEMFKTAETQPKRQKVS-
                                                                  + +  ++H                 EMF+TA TQ KRQ++S 
Subjt:  FGQLGEKVSLELILRVVADVGLVAREGNKVSYVCCSSFSPLDVDKATHVGRGGGGAFGDGETGVSHLLSAIVCTDLPSNFAVNEMFKTAETQPKRQKVS-

Query:  -QSTYLWHLRLGNINLNMIGRLVKNGLLSQLEDGTLPSCESGLEG-------------------------------------------------------
          +TYLWHLRLG+INL+ IGRLVKNGLL++L+D +LP CES LEG                                                       
Subjt:  -QSTYLWHLRLGNINLNMIGRLVKNGLLSQLEDGTLPSCESGLEG-------------------------------------------------------

Query:  -----------------------KTIKTLRSDRGGEYLGLKFQNYLIEHEISSQLLAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVEIAV
                               K IK LRSDRGGEY+ L+FQ+Y+IEH I SQL APGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVE AV
Subjt:  -----------------------KTIKTLRSDRGGEYLGLKFQNYLIEHEISSQLLAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVEIAV

Query:  YILNVVSSKSVSKTPFELWRGRKASLPHFRIWGCPAHVLVANPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLEEDHVREHKPHSKMVL
        +ILN V SKSVS+TPFELWRGRK SL HFRIWGCPAHVLV NPKKL+ RS+LC FVGYPKETRGG FFDPQEN+VFVSTNATFLEEDH+R HKP SK+VL
Subjt:  YILNVVSSKSVSKTPFELWRGRKASLPHFRIWGCPAHVLVANPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLEEDHVREHKPHSKMVL

Query:  SGLSGEATNESTRVGPSTRVVDEPSTQTGPSTRVVDEPSTSGQAQPSQQFRMPRRSGRVVTQPERYLSLAETPAVISDDGIEDPLTFRQAMNDVDHDEWV
        S    EAT+E      STRVVDE     GPS+R VDE +TSGQ+ PSQ  RMPRRSGRVV+QP RYL L ET  VI DDG+EDPL+++QAMNDVD D+WV
Subjt:  SGLSGEATNESTRVGPSTRVVDEPSTQTGPSTRVVDEPSTSGQAQPSQQFRMPRRSGRVVTQPERYLSLAETPAVISDDGIEDPLTFRQAMNDVDHDEWV

Query:  KAMDLEMESMYFNQV
        KAMDLEMESMYFN V
Subjt:  KAMDLEMESMYFNQV

A0A5A7V4M1 Gag/pol protein8.8e-14447.06Show/hide
Query:  ENKFDLLVIETCIVENDDTSWILDSGATHHVCSSFEGMSSWQQLDTGEMTLRVGTGELISAKVVGDIKMFFSREHYMLLDNVSKIMLMSTVLFEDITSVK
        + K+DLLV+ETC+VENDD++WI+DSGAT+HVCSSF+G+SSW+QL+TGEMT+RVGTG ++SA  VG +++   +  ++LL+NV                  
Subjt:  ENKFDLLVIETCIVENDDTSWILDSGATHHVCSSFEGMSSWQQLDTGEMTLRVGTGELISAKVVGDIKMFFSREHYMLLDNVSKIMLMSTVLFEDITSVK

Query:  EHVVNVLYARNSLVSWLAYGTTIEARRRTTRYSFPITAMLENRCCKYAIVQIIPIDMPERRINIMSPTSNVMRDESDKVRVFGQLGEKVSLELILRVVAD
         +VV  L  + +L+S                    +  +LE                        S T NV     +KV ++    E  S +L       
Subjt:  EHVVNVLYARNSLVSWLAYGTTIEARRRTTRYSFPITAMLENRCCKYAIVQIIPIDMPERRINIMSPTSNVMRDESDKVRVFGQLGEKVSLELILRVVAD

Query:  VGLVAREGNKVSYVCCSSFSPLDVDKATHVGRGGGGAFGDGETGVSHLLSAIVCTDLPSNFAVN-EMFKTAETQPKRQKVS--QSTYLWHLRLGNINLNM
              E N                                           V   L S   +N EMFKTA TQ KR K+S  ++ +LWHLRLG+INLN 
Subjt:  VGLVAREGNKVSYVCCSSFSPLDVDKATHVGRGGGGAFGDGETGVSHLLSAIVCTDLPSNFAVN-EMFKTAETQPKRQKVS--QSTYLWHLRLGNINLNM

Query:  IGRLVKNGLLSQLEDGTLPSCESGLEG-------------------------------------------------------------------------
        I RLVKNGLLS+LE+ +LP CES LEG                                                                         
Subjt:  IGRLVKNGLLSQLEDGTLPSCESGLEG-------------------------------------------------------------------------

Query:  -----KTIKTLRSDRGGEYLGLKFQNYLIEHEISSQLLAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVEIAVYILNVVSSKSVSKTPFEL
             KTIKT RSDRGGEY+ LKFQNYL+E  I SQL APGTPQQNGVSERRNRTLLDMVRSMMSYA LP+SFWGYAV+ AVYILN V SKSVS+TP +L
Subjt:  -----KTIKTLRSDRGGEYLGLKFQNYLIEHEISSQLLAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVEIAVYILNVVSSKSVSKTPFEL

Query:  WRGRKASLPHFRIWGCPAHVLVANPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLEEDHVREHKPHSKMVLSGLSGEATNESTRVGPST
        W GRK SL HFRIWGCPAHVL  NPKKL+ RSKLCLFVGYPK TRGGYF+DP++NKVFVSTNATFLEEDH+REHKP SK+VL+ LS E T       PST
Subjt:  WRGRKASLPHFRIWGCPAHVLVANPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLEEDHVREHKPHSKMVLSGLSGEATNESTRVGPST

Query:  RVVDEPSTQTGPSTRVVDEPSTSGQAQPSQQFRMPRRSGRVVTQPERYLSLAETPAVISDDGIEDPLTFRQAMNDVDHDEWVKAMDLEMESMYFNQV
        RVV+EPS      TRVV   S++   QP Q  R PRRSGRV   P RY+SL ET  VISD  IEDPLTF++AM DVD DEW+KAM+LE+ESMYFN V
Subjt:  RVVDEPSTQTGPSTRVVDEPSTSGQAQPSQQFRMPRRSGRVVTQPERYLSLAETPAVISDDGIEDPLTFRQAMNDVDHDEWVKAMDLEMESMYFNQV

A0A5D3CPJ6 Gag/pol protein8.8e-14447.06Show/hide
Query:  ENKFDLLVIETCIVENDDTSWILDSGATHHVCSSFEGMSSWQQLDTGEMTLRVGTGELISAKVVGDIKMFFSREHYMLLDNVSKIMLMSTVLFEDITSVK
        + K+DLLV+ETC+VENDD++WI+DSGAT+HVCSSF+G+SSW+QL+TGEMT+RVGTG ++SA  VG +++   +  ++LL+NV                  
Subjt:  ENKFDLLVIETCIVENDDTSWILDSGATHHVCSSFEGMSSWQQLDTGEMTLRVGTGELISAKVVGDIKMFFSREHYMLLDNVSKIMLMSTVLFEDITSVK

Query:  EHVVNVLYARNSLVSWLAYGTTIEARRRTTRYSFPITAMLENRCCKYAIVQIIPIDMPERRINIMSPTSNVMRDESDKVRVFGQLGEKVSLELILRVVAD
         +VV  L  + +L+S                    +  +LE                        S T NV     +KV ++    E  S +L       
Subjt:  EHVVNVLYARNSLVSWLAYGTTIEARRRTTRYSFPITAMLENRCCKYAIVQIIPIDMPERRINIMSPTSNVMRDESDKVRVFGQLGEKVSLELILRVVAD

Query:  VGLVAREGNKVSYVCCSSFSPLDVDKATHVGRGGGGAFGDGETGVSHLLSAIVCTDLPSNFAVN-EMFKTAETQPKRQKVS--QSTYLWHLRLGNINLNM
              E N                                           V   L S   +N EMFKTA TQ KR K+S  ++ +LWHLRLG+INLN 
Subjt:  VGLVAREGNKVSYVCCSSFSPLDVDKATHVGRGGGGAFGDGETGVSHLLSAIVCTDLPSNFAVN-EMFKTAETQPKRQKVS--QSTYLWHLRLGNINLNM

Query:  IGRLVKNGLLSQLEDGTLPSCESGLEG-------------------------------------------------------------------------
        I RLVKNGLLS+LE+ +LP CES LEG                                                                         
Subjt:  IGRLVKNGLLSQLEDGTLPSCESGLEG-------------------------------------------------------------------------

Query:  -----KTIKTLRSDRGGEYLGLKFQNYLIEHEISSQLLAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVEIAVYILNVVSSKSVSKTPFEL
             KTIKT RSDRGGEY+ LKFQNYL+E  I SQL APGTPQQNGVSERRNRTLLDMVRSMMSYA LP+SFWGYAV+ AVYILN V SKSVS+TP +L
Subjt:  -----KTIKTLRSDRGGEYLGLKFQNYLIEHEISSQLLAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVEIAVYILNVVSSKSVSKTPFEL

Query:  WRGRKASLPHFRIWGCPAHVLVANPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLEEDHVREHKPHSKMVLSGLSGEATNESTRVGPST
        W GRK SL HFRIWGCPAHVL  NPKKL+ RSKLCLFVGYPK TRGGYF+DP++NKVFVSTNATFLEEDH+REHKP SK+VL+ LS E T       PST
Subjt:  WRGRKASLPHFRIWGCPAHVLVANPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLEEDHVREHKPHSKMVLSGLSGEATNESTRVGPST

Query:  RVVDEPSTQTGPSTRVVDEPSTSGQAQPSQQFRMPRRSGRVVTQPERYLSLAETPAVISDDGIEDPLTFRQAMNDVDHDEWVKAMDLEMESMYFNQV
        RVV+EPS      TRVV   S++   QP Q  R PRRSGRV   P RY+SL ET  VISD  IEDPLTF++AM DVD DEW+KAM+LE+ESMYFN V
Subjt:  RVVDEPSTQTGPSTRVVDEPSTSGQAQPSQQFRMPRRSGRVVTQPERYLSLAETPAVISDDGIEDPLTFRQAMNDVDHDEWVKAMDLEMESMYFNQV

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.0e-2435.87Show/hide
Query:  DRGGEYLGLKFQNYLIEHEISSQLLAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVEIAVYILNVVSSKSV---SKTPFELWRGRKASLPH
        D G EYL  + + + ++  IS  L  P TPQ NGVSER  RT+ +  R+M+S A+L  SFWG AV  A Y++N + S+++   SKTP+E+W  +K  L H
Subjt:  DRGGEYLGLKFQNYLIEHEISSQLLAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVEIAVYILNVVSSKSV---SKTPFELWRGRKASLPH

Query:  FRIWGCPAHVLVANPK-KLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLEEDHVREHKPHSKMVLSGLSGEATNES
         R++G   +V + N + K D +S   +FVGY  E  G   +D    K  V+ +    E + V       + V    S E+ N++
Subjt:  FRIWGCPAHVLVANPK-KLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLEEDHVREHKPHSKMVLSGLSGEATNES

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-949.0e-3734.75Show/hide
Query:  GKTIKTLRSDRGGEYLGLKFQNYLIEHEISSQLLAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVEIAVYILNVVSSKSVS-KTPFELWRG
        G+ +K LRSD GGEY   +F+ Y   H I  +   PGTPQ NGV+ER NRT+++ VRSM+  A+LP SFWG AV+ A Y++N   S  ++ + P  +W  
Subjt:  GKTIKTLRSDRGGEYLGLKFQNYLIEHEISSQLLAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVEIAVYILNVVSSKSVS-KTPFELWRG

Query:  RKASLPHFRIWGCP--AHVLVANPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLEEDHVREHKPHSKMVLSG-LSGEATNESTRVGPST
        ++ S  H +++GC   AHV      KLD +S  C+F+GY  E  G   +DP + KV  S +  F  E  VR     S+ V +G +    T  ST   P++
Subjt:  RKASLPHFRIWGCP--AHVLVANPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLEEDHVREHKPHSKMVLSG-LSGEATNESTRVGPST

Query:  --RVVDEPSTQTGPSTRVVDE--------PSTSGQAQPSQQFRMPRRSGRVVTQPERYLSLAETPAVISDDGIEDPLTFRQAMNDVDHDEWVKAMDLEME
             DE S Q      V+++               Q  +Q +  RRS R   +  RY S      +ISDD   +P + ++ ++  + ++ +KAM  EME
Subjt:  --RVVDEPSTQTGPSTRVVDE--------PSTSGQAQPSQQFRMPRRSGRVVTQPERYLSLAETPAVISDDGIEDPLTFRQAMNDVDHDEWVKAMDLEME

Query:  SMYFN
        S+  N
Subjt:  SMYFN

Q851Q6 Probable GTP-binding protein OBGC23.9e-0842.86Show/hide
Query:  IDMPE-RRINIMSPTSNVMRDESDKVRVFGQLGEKVSLELILRVVADVGLV----AREGNKVSYVCCSSFSPLDVDKAT---HVGRGGGGAFGDGETGVS
        ID+PE RR   M  + N+MRD SD+V + GQ GE+VSLELILRVVADVGLV    A +   +S +  +     D    T   ++GR G    GD   G  
Subjt:  IDMPE-RRINIMSPTSNVMRDESDKVRVFGQLGEKVSLELILRVVADVGLV----AREGNKVSYVCCSSFSPLDVDKAT---HVGRGGGGAFGDGETGVS

Query:  HLLSAIVCTDLP
           S     DLP
Subjt:  HLLSAIVCTDLP

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.9e-1930.67Show/hide
Query:  IKTLRSDRGGEYLGLKFQNYLIEHEISSQLLAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVEIAVYILNVVSSKSVS-KTPFELWRGRKA
        I T  SD GGE++ L    Y  +H IS     P TP+ NG+SER++R +++   +++S+A +P ++W YA  +AVY++N + +  +  ++PF+   G   
Subjt:  IKTLRSDRGGEYLGLKFQNYLIEHEISSQLLAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVEIAVYILNVVSSKSVS-KTPFELWRGRKA

Query:  SLPHFRIWGCPAHVLVA--NPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLE
        +    R++GC  +  +   N  KLD +S+ C+F+GY            Q +++++S +  F E
Subjt:  SLPHFRIWGCPAHVLVA--NPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.8e-1935.29Show/hide
Query:  IKTLRSDRGGEYLGLKFQNYLIEHEISSQLLAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVEIAVYILNVVSSKSVS-KTPFELWRGRKA
        I TL SD GGE++ L+  +YL +H IS     P TP+ NG+SER++R +++M  +++S+A +P ++W YA  +AVY++N + +  +  ++PF+   G+  
Subjt:  IKTLRSDRGGEYLGLKFQNYLIEHEISSQLLAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVEIAVYILNVVSSKSVS-KTPFELWRGRKA

Query:  SLPHFRIWGCPAHVLVA--NPKKLDSRSKLCLFVGY
        +    +++GC  +  +   N  KL+ +SK C F+GY
Subjt:  SLPHFRIWGCPAHVLVA--NPKKLDSRSKLCLFVGY

Arabidopsis top hitse value%identityAlignment
ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein7.1e-0532.93Show/hide
Query:  NRTLLDMVRSMMSYAQLPSSFWGYAVEIAVYILNVVSSKSVS-KTPFELWRGRKASLPHFRIWGCPAHVLVANPKKLDSRSK
        NRT+++ VRSM+    LP +F   A   AV+I+N   S +++   P E+W     +  + R +GC A++   +  KL  R+K
Subjt:  NRTLLDMVRSMMSYAQLPSSFWGYAVEIAVYILNVVSSKSVS-KTPFELWRGRKASLPHFRIWGCPAHVLVANPKKLDSRSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAACGTTCTTTGTATGCCTCGAATAACGCCAACTACAAAACTAGTATTTCTAATACAACTAAGAAAATGAACTTAAACATTAATTGCGTGGAATTGGTCTTAAATGA
GGACGATCCTCAAGCCCACATGTCTAATTCATATGAAATTGTTCATGATGAATATGACATAGACTGCTTAGCACATCTCAATGAAAATAAATTTGATTTATTGGTCATTG
AAACGTGTATAGTGGAAAATGATGATACATCCTGGATTCTTGATTCAGGTGCCACTCATCACGTATGCTCTTCCTTTGAGGGAATGAGCTCTTGGCAACAGTTGGATACT
GGGGAAATGACTCTAAGAGTTGGGACTGGGGAGCTTATTTCAGCTAAAGTAGTGGGAGATATTAAAATGTTCTTCTCCAGGGAACACTATATGTTATTAGATAATGTGTC
GAAGATAATGTTGATGTCGACTGTGTTATTTGAAGATATCACTAGTGTGAAAGAACACGTTGTCAATGTGTTGTATGCTCGAAACTCGTTGGTGTCGTGGTTGGCGTATG
GCACCACAATAGAGGCCAGAAGAAGGACCACAAGGTATAGCTTCCCCATCACCGCCATGCTTGAGAATCGATGTTGCAAATACGCCATTGTGCAGATTATCCCGATAGAT
ATGCCAGAGAGAAGGATAAATATTATGTCTCCAACCTCCAATGTTATGAGAGATGAAAGTGATAAGGTTCGGGTATTCGGTCAACTTGGGGAAAAGGTCAGTCTCGAGTT
GATTCTTCGAGTAGTTGCTGACGTTGGTCTAGTGGCTAGGGAAGGAAACAAGGTCAGCTATGTTTGTTGTAGTAGCTTCAGTCCCCTCGATGTTGACAAAGCAACACATG
TTGGCAGAGGGGGTGGAGGAGCATTTGGGGATGGAGAGACTGGTGTGAGCCACTTACTTAGTGCCATTGTTTGTACTGACCTACCATCAAATTTTGCAGTGAATGAAATG
TTTAAAACAGCCGAGACTCAACCAAAAAGGCAGAAAGTTTCTCAAAGTACCTATCTTTGGCACTTGAGACTTGGAAACATAAATCTCAATATGATTGGGAGATTGGTTAA
GAACGGACTTCTAAGCCAGTTAGAAGATGGTACTTTACCTTCGTGTGAGTCAGGTCTCGAAGGTAAGACCATTAAAACACTTCGATCCGATCGAGGTGGAGAGTATTTGG
GCTTAAAATTCCAGAATTATTTGATAGAACATGAAATCTCGTCTCAACTCTTAGCACCAGGTACACCTCAACAGAATGGTGTATCTGAAAGGAGAAATAGAACCTTGTTG
GACATGGTTCGATCTATGATGAGCTATGCTCAACTGCCTAGTTCTTTTTGGGGTTATGCAGTAGAGATTGCAGTTTATATTTTGAATGTAGTTTCCTCAAAGAGTGTTTC
TAAAACACCATTTGAGCTCTGGAGAGGGCGTAAAGCTAGTTTACCTCATTTCAGAATTTGGGGTTGTCCTGCACATGTGCTTGTGGCAAACCCAAAGAAATTGGATTCAC
GTTCAAAATTATGCCTATTCGTAGGCTACCCAAAAGAAACAAGGGGTGGATACTTCTTCGATCCACAAGAAAATAAAGTTTTTGTTTCGACAAATGCTACTTTCTTAGAA
GAAGACCACGTTCGAGAACACAAGCCACACAGTAAAATGGTGTTAAGTGGACTTTCTGGTGAAGCTACGAATGAATCAACAAGAGTTGGACCTTCAACAAGGGTTGTTGA
TGAGCCCAGTACACAGACTGGACCTTCAACAAGAGTTGTAGATGAGCCCAGTACATCTGGTCAAGCCCAACCTTCTCAACAGTTTAGAATGCCTCGACGTAGTGGGAGGG
TTGTTACTCAACCTGAACGCTATTTGAGTTTAGCAGAAACTCCAGCTGTCATATCTGATGATGGAATCGAGGATCCATTGACCTTTAGACAGGCAATGAATGATGTGGAC
CATGATGAGTGGGTCAAAGCTATGGACCTAGAGATGGAGTCTATGTACTTCAATCAAGTTTCACTACTACAAATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAACGTTCTTTGTATGCCTCGAATAACGCCAACTACAAAACTAGTATTTCTAATACAACTAAGAAAATGAACTTAAACATTAATTGCGTGGAATTGGTCTTAAATGA
GGACGATCCTCAAGCCCACATGTCTAATTCATATGAAATTGTTCATGATGAATATGACATAGACTGCTTAGCACATCTCAATGAAAATAAATTTGATTTATTGGTCATTG
AAACGTGTATAGTGGAAAATGATGATACATCCTGGATTCTTGATTCAGGTGCCACTCATCACGTATGCTCTTCCTTTGAGGGAATGAGCTCTTGGCAACAGTTGGATACT
GGGGAAATGACTCTAAGAGTTGGGACTGGGGAGCTTATTTCAGCTAAAGTAGTGGGAGATATTAAAATGTTCTTCTCCAGGGAACACTATATGTTATTAGATAATGTGTC
GAAGATAATGTTGATGTCGACTGTGTTATTTGAAGATATCACTAGTGTGAAAGAACACGTTGTCAATGTGTTGTATGCTCGAAACTCGTTGGTGTCGTGGTTGGCGTATG
GCACCACAATAGAGGCCAGAAGAAGGACCACAAGGTATAGCTTCCCCATCACCGCCATGCTTGAGAATCGATGTTGCAAATACGCCATTGTGCAGATTATCCCGATAGAT
ATGCCAGAGAGAAGGATAAATATTATGTCTCCAACCTCCAATGTTATGAGAGATGAAAGTGATAAGGTTCGGGTATTCGGTCAACTTGGGGAAAAGGTCAGTCTCGAGTT
GATTCTTCGAGTAGTTGCTGACGTTGGTCTAGTGGCTAGGGAAGGAAACAAGGTCAGCTATGTTTGTTGTAGTAGCTTCAGTCCCCTCGATGTTGACAAAGCAACACATG
TTGGCAGAGGGGGTGGAGGAGCATTTGGGGATGGAGAGACTGGTGTGAGCCACTTACTTAGTGCCATTGTTTGTACTGACCTACCATCAAATTTTGCAGTGAATGAAATG
TTTAAAACAGCCGAGACTCAACCAAAAAGGCAGAAAGTTTCTCAAAGTACCTATCTTTGGCACTTGAGACTTGGAAACATAAATCTCAATATGATTGGGAGATTGGTTAA
GAACGGACTTCTAAGCCAGTTAGAAGATGGTACTTTACCTTCGTGTGAGTCAGGTCTCGAAGGTAAGACCATTAAAACACTTCGATCCGATCGAGGTGGAGAGTATTTGG
GCTTAAAATTCCAGAATTATTTGATAGAACATGAAATCTCGTCTCAACTCTTAGCACCAGGTACACCTCAACAGAATGGTGTATCTGAAAGGAGAAATAGAACCTTGTTG
GACATGGTTCGATCTATGATGAGCTATGCTCAACTGCCTAGTTCTTTTTGGGGTTATGCAGTAGAGATTGCAGTTTATATTTTGAATGTAGTTTCCTCAAAGAGTGTTTC
TAAAACACCATTTGAGCTCTGGAGAGGGCGTAAAGCTAGTTTACCTCATTTCAGAATTTGGGGTTGTCCTGCACATGTGCTTGTGGCAAACCCAAAGAAATTGGATTCAC
GTTCAAAATTATGCCTATTCGTAGGCTACCCAAAAGAAACAAGGGGTGGATACTTCTTCGATCCACAAGAAAATAAAGTTTTTGTTTCGACAAATGCTACTTTCTTAGAA
GAAGACCACGTTCGAGAACACAAGCCACACAGTAAAATGGTGTTAAGTGGACTTTCTGGTGAAGCTACGAATGAATCAACAAGAGTTGGACCTTCAACAAGGGTTGTTGA
TGAGCCCAGTACACAGACTGGACCTTCAACAAGAGTTGTAGATGAGCCCAGTACATCTGGTCAAGCCCAACCTTCTCAACAGTTTAGAATGCCTCGACGTAGTGGGAGGG
TTGTTACTCAACCTGAACGCTATTTGAGTTTAGCAGAAACTCCAGCTGTCATATCTGATGATGGAATCGAGGATCCATTGACCTTTAGACAGGCAATGAATGATGTGGAC
CATGATGAGTGGGTCAAAGCTATGGACCTAGAGATGGAGTCTATGTACTTCAATCAAGTTTCACTACTACAAATTTGA
Protein sequenceShow/hide protein sequence
MKRSLYASNNANYKTSISNTTKKMNLNINCVELVLNEDDPQAHMSNSYEIVHDEYDIDCLAHLNENKFDLLVIETCIVENDDTSWILDSGATHHVCSSFEGMSSWQQLDT
GEMTLRVGTGELISAKVVGDIKMFFSREHYMLLDNVSKIMLMSTVLFEDITSVKEHVVNVLYARNSLVSWLAYGTTIEARRRTTRYSFPITAMLENRCCKYAIVQIIPID
MPERRINIMSPTSNVMRDESDKVRVFGQLGEKVSLELILRVVADVGLVAREGNKVSYVCCSSFSPLDVDKATHVGRGGGGAFGDGETGVSHLLSAIVCTDLPSNFAVNEM
FKTAETQPKRQKVSQSTYLWHLRLGNINLNMIGRLVKNGLLSQLEDGTLPSCESGLEGKTIKTLRSDRGGEYLGLKFQNYLIEHEISSQLLAPGTPQQNGVSERRNRTLL
DMVRSMMSYAQLPSSFWGYAVEIAVYILNVVSSKSVSKTPFELWRGRKASLPHFRIWGCPAHVLVANPKKLDSRSKLCLFVGYPKETRGGYFFDPQENKVFVSTNATFLE
EDHVREHKPHSKMVLSGLSGEATNESTRVGPSTRVVDEPSTQTGPSTRVVDEPSTSGQAQPSQQFRMPRRSGRVVTQPERYLSLAETPAVISDDGIEDPLTFRQAMNDVD
HDEWVKAMDLEMESMYFNQVSLLQI