; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g33140 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g33140
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag-Pol polyprotein
Genome locationchr6:25089157..25096013
RNA-Seq ExpressionMoc06g33140
SyntenyMoc06g33140
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BAD34493.1 Gag-Pol [Ipomoea batatas]3.2e-8248.32Show/hide
Query:  MAAKYEIEKFNGTNFSLWKMKMKAILRKDNYLATSSARPVDVTDD-KWNEMDKNAIANLHL----------------------------ARSLHNKIFLK
        MAAK+EIEKFNG NFSLWK+K+KAILRKDN LA  S RPVD TDD KW+EM+++A+A+L+L                            A+SLHNKIFLK
Subjt:  MAAKYEIEKFNGTNFSLWKMKMKAILRKDNYLATSSARPVDVTDD-KWNEMDKNAIANLHL----------------------------ARSLHNKIFLK

Query:  RRLYTLWMSESTPMTEHINTLNTLFSQLTSLGCKIEPNER-------LPDSYDQLVINLTNNVLTDYLNFEAIGVAVMEEENRHKNKVDKLASSQQAEVL
        R+LYTL MSEST +TEH+NTLNTLFSQLTSL CKIEP ER       LPDSYDQL+INLTNN+LTDYL F+ +  AV+EEE+R KNK D+  + QQAE L
Subjt:  RRLYTLWMSESTPMTEHINTLNTLFSQLTSLGCKIEPNER-------LPDSYDQLVINLTNNVLTDYLNFEAIGVAVMEEENRHKNKVDKLASSQQAEVL

Query:  AVTRGRSTERGSSGSQNQGRAKS------------------------RNSNPQGNVASTSNKGDVLCCEAATTVEGRKSLAD------------------
         V RGRSTERG   S  +GR+KS                        +NSNPQGNVASTS+ G  LCCEA+   EGRK  AD                  
Subjt:  AVTRGRSTERGSSGSQNQGRAKS------------------------RNSNPQGNVASTSNKGDVLCCEAATTVEGRKSLAD------------------

Query:  ----------DLCSADGHALKIVGIGTIKLKFHDNTVRTIH--------------------------------KRFE-----VEGKKVATNLYMLEGETL
                   + S D HAL+I+GIGTIKLK +D TV+T+                                 K F+     ++G+K+A NLYML+GETL
Subjt:  ----------DLCSADGHALKIVGIGTIKLKFHDNTVRTIH--------------------------------KRFE-----VEGKKVATNLYMLEGETL

Query:  QDGEASVASRSPSEKL
        Q+ EASVA+ SP   L
Subjt:  QDGEASVASRSPSEKL

BAD34493.1 Gag-Pol [Ipomoea batatas]2.4e-0533.33Show/hide
Query:  VIFMEDKRQAVE-DDSTVNKNSETTTVHVEKESEEDSSEGEPTHEIQEPKGSQVPETRRSDRLTKPPVGSLITLWRGVWSRRNDKAF---NPDLDRTT--
        VIF+ED+ Q  E DDST  +  ETT + VE+E E+DSSE EP HE  EP+ S  P TR+SDR  + P       W   +    + A+     D + +T  
Subjt:  VIFMEDKRQAVE-DDSTVNKNSETTTVHVEKESEEDSSEGEPTHEIQEPKGSQVPETRRSDRLTKPPVGSLITLWRGVWSRRNDKAF---NPDLDRTT--

Query:  -----NDLSAWASSYITAFWTANSN------LLPRDHPPRETRWLPPIER
             +D+S W ++        + N       LP+   P   +W+  I+R
Subjt:  -----NDLSAWASSYITAFWTANSN------LLPRDHPPRETRWLPPIER

BAD34493.1 Gag-Pol [Ipomoea batatas]3.2e-8242.33Show/hide
Query:  MAAKYEIEKFNGTNFSLWKMKMKAILRKDNYLATSSARPVDVTDD-KWNEMDKNAIANLHL----------------------------ARSLHNKIFLK
        M  K++IEKFNG NFSLWK+KMKAILRKD  LA  + RP+D  DD KWNEMD NA+ N HL                            A SLHNKIFLK
Subjt:  MAAKYEIEKFNGTNFSLWKMKMKAILRKDNYLATSSARPVDVTDD-KWNEMDKNAIANLHL----------------------------ARSLHNKIFLK

Query:  RRLYTLWMSESTPMTEHINTLNTLFSQLTSLGCKIEPNER-------LPDSYDQLVINLTNNVLTDYLNFEAIGVAVMEEENRHKNKVDKLASSQQAEVL
        R+LYTL M EST +TEH+NTLNTLFSQLTSL CKI   ER       LPDSYDQL+INLTN+ +T  L F+ +  A+++EEN+ KNK DK  + QQAE L
Subjt:  RRLYTLWMSESTPMTEHINTLNTLFSQLTSLGCKIEPNER-------LPDSYDQLVINLTNNVLTDYLNFEAIGVAVMEEENRHKNKVDKLASSQQAEVL

Query:  AVTRGRSTERGSSGSQNQGRAKSR-------------------------NSNPQGNVASTSNKGDVLCCEAATTVEGRKSLAD-----------------
          TR RSTERG S S   GR+KSR                         NSNPQGN A+TS+ GD LCCEA+TTVEGRK  AD                 
Subjt:  AVTRGRSTERGSSGSQNQGRAKSR-------------------------NSNPQGNVASTSNKGDVLCCEAATTVEGRKSLAD-----------------

Query:  -----------DLCSADGHALKIVGIGTIKLKFHDNTVRTIH-----KRFE--------------------------------VEGKKVATNLYMLEGET
                    + S + HAL IVG+ TIKLK +D T++ +      K F+                                ++G+K+A NLYML+GET
Subjt:  -----------DLCSADGHALKIVGIGTIKLKFHDNTVRTIH-----KRFE--------------------------------VEGKKVATNLYMLEGET

Query:  LQDGEASVASRSPS------EKLSMIWV----------------------------------------------VIFMEDKRQAVEDDSTVNKNSETTTV
        L + EASVAS S        +KL  I V                                              VIF+EDK Q  EDD +  K SETT +
Subjt:  LQDGEASVASRSPS------EKLSMIWV----------------------------------------------VIFMEDKRQAVEDDSTVNKNSETTTV

Query:  HVEKESEE-DSSEGEPTHEIQEPKGSQVPETRRSDRLTKPP
        +VEK+ E+ DSSE EP  + QEPK S+ P TR+SDR+ K P
Subjt:  HVEKESEE-DSSEGEPTHEIQEPKGSQVPETRRSDRLTKPP

KAA0026163.1 Gag-Pol [Cucumis melo var. makuwa]9.3e-8250Show/hide
Query:  MAAKYEIEKFNGTNFSLWKMKMKAILRKDNYLATSSARPVDVTDD-KWNEMDKNAIANLHLA----------------------------RSLHNKIFLK
        MAAK+EIEKFNGTNFSLW +KMK +LR DN L      P ++TDD KWNEMD NA+ N+HLA                            +SLHNKIFLK
Subjt:  MAAKYEIEKFNGTNFSLWKMKMKAILRKDNYLATSSARPVDVTDD-KWNEMDKNAIANLHLA----------------------------RSLHNKIFLK

Query:  RRLYTLWMSESTPMTEHINTLNTLFSQLTSLGCKIEPNER-------LPDSYDQLVINLTNNVLTDYLNFEAIGVAVMEEENRHKNKVDKLASSQQAEVL
        R+LYTL MSEST MTEH+NTLNTLFSQL  LG KIEPNER       L DSYDQLVINL NN+L DYL+F+ +  AV+EEENR KNK DKL + QQAE L
Subjt:  RRLYTLWMSESTPMTEHINTLNTLFSQLTSLGCKIEPNER-------LPDSYDQLVINLTNNVLTDYLNFEAIGVAVMEEENRHKNKVDKLASSQQAEVL

Query:  AVTRGRSTERGSSGSQNQGRAKSRNSNPQGNVASTSNKGDVLCCEAATTVEGRKSLAD----------------------------DLCSADGHALKIVG
         VTRGR  E   +    +G++  ++SNPQGNVAST N+G  L CEA TT EG+K +AD                             L S + HALKIV 
Subjt:  AVTRGRSTERGSSGSQNQGRAKSRNSNPQGNVASTSNKGDVLCCEAATTVEGRKSLAD----------------------------DLCSADGHALKIVG

Query:  IGTIKLKFHDNTVRTIHKRFEVE-------------------------------------GKKVATNLYMLEGETLQDGEASVASRSPSEKLSMIW
        IGTIKLK HDN V TI +   VE                                     G+KV  NLYMLEGETLQ+GEASVAS S  E L M+W
Subjt:  IGTIKLKFHDNTVRTIHKRFEVE-------------------------------------GKKVATNLYMLEGETLQDGEASVASRSPSEKLSMIW

KAA0044949.1 hypothetical protein E6C27_scaffold74G002510 [Cucumis melo var. makuwa]5.7e-7952.1Show/hide
Query:  MAAKYEIEKFNGTNFSLWKMKMKAILRKDNYLATSSARPVDVTDD-KWNEMDKNAIANLHL----------------------------ARSLHNKIFLK
        M  K++IEKFNGTNFSLWK+KMKAI RKDN L     RP ++TDD KWNEMD NA+AN+HL                            A+SLHNKIFLK
Subjt:  MAAKYEIEKFNGTNFSLWKMKMKAILRKDNYLATSSARPVDVTDD-KWNEMDKNAIANLHL----------------------------ARSLHNKIFLK

Query:  RRLYTLWMSESTPMTEHINTLNTLFSQLTSLGCKIEPN-------ERLPDSYDQLVINLTNNVLTDYLNFEAIGVAVMEEENRHKNKVDKLASSQQAEVL
        R+LYTL MS ST MTEH+NTLNTLFSQLT LG KIEPN       + LPDSYDQLVINLTNN+LTDYL+F+ +   V+EEENR KNK DKL SSQQAE L
Subjt:  RRLYTLWMSESTPMTEHINTLNTLFSQLTSLGCKIEPN-------ERLPDSYDQLVINLTNNVLTDYLNFEAIGVAVMEEENRHKNKVDKLASSQQAEVL

Query:  AVTRGRSTERGSSGSQNQGRAKSRNS--------------------------NPQGNVASTSNKGDVLCCEAATTVEGRKSLADDLCSADGHALKIVGIG
         VTR R  E  SSGS+NQGR+KS++                           NPQGNVAST N+G  L CEA TT+EG+    D     +   +K++   
Subjt:  AVTRGRSTERGSSGSQNQGRAKSRNS--------------------------NPQGNVASTSNKGDVLCCEAATTVEGRKSLADDLCSADGHALKIVGIG

Query:  TIKLKFHDNTVRTIHKRFEVEGKKVATNLYMLEGETLQDGEASVASRSPSEKLSMIW
         + +K                 +KV  NLYMLEGETLQ+GEASVAS S  E LSM+W
Subjt:  TIKLKFHDNTVRTIHKRFEVEGKKVATNLYMLEGETLQDGEASVASRSPSEKLSMIW

KAA0044949.1 hypothetical protein E6C27_scaffold74G002510 [Cucumis melo var. makuwa]6.6e-0369.57Show/hide
Query:  VIFMEDKRQAVEDDSTVNKNSETTTVHVEKESEEDSSEGEPTHEIQ
        VIFMEDK+Q V DDSTVN++S   TVHVEKES +DS E  P HEIQ
Subjt:  VIFMEDKRQAVEDDSTVNKNSETTTVHVEKESEEDSSEGEPTHEIQ

TYK16527.1 hypothetical protein E5676_scaffold21G003420 [Cucumis melo var. makuwa]1.5e-7953.22Show/hide
Query:  MAAKYEIEKFNGTNFSLWKMKMKAILRKDNYLATSSARPVDVTDD-KWNEMDKNAIANLHL----------------------------ARSLHNKIFLK
        M  K++IEKFNGTNFSLWK+KMKAI RKDN L     RP ++TDD KWNEMD NA+AN+HL                            A+SLHNKIFLK
Subjt:  MAAKYEIEKFNGTNFSLWKMKMKAILRKDNYLATSSARPVDVTDD-KWNEMDKNAIANLHL----------------------------ARSLHNKIFLK

Query:  RRLYTLWMSESTPMTEHINTLNTLFSQLTSLGCKIEPN-------ERLPDSYDQLVINLTNNVLTDYLNFEAIGVAVMEEENRHKNKVDKLASSQQAEVL
        R+LYTL MS ST MTEH+NTLNTLFSQLT LG KIEPN       + LPDSYDQLVINLTNN+LTDYL+F+ +   V+EEENR KNK DKL SSQQAEVL
Subjt:  RRLYTLWMSESTPMTEHINTLNTLFSQLTSLGCKIEPN-------ERLPDSYDQLVINLTNNVLTDYLNFEAIGVAVMEEENRHKNKVDKLASSQQAEVL

Query:  AVTRGRSTERGSSGSQNQGRAKS--------------------------RNSNPQGNVASTSNKGDVLCCEAATTVEGRKSLADDLCSADGHALKIVGIG
         VTR R  E  SSGS+NQGR+KS                          ++SNPQGNVAST N+G VL C+A TT+EG+    D     +   +K++   
Subjt:  AVTRGRSTERGSSGSQNQGRAKS--------------------------RNSNPQGNVASTSNKGDVLCCEAATTVEGRKSLADDLCSADGHALKIVGIG

Query:  TIKLKFHDNTVRTIHKRFEVEGKKVATNLYMLEGETLQDGEASVASRSPSEKLSMIW
                  V  + KR     +KV  NLYMLEGETLQ+GEASVAS S  E LSM+W
Subjt:  TIKLKFHDNTVRTIHKRFEVEGKKVATNLYMLEGETLQDGEASVASRSPSEKLSMIW

TrEMBL top hitse value%identityAlignment
A0A5A7TUN0 Uncharacterized protein3.2e-0369.57Show/hide
Query:  VIFMEDKRQAVEDDSTVNKNSETTTVHVEKESEEDSSEGEPTHEIQ
        VIFMEDK+Q V DDSTVN++S   TVHVEKES +DS E  P HEIQ
Subjt:  VIFMEDKRQAVEDDSTVNKNSETTTVHVEKESEEDSSEGEPTHEIQ

A0A5D3CXA6 Uncharacterized protein7.2e-8053.22Show/hide
Query:  MAAKYEIEKFNGTNFSLWKMKMKAILRKDNYLATSSARPVDVTDD-KWNEMDKNAIANLHL----------------------------ARSLHNKIFLK
        M  K++IEKFNGTNFSLWK+KMKAI RKDN L     RP ++TDD KWNEMD NA+AN+HL                            A+SLHNKIFLK
Subjt:  MAAKYEIEKFNGTNFSLWKMKMKAILRKDNYLATSSARPVDVTDD-KWNEMDKNAIANLHL----------------------------ARSLHNKIFLK

Query:  RRLYTLWMSESTPMTEHINTLNTLFSQLTSLGCKIEPN-------ERLPDSYDQLVINLTNNVLTDYLNFEAIGVAVMEEENRHKNKVDKLASSQQAEVL
        R+LYTL MS ST MTEH+NTLNTLFSQLT LG KIEPN       + LPDSYDQLVINLTNN+LTDYL+F+ +   V+EEENR KNK DKL SSQQAEVL
Subjt:  RRLYTLWMSESTPMTEHINTLNTLFSQLTSLGCKIEPN-------ERLPDSYDQLVINLTNNVLTDYLNFEAIGVAVMEEENRHKNKVDKLASSQQAEVL

Query:  AVTRGRSTERGSSGSQNQGRAKS--------------------------RNSNPQGNVASTSNKGDVLCCEAATTVEGRKSLADDLCSADGHALKIVGIG
         VTR R  E  SSGS+NQGR+KS                          ++SNPQGNVAST N+G VL C+A TT+EG+    D     +   +K++   
Subjt:  AVTRGRSTERGSSGSQNQGRAKS--------------------------RNSNPQGNVASTSNKGDVLCCEAATTVEGRKSLADDLCSADGHALKIVGIG

Query:  TIKLKFHDNTVRTIHKRFEVEGKKVATNLYMLEGETLQDGEASVASRSPSEKLSMIW
                  V  + KR     +KV  NLYMLEGETLQ+GEASVAS S  E LSM+W
Subjt:  TIKLKFHDNTVRTIHKRFEVEGKKVATNLYMLEGETLQDGEASVASRSPSEKLSMIW

A0A5D3CXA6 Uncharacterized protein5.5e-0369.57Show/hide
Query:  VIFMEDKRQAVEDDSTVNKNSETTTVHVEKESEEDSSEGEPTHEIQ
        VIFMEDK+Q V DDSTVN++S   TVHVEKES +DS E  P HEIQ
Subjt:  VIFMEDKRQAVEDDSTVNKNSETTTVHVEKESEEDSSEGEPTHEIQ

A0A5D3CXA6 Uncharacterized protein2.7e-7952.1Show/hide
Query:  MAAKYEIEKFNGTNFSLWKMKMKAILRKDNYLATSSARPVDVTDD-KWNEMDKNAIANLHL----------------------------ARSLHNKIFLK
        M  K++IEKFNGTNFSLWK+KMKAI RKDN L     RP ++TDD KWNEMD NA+AN+HL                            A+SLHNKIFLK
Subjt:  MAAKYEIEKFNGTNFSLWKMKMKAILRKDNYLATSSARPVDVTDD-KWNEMDKNAIANLHL----------------------------ARSLHNKIFLK

Query:  RRLYTLWMSESTPMTEHINTLNTLFSQLTSLGCKIEPN-------ERLPDSYDQLVINLTNNVLTDYLNFEAIGVAVMEEENRHKNKVDKLASSQQAEVL
        R+LYTL MS ST MTEH+NTLNTLFSQLT LG KIEPN       + LPDSYDQLVINLTNN+LTDYL+F+ +   V+EEENR KNK DKL SSQQAE L
Subjt:  RRLYTLWMSESTPMTEHINTLNTLFSQLTSLGCKIEPN-------ERLPDSYDQLVINLTNNVLTDYLNFEAIGVAVMEEENRHKNKVDKLASSQQAEVL

Query:  AVTRGRSTERGSSGSQNQGRAKSRNS--------------------------NPQGNVASTSNKGDVLCCEAATTVEGRKSLADDLCSADGHALKIVGIG
         VTR R  E  SSGS+NQGR+KS++                           NPQGNVAST N+G  L CEA TT+EG+    D     +   +K++   
Subjt:  AVTRGRSTERGSSGSQNQGRAKSRNS--------------------------NPQGNVASTSNKGDVLCCEAATTVEGRKSLADDLCSADGHALKIVGIG

Query:  TIKLKFHDNTVRTIHKRFEVEGKKVATNLYMLEGETLQDGEASVASRSPSEKLSMIW
         + +K                 +KV  NLYMLEGETLQ+GEASVAS S  E LSM+W
Subjt:  TIKLKFHDNTVRTIHKRFEVEGKKVATNLYMLEGETLQDGEASVASRSPSEKLSMIW

A0A6A2WWQ1 Uncharacterized protein1.6e-8242.33Show/hide
Query:  MAAKYEIEKFNGTNFSLWKMKMKAILRKDNYLATSSARPVDVTDD-KWNEMDKNAIANLHL----------------------------ARSLHNKIFLK
        M  K++IEKFNG NFSLWK+KMKAILRKD  LA  + RP+D  DD KWNEMD NA+ N HL                            A SLHNKIFLK
Subjt:  MAAKYEIEKFNGTNFSLWKMKMKAILRKDNYLATSSARPVDVTDD-KWNEMDKNAIANLHL----------------------------ARSLHNKIFLK

Query:  RRLYTLWMSESTPMTEHINTLNTLFSQLTSLGCKIEPNER-------LPDSYDQLVINLTNNVLTDYLNFEAIGVAVMEEENRHKNKVDKLASSQQAEVL
        R+LYTL M EST +TEH+NTLNTLFSQLTSL CKI   ER       LPDSYDQL+INLTN+ +T  L F+ +  A+++EEN+ KNK DK  + QQAE L
Subjt:  RRLYTLWMSESTPMTEHINTLNTLFSQLTSLGCKIEPNER-------LPDSYDQLVINLTNNVLTDYLNFEAIGVAVMEEENRHKNKVDKLASSQQAEVL

Query:  AVTRGRSTERGSSGSQNQGRAKSR-------------------------NSNPQGNVASTSNKGDVLCCEAATTVEGRKSLAD-----------------
          TR RSTERG S S   GR+KSR                         NSNPQGN A+TS+ GD LCCEA+TTVEGRK  AD                 
Subjt:  AVTRGRSTERGSSGSQNQGRAKSR-------------------------NSNPQGNVASTSNKGDVLCCEAATTVEGRKSLAD-----------------

Query:  -----------DLCSADGHALKIVGIGTIKLKFHDNTVRTIH-----KRFE--------------------------------VEGKKVATNLYMLEGET
                    + S + HAL IVG+ TIKLK +D T++ +      K F+                                ++G+K+A NLYML+GET
Subjt:  -----------DLCSADGHALKIVGIGTIKLKFHDNTVRTIH-----KRFE--------------------------------VEGKKVATNLYMLEGET

Query:  LQDGEASVASRSPS------EKLSMIWV----------------------------------------------VIFMEDKRQAVEDDSTVNKNSETTTV
        L + EASVAS S        +KL  I V                                              VIF+EDK Q  EDD +  K SETT +
Subjt:  LQDGEASVASRSPS------EKLSMIWV----------------------------------------------VIFMEDKRQAVEDDSTVNKNSETTTV

Query:  HVEKESEE-DSSEGEPTHEIQEPKGSQVPETRRSDRLTKPP
        +VEK+ E+ DSSE EP  + QEPK S+ P TR+SDR+ K P
Subjt:  HVEKESEE-DSSEGEPTHEIQEPKGSQVPETRRSDRLTKPP

Q6BCY1 Gag-Pol1.6e-8248.32Show/hide
Query:  MAAKYEIEKFNGTNFSLWKMKMKAILRKDNYLATSSARPVDVTDD-KWNEMDKNAIANLHL----------------------------ARSLHNKIFLK
        MAAK+EIEKFNG NFSLWK+K+KAILRKDN LA  S RPVD TDD KW+EM+++A+A+L+L                            A+SLHNKIFLK
Subjt:  MAAKYEIEKFNGTNFSLWKMKMKAILRKDNYLATSSARPVDVTDD-KWNEMDKNAIANLHL----------------------------ARSLHNKIFLK

Query:  RRLYTLWMSESTPMTEHINTLNTLFSQLTSLGCKIEPNER-------LPDSYDQLVINLTNNVLTDYLNFEAIGVAVMEEENRHKNKVDKLASSQQAEVL
        R+LYTL MSEST +TEH+NTLNTLFSQLTSL CKIEP ER       LPDSYDQL+INLTNN+LTDYL F+ +  AV+EEE+R KNK D+  + QQAE L
Subjt:  RRLYTLWMSESTPMTEHINTLNTLFSQLTSLGCKIEPNER-------LPDSYDQLVINLTNNVLTDYLNFEAIGVAVMEEENRHKNKVDKLASSQQAEVL

Query:  AVTRGRSTERGSSGSQNQGRAKS------------------------RNSNPQGNVASTSNKGDVLCCEAATTVEGRKSLAD------------------
         V RGRSTERG   S  +GR+KS                        +NSNPQGNVASTS+ G  LCCEA+   EGRK  AD                  
Subjt:  AVTRGRSTERGSSGSQNQGRAKS------------------------RNSNPQGNVASTSNKGDVLCCEAATTVEGRKSLAD------------------

Query:  ----------DLCSADGHALKIVGIGTIKLKFHDNTVRTIH--------------------------------KRFE-----VEGKKVATNLYMLEGETL
                   + S D HAL+I+GIGTIKLK +D TV+T+                                 K F+     ++G+K+A NLYML+GETL
Subjt:  ----------DLCSADGHALKIVGIGTIKLKFHDNTVRTIH--------------------------------KRFE-----VEGKKVATNLYMLEGETL

Query:  QDGEASVASRSPSEKL
        Q+ EASVA+ SP   L
Subjt:  QDGEASVASRSPSEKL

Q6BCY1 Gag-Pol1.2e-0533.33Show/hide
Query:  VIFMEDKRQAVE-DDSTVNKNSETTTVHVEKESEEDSSEGEPTHEIQEPKGSQVPETRRSDRLTKPPVGSLITLWRGVWSRRNDKAF---NPDLDRTT--
        VIF+ED+ Q  E DDST  +  ETT + VE+E E+DSSE EP HE  EP+ S  P TR+SDR  + P       W   +    + A+     D + +T  
Subjt:  VIFMEDKRQAVE-DDSTVNKNSETTTVHVEKESEEDSSEGEPTHEIQEPKGSQVPETRRSDRLTKPPVGSLITLWRGVWSRRNDKAF---NPDLDRTT--

Query:  -----NDLSAWASSYITAFWTANSN------LLPRDHPPRETRWLPPIER
             +D+S W ++        + N       LP+   P   +W+  I+R
Subjt:  -----NDLSAWASSYITAFWTANSN------LLPRDHPPRETRWLPPIER

Q6BCY1 Gag-Pol4.5e-8250Show/hide
Query:  MAAKYEIEKFNGTNFSLWKMKMKAILRKDNYLATSSARPVDVTDD-KWNEMDKNAIANLHLA----------------------------RSLHNKIFLK
        MAAK+EIEKFNGTNFSLW +KMK +LR DN L      P ++TDD KWNEMD NA+ N+HLA                            +SLHNKIFLK
Subjt:  MAAKYEIEKFNGTNFSLWKMKMKAILRKDNYLATSSARPVDVTDD-KWNEMDKNAIANLHLA----------------------------RSLHNKIFLK

Query:  RRLYTLWMSESTPMTEHINTLNTLFSQLTSLGCKIEPNER-------LPDSYDQLVINLTNNVLTDYLNFEAIGVAVMEEENRHKNKVDKLASSQQAEVL
        R+LYTL MSEST MTEH+NTLNTLFSQL  LG KIEPNER       L DSYDQLVINL NN+L DYL+F+ +  AV+EEENR KNK DKL + QQAE L
Subjt:  RRLYTLWMSESTPMTEHINTLNTLFSQLTSLGCKIEPNER-------LPDSYDQLVINLTNNVLTDYLNFEAIGVAVMEEENRHKNKVDKLASSQQAEVL

Query:  AVTRGRSTERGSSGSQNQGRAKSRNSNPQGNVASTSNKGDVLCCEAATTVEGRKSLAD----------------------------DLCSADGHALKIVG
         VTRGR  E   +    +G++  ++SNPQGNVAST N+G  L CEA TT EG+K +AD                             L S + HALKIV 
Subjt:  AVTRGRSTERGSSGSQNQGRAKSRNSNPQGNVASTSNKGDVLCCEAATTVEGRKSLAD----------------------------DLCSADGHALKIVG

Query:  IGTIKLKFHDNTVRTIHKRFEVE-------------------------------------GKKVATNLYMLEGETLQDGEASVASRSPSEKLSMIW
        IGTIKLK HDN V TI +   VE                                     G+KV  NLYMLEGETLQ+GEASVAS S  E L M+W
Subjt:  IGTIKLKFHDNTVRTIHKRFEVE-------------------------------------GKKVATNLYMLEGETLQDGEASVASRSPSEKLSMIW

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.5e-0521.86Show/hide
Query:  AKYEIEKFNGTNFSLWKMKMKAILRKDNYLATSSARPVDVTDDKWNEMDKNA-------------------------IANL---HLARSLHNKIFLKRRL
        AK  I+ F+G  +++WK +++A+L + + L        +  DD W + ++ A                         + NL   +  +SL +++ L++RL
Subjt:  AKYEIEKFNGTNFSLWKMKMKAILRKDNYLATSSARPVDVTDDKWNEMDKNA-------------------------IANL---HLARSLHNKIFLKRRL

Query:  YTLWMSESTPMTEHINTLNTLFSQLTSLGCKIEPNER-------LPDSYDQLVINLTNNVLTDYLNFEAIGVAVMEEENRHKN
         +L +S    +  H +  + L S+L + G KIE  ++       LP  YD  +I     +  + L    +   ++++E + KN
Subjt:  YTLWMSESTPMTEHINTLNTLFSQLTSLGCKIEPNER-------LPDSYDQLVINLTNNVLTDYLNFEAIGVAVMEEENRHKN

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-1429.2Show/hide
Query:  KYEIEKFNGTN-FSLWKMKMKAILRKD---NYLATSSARPVDVTDDKWNEMDKNA----------------------------IANLHLARSLHNKIFLK
        KYE+ KFNG N FS W+ +M+ +L +      L   S +P  +  + W ++D+ A                            + +L+++++L NK++LK
Subjt:  KYEIEKFNGTN-FSLWKMKMKAILRKD---NYLATSSARPVDVTDDKWNEMDKNA----------------------------IANLHLARSLHNKIFLK

Query:  RRLYTLWMSESTPMTEHINTLNTLFSQLTSLGCKIEPNER-------LPDSYDQLVINLTNNVLTDYLNFEAIGVAVMEEENRHKNKVDKLASSQQAEVL
        ++LY L MSE T    H+N  N L +QL +LG KIE  ++       LP SYD L   + +   T  +  + +  A++  E     K+ K   +Q   ++
Subjt:  RRLYTLWMSESTPMTEHINTLNTLFSQLTSLGCKIEPNER-------LPDSYDQLVINLTNNVLTDYLNFEAIGVAVMEEENRHKNKVDKLASSQQAEVL

Query:  AVTRGRSTERGSSGSQNQG-RAKSRN
           RGRS +R S+     G R KS+N
Subjt:  AVTRGRSTERGSSGSQNQG-RAKSRN

Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein7.5e-0523.96Show/hide
Query:  LWRGVWSRRND-----KAFNPD--LDRTTNDLSAWASSYITAFWTANSNLLPRDHPPRETRWLPPIERVYKINIDASFSPIECNAGLGIIIRNFRGQVMA
        LWR +W  RN+     + FN    L R  +DL  W           +    P+ +     RW PP  +  K N DA+++      G+G ++RN +G+V  
Subjt:  LWRGVWSRRND-----KAFNPD--LDRTTNDLSAWASSYITAFWTANSNLLPRDHPPRETRWLPPIERVYKINIDASFSPIECNAGLGIIIRNFRGQVMA

Query:  SATKYLEHGQSVDNAEALATSEGLRLALEIGLLPVQLEMDSARIFNLFVHEVEDLSETGNIVQTVKMEVATVIHASYSFTKRDGNGVAHMLA
           + L   +SV  AE  A    +          V  E DS  +  +  ++ E        +Q ++  ++      + F  R+GN +A  +A
Subjt:  SATKYLEHGQSVDNAEALATSEGLRLALEIGLLPVQLEMDSARIFNLFVHEVEDLSETGNIVQTVKMEVATVIHASYSFTKRDGNGVAHMLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCAAAGTACGAGATTGAGAAATTCAACGGGACTAATTTCTCGTTGTGGAAGATGAAGATGAAGGCTATCTTGAGAAAAGATAATTACCTTGCAACCAGTAGTGC
GAGGCCAGTGGATGTCACAGATGATAAATGGAACGAGATGGATAAGAATGCTATTGCAAATCTTCATCTGGCCAGATCACTTCACAACAAGATTTTCCTTAAGAGGAGAT
TGTATACTCTTTGGATGTCAGAATCTACTCCAATGACAGAGCACATCAACACGTTGAATACTCTATTTTCTCAACTCACATCACTGGGTTGTAAAATAGAGCCAAATGAA
CGTCTTCCTGATTCGTATGATCAACTTGTCATCAACTTGACAAATAATGTTCTCACTGACTATCTAAACTTTGAAGCTATTGGAGTTGCTGTCATGGAAGAGGAAAATCG
GCACAAGAACAAAGTAGATAAGTTGGCGAGTTCGCAACAAGCAGAGGTTCTGGCGGTGACAAGAGGTAGATCAACGGAACGTGGCTCAAGTGGGAGCCAAAATCAAGGAA
GGGCAAAATCAAGAAATTCCAATCCTCAAGGAAATGTAGCAAGCACCTCAAATAAAGGTGATGTCTTGTGTTGTGAAGCAGCGACAACTGTTGAAGGCAGAAAGAGTCTA
GCTGACGATTTGTGTTCAGCTGACGGTCATGCCCTGAAAATTGTCGGTATTGGAACTATCAAGTTGAAGTTCCATGACAATACAGTTCGCACAATTCACAAGCGATTTGA
GGTGGAGGGAAAAAAGGTGGCTACAAACTTGTACATGTTGGAAGGAGAGACTTTACAAGATGGGGAAGCATCAGTTGCCTCAAGAAGTCCAAGTGAAAAGCTCTCGATGA
TCTGGGTTGTGATCTTTATGGAGGACAAGAGACAAGCTGTAGAAGATGATAGCACTGTAAATAAAAATTCAGAGACTACAACGGTACACGTGGAGAAAGAATCTGAAGAA
GACTCTTCTGAAGGTGAACCAACGCACGAGATCCAAGAACCAAAAGGATCACAAGTACCAGAGACTCGTAGATCAGATAGGCTGACAAAACCACCTGTTGGCAGTCTGAT
TACATTATGGAGAGGAGTTTGGAGTCGGAGGAATGATAAGGCTTTTAATCCAGATTTGGACAGGACAACCAATGATCTATCGGCTTGGGCCTCTTCTTACATCACAGCTT
TTTGGACAGCAAATTCTAATCTTCTACCGAGGGACCATCCTCCTAGAGAAACTCGTTGGCTTCCACCGATTGAAAGGGTATATAAGATTAACATTGATGCTTCATTCTCT
CCTATTGAATGTAATGCAGGTTTAGGGATTATAATTAGAAATTTTAGAGGCCAAGTGATGGCTTCTGCTACGAAATACCTAGAGCATGGTCAGTCGGTCGATAATGCAGA
GGCCTTAGCGACTTCGGAGGGCCTACGGTTGGCGTTGGAGATCGGTCTTCTTCCAGTGCAGCTGGAAATGGACTCTGCCAGAATCTTCAACCTTTTCGTTCATGAGGTGG
AGGATTTATCAGAAACGGGAAATATTGTTCAAACTGTCAAAATGGAGGTGGCTACAGTCATTCATGCATCCTACAGCTTCACGAAAAGAGATGGTAACGGAGTAGCGCAC
ATGCTTGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGGCAAAGTACGAGATTGAGAAATTCAACGGGACTAATTTCTCGTTGTGGAAGATGAAGATGAAGGCTATCTTGAGAAAAGATAATTACCTTGCAACCAGTAGTGC
GAGGCCAGTGGATGTCACAGATGATAAATGGAACGAGATGGATAAGAATGCTATTGCAAATCTTCATCTGGCCAGATCACTTCACAACAAGATTTTCCTTAAGAGGAGAT
TGTATACTCTTTGGATGTCAGAATCTACTCCAATGACAGAGCACATCAACACGTTGAATACTCTATTTTCTCAACTCACATCACTGGGTTGTAAAATAGAGCCAAATGAA
CGTCTTCCTGATTCGTATGATCAACTTGTCATCAACTTGACAAATAATGTTCTCACTGACTATCTAAACTTTGAAGCTATTGGAGTTGCTGTCATGGAAGAGGAAAATCG
GCACAAGAACAAAGTAGATAAGTTGGCGAGTTCGCAACAAGCAGAGGTTCTGGCGGTGACAAGAGGTAGATCAACGGAACGTGGCTCAAGTGGGAGCCAAAATCAAGGAA
GGGCAAAATCAAGAAATTCCAATCCTCAAGGAAATGTAGCAAGCACCTCAAATAAAGGTGATGTCTTGTGTTGTGAAGCAGCGACAACTGTTGAAGGCAGAAAGAGTCTA
GCTGACGATTTGTGTTCAGCTGACGGTCATGCCCTGAAAATTGTCGGTATTGGAACTATCAAGTTGAAGTTCCATGACAATACAGTTCGCACAATTCACAAGCGATTTGA
GGTGGAGGGAAAAAAGGTGGCTACAAACTTGTACATGTTGGAAGGAGAGACTTTACAAGATGGGGAAGCATCAGTTGCCTCAAGAAGTCCAAGTGAAAAGCTCTCGATGA
TCTGGGTTGTGATCTTTATGGAGGACAAGAGACAAGCTGTAGAAGATGATAGCACTGTAAATAAAAATTCAGAGACTACAACGGTACACGTGGAGAAAGAATCTGAAGAA
GACTCTTCTGAAGGTGAACCAACGCACGAGATCCAAGAACCAAAAGGATCACAAGTACCAGAGACTCGTAGATCAGATAGGCTGACAAAACCACCTGTTGGCAGTCTGAT
TACATTATGGAGAGGAGTTTGGAGTCGGAGGAATGATAAGGCTTTTAATCCAGATTTGGACAGGACAACCAATGATCTATCGGCTTGGGCCTCTTCTTACATCACAGCTT
TTTGGACAGCAAATTCTAATCTTCTACCGAGGGACCATCCTCCTAGAGAAACTCGTTGGCTTCCACCGATTGAAAGGGTATATAAGATTAACATTGATGCTTCATTCTCT
CCTATTGAATGTAATGCAGGTTTAGGGATTATAATTAGAAATTTTAGAGGCCAAGTGATGGCTTCTGCTACGAAATACCTAGAGCATGGTCAGTCGGTCGATAATGCAGA
GGCCTTAGCGACTTCGGAGGGCCTACGGTTGGCGTTGGAGATCGGTCTTCTTCCAGTGCAGCTGGAAATGGACTCTGCCAGAATCTTCAACCTTTTCGTTCATGAGGTGG
AGGATTTATCAGAAACGGGAAATATTGTTCAAACTGTCAAAATGGAGGTGGCTACAGTCATTCATGCATCCTACAGCTTCACGAAAAGAGATGGTAACGGAGTAGCGCAC
ATGCTTGCTTAG
Protein sequenceShow/hide protein sequence
MAAKYEIEKFNGTNFSLWKMKMKAILRKDNYLATSSARPVDVTDDKWNEMDKNAIANLHLARSLHNKIFLKRRLYTLWMSESTPMTEHINTLNTLFSQLTSLGCKIEPNE
RLPDSYDQLVINLTNNVLTDYLNFEAIGVAVMEEENRHKNKVDKLASSQQAEVLAVTRGRSTERGSSGSQNQGRAKSRNSNPQGNVASTSNKGDVLCCEAATTVEGRKSL
ADDLCSADGHALKIVGIGTIKLKFHDNTVRTIHKRFEVEGKKVATNLYMLEGETLQDGEASVASRSPSEKLSMIWVVIFMEDKRQAVEDDSTVNKNSETTTVHVEKESEE
DSSEGEPTHEIQEPKGSQVPETRRSDRLTKPPVGSLITLWRGVWSRRNDKAFNPDLDRTTNDLSAWASSYITAFWTANSNLLPRDHPPRETRWLPPIERVYKINIDASFS
PIECNAGLGIIIRNFRGQVMASATKYLEHGQSVDNAEALATSEGLRLALEIGLLPVQLEMDSARIFNLFVHEVEDLSETGNIVQTVKMEVATVIHASYSFTKRDGNGVAH
MLA