; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g12030 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g12030
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRibonuclease H
Genome locationchr7:9157754..9170797
RNA-Seq ExpressionMoc07g12030
SyntenyMoc07g12030
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000467 - G-patch domain
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_012436090.1 PREDICTED: uncharacterized protein LOC105762750 [Gossypium raimondii]3.6e-12176.29Show/hide
Query:  SLDPAKTKAVLDMPEPTSAKEFKSFLGKASYLRRFLPGLAALSAPLMELLKKKVEYKWEEVHRQAFAKIKETLANAPVMMAPIA----------------
        ++DPAK KAVLDMPEPTSAK+ KSFLGKASYLRRFLPGLAALSAPLMELLKKKVEYKWEEVHRQAFAKIKETLANAPVMMAPIA                
Subjt:  SLDPAKTKAVLDMPEPTSAKEFKSFLGKASYLRRFLPGLAALSAPLMELLKKKVEYKWEEVHRQAFAKIKETLANAPVMMAPIA----------------

Query:  ---------AKKLRHYMLAHKIKLVVGANPIRSTGPSKDPIELSEEIPGDLEQIATLEDGQWTLYFDESSTAKEE-------------------------
                 A+KLRHYMLAHKIKLVVGANPIRSTGPSKD IELSEEIPG++EQIAT+EDGQWTLYF ESSTAKEE                         
Subjt:  ---------AKKLRHYMLAHKIKLVVGANPIRSTGPSKDPIELSEEIPGDLEQIATLEDGQWTLYFDESSTAKEE-------------------------

Query:  -----EYEALAIGLSIAKEMKIKKFKVVGDSNLAVRQTGGTFALKEISLAPYQFLVLKLCYEIDVELTHVNRSTNRHADSLATLASKIKFED--DEAVTK
             EYEALAIGLSIAKEMKI K KVVGDSNL VRQT GTFALKEISLAPYQ LV KLC EIDVELTHVNR TNRHADSLATLASKIKFED  DEAV K
Subjt:  -----EYEALAIGLSIAKEMKIKKFKVVGDSNLAVRQTGGTFALKEISLAPYQFLVLKLCYEIDVELTHVNRSTNRHADSLATLASKIKFED--DEAVTK

Query:  ISKRRTPAHGTLTISYMRMSLKKEIGDML
        ISKRRTPAHGTLTISYM+MSLKKEIGDML
Subjt:  ISKRRTPAHGTLTISYMRMSLKKEIGDML

XP_021833842.1 uncharacterized protein LOC110773621 [Prunus avium]2.1e-4433.5Show/hide
Query:  EVYVDDILFKSKTGEGHPEALQRVLERSRKSQLKMN-----------------------SLDPAKTKAVLDMPEPTSAKEFKSFLGKASYLRRFLPGLAA
        E YVDD++ KSKT EGH E L++VLER R   LKMN                        +DP KT+A+  +  P + KE KSF+G+ SY+RRF+PGLAA
Subjt:  EVYVDDILFKSKTGEGHPEALQRVLERSRKSQLKMN-----------------------SLDPAKTKAVLDMPEPTSAKEFKSFLGKASYLRRFLPGLAA

Query:  LSAPLMELLKKKVEYKWEEVHRQAFAKIKETLANAPVMMAPI----------------------------------------------------------
          +    L KK   Y+W +  ++A+ ++++ +   P M API                                                          
Subjt:  LSAPLMELLKKKVEYKWEEVHRQAFAKIKETLANAPVMMAPI----------------------------------------------------------

Query:  AAKKLRHYMLAHKIKLVVGANPIRSTGPSKDPIELSEEIPGDLEQIATL--EDGQWTLYFDESSTA-----------------------------KEEEY
        AA++LRHY LAHK++L+V +  +    P ++   LSEE+ G+L +I  +  E+  WTLYFD SST+                                EY
Subjt:  AAKKLRHYMLAHKIKLVVGANPIRSTGPSKDPIELSEEIPGDLEQIATL--EDGQWTLYFDESSTA-----------------------------KEEEY

Query:  EALAIGLSIAKEMKIKKFKVVGDSNLAVRQTGGTFALKEISLAPYQFLVLKLCYEI-DVELTHVNRSTNRHADSLATLASKIKFEDDEAVTKISKRRTPA
        EA  +G+S AKEM I++ K++GDSNL + Q  G+FA+KE +LAPY+    KL      V L H+  +TNR+AD+LATL SK+ F  ++    + KR  PA
Subjt:  EALAIGLSIAKEMKIKKFKVVGDSNLAVRQTGGTFALKEISLAPYQFLVLKLCYEI-DVELTHVNRSTNRHADSLATLASKIKFEDDEAVTKISKRRTPA

XP_022157414.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111024115 [Momordica charantia]8.8e-5191.67Show/hide
Query:  MANLGDEGNISDARRREALEGTVERMIRSMEALTERIGRLETQNQAKERVPPPPPPRVMDEDEYEGDGSDHWEDDQATVLAGPRGGDRHVGRGLDRGRGR
        MANLG EGNISDARRREALEGTVERMIRSMEALTERIGRLETQNQA+ RVPPPP  RVMDEDEYEGDGSDHWEDDQATVLA PRGGD HVG GLDRGRGR
Subjt:  MANLGDEGNISDARRREALEGTVERMIRSMEALTERIGRLETQNQAKERVPPPPPPRVMDEDEYEGDGSDHWEDDQATVLAGPRGGDRHVGRGLDRGRGR

Query:  GRGYHNFQRAPRANAQVLRS
        GRGYHNFQRAPRANAQV R+
Subjt:  GRGYHNFQRAPRANAQVLRS

XP_022157414.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111024115 [Momordica charantia]5.1e-0639.51Show/hide
Query:  LDPAKTKAVLDMPEPTSAKEFKSFLGKASYLRRFLPGLAALSAPLMELLKKKVEYKWEEVHRQAFAKIKETLANAPVMMAP
        +D  K KA+ + P PT+  E +SF G AS+ RRF+   + L++PL EL+ K V + W      AF  +K+ L NAP++  P
Subjt:  LDPAKTKAVLDMPEPTSAKEFKSFLGKASYLRRFLPGLAALSAPLMELLKKKVEYKWEEVHRQAFAKIKETLANAPVMMAP

XP_022157414.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111024115 [Momordica charantia]1.3e-4968.25Show/hide
Query:  MFRSMRHLPGSGLGRRHQGIVEPIHVQAIEPPFGLGYIPTEEDLQKLEDKKKKKKNKSLRRKKKRKTVESEIEEEYQLQLMFT--QDGATGEQGMVSMKI
        MFRSMR+LPGSGLGR HQGIVEPIHVQAIEPPFGLG    +                     KK  T    +   +   L  T  QDGA GEQGMVSM I
Subjt:  MFRSMRHLPGSGLGRRHQGIVEPIHVQAIEPPFGLGYIPTEEDLQKLEDKKKKKKNKSLRRKKKRKTVESEIEEEYQLQLMFT--QDGATGEQGMVSMKI

Query:  ANPVAKDPSTLIIPTDGAPRNWSSLPQLQSFSVRDEPVLLKSVESLFNVTSSEDGLKSVPSVSVIISEVE-SALEARESVIKELVVDEE
        A+PVAKDPSTLIIP DGAPRNW+SLPQL+S SV  EPVL+KSVESLFNVTSSEDGLKSVPSVSVIISEVE SAL ARESVIKELVV  E
Subjt:  ANPVAKDPSTLIIPTDGAPRNWSSLPQLQSFSVRDEPVLLKSVESLFNVTSSEDGLKSVPSVSVIISEVE-SALEARESVIKELVVDEE

XP_023908129.1 uncharacterized protein LOC112019817 [Quercus suber]1.5e-4231.59Show/hide
Query:  EVYVDDILFKSKTGEGHPEALQRVLERSRKSQLKMNSL-----------------------DPAKTKAVLDMPEPTSAKEFKSFLGKASYLRRFLPGLAA
        E Y DDI+ KSKT E +   L++V ER R+ +L+MN L                       DP K KA++ M  P S KE KSFLGK SY+ RF+PGLAA
Subjt:  EVYVDDILFKSKTGEGHPEALQRVLERSRKSQLKMNSL-----------------------DPAKTKAVLDMPEPTSAKEFKSFLGKASYLRRFLPGLAA

Query:  LSAPLMELLKKKVEYKWEEVHRQAFAKIKETLANAPVMMAP----------------------------------------------------------I
          AP   LLKK ++++W   H+ AF K+++ +   P + +P                                                           
Subjt:  LSAPLMELLKKKVEYKWEEVHRQAFAKIKETLANAPVMMAP----------------------------------------------------------I

Query:  AAKKLRHYMLAHKIKLVVGANPIR-------------------------STGP----------------SKDPIELSEEIPGDLEQIATLEDGQ--WTLY
        AA+KLRHY+LA+ + L+  +NPIR                         S  P                 +D  ++++E+PG++ ++A LE+ +  W + 
Subjt:  AAKKLRHYMLAHKIKLVVGANPIR-------------------------STGP----------------SKDPIELSEEIPGDLEQIATLEDGQ--WTLY

Query:  FDESSTAKEE-----------------------------EYEALAIGLSIAKEMKIKKFKVVGDSNLAVRQTGGTFALKEISLAPYQFLVLKLCYEID-V
        FD SS +  E                             EYEA   GLSIA+EM+IK+ KV GDSNL V Q  G F+LKE SLAPY+ +  +L    D +
Subjt:  FDESSTAKEE-----------------------------EYEALAIGLSIAKEMKIKKFKVVGDSNLAVRQTGGTFALKEISLAPYQFLVLKLCYEID-V

Query:  ELTHVNRSTNRHADSLATLASKIKFEDDEAVTKISKRRTP
         + H  RS NRHAD++ TL SKI FE +     I KR  P
Subjt:  ELTHVNRSTNRHADSLATLASKIKFEDDEAVTKISKRRTP

TrEMBL top hitse value%identityAlignment
A0A2N9FL36 Reverse transcriptase7.5e-4834.47Show/hide
Query:  EVYVDDILFKSKTGEGHPEALQRVLERSRKSQLKMNSL-----------------------DPAKTKAVLDMPEPTSAKEFKSFLGKASYLRRFLPGLAA
        E YVDD++ K++T EGH + L+RV ER R+ +L MN L                       DP K KA+  M  PT+ KE KSFLGK SY+RRF+PGLAA
Subjt:  EVYVDDILFKSKTGEGHPEALQRVLERSRKSQLKMNSL-----------------------DPAKTKAVLDMPEPTSAKEFKSFLGKASYLRRFLPGLAA

Query:  LSAPLMELLKKKVEYKWEEVHRQAFAKIKETLANAPVMMAPI------------------------------AAKKLRHYMLAHKIKLVVGANPIR----
        ++     LLKK V+Y+W +  +Q F +++  +AN P + API                              A KKLRHY +AH + L+  ++PIR    
Subjt:  LSAPLMELLKKKVEYKWEEVHRQAFAKIKETLANAPVMMAPI------------------------------AAKKLRHYMLAHKIKLVVGANPIR----

Query:  -------------------------------------STGPSKDPIELSEEIPGDLEQIATLE--DGQWTLYFDESST----------AKEE--------
                                             +  P +D   ++EE+PG++ ++A  E  D  WTL FD SST           +EE        
Subjt:  -------------------------------------STGPSKDPIELSEEIPGDLEQIATLE--DGQWTLYFDESST----------AKEE--------

Query:  -----------EYEALAIGLSIAKEMKIKKFKVVGDSNLAVRQTGGTFALKEISLAPYQFLVLKL-CYEIDVELTHVNRSTNRHADSLATLASKIKFEDD
                   EYEA   GL++A+E++IK+ KV GDSNL V Q  G FAL E SLAPY+ +  +L  Y  ++ + H  RS NRHAD+L TL SK  F+ +
Subjt:  -----------EYEALAIGLSIAKEMKIKKFKVVGDSNLAVRQTGGTFALKEISLAPYQFLVLKL-CYEIDVELTHVNRSTNRHADSLATLASKIKFEDD

Query:  EAVTKISKRRTP
             I KR +P
Subjt:  EAVTKISKRRTP

A0A2N9G586 Ribonuclease H4.9e-4735.8Show/hide
Query:  EVYVDDILFKSKTGEGHPEALQRVLERSRKSQLKMNSL-----------------------DPAKTKAVLDMPEPTSAKEFKSFLGKASYLRRFLPGLAA
        E YVDDI+ KSK  E H E L+RV +R R  +LKMN L                       DPAK  A+  M  P S KE KSFLG+ SY+RRF+PGLAA
Subjt:  EVYVDDILFKSKTGEGHPEALQRVLERSRKSQLKMNSL-----------------------DPAKTKAVLDMPEPTSAKEFKSFLGKASYLRRFLPGLAA

Query:  LSAPLMELLKKKVEYKWEEVHRQAFAKIKETLANAPVMMAPI----------------------------------------------------------
        ++     L+KK V + W    +QAF KI+  +   P + API                                                          
Subjt:  LSAPLMELLKKKVEYKWEEVHRQAFAKIKETLANAPVMMAPI----------------------------------------------------------

Query:  AAKKLRHYMLAHKIKLVVGANPIRS----------------TGPSKDPIELSEEIPGDLEQ--IATLEDGQWTLYFDESSTAKEE--EYEALAIGLSIAK
        A+++LRHY LAHKI+L+  ++PIRS                    +D   +S E+PG + +  +  L D  WTL FD SST+     EYEA   GL+IA 
Subjt:  AAKKLRHYMLAHKIKLVVGANPIRS----------------TGPSKDPIELSEEIPGDLEQ--IATLEDGQWTLYFDESSTAKEE--EYEALAIGLSIAK

Query:  EMKIKKFKVVGDSNLAVRQTGGTFALKEISLAPYQFLVLKLCYEIDV-ELTHVNRSTNRHADSLATLASKIKFEDDEAVTKISKRRTPAHGTLTISYMRM
        EM IK  +V+GDSNL + QT G F+LKE SLA Y+ L  KL  + D  E++H  R  NR+AD+LATL S++ FE  +    I KR  P    L   +   
Subjt:  EMKIKKFKVVGDSNLAVRQTGGTFALKEISLAPYQFLVLKLCYEIDV-ELTHVNRSTNRHADSLATLASKIKFEDDEAVTKISKRRTPAHGTLTISYMRM

Query:  SLKKE
        +L  E
Subjt:  SLKKE

A0A2N9HJV8 RNase H domain-containing protein2.9e-4735.22Show/hide
Query:  EVYVDDILFKSKTGEGHPEALQRVLERSRKSQLKMNSL-----------------------DPAKTKAVLDMPEPTSAKEFKSFLGKASYLRRFLPGLAA
        E YVDDI+ KSK  E H E L++V ER R  +LKMN L                       DPAK  A+  M  PTS KE KSFLG+ SY+RRF+PGLAA
Subjt:  EVYVDDILFKSKTGEGHPEALQRVLERSRKSQLKMNSL-----------------------DPAKTKAVLDMPEPTSAKEFKSFLGKASYLRRFLPGLAA

Query:  LSAPLMELLKKKVEYKWEEVHRQAFAKIKETLANAPVMMAPIAAK------------------------KLRHYMLAHKIKLVVGANPIRS---------
        +++    LLKK V + W    ++AF +I+  +   P + APIA K                        +LRHY LAHKI+L+  ++PIRS         
Subjt:  LSAPLMELLKKKVEYKWEEVHRQAFAKIKETLANAPVMMAPIAAK------------------------KLRHYMLAHKIKLVVGANPIRS---------

Query:  --------------------------------TGPSKDPIELSEEIPGDL--EQIATLEDGQWTLYFDESSTA---------------------------
                                          P +D   +S+E+PG++    +  + D  WTL FD SST+                           
Subjt:  --------------------------------TGPSKDPIELSEEIPGDL--EQIATLEDGQWTLYFDESSTA---------------------------

Query:  --KEEEYEALAIGLSIAKEMKIKKFKVVGDSNLAVRQTGGTFALKEISLAPYQFLVLKLCYEIDV-ELTHVNRSTNRHADSLATLASKIKFEDDEAVTKI
             EYEA   GL+IA EM IK  +V+GDSNL V Q  G F+LKE SLAPY+ L  +L  +    E+TH  RS NR+AD+LA L S++ FE   A   I
Subjt:  --KEEEYEALAIGLSIAKEMKIKKFKVVGDSNLAVRQTGGTFALKEISLAPYQFLVLKLCYEIDV-ELTHVNRSTNRHADSLATLASKIKFEDDEAVTKI

Query:  SKRRTPAHGTLTISYMRMSLKKE
        +KR TP    L   Y      KE
Subjt:  SKRRTPAHGTLTISYMRMSLKKE

A0A6J1DWE9 LOW QUALITY PROTEIN: uncharacterized protein LOC1110241154.3e-5191.67Show/hide
Query:  MANLGDEGNISDARRREALEGTVERMIRSMEALTERIGRLETQNQAKERVPPPPPPRVMDEDEYEGDGSDHWEDDQATVLAGPRGGDRHVGRGLDRGRGR
        MANLG EGNISDARRREALEGTVERMIRSMEALTERIGRLETQNQA+ RVPPPP  RVMDEDEYEGDGSDHWEDDQATVLA PRGGD HVG GLDRGRGR
Subjt:  MANLGDEGNISDARRREALEGTVERMIRSMEALTERIGRLETQNQAKERVPPPPPPRVMDEDEYEGDGSDHWEDDQATVLAGPRGGDRHVGRGLDRGRGR

Query:  GRGYHNFQRAPRANAQVLRS
        GRGYHNFQRAPRANAQV R+
Subjt:  GRGYHNFQRAPRANAQVLRS

A0A6J1DWE9 LOW QUALITY PROTEIN: uncharacterized protein LOC1110241152.5e-0639.51Show/hide
Query:  LDPAKTKAVLDMPEPTSAKEFKSFLGKASYLRRFLPGLAALSAPLMELLKKKVEYKWEEVHRQAFAKIKETLANAPVMMAP
        +D  K KA+ + P PT+  E +SF G AS+ RRF+   + L++PL EL+ K V + W      AF  +K+ L NAP++  P
Subjt:  LDPAKTKAVLDMPEPTSAKEFKSFLGKASYLRRFLPGLAALSAPLMELLKKKVEYKWEEVHRQAFAKIKETLANAPVMMAP

A0A6J1DWE9 LOW QUALITY PROTEIN: uncharacterized protein LOC1110241156.2e-5068.25Show/hide
Query:  MFRSMRHLPGSGLGRRHQGIVEPIHVQAIEPPFGLGYIPTEEDLQKLEDKKKKKKNKSLRRKKKRKTVESEIEEEYQLQLMFT--QDGATGEQGMVSMKI
        MFRSMR+LPGSGLGR HQGIVEPIHVQAIEPPFGLG    +                     KK  T    +   +   L  T  QDGA GEQGMVSM I
Subjt:  MFRSMRHLPGSGLGRRHQGIVEPIHVQAIEPPFGLGYIPTEEDLQKLEDKKKKKKNKSLRRKKKRKTVESEIEEEYQLQLMFT--QDGATGEQGMVSMKI

Query:  ANPVAKDPSTLIIPTDGAPRNWSSLPQLQSFSVRDEPVLLKSVESLFNVTSSEDGLKSVPSVSVIISEVE-SALEARESVIKELVVDEE
        A+PVAKDPSTLIIP DGAPRNW+SLPQL+S SV  EPVL+KSVESLFNVTSSEDGLKSVPSVSVIISEVE SAL ARESVIKELVV  E
Subjt:  ANPVAKDPSTLIIPTDGAPRNWSSLPQLQSFSVRDEPVLLKSVESLFNVTSSEDGLKSVPSVSVIISEVE-SALEARESVIKELVVDEE

SwissProt top hitse value%identityAlignment
P03359 Gag-Pol polyprotein5.8e-1331.54Show/hide
Query:  YVDDILFKSKTGEGHPEALQRVLER---------SRKSQLKMNS--------------LDPAKTKAVLDMPEPTSAKEFKSFLGKASYLRRFLPGLAALS
        YVDD+L  + T     E  Q++L+          ++K+QL                  L PA+   V+ +P PT+ ++ + FLG A + R ++PG A+L+
Subjt:  YVDDILFKSKTGEGHPEALQRVLER---------SRKSQLKMNS--------------LDPAKTKAVLDMPEPTSAKEFKSFLGKASYLRRFLPGLAALS

Query:  APLMELLKKKVEYKWEEVHRQAFAKIKETLANAPVMMAPIAAKKLRHYM
        APL  L K+ + + W E H++AF +IKE L +AP +  P   K    Y+
Subjt:  APLMELLKKKVEYKWEEVHRQAFAKIKETLANAPVMMAPIAAKKLRHYM

P0CT34 Transposon Tf2-1 polyprotein1.1e-1131.61Show/hide
Query:  LGTADEPQEV-YVDDILFKSKTGEGHPEALQRVLERSRKSQLKMN---------------------SLDPAKTK--AVLDMPEPTSAKEFKSFLGKASYL
        LG A E   V Y+DDIL  SK+   H + ++ VL++ + + L +N                        P +     VL   +P + KE + FLG  +YL
Subjt:  LGTADEPQEV-YVDDILFKSKTGEGHPEALQRVLERSRKSQLKMN---------------------SLDPAKTK--AVLDMPEPTSAKEFKSFLGKASYL

Query:  RRFLPGLAALSAPLMELLKKKVEYKWEEVHRQAFAKIKETLANAPVMMAPIAAKKLRHYMLAHKIKLVVGANPI
        R+F+P  + L+ PL  LLKK V +KW     QA   IK+ L + PV         LRH+  + KI L   A+ +
Subjt:  RRFLPGLAALSAPLMELLKKKVEYKWEEVHRQAFAKIKETLANAPVMMAPIAAKKLRHYMLAHKIKLVVGANPI

P0CT41 Transposon Tf2-12 polyprotein1.1e-1131.61Show/hide
Query:  LGTADEPQEV-YVDDILFKSKTGEGHPEALQRVLERSRKSQLKMN---------------------SLDPAKTK--AVLDMPEPTSAKEFKSFLGKASYL
        LG A E   V Y+DDIL  SK+   H + ++ VL++ + + L +N                        P +     VL   +P + KE + FLG  +YL
Subjt:  LGTADEPQEV-YVDDILFKSKTGEGHPEALQRVLERSRKSQLKMN---------------------SLDPAKTK--AVLDMPEPTSAKEFKSFLGKASYL

Query:  RRFLPGLAALSAPLMELLKKKVEYKWEEVHRQAFAKIKETLANAPVMMAPIAAKKLRHYMLAHKIKLVVGANPI
        R+F+P  + L+ PL  LLKK V +KW     QA   IK+ L + PV         LRH+  + KI L   A+ +
Subjt:  RRFLPGLAALSAPLMELLKKKVEYKWEEVHRQAFAKIKETLANAPVMMAPIAAKKLRHYMLAHKIKLVVGANPI

P21414 Gag-Pol polyprotein1.3e-1231.54Show/hide
Query:  YVDDILFKSKTGEGHPEALQRVLER---------SRKSQLKMNS--------------LDPAKTKAVLDMPEPTSAKEFKSFLGKASYLRRFLPGLAALS
        YVDD+L  + T E   +  Q++L+          ++K+QL                  L PA+   V+ +P PT+ ++ + FLG A + R ++PG A+L+
Subjt:  YVDDILFKSKTGEGHPEALQRVLER---------SRKSQLKMNS--------------LDPAKTKAVLDMPEPTSAKEFKSFLGKASYLRRFLPGLAALS

Query:  APLMELLKKKVEYKWEEVHRQAFAKIKETLANAPVMMAPIAAKKLRHYM
        APL  L K+ + + W E H+QAF  IK+ L +AP +  P   K    Y+
Subjt:  APLMELLKKKVEYKWEEVHRQAFAKIKETLANAPVMMAPIAAKKLRHYM

Q9TTC1 Gag-Pol polyprotein6.9e-1432.21Show/hide
Query:  YVDDILFKSKTGEGHPEALQRVLER---------SRKSQLKMNS--------------LDPAKTKAVLDMPEPTSAKEFKSFLGKASYLRRFLPGLAALS
        YVDD+L  + T     E  +R+L+          ++K+QL                  L PA+   V+ +P PT+ ++ + FLG A + R ++PG A+L+
Subjt:  YVDDILFKSKTGEGHPEALQRVLER---------SRKSQLKMNS--------------LDPAKTKAVLDMPEPTSAKEFKSFLGKASYLRRFLPGLAALS

Query:  APLMELLKKKVEYKWEEVHRQAFAKIKETLANAPVMMAPIAAKKLRHYM
        APL  L ++KV + W E H++AF +IKE L +AP +  P   K    Y+
Subjt:  APLMELLKKKVEYKWEEVHRQAFAKIKETLANAPVMMAPIAAKKLRHYM

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATTTGCATCTTGCATATAGTCCCGAAATAGGAAAACCCGGTGGGAAGTCTTGGTTGCTACGGTCCAGGGAGCCGAGTCAGACCATCCCCAAAACCCACCAC
CTGCTGCCAAGTACTCCTCTTAGGAGGGTATACCAAACTACTATGTCAGATGAACCCGAAATCCCGGTTCCGCCTGAACCAACCACTTTCGACATTCTCAAGGCT
CTGTTGGCCCAAAAGGAAAGCTTGAACCAGGCCATCAATAAGTCAGCTCCTTCTACTGCACCTCAAGCACCTCCGGTTCAGGCCACAGCCCTTCCTTCCCCTAAA
GCCAGAACTGAAATGGAAAAATGGTTGAAAGAACAAGGGGAGCTGCTAGCCAAGTTACTGGAAGAGTTGCAGAAAGTAAAGGGTCTAGAGACCATGGGTCAACAT
GAAGGGATTGACCATTTTTCCAGAGGAAAAATCTCCAGAAAAGACTTTCACTCCGTTAGGCATAACTCTTGCCGCGCCTTGGAAAAGCTTCAAAAGCATGGTGTA
CTCACGCCTCTTCCCCCATCTCCGCCACCAGATCCTCTTCCTCCTAGGTACAAAGCCAATGCCTACTGCAACATCAGACAAATTGCCAAAATACTGACCGGTGTT
GATATGAACCAAGGAGCTACCGTTGTTATGCCTCATGGACCGATTACAAGATCGAGAGCTAGGAAACTTCAACAAGCTTTGCTACTTCGTGTCCAAGCGTTCTTG
AACTCGACAAGAGAAATAAAGGAAGATGTGCAAGAAGTAATCCGTTCTAAAACTTTTCCTTGGGTTGATTCCATACTTGTTTTCGGGGTTTTTGGGATTGAGTTC
TATGGGTCTTGTTCGTTAGCAAGCATGGCTAATCTTGGAGATGAAGGCAATATTTCTGATGCCCGGAGGAGAGAGGCGCTAGAAGGAACTGTTGAACGAATGATT
CGTAGCATGGAGGCATTGACAGAACGGATAGGAAGGTTGGAGACTCAAAACCAAGCTAAGGAGAGAGTTCCACCACCTCCACCACCACGTGTTATGGATGAGGAT
GAGTATGAAGGTGATGGATCCGATCATTGGGAGGACGATCAGGCTACAGTTTTAGCGGGTCCAAGAGGGGGTGATAGACACGTTGGTCGTGGCTTAGATCGAGGG
AGAGGCCGGGGAAGAGGCTATCATAATTTCCAAAGAGCCCCACGAGCGAATGCTCAAGTTCTTAGATCCAACTCTCAAGGGTTTCTTAACAGGTGTTTAAGGCTG
AAACATGACATCCAAGACCTTATTGATTCAGAGAAAATTGAGCCACCAAGGGCAAACCGCCCCAATGTCACTACCAATCCCCTACCCACTCATGCTGTACCTCCT
CCTGCCAATATAAACATGATCGAGCCGACATTTTCGAGTTGGGATCCGTCTCTCCTTATCACGCCAGCCGGAGAAAAGGGAGAAACGGATGGTAGAGACATGATT
GGAATCCGGGAGACGAGATTGAGCCCCCACCTCGAGTTCACATGCTCCGGTGGGATGATTTCTACGACAATCTCCCAACCCCTGAACAACCCCTCCAATTACTCC
CGGAAAGCCCAGAAAGTGAAACATTTATCCTTTACCCTCGCTCGAATTCCTCAACCCTTTGATGAAAGCCCTAGTTGCTCGGACCCTACGGACCTAAAACCCCTC
CCCGCATCTTTGTGGATGGACTCCGACGAACTGGATCAATACGGCACCAATGATGATATGGGGATTTTGTCAACAATGAGGAGGCCCAACACATGGCATCTAAGA
GCCACCGTGATCCTGTCATGCACAAACACACGGAAAAGGGTGCCAATGGTTCTGGTTGACAATGGATCGGCCCTGAACGTGTGCCCATTCAGAACTGCCACTTGC
TTAAGATATCAGCAAAAAGATTTCGCAGTCTTAGCGCAGGCCATTAGAGCATATGACAACACAGGGGGGCCGTGGATTCATGCCATAGAAGCGGTCCCCTCTACT
TTGCATCATCAAGTCGACGTTCATACAGGGGAATCATTCAATTTTTCGATCTTTGGAGATCCAGAAGAGCCGGTGCTGGACCCTATCCCAGTGCGAGAAATTCAG
CACGATGAGAACTTGGAATTGGCAGGATTCCAGTTCGAGCAAGTGCACGTCATTGAGTTGGCAGAACCAGTCAAGGAATATTTAGCCCTGCCCGATTTTAGTGAC
AATGTCTTTGTCAGGGAAATGTTCCGATCAATGCGCCACCTCCCCGGATCAGGACTCGGCCGGCGTCACCAAGGAATCGTGGAGCCAATTCATGTCCAGGCAATT
GAGCCACCTTTTGGGTTGGGGTATATACCTACGGAAGAAGATCTCCAAAAATTGGAAGACAAGAAGAAGAAAAAGAAGAATAAGAGTCTCAGAAGAAAGAAGAAA
AGGAAGACTGTAGAATCTGAGATTGAGGAAGAATACCAGCTTCAACTTATGTTTACCCAGGACGGTGCGACAGGGGAACAAGGGATGGTGTCCATGAAAATTGCG
AACCCAGTGGCAAAAGACCCATCCACTCTCATCATCCCAACCGATGGAGCACCACGCAATTGGTCTTCTCTTCCTCAGCTTCAGTCTTTCAGTGTTCGAGATGAA
CCAGTCTTACTCAAGTCTGTTGAGTCCCTGTTTAATGTAACCTCTTCGGAGGATGGTCTGAAGTCTGTCCCGTCAGTGTCTGTTATCATTTCTGAAGTCGAGTCG
GCCCTGGAGGCCAGAGAGTCTGTAATCAAGGAACTTGTGGTAGATGAGGAAGAAACCTATGTTGAAACTCCTCCTTCACTCTTGGAAACGGTGGAGAGATCAGAG
GAAAGGAAGGCTCAACCCGTCATGGAAGAAACCCAACCCATTCATTTAGGGACCGCAGACGAACCCCAAGAGGTGTACGTCGACGATATACTTTTCAAATCCAAG
ACAGGAGAAGGACATCCGGAAGCTCTACAAAGGGTTCTTGAAAGATCAAGAAAATCTCAGTTGAAGATGAACTCCCTAGACCCAGCTAAAACAAAAGCAGTGCTC
GATATGCCAGAGCCAACCTCAGCAAAAGAGTTTAAGTCATTTTTAGGAAAAGCTTCCTACTTGAGAAGGTTCTTACCAGGATTAGCAGCCTTATCGGCCCCTCTT
ATGGAATTGCTAAAGAAAAAGGTAGAGTACAAATGGGAAGAGGTCCATCGACAAGCCTTTGCAAAAATCAAGGAGACCTTAGCGAACGCCCCAGTGATGATGGCT
CCAATAGCAGCCAAAAAGTTAAGACATTATATGCTGGCGCACAAGATAAAGTTGGTTGTGGGGGCTAATCCGATAAGATCTACTGGCCCATCAAAAGACCCTATC
GAGCTTTCTGAAGAAATTCCTGGTGATCTCGAACAGATTGCAACATTAGAAGATGGGCAATGGACCTTATACTTTGATGAATCATCCACAGCCAAGGAAGAAGAA
TATGAGGCCTTAGCGATAGGGTTGTCTATCGCTAAAGAAATGAAGATCAAGAAGTTCAAAGTGGTTGGTGACTCTAATTTGGCGGTTCGACAGACAGGTGGGACA
TTCGCACTGAAGGAGATTTCCCTAGCACCTTATCAGTTCCTTGTGCTGAAGCTGTGTTATGAAATTGATGTGGAATTAACCCATGTCAACAGGTCTACCAATCGA
CATGCTGATTCTTTGGCCACTCTAGCTTCAAAGATCAAGTTCGAAGATGATGAAGCTGTCACTAAGATCAGCAAGCGACGAACTCCAGCTCATGGAACTTTAACG
ATAAGCTATATGAGGATGTCATTAAAGAAGGAGATTGGAGACATGCTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCATTTGCATCTTGCATATAGTCCCGAAATAGGAAAACCCGGTGGGAAGTCTTGGTTGCTACGGTCCAGGGAGCCGAGTCAGACCATCCCCAAAACCCACCAC
CTGCTGCCAAGTACTCCTCTTAGGAGGGTATACCAAACTACTATGTCAGATGAACCCGAAATCCCGGTTCCGCCTGAACCAACCACTTTCGACATTCTCAAGGCT
CTGTTGGCCCAAAAGGAAAGCTTGAACCAGGCCATCAATAAGTCAGCTCCTTCTACTGCACCTCAAGCACCTCCGGTTCAGGCCACAGCCCTTCCTTCCCCTAAA
GCCAGAACTGAAATGGAAAAATGGTTGAAAGAACAAGGGGAGCTGCTAGCCAAGTTACTGGAAGAGTTGCAGAAAGTAAAGGGTCTAGAGACCATGGGTCAACAT
GAAGGGATTGACCATTTTTCCAGAGGAAAAATCTCCAGAAAAGACTTTCACTCCGTTAGGCATAACTCTTGCCGCGCCTTGGAAAAGCTTCAAAAGCATGGTGTA
CTCACGCCTCTTCCCCCATCTCCGCCACCAGATCCTCTTCCTCCTAGGTACAAAGCCAATGCCTACTGCAACATCAGACAAATTGCCAAAATACTGACCGGTGTT
GATATGAACCAAGGAGCTACCGTTGTTATGCCTCATGGACCGATTACAAGATCGAGAGCTAGGAAACTTCAACAAGCTTTGCTACTTCGTGTCCAAGCGTTCTTG
AACTCGACAAGAGAAATAAAGGAAGATGTGCAAGAAGTAATCCGTTCTAAAACTTTTCCTTGGGTTGATTCCATACTTGTTTTCGGGGTTTTTGGGATTGAGTTC
TATGGGTCTTGTTCGTTAGCAAGCATGGCTAATCTTGGAGATGAAGGCAATATTTCTGATGCCCGGAGGAGAGAGGCGCTAGAAGGAACTGTTGAACGAATGATT
CGTAGCATGGAGGCATTGACAGAACGGATAGGAAGGTTGGAGACTCAAAACCAAGCTAAGGAGAGAGTTCCACCACCTCCACCACCACGTGTTATGGATGAGGAT
GAGTATGAAGGTGATGGATCCGATCATTGGGAGGACGATCAGGCTACAGTTTTAGCGGGTCCAAGAGGGGGTGATAGACACGTTGGTCGTGGCTTAGATCGAGGG
AGAGGCCGGGGAAGAGGCTATCATAATTTCCAAAGAGCCCCACGAGCGAATGCTCAAGTTCTTAGATCCAACTCTCAAGGGTTTCTTAACAGGTGTTTAAGGCTG
AAACATGACATCCAAGACCTTATTGATTCAGAGAAAATTGAGCCACCAAGGGCAAACCGCCCCAATGTCACTACCAATCCCCTACCCACTCATGCTGTACCTCCT
CCTGCCAATATAAACATGATCGAGCCGACATTTTCGAGTTGGGATCCGTCTCTCCTTATCACGCCAGCCGGAGAAAAGGGAGAAACGGATGGTAGAGACATGATT
GGAATCCGGGAGACGAGATTGAGCCCCCACCTCGAGTTCACATGCTCCGGTGGGATGATTTCTACGACAATCTCCCAACCCCTGAACAACCCCTCCAATTACTCC
CGGAAAGCCCAGAAAGTGAAACATTTATCCTTTACCCTCGCTCGAATTCCTCAACCCTTTGATGAAAGCCCTAGTTGCTCGGACCCTACGGACCTAAAACCCCTC
CCCGCATCTTTGTGGATGGACTCCGACGAACTGGATCAATACGGCACCAATGATGATATGGGGATTTTGTCAACAATGAGGAGGCCCAACACATGGCATCTAAGA
GCCACCGTGATCCTGTCATGCACAAACACACGGAAAAGGGTGCCAATGGTTCTGGTTGACAATGGATCGGCCCTGAACGTGTGCCCATTCAGAACTGCCACTTGC
TTAAGATATCAGCAAAAAGATTTCGCAGTCTTAGCGCAGGCCATTAGAGCATATGACAACACAGGGGGGCCGTGGATTCATGCCATAGAAGCGGTCCCCTCTACT
TTGCATCATCAAGTCGACGTTCATACAGGGGAATCATTCAATTTTTCGATCTTTGGAGATCCAGAAGAGCCGGTGCTGGACCCTATCCCAGTGCGAGAAATTCAG
CACGATGAGAACTTGGAATTGGCAGGATTCCAGTTCGAGCAAGTGCACGTCATTGAGTTGGCAGAACCAGTCAAGGAATATTTAGCCCTGCCCGATTTTAGTGAC
AATGTCTTTGTCAGGGAAATGTTCCGATCAATGCGCCACCTCCCCGGATCAGGACTCGGCCGGCGTCACCAAGGAATCGTGGAGCCAATTCATGTCCAGGCAATT
GAGCCACCTTTTGGGTTGGGGTATATACCTACGGAAGAAGATCTCCAAAAATTGGAAGACAAGAAGAAGAAAAAGAAGAATAAGAGTCTCAGAAGAAAGAAGAAA
AGGAAGACTGTAGAATCTGAGATTGAGGAAGAATACCAGCTTCAACTTATGTTTACCCAGGACGGTGCGACAGGGGAACAAGGGATGGTGTCCATGAAAATTGCG
AACCCAGTGGCAAAAGACCCATCCACTCTCATCATCCCAACCGATGGAGCACCACGCAATTGGTCTTCTCTTCCTCAGCTTCAGTCTTTCAGTGTTCGAGATGAA
CCAGTCTTACTCAAGTCTGTTGAGTCCCTGTTTAATGTAACCTCTTCGGAGGATGGTCTGAAGTCTGTCCCGTCAGTGTCTGTTATCATTTCTGAAGTCGAGTCG
GCCCTGGAGGCCAGAGAGTCTGTAATCAAGGAACTTGTGGTAGATGAGGAAGAAACCTATGTTGAAACTCCTCCTTCACTCTTGGAAACGGTGGAGAGATCAGAG
GAAAGGAAGGCTCAACCCGTCATGGAAGAAACCCAACCCATTCATTTAGGGACCGCAGACGAACCCCAAGAGGTGTACGTCGACGATATACTTTTCAAATCCAAG
ACAGGAGAAGGACATCCGGAAGCTCTACAAAGGGTTCTTGAAAGATCAAGAAAATCTCAGTTGAAGATGAACTCCCTAGACCCAGCTAAAACAAAAGCAGTGCTC
GATATGCCAGAGCCAACCTCAGCAAAAGAGTTTAAGTCATTTTTAGGAAAAGCTTCCTACTTGAGAAGGTTCTTACCAGGATTAGCAGCCTTATCGGCCCCTCTT
ATGGAATTGCTAAAGAAAAAGGTAGAGTACAAATGGGAAGAGGTCCATCGACAAGCCTTTGCAAAAATCAAGGAGACCTTAGCGAACGCCCCAGTGATGATGGCT
CCAATAGCAGCCAAAAAGTTAAGACATTATATGCTGGCGCACAAGATAAAGTTGGTTGTGGGGGCTAATCCGATAAGATCTACTGGCCCATCAAAAGACCCTATC
GAGCTTTCTGAAGAAATTCCTGGTGATCTCGAACAGATTGCAACATTAGAAGATGGGCAATGGACCTTATACTTTGATGAATCATCCACAGCCAAGGAAGAAGAA
TATGAGGCCTTAGCGATAGGGTTGTCTATCGCTAAAGAAATGAAGATCAAGAAGTTCAAAGTGGTTGGTGACTCTAATTTGGCGGTTCGACAGACAGGTGGGACA
TTCGCACTGAAGGAGATTTCCCTAGCACCTTATCAGTTCCTTGTGCTGAAGCTGTGTTATGAAATTGATGTGGAATTAACCCATGTCAACAGGTCTACCAATCGA
CATGCTGATTCTTTGGCCACTCTAGCTTCAAAGATCAAGTTCGAAGATGATGAAGCTGTCACTAAGATCAGCAAGCGACGAACTCCAGCTCATGGAACTTTAACG
ATAAGCTATATGAGGATGTCATTAAAGAAGGAGATTGGAGACATGCTGTGA
Protein sequenceShow/hide protein sequence
MHLHLAYSPEIGKPGGKSWLLRSREPSQTIPKTHHLLPSTPLRRVYQTTMSDEPEIPVPPEPTTFDILKALLAQKESLNQAINKSAPSTAPQAPPVQATALPSPK
ARTEMEKWLKEQGELLAKLLEELQKVKGLETMGQHEGIDHFSRGKISRKDFHSVRHNSCRALEKLQKHGVLTPLPPSPPPDPLPPRYKANAYCNIRQIAKILTGV
DMNQGATVVMPHGPITRSRARKLQQALLLRVQAFLNSTREIKEDVQEVIRSKTFPWVDSILVFGVFGIEFYGSCSLASMANLGDEGNISDARRREALEGTVERMI
RSMEALTERIGRLETQNQAKERVPPPPPPRVMDEDEYEGDGSDHWEDDQATVLAGPRGGDRHVGRGLDRGRGRGRGYHNFQRAPRANAQVLRSNSQGFLNRCLRL
KHDIQDLIDSEKIEPPRANRPNVTTNPLPTHAVPPPANINMIEPTFSSWDPSLLITPAGEKGETDGRDMIGIRETRLSPHLEFTCSGGMISTTISQPLNNPSNYS
RKAQKVKHLSFTLARIPQPFDESPSCSDPTDLKPLPASLWMDSDELDQYGTNDDMGILSTMRRPNTWHLRATVILSCTNTRKRVPMVLVDNGSALNVCPFRTATC
LRYQQKDFAVLAQAIRAYDNTGGPWIHAIEAVPSTLHHQVDVHTGESFNFSIFGDPEEPVLDPIPVREIQHDENLELAGFQFEQVHVIELAEPVKEYLALPDFSD
NVFVREMFRSMRHLPGSGLGRRHQGIVEPIHVQAIEPPFGLGYIPTEEDLQKLEDKKKKKKNKSLRRKKKRKTVESEIEEEYQLQLMFTQDGATGEQGMVSMKIA
NPVAKDPSTLIIPTDGAPRNWSSLPQLQSFSVRDEPVLLKSVESLFNVTSSEDGLKSVPSVSVIISEVESALEARESVIKELVVDEEETYVETPPSLLETVERSE
ERKAQPVMEETQPIHLGTADEPQEVYVDDILFKSKTGEGHPEALQRVLERSRKSQLKMNSLDPAKTKAVLDMPEPTSAKEFKSFLGKASYLRRFLPGLAALSAPL
MELLKKKVEYKWEEVHRQAFAKIKETLANAPVMMAPIAAKKLRHYMLAHKIKLVVGANPIRSTGPSKDPIELSEEIPGDLEQIATLEDGQWTLYFDESSTAKEEE
YEALAIGLSIAKEMKIKKFKVVGDSNLAVRQTGGTFALKEISLAPYQFLVLKLCYEIDVELTHVNRSTNRHADSLATLASKIKFEDDEAVTKISKRRTPAHGTLT
ISYMRMSLKKEIGDML