; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0007613 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0007613
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr9:1961100..1968086
RNA-Seq ExpressionLag0007613
SyntenyLag0007613
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0050896 - response to stimulus (biological process)
GO:0016020 - membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PNX96445.1 copia LTR rider [Trifolium pratense]5.1e-4368.6Show/hide
Query:  ATHGGARHLTAAGNPQQNGFAERYNRTILEKVRCMLSNSKLPRIFWGEAVSTVVYLINRSPSTALNFKTPQEIWSGHPPCLKHLHVFGCAAYAHTRQDKL
        A  G ARH T AG PQQNG AER+NRTILE+VRCML+++ L ++FW EAVST  YLINR PSTAL+ KTP+E+WSGHPP L  L VFGC AYAH RQDK+
Subjt:  ATHGGARHLTAAGNPQQNGFAERYNRTILEKVRCMLSNSKLPRIFWGEAVSTVVYLINRSPSTALNFKTPQEIWSGHPPCLKHLHVFGCAAYAHTRQDKL

Query:  QPRAKKCVFIGYSQGVKAYKL
        +PRA KC+F+GY +GVKAY+L
Subjt:  QPRAKKCVFIGYSQGVKAYKL

RVW99173.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.3e-4354.17Show/hide
Query:  KNRLDYIHSDLWGPSRIATHGG------------------------------ARHLTAAGNPQQNGFAERYNRTILEKVRCMLSNSKLPRIFWGEAVSTV
        +N+LDYIHSDLWGPSR+ + GG                              A H T    PQQNG AER NRTILE++RCMLS+S L ++FW EA  TV
Subjt:  KNRLDYIHSDLWGPSRIATHGG------------------------------ARHLTAAGNPQQNGFAERYNRTILEKVRCMLSNSKLPRIFWGEAVSTV

Query:  VYLINRSPSTALNFKTPQEIWSGHPPCLKHLHVFGCAAYAHTRQDKLQPRAKKCVFIGYSQGVKAYKL
        V+LINRSPS+AL FKTPQE W+G     +HL VFGC AY HT+ DKL+PRA KC+F+GY +GVK YKL
Subjt:  VYLINRSPSTALNFKTPQEIWSGHPPCLKHLHVFGCAAYAHTRQDKLQPRAKKCVFIGYSQGVKAYKL

RZB42800.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja]1.4e-4857.32Show/hide
Query:  LDYIHSDLWGPSRIATHGG-----------------------------ARHLTAAGNPQQNGFAERYNRTILEKVRCMLSNSKLPRIFWGEAVSTVVYLI
        LDY+H+D WGP++  +H G                             ARH T AG PQQNG AER+NRTILE+VRCML ++ LP+IFW EA   VVYLI
Subjt:  LDYIHSDLWGPSRIATHGG-----------------------------ARHLTAAGNPQQNGFAERYNRTILEKVRCMLSNSKLPRIFWGEAVSTVVYLI

Query:  NRSPSTALNFKTPQEIWSGHPPCLKHLHVFGCAAYAHTRQDKLQPRAKKCVFIGYSQGVKAYKL
        N+ PST LNFKTP+EIWSGHPP LK L VFGC AYAH +QDKL+PRA KC+F+GY +GVK YKL
Subjt:  NRSPSTALNFKTPQEIWSGHPPCLKHLHVFGCAAYAHTRQDKLQPRAKKCVFIGYSQGVKAYKL

RZB65625.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja]1.2e-4756.29Show/hide
Query:  KNRLDYIHSDLWGPSRIATHGG-----------------------------ARHLTAAGNPQQNGFAERYNRTILEKVRCMLSNSKLPRIFWGEAVSTVV
        K  LDY+H+DLWG ++  +H G                             ARH T AG PQQNG AER+NRTIL++VRCM  +  LP+IFW EA  T V
Subjt:  KNRLDYIHSDLWGPSRIATHGG-----------------------------ARHLTAAGNPQQNGFAERYNRTILEKVRCMLSNSKLPRIFWGEAVSTVV

Query:  YLINRSPSTALNFKTPQEIWSGHPPCLKHLHVFGCAAYAHTRQDKLQPRAKKCVFIGYSQGVKAYKL
        YLIN+ PSTALNFKTP+EIWSGHPP LK L VFGC AYAH +QDKL+PRA KC+F+GY +GVK YKL
Subjt:  YLINRSPSTALNFKTPQEIWSGHPPCLKHLHVFGCAAYAHTRQDKLQPRAKKCVFIGYSQGVKAYKL

RZB80697.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja]6.6e-4371.43Show/hide
Query:  HGGARHLTAAGNPQQNGFAERYNRTILEKVRCMLSNSKLPRIFWGEAVSTVVYLINRSPSTALNFKTPQEIWSGHPPCLKHLHVFGCAAYAHTRQDKLQP
        +G ARH T AG PQQNG AER+N TILE+VRCML ++ LP+IF  EA  TVVYLIN+ PSTALNFKTP+EIWSGHPP LK L VFGC AYAH +QDKL+P
Subjt:  HGGARHLTAAGNPQQNGFAERYNRTILEKVRCMLSNSKLPRIFWGEAVSTVVYLINRSPSTALNFKTPQEIWSGHPPCLKHLHVFGCAAYAHTRQDKLQP

Query:  RAKKCVFIGYSQGVKAYKL
        RA KC+F+GY +GVK YKL
Subjt:  RAKKCVFIGYSQGVKAYKL

TrEMBL top hitse value%identityAlignment
A0A2K3N065 Copia LTR rider2.5e-4368.6Show/hide
Query:  ATHGGARHLTAAGNPQQNGFAERYNRTILEKVRCMLSNSKLPRIFWGEAVSTVVYLINRSPSTALNFKTPQEIWSGHPPCLKHLHVFGCAAYAHTRQDKL
        A  G ARH T AG PQQNG AER+NRTILE+VRCML+++ L ++FW EAVST  YLINR PSTAL+ KTP+E+WSGHPP L  L VFGC AYAH RQDK+
Subjt:  ATHGGARHLTAAGNPQQNGFAERYNRTILEKVRCMLSNSKLPRIFWGEAVSTVVYLINRSPSTALNFKTPQEIWSGHPPCLKHLHVFGCAAYAHTRQDKL

Query:  QPRAKKCVFIGYSQGVKAYKL
        +PRA KC+F+GY +GVKAY+L
Subjt:  QPRAKKCVFIGYSQGVKAYKL

A0A438IR25 Retrovirus-related Pol polyprotein from transposon TNT 1-946.5e-4454.17Show/hide
Query:  KNRLDYIHSDLWGPSRIATHGG------------------------------ARHLTAAGNPQQNGFAERYNRTILEKVRCMLSNSKLPRIFWGEAVSTV
        +N+LDYIHSDLWGPSR+ + GG                              A H T    PQQNG AER NRTILE++RCMLS+S L ++FW EA  TV
Subjt:  KNRLDYIHSDLWGPSRIATHGG------------------------------ARHLTAAGNPQQNGFAERYNRTILEKVRCMLSNSKLPRIFWGEAVSTV

Query:  VYLINRSPSTALNFKTPQEIWSGHPPCLKHLHVFGCAAYAHTRQDKLQPRAKKCVFIGYSQGVKAYKL
        V+LINRSPS+AL FKTPQE W+G     +HL VFGC AY HT+ DKL+PRA KC+F+GY +GVK YKL
Subjt:  VYLINRSPSTALNFKTPQEIWSGHPPCLKHLHVFGCAAYAHTRQDKLQPRAKKCVFIGYSQGVKAYKL

A0A445F227 Retrovirus-related Pol polyprotein from transposon TNT 1-946.7e-4957.32Show/hide
Query:  LDYIHSDLWGPSRIATHGG-----------------------------ARHLTAAGNPQQNGFAERYNRTILEKVRCMLSNSKLPRIFWGEAVSTVVYLI
        LDY+H+D WGP++  +H G                             ARH T AG PQQNG AER+NRTILE+VRCML ++ LP+IFW EA   VVYLI
Subjt:  LDYIHSDLWGPSRIATHGG-----------------------------ARHLTAAGNPQQNGFAERYNRTILEKVRCMLSNSKLPRIFWGEAVSTVVYLI

Query:  NRSPSTALNFKTPQEIWSGHPPCLKHLHVFGCAAYAHTRQDKLQPRAKKCVFIGYSQGVKAYKL
        N+ PST LNFKTP+EIWSGHPP LK L VFGC AYAH +QDKL+PRA KC+F+GY +GVK YKL
Subjt:  NRSPSTALNFKTPQEIWSGHPPCLKHLHVFGCAAYAHTRQDKLQPRAKKCVFIGYSQGVKAYKL

A0A445GWE1 Retrovirus-related Pol polyprotein from transposon TNT 1-945.6e-4856.29Show/hide
Query:  KNRLDYIHSDLWGPSRIATHGG-----------------------------ARHLTAAGNPQQNGFAERYNRTILEKVRCMLSNSKLPRIFWGEAVSTVV
        K  LDY+H+DLWG ++  +H G                             ARH T AG PQQNG AER+NRTIL++VRCM  +  LP+IFW EA  T V
Subjt:  KNRLDYIHSDLWGPSRIATHGG-----------------------------ARHLTAAGNPQQNGFAERYNRTILEKVRCMLSNSKLPRIFWGEAVSTVV

Query:  YLINRSPSTALNFKTPQEIWSGHPPCLKHLHVFGCAAYAHTRQDKLQPRAKKCVFIGYSQGVKAYKL
        YLIN+ PSTALNFKTP+EIWSGHPP LK L VFGC AYAH +QDKL+PRA KC+F+GY +GVK YKL
Subjt:  YLINRSPSTALNFKTPQEIWSGHPPCLKHLHVFGCAAYAHTRQDKLQPRAKKCVFIGYSQGVKAYKL

A0A445I3R1 Retrovirus-related Pol polyprotein from transposon TNT 1-943.2e-4371.43Show/hide
Query:  HGGARHLTAAGNPQQNGFAERYNRTILEKVRCMLSNSKLPRIFWGEAVSTVVYLINRSPSTALNFKTPQEIWSGHPPCLKHLHVFGCAAYAHTRQDKLQP
        +G ARH T AG PQQNG AER+N TILE+VRCML ++ LP+IF  EA  TVVYLIN+ PSTALNFKTP+EIWSGHPP LK L VFGC AYAH +QDKL+P
Subjt:  HGGARHLTAAGNPQQNGFAERYNRTILEKVRCMLSNSKLPRIFWGEAVSTVVYLINRSPSTALNFKTPQEIWSGHPPCLKHLHVFGCAAYAHTRQDKLQP

Query:  RAKKCVFIGYSQGVKAYKL
        RA KC+F+GY +GVK YKL
Subjt:  RAKKCVFIGYSQGVKAYKL

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.7e-2350.44Show/hide
Query:  GGARHLTAAGNPQQNGFAERYNRTILEKVRCMLSNSKLPRIFWGEAVSTVVYLINRSPSTAL--NFKTPQEIWSGHPPCLKHLHVFGCAAYAH--TRQDK
        G + HLT    PQ NG +ER  RTI EK R M+S +KL + FWGEAV T  YLINR PS AL  + KTP E+W    P LKHL VFG   Y H   +Q K
Subjt:  GGARHLTAAGNPQQNGFAERYNRTILEKVRCMLSNSKLPRIFWGEAVSTVVYLINRSPSTAL--NFKTPQEIWSGHPPCLKHLHVFGCAAYAH--TRQDK

Query:  LQPRAKKCVFIGY
           ++ K +F+GY
Subjt:  LQPRAKKCVFIGY

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.1e-2849.19Show/hide
Query:  ATHGGARHLTAAGNPQQNGFAERYNRTILEKVRCMLSNSKLPRIFWGEAVSTVVYLINRSPSTALNFKTPQEIWSGHPPCLKHLHVFGCAAYAHT---RQ
        ++HG     T  G PQ NG AER NRTI+EKVR ML  +KLP+ FWGEAV T  YLINRSPS  L F+ P+ +W+       HL VFGC A+AH    ++
Subjt:  ATHGGARHLTAAGNPQQNGFAERYNRTILEKVRCMLSNSKLPRIFWGEAVSTVVYLINRSPSTALNFKTPQEIWSGHPPCLKHLHVFGCAAYAHT---RQ

Query:  DKLQPRAKKCVFIGYSQGVKAYKL
         KL  ++  C+FIGY      Y+L
Subjt:  DKLQPRAKKCVFIGYSQGVKAYKL

P92512 Uncharacterized mitochondrial protein AtMg007104.5e-1851.22Show/hide
Query:  NRTILEKVRCMLSNSKLPRIFWGEAVSTVVYLINRSPSTALNFKTPQEIWSGHPPCLKHLHVFGCAAYAHTRQDKLQPRAKK
        NRTI+EKVR ML    LP+ F  +A +T V++IN+ PSTA+NF  P E+W    P   +L  FGC AY H  + KL+PRAKK
Subjt:  NRTILEKVRCMLSNSKLPRIFWGEAVSTVVYLINRSPSTALNFKTPQEIWSGHPPCLKHLHVFGCAAYAHTRQDKLQPRAKK

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.5e-1739.5Show/hide
Query:  GARHLTAAGN-PQQNGFAERYNRTILEKVRCMLSNSKLPRIFWGEAVSTVVYLINRSPSTALNFKTPQEIWSGHPPCLKHLHVFGCAAYAHTR---QDKL
        G  HLT+  + P+ NG +ER +R I+E    +LS++ +P+ +W  A +  VYLINR P+  L  ++P +   G  P    L VFGCA Y   R   Q KL
Subjt:  GARHLTAAGN-PQQNGFAERYNRTILEKVRCMLSNSKLPRIFWGEAVSTVVYLINRSPSTALNFKTPQEIWSGHPPCLKHLHVFGCAAYAHTR---QDKL

Query:  QPRAKKCVFIGYSQGVKAY
          ++++CVF+GYS    AY
Subjt:  QPRAKKCVFIGYSQGVKAY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.0e-1834.72Show/hide
Query:  KNRLDYIHSDLWGP-----SRIATHGGARHLTAAGNPQQNGFAERYNRTILEKVRCMLSNSKLPRIFWGEAVSTVVYLINRSPSTALNFKTPQEIWSGHP
        + R+  ++SD  G        ++ HG +   +    P+ NG +ER +R I+E    +LS++ +P+ +W  A S  VYLINR P+  L  ++P +   G P
Subjt:  KNRLDYIHSDLWGP-----SRIATHGGARHLTAAGNPQQNGFAERYNRTILEKVRCMLSNSKLPRIFWGEAVSTVVYLINRSPSTALNFKTPQEIWSGHP

Query:  PCLKHLHVFGCAAYAHTR---QDKLQPRAKKCVFIGYSQGVKAY
        P  + L VFGCA Y   R   + KL+ ++K+C F+GYS    AY
Subjt:  PCLKHLHVFGCAAYAHTR---QDKLQPRAKKCVFIGYSQGVKAY

Arabidopsis top hitse value%identityAlignment
ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.2e-1951.22Show/hide
Query:  NRTILEKVRCMLSNSKLPRIFWGEAVSTVVYLINRSPSTALNFKTPQEIWSGHPPCLKHLHVFGCAAYAHTRQDKLQPRAKK
        NRTI+EKVR ML    LP+ F  +A +T V++IN+ PSTA+NF  P E+W    P   +L  FGC AY H  + KL+PRAKK
Subjt:  NRTILEKVRCMLSNSKLPRIFWGEAVSTVVYLINRSPSTALNFKTPQEIWSGHPPCLKHLHVFGCAAYAHTRQDKLQPRAKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAAGCCCTGGATATGGTATGAAGCACTCCGAATGACCAAGTACGGGGTCTCGCTGAGATGGAAAATGTTGTGCTCGGCCTCTTCCCGAGGCCGAGGCCGACCAGG
GAGGCCCTCCAGCCACTTCTGTAATTTTCGACCACACAGACGCACAAGGAGCTCACGAGGACAATCGGGCAGAGGTAAGGCCACAAGACCGACCCAGGGGAAGCCGGACC
AAAGGGTCGGGCCAACATGGCCCGACCCATATGGTCGGCCTCGGAAAAAGGCCGAGGTCGAGCCCGGTGACCTCTTTTCGGTCCCTAATGCCCCGGGTCGCCTCAGTTCC
GCCTGGTTCGTCCCGAAACACCTCCGAATTCCTAAAAACCCTAGGAGGACAAACAGGCATCGGAGGCGGTGTGGCTTACACCACGCCGGTGTGCAGCGGTTTTTGTTGGC
CTTGCAGGTCACGTCTTCCCCGATTTCTACAAATTTACTGTTGGCGTCACGTGAAGGTTTAAATATTGATTATAGGTCCCGTCCTGGAGGTGGTTGGACCGAACTCCCTG
TATGGCTGACTTGGGGTGAAAAAAACCGTCTTGATTACATCCATTCCGATCTTTGGGGACCCTCAAGGATTGCTACTCATGGTGGTGCAAGGCATCTCACCGCAGCTGGA
AATCCTCAGCAGAATGGTTTTGCTGAACGATATAACAGAACTATACTCGAAAAAGTGAGGTGTATGCTATCAAATTCTAAACTTCCAAGAATTTTTTGGGGAGAAGCTGT
GAGCACAGTTGTGTACTTGATAAACCGCAGCCCATCCACTGCTTTGAACTTCAAAACACCGCAGGAGATTTGGTCTGGTCATCCTCCTTGCCTGAAACACCTTCATGTCT
TTGGATGTGCAGCTTATGCACATACAAGACAAGACAAACTACAACCCCGAGCTAAGAAATGTGTTTTCATAGGCTACTCTCAAGGTGTTAAAGCTTACAAGCTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGAAGCCCTGGATATGGTATGAAGCACTCCGAATGACCAAGTACGGGGTCTCGCTGAGATGGAAAATGTTGTGCTCGGCCTCTTCCCGAGGCCGAGGCCGACCAGG
GAGGCCCTCCAGCCACTTCTGTAATTTTCGACCACACAGACGCACAAGGAGCTCACGAGGACAATCGGGCAGAGGTAAGGCCACAAGACCGACCCAGGGGAAGCCGGACC
AAAGGGTCGGGCCAACATGGCCCGACCCATATGGTCGGCCTCGGAAAAAGGCCGAGGTCGAGCCCGGTGACCTCTTTTCGGTCCCTAATGCCCCGGGTCGCCTCAGTTCC
GCCTGGTTCGTCCCGAAACACCTCCGAATTCCTAAAAACCCTAGGAGGACAAACAGGCATCGGAGGCGGTGTGGCTTACACCACGCCGGTGTGCAGCGGTTTTTGTTGGC
CTTGCAGGTCACGTCTTCCCCGATTTCTACAAATTTACTGTTGGCGTCACGTGAAGGTTTAAATATTGATTATAGGTCCCGTCCTGGAGGTGGTTGGACCGAACTCCCTG
TATGGCTGACTTGGGGTGAAAAAAACCGTCTTGATTACATCCATTCCGATCTTTGGGGACCCTCAAGGATTGCTACTCATGGTGGTGCAAGGCATCTCACCGCAGCTGGA
AATCCTCAGCAGAATGGTTTTGCTGAACGATATAACAGAACTATACTCGAAAAAGTGAGGTGTATGCTATCAAATTCTAAACTTCCAAGAATTTTTTGGGGAGAAGCTGT
GAGCACAGTTGTGTACTTGATAAACCGCAGCCCATCCACTGCTTTGAACTTCAAAACACCGCAGGAGATTTGGTCTGGTCATCCTCCTTGCCTGAAACACCTTCATGTCT
TTGGATGTGCAGCTTATGCACATACAAGACAAGACAAACTACAACCCCGAGCTAAGAAATGTGTTTTCATAGGCTACTCTCAAGGTGTTAAAGCTTACAAGCTCTAG
Protein sequenceShow/hide protein sequence
MEKPWIWYEALRMTKYGVSLRWKMLCSASSRGRGRPGRPSSHFCNFRPHRRTRSSRGQSGRGKATRPTQGKPDQRVGPTWPDPYGRPRKKAEVEPGDLFSVPNAPGRLSS
AWFVPKHLRIPKNPRRTNRHRRRCGLHHAGVQRFLLALQVTSSPISTNLLLASREGLNIDYRSRPGGGWTELPVWLTWGEKNRLDYIHSDLWGPSRIATHGGARHLTAAG
NPQQNGFAERYNRTILEKVRCMLSNSKLPRIFWGEAVSTVVYLINRSPSTALNFKTPQEIWSGHPPCLKHLHVFGCAAYAHTRQDKLQPRAKKCVFIGYSQGVKAYKL