; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039111 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039111
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol protein
Genome locationchr2:36220437..36224827
RNA-Seq ExpressionLag0039111
SyntenyLag0039111
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]1.7e-3889.36Show/hide
Query:  MGHKSEALEKFKEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNGTLLDMVRSMMSYAQLPSSFWGF
        M HKSEALEKFKE+K EVENLL K IK LRSDRGGEY+D RFQDYMIEHGIQSQLSAPGTPQQNGVSERRN TLLDMVRSMMSYAQLPSSFWG+
Subjt:  MGHKSEALEKFKEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNGTLLDMVRSMMSYAQLPSSFWGF

KAA0031924.1 gag/pol protein [Cucumis melo var. makuwa]1.7e-3889.36Show/hide
Query:  MGHKSEALEKFKEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNGTLLDMVRSMMSYAQLPSSFWGF
        M HKSEALEKFKE+K EVENLL K IK LRSDRGGEY+D RFQDYMIEHGIQSQLSAPGTPQQNGVSERRN TLLDMVRSMMSYAQLPSSFWG+
Subjt:  MGHKSEALEKFKEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNGTLLDMVRSMMSYAQLPSSFWGF

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]1.7e-3889.36Show/hide
Query:  MGHKSEALEKFKEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNGTLLDMVRSMMSYAQLPSSFWGF
        M HKSEALEKFKE+K EVENLL K IK LRSDRGGEY+D RFQDYMIEHGIQSQLSAPGTPQQNGVSERRN TLLDMVRSMMSYAQLPSSFWG+
Subjt:  MGHKSEALEKFKEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNGTLLDMVRSMMSYAQLPSSFWGF

TYK02840.1 gag/pol protein [Cucumis melo var. makuwa]1.7e-3889.36Show/hide
Query:  MGHKSEALEKFKEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNGTLLDMVRSMMSYAQLPSSFWGF
        M HKSEALEKFKE+K EVENLL K IK LRSDRGGEY+D RFQDYMIEHGIQSQLSAPGTPQQNGVSERRN TLLDMVRSMMSYAQLPSSFWG+
Subjt:  MGHKSEALEKFKEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNGTLLDMVRSMMSYAQLPSSFWGF

TYK04171.1 gag/pol protein [Cucumis melo var. makuwa]1.7e-3889.36Show/hide
Query:  MGHKSEALEKFKEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNGTLLDMVRSMMSYAQLPSSFWGF
        M HKSEALEKFKE+K EVENLL K IK LRSDRGGEY+D RFQDYMIEHGIQSQLSAPGTPQQNGVSERRN TLLDMVRSMMSYAQLPSSFWG+
Subjt:  MGHKSEALEKFKEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNGTLLDMVRSMMSYAQLPSSFWGF

TrEMBL top hitse value%identityAlignment
A0A5A7SL69 Gag/pol protein8.3e-3989.36Show/hide
Query:  MGHKSEALEKFKEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNGTLLDMVRSMMSYAQLPSSFWGF
        M HKSEALEKFKE+K EVENLL K IK LRSDRGGEY+D RFQDYMIEHGIQSQLSAPGTPQQNGVSERRN TLLDMVRSMMSYAQLPSSFWG+
Subjt:  MGHKSEALEKFKEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNGTLLDMVRSMMSYAQLPSSFWGF

A0A5A7TZD0 Gag/pol protein8.3e-3989.36Show/hide
Query:  MGHKSEALEKFKEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNGTLLDMVRSMMSYAQLPSSFWGF
        M HKSEALEKFKE+K EVENLL K IK LRSDRGGEY+D RFQDYMIEHGIQSQLSAPGTPQQNGVSERRN TLLDMVRSMMSYAQLPSSFWG+
Subjt:  MGHKSEALEKFKEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNGTLLDMVRSMMSYAQLPSSFWGF

A0A5A7UYE8 Gag/pol protein8.3e-3989.36Show/hide
Query:  MGHKSEALEKFKEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNGTLLDMVRSMMSYAQLPSSFWGF
        M HKSEALEKFKE+K EVENLL K IK LRSDRGGEY+D RFQDYMIEHGIQSQLSAPGTPQQNGVSERRN TLLDMVRSMMSYAQLPSSFWG+
Subjt:  MGHKSEALEKFKEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNGTLLDMVRSMMSYAQLPSSFWGF

A0A5D3BUN8 Gag/pol protein8.3e-3989.36Show/hide
Query:  MGHKSEALEKFKEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNGTLLDMVRSMMSYAQLPSSFWGF
        M HKSEALEKFKE+K EVENLL K IK LRSDRGGEY+D RFQDYMIEHGIQSQLSAPGTPQQNGVSERRN TLLDMVRSMMSYAQLPSSFWG+
Subjt:  MGHKSEALEKFKEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNGTLLDMVRSMMSYAQLPSSFWGF

A0A5D3BWT8 Gag/pol protein8.3e-3989.36Show/hide
Query:  MGHKSEALEKFKEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNGTLLDMVRSMMSYAQLPSSFWGF
        M HKSEALEKFKE+K EVENLL K IK LRSDRGGEY+D RFQDYMIEHGIQSQLSAPGTPQQNGVSERRN TLLDMVRSMMSYAQLPSSFWG+
Subjt:  MGHKSEALEKFKEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNGTLLDMVRSMMSYAQLPSSFWGF

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.6e-1038.46Show/hide
Query:  HKSEALEKFKEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNGTLLDMVRSMMSYAQLPSSFWG
        +KS+    F++F A+ E      +  L  D G EYL    + + ++ GI   L+ P TPQ NGVSER   T+ +  R+M+S A+L  SFWG
Subjt:  HKSEALEKFKEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNGTLLDMVRSMMSYAQLPSSFWG

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-1948.89Show/hide
Query:  KSEALEKFKEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNGTLLDMVRSMMSYAQLPSSFWG
        K +  + F++F A VE   G+ +K LRSD GGEY  + F++Y   HGI+ + + PGTPQ NGV+ER N T+++ VRSM+  A+LP SFWG
Subjt:  KSEALEKFKEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNGTLLDMVRSMMSYAQLPSSFWG

Q12491 Transposon Ty2-B Gag-Pol polyprotein3.7e-0426.53Show/hide
Query:  LEKFKEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNGTLLDMVRSMMSYAQLPSSFWGFTFSLPDISRLL
        L  F    A ++N     +  ++ DRG EY ++    +    GI +  +     + +GV+ER N TLL+  R+++  + LP+  W   FS  + S ++
Subjt:  LEKFKEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNGTLLDMVRSMMSYAQLPSSFWGFTFSLPDISRLL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.2e-1134.75Show/hide
Query:  KSEALEKFKEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNGTLLDMVRSMMSYAQLPSSFWGFTFS------
        KS+  E F  FK  +EN     I T  SD GGE++     +Y  +HGI    S P TP+ NG+SER++  +++   +++S+A +P ++W + F+      
Subjt:  KSEALEKFKEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNGTLLDMVRSMMSYAQLPSSFWGFTFS------

Query:  --LPDISRLLSSPFPHLF
          LP     L SPF  LF
Subjt:  --LPDISRLLSSPFPHLF

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.5e-1338.98Show/hide
Query:  KSEALEKFKEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNGTLLDMVRSMMSYAQLPSSFWGFTFS------
        KS+  + F  FK+ VEN     I TL SD GGE++  R  DY+ +HGI    S P TP+ NG+SER++  +++M  +++S+A +P ++W + FS      
Subjt:  KSEALEKFKEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNGTLLDMVRSMMSYAQLPSSFWGFTFS------

Query:  --LPDISRLLSSPFPHLF
          LP     L SPF  LF
Subjt:  --LPDISRLLSSPFPHLF

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCCATAAGTCTGAGGCTCTTGAAAAGTTCAAGGAGTTTAAGGCTGAGGTAGAAAACTTATTAGGTAAAACAATTAAAACACTTCGATCAGATCGAGGTGGAGAGTA
TTTAGATCAGCGATTCCAGGACTATATGATAGAACATGGAATCCAATCACAACTCTCAGCACCTGGTACACCTCAGCAAAATGGTGTATCAGAAAGGAGAAACGGAACCT
TGTTAGACATGGTTCGATCTATGATGAGCTATGCTCAGTTGCCTAGCTCGTTTTGGGGCTTTACTTTTTCTCTCCCTGATATCTCTCGTCTTCTCTCTTCCCCCTTTCCT
CATCTCTTCACTTTCGTCTTCTCTCGTCTTCACGAATTGGAAGCTTACGGACGGCGACGGAAAGCTCATCGACGGCGACTTGAAGCAGTACATCTCCGGCGCGGTTTCCA
TTCTCTCGTCTTCCAAACGCATGGGGCCGATAAAGGAGACTCACTAAGAGTGGCATACTTGCACCTTTTCGAACAGTTGAATGGGGGAGCTGTTGTGGGAGAAATCAGTA
GAAAGACTATGCAAACAGCGGTCGAAGCCTTTGGTGGTGAAGGATATTCAGATGGTATAAATGTTCCTGCTATTGATTTTTCAATTAGCGAGTATCGATTGAACACGATG
ACGATATCCTGTGGAAGCCGGAACATCAGAGAGACTAACGAAGAAATTGCAGATAGGGGAGTGAAGTTTCTTAAGTGTTTCTGTTCTACCCAAGGGGGGAAGCCATTAGG
TTTGGGATTTACCATAACAGATGATGAAATGAATATTCTTGAAAAGAAGTTCGAAGATGAACTTGGGAAAGAGCATTATTTAAGGCCTCATGCTCTGGTAATTTCCTCAA
TGCTCTGTTTCTCGTTCTGTTCTTCATTTACTCACAAACTTACAACATTAACTTCTCCATTGTCAATGCAAATTACTGCACACCAAGTGTTCTATAATCCTCTAGATGTT
GTTGGTTATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGCCATAAGTCTGAGGCTCTTGAAAAGTTCAAGGAGTTTAAGGCTGAGGTAGAAAACTTATTAGGTAAAACAATTAAAACACTTCGATCAGATCGAGGTGGAGAGTA
TTTAGATCAGCGATTCCAGGACTATATGATAGAACATGGAATCCAATCACAACTCTCAGCACCTGGTACACCTCAGCAAAATGGTGTATCAGAAAGGAGAAACGGAACCT
TGTTAGACATGGTTCGATCTATGATGAGCTATGCTCAGTTGCCTAGCTCGTTTTGGGGCTTTACTTTTTCTCTCCCTGATATCTCTCGTCTTCTCTCTTCCCCCTTTCCT
CATCTCTTCACTTTCGTCTTCTCTCGTCTTCACGAATTGGAAGCTTACGGACGGCGACGGAAAGCTCATCGACGGCGACTTGAAGCAGTACATCTCCGGCGCGGTTTCCA
TTCTCTCGTCTTCCAAACGCATGGGGCCGATAAAGGAGACTCACTAAGAGTGGCATACTTGCACCTTTTCGAACAGTTGAATGGGGGAGCTGTTGTGGGAGAAATCAGTA
GAAAGACTATGCAAACAGCGGTCGAAGCCTTTGGTGGTGAAGGATATTCAGATGGTATAAATGTTCCTGCTATTGATTTTTCAATTAGCGAGTATCGATTGAACACGATG
ACGATATCCTGTGGAAGCCGGAACATCAGAGAGACTAACGAAGAAATTGCAGATAGGGGAGTGAAGTTTCTTAAGTGTTTCTGTTCTACCCAAGGGGGGAAGCCATTAGG
TTTGGGATTTACCATAACAGATGATGAAATGAATATTCTTGAAAAGAAGTTCGAAGATGAACTTGGGAAAGAGCATTATTTAAGGCCTCATGCTCTGGTAATTTCCTCAA
TGCTCTGTTTCTCGTTCTGTTCTTCATTTACTCACAAACTTACAACATTAACTTCTCCATTGTCAATGCAAATTACTGCACACCAAGTGTTCTATAATCCTCTAGATGTT
GTTGGTTATTAG
Protein sequenceShow/hide protein sequence
MGHKSEALEKFKEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNGTLLDMVRSMMSYAQLPSSFWGFTFSLPDISRLLSSPFP
HLFTFVFSRLHELEAYGRRRKAHRRRLEAVHLRRGFHSLVFQTHGADKGDSLRVAYLHLFEQLNGGAVVGEISRKTMQTAVEAFGGEGYSDGINVPAIDFSISEYRLNTM
TISCGSRNIRETNEEIADRGVKFLKCFCSTQGGKPLGLGFTITDDEMNILEKKFEDELGKEHYLRPHALVISSMLCFSFCSSFTHKLTTLTSPLSMQITAHQVFYNPLDV
VGY