; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011665 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011665
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol protein
Genome locationchr1:30163484..30167491
RNA-Seq ExpressionLag0011665
SyntenyLag0011665
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0063766.1 gag/pol protein [Cucumis melo var. makuwa]9.3e-1990.38Show/hide
Query:  MYFNQVWELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTNEKG
        MYFN VWELVD PEGVKPIGCKWIYKRKRD+AGKVQTFKARLVAKGYT  KG
Subjt:  MYFNQVWELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTNEKG

KAF2369249.1 hypothetical protein BSL88_16760, partial [Acinetobacter baylyi]2.5e-1990.38Show/hide
Query:  MYFNQVWELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTNEKG
        MYFN VWELVDPPEGVKPIGCKWIYKRKRD AGKVQTFKARLVAKGYT  +G
Subjt:  MYFNQVWELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTNEKG

TYK05518.1 gag/pol protein [Cucumis melo var. makuwa]9.3e-1990.38Show/hide
Query:  MYFNQVWELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTNEKG
        MYFN VWELVD PEGVKPIGCKWIYKRKRD+AGKVQTFKARLVAKGYT  KG
Subjt:  MYFNQVWELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTNEKG

WP_163093664.1 hypothetical protein, partial [Acinetobacter baylyi]2.5e-1990.38Show/hide
Query:  MYFNQVWELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTNEKG
        MYFN VWELVDPPEGVKPIGCKWIYKRKRD AGKVQTFKARLVAKGYT  +G
Subjt:  MYFNQVWELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTNEKG

WP_216072462.1 hypothetical protein, partial [Acinetobacter baylyi]2.5e-1990.38Show/hide
Query:  MYFNQVWELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTNEKG
        MYFN VWELVDPPEGVKPIGCKWIYKRKRD AGKVQTFKARLVAKGYT  +G
Subjt:  MYFNQVWELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTNEKG

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein1.0e-1888.46Show/hide
Query:  MYFNQVWELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTNEKG
        MYFN VWELVD PEGVKPIGCKWIYKRKRD+AGKVQTFKARLVAKGYT ++G
Subjt:  MYFNQVWELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTNEKG

A0A5A7UH21 Gag/pol protein1.0e-1888.46Show/hide
Query:  MYFNQVWELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTNEKG
        MYFN VWELVD P+GVKPIGCKWIYKRKRD+AGKVQTFKARLVAKGYT  KG
Subjt:  MYFNQVWELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTNEKG

A0A5A7VBG2 Gag/pol protein4.5e-1990.38Show/hide
Query:  MYFNQVWELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTNEKG
        MYFN VWELVD PEGVKPIGCKWIYKRKRD+AGKVQTFKARLVAKGYT  KG
Subjt:  MYFNQVWELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTNEKG

A0A5D3C2T5 Gag/pol protein4.5e-1990.38Show/hide
Query:  MYFNQVWELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTNEKG
        MYFN VWELVD PEGVKPIGCKWIYKRKRD+AGKVQTFKARLVAKGYT  KG
Subjt:  MYFNQVWELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTNEKG

A0A5D3CU90 Gag/pol protein1.0e-1888.46Show/hide
Query:  MYFNQVWELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTNEKG
        MYFN VWELVD P+GVKPIGCKWIYKRKRD+AGKVQTFKARLVAKGYT  KG
Subjt:  MYFNQVWELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTNEKG

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-948.5e-0744.9Show/hide
Query:  NQVWELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTNEKG
        N  ++LV+ P+G +P+ CKW++K K+D   K+  +KARLV KG+  +KG
Subjt:  NQVWELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTNEKG

P92520 Uncharacterized mitochondrial protein AtMg008201.5e-0646.94Show/hide
Query:  NQVWELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTNEKG
        N+ W LV PP     +GCKW++K K  + G +   KARLVAKG+  E+G
Subjt:  NQVWELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTNEKG

Q8L748 Nuclear pore complex protein NUP1077.2e-0660.98Show/hide
Query:  KFYSILLHHFIEEKWMRQKAQFLEDEAASWSLLWCLYGKGS
        ++ S + H  +E+K MRQKAQ L  EAASWSLLW LYGKG+
Subjt:  KFYSILLHHFIEEKWMRQKAQFLEDEAASWSLLWCLYGKGS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.9e-0646Show/hide
Query:  NQVWELVDPPEG-VKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTNEKG
        N  W+LV PP   V  +GC+WI+ +K ++ G +  +KARLVAKGY    G
Subjt:  NQVWELVDPPEG-VKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTNEKG

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.5e-0646Show/hide
Query:  NQVWELV-DPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTNEKG
        N  W+LV  PP  V  +GC+WI+ +K ++ G +  +KARLVAKGY    G
Subjt:  NQVWELV-DPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTNEKG

Arabidopsis top hitse value%identityAlignment
AT3G14120.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: transport; LOCATED IN: nuclear pore; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Nuclear pore protein 84/107 (InterPro:IPR007252); Has 5399 Blast hits to 5001 proteins in 612 species: Archae - 19; Bacteria - 730; Metazoa - 2186; Fungi - 823; Plants - 382; Viruses - 37; Other Eukaryotes - 1222 (source: NCBI BLink).5.1e-0760.98Show/hide
Query:  KFYSILLHHFIEEKWMRQKAQFLEDEAASWSLLWCLYGKGS
        ++ S + H  +E+K MRQKAQ L  EAASWSLLW LYGKG+
Subjt:  KFYSILLHHFIEEKWMRQKAQFLEDEAASWSLLWCLYGKGS

AT3G14120.2 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: transport; LOCATED IN: nuclear pore; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Nuclear pore protein 84/107 (InterPro:IPR007252); Has 271 Blast hits to 268 proteins in 107 species: Archae - 0; Bacteria - 0; Metazoa - 138; Fungi - 69; Plants - 52; Viruses - 0; Other Eukaryotes - 12 (source: NCBI BLink).5.1e-0760.98Show/hide
Query:  KFYSILLHHFIEEKWMRQKAQFLEDEAASWSLLWCLYGKGS
        ++ S + H  +E+K MRQKAQ L  EAASWSLLW LYGKG+
Subjt:  KFYSILLHHFIEEKWMRQKAQFLEDEAASWSLLWCLYGKGS

AT3G14120.3 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: transport; LOCATED IN: nuclear pore; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Nuclear pore protein 84/107 (InterPro:IPR007252).5.1e-0760.98Show/hide
Query:  KFYSILLHHFIEEKWMRQKAQFLEDEAASWSLLWCLYGKGS
        ++ S + H  +E+K MRQKAQ L  EAASWSLLW LYGKG+
Subjt:  KFYSILLHHFIEEKWMRQKAQFLEDEAASWSLLWCLYGKGS

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.8e-1040Show/hide
Query:  MYFNQVWELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTNEKGWTTTTNLTFIDAWLFRFVDACIKKTSIKRRSATFSSLQKRSLH
        M     WE+   P   KPIGCKW+YK K ++ G ++ +KARLVAKGYT ++G      + FI+     F   C K TS+K   A  S++   +LH
Subjt:  MYFNQVWELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTNEKGWTTTTNLTFIDAWLFRFVDACIKKTSIKRRSATFSSLQKRSLH

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.0e-0746.94Show/hide
Query:  NQVWELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTNEKG
        N+ W LV PP     +GCKW++K K  + G +   KARLVAKG+  E+G
Subjt:  NQVWELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTNEKG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACTTCAATCAAGTTTGGGAACTTGTAGATCCACCTGAAGGGGTCAAACCCATTGGGTGTAAATGGATCTATAAGAGGAAAAGAGATGCTGCTGGGAAAGTACAGAC
TTTCAAAGCCAGACTCGTAGCAAAGGGTTATACCAACGAGAAGGGGTGGACCACTACTACAAATTTGACATTTATTGACGCTTGGCTTTTTAGATTTGTTGACGCTTGTA
TCAAGAAAACGTCAATAAAGCGTCGTTCAGCCACGTTCAGCTCTCTCCAGAAACGTTCTCTCCACACTGGTTCGTCTTTGCGCTGTTTTCGTTTTCGCGCAGCCACATAC
CGTTCTCGTCAGCCCGCCAGCCGTTTCATTTTCGCATCGTTTTCGTCTTCGTGCAGCCACACGCCGTTCTCGTTAGCCCACCGGTTACACGCCGTTCGCACAGCTAGGTT
CGTTCAGCCCGTCTTCTTCGAAAAGACCTGCCATTCAACTCCTTTTGAACGTTTTAGATTCACTGCTTTTGATTTCTCTGCTTTGATGGGTGTCATTGGTTCATTGCTTA
TGGCATTTGGGTTCGTATATTTGGCTATGGAATATAGCTTCCTCACAGCCTTTGGGATGACCAGTTTACTATGTTTATCTGCTTTTGACCAGTTGCTTGGAAAATTCTAT
TCCATCCTATTGCACCACTTTATAGAAGAGAAATGGATGAGGCAGAAGGCTCAATTTCTTGAAGATGAAGCTGCCTCTTGGTCCCTCTTGTGGTGCCTTTATGGAAAAGG
AAGTATGATAGGAATGTACCTAAGGAGCGGTTCGATTCTTTCTAACATGGAGTTTGACATGGAGTTAGAGAATGTTAGCAAGAAGATGAGAGAGTTGTGGAATATCTTCA
GCGAAGCAACTACGAAGTTGGAACGAAGCGTGGGAAAAATAAAGCAGAATTTCGAAGAATTGACAGAAGAGTTTTCAGCAATGAAAAGAGAGTTTAGATTGGCCAAAGAA
AAGGGGCAGAAATTGAGAAATCGATACAAGGCCAGAAAAAAAAAACAAAGTTGTCCAAGACTACCATGA
mRNA sequenceShow/hide mRNA sequence
ATGTACTTCAATCAAGTTTGGGAACTTGTAGATCCACCTGAAGGGGTCAAACCCATTGGGTGTAAATGGATCTATAAGAGGAAAAGAGATGCTGCTGGGAAAGTACAGAC
TTTCAAAGCCAGACTCGTAGCAAAGGGTTATACCAACGAGAAGGGGTGGACCACTACTACAAATTTGACATTTATTGACGCTTGGCTTTTTAGATTTGTTGACGCTTGTA
TCAAGAAAACGTCAATAAAGCGTCGTTCAGCCACGTTCAGCTCTCTCCAGAAACGTTCTCTCCACACTGGTTCGTCTTTGCGCTGTTTTCGTTTTCGCGCAGCCACATAC
CGTTCTCGTCAGCCCGCCAGCCGTTTCATTTTCGCATCGTTTTCGTCTTCGTGCAGCCACACGCCGTTCTCGTTAGCCCACCGGTTACACGCCGTTCGCACAGCTAGGTT
CGTTCAGCCCGTCTTCTTCGAAAAGACCTGCCATTCAACTCCTTTTGAACGTTTTAGATTCACTGCTTTTGATTTCTCTGCTTTGATGGGTGTCATTGGTTCATTGCTTA
TGGCATTTGGGTTCGTATATTTGGCTATGGAATATAGCTTCCTCACAGCCTTTGGGATGACCAGTTTACTATGTTTATCTGCTTTTGACCAGTTGCTTGGAAAATTCTAT
TCCATCCTATTGCACCACTTTATAGAAGAGAAATGGATGAGGCAGAAGGCTCAATTTCTTGAAGATGAAGCTGCCTCTTGGTCCCTCTTGTGGTGCCTTTATGGAAAAGG
AAGTATGATAGGAATGTACCTAAGGAGCGGTTCGATTCTTTCTAACATGGAGTTTGACATGGAGTTAGAGAATGTTAGCAAGAAGATGAGAGAGTTGTGGAATATCTTCA
GCGAAGCAACTACGAAGTTGGAACGAAGCGTGGGAAAAATAAAGCAGAATTTCGAAGAATTGACAGAAGAGTTTTCAGCAATGAAAAGAGAGTTTAGATTGGCCAAAGAA
AAGGGGCAGAAATTGAGAAATCGATACAAGGCCAGAAAAAAAAAACAAAGTTGTCCAAGACTACCATGA
Protein sequenceShow/hide protein sequence
MYFNQVWELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTNEKGWTTTTNLTFIDAWLFRFVDACIKKTSIKRRSATFSSLQKRSLHTGSSLRCFRFRAATY
RSRQPASRFIFASFSSSCSHTPFSLAHRLHAVRTARFVQPVFFEKTCHSTPFERFRFTAFDFSALMGVIGSLLMAFGFVYLAMEYSFLTAFGMTSLLCLSAFDQLLGKFY
SILLHHFIEEKWMRQKAQFLEDEAASWSLLWCLYGKGSMIGMYLRSGSILSNMEFDMELENVSKKMRELWNIFSEATTKLERSVGKIKQNFEELTEEFSAMKREFRLAKE
KGQKLRNRYKARKKKQSCPRLP