; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000499 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000499
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr4:9000711..9005135
RNA-Seq ExpressionLag0000499
SyntenyLag0000499
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0008289 - lipid binding (molecular function)
InterPro domainsIPR002913 - START domain
IPR023393 - START-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032986.1 putative copia-type polyprotein [Cucumis melo var. makuwa]3.5e-3786.6Show/hide
Query:  ETVGILQEDATVIHVDNKSTIALAKNPVFNDRSKHIDTRFHFIRDCISRKKIQVEYVKTEDQIADIFTKPLKVDVFSKLRTLLGVL--RKTCLREDV
        +TVGILQED TVIHVDNKSTIALAKNPVF+DRSKHIDTRFHFIRDCISRK++QVEYVKTEDQIADI TKPLKV+VF+ LRTLLGV   +KTCLREDV
Subjt:  ETVGILQEDATVIHVDNKSTIALAKNPVFNDRSKHIDTRFHFIRDCISRKKIQVEYVKTEDQIADIFTKPLKVDVFSKLRTLLGVL--RKTCLREDV

KAA0046907.1 T26F17.17 [Cucumis melo var. makuwa]7.1e-3887.63Show/hide
Query:  ETVGILQEDATVIHVDNKSTIALAKNPVFNDRSKHIDTRFHFIRDCISRKKIQVEYVKTEDQIADIFTKPLKVDVFSKLRTLLGVL--RKTCLREDV
        +TVGILQED TVIHVDNKSTIALAKNPVF+DRSKHIDTRFHFIRDCISRK++QVEYVKTEDQIADIFTKPLKV+VF+ LRTLLGV   +KTCLREDV
Subjt:  ETVGILQEDATVIHVDNKSTIALAKNPVFNDRSKHIDTRFHFIRDCISRKKIQVEYVKTEDQIADIFTKPLKVDVFSKLRTLLGVL--RKTCLREDV

KAA0047979.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]7.9e-3785.57Show/hide
Query:  ETVGILQEDATVIHVDNKSTIALAKNPVFNDRSKHIDTRFHFIRDCISRKKIQVEYVKTEDQIADIFTKPLKVDVFSKLRTLLGVL--RKTCLREDV
        +TV ILQED TVIH+DNKSTIALAKNPVF+DRSKHIDTRFHFIRDCISRK++QVEYVKTEDQIADIFTKPLKV+VF+ LRTLLGV   +KTCLREDV
Subjt:  ETVGILQEDATVIHVDNKSTIALAKNPVFNDRSKHIDTRFHFIRDCISRKKIQVEYVKTEDQIADIFTKPLKVDVFSKLRTLLGVL--RKTCLREDV

TYK12247.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.5e-3888.54Show/hide
Query:  ETVGILQEDATVIHVDNKSTIALAKNPVFNDRSKHIDTRFHFIRDCISRKKIQVEYVKTEDQIADIFTKPLKVDVFSKLRTLLGV-LRKTCLREDV
        +TVGILQED TVIH+DNKSTIALAKNPVF+DRSKHIDTRFHFIRDCISRK++QVEYVKTEDQIADIFTKPLKV+VF+ LRTLLGV L+KTCLREDV
Subjt:  ETVGILQEDATVIHVDNKSTIALAKNPVFNDRSKHIDTRFHFIRDCISRKKIQVEYVKTEDQIADIFTKPLKVDVFSKLRTLLGV-LRKTCLREDV

TYK21979.1 putative copia-type polyprotein [Cucumis melo var. makuwa]3.5e-3786.6Show/hide
Query:  ETVGILQEDATVIHVDNKSTIALAKNPVFNDRSKHIDTRFHFIRDCISRKKIQVEYVKTEDQIADIFTKPLKVDVFSKLRTLLGVL--RKTCLREDV
        +TVGILQED TVIHVDNKSTIALAKNPVF+DRSKHIDTRFHFIRDCISRK++QVEYVKTEDQIADI TKPLKV+VF+ LRTLLGV   +KTCLREDV
Subjt:  ETVGILQEDATVIHVDNKSTIALAKNPVFNDRSKHIDTRFHFIRDCISRKKIQVEYVKTEDQIADIFTKPLKVDVFSKLRTLLGVL--RKTCLREDV

TrEMBL top hitse value%identityAlignment
A0A5A7SSL5 Putative copia-type polyprotein1.7e-3786.6Show/hide
Query:  ETVGILQEDATVIHVDNKSTIALAKNPVFNDRSKHIDTRFHFIRDCISRKKIQVEYVKTEDQIADIFTKPLKVDVFSKLRTLLGVL--RKTCLREDV
        +TVGILQED TVIHVDNKSTIALAKNPVF+DRSKHIDTRFHFIRDCISRK++QVEYVKTEDQIADI TKPLKV+VF+ LRTLLGV   +KTCLREDV
Subjt:  ETVGILQEDATVIHVDNKSTIALAKNPVFNDRSKHIDTRFHFIRDCISRKKIQVEYVKTEDQIADIFTKPLKVDVFSKLRTLLGVL--RKTCLREDV

A0A5A7TYB6 T26F17.173.5e-3887.63Show/hide
Query:  ETVGILQEDATVIHVDNKSTIALAKNPVFNDRSKHIDTRFHFIRDCISRKKIQVEYVKTEDQIADIFTKPLKVDVFSKLRTLLGVL--RKTCLREDV
        +TVGILQED TVIHVDNKSTIALAKNPVF+DRSKHIDTRFHFIRDCISRK++QVEYVKTEDQIADIFTKPLKV+VF+ LRTLLGV   +KTCLREDV
Subjt:  ETVGILQEDATVIHVDNKSTIALAKNPVFNDRSKHIDTRFHFIRDCISRKKIQVEYVKTEDQIADIFTKPLKVDVFSKLRTLLGVL--RKTCLREDV

A0A5A7U3F3 Retrovirus-related Pol polyprotein from transposon TNT 1-943.8e-3785.57Show/hide
Query:  ETVGILQEDATVIHVDNKSTIALAKNPVFNDRSKHIDTRFHFIRDCISRKKIQVEYVKTEDQIADIFTKPLKVDVFSKLRTLLGVL--RKTCLREDV
        +TV ILQED TVIH+DNKSTIALAKNPVF+DRSKHIDTRFHFIRDCISRK++QVEYVKTEDQIADIFTKPLKV+VF+ LRTLLGV   +KTCLREDV
Subjt:  ETVGILQEDATVIHVDNKSTIALAKNPVFNDRSKHIDTRFHFIRDCISRKKIQVEYVKTEDQIADIFTKPLKVDVFSKLRTLLGVL--RKTCLREDV

A0A5D3CJX7 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-3888.54Show/hide
Query:  ETVGILQEDATVIHVDNKSTIALAKNPVFNDRSKHIDTRFHFIRDCISRKKIQVEYVKTEDQIADIFTKPLKVDVFSKLRTLLGV-LRKTCLREDV
        +TVGILQED TVIH+DNKSTIALAKNPVF+DRSKHIDTRFHFIRDCISRK++QVEYVKTEDQIADIFTKPLKV+VF+ LRTLLGV L+KTCLREDV
Subjt:  ETVGILQEDATVIHVDNKSTIALAKNPVFNDRSKHIDTRFHFIRDCISRKKIQVEYVKTEDQIADIFTKPLKVDVFSKLRTLLGV-LRKTCLREDV

A0A5D3DF53 Putative copia-type polyprotein1.7e-3786.6Show/hide
Query:  ETVGILQEDATVIHVDNKSTIALAKNPVFNDRSKHIDTRFHFIRDCISRKKIQVEYVKTEDQIADIFTKPLKVDVFSKLRTLLGVL--RKTCLREDV
        +TVGILQED TVIHVDNKSTIALAKNPVF+DRSKHIDTRFHFIRDCISRK++QVEYVKTEDQIADI TKPLKV+VF+ LRTLLGV   +KTCLREDV
Subjt:  ETVGILQEDATVIHVDNKSTIALAKNPVFNDRSKHIDTRFHFIRDCISRKKIQVEYVKTEDQIADIFTKPLKVDVFSKLRTLLGVL--RKTCLREDV

SwissProt top hitse value%identityAlignment
P04146 Copia protein8.3e-1343.02Show/hide
Query:  TVGILQEDATVIHVDNKSTIALAKNPVFNDRSKHIDTRFHFIRDCISRKKIQVEYVKTEDQIADIFTKPLKVDVFSKLRTLLGVLR
        ++ I  E+   I+ DN+  I++A NP  + R+KHID ++HF R+ +    I +EY+ TE+Q+ADIFTKPL    F +LR  LG+L+
Subjt:  TVGILQEDATVIHVDNKSTIALAKNPVFNDRSKHIDTRFHFIRDCISRKKIQVEYVKTEDQIADIFTKPLKVDVFSKLRTLLGVLR

Q39123 Homeobox-leucine zipper protein ATHB-81.0e-1073.81Show/hide
Query:  NGPSMPPVKNFVRVEMLPSGYLIRLCEGGSSIIDIVDRMDLE
        NGPSMPP  +FVR E+LPSGYLIR CEGG SI+ IVD  DLE
Subjt:  NGPSMPPVKNFVRVEMLPSGYLIRLCEGGSSIIDIVDRMDLE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.5e-1041.3Show/hide
Query:  ICSSET-VGILQEDATVIHVDNKSTIALAKNPVFNDRSKHIDTRFHFIRDCISRKKIQVEYVKTEDQIADIFTKPLKVDVFSKLRTLLGVLR
        ICS  T +GI      VI+ DN     L  NPVF+ R KHI   +HFIR+ +    ++V +V T DQ+AD  TKPL    F    + +GV R
Subjt:  ICSSET-VGILQEDATVIHVDNKSTIALAKNPVFNDRSKHIDTRFHFIRDCISRKKIQVEYVKTEDQIADIFTKPLKVDVFSKLRTLLGVLR

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.1e-0940.22Show/hide
Query:  ICSSET-VGILQEDATVIHVDNKSTIALAKNPVFNDRSKHIDTRFHFIRDCISRKKIQVEYVKTEDQIADIFTKPLKVDVFSKLRTLLGVLR
        ICS  T +GI      VI+ DN     L  NPVF+ R KHI   +HFIR+ +    ++V +V T DQ+AD  TKPL    F      +GV++
Subjt:  ICSSET-VGILQEDATVIHVDNKSTIALAKNPVFNDRSKHIDTRFHFIRDCISRKKIQVEYVKTEDQIADIFTKPLKVDVFSKLRTLLGVLR

Q9ZU11 Homeobox-leucine zipper protein ATHB-151.6e-1179.07Show/hide
Query:  NGPSMPPVKNFVRVEMLPSGYLIRLCEGGSSIIDIVDRMDLEA
        NGPSMP V+NFVR EML SGYLIR C+GG SII IVD MDLEA
Subjt:  NGPSMPPVKNFVRVEMLPSGYLIRLCEGGSSIIDIVDRMDLEA

Arabidopsis top hitse value%identityAlignment
AT1G52150.1 Homeobox-leucine zipper family protein / lipid-binding START domain-containing protein1.1e-1279.07Show/hide
Query:  NGPSMPPVKNFVRVEMLPSGYLIRLCEGGSSIIDIVDRMDLEA
        NGPSMP V+NFVR EML SGYLIR C+GG SII IVD MDLEA
Subjt:  NGPSMPPVKNFVRVEMLPSGYLIRLCEGGSSIIDIVDRMDLEA

AT1G52150.2 Homeobox-leucine zipper family protein / lipid-binding START domain-containing protein1.1e-1279.07Show/hide
Query:  NGPSMPPVKNFVRVEMLPSGYLIRLCEGGSSIIDIVDRMDLEA
        NGPSMP V+NFVR EML SGYLIR C+GG SII IVD MDLEA
Subjt:  NGPSMPPVKNFVRVEMLPSGYLIRLCEGGSSIIDIVDRMDLEA

AT1G52150.3 Homeobox-leucine zipper family protein / lipid-binding START domain-containing protein1.1e-1279.07Show/hide
Query:  NGPSMPPVKNFVRVEMLPSGYLIRLCEGGSSIIDIVDRMDLEA
        NGPSMP V+NFVR EML SGYLIR C+GG SII IVD MDLEA
Subjt:  NGPSMPPVKNFVRVEMLPSGYLIRLCEGGSSIIDIVDRMDLEA

AT2G34710.1 Homeobox-leucine zipper family protein / lipid-binding START domain-containing protein8.8e-1064.29Show/hide
Query:  GPSMPPVKNFVRVEMLPSGYLIRLCEGGSSIIDIVDRMDLEA
        GP+ PP  NFVR EM PSG+LIR C+GG SI+ IVD +DL+A
Subjt:  GPSMPPVKNFVRVEMLPSGYLIRLCEGGSSIIDIVDRMDLEA

AT4G32880.1 homeobox gene 87.2e-1273.81Show/hide
Query:  NGPSMPPVKNFVRVEMLPSGYLIRLCEGGSSIIDIVDRMDLE
        NGPSMPP  +FVR E+LPSGYLIR CEGG SI+ IVD  DLE
Subjt:  NGPSMPPVKNFVRVEMLPSGYLIRLCEGGSSIIDIVDRMDLE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGCGATCTGGGTCGTTTTTCGGTGTTTCTCGAATTGAAGCCCCGGACAGCAAGCATAGGTTCTATTTTGGCCTGGAGAATCTCTTAGGGAGTTCTATAAACGTACA
AATCTCTACTGGCAAATGGCAACACATGAGCATACACAATATACTGGAAAGGTATGTGAAAGATCTCTTAACAATACTCATAATGGTCCAAGGTCATAATGGTCCAAGTA
TGCCTCCAGTAAAAAACTTTGTAAGAGTAGAAATGCTACCAAGTGGGTATTTGATAAGACTTTGTGAAGGTGGCAGTTCCATAATCGATATCGTTGATCGCATGGACTTA
GAAGCCAAAACCAAAGCAAAGGAGAAGCTCCTTGCCAAGGAAGCTGCTCAAAGGGTGCTAAGTGATCGTTTTGAGTTTAGATGGTTTCGTTTAGTTTTTGTCTGTATAAC
TTTGAATATGCAAGTACCCAAAAATGTTGTCGTTAGTAATGCCCTCTATATTATGTTGAATTTTGGAGACATTACAATTGCCGTCAACAAAGACCAAATTTGTAGTAGTG
AGACCGTTGGAATTTTGCAAGAGGATGCAACTGTGATCCATGTGGATAATAAGTCAACAATTGCTCTAGCCAAGAATCCAGTGTTTAATGATCGCAGTAAGCACATTGAT
ACAAGATTTCACTTTATTAGAGATTGCATTTCAAGAAAGAAGATTCAAGTAGAATATGTGAAGACAGAAGATCAAATTGCAGATATTTTCACAAAGCCACTTAAGGTCGA
TGTGTTTAGCAAGTTGAGAACTTTGCTCGGAGTTTTGAGAAAAACATGTTTAAGGGAGGATGTTGTAAGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTGCGATCTGGGTCGTTTTTCGGTGTTTCTCGAATTGAAGCCCCGGACAGCAAGCATAGGTTCTATTTTGGCCTGGAGAATCTCTTAGGGAGTTCTATAAACGTACA
AATCTCTACTGGCAAATGGCAACACATGAGCATACACAATATACTGGAAAGGTATGTGAAAGATCTCTTAACAATACTCATAATGGTCCAAGGTCATAATGGTCCAAGTA
TGCCTCCAGTAAAAAACTTTGTAAGAGTAGAAATGCTACCAAGTGGGTATTTGATAAGACTTTGTGAAGGTGGCAGTTCCATAATCGATATCGTTGATCGCATGGACTTA
GAAGCCAAAACCAAAGCAAAGGAGAAGCTCCTTGCCAAGGAAGCTGCTCAAAGGGTGCTAAGTGATCGTTTTGAGTTTAGATGGTTTCGTTTAGTTTTTGTCTGTATAAC
TTTGAATATGCAAGTACCCAAAAATGTTGTCGTTAGTAATGCCCTCTATATTATGTTGAATTTTGGAGACATTACAATTGCCGTCAACAAAGACCAAATTTGTAGTAGTG
AGACCGTTGGAATTTTGCAAGAGGATGCAACTGTGATCCATGTGGATAATAAGTCAACAATTGCTCTAGCCAAGAATCCAGTGTTTAATGATCGCAGTAAGCACATTGAT
ACAAGATTTCACTTTATTAGAGATTGCATTTCAAGAAAGAAGATTCAAGTAGAATATGTGAAGACAGAAGATCAAATTGCAGATATTTTCACAAAGCCACTTAAGGTCGA
TGTGTTTAGCAAGTTGAGAACTTTGCTCGGAGTTTTGAGAAAAACATGTTTAAGGGAGGATGTTGTAAGTTAA
Protein sequenceShow/hide protein sequence
MLRSGSFFGVSRIEAPDSKHRFYFGLENLLGSSINVQISTGKWQHMSIHNILERYVKDLLTILIMVQGHNGPSMPPVKNFVRVEMLPSGYLIRLCEGGSSIIDIVDRMDL
EAKTKAKEKLLAKEAAQRVLSDRFEFRWFRLVFVCITLNMQVPKNVVVSNALYIMLNFGDITIAVNKDQICSSETVGILQEDATVIHVDNKSTIALAKNPVFNDRSKHID
TRFHFIRDCISRKKIQVEYVKTEDQIADIFTKPLKVDVFSKLRTLLGVLRKTCLREDVVS