; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038395 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038395
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr2:16611291..16617670
RNA-Seq ExpressionLag0038395
SyntenyLag0038395
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016772 - transferase activity, transferring phosphorus-containing groups (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0063422.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.4e-3347.92Show/hide
Query:  TRFRNDFYHQALPYDHWKDVMCAELEAMDVINTWTIVPLPLDKHTIGCKWIIKIKYHVDGSIGRYKARLVAKGYTQQEGLDFIETFSPVAKLVTVRVLLT
        T +   F+HQA+    W++ M AE+  M+  NTW+IVPLP   H +GCKW+ K+KYH DG++ +YKARLV KGY QQE +DF++TFSPVAK+VTV+VLL+
Subjt:  TRFRNDFYHQALPYDHWKDVMCAELEAMDVINTWTIVPLPLDKHTIGCKWIIKIKYHVDGSIGRYKARLVAKGYTQQEGLDFIETFSPVAKLVTVRVLLT

Query:  MVVSYQWPLVQLDVNNAFCCNSEVVKIIDRIHLKILCKYTQSSN
        +  S+ W L Q+DVNNAF  +    ++   + +    K  Q+S+
Subjt:  MVVSYQWPLVQLDVNNAFCCNSEVVKIIDRIHLKILCKYTQSSN

KAA0068025.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]4.5e-3561.16Show/hide
Query:  EARTRFRNDFYHQALPYDHWKDVMCAELEAMDVINTWTIVPLPLDKHTIGCKWIIKIKYHVDGSIGRYKARLVAKGYTQQEGLDFIETFSPVAKLVTVRV
        +  T+    FYH+A+    W + M AELEAM+   TW+IVPLP  K++IGC+W+ KIK+  DGSI RYK RLVAKGYTQQEGLD+ ETFS V K+VTV+ 
Subjt:  EARTRFRNDFYHQALPYDHWKDVMCAELEAMDVINTWTIVPLPLDKHTIGCKWIIKIKYHVDGSIGRYKARLVAKGYTQQEGLDFIETFSPVAKLVTVRV

Query:  LLTMVVSYQWPLVQLDVNNAF
        LLT+ VS +WPL+QLDVNNAF
Subjt:  LLTMVVSYQWPLVQLDVNNAF

TYK16758.1 Copia protein [Cucumis melo var. makuwa]4.3e-3865.29Show/hide
Query:  EARTRFRNDFYHQALPYDHWKDVMCAELEAMDVINTWTIVPLPLDKHTIGCKWIIKIKYHVDGSIGRYKARLVAKGYTQQEGLDFIETFSPVAKLVTVRV
        +  T+    FYH+A+    W++ M AELEAM+   TW+IVPLP  K++IGC+W+ KIK+ VDGSI RYKARLVAKGYTQQEGLD+ ETFSPVAK+VTV+ 
Subjt:  EARTRFRNDFYHQALPYDHWKDVMCAELEAMDVINTWTIVPLPLDKHTIGCKWIIKIKYHVDGSIGRYKARLVAKGYTQQEGLDFIETFSPVAKLVTVRV

Query:  LLTMVVSYQWPLVQLDVNNAF
        LLT+ VS +WPLVQLDVNNAF
Subjt:  LLTMVVSYQWPLVQLDVNNAF

TYK18103.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]4.5e-3561.16Show/hide
Query:  EARTRFRNDFYHQALPYDHWKDVMCAELEAMDVINTWTIVPLPLDKHTIGCKWIIKIKYHVDGSIGRYKARLVAKGYTQQEGLDFIETFSPVAKLVTVRV
        +  T+    FYH+A+    W + M AELEAM+   TW+IVPLP  K++IGC+W+ KIK+  DGSI RYK RLVAKGYTQQEGLD+ ETFS V K+VTV+ 
Subjt:  EARTRFRNDFYHQALPYDHWKDVMCAELEAMDVINTWTIVPLPLDKHTIGCKWIIKIKYHVDGSIGRYKARLVAKGYTQQEGLDFIETFSPVAKLVTVRV

Query:  LLTMVVSYQWPLVQLDVNNAF
        LLT+ VS +WPL+QLDVNNAF
Subjt:  LLTMVVSYQWPLVQLDVNNAF

XP_022154919.1 uncharacterized protein LOC111022065 [Momordica charantia]3.2e-4169.83Show/hide
Query:  FRNDFYHQALPYDHWKDVMCAELEAMDVINTWTIVPLPLDKHTIGCKWIIKIKYHVDGSIGRYKARLVAKGYTQQEGLDFIETFSPVAKLVTVRVLLTMV
        +   FYHQA+P+ HW++ M AEL AM+  +TW++VPLP + H+IGCKWI K+K+  DGSI RYKARLVAKGYTQQEGLD+IETFSPVAKLVTV+VLLT+ 
Subjt:  FRNDFYHQALPYDHWKDVMCAELEAMDVINTWTIVPLPLDKHTIGCKWIIKIKYHVDGSIGRYKARLVAKGYTQQEGLDFIETFSPVAKLVTVRVLLTMV

Query:  VSYQWPLVQLDVNNAF
        VS+ W LVQLDVNNAF
Subjt:  VSYQWPLVQLDVNNAF

TrEMBL top hitse value%identityAlignment
A0A5A7V587 Retrovirus-related Pol polyprotein from transposon TNT 1-946.9e-3447.92Show/hide
Query:  TRFRNDFYHQALPYDHWKDVMCAELEAMDVINTWTIVPLPLDKHTIGCKWIIKIKYHVDGSIGRYKARLVAKGYTQQEGLDFIETFSPVAKLVTVRVLLT
        T +   F+HQA+    W++ M AE+  M+  NTW+IVPLP   H +GCKW+ K+KYH DG++ +YKARLV KGY QQE +DF++TFSPVAK+VTV+VLL+
Subjt:  TRFRNDFYHQALPYDHWKDVMCAELEAMDVINTWTIVPLPLDKHTIGCKWIIKIKYHVDGSIGRYKARLVAKGYTQQEGLDFIETFSPVAKLVTVRVLLT

Query:  MVVSYQWPLVQLDVNNAFCCNSEVVKIIDRIHLKILCKYTQSSN
        +  S+ W L Q+DVNNAF  +    ++   + +    K  Q+S+
Subjt:  MVVSYQWPLVQLDVNNAFCCNSEVVKIIDRIHLKILCKYTQSSN

A0A5A7VQN7 Cysteine-rich RLK (Receptor-like protein kinase) 82.2e-3561.16Show/hide
Query:  EARTRFRNDFYHQALPYDHWKDVMCAELEAMDVINTWTIVPLPLDKHTIGCKWIIKIKYHVDGSIGRYKARLVAKGYTQQEGLDFIETFSPVAKLVTVRV
        +  T+    FYH+A+    W + M AELEAM+   TW+IVPLP  K++IGC+W+ KIK+  DGSI RYK RLVAKGYTQQEGLD+ ETFS V K+VTV+ 
Subjt:  EARTRFRNDFYHQALPYDHWKDVMCAELEAMDVINTWTIVPLPLDKHTIGCKWIIKIKYHVDGSIGRYKARLVAKGYTQQEGLDFIETFSPVAKLVTVRV

Query:  LLTMVVSYQWPLVQLDVNNAF
        LLT+ VS +WPL+QLDVNNAF
Subjt:  LLTMVVSYQWPLVQLDVNNAF

A0A5D3CZP1 Copia protein2.1e-3865.29Show/hide
Query:  EARTRFRNDFYHQALPYDHWKDVMCAELEAMDVINTWTIVPLPLDKHTIGCKWIIKIKYHVDGSIGRYKARLVAKGYTQQEGLDFIETFSPVAKLVTVRV
        +  T+    FYH+A+    W++ M AELEAM+   TW+IVPLP  K++IGC+W+ KIK+ VDGSI RYKARLVAKGYTQQEGLD+ ETFSPVAK+VTV+ 
Subjt:  EARTRFRNDFYHQALPYDHWKDVMCAELEAMDVINTWTIVPLPLDKHTIGCKWIIKIKYHVDGSIGRYKARLVAKGYTQQEGLDFIETFSPVAKLVTVRV

Query:  LLTMVVSYQWPLVQLDVNNAF
        LLT+ VS +WPLVQLDVNNAF
Subjt:  LLTMVVSYQWPLVQLDVNNAF

A0A5D3D1N3 Cysteine-rich RLK (Receptor-like protein kinase) 82.2e-3561.16Show/hide
Query:  EARTRFRNDFYHQALPYDHWKDVMCAELEAMDVINTWTIVPLPLDKHTIGCKWIIKIKYHVDGSIGRYKARLVAKGYTQQEGLDFIETFSPVAKLVTVRV
        +  T+    FYH+A+    W + M AELEAM+   TW+IVPLP  K++IGC+W+ KIK+  DGSI RYK RLVAKGYTQQEGLD+ ETFS V K+VTV+ 
Subjt:  EARTRFRNDFYHQALPYDHWKDVMCAELEAMDVINTWTIVPLPLDKHTIGCKWIIKIKYHVDGSIGRYKARLVAKGYTQQEGLDFIETFSPVAKLVTVRV

Query:  LLTMVVSYQWPLVQLDVNNAF
        LLT+ VS +WPL+QLDVNNAF
Subjt:  LLTMVVSYQWPLVQLDVNNAF

A0A6J1DNP7 uncharacterized protein LOC1110220651.5e-4169.83Show/hide
Query:  FRNDFYHQALPYDHWKDVMCAELEAMDVINTWTIVPLPLDKHTIGCKWIIKIKYHVDGSIGRYKARLVAKGYTQQEGLDFIETFSPVAKLVTVRVLLTMV
        +   FYHQA+P+ HW++ M AEL AM+  +TW++VPLP + H+IGCKWI K+K+  DGSI RYKARLVAKGYTQQEGLD+IETFSPVAKLVTV+VLLT+ 
Subjt:  FRNDFYHQALPYDHWKDVMCAELEAMDVINTWTIVPLPLDKHTIGCKWIIKIKYHVDGSIGRYKARLVAKGYTQQEGLDFIETFSPVAKLVTVRVLLTMV

Query:  VSYQWPLVQLDVNNAF
        VS+ W LVQLDVNNAF
Subjt:  VSYQWPLVQLDVNNAF

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.1e-1938.02Show/hide
Query:  WKDVMCAELEAMDVINTWTIVPLPLDKHTIGCKWIIKIKYHVDGSIGRYKARLVAKGYTQQEGLDFIETFSPVAKLVTVRVLLTMVVSYQWPLVQLDVNN
        W++ +  EL A  + NTWTI   P +K+ +  +W+  +KY+  G+  RYKARLVA+G+TQ+  +D+ ETF+PVA++ + R +L++V+ Y   + Q+DV  
Subjt:  WKDVMCAELEAMDVINTWTIVPLPLDKHTIGCKWIIKIKYHVDGSIGRYKARLVAKGYTQQEGLDFIETFSPVAKLVTVRVLLTMVVSYQWPLVQLDVNN

Query:  AFCCNSEVVKIIDRIHLKILC
        AF   +   +I  R+   I C
Subjt:  AFCCNSEVVKIIDRIHLKILC

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.4e-1742.86Show/hide
Query:  MCAELEAMDVINTWTIVPLPLDKHTIGCKWIIKIKYHVDGSIGRYKARLVAKGYTQQEGLDFIETFSPVAKLVTVRVLLTMVVSYQWPLVQLDVNNAF
        M  E+E++    T+ +V LP  K  + CKW+ K+K   D  + RYKARLV KG+ Q++G+DF E FSPV K+ ++R +L++  S    + QLDV  AF
Subjt:  MCAELEAMDVINTWTIVPLPLDKHTIGCKWIIKIKYHVDGSIGRYKARLVAKGYTQQEGLDFIETFSPVAKLVTVRVLLTMVVSYQWPLVQLDVNNAF

P92520 Uncharacterized mitochondrial protein AtMg008204.7e-1946.15Show/hide
Query:  ALPYDHWKDVMCAELEAMDVINTWTIVPLPLDKHTIGCKWIIKIKYHVDGSIGRYKARLVAKGYTQQEGLDFIETFSPVAKLVTVRVLLTM
        AL    W   M  EL+A+    TW +VP P++++ +GCKW+ K K H DG++ R KARLVAKG+ Q+EG+ F+ET+SPV +  T+R +L +
Subjt:  ALPYDHWKDVMCAELEAMDVINTWTIVPLPLDKHTIGCKWIIKIKYHVDGSIGRYKARLVAKGYTQQEGLDFIETFSPVAKLVTVRVLLTM

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.9e-2652.73Show/hide
Query:  QALPYDHWKDVMCAELEAMDVINTWTIVPLPLDKHTI-GCKWIIKIKYHVDGSIGRYKARLVAKGYTQQEGLDFIETFSPVAKLVTVRVLLTMVVSYQWP
        QAL  + W++ M +E+ A    +TW +VP P    TI GC+WI   KY+ DGS+ RYKARLVAKGY Q+ GLD+ ETFSPV K  ++R++L + V   WP
Subjt:  QALPYDHWKDVMCAELEAMDVINTWTIVPLPLDKHTI-GCKWIIKIKYHVDGSIGRYKARLVAKGYTQQEGLDFIETFSPVAKLVTVRVLLTMVVSYQWP

Query:  LVQLDVNNAF
        + QLDVNNAF
Subjt:  LVQLDVNNAF

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.7e-2649.19Show/hide
Query:  HSEARTRFRNDFYHQALPYDHWKDVMCAELEAMDVINTWTIVPLPLDKHTI-GCKWIIKIKYHVDGSIGRYKARLVAKGYTQQEGLDFIETFSPVAKLVT
        +SE RT        QA+  D W+  M +E+ A    +TW +VP P    TI GC+WI   K++ DGS+ RYKARLVAKGY Q+ GLD+ ETFSPV K  +
Subjt:  HSEARTRFRNDFYHQALPYDHWKDVMCAELEAMDVINTWTIVPLPLDKHTI-GCKWIIKIKYHVDGSIGRYKARLVAKGYTQQEGLDFIETFSPVAKLVT

Query:  VRVLLTMVVSYQWPLVQLDVNNAF
        +R++L + V   WP+ QLDVNNAF
Subjt:  VRVLLTMVVSYQWPLVQLDVNNAF

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.7e-3055.86Show/hide
Query:  YHQALPYDHWKDVMCAELEAMDVINTWTIVPLPLDKHTIGCKWIIKIKYHVDGSIGRYKARLVAKGYTQQEGLDFIETFSPVAKLVTVRVLLTMVVSYQW
        Y++A  +  W   M  E+ AM+  +TW I  LP +K  IGCKW+ KIKY+ DG+I RYKARLVAKGYTQQEG+DFIETFSPV KL +V+++L +   Y +
Subjt:  YHQALPYDHWKDVMCAELEAMDVINTWTIVPLPLDKHTIGCKWIIKIKYHVDGSIGRYKARLVAKGYTQQEGLDFIETFSPVAKLVTVRVLLTMVVSYQW

Query:  PLVQLDVNNAF
         L QLD++NAF
Subjt:  PLVQLDVNNAF

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)3.3e-2046.15Show/hide
Query:  ALPYDHWKDVMCAELEAMDVINTWTIVPLPLDKHTIGCKWIIKIKYHVDGSIGRYKARLVAKGYTQQEGLDFIETFSPVAKLVTVRVLLTM
        AL    W   M  EL+A+    TW +VP P++++ +GCKW+ K K H DG++ R KARLVAKG+ Q+EG+ F+ET+SPV +  T+R +L +
Subjt:  ALPYDHWKDVMCAELEAMDVINTWTIVPLPLDKHTIGCKWIIKIKYHVDGSIGRYKARLVAKGYTQQEGLDFIETFSPVAKLVTVRVLLTM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCGACCCATATGGTCGGCCTCGGCAAAAGGCCGAGGCCGACCATTCGGCCCGGTTGCGCGGGCCGAGTCCGTTCGATGCCGTTTGGTCCCCACCGCGTTT
GACCGCCTCGGTTTCACCTGGTTTGACCTAAAACGCCTCCAAATCCCTAAAAACCCTAGGAGGAGGAGCAGGTATTTATATCCCTCTTCGCCACTGAAGAGGGGA
TCCCGAATTCTATCCCAAAACTCTACTCTCTATTCTCTGCTTTCTCCTCTTGCTCATACTTTTCCATGCCCTACCGTTTTGTTTGCTAACTTAAGCATCGGAGCC
GGTGTGGCGAGCACCACACCGGTGTGCAGGTTTACTGTCTTGCAGGCCACGTCTTCCCCCTCAACTACAAATTTACCGTTGGTGGCACGTGAAGGTCAGGGATGG
AGAGAGAAGCAACCGAATGGAGAAGTTGAGGACGCGGGGCACTATGGCCATTTTTGGCCAAGCGAAAACCCGGAGATGGGTTTTTGGCCCATGAGTGAAGACCCA
TCGAAAAACCAGTTGTTTGCGACTGAGGGTGTTGTACGGACGAGGATCATGTGGTGCTGCTCGTGCCGACGCACAAACGATGTGGCTCGATTTGAAAGATCGTTT
TCAGAAATGCAATGGCTCGAGGATCTTTCATTTAAGGCGAGAATTGTCCACTCTGAAGCAAGAACAAGATTCCGTAACGATTTCTATCATCAAGCCTTGCCTTAT
GACCATTGGAAAGATGTTATGTGTGCTGAATTGGAAGCTATGGATGTTATTAACACTTGGACAATTGTTCCATTGCCTCTTGATAAGCATACTATTGGATGTAAA
TGGATCATTAAGATCAAGTATCATGTTGATGGATCGATTGGGAGATATAAAGCTCGTTTAGTCGCAAAAGGCTATACTCAGCAAGAAGGCCTCGACTTCATTGAA
ACTTTTTCTCCAGTTGCTAAATTAGTAACCGTGCGTGTCTTACTTACTATGGTTGTGTCTTATCAGTGGCCGCTTGTTCAACTAGACGTTAACAACGCCTTTTGT
TGCAATTCTGAAGTGGTTAAAATTATAGATAGAATTCACCTCAAAATACTGTGCAAGTATACACAATCTAGCAATAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCCGACCCATATGGTCGGCCTCGGCAAAAGGCCGAGGCCGACCATTCGGCCCGGTTGCGCGGGCCGAGTCCGTTCGATGCCGTTTGGTCCCCACCGCGTTT
GACCGCCTCGGTTTCACCTGGTTTGACCTAAAACGCCTCCAAATCCCTAAAAACCCTAGGAGGAGGAGCAGGTATTTATATCCCTCTTCGCCACTGAAGAGGGGA
TCCCGAATTCTATCCCAAAACTCTACTCTCTATTCTCTGCTTTCTCCTCTTGCTCATACTTTTCCATGCCCTACCGTTTTGTTTGCTAACTTAAGCATCGGAGCC
GGTGTGGCGAGCACCACACCGGTGTGCAGGTTTACTGTCTTGCAGGCCACGTCTTCCCCCTCAACTACAAATTTACCGTTGGTGGCACGTGAAGGTCAGGGATGG
AGAGAGAAGCAACCGAATGGAGAAGTTGAGGACGCGGGGCACTATGGCCATTTTTGGCCAAGCGAAAACCCGGAGATGGGTTTTTGGCCCATGAGTGAAGACCCA
TCGAAAAACCAGTTGTTTGCGACTGAGGGTGTTGTACGGACGAGGATCATGTGGTGCTGCTCGTGCCGACGCACAAACGATGTGGCTCGATTTGAAAGATCGTTT
TCAGAAATGCAATGGCTCGAGGATCTTTCATTTAAGGCGAGAATTGTCCACTCTGAAGCAAGAACAAGATTCCGTAACGATTTCTATCATCAAGCCTTGCCTTAT
GACCATTGGAAAGATGTTATGTGTGCTGAATTGGAAGCTATGGATGTTATTAACACTTGGACAATTGTTCCATTGCCTCTTGATAAGCATACTATTGGATGTAAA
TGGATCATTAAGATCAAGTATCATGTTGATGGATCGATTGGGAGATATAAAGCTCGTTTAGTCGCAAAAGGCTATACTCAGCAAGAAGGCCTCGACTTCATTGAA
ACTTTTTCTCCAGTTGCTAAATTAGTAACCGTGCGTGTCTTACTTACTATGGTTGTGTCTTATCAGTGGCCGCTTGTTCAACTAGACGTTAACAACGCCTTTTGT
TGCAATTCTGAAGTGGTTAAAATTATAGATAGAATTCACCTCAAAATACTGTGCAAGTATACACAATCTAGCAATAAGTGA
Protein sequenceShow/hide protein sequence
MARPIWSASAKGRGRPFGPVARAESVRCRLVPTAFDRLGFTWFDLKRLQIPKNPRRRSRYLYPSSPLKRGSRILSQNSTLYSLLSPLAHTFPCPTVLFANLSIGA
GVASTTPVCRFTVLQATSSPSTTNLPLVAREGQGWREKQPNGEVEDAGHYGHFWPSENPEMGFWPMSEDPSKNQLFATEGVVRTRIMWCCSCRRTNDVARFERSF
SEMQWLEDLSFKARIVHSEARTRFRNDFYHQALPYDHWKDVMCAELEAMDVINTWTIVPLPLDKHTIGCKWIIKIKYHVDGSIGRYKARLVAKGYTQQEGLDFIE
TFSPVAKLVTVRVLLTMVVSYQWPLVQLDVNNAFCCNSEVVKIIDRIHLKILCKYTQSSNK