; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032245 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032245
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr11:28374275..28379776
RNA-Seq ExpressionLag0032245
SyntenyLag0032245
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022132680.1 uncharacterized protein LOC111005480 [Momordica charantia]5.6e-2465.31Show/hide
Query:  MFTKGVGSSLVALLVYVNNIRITRASSSNIKSLKTHLHDTFKLKDLGNLCYFLGLEIASSSKGIFLSQRKYALQLVEDVGLLASKPSSLPMDPNNKLT
        +FT+G G + VALLVYV++I +T ASS  +  LKTHL   FKLKDLG+L YFLGLE+A SS GI LSQR YAL L ED GLLASKP++LPMDP  KL+
Subjt:  MFTKGVGSSLVALLVYVNNIRITRASSSNIKSLKTHLHDTFKLKDLGNLCYFLGLEIASSSKGIFLSQRKYALQLVEDVGLLASKPSSLPMDPNNKLT

XP_022155859.1 uncharacterized protein LOC111022877 isoform X1 [Momordica charantia]4.8e-2365.31Show/hide
Query:  MFTKGVGSSLVALLVYVNNIRITRASSSNIKSLKTHLHDTFKLKDLGNLCYFLGLEIASSSKGIFLSQRKYALQLVEDVGLLASKPSSLPMDPNNKLT
        +F +G G+S VALLV V++I +T ASSS I SLK HL++ FKLKDLG L YFLGLE+  SS  IFLSQR YALQL+ED G LA+KP  LPMDPN KL+
Subjt:  MFTKGVGSSLVALLVYVNNIRITRASSSNIKSLKTHLHDTFKLKDLGNLCYFLGLEIASSSKGIFLSQRKYALQLVEDVGLLASKPSSLPMDPNNKLT

XP_022155861.1 uncharacterized protein LOC111022877 isoform X2 [Momordica charantia]4.8e-2365.31Show/hide
Query:  MFTKGVGSSLVALLVYVNNIRITRASSSNIKSLKTHLHDTFKLKDLGNLCYFLGLEIASSSKGIFLSQRKYALQLVEDVGLLASKPSSLPMDPNNKLT
        +F +G G+S VALLV V++I +T ASSS I SLK HL++ FKLKDLG L YFLGLE+  SS  IFLSQR YALQL+ED G LA+KP  LPMDPN KL+
Subjt:  MFTKGVGSSLVALLVYVNNIRITRASSSNIKSLKTHLHDTFKLKDLGNLCYFLGLEIASSSKGIFLSQRKYALQLVEDVGLLASKPSSLPMDPNNKLT

XP_022155863.1 uncharacterized protein LOC111022877 isoform X4 [Momordica charantia]4.8e-2365.31Show/hide
Query:  MFTKGVGSSLVALLVYVNNIRITRASSSNIKSLKTHLHDTFKLKDLGNLCYFLGLEIASSSKGIFLSQRKYALQLVEDVGLLASKPSSLPMDPNNKLT
        +F +G G+S VALLV V++I +T ASSS I SLK HL++ FKLKDLG L YFLGLE+  SS  IFLSQR YALQL+ED G LA+KP  LPMDPN KL+
Subjt:  MFTKGVGSSLVALLVYVNNIRITRASSSNIKSLKTHLHDTFKLKDLGNLCYFLGLEIASSSKGIFLSQRKYALQLVEDVGLLASKPSSLPMDPNNKLT

XP_022899321.1 uncharacterized protein LOC111412620 [Olea europaea var. sylvestris]1.5e-2463.11Show/hide
Query:  MFTKGVGSSLVALLVYVNNIRITRASSSNIKSLKTHLHDTFKLKDLGNLCYFLGLEIASSSKGIFLSQRKYALQLVEDVGLLASKPSSLPMDPNNKL-TF
        +FTKG G++ +ALLVYV++I IT +S   I  LK  LH  FKLKDLGNL YFL LEIA S KGIFLSQR+Y LQL+ED G LASKP++LPMDP  KL ++
Subjt:  MFTKGVGSSLVALLVYVNNIRITRASSSNIKSLKTHLHDTFKLKDLGNLCYFLGLEIASSSKGIFLSQRKYALQLVEDVGLLASKPSSLPMDPNNKL-TF

Query:  DGE
        +G+
Subjt:  DGE

TrEMBL top hitse value%identityAlignment
A0A6J1BT54 uncharacterized protein LOC1110054802.7e-2465.31Show/hide
Query:  MFTKGVGSSLVALLVYVNNIRITRASSSNIKSLKTHLHDTFKLKDLGNLCYFLGLEIASSSKGIFLSQRKYALQLVEDVGLLASKPSSLPMDPNNKLT
        +FT+G G + VALLVYV++I +T ASS  +  LKTHL   FKLKDLG+L YFLGLE+A SS GI LSQR YAL L ED GLLASKP++LPMDP  KL+
Subjt:  MFTKGVGSSLVALLVYVNNIRITRASSSNIKSLKTHLHDTFKLKDLGNLCYFLGLEIASSSKGIFLSQRKYALQLVEDVGLLASKPSSLPMDPNNKLT

A0A6J1DNK9 uncharacterized protein LOC111022877 isoform X12.3e-2365.31Show/hide
Query:  MFTKGVGSSLVALLVYVNNIRITRASSSNIKSLKTHLHDTFKLKDLGNLCYFLGLEIASSSKGIFLSQRKYALQLVEDVGLLASKPSSLPMDPNNKLT
        +F +G G+S VALLV V++I +T ASSS I SLK HL++ FKLKDLG L YFLGLE+  SS  IFLSQR YALQL+ED G LA+KP  LPMDPN KL+
Subjt:  MFTKGVGSSLVALLVYVNNIRITRASSSNIKSLKTHLHDTFKLKDLGNLCYFLGLEIASSSKGIFLSQRKYALQLVEDVGLLASKPSSLPMDPNNKLT

A0A6J1DP23 uncharacterized protein LOC111022877 isoform X42.3e-2365.31Show/hide
Query:  MFTKGVGSSLVALLVYVNNIRITRASSSNIKSLKTHLHDTFKLKDLGNLCYFLGLEIASSSKGIFLSQRKYALQLVEDVGLLASKPSSLPMDPNNKLT
        +F +G G+S VALLV V++I +T ASSS I SLK HL++ FKLKDLG L YFLGLE+  SS  IFLSQR YALQL+ED G LA+KP  LPMDPN KL+
Subjt:  MFTKGVGSSLVALLVYVNNIRITRASSSNIKSLKTHLHDTFKLKDLGNLCYFLGLEIASSSKGIFLSQRKYALQLVEDVGLLASKPSSLPMDPNNKLT

A0A6J1DQI6 uncharacterized protein LOC111022877 isoform X32.3e-2365.31Show/hide
Query:  MFTKGVGSSLVALLVYVNNIRITRASSSNIKSLKTHLHDTFKLKDLGNLCYFLGLEIASSSKGIFLSQRKYALQLVEDVGLLASKPSSLPMDPNNKLT
        +F +G G+S VALLV V++I +T ASSS I SLK HL++ FKLKDLG L YFLGLE+  SS  IFLSQR YALQL+ED G LA+KP  LPMDPN KL+
Subjt:  MFTKGVGSSLVALLVYVNNIRITRASSSNIKSLKTHLHDTFKLKDLGNLCYFLGLEIASSSKGIFLSQRKYALQLVEDVGLLASKPSSLPMDPNNKLT

A0A6J1DSZ9 uncharacterized protein LOC111022877 isoform X22.3e-2365.31Show/hide
Query:  MFTKGVGSSLVALLVYVNNIRITRASSSNIKSLKTHLHDTFKLKDLGNLCYFLGLEIASSSKGIFLSQRKYALQLVEDVGLLASKPSSLPMDPNNKLT
        +F +G G+S VALLV V++I +T ASSS I SLK HL++ FKLKDLG L YFLGLE+  SS  IFLSQR YALQL+ED G LA+KP  LPMDPN KL+
Subjt:  MFTKGVGSSLVALLVYVNNIRITRASSSNIKSLKTHLHDTFKLKDLGNLCYFLGLEIASSSKGIFLSQRKYALQLVEDVGLLASKPSSLPMDPNNKLT

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-0432.26Show/hide
Query:  SSLVALLVYVNNIRITRASSSNIKSLKTHLHDTFKLKDLGNLCYFLGLEIA--SSSKGIFLSQRKYALQLVEDVGLLASKPSSLPMDPNNKLT
        ++ + LL+YV+++ I       I  LK  L  +F +KDLG     LG++I    +S+ ++LSQ KY  +++E   +  +KP S P+  + KL+
Subjt:  SSLVALLVYVNNIRITRASSSNIKSLKTHLHDTFKLKDLGNLCYFLGLEIA--SSSKGIFLSQRKYALQLVEDVGLLASKPSSLPMDPNNKLT

P92519 Uncharacterized mitochondrial protein AtMg008102.5e-1144.3Show/hide
Query:  LLVYVNNIRITRASSSNIKSLKTHLHDTFKLKDLGNLCYFLGLEIASSSKGIFLSQRKYALQLVEDVGLLASKPSSLPM
        LL+YV++I +T +S++ +  L   L  TF +KDLG + YFLG++I +   G+FLSQ KYA Q++ + G+L  KP S P+
Subjt:  LLVYVNNIRITRASSSNIKSLKTHLHDTFKLKDLGNLCYFLGLEIASSSKGIFLSQRKYALQLVEDVGLLASKPSSLPM

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.4e-0936.73Show/hide
Query:  MFTKGVGSSLVALLVYVNNIRITRASSSNIKSLKTHLHDTFKLKDLGNLCYFLGLEIASSSKGIFLSQRKYALQLVEDVGLLASKPSSLPMDPNNKLT
        +F    G S+V +LVYV++I IT    + + +   +L   F +KD   L YFLG+E      G+ LSQR+Y L L+    ++ +KP + PM P+ KL+
Subjt:  MFTKGVGSSLVALLVYVNNIRITRASSSNIKSLKTHLHDTFKLKDLGNLCYFLGLEIASSSKGIFLSQRKYALQLVEDVGLLASKPSSLPMDPNNKLT

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.9e-0936.73Show/hide
Query:  MFTKGVGSSLVALLVYVNNIRITRASSSNIKSLKTHLHDTFKLKDLGNLCYFLGLEIASSSKGIFLSQRKYALQLVEDVGLLASKPSSLPMDPNNKLT
        +F    G S++ +LVYV++I IT   +  +K     L   F +K+  +L YFLG+E     +G+ LSQR+Y L L+    +L +KP + PM  + KLT
Subjt:  MFTKGVGSSLVALLVYVNNIRITRASSSNIKSLKTHLHDTFKLKDLGNLCYFLGLEIASSSKGIFLSQRKYALQLVEDVGLLASKPSSLPMDPNNKLT

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.8e-1850.54Show/hide
Query:  FTKGVGSSLVALLVYVNNIRITRASSSNIKSLKTHLHDTFKLKDLGNLCYFLGLEIASSSKGIFLSQRKYALQLVEDVGLLASKPSSLPMDPN
        F K   +  + +LVYV++I I   + + +  LK+ L   FKL+DLG L YFLGLEIA S+ GI + QRKYAL L+++ GLL  KPSS+PMDP+
Subjt:  FTKGVGSSLVALLVYVNNIRITRASSSNIKSLKTHLHDTFKLKDLGNLCYFLGLEIASSSKGIFLSQRKYALQLVEDVGLLASKPSSLPMDPN

ATMG00810.1 DNA/RNA polymerases superfamily protein1.8e-1244.3Show/hide
Query:  LLVYVNNIRITRASSSNIKSLKTHLHDTFKLKDLGNLCYFLGLEIASSSKGIFLSQRKYALQLVEDVGLLASKPSSLPM
        LL+YV++I +T +S++ +  L   L  TF +KDLG + YFLG++I +   G+FLSQ KYA Q++ + G+L  KP S P+
Subjt:  LLVYVNNIRITRASSSNIKSLKTHLHDTFKLKDLGNLCYFLGLEIASSSKGIFLSQRKYALQLVEDVGLLASKPSSLPM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTACAAAAGGAGTTGGTTCTTCCTTGGTTGCCCTCCTTGTATATGTTAATAATATAAGAATCACAAGAGCTTCATCCTCTAATATTAAAAGTTTGAAAACACATCT
TCATGACACTTTTAAGCTCAAGGATTTGGGAAATTTATGTTACTTCTTGGGACTTGAAATTGCTAGTTCCTCCAAGGGTATTTTTCTTTCTCAAAGAAAATATGCCCTTC
AATTAGTTGAAGATGTTGGTCTTTTGGCTTCTAAGCCTTCATCTCTGCCTATGGATCCAAATAACAAGCTTACTTTTGATGGAGAAAATACAGCAAACAAGAAAGTTTCC
TTTGCTAGAAAATCTCCCAAAAACGTGAACAACACCTCAAGGAAACACCGGTATCCTCATAGGACTTTAATCCCTGCACTAACGATGTTACAACTAGTGGTTAACTATTT
TGCGGTCCGGTCTTATGCAAACTCATTGCATAGGATACCCCCACTCACATGTCCACTACATGAACGTGTTGGATCATTGCGTTTGTATCACAATACAAAGCGGGCTGCAA
TGTCGGGAGCCAATCACGTTAAATTGGTGTTAATTCGGGCCACTTATGGAGTTTTGGAGCCATCTTGGCGTCTTGGATGGCGAGAAGGTGAAAAGGCCCAAATAGCCACT
AGGTTGAGGCAAGACAATCTCTACGGAGTGACGACCCTTAATAGGCCTAGGATGCAGGATAAAGATTATGCTGCTGGGCGACTGGAGGAAGCAAATTTTGTGCTGCAGCA
AAACTGGGAACAGAACTGCCACATCACAGCTCGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTTACAAAAGGAGTTGGTTCTTCCTTGGTTGCCCTCCTTGTATATGTTAATAATATAAGAATCACAAGAGCTTCATCCTCTAATATTAAAAGTTTGAAAACACATCT
TCATGACACTTTTAAGCTCAAGGATTTGGGAAATTTATGTTACTTCTTGGGACTTGAAATTGCTAGTTCCTCCAAGGGTATTTTTCTTTCTCAAAGAAAATATGCCCTTC
AATTAGTTGAAGATGTTGGTCTTTTGGCTTCTAAGCCTTCATCTCTGCCTATGGATCCAAATAACAAGCTTACTTTTGATGGAGAAAATACAGCAAACAAGAAAGTTTCC
TTTGCTAGAAAATCTCCCAAAAACGTGAACAACACCTCAAGGAAACACCGGTATCCTCATAGGACTTTAATCCCTGCACTAACGATGTTACAACTAGTGGTTAACTATTT
TGCGGTCCGGTCTTATGCAAACTCATTGCATAGGATACCCCCACTCACATGTCCACTACATGAACGTGTTGGATCATTGCGTTTGTATCACAATACAAAGCGGGCTGCAA
TGTCGGGAGCCAATCACGTTAAATTGGTGTTAATTCGGGCCACTTATGGAGTTTTGGAGCCATCTTGGCGTCTTGGATGGCGAGAAGGTGAAAAGGCCCAAATAGCCACT
AGGTTGAGGCAAGACAATCTCTACGGAGTGACGACCCTTAATAGGCCTAGGATGCAGGATAAAGATTATGCTGCTGGGCGACTGGAGGAAGCAAATTTTGTGCTGCAGCA
AAACTGGGAACAGAACTGCCACATCACAGCTCGTTAG
Protein sequenceShow/hide protein sequence
MFTKGVGSSLVALLVYVNNIRITRASSSNIKSLKTHLHDTFKLKDLGNLCYFLGLEIASSSKGIFLSQRKYALQLVEDVGLLASKPSSLPMDPNNKLTFDGENTANKKVS
FARKSPKNVNNTSRKHRYPHRTLIPALTMLQLVVNYFAVRSYANSLHRIPPLTCPLHERVGSLRLYHNTKRAAMSGANHVKLVLIRATYGVLEPSWRLGWREGEKAQIAT
RLRQDNLYGVTTLNRPRMQDKDYAAGRLEEANFVLQQNWEQNCHITAR