; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0010655 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0010655
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr1:3224551..3225387
RNA-Seq ExpressionLag0010655
SyntenyLag0010655
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151688.1 uncharacterized protein LOC111019603 [Momordica charantia]3.0e-3540Show/hide
Query:  HPKANPPIPPSVPHVVLSLEALQALIDN----------------AIQNYLQ------HPDPPSFDGLSECLRDVEEWTSELEALFHYLGADDQQRVLGAT
        H  A+P        V L   ALQALIDN                A+Q+  Q         PP+F+G SE    VEEW  ELEAL+ YLG  DQ +V GA 
Subjt:  HPKANPPIPPSVPHVVLSLEALQALIDN----------------AIQNYLQ------HPDPPSFDGLSECLRDVEEWTSELEALFHYLGADDQQRVLGAT

Query:  SLLRVHTRNWWNLVGQHEDRSDNPLSWTRFKGLMRDQFVERFPSDEVEDREAEFLLLVQGSMSVVQYERRFAELSHFVPVLISTEPRRIGRFINGLRKEL
         +LR    NWW++V   ED ++ P++WT  K L+ D +   FP    +++E EFL L Q ++ V QYE++F E S F   LI TE R+I RF+ GL K +
Subjt:  SLLRVHTRNWWNLVGQHEDRSDNPLSWTRFKGLMRDQFVERFPSDEVEDREAEFLLLVQGSMSVVQYERRFAELSHFVPVLISTEPRRIGRFINGLRKEL

Query:  RALVRQGQPTTYAAALASVIAWDSEV-PRMEQFQEVGTSSGVKRK
        +  +   +PTTYA A+   +  D +V  + +  Q+VG SSGVKRK
Subjt:  RALVRQGQPTTYAAALASVIAWDSEV-PRMEQFQEVGTSSGVKRK

XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]1.3e-3849.19Show/hide
Query:  PPSFDGLSECLRDVEEWTSELEALFHYLGADDQQRVLGATSLLRVHTRNWWNLVGQHEDRSDNPLSWTRFKGLMRDQFVERFPSDEVEDREAEFLLLVQG
        PP+FDG SE    VEEW  ELEAL+ YLG +DQ +V GA  +LR    NWW+ V   ED ++ P+ W RFK L+ D +   +P    + +EAEFL LVQG
Subjt:  PPSFDGLSECLRDVEEWTSELEALFHYLGADDQQRVLGATSLLRVHTRNWWNLVGQHEDRSDNPLSWTRFKGLMRDQFVERFPSDEVEDREAEFLLLVQG

Query:  SMSVVQYERRFAELSHFVPVLISTEPRRIGRFINGLRKELRALVRQGQPTTYAAALASVIAWDSEVP-RMEQFQEVGTSSGVKRK
        ++SV QYER+F ELS F   LI TE  +I RF+ GLRK +R  V   +PTTYA A+   +  D +V  +     EVG+SSGVKRK
Subjt:  SMSVVQYERRFAELSHFVPVLISTEPRRIGRFINGLRKELRALVRQGQPTTYAAALASVIAWDSEVP-RMEQFQEVGTSSGVKRK

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]6.1e-3639.75Show/hide
Query:  PKANPPIPPSV-PHVVLSLEALQALIDNAI---QNYLQHP-------------------DPPSFDGLSECLRDVEEWTSELEALFHYLGADDQQRVLGAT
        P+A P   P V P V L  EALQ L+ NA       +Q P                    PP F+G+SE     EEW  ELEAL+ YLG  D  +V GA 
Subjt:  PKANPPIPPSV-PHVVLSLEALQALIDNAI---QNYLQHP-------------------DPPSFDGLSECLRDVEEWTSELEALFHYLGADDQQRVLGAT

Query:  SLLRVHTRNWWNLVGQHEDRSDNPLSWTRFKGLMRDQFVERFPSDEVEDREAEFLLLVQGSMSVVQYERRFAELSHFVPVLISTEPRRIGRFINGLRKEL
         +LR    NWW  V   ED ++ P++W RFK L+ + +   FP     ++  EFL L QGS++V QYER+F ELS F    + TE  +I +FI+GLR+E+
Subjt:  SLLRVHTRNWWNLVGQHEDRSDNPLSWTRFKGLMRDQFVERFPSDEVEDREAEFLLLVQGSMSVVQYERRFAELSHFVPVLISTEPRRIGRFINGLRKEL

Query:  RALVRQGQPTTYAAALASVIAWDSEVPRMEQFQEVGTSSGVKRK
        + L+   +PTTYAAA+   +  D  +   +  Q +G++SGVKRK
Subjt:  RALVRQGQPTTYAAALASVIAWDSEVPRMEQFQEVGTSSGVKRK

XP_022156546.1 uncharacterized protein LOC111023424 [Momordica charantia]2.3e-3545.18Show/hide
Query:  PPSFDGLSECLRDVEEWTSELEALFHYLGADDQQRVLGATSLLRVHTRNWWNLVGQHEDRSDNPLSWTRFKGLMRDQFVERFPSDEVED-REAEFLLLVQ
        PP+F G SE     EEW  ELEAL+ YLG +DQ +V GA  +LR    NWW+ V   ED ++ P+ W RFK L+ D +      + VED +E EFL LVQ
Subjt:  PPSFDGLSECLRDVEEWTSELEALFHYLGADDQQRVLGATSLLRVHTRNWWNLVGQHEDRSDNPLSWTRFKGLMRDQFVERFPSDEVED-REAEFLLLVQ

Query:  GSMSVVQYERRFAELSHFVPVLISTEPRRIGRFINGLRKELRALVRQGQPTTYAAALASVIAWDSEVP-RMEQFQEVGTSSGVKRKHEELDPAFGSQ
        G+++V QYER+F ELS F   LI TE  +I RF+ GL K +R  V   +P TYA A+   +  D +V  R++   EVG+S GVKRK   + P +  Q
Subjt:  GSMSVVQYERRFAELSHFVPVLISTEPRRIGRFINGLRKELRALVRQGQPTTYAAALASVIAWDSEVP-RMEQFQEVGTSSGVKRKHEELDPAFGSQ

XP_022157413.1 uncharacterized protein LOC111024114 [Momordica charantia]3.3e-3740.82Show/hide
Query:  PKANPPIPPSV-PHVVLSLEALQALIDNA-----------------------IQNYLQHPDPPSFDGLSECLRDVEEWTSELEALFHYLGADDQQRVLGA
        P+A P   P V P V L  EALQ L+DNA                       I+++ +   PP F+G+SE     EEW  ELEAL+ YLG  D  +V GA
Subjt:  PKANPPIPPSV-PHVVLSLEALQALIDNA-----------------------IQNYLQHPDPPSFDGLSECLRDVEEWTSELEALFHYLGADDQQRVLGA

Query:  TSLLRVHTRNWWNLVGQHEDRSDNPLSWTRFKGLMRDQFVERFPSDEVEDREAEFLLLVQGSMSVVQYERRFAELSHFVPVLISTEPRRIGRFINGLRKE
          +LR    NWW  V   ED ++ P++W RFK L+ + +   FP     ++ AEFL L QGS++V QYER+F ELS F    I TE  +I +FI+GLR E
Subjt:  TSLLRVHTRNWWNLVGQHEDRSDNPLSWTRFKGLMRDQFVERFPSDEVEDREAEFLLLVQGSMSVVQYERRFAELSHFVPVLISTEPRRIGRFINGLRKE

Query:  LRALVRQGQPTTYAAALASVIAWDSEVPRMEQFQEVGTSSGVKRK
        ++ L+   +PTTYAAA+   +  D  +   +  Q +G+SSGVKRK
Subjt:  LRALVRQGQPTTYAAALASVIAWDSEVPRMEQFQEVGTSSGVKRK

TrEMBL top hitse value%identityAlignment
A0A6J1DCW8 uncharacterized protein LOC1110196031.5e-3540Show/hide
Query:  HPKANPPIPPSVPHVVLSLEALQALIDN----------------AIQNYLQ------HPDPPSFDGLSECLRDVEEWTSELEALFHYLGADDQQRVLGAT
        H  A+P        V L   ALQALIDN                A+Q+  Q         PP+F+G SE    VEEW  ELEAL+ YLG  DQ +V GA 
Subjt:  HPKANPPIPPSVPHVVLSLEALQALIDN----------------AIQNYLQ------HPDPPSFDGLSECLRDVEEWTSELEALFHYLGADDQQRVLGAT

Query:  SLLRVHTRNWWNLVGQHEDRSDNPLSWTRFKGLMRDQFVERFPSDEVEDREAEFLLLVQGSMSVVQYERRFAELSHFVPVLISTEPRRIGRFINGLRKEL
         +LR    NWW++V   ED ++ P++WT  K L+ D +   FP    +++E EFL L Q ++ V QYE++F E S F   LI TE R+I RF+ GL K +
Subjt:  SLLRVHTRNWWNLVGQHEDRSDNPLSWTRFKGLMRDQFVERFPSDEVEDREAEFLLLVQGSMSVVQYERRFAELSHFVPVLISTEPRRIGRFINGLRKEL

Query:  RALVRQGQPTTYAAALASVIAWDSEV-PRMEQFQEVGTSSGVKRK
        +  +   +PTTYA A+   +  D +V  + +  Q+VG SSGVKRK
Subjt:  RALVRQGQPTTYAAALASVIAWDSEV-PRMEQFQEVGTSSGVKRK

A0A6J1DQB9 Reverse transcriptase3.0e-3639.75Show/hide
Query:  PKANPPIPPSV-PHVVLSLEALQALIDNAI---QNYLQHP-------------------DPPSFDGLSECLRDVEEWTSELEALFHYLGADDQQRVLGAT
        P+A P   P V P V L  EALQ L+ NA       +Q P                    PP F+G+SE     EEW  ELEAL+ YLG  D  +V GA 
Subjt:  PKANPPIPPSV-PHVVLSLEALQALIDNAI---QNYLQHP-------------------DPPSFDGLSECLRDVEEWTSELEALFHYLGADDQQRVLGAT

Query:  SLLRVHTRNWWNLVGQHEDRSDNPLSWTRFKGLMRDQFVERFPSDEVEDREAEFLLLVQGSMSVVQYERRFAELSHFVPVLISTEPRRIGRFINGLRKEL
         +LR    NWW  V   ED ++ P++W RFK L+ + +   FP     ++  EFL L QGS++V QYER+F ELS F    + TE  +I +FI+GLR+E+
Subjt:  SLLRVHTRNWWNLVGQHEDRSDNPLSWTRFKGLMRDQFVERFPSDEVEDREAEFLLLVQGSMSVVQYERRFAELSHFVPVLISTEPRRIGRFINGLRKEL

Query:  RALVRQGQPTTYAAALASVIAWDSEVPRMEQFQEVGTSSGVKRK
        + L+   +PTTYAAA+   +  D  +   +  Q +G++SGVKRK
Subjt:  RALVRQGQPTTYAAALASVIAWDSEVPRMEQFQEVGTSSGVKRK

A0A6J1DTA8 uncharacterized protein LOC1110241141.6e-3740.82Show/hide
Query:  PKANPPIPPSV-PHVVLSLEALQALIDNA-----------------------IQNYLQHPDPPSFDGLSECLRDVEEWTSELEALFHYLGADDQQRVLGA
        P+A P   P V P V L  EALQ L+DNA                       I+++ +   PP F+G+SE     EEW  ELEAL+ YLG  D  +V GA
Subjt:  PKANPPIPPSV-PHVVLSLEALQALIDNA-----------------------IQNYLQHPDPPSFDGLSECLRDVEEWTSELEALFHYLGADDQQRVLGA

Query:  TSLLRVHTRNWWNLVGQHEDRSDNPLSWTRFKGLMRDQFVERFPSDEVEDREAEFLLLVQGSMSVVQYERRFAELSHFVPVLISTEPRRIGRFINGLRKE
          +LR    NWW  V   ED ++ P++W RFK L+ + +   FP     ++ AEFL L QGS++V QYER+F ELS F    I TE  +I +FI+GLR E
Subjt:  TSLLRVHTRNWWNLVGQHEDRSDNPLSWTRFKGLMRDQFVERFPSDEVEDREAEFLLLVQGSMSVVQYERRFAELSHFVPVLISTEPRRIGRFINGLRKE

Query:  LRALVRQGQPTTYAAALASVIAWDSEVPRMEQFQEVGTSSGVKRK
        ++ L+   +PTTYAAA+   +  D  +   +  Q +G+SSGVKRK
Subjt:  LRALVRQGQPTTYAAALASVIAWDSEVPRMEQFQEVGTSSGVKRK

A0A6J1DUM2 uncharacterized protein LOC1110232476.4e-3949.19Show/hide
Query:  PPSFDGLSECLRDVEEWTSELEALFHYLGADDQQRVLGATSLLRVHTRNWWNLVGQHEDRSDNPLSWTRFKGLMRDQFVERFPSDEVEDREAEFLLLVQG
        PP+FDG SE    VEEW  ELEAL+ YLG +DQ +V GA  +LR    NWW+ V   ED ++ P+ W RFK L+ D +   +P    + +EAEFL LVQG
Subjt:  PPSFDGLSECLRDVEEWTSELEALFHYLGADDQQRVLGATSLLRVHTRNWWNLVGQHEDRSDNPLSWTRFKGLMRDQFVERFPSDEVEDREAEFLLLVQG

Query:  SMSVVQYERRFAELSHFVPVLISTEPRRIGRFINGLRKELRALVRQGQPTTYAAALASVIAWDSEVP-RMEQFQEVGTSSGVKRK
        ++SV QYER+F ELS F   LI TE  +I RF+ GLRK +R  V   +PTTYA A+   +  D +V  +     EVG+SSGVKRK
Subjt:  SMSVVQYERRFAELSHFVPVLISTEPRRIGRFINGLRKELRALVRQGQPTTYAAALASVIAWDSEVP-RMEQFQEVGTSSGVKRK

A0A6J1DVA0 uncharacterized protein LOC1110234241.1e-3545.18Show/hide
Query:  PPSFDGLSECLRDVEEWTSELEALFHYLGADDQQRVLGATSLLRVHTRNWWNLVGQHEDRSDNPLSWTRFKGLMRDQFVERFPSDEVED-REAEFLLLVQ
        PP+F G SE     EEW  ELEAL+ YLG +DQ +V GA  +LR    NWW+ V   ED ++ P+ W RFK L+ D +      + VED +E EFL LVQ
Subjt:  PPSFDGLSECLRDVEEWTSELEALFHYLGADDQQRVLGATSLLRVHTRNWWNLVGQHEDRSDNPLSWTRFKGLMRDQFVERFPSDEVED-REAEFLLLVQ

Query:  GSMSVVQYERRFAELSHFVPVLISTEPRRIGRFINGLRKELRALVRQGQPTTYAAALASVIAWDSEVP-RMEQFQEVGTSSGVKRKHEELDPAFGSQ
        G+++V QYER+F ELS F   LI TE  +I RF+ GL K +R  V   +P TYA A+   +  D +V  R++   EVG+S GVKRK   + P +  Q
Subjt:  GSMSVVQYERRFAELSHFVPVLISTEPRRIGRFINGLRKELRALVRQGQPTTYAAALASVIAWDSEVP-RMEQFQEVGTSSGVKRKHEELDPAFGSQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTCTAGTGGGAGTCGAGATAGTCATCGTTCTGACTCTTTGGGTGTCCCTGAAGTTGAATTGCACCCTAAGGCGAATCCCCCTATTCCTCCTTCGGTGCCTCATGT
AGTTTTGTCATTGGAAGCGTTGCAAGCTTTGATTGACAATGCAATCCAAAACTATCTGCAGCACCCTGATCCTCCATCATTCGATGGCCTGTCTGAATGTTTAAGAGATG
TCGAAGAGTGGACCTCAGAGCTGGAAGCCCTATTCCATTACCTTGGAGCTGATGACCAACAACGTGTCTTAGGAGCTACCTCTTTGCTCAGAGTTCACACACGCAACTGG
TGGAATTTAGTGGGCCAGCATGAGGATCGCTCCGACAATCCCTTGTCATGGACAAGATTCAAGGGTCTTATGCGAGACCAGTTTGTCGAACGCTTCCCTAGTGATGAAGT
TGAAGATCGGGAAGCGGAGTTTCTATTGCTCGTGCAGGGGAGCATGTCTGTAGTGCAATACGAAAGAAGATTTGCAGAGTTGTCTCATTTCGTTCCTGTGCTAATTTCCA
CTGAGCCAAGAAGAATTGGAAGGTTCATCAATGGCTTACGTAAGGAATTAAGAGCGTTGGTCAGGCAAGGACAACCAACTACTTACGCAGCAGCTCTTGCAAGCGTAATA
GCGTGGGATAGCGAAGTTCCCAGGATGGAGCAGTTCCAGGAGGTGGGCACTTCATCTGGTGTCAAGCGAAAGCATGAGGAGTTGGATCCTGCGTTTGGTTCGCAAGACCA
ATAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTTCTAGTGGGAGTCGAGATAGTCATCGTTCTGACTCTTTGGGTGTCCCTGAAGTTGAATTGCACCCTAAGGCGAATCCCCCTATTCCTCCTTCGGTGCCTCATGT
AGTTTTGTCATTGGAAGCGTTGCAAGCTTTGATTGACAATGCAATCCAAAACTATCTGCAGCACCCTGATCCTCCATCATTCGATGGCCTGTCTGAATGTTTAAGAGATG
TCGAAGAGTGGACCTCAGAGCTGGAAGCCCTATTCCATTACCTTGGAGCTGATGACCAACAACGTGTCTTAGGAGCTACCTCTTTGCTCAGAGTTCACACACGCAACTGG
TGGAATTTAGTGGGCCAGCATGAGGATCGCTCCGACAATCCCTTGTCATGGACAAGATTCAAGGGTCTTATGCGAGACCAGTTTGTCGAACGCTTCCCTAGTGATGAAGT
TGAAGATCGGGAAGCGGAGTTTCTATTGCTCGTGCAGGGGAGCATGTCTGTAGTGCAATACGAAAGAAGATTTGCAGAGTTGTCTCATTTCGTTCCTGTGCTAATTTCCA
CTGAGCCAAGAAGAATTGGAAGGTTCATCAATGGCTTACGTAAGGAATTAAGAGCGTTGGTCAGGCAAGGACAACCAACTACTTACGCAGCAGCTCTTGCAAGCGTAATA
GCGTGGGATAGCGAAGTTCCCAGGATGGAGCAGTTCCAGGAGGTGGGCACTTCATCTGGTGTCAAGCGAAAGCATGAGGAGTTGGATCCTGCGTTTGGTTCGCAAGACCA
ATAG
Protein sequenceShow/hide protein sequence
MSSSGSRDSHRSDSLGVPEVELHPKANPPIPPSVPHVVLSLEALQALIDNAIQNYLQHPDPPSFDGLSECLRDVEEWTSELEALFHYLGADDQQRVLGATSLLRVHTRNW
WNLVGQHEDRSDNPLSWTRFKGLMRDQFVERFPSDEVEDREAEFLLLVQGSMSVVQYERRFAELSHFVPVLISTEPRRIGRFINGLRKELRALVRQGQPTTYAAALASVI
AWDSEVPRMEQFQEVGTSSGVKRKHEELDPAFGSQDQ