; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039010 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039010
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr2:33636145..33637092
RNA-Seq ExpressionLag0039010
SyntenyLag0039010
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151688.1 uncharacterized protein LOC111019603 [Momordica charantia]3.6e-3839.43Show/hide
Query:  APLVLAAEALQAMLGNAF-LNNLQHVGANGAPAHGEEVQFIKSFMKVKPPSFDGHSDSSEAVVEWTAALEAIFQFLGANAQQRVQGAAFMLKGHARTWWN
        A + L   ALQA++ N+      Q +    A A   E QFI+ F +  PP+F+G S+ +  V EW   LEA++ +LG + Q +V+GA FML+G A  WW+
Subjt:  APLVLAAEALQAMLGNAF-LNNLQHVGANGAPAHGEEVQFIKSFMKVKPPSFDGHSDSSEAVVEWTAALEAIFQFLGANAQQRVQGAAFMLKGHARTWWN

Query:  VVGQTENRPENPISWLGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRINRFVNGLRAEIRGLVRLSRPTTFAAA
        VV   E+    PI+W   K L+ D++  +    E+E EF+ L Q TL V QY ++F E S     L+ TE  +I RFV GL   I+G + L RPTT+A A
Subjt:  VVGQTENRPENPISWLGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRINRFVNGLRAEIRGLVRLSRPTTFAAA

Query:  LASARMLDRD-IPRTDQSQEVGTSSGAKKKSEVEALIASQKARGSP
        +  A ++D+D I +    Q+VG SSG K+K  V  + +SQ ++ SP
Subjt:  LASARMLDRD-IPRTDQSQEVGTSSGAKKKSEVEALIASQKARGSP

XP_022155925.1 uncharacterized protein LOC111022925 [Momordica charantia]9.5e-3939.23Show/hide
Query:  SEANPPIPPPAPLVLAAEALQAMLGNAFLNNLQHVGANGA----PAHG----EEVQFIKSFMKVKPPSFDGHSDSSEAVVEWTAALEAIFQFLGANAQQR
        ++ N P   P  +VL AEALQ +L NA        GA GA    P+ G    EEVQFI+ F +  PP F+G S+   A  EW   LEA++ +LG +   +
Subjt:  SEANPPIPPPAPLVLAAEALQAMLGNAFLNNLQHVGANGA----PAHG----EEVQFIKSFMKVKPPSFDGHSDSSEAVVEWTAALEAIFQFLGANAQQR

Query:  VQGAAFMLKGHARTWWNVVGQTENRPENPISWLGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRINRFVNGLRA
        V+GA FML+G A  WW  V   E+    P++W  FK L+ +++       E+ AEF+ L Q +L V QY R+F ELS      + TE+++I++F++GLR 
Subjt:  VQGAAFMLKGHARTWWNVVGQTENRPENPISWLGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRINRFVNGLRA

Query:  EIRGLVRLSRPTTFAAALASARMLDRDIPRTDQSQEVGTSSGAKKKSEVEALIASQKARG
        EI+GL+ L  PTT+AAA+  A ++D+ +      Q +G+SSG K+K    +  +SQ +RG
Subjt:  EIRGLVRLSRPTTFAAALASARMLDRDIPRTDQSQEVGTSSGAKKKSEVEALIASQKARG

XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]2.5e-3946.91Show/hide
Query:  EVQFIKSFMKVKPPSFDGHSDSSEAVVEWTAALEAIFQFLGANAQQRVQGAAFMLKGHARTWWNVVGQTENRPENPISWLGFKGLVRDHFGCRFADVEQE
        E +FIK F +  PP+FDG S+ + AV EW   LEA++ +LG   Q +V+GA FML+G A  WW+ V   E+    PI W  FK L+ D++        +E
Subjt:  EVQFIKSFMKVKPPSFDGHSDSSEAVVEWTAALEAIFQFLGANAQQRVQGAAFMLKGHARTWWNVVGQTENRPENPISWLGFKGLVRDHFGCRFADVEQE

Query:  AEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRINRFVNGLRAEIRGLVRLSRPTTFAAALASARMLDRDIP-RTDQSQEVGTSSGAKKK
        AEF+ LVQGTLSV QY R+F ELS     L+ TE ++I RFV GLR  IRG V L RPTT+A A+  A ++D+D+  +     EVG+SSG K+K
Subjt:  AEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRINRFVNGLRAEIRGLVRLSRPTTFAAALASARMLDRDIP-RTDQSQEVGTSSGAKKK

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]9.5e-3938.49Show/hide
Query:  PPIPPPAP---------LVLAAEALQAMLGNAFLNNLQHVGANGAPAH--------GEEVQFIKSFMKVKPPSFDGHSDSSEAVVEWTAALEAIFQFLGA
        PP+P  AP         + L AEALQ +L NA        GA GA            +EVQFI+ F    PP F+G S+   A  EW   LEA++ +LG 
Subjt:  PPIPPPAP---------LVLAAEALQAMLGNAFLNNLQHVGANGAPAH--------GEEVQFIKSFMKVKPPSFDGHSDSSEAVVEWTAALEAIFQFLGA

Query:  NAQQRVQGAAFMLKGHARTWWNVVGQTENRPENPISWLGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRINRFV
        +   +V+GA FML+G A  WW  V   E+    P++W  FK L+ +++    A  E+  EF+ L QG+L+V QY R+F ELS      V TE+++I++F+
Subjt:  NAQQRVQGAAFMLKGHARTWWNVVGQTENRPENPISWLGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRINRFV

Query:  NGLRAEIRGLVRLSRPTTFAAALASARMLDRDIPRTDQSQEVGTSSGAKKKSEVEALIASQKARG
        +GLR EI+GL+ L  PTT+AAA+  A ++D+ +      Q +G++SG K+K    +  ASQ +RG
Subjt:  NGLRAEIRGLVRLSRPTTFAAALASARMLDRDIPRTDQSQEVGTSSGAKKKSEVEALIASQKARG

XP_022157413.1 uncharacterized protein LOC111024114 [Momordica charantia]2.1e-3837.74Show/hide
Query:  PPIPPPAP---------LVLAAEALQAMLGNAFLNNLQHVGANGAPAH--------GEEVQFIKSFMKVKPPSFDGHSDSSEAVVEWTAALEAIFQFLGA
        PP+P  AP         + L AEALQ +L NA        GA GA            +EVQFI+ F +  PP F+G S+   A  EW   LEA++ +LG 
Subjt:  PPIPPPAP---------LVLAAEALQAMLGNAFLNNLQHVGANGAPAH--------GEEVQFIKSFMKVKPPSFDGHSDSSEAVVEWTAALEAIFQFLGA

Query:  NAQQRVQGAAFMLKGHARTWWNVVGQTENRPENPISWLGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRINRFV
        +   +V+GA FML+G A  WW  V   E+    P++W  FK L+ +++       E+ AEF+ L QG+L+V QY R+F ELS      + TE+++I++F+
Subjt:  NAQQRVQGAAFMLKGHARTWWNVVGQTENRPENPISWLGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRINRFV

Query:  NGLRAEIRGLVRLSRPTTFAAALASARMLDRDIPRTDQSQEVGTSSGAKKKSEVEALIASQKARG
        +GLR EI+GL+ +  PTT+AAA+  A ++D+ +      Q +G+SSG K+K  +    +SQ +RG
Subjt:  NGLRAEIRGLVRLSRPTTFAAALASARMLDRDIPRTDQSQEVGTSSGAKKKSEVEALIASQKARG

TrEMBL top hitse value%identityAlignment
A0A6J1DCW8 uncharacterized protein LOC1110196031.7e-3839.43Show/hide
Query:  APLVLAAEALQAMLGNAF-LNNLQHVGANGAPAHGEEVQFIKSFMKVKPPSFDGHSDSSEAVVEWTAALEAIFQFLGANAQQRVQGAAFMLKGHARTWWN
        A + L   ALQA++ N+      Q +    A A   E QFI+ F +  PP+F+G S+ +  V EW   LEA++ +LG + Q +V+GA FML+G A  WW+
Subjt:  APLVLAAEALQAMLGNAF-LNNLQHVGANGAPAHGEEVQFIKSFMKVKPPSFDGHSDSSEAVVEWTAALEAIFQFLGANAQQRVQGAAFMLKGHARTWWN

Query:  VVGQTENRPENPISWLGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRINRFVNGLRAEIRGLVRLSRPTTFAAA
        VV   E+    PI+W   K L+ D++  +    E+E EF+ L Q TL V QY ++F E S     L+ TE  +I RFV GL   I+G + L RPTT+A A
Subjt:  VVGQTENRPENPISWLGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRINRFVNGLRAEIRGLVRLSRPTTFAAA

Query:  LASARMLDRD-IPRTDQSQEVGTSSGAKKKSEVEALIASQKARGSP
        +  A ++D+D I +    Q+VG SSG K+K  V  + +SQ ++ SP
Subjt:  LASARMLDRD-IPRTDQSQEVGTSSGAKKKSEVEALIASQKARGSP

A0A6J1DNV8 uncharacterized protein LOC1110229254.6e-3939.23Show/hide
Query:  SEANPPIPPPAPLVLAAEALQAMLGNAFLNNLQHVGANGA----PAHG----EEVQFIKSFMKVKPPSFDGHSDSSEAVVEWTAALEAIFQFLGANAQQR
        ++ N P   P  +VL AEALQ +L NA        GA GA    P+ G    EEVQFI+ F +  PP F+G S+   A  EW   LEA++ +LG +   +
Subjt:  SEANPPIPPPAPLVLAAEALQAMLGNAFLNNLQHVGANGA----PAHG----EEVQFIKSFMKVKPPSFDGHSDSSEAVVEWTAALEAIFQFLGANAQQR

Query:  VQGAAFMLKGHARTWWNVVGQTENRPENPISWLGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRINRFVNGLRA
        V+GA FML+G A  WW  V   E+    P++W  FK L+ +++       E+ AEF+ L Q +L V QY R+F ELS      + TE+++I++F++GLR 
Subjt:  VQGAAFMLKGHARTWWNVVGQTENRPENPISWLGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRINRFVNGLRA

Query:  EIRGLVRLSRPTTFAAALASARMLDRDIPRTDQSQEVGTSSGAKKKSEVEALIASQKARG
        EI+GL+ L  PTT+AAA+  A ++D+ +      Q +G+SSG K+K    +  +SQ +RG
Subjt:  EIRGLVRLSRPTTFAAALASARMLDRDIPRTDQSQEVGTSSGAKKKSEVEALIASQKARG

A0A6J1DQB9 Reverse transcriptase4.6e-3938.49Show/hide
Query:  PPIPPPAP---------LVLAAEALQAMLGNAFLNNLQHVGANGAPAH--------GEEVQFIKSFMKVKPPSFDGHSDSSEAVVEWTAALEAIFQFLGA
        PP+P  AP         + L AEALQ +L NA        GA GA            +EVQFI+ F    PP F+G S+   A  EW   LEA++ +LG 
Subjt:  PPIPPPAP---------LVLAAEALQAMLGNAFLNNLQHVGANGAPAH--------GEEVQFIKSFMKVKPPSFDGHSDSSEAVVEWTAALEAIFQFLGA

Query:  NAQQRVQGAAFMLKGHARTWWNVVGQTENRPENPISWLGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRINRFV
        +   +V+GA FML+G A  WW  V   E+    P++W  FK L+ +++    A  E+  EF+ L QG+L+V QY R+F ELS      V TE+++I++F+
Subjt:  NAQQRVQGAAFMLKGHARTWWNVVGQTENRPENPISWLGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRINRFV

Query:  NGLRAEIRGLVRLSRPTTFAAALASARMLDRDIPRTDQSQEVGTSSGAKKKSEVEALIASQKARG
        +GLR EI+GL+ L  PTT+AAA+  A ++D+ +      Q +G++SG K+K    +  ASQ +RG
Subjt:  NGLRAEIRGLVRLSRPTTFAAALASARMLDRDIPRTDQSQEVGTSSGAKKKSEVEALIASQKARG

A0A6J1DTA8 uncharacterized protein LOC1110241141.0e-3837.74Show/hide
Query:  PPIPPPAP---------LVLAAEALQAMLGNAFLNNLQHVGANGAPAH--------GEEVQFIKSFMKVKPPSFDGHSDSSEAVVEWTAALEAIFQFLGA
        PP+P  AP         + L AEALQ +L NA        GA GA            +EVQFI+ F +  PP F+G S+   A  EW   LEA++ +LG 
Subjt:  PPIPPPAP---------LVLAAEALQAMLGNAFLNNLQHVGANGAPAH--------GEEVQFIKSFMKVKPPSFDGHSDSSEAVVEWTAALEAIFQFLGA

Query:  NAQQRVQGAAFMLKGHARTWWNVVGQTENRPENPISWLGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRINRFV
        +   +V+GA FML+G A  WW  V   E+    P++W  FK L+ +++       E+ AEF+ L QG+L+V QY R+F ELS      + TE+++I++F+
Subjt:  NAQQRVQGAAFMLKGHARTWWNVVGQTENRPENPISWLGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRINRFV

Query:  NGLRAEIRGLVRLSRPTTFAAALASARMLDRDIPRTDQSQEVGTSSGAKKKSEVEALIASQKARG
        +GLR EI+GL+ +  PTT+AAA+  A ++D+ +      Q +G+SSG K+K  +    +SQ +RG
Subjt:  NGLRAEIRGLVRLSRPTTFAAALASARMLDRDIPRTDQSQEVGTSSGAKKKSEVEALIASQKARG

A0A6J1DUM2 uncharacterized protein LOC1110232471.2e-3946.91Show/hide
Query:  EVQFIKSFMKVKPPSFDGHSDSSEAVVEWTAALEAIFQFLGANAQQRVQGAAFMLKGHARTWWNVVGQTENRPENPISWLGFKGLVRDHFGCRFADVEQE
        E +FIK F +  PP+FDG S+ + AV EW   LEA++ +LG   Q +V+GA FML+G A  WW+ V   E+    PI W  FK L+ D++        +E
Subjt:  EVQFIKSFMKVKPPSFDGHSDSSEAVVEWTAALEAIFQFLGANAQQRVQGAAFMLKGHARTWWNVVGQTENRPENPISWLGFKGLVRDHFGCRFADVEQE

Query:  AEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRINRFVNGLRAEIRGLVRLSRPTTFAAALASARMLDRDIP-RTDQSQEVGTSSGAKKK
        AEF+ LVQGTLSV QY R+F ELS     L+ TE ++I RFV GLR  IRG V L RPTT+A A+  A ++D+D+  +     EVG+SSG K+K
Subjt:  AEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRINRFVNGLRAEIRGLVRLSRPTTFAAALASARMLDRDIP-RTDQSQEVGTSSGAKKK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTCAAGTAGCAGTCAAGGTAGTGGACGTTCTGGTCCTCCAGACGTTCTCGAGGTCAATTCGCATTCTGAGGCGAATCCTCCCATTCCTCCCCCAGCGCCTCTTGT
GCTGGCAGCAGAGGCATTGCAGGCGATGCTTGGCAATGCATTCCTGAACAACCTGCAGCACGTCGGTGCAAATGGAGCCCCTGCTCATGGCGAAGAGGTGCAGTTTATCA
AGAGTTTCATGAAGGTGAAGCCTCCTTCATTCGATGGGCACTCGGATAGTTCTGAAGCAGTGGTAGAATGGACCGCCGCATTGGAAGCGATATTTCAATTTCTTGGAGCT
AATGCCCAACAGCGGGTCCAAGGAGCTGCCTTTATGCTCAAAGGTCACGCTCGCACTTGGTGGAACGTTGTGGGTCAAACCGAGAACCGCCCAGAGAATCCCATTTCCTG
GTTGGGGTTCAAAGGTCTTGTGCGAGACCATTTTGGCTGTCGTTTTGCTGATGTTGAGCAAGAAGCAGAGTTTGTCTCTCTTGTTCAAGGGACCTTGTCTGTGGAACAGT
ACGTCAGAAGGTTTGAAGAGTTATCCTGCCGAGTCCCAGGGTTGGTTGCCACCGAAGAGATTAGGATCAACCGATTCGTTAATGGGCTCCGCGCAGAAATTCGAGGTTTG
GTCAGGCTTAGTCGACCGACCACCTTTGCAGCAGCCCTAGCGAGCGCTCGGATGTTGGATAGGGACATCCCCAGGACGGATCAGTCCCAAGAGGTTGGCACGTCGTCTGG
TGCTAAGAAGAAGAGCGAAGTGGAAGCGCTTATAGCTAGTCAGAAGGCTAGAGGATCTCCGTCAGGATCTAGTGCGTGCACTGAGGAGTTCTTGCCCTGTGTCACCGATG
AGGAACTCAAGGCAGAATACCCAGACCTTTACGATGTCGATGATTCCGATGATGAAGATAGTTCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTTCAAGTAGCAGTCAAGGTAGTGGACGTTCTGGTCCTCCAGACGTTCTCGAGGTCAATTCGCATTCTGAGGCGAATCCTCCCATTCCTCCCCCAGCGCCTCTTGT
GCTGGCAGCAGAGGCATTGCAGGCGATGCTTGGCAATGCATTCCTGAACAACCTGCAGCACGTCGGTGCAAATGGAGCCCCTGCTCATGGCGAAGAGGTGCAGTTTATCA
AGAGTTTCATGAAGGTGAAGCCTCCTTCATTCGATGGGCACTCGGATAGTTCTGAAGCAGTGGTAGAATGGACCGCCGCATTGGAAGCGATATTTCAATTTCTTGGAGCT
AATGCCCAACAGCGGGTCCAAGGAGCTGCCTTTATGCTCAAAGGTCACGCTCGCACTTGGTGGAACGTTGTGGGTCAAACCGAGAACCGCCCAGAGAATCCCATTTCCTG
GTTGGGGTTCAAAGGTCTTGTGCGAGACCATTTTGGCTGTCGTTTTGCTGATGTTGAGCAAGAAGCAGAGTTTGTCTCTCTTGTTCAAGGGACCTTGTCTGTGGAACAGT
ACGTCAGAAGGTTTGAAGAGTTATCCTGCCGAGTCCCAGGGTTGGTTGCCACCGAAGAGATTAGGATCAACCGATTCGTTAATGGGCTCCGCGCAGAAATTCGAGGTTTG
GTCAGGCTTAGTCGACCGACCACCTTTGCAGCAGCCCTAGCGAGCGCTCGGATGTTGGATAGGGACATCCCCAGGACGGATCAGTCCCAAGAGGTTGGCACGTCGTCTGG
TGCTAAGAAGAAGAGCGAAGTGGAAGCGCTTATAGCTAGTCAGAAGGCTAGAGGATCTCCGTCAGGATCTAGTGCGTGCACTGAGGAGTTCTTGCCCTGTGTCACCGATG
AGGAACTCAAGGCAGAATACCCAGACCTTTACGATGTCGATGATTCCGATGATGAAGATAGTTCCTAA
Protein sequenceShow/hide protein sequence
MSSSSSQGSGRSGPPDVLEVNSHSEANPPIPPPAPLVLAAEALQAMLGNAFLNNLQHVGANGAPAHGEEVQFIKSFMKVKPPSFDGHSDSSEAVVEWTAALEAIFQFLGA
NAQQRVQGAAFMLKGHARTWWNVVGQTENRPENPISWLGFKGLVRDHFGCRFADVEQEAEFVSLVQGTLSVEQYVRRFEELSCRVPGLVATEEIRINRFVNGLRAEIRGL
VRLSRPTTFAAALASARMLDRDIPRTDQSQEVGTSSGAKKKSEVEALIASQKARGSPSGSSACTEEFLPCVTDEELKAEYPDLYDVDDSDDEDSS