; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008543 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008543
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr9:24907500..24908916
RNA-Seq ExpressionLag0008543
SyntenyLag0008543
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022155341.1 uncharacterized protein LOC111022474 [Momordica charantia]3.2e-3540.52Show/hide
Query:  RSDSPGIPKVESHPEANPPIPPPMPHVVLSLEALQALIDNAI-QNYLQHPDVNQASTRNKDVRFFRSFNKANPPSFNGLSGSSRVVEEWTSELEALFQYL
        R  +P +P++     A   +P   P V L  EALQ L+DNA      Q     +A  + ++V+F R F +  PP FNG+S      EEW  ELEAL+ YL
Subjt:  RSDSPGIPKVESHPEANPPIPPPMPHVVLSLEALQALIDNAI-QNYLQHPDVNQASTRNKDVRFFRSFNKANPPSFNGLSGSSRVVEEWTSELEALFQYL

Query:  GADARQRVLGATSLLRGHARNWWNLVGQQEDRPNNPLSWTRFKGLMRDQFVQRFPNDEVEDLEVEFLFLVQGSMPVVQYERRFAELSHFVPELVATEPRR
        G     +V GA  +LRG A NWW  V   ED  N P++W RFK L+ + +   FP     +   EFL L QGS+ V QYER+F ELS F  + + TE  +
Subjt:  GADARQRVLGATSLLRGHARNWWNLVGQQEDRPNNPLSWTRFKGLMRDQFVQRFPNDEVEDLEVEFLFLVQGSMPVVQYERRFAELSHFVPELVATEPRR

Query:  IGRFINGLRRELRALVRQGQLTTYAGAFASAI
        I +FI+ LRRE++ L+   + TTYA A   A+
Subjt:  IGRFINGLRRELRALVRQGQLTTYAGAFASAI

XP_022155925.1 uncharacterized protein LOC111022925 [Momordica charantia]1.9e-3538.49Show/hide
Query:  PKVESHPEANPPIPPPMP-HVVLSLEALQALIDNAI---QNYLQHPDVNQASTRNKDVRFFRSFNKANPPSFNGLSGSSRVVEEWTSELEALFQYLGADA
        P       A+  +PP +P  VVL  EALQ L+DNA       +Q P   Q S   ++V+F R F +  PP FNG+S      EEW  ELEAL+ YLG   
Subjt:  PKVESHPEANPPIPPPMP-HVVLSLEALQALIDNAI---QNYLQHPDVNQASTRNKDVRFFRSFNKANPPSFNGLSGSSRVVEEWTSELEALFQYLGADA

Query:  RQRVLGATSLLRGHARNWWNLVGQQEDRPNNPLSWTRFKGLMRDQFVQRFPNDEVEDLEVEFLFLVQGSMPVVQYERRFAELSHFVPELVATEPRRIGRF
          +V GA  +L+G A NWW  V   ED  N P++W RFK L+ + +   FP     +   EFL L Q S+ V QYER+F ELS F  + + TE  +I +F
Subjt:  RQRVLGATSLLRGHARNWWNLVGQQEDRPNNPLSWTRFKGLMRDQFVQRFPNDEVEDLEVEFLFLVQGSMPVVQYERRFAELSHFVPELVATEPRRIGRF

Query:  INGLRRELRALVRQGQLTTYAGAFASAIACGIAKFLGRSSSRRLELSEFVVREAKISKSSKVPSENELKDGKQEKLMP
        I+GLRRE++ L+   + TTYA A   A+   + K L    S+++  S   V+    S SS  PS     +G+++   P
Subjt:  INGLRRELRALVRQGQLTTYAGAFASAIACGIAKFLGRSSSRRLELSEFVVREAKISKSSKVPSENELKDGKQEKLMP

XP_022156172.1 uncharacterized protein LOC111023126 [Momordica charantia]2.5e-3540.08Show/hide
Query:  PGIP--KVESHPEANPP-IPPPMPHVVLSLEALQALIDNAI-QNYLQHPDVNQASTRNKDVRFFRSFNKANPPSFNGLSGSSRVVEEWTSELEALFQYLG
        P +P   V   P+A P  +P   P V L  EALQ L+DNA      Q     +A  + ++V+F R F +  P  FNG+S      EEW  ELEAL  YLG
Subjt:  PGIP--KVESHPEANPP-IPPPMPHVVLSLEALQALIDNAI-QNYLQHPDVNQASTRNKDVRFFRSFNKANPPSFNGLSGSSRVVEEWTSELEALFQYLG

Query:  ADARQRVLGATSLLRGHARNWWNLVGQQEDRPNNPLSWTRFKGLMRDQFVQRFPNDEVEDLEVEFLFLVQGSMPVVQYERRFAELSHFVPELVATEPRRI
             +V GA  +LRG   NWW  V   ED  N P++WTRFK L+ + +   FP     +   EFL L QGS+ V QYER+F ELS F  + + T+  +I
Subjt:  ADARQRVLGATSLLRGHARNWWNLVGQQEDRPNNPLSWTRFKGLMRDQFVQRFPNDEVEDLEVEFLFLVQGSMPVVQYERRFAELSHFVPELVATEPRRI

Query:  GRFINGLRRELRALVRQGQLTTYAGAFASAIACGIAK
         +FI+GLRRE++ L+   + TTYA A   A+   + +
Subjt:  GRFINGLRRELRALVRQGQLTTYAGAFASAIACGIAK

XP_022157413.1 uncharacterized protein LOC111024114 [Momordica charantia]4.2e-3542.27Show/hide
Query:  PEANPP-IPPPMPHVVLSLEALQALIDNAI-QNYLQHPDVNQASTRNKDVRFFRSFNKANPPSFNGLSGSSRVVEEWTSELEALFQYLGADARQRVLGAT
        P+A P  +P   P V L  EALQ L+DNA      Q     +A     +V+F R F +  PP FNG+S      EEW  ELEAL+ YLG     +V GA 
Subjt:  PEANPP-IPPPMPHVVLSLEALQALIDNAI-QNYLQHPDVNQASTRNKDVRFFRSFNKANPPSFNGLSGSSRVVEEWTSELEALFQYLGADARQRVLGAT

Query:  SLLRGHARNWWNLVGQQEDRPNNPLSWTRFKGLMRDQFVQRFPNDEVEDLEVEFLFLVQGSMPVVQYERRFAELSHFVPELVATEPRRIGRFINGLRREL
         +LRG A NWW  V   ED  N P++W RFK L+ + +   FP     +   EFL L QGS+ V QYER+F ELS F  + + TE  +I +FI+GLR E+
Subjt:  SLLRGHARNWWNLVGQQEDRPNNPLSWTRFKGLMRDQFVQRFPNDEVEDLEVEFLFLVQGSMPVVQYERRFAELSHFVPELVATEPRRIGRFINGLRREL

Query:  RALVRQGQLTTYAGAFASAI
        + L+   + TTYA A   A+
Subjt:  RALVRQGQLTTYAGAFASAI

XP_022158645.1 uncharacterized protein LOC111025102 [Momordica charantia]3.2e-3542.31Show/hide
Query:  PHVVLSLEALQALIDNAIQ-NYLQHPDVNQASTRNKDVRFFRSFNKANPPSFNGLSGSSRVVEEWTSELEALFQYLGADARQRVLGATSLLRGHARNWWN
        P V L  EALQ L+DNA +    Q     +A  + ++V+F R F +  PP FNG+S      EEW  ELEAL+ YLG     +V GA  +L G A NWW 
Subjt:  PHVVLSLEALQALIDNAIQ-NYLQHPDVNQASTRNKDVRFFRSFNKANPPSFNGLSGSSRVVEEWTSELEALFQYLGADARQRVLGATSLLRGHARNWWN

Query:  LVGQQEDRPNNPLSWTRFKGLMRDQFVQRFPNDEVEDLEVEFLFLVQGSMPVVQYERRFAELSHFVPELVATEPRRIGRFINGLRRELRALVRQGQLTTY
         V   ED  N P++W RFK L+ + +   FP     +   EFL L QGS+ V Q+ER+F ELS F  + + TE  +I +FI+GLRRE++ L+   + TTY
Subjt:  LVGQQEDRPNNPLSWTRFKGLMRDQFVQRFPNDEVEDLEVEFLFLVQGSMPVVQYERRFAELSHFVPELVATEPRRIGRFINGLRRELRALVRQGQLTTY

Query:  AGAFASAI
        A A   A+
Subjt:  AGAFASAI

TrEMBL top hitse value%identityAlignment
A0A6J1DNV8 uncharacterized protein LOC1110229259.1e-3638.49Show/hide
Query:  PKVESHPEANPPIPPPMP-HVVLSLEALQALIDNAI---QNYLQHPDVNQASTRNKDVRFFRSFNKANPPSFNGLSGSSRVVEEWTSELEALFQYLGADA
        P       A+  +PP +P  VVL  EALQ L+DNA       +Q P   Q S   ++V+F R F +  PP FNG+S      EEW  ELEAL+ YLG   
Subjt:  PKVESHPEANPPIPPPMP-HVVLSLEALQALIDNAI---QNYLQHPDVNQASTRNKDVRFFRSFNKANPPSFNGLSGSSRVVEEWTSELEALFQYLGADA

Query:  RQRVLGATSLLRGHARNWWNLVGQQEDRPNNPLSWTRFKGLMRDQFVQRFPNDEVEDLEVEFLFLVQGSMPVVQYERRFAELSHFVPELVATEPRRIGRF
          +V GA  +L+G A NWW  V   ED  N P++W RFK L+ + +   FP     +   EFL L Q S+ V QYER+F ELS F  + + TE  +I +F
Subjt:  RQRVLGATSLLRGHARNWWNLVGQQEDRPNNPLSWTRFKGLMRDQFVQRFPNDEVEDLEVEFLFLVQGSMPVVQYERRFAELSHFVPELVATEPRRIGRF

Query:  INGLRRELRALVRQGQLTTYAGAFASAIACGIAKFLGRSSSRRLELSEFVVREAKISKSSKVPSENELKDGKQEKLMP
        I+GLRRE++ L+   + TTYA A   A+   + K L    S+++  S   V+    S SS  PS     +G+++   P
Subjt:  INGLRRELRALVRQGQLTTYAGAFASAIACGIAKFLGRSSSRRLELSEFVVREAKISKSSKVPSENELKDGKQEKLMP

A0A6J1DRB3 uncharacterized protein LOC1110231261.2e-3540.08Show/hide
Query:  PGIP--KVESHPEANPP-IPPPMPHVVLSLEALQALIDNAI-QNYLQHPDVNQASTRNKDVRFFRSFNKANPPSFNGLSGSSRVVEEWTSELEALFQYLG
        P +P   V   P+A P  +P   P V L  EALQ L+DNA      Q     +A  + ++V+F R F +  P  FNG+S      EEW  ELEAL  YLG
Subjt:  PGIP--KVESHPEANPP-IPPPMPHVVLSLEALQALIDNAI-QNYLQHPDVNQASTRNKDVRFFRSFNKANPPSFNGLSGSSRVVEEWTSELEALFQYLG

Query:  ADARQRVLGATSLLRGHARNWWNLVGQQEDRPNNPLSWTRFKGLMRDQFVQRFPNDEVEDLEVEFLFLVQGSMPVVQYERRFAELSHFVPELVATEPRRI
             +V GA  +LRG   NWW  V   ED  N P++WTRFK L+ + +   FP     +   EFL L QGS+ V QYER+F ELS F  + + T+  +I
Subjt:  ADARQRVLGATSLLRGHARNWWNLVGQQEDRPNNPLSWTRFKGLMRDQFVQRFPNDEVEDLEVEFLFLVQGSMPVVQYERRFAELSHFVPELVATEPRRI

Query:  GRFINGLRRELRALVRQGQLTTYAGAFASAIACGIAK
         +FI+GLRRE++ L+   + TTYA A   A+   + +
Subjt:  GRFINGLRRELRALVRQGQLTTYAGAFASAIACGIAK

A0A6J1DRF5 uncharacterized protein LOC1110224741.6e-3540.52Show/hide
Query:  RSDSPGIPKVESHPEANPPIPPPMPHVVLSLEALQALIDNAI-QNYLQHPDVNQASTRNKDVRFFRSFNKANPPSFNGLSGSSRVVEEWTSELEALFQYL
        R  +P +P++     A   +P   P V L  EALQ L+DNA      Q     +A  + ++V+F R F +  PP FNG+S      EEW  ELEAL+ YL
Subjt:  RSDSPGIPKVESHPEANPPIPPPMPHVVLSLEALQALIDNAI-QNYLQHPDVNQASTRNKDVRFFRSFNKANPPSFNGLSGSSRVVEEWTSELEALFQYL

Query:  GADARQRVLGATSLLRGHARNWWNLVGQQEDRPNNPLSWTRFKGLMRDQFVQRFPNDEVEDLEVEFLFLVQGSMPVVQYERRFAELSHFVPELVATEPRR
        G     +V GA  +LRG A NWW  V   ED  N P++W RFK L+ + +   FP     +   EFL L QGS+ V QYER+F ELS F  + + TE  +
Subjt:  GADARQRVLGATSLLRGHARNWWNLVGQQEDRPNNPLSWTRFKGLMRDQFVQRFPNDEVEDLEVEFLFLVQGSMPVVQYERRFAELSHFVPELVATEPRR

Query:  IGRFINGLRRELRALVRQGQLTTYAGAFASAI
        I +FI+ LRRE++ L+   + TTYA A   A+
Subjt:  IGRFINGLRRELRALVRQGQLTTYAGAFASAI

A0A6J1DTA8 uncharacterized protein LOC1110241142.0e-3542.27Show/hide
Query:  PEANPP-IPPPMPHVVLSLEALQALIDNAI-QNYLQHPDVNQASTRNKDVRFFRSFNKANPPSFNGLSGSSRVVEEWTSELEALFQYLGADARQRVLGAT
        P+A P  +P   P V L  EALQ L+DNA      Q     +A     +V+F R F +  PP FNG+S      EEW  ELEAL+ YLG     +V GA 
Subjt:  PEANPP-IPPPMPHVVLSLEALQALIDNAI-QNYLQHPDVNQASTRNKDVRFFRSFNKANPPSFNGLSGSSRVVEEWTSELEALFQYLGADARQRVLGAT

Query:  SLLRGHARNWWNLVGQQEDRPNNPLSWTRFKGLMRDQFVQRFPNDEVEDLEVEFLFLVQGSMPVVQYERRFAELSHFVPELVATEPRRIGRFINGLRREL
         +LRG A NWW  V   ED  N P++W RFK L+ + +   FP     +   EFL L QGS+ V QYER+F ELS F  + + TE  +I +FI+GLR E+
Subjt:  SLLRGHARNWWNLVGQQEDRPNNPLSWTRFKGLMRDQFVQRFPNDEVEDLEVEFLFLVQGSMPVVQYERRFAELSHFVPELVATEPRRIGRFINGLRREL

Query:  RALVRQGQLTTYAGAFASAI
        + L+   + TTYA A   A+
Subjt:  RALVRQGQLTTYAGAFASAI

A0A6J1DWE6 uncharacterized protein LOC1110251021.6e-3542.31Show/hide
Query:  PHVVLSLEALQALIDNAIQ-NYLQHPDVNQASTRNKDVRFFRSFNKANPPSFNGLSGSSRVVEEWTSELEALFQYLGADARQRVLGATSLLRGHARNWWN
        P V L  EALQ L+DNA +    Q     +A  + ++V+F R F +  PP FNG+S      EEW  ELEAL+ YLG     +V GA  +L G A NWW 
Subjt:  PHVVLSLEALQALIDNAIQ-NYLQHPDVNQASTRNKDVRFFRSFNKANPPSFNGLSGSSRVVEEWTSELEALFQYLGADARQRVLGATSLLRGHARNWWN

Query:  LVGQQEDRPNNPLSWTRFKGLMRDQFVQRFPNDEVEDLEVEFLFLVQGSMPVVQYERRFAELSHFVPELVATEPRRIGRFINGLRRELRALVRQGQLTTY
         V   ED  N P++W RFK L+ + +   FP     +   EFL L QGS+ V Q+ER+F ELS F  + + TE  +I +FI+GLRRE++ L+   + TTY
Subjt:  LVGQQEDRPNNPLSWTRFKGLMRDQFVQRFPNDEVEDLEVEFLFLVQGSMPVVQYERRFAELSHFVPELVATEPRRIGRFINGLRRELRALVRQGQLTTY

Query:  AGAFASAI
        A A   A+
Subjt:  AGAFASAI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTCTAGTGGGAGTCGAGATAGCCGTCGTTCTGACTCTCCGGGTATCCCCAAAGTCGAATCGCACCCTGAGGCAAATCCCCCTATTCCTCCTCCGATGCCTCATGT
AGTGTTGTCATTGGAAGCGTTGCAAGCTTTGATTGACAATGCAATCCAAAACTATCTACAGCACCCTGATGTGAATCAAGCTTCTACACGAAATAAGGATGTCCGATTCT
TTCGAAGCTTCAATAAAGCCAACCCTCCATCATTCAATGGCCTATCTGGAAGTTCGAGAGTTGTCGAAGAGTGGACCTCAGAGCTGGAAGCCCTATTCCAATACCTTGGA
GCTGATGCCCGACAACGTGTCTTAGGAGCTACCTCATTGCTCAGAGGTCACGCACGCAATTGGTGGAATTTAGTGGGCCAGCAAGAGGATCGCCCCAACAATCCCTTGTC
ATGGACAAGATTCAAGGGTCTTATGCGAGACCAGTTTGTCCAACGCTTCCCTAATGATGAAGTTGAAGATCTGGAAGTGGAGTTTCTATTTCTCGTGCAGGGGAGCATGC
CCGTAGTGCAATACGAAAGAAGATTTGCAGAGTTGTCTCATTTCGTTCCGGAGCTAGTTGCCACTGAGCCAAGAAGAATTGGAAGGTTCATCAATGGCTTGCGTAGGGAA
TTAAGAGCGTTGGTCAGGCAAGGACAACTAACTACTTACGCAGGAGCTTTTGCAAGCGCAATAGCCTGTGGGATAGCGAAGTTCCTAGGACGGAGCAGTTCTAGGAGACT
TGAGCTTAGCGAGTTTGTTGTAAGGGAAGCCAAGATCTCGAAGAGTTCAAAGGTTCCCTCCGAAAACGAGCTAAAAGATGGCAAGCAAGAGAAGTTGATGCCTTGGATTT
CTGATGAAGAACTAAAGGCAGAATACCCAGAGCTTTACAACGGCGAATCTGAAGAAGAGGAAAGTCGCTATAGGCGGGAAGTCGGTTTCACCTCTCGTTTCCAGTTAGAT
CCTTCGTTTCCTAGGAAGTTAATGGAAGCTAGTAAGTTGGTTTCCAAGCTCTATCAAGCATCGCCTAGATTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTTCTAGTGGGAGTCGAGATAGCCGTCGTTCTGACTCTCCGGGTATCCCCAAAGTCGAATCGCACCCTGAGGCAAATCCCCCTATTCCTCCTCCGATGCCTCATGT
AGTGTTGTCATTGGAAGCGTTGCAAGCTTTGATTGACAATGCAATCCAAAACTATCTACAGCACCCTGATGTGAATCAAGCTTCTACACGAAATAAGGATGTCCGATTCT
TTCGAAGCTTCAATAAAGCCAACCCTCCATCATTCAATGGCCTATCTGGAAGTTCGAGAGTTGTCGAAGAGTGGACCTCAGAGCTGGAAGCCCTATTCCAATACCTTGGA
GCTGATGCCCGACAACGTGTCTTAGGAGCTACCTCATTGCTCAGAGGTCACGCACGCAATTGGTGGAATTTAGTGGGCCAGCAAGAGGATCGCCCCAACAATCCCTTGTC
ATGGACAAGATTCAAGGGTCTTATGCGAGACCAGTTTGTCCAACGCTTCCCTAATGATGAAGTTGAAGATCTGGAAGTGGAGTTTCTATTTCTCGTGCAGGGGAGCATGC
CCGTAGTGCAATACGAAAGAAGATTTGCAGAGTTGTCTCATTTCGTTCCGGAGCTAGTTGCCACTGAGCCAAGAAGAATTGGAAGGTTCATCAATGGCTTGCGTAGGGAA
TTAAGAGCGTTGGTCAGGCAAGGACAACTAACTACTTACGCAGGAGCTTTTGCAAGCGCAATAGCCTGTGGGATAGCGAAGTTCCTAGGACGGAGCAGTTCTAGGAGACT
TGAGCTTAGCGAGTTTGTTGTAAGGGAAGCCAAGATCTCGAAGAGTTCAAAGGTTCCCTCCGAAAACGAGCTAAAAGATGGCAAGCAAGAGAAGTTGATGCCTTGGATTT
CTGATGAAGAACTAAAGGCAGAATACCCAGAGCTTTACAACGGCGAATCTGAAGAAGAGGAAAGTCGCTATAGGCGGGAAGTCGGTTTCACCTCTCGTTTCCAGTTAGAT
CCTTCGTTTCCTAGGAAGTTAATGGAAGCTAGTAAGTTGGTTTCCAAGCTCTATCAAGCATCGCCTAGATTCTAA
Protein sequenceShow/hide protein sequence
MSSSGSRDSRRSDSPGIPKVESHPEANPPIPPPMPHVVLSLEALQALIDNAIQNYLQHPDVNQASTRNKDVRFFRSFNKANPPSFNGLSGSSRVVEEWTSELEALFQYLG
ADARQRVLGATSLLRGHARNWWNLVGQQEDRPNNPLSWTRFKGLMRDQFVQRFPNDEVEDLEVEFLFLVQGSMPVVQYERRFAELSHFVPELVATEPRRIGRFINGLRRE
LRALVRQGQLTTYAGAFASAIACGIAKFLGRSSSRRLELSEFVVREAKISKSSKVPSENELKDGKQEKLMPWISDEELKAEYPELYNGESEEEESRYRREVGFTSRFQLD
PSFPRKLMEASKLVSKLYQASPRF