; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031881 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031881
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr11:17713261..17719784
RNA-Seq ExpressionLag0031881
SyntenyLag0031881
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_030494802.1 uncharacterized protein LOC115710583 [Cannabis sativa]2.9e-4047.22Show/hide
Query:  AEAWLDSFPPNSISTWDELAEKFLLKYFPPTKNAQIRGEMITFKQGPQERVDTTWERFKKLIRKCPHHGLIACLLVEHFYNGLTQASKTVVNASANCSLL
        A AWL++ PP+S++ W++LAEKFL KYFPPT+NA+ R E+++F+Q   E     WERFK+++RKCPHHG+  C+ +E FYNGL  AS+ V++ASAN ++L
Subjt:  AEAWLDSFPPNSISTWDELAEKFLLKYFPPTKNAQIRGEMITFKQGPQERVDTTWERFKKLIRKCPHHGLIACLLVEHFYNGLTQASKTVVNASANCSLL

Query:  KKTFNKANKILDAIVANNSQWGAEETP-SRSSVEVPEVNHLETLSEQMLIMNNMLMNLTSGNQVQVAVVSQASTPICTTC
         K++N+A +IL+ I +NN QW     P SR    V EV+ L  L+ QM  M N+L N+  G  VQ A   Q +   C  C
Subjt:  KKTFNKANKILDAIVANNSQWGAEETP-SRSSVEVPEVNHLETLSEQMLIMNNMLMNLTSGNQVQVAVVSQASTPICTTC

XP_030497851.1 uncharacterized protein LOC115713509 [Cannabis sativa]1.3e-4048.33Show/hide
Query:  AEAWLDSFPPNSISTWDELAEKFLLKYFPPTKNAQIRGEMITFKQGPQERVDTTWERFKKLIRKCPHHGLIACLLVEHFYNGLTQASKTVVNASANCSLL
        A AWL++ PP+S++ W++LAEKFL KYFPPT+NA+ R E+++F+Q   E    TWERFK+L+RKCPHHG+  C+ +E FYNGL  AS+ V++ASAN ++L
Subjt:  AEAWLDSFPPNSISTWDELAEKFLLKYFPPTKNAQIRGEMITFKQGPQERVDTTWERFKKLIRKCPHHGLIACLLVEHFYNGLTQASKTVVNASANCSLL

Query:  KKTFNKANKILDAIVANNSQWGAEETP-SRSSVEVPEVNHLETLSEQMLIMNNMLMNLTSGNQVQVAVVSQASTPICTTC
         K++N+A +IL+ I +NN QW     P SR    V EV+ L  L+ QM  M N+L N+  G  VQ  V  Q +   C  C
Subjt:  KKTFNKANKILDAIVANNSQWGAEETP-SRSSVEVPEVNHLETLSEQMLIMNNMLMNLTSGNQVQVAVVSQASTPICTTC

XP_030503898.1 uncharacterized protein LOC115719117 [Cannabis sativa]4.4e-4142.79Show/hide
Query:  NPILVADEKDRAIRYY-------------------------EGNAEAWLDSFPPNSISTWDELAEKFLLKYFPPTKNAQIRGEMITFKQGPQERVDTTWE
        NPI +AD++ RAIR Y                            A AWL++ PP+S++ W++LAEKFL KYFPPT+NA+ R E+++F+Q   E     WE
Subjt:  NPILVADEKDRAIRYY-------------------------EGNAEAWLDSFPPNSISTWDELAEKFLLKYFPPTKNAQIRGEMITFKQGPQERVDTTWE

Query:  RFKKLIRKCPHHGLIACLLVEHFYNGLTQASKTVVNASANCSLLKKTFNKANKILDAIVANNSQWGAEETP-SRSSVEVPEVNHLETLSEQMLIMNNMLM
        RFK+L+RKCPHHG+  C+ +E FYNGL  AS+ V++ASAN ++L K++N+A +IL+ I +NN QW     P SR    V EV+ L  L+ QM  M N+L 
Subjt:  RFKKLIRKCPHHGLIACLLVEHFYNGLTQASKTVVNASANCSLLKKTFNKANKILDAIVANNSQWGAEETP-SRSSVEVPEVNHLETLSEQMLIMNNMLM

Query:  NLTSGNQVQVAVVS--QASTPICTTCNQS
        N+  G  VQ A  S  + S+ +C   NQ+
Subjt:  NLTSGNQVQVAVVS--QASTPICTTCNQS

XP_030509229.1 uncharacterized protein LOC115723907 [Cannabis sativa]8.4e-4046.67Show/hide
Query:  AEAWLDSFPPNSISTWDELAEKFLLKYFPPTKNAQIRGEMITFKQGPQERVDTTWERFKKLIRKCPHHGLIACLLVEHFYNGLTQASKTVVNASANCSLL
        A AWL++ PPNS++TW++LAEKFL KYFPPT+NA+ R E+++F+Q   E     WERFK+ +RKCPHHG+  C+ +E FYNGL  AS+ V++ASAN  +L
Subjt:  AEAWLDSFPPNSISTWDELAEKFLLKYFPPTKNAQIRGEMITFKQGPQERVDTTWERFKKLIRKCPHHGLIACLLVEHFYNGLTQASKTVVNASANCSLL

Query:  KKTFNKANKILDAIVANNSQWGAEETPSRSSVE-VPEVNHLETLSEQMLIMNNMLMNLTSGNQVQVAVVSQASTPICTTC
         K++N+A +IL+ I +NN QW     P+   V  V EV+ L  L+ QM  M N+L N+  G  V+     Q +   C  C
Subjt:  KKTFNKANKILDAIVANNSQWGAEETPSRSSVE-VPEVNHLETLSEQMLIMNNMLMNLTSGNQVQVAVVSQASTPICTTC

XP_030509259.1 uncharacterized protein LOC115723937 [Cannabis sativa]3.7e-4047.78Show/hide
Query:  AEAWLDSFPPNSISTWDELAEKFLLKYFPPTKNAQIRGEMITFKQGPQERVDTTWERFKKLIRKCPHHGLIACLLVEHFYNGLTQASKTVVNASANCSLL
        A AWL++ PP+S++ W++LAEKFL KYFPPT+NA+ R E+++F+Q   E     WERFK+L+RKCPHHG+  C+ +E FYNGL  AS+ V++ASAN ++L
Subjt:  AEAWLDSFPPNSISTWDELAEKFLLKYFPPTKNAQIRGEMITFKQGPQERVDTTWERFKKLIRKCPHHGLIACLLVEHFYNGLTQASKTVVNASANCSLL

Query:  KKTFNKANKILDAIVANNSQWGAEETP-SRSSVEVPEVNHLETLSEQMLIMNNMLMNLTSGNQVQVAVVSQASTPICTTC
         K++N+A +IL+ I +NN QW     P SR    V EV+ L  L+ QM  M N+L N+  G  VQ A   Q +   C  C
Subjt:  KKTFNKANKILDAIVANNSQWGAEETP-SRSSVEVPEVNHLETLSEQMLIMNNMLMNLTSGNQVQVAVVSQASTPICTTC

TrEMBL top hitse value%identityAlignment
A0A6J1EEI2 uncharacterized protein LOC1114333941.6e-3642.33Show/hide
Query:  AEAWLDSFPPNSISTWDELAEKFLLKYFPPTKNAQIRGEMITFKQGPQERVDTTWERFKKLIRKCPHHGLIACLLVEHFYNGLTQASKTVVNASANCSLL
        A++WL++    +I +W+ L EKFL+KYFPPT+NA+ R E++ F+Q   + +   WERFK+++RKCPHHGL  C+ +E FYNGL  A+K VV+ASAN ++L
Subjt:  AEAWLDSFPPNSISTWDELAEKFLLKYFPPTKNAQIRGEMITFKQGPQERVDTTWERFKKLIRKCPHHGLIACLLVEHFYNGLTQASKTVVNASANCSLL

Query:  KKTFNKANKILDAIVANNSQWG-AEETPSRSSVEVPEVNHLETLSEQMLIMNNMLMNLTSGNQVQ-------VAVVSQASTPICTTCNQ
         KT+N+A +IL+ I +NN QW      P R +  V EV+ L +++ Q+  + N+L NL  G           VAV++Q +   C  C +
Subjt:  KKTFNKANKILDAIVANNSQWG-AEETPSRSSVEVPEVNHLETLSEQMLIMNNMLMNLTSGNQVQ-------VAVVSQASTPICTTCNQ

A0A6J1EQ90 uncharacterized protein LOC1114364111.3e-3541.27Show/hide
Query:  AEAWLDSFPPNSISTWDELAEKFLLKYFPPTKNAQIRGEMITFKQGPQERVDTTWERFKKLIRKCPHHGLIACLLVEHFYNGLTQASKTVVNASANCSLL
        A++WL++  P +I +W+ LAE FL+KYFPPT+NA+ + E++TF+Q   E +    ERFK+++RKCPHHGL  C+ +E FYNGL   +K VV+ASAN ++L
Subjt:  AEAWLDSFPPNSISTWDELAEKFLLKYFPPTKNAQIRGEMITFKQGPQERVDTTWERFKKLIRKCPHHGLIACLLVEHFYNGLTQASKTVVNASANCSLL

Query:  KKTFNKANKILDAIVANNSQWG-AEETPSRSSVEVPEVNHLETLSEQMLIMNNMLMNLTSGNQVQV-------AVVSQASTPICTTCNQ
         KT+N+A +IL+ I +NN QW      P R +  V EV+ L +++ Q+  + N+L NL  G    +       A ++Q +   C  C +
Subjt:  KKTFNKANKILDAIVANNSQWG-AEETPSRSSVEVPEVNHLETLSEQMLIMNNMLMNLTSGNQVQV-------AVVSQASTPICTTCNQ

A0A6J1G7Q6 uncharacterized protein LOC1114515981.8e-3542.62Show/hide
Query:  AEAWLDSFPPNSISTWDELAEKFLLKYFPPTKNAQIRGEMITFKQGPQERVDTTWERFKKLIRKCPHHGLIACLLVEHFYNGLTQASKTVVNASANCSLL
        A++WL+      I +W+ LAEKFL KYFPPT++A+ R E++ F++   E +   WERFK+ +RKCPHHGL  C+ +E FYNGL  A+K VV+ASAN  +L
Subjt:  AEAWLDSFPPNSISTWDELAEKFLLKYFPPTKNAQIRGEMITFKQGPQERVDTTWERFKKLIRKCPHHGLIACLLVEHFYNGLTQASKTVVNASANCSLL

Query:  KKTFNKANKILDAIVANNSQW-GAEETPSRSSVEVPEVNHLETLSEQMLIMNNMLMNLTSGNQVQVAVVSQASTPICTTCNQS
         KT+N+A +IL+ I +NN QW      P + + EV EV+ L +++ Q+  M N+L NL  G    +   +  +T +  T  +S
Subjt:  KKTFNKANKILDAIVANNSQW-GAEETPSRSSVEVPEVNHLETLSEQMLIMNNMLMNLTSGNQVQVAVVSQASTPICTTCNQS

A0A6J1H7E4 uncharacterized protein LOC1114611682.2e-3843.39Show/hide
Query:  AEAWLDSFPPNSISTWDELAEKFLLKYFPPTKNAQIRGEMITFKQGPQERVDTTWERFKKLIRKCPHHGLIACLLVEHFYNGLTQASKTVVNASANCSLL
        A++WL++  P +I +W+ LAEKFL+KYFPPT+NA+ R E++ F+Q   E +   WERFK+++RKCPHHGL  C+ +E FYNGL  A+K VV+ASAN ++L
Subjt:  AEAWLDSFPPNSISTWDELAEKFLLKYFPPTKNAQIRGEMITFKQGPQERVDTTWERFKKLIRKCPHHGLIACLLVEHFYNGLTQASKTVVNASANCSLL

Query:  KKTFNKANKILDAIVANNSQWG-AEETPSRSSVEVPEVNHLETLSEQMLIMNNMLMNLTSGNQVQV-------AVVSQASTPICTTCNQ
         KT+N+A +IL+ I +NN QW      P + +  V EV+ L +++ Q+  + N+L NL  G    +       AV++Q +T  C  C +
Subjt:  KKTFNKANKILDAIVANNSQWG-AEETPSRSSVEVPEVNHLETLSEQMLIMNNMLMNLTSGNQVQV-------AVVSQASTPICTTCNQ

U5CUI2 Retrotrans_gag domain-containing protein2.0e-3946.2Show/hide
Query:  AEAWLDSFPPNSISTWDELAEKFLLKYFPPTKNAQIRGEMITFKQGPQERVDTTWERFKKLIRKCPHHGLIACLLVEHFYNGLTQASKTVVNASANCSLL
        A +WL++ PP+S++ W++LAEKFL KYFPPT+NA+ R E+++F+Q   E     WERFK+L+RKCPHHG+  C+ +E FYNGL  AS+ V++ASAN ++L
Subjt:  AEAWLDSFPPNSISTWDELAEKFLLKYFPPTKNAQIRGEMITFKQGPQERVDTTWERFKKLIRKCPHHGLIACLLVEHFYNGLTQASKTVVNASANCSLL

Query:  KKTFNKANKILDAIVANNSQWGAEETP-SRSSVEVPEVNHLETLSEQMLIMNNMLMNLTSGN--QVQVAVVSQASTPICTTCNQ
         K++N+A +IL+ I +NN QW     P SR    V EV+ +  L+ QM  M N+L NL+ GN   +Q A   Q+    C  C +
Subjt:  KKTFNKANKILDAIVANNSQWGAEETP-SRSSVEVPEVNHLETLSEQMLIMNNMLMNLTSGN--QVQVAVVSQASTPICTTCNQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTCAAGCGTCTTCAAATTGGTGCAATTCCATGATTCTTTTGTTTCAGGAGTTGTACAGGGAGTTAAAGTCTCGGTCCAATTCTTCGATTTTTGATGCAAGGCGGTA
TCATGCCGCCATGCCCGCTGAGCATATGATATTTGGAGGTGATGTCCTGAGCTGTTTTGCTGAGACTACTCAAGCTAGAGAAATTCTAGCTAGAAAGATTTCAAAGGAAG
TGCAAGTCTTGCATTGGGATCCAACACAAGGTCACAAAGGAGGATGCCATGAAGTGGTTCCAGGTACTTTGATGGATGAAGTGGCTAATACTATAGTAGTGGAGCTTGGT
GAAGGTCAACTTGTTGTATACTTCACCATATGTTGCAGGGCATTTGCGGGCGCAACTGGAGGATTCTATCTGCACCATGTCTATGTGGGCAACATAGATAGTGTTCTTCA
GTCGTTTGCAGTGCCTGAGAAATTTCTATGGAGCCACCTGTTGGAGACTCTCAGTGCGCAGCAAAGAGGAGCACAACCTAACAATGTGCAGCAACAAACTGCGGCAGCAC
ATAATCCTATTCTTGTTGCAGATGAAAAGGATAGAGCCATCAGGTATTATGAGGGCAATGCAGAAGCATGGCTAGATTCTTTTCCACCAAATAGTATCTCCACATGGGAT
GAGCTTGCTGAAAAATTTCTTTTGAAGTATTTTCCGCCTACAAAGAATGCACAAATTCGAGGAGAAATGATCACTTTTAAGCAAGGACCACAAGAAAGAGTAGATACAAC
ATGGGAGCGCTTTAAAAAGTTGATAAGGAAATGTCCACACCATGGCTTGATTGCTTGTCTTCTTGTAGAGCATTTTTACAACGGATTGACTCAAGCTTCCAAAACAGTAG
TCAATGCATCAGCTAATTGTTCGTTGCTTAAAAAGACTTTCAATAAGGCAAACAAAATTCTGGACGCAATTGTTGCAAACAACAGTCAATGGGGAGCAGAAGAAACACCT
TCTAGAAGTAGTGTCGAGGTCCCAGAAGTTAATCATCTAGAGACACTCTCGGAGCAAATGTTAATTATGAACAATATGCTTATGAATCTCACTTCAGGGAATCAAGTTCA
AGTTGCAGTCGTGAGCCAAGCTTCAACCCCCATCTGCACAACTTGCAATCAAAGCATTCGTTTGAAGATTGTCCTTAAAGTCCAGCTTCAATATATTTTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTTCAAGCGTCTTCAAATTGGTGCAATTCCATGATTCTTTTGTTTCAGGAGTTGTACAGGGAGTTAAAGTCTCGGTCCAATTCTTCGATTTTTGATGCAAGGCGGTA
TCATGCCGCCATGCCCGCTGAGCATATGATATTTGGAGGTGATGTCCTGAGCTGTTTTGCTGAGACTACTCAAGCTAGAGAAATTCTAGCTAGAAAGATTTCAAAGGAAG
TGCAAGTCTTGCATTGGGATCCAACACAAGGTCACAAAGGAGGATGCCATGAAGTGGTTCCAGGTACTTTGATGGATGAAGTGGCTAATACTATAGTAGTGGAGCTTGGT
GAAGGTCAACTTGTTGTATACTTCACCATATGTTGCAGGGCATTTGCGGGCGCAACTGGAGGATTCTATCTGCACCATGTCTATGTGGGCAACATAGATAGTGTTCTTCA
GTCGTTTGCAGTGCCTGAGAAATTTCTATGGAGCCACCTGTTGGAGACTCTCAGTGCGCAGCAAAGAGGAGCACAACCTAACAATGTGCAGCAACAAACTGCGGCAGCAC
ATAATCCTATTCTTGTTGCAGATGAAAAGGATAGAGCCATCAGGTATTATGAGGGCAATGCAGAAGCATGGCTAGATTCTTTTCCACCAAATAGTATCTCCACATGGGAT
GAGCTTGCTGAAAAATTTCTTTTGAAGTATTTTCCGCCTACAAAGAATGCACAAATTCGAGGAGAAATGATCACTTTTAAGCAAGGACCACAAGAAAGAGTAGATACAAC
ATGGGAGCGCTTTAAAAAGTTGATAAGGAAATGTCCACACCATGGCTTGATTGCTTGTCTTCTTGTAGAGCATTTTTACAACGGATTGACTCAAGCTTCCAAAACAGTAG
TCAATGCATCAGCTAATTGTTCGTTGCTTAAAAAGACTTTCAATAAGGCAAACAAAATTCTGGACGCAATTGTTGCAAACAACAGTCAATGGGGAGCAGAAGAAACACCT
TCTAGAAGTAGTGTCGAGGTCCCAGAAGTTAATCATCTAGAGACACTCTCGGAGCAAATGTTAATTATGAACAATATGCTTATGAATCTCACTTCAGGGAATCAAGTTCA
AGTTGCAGTCGTGAGCCAAGCTTCAACCCCCATCTGCACAACTTGCAATCAAAGCATTCGTTTGAAGATTGTCCTTAAAGTCCAGCTTCAATATATTTTGTAG
Protein sequenceShow/hide protein sequence
MFQASSNWCNSMILLFQELYRELKSRSNSSIFDARRYHAAMPAEHMIFGGDVLSCFAETTQAREILARKISKEVQVLHWDPTQGHKGGCHEVVPGTLMDEVANTIVVELG
EGQLVVYFTICCRAFAGATGGFYLHHVYVGNIDSVLQSFAVPEKFLWSHLLETLSAQQRGAQPNNVQQQTAAAHNPILVADEKDRAIRYYEGNAEAWLDSFPPNSISTWD
ELAEKFLLKYFPPTKNAQIRGEMITFKQGPQERVDTTWERFKKLIRKCPHHGLIACLLVEHFYNGLTQASKTVVNASANCSLLKKTFNKANKILDAIVANNSQWGAEETP
SRSSVEVPEVNHLETLSEQMLIMNNMLMNLTSGNQVQVAVVSQASTPICTTCNQSIRLKIVLKVQLQYIL