; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025952 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025952
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr10:25201534..25202728
RNA-Seq ExpressionLag0025952
SyntenyLag0025952
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022147761.1 uncharacterized protein LOC111016619 [Momordica charantia]3.7e-2048.15Show/hide
Query:  MKAEVPQKFKVPTFKPYDGKKDHVQHLGAYKNWMDFHDVSDAIRCHAFSFTLTGPAKRWFERLKRRSISSFKELARAFLSQFMGAQASHQPLHCQTTARA
        MK +VP KFK+P  K YDG  D + HL  Y  W D + +++AIRC  FSFTLTG  + WF++LKR+SISSFKELARAF++QF G     +P+    T + 
Subjt:  MKAEVPQKFKVPTFKPYDGKKDHVQHLGAYKNWMDFHDVSDAIRCHAFSFTLTGPAKRWFERLKRRSISSFKELARAFLSQFMGAQASHQPLHCQTTARA

Query:  QKYMNAKE
        +   + K+
Subjt:  QKYMNAKE

XP_022150035.1 uncharacterized protein LOC111018307 [Momordica charantia]8.7e-2254.74Show/hide
Query:  EEVMKAEVPQKFKVPTFKPYDGKKDHVQHLGAYKNWMDFHDVSDAIRCHAFSFTLTGPAKRWFERLKRRSISSFKELARAFLSQFMGAQASHQPL
        EE+MK +VP KFK+PT K +DG  D V HL AY+ WMD + VS+A++C  FS TL+G A+ WF +LKR SISSFK LA+AF++QF+G ++  +P+
Subjt:  EEVMKAEVPQKFKVPTFKPYDGKKDHVQHLGAYKNWMDFHDVSDAIRCHAFSFTLTGPAKRWFERLKRRSISSFKELARAFLSQFMGAQASHQPL

XP_022158344.1 uncharacterized protein LOC111024851 [Momordica charantia]7.7e-1832.06Show/hide
Query:  MKAEVPQKFKVPTFKPYDGKKDHVQHLGAYKNWMDFHDVSDAIRCHAFSFTLTGPAKRWFERLKRRSISSFKELARAFLSQFMGAQASHQPL--------
        MK +   KFK+P    YDG  D + HL AY+ W D +D+ +AIRC  FSFTLTG A+ WF +LKR SISSFKELA AF++QF+G +   +P+        
Subjt:  MKAEVPQKFKVPTFKPYDGKKDHVQHLGAYKNWMDFHDVSDAIRCHAFSFTLTGPAKRWFERLKRRSISSFKELARAFLSQFMGAQASHQPL--------

Query:  ----------------------------------------------------HCQTTARAQKYMNAKELLKSKKSKQERKRHSSVDQDRKKDKRQRT---
                                                              +  +RAQKYM+A EL+   +  +  + + S  ++R+ +KR R+   
Subjt:  ----------------------------------------------------HCQTTARAQKYMNAKELLKSKKSKQERKRHSSVDQDRKKDKRQRT---

Query:  -NDGGRGRV
         +D G GR+
Subjt:  -NDGGRGRV

XP_022158830.1 uncharacterized protein LOC111025293 [Momordica charantia]1.1e-2437.73Show/hide
Query:  ILRSPKPSALKGRNHTISTPSYSHIGIDLRDLIEEKRKRTKVVEAEARAIEAKAKV-EARVAKVEDRLAEA----KAKKDNLPWKTELLNTLKKLGNPQG
        ++R PK    KG   + +  S + +G  LR +    R+RT++ +      + K+       +  +DR +E     K K  + P  +E  ++ K+ G    
Subjt:  ILRSPKPSALKGRNHTISTPSYSHIGIDLRDLIEEKRKRTKVVEAEARAIEAKAKV-EARVAKVEDRLAEA----KAKKDNLPWKTELLNTLKKLGNPQG

Query:  DLQKSKGSREQDLEELIDQVNPTFTEEVMKAEVPQKFKVPTFKPYDGKKDHVQHLGAYKNWMDFHDVSDAIRCHAFSFTLTGPAKRWFERLKRRSISSFK
                   DLEEL+DQ +  FTEE+M+ +VP KFK+PT K +D   D V HL AY+ WMD + VS+A+RC  FS TL G A+ WF +LKR SISSFK
Subjt:  DLQKSKGSREQDLEELIDQVNPTFTEEVMKAEVPQKFKVPTFKPYDGKKDHVQHLGAYKNWMDFHDVSDAIRCHAFSFTLTGPAKRWFERLKRRSISSFK

Query:  ELARAFLSQFMGAQASHQPL
         LARAF++QF+G +   +P+
Subjt:  ELARAFLSQFMGAQASHQPL

XP_022159109.1 uncharacterized protein LOC111025548 [Momordica charantia]2.4e-2742.39Show/hide
Query:  KRKRTKVVEAEARAIEAKAKVEARVAKVEDRLAEAKAKKDNLPWKTE--LLNTLKKLGNPQGDLQKSKGSREQ--DLEELIDQVNPTFTEEVMKAEVPQK
        +R+RT++ +++            ++ K    LA   +K D+    +E   LN  K +  P+   +K    +E+  DLEEL+ Q +  FTEE+M+ +VP K
Subjt:  KRKRTKVVEAEARAIEAKAKVEARVAKVEDRLAEAKAKKDNLPWKTE--LLNTLKKLGNPQGDLQKSKGSREQ--DLEELIDQVNPTFTEEVMKAEVPQK

Query:  FKVPTFKPYDGKKDHVQHLGAYKNWMDFHDVSDAIRCHAFSFTLTGPAKRWFERLKRRSISSFKELARAFLSQFMGAQASHQPL
        FK+PT KP+DG  + V HL AY+ WMD + VSDAIRC  FS TL G A+ WF +LKR SISSFK LARAF++QF+G +   +P+
Subjt:  FKVPTFKPYDGKKDHVQHLGAYKNWMDFHDVSDAIRCHAFSFTLTGPAKRWFERLKRRSISSFKELARAFLSQFMGAQASHQPL

TrEMBL top hitse value%identityAlignment
A0A6J1D3B7 uncharacterized protein LOC1110166191.8e-2048.15Show/hide
Query:  MKAEVPQKFKVPTFKPYDGKKDHVQHLGAYKNWMDFHDVSDAIRCHAFSFTLTGPAKRWFERLKRRSISSFKELARAFLSQFMGAQASHQPLHCQTTARA
        MK +VP KFK+P  K YDG  D + HL  Y  W D + +++AIRC  FSFTLTG  + WF++LKR+SISSFKELARAF++QF G     +P+    T + 
Subjt:  MKAEVPQKFKVPTFKPYDGKKDHVQHLGAYKNWMDFHDVSDAIRCHAFSFTLTGPAKRWFERLKRRSISSFKELARAFLSQFMGAQASHQPLHCQTTARA

Query:  QKYMNAKE
        +   + K+
Subjt:  QKYMNAKE

A0A6J1D7D2 uncharacterized protein LOC1110183074.2e-2254.74Show/hide
Query:  EEVMKAEVPQKFKVPTFKPYDGKKDHVQHLGAYKNWMDFHDVSDAIRCHAFSFTLTGPAKRWFERLKRRSISSFKELARAFLSQFMGAQASHQPL
        EE+MK +VP KFK+PT K +DG  D V HL AY+ WMD + VS+A++C  FS TL+G A+ WF +LKR SISSFK LA+AF++QF+G ++  +P+
Subjt:  EEVMKAEVPQKFKVPTFKPYDGKKDHVQHLGAYKNWMDFHDVSDAIRCHAFSFTLTGPAKRWFERLKRRSISSFKELARAFLSQFMGAQASHQPL

A0A6J1DWY0 uncharacterized protein LOC1110252935.3e-2537.73Show/hide
Query:  ILRSPKPSALKGRNHTISTPSYSHIGIDLRDLIEEKRKRTKVVEAEARAIEAKAKV-EARVAKVEDRLAEA----KAKKDNLPWKTELLNTLKKLGNPQG
        ++R PK    KG   + +  S + +G  LR +    R+RT++ +      + K+       +  +DR +E     K K  + P  +E  ++ K+ G    
Subjt:  ILRSPKPSALKGRNHTISTPSYSHIGIDLRDLIEEKRKRTKVVEAEARAIEAKAKV-EARVAKVEDRLAEA----KAKKDNLPWKTELLNTLKKLGNPQG

Query:  DLQKSKGSREQDLEELIDQVNPTFTEEVMKAEVPQKFKVPTFKPYDGKKDHVQHLGAYKNWMDFHDVSDAIRCHAFSFTLTGPAKRWFERLKRRSISSFK
                   DLEEL+DQ +  FTEE+M+ +VP KFK+PT K +D   D V HL AY+ WMD + VS+A+RC  FS TL G A+ WF +LKR SISSFK
Subjt:  DLQKSKGSREQDLEELIDQVNPTFTEEVMKAEVPQKFKVPTFKPYDGKKDHVQHLGAYKNWMDFHDVSDAIRCHAFSFTLTGPAKRWFERLKRRSISSFK

Query:  ELARAFLSQFMGAQASHQPL
         LARAF++QF+G +   +P+
Subjt:  ELARAFLSQFMGAQASHQPL

A0A6J1DZ49 uncharacterized protein LOC1110248513.7e-1832.06Show/hide
Query:  MKAEVPQKFKVPTFKPYDGKKDHVQHLGAYKNWMDFHDVSDAIRCHAFSFTLTGPAKRWFERLKRRSISSFKELARAFLSQFMGAQASHQPL--------
        MK +   KFK+P    YDG  D + HL AY+ W D +D+ +AIRC  FSFTLTG A+ WF +LKR SISSFKELA AF++QF+G +   +P+        
Subjt:  MKAEVPQKFKVPTFKPYDGKKDHVQHLGAYKNWMDFHDVSDAIRCHAFSFTLTGPAKRWFERLKRRSISSFKELARAFLSQFMGAQASHQPL--------

Query:  ----------------------------------------------------HCQTTARAQKYMNAKELLKSKKSKQERKRHSSVDQDRKKDKRQRT---
                                                              +  +RAQKYM+A EL+   +  +  + + S  ++R+ +KR R+   
Subjt:  ----------------------------------------------------HCQTTARAQKYMNAKELLKSKKSKQERKRHSSVDQDRKKDKRQRT---

Query:  -NDGGRGRV
         +D G GR+
Subjt:  -NDGGRGRV

A0A6J1E1E7 uncharacterized protein LOC1110255481.2e-2742.39Show/hide
Query:  KRKRTKVVEAEARAIEAKAKVEARVAKVEDRLAEAKAKKDNLPWKTE--LLNTLKKLGNPQGDLQKSKGSREQ--DLEELIDQVNPTFTEEVMKAEVPQK
        +R+RT++ +++            ++ K    LA   +K D+    +E   LN  K +  P+   +K    +E+  DLEEL+ Q +  FTEE+M+ +VP K
Subjt:  KRKRTKVVEAEARAIEAKAKVEARVAKVEDRLAEAKAKKDNLPWKTE--LLNTLKKLGNPQGDLQKSKGSREQ--DLEELIDQVNPTFTEEVMKAEVPQK

Query:  FKVPTFKPYDGKKDHVQHLGAYKNWMDFHDVSDAIRCHAFSFTLTGPAKRWFERLKRRSISSFKELARAFLSQFMGAQASHQPL
        FK+PT KP+DG  + V HL AY+ WMD + VSDAIRC  FS TL G A+ WF +LKR SISSFK LARAF++QF+G +   +P+
Subjt:  FKVPTFKPYDGKKDHVQHLGAYKNWMDFHDVSDAIRCHAFSFTLTGPAKRWFERLKRRSISSFKELARAFLSQFMGAQASHQPL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCAGAGTGTGTCCAAGATACTTCGTATCTTAAACAAACCTAGTCCTAGCACCAGAGTTCATGAGGAGAGGTTGGTTAGGGATCCGAGGAAGGGGAAGGAGCCAAT
GGAGTACACTGTAGAGTCAGAGACAAGATCGAAGGGAAAGAAAACTGATAACATGACTAGCAAGGTCAGGGGGCTCAAACCTACAGGGCGTACCATTCTGAGGAGCCCGA
AACCAAGCGCACTTAAAGGACGCAACCACACAATTTCAACACCAAGTTACAGTCATATCGGAATAGACTTGAGGGATCTGATTGAGGAGAAGCGCAAAAGAACCAAGGTT
GTCGAGGCCGAGGCCAGAGCTATCGAAGCTAAAGCCAAGGTCGAGGCCAGAGTAGCCAAGGTCGAGGATAGATTGGCCGAGGCCAAGGCCAAGAAGGACAATCTCCCTTG
GAAGACTGAGCTTCTGAACACACTAAAGAAGCTCGGAAATCCTCAGGGAGACCTGCAGAAGTCAAAGGGCTCTAGAGAACAAGACTTGGAAGAATTGATTGACCAAGTCA
ACCCAACGTTCACGGAAGAAGTCATGAAAGCCGAGGTGCCCCAGAAGTTCAAAGTACCTACATTCAAACCATATGATGGCAAGAAAGACCACGTACAACATCTAGGCGCA
TACAAGAACTGGATGGACTTCCACGACGTCTCAGATGCAATCAGGTGTCACGCCTTCTCTTTCACTCTGACAGGACCAGCCAAGCGATGGTTTGAAAGGTTGAAAAGGAG
ATCCATCAGCTCTTTCAAGGAATTAGCCCGAGCATTCCTCTCACAGTTCATGGGAGCCCAAGCCTCACATCAACCTCTTCACTGTCAAACAACAGCCAGGGCACAGAAGT
ACATGAACGCAAAGGAGCTACTGAAATCAAAGAAGTCAAAACAAGAGCGCAAGAGACATTCTTCAGTCGATCAGGACAGAAAGAAAGATAAGAGGCAGCGAACAAACGAT
GGTGGTCGAGGGCGAGTCGACCATGACCATAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATCAGAGTGTGTCCAAGATACTTCGTATCTTAAACAAACCTAGTCCTAGCACCAGAGTTCATGAGGAGAGGTTGGTTAGGGATCCGAGGAAGGGGAAGGAGCCAAT
GGAGTACACTGTAGAGTCAGAGACAAGATCGAAGGGAAAGAAAACTGATAACATGACTAGCAAGGTCAGGGGGCTCAAACCTACAGGGCGTACCATTCTGAGGAGCCCGA
AACCAAGCGCACTTAAAGGACGCAACCACACAATTTCAACACCAAGTTACAGTCATATCGGAATAGACTTGAGGGATCTGATTGAGGAGAAGCGCAAAAGAACCAAGGTT
GTCGAGGCCGAGGCCAGAGCTATCGAAGCTAAAGCCAAGGTCGAGGCCAGAGTAGCCAAGGTCGAGGATAGATTGGCCGAGGCCAAGGCCAAGAAGGACAATCTCCCTTG
GAAGACTGAGCTTCTGAACACACTAAAGAAGCTCGGAAATCCTCAGGGAGACCTGCAGAAGTCAAAGGGCTCTAGAGAACAAGACTTGGAAGAATTGATTGACCAAGTCA
ACCCAACGTTCACGGAAGAAGTCATGAAAGCCGAGGTGCCCCAGAAGTTCAAAGTACCTACATTCAAACCATATGATGGCAAGAAAGACCACGTACAACATCTAGGCGCA
TACAAGAACTGGATGGACTTCCACGACGTCTCAGATGCAATCAGGTGTCACGCCTTCTCTTTCACTCTGACAGGACCAGCCAAGCGATGGTTTGAAAGGTTGAAAAGGAG
ATCCATCAGCTCTTTCAAGGAATTAGCCCGAGCATTCCTCTCACAGTTCATGGGAGCCCAAGCCTCACATCAACCTCTTCACTGTCAAACAACAGCCAGGGCACAGAAGT
ACATGAACGCAAAGGAGCTACTGAAATCAAAGAAGTCAAAACAAGAGCGCAAGAGACATTCTTCAGTCGATCAGGACAGAAAGAAAGATAAGAGGCAGCGAACAAACGAT
GGTGGTCGAGGGCGAGTCGACCATGACCATAGCTGA
Protein sequenceShow/hide protein sequence
MDQSVSKILRILNKPSPSTRVHEERLVRDPRKGKEPMEYTVESETRSKGKKTDNMTSKVRGLKPTGRTILRSPKPSALKGRNHTISTPSYSHIGIDLRDLIEEKRKRTKV
VEAEARAIEAKAKVEARVAKVEDRLAEAKAKKDNLPWKTELLNTLKKLGNPQGDLQKSKGSREQDLEELIDQVNPTFTEEVMKAEVPQKFKVPTFKPYDGKKDHVQHLGA
YKNWMDFHDVSDAIRCHAFSFTLTGPAKRWFERLKRRSISSFKELARAFLSQFMGAQASHQPLHCQTTARAQKYMNAKELLKSKKSKQERKRHSSVDQDRKKDKRQRTND
GGRGRVDHDHS