; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039710 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039710
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon gag protein
Genome locationchr2:48717470..48718301
RNA-Seq ExpressionLag0039710
SyntenyLag0039710
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG8475506.1 hypothetical protein CXB51_032293 [Gossypium anomalum]5.0e-1833.04Show/hide
Query:  GLPACLLVEHFYNGLNQTSKAIVNASANALLLKNTFNEANEILDAIVANNSQCGEEETPSRSSVKIQEVNHLETLSE------------RMLTMNAANAA
        G+P C+ +E FYNGLN  ++ +V+A+AN ++L  ++NEA EI++ I +NN Q       SR  V   EV+ ++TL+             +  T N+AN  
Subjt:  GLPACLLVEHFYNGLNQTSKAIVNASANALLLKNTFNEANEILDAIVANNSQCGEEETPSRSSVKIQEVNHLETLSE------------RMLTMNAANAA

Query:  IA---------------------NASATP-----------------------------------INNKLENLMRELMTKNDVVIQSQIASLRSLESQIGQ
        +A                     N  + P                                    +N LENL++  M KND +IQSQ  +L++LE+QIGQ
Subjt:  IA---------------------NASATP-----------------------------------INNKLENLMRELMTKNDVVIQSQIASLRSLESQIGQ

Query:  LAKEMRNRPIGTLPSNTKNSKKEGLSKDKA
        LA E+RNRP GTLPSNT+N +  G    KA
Subjt:  LAKEMRNRPIGTLPSNTKNSKKEGLSKDKA

XP_023522102.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111785979 [Cucurbita pepo subsp. pepo]3.5e-1941.42Show/hide
Query:  VEHFYNGLNQTSKAIVNASANALLLKNTFNEANEILDAIVANNSQCGE-EETPSRSSVKIQEVNHLETLSERM--LTMNAANAAIANAS--------ATP
        +E FYNGLN  +K +V+ASAN  +L  T+NEA EIL+ I +NN Q  +    P R +  + EV+ L +++ ++  +T    N A+   S        A  
Subjt:  VEHFYNGLNQTSKAIVNASANALLLKNTFNEANEILDAIVANNSQCGE-EETPSRSSVKIQEVNHLETLSERM--LTMNAANAAIANAS--------ATP

Query:  INNKLE---NLMRELMTKNDVVIQSQIASLRSLESQIGQLAKEMRNRPIGTLPSNTKNSKKEGLSKDKA
        IN       +L++E M KND  IQSQ ASLR+LE Q+GQLA E+RNRP+  LP++T+  K+EG+ + +A
Subjt:  INNKLE---NLMRELMTKNDVVIQSQIASLRSLESQIGQLAKEMRNRPIGTLPSNTKNSKKEGLSKDKA

XP_030490806.1 uncharacterized protein LOC115707099 [Cannabis sativa]2.9e-1835.44Show/hide
Query:  GLPACLLVEHFYNGLNQTSKAIVNASANALLLKNTFNEANEILDAIVANNSQCGEEETP-SRSSVKIQEVNHLETLSERMLTMNAA--------------
        G+P C+ +E F NGLN  S+ +++ASAN  +L  ++NE  EIL+ I +NN Q      P SR    + EV+ L  L+ ++  +  A              
Subjt:  GLPACLLVEHFYNGLNQTSKAIVNASANALLLKNTFNEANEILDAIVANNSQCGEEETP-SRSSVKIQEVNHLETLSERMLTMNAA--------------

Query:  ---NAAIANASATPINNK--------------------------LENLMRELMTKNDVVIQSQIASLRSLESQIGQLAKEMRNRPIGTLPSNTKNSKKEG
               A++S      K                          LE+LMR+ M KNDVVIQSQ ASLR+LE Q+GQLA +++NRP GTLPS+T+N +++ 
Subjt:  ---NAAIANASATPINNK--------------------------LENLMRELMTKNDVVIQSQIASLRSLESQIGQLAKEMRNRPIGTLPSNTKNSKKEG

Query:  LSKDKA
            KA
Subjt:  LSKDKA

XP_030494802.1 uncharacterized protein LOC115710583 [Cannabis sativa]1.7e-1832.76Show/hide
Query:  GLPACLLVEHFYNGLNQTSKAIVNASANALLLKNTFNEANEILDAIVANNSQCGEEETP-SRSSVKIQEVNHLETLSERMLTM-----------------
        G+P C+ +E FYNGLN  S+ +++ASAN  +L  ++NEA EIL+ I +NN Q      P SR    + EV+ L  L+ +M +M                 
Subjt:  GLPACLLVEHFYNGLNQTSKAIVNASANALLLKNTFNEANEILDAIVANNSQCGEEETP-SRSSVKIQEVNHLETLSERMLTM-----------------

Query:  -----------------------NAANAAIANASATPI-----------------------------NNKLENLMRELMTKNDVVIQSQIASLRSLESQI
                               N A+     AS++                                + LE+LMR+ M KND VIQSQ ASL++LE Q+
Subjt:  -----------------------NAANAAIANASATPI-----------------------------NNKLENLMRELMTKNDVVIQSQIASLRSLESQI

Query:  GQLAKEMRNRPIGTLPSNTKNSKKEGLSKDKA
        GQLA +++NRP GTLPS+T+N +++G    KA
Subjt:  GQLAKEMRNRPIGTLPSNTKNSKKEGLSKDKA

XP_030509265.1 uncharacterized protein LOC115723943 [Cannabis sativa]2.8e-2137.31Show/hide
Query:  GLPACLLVEHFYNGLNQTSKAIVNASANALLLKNTFNEANEILDAIVANNSQCGEEETP-SRSSVKIQEVNHLETLSERMLTMNAANAAI------ANAS
        G+P C+ +E FYNGLN  S+ +++ASAN  +L  ++NEA EIL+ I +NN Q      P SR    + EV+ +  L+ +M +M   +  +      A++S
Subjt:  GLPACLLVEHFYNGLNQTSKAIVNASANALLLKNTFNEANEILDAIVANNSQCGEEETP-SRSSVKIQEVNHLETLSERMLTMNAANAAI------ANAS

Query:  ATPINNK------------------------LENLMRELMTKNDVVIQSQIASLRSLESQIGQLAKEMRNRPIGTLPSNTKNSKKEGLSKDKA
          P   +                        LE+LMR+ M KND VIQSQ ASLR+LE Q+G LA E++ RP G+LPS+T+N +++G  + K+
Subjt:  ATPINNK------------------------LENLMRELMTKNDVVIQSQIASLRSLESQIGQLAKEMRNRPIGTLPSNTKNSKKEGLSKDKA

TrEMBL top hitse value%identityAlignment
A0A5B6VNY6 Gag-asp_proteas domain-containing protein2.8e-1435.4Show/hide
Query:  GLPACLLVEHFYNGLNQTSKAIVNASANALLLKNTFNEANEILDAIVANNSQ-CGEEETPSRSSVKIQEVNHLETLSERMLTMNAA----NAAIANASAT
        G+P C+ +E FYNGLN  ++ +V+ASAN +LL  ++NEA  I+D I + N Q         R   ++ EV+ L +L   + ++++          N  AT
Subjt:  GLPACLLVEHFYNGLNQTSKAIVNASANALLLKNTFNEANEILDAIVANNSQ-CGEEETPSRSSVKIQEVNHLETLSERMLTMNAA----NAAIANASAT

Query:  PINNKLENLMRELMTKNDVVIQSQIASLRSLESQIGQLAKEMRNRPIGTLPSNTKNSKKEG
            + + +    M KND +IQ Q+A+L++LE+++GQLA E+  RP G  PS+ KN +  G
Subjt:  PINNKLENLMRELMTKNDVVIQSQIASLRSLESQIGQLAKEMRNRPIGTLPSNTKNSKKEG

A0A5B6VWJ0 Retroelement pol polyprotein-like1.1e-1027.59Show/hide
Query:  GLPACLLVEHFYNGLNQTSKAIVNASANALLLKNTFNEANEILDAIVANNSQCGEEETPS-RSSVKIQEVNHLETLSERM---------LTMNAANAAIA
        G+P C+ +E FYNGL   ++ +V+ASAN  LL  ++NEA EI++ I +NN Q       S R    I EV+ + +L+ ++         LT N +N+  A
Subjt:  GLPACLLVEHFYNGLNQTSKAIVNASANALLLKNTFNEANEILDAIVANNSQCGEEETPS-RSSVKIQEVNHLETLSERM---------LTMNAANAAIA

Query:  --------------------------------------------------NAS---------------------------------------ATPINNKL
                                                          N+S                                           +N L
Subjt:  --------------------------------------------------NAS---------------------------------------ATPINNKL

Query:  ENLMRELMTKNDVVIQSQIASLRSLESQIGQLAKEMRNRPIGTLPSNTKNSKKEGLSKDKA
        E+L++  M KND +IQSQ A+L++LE+Q+GQLA E+RNR  G LPS+T+N +  G    KA
Subjt:  ENLMRELMTKNDVVIQSQIASLRSLESQIGQLAKEMRNRPIGTLPSNTKNSKKEGLSKDKA

A0A5B6WTE7 Retrovirus-related Pol polyprotein from transposon opus1.1e-1538.51Show/hide
Query:  FYNGLNQTSKAIVNASANALLLKNTFNEANEILDAIVANNSQCGEEETPS-RSSVKIQEVNHLETLSERMLTMNAANAAIANASATPINNKLENLMRELM
        FYNGLN  ++ +V+AS N  LL  ++NEA EI++ I +NN Q     T S R  V++ +V  L  LS  + ++++           PI  +L +  +  M
Subjt:  FYNGLNQTSKAIVNASANALLLKNTFNEANEILDAIVANNSQCGEEETPS-RSSVKIQEVNHLETLSERMLTMNAANAAIANASATPINNKLENLMRELM

Query:  TKNDVVIQSQIASLRSLESQIGQLAKEMRNRPIGTLPSNTKNSKKEGL
         KND +IQSQ  +L++LE+Q+GQ+A E+ NR  G LPS+T+N +  G+
Subjt:  TKNDVVIQSQIASLRSLESQIGQLAKEMRNRPIGTLPSNTKNSKKEGL

A0A6J1G7Q6 uncharacterized protein LOC1114515988.9e-1326.91Show/hide
Query:  GLPACLLVEHFYNGLNQTSKAIVNASANALLLKNTFNEANEILDAIVANNSQCGE-EETPSRSSVKIQEVNHLETLSERMLTM-----------------
        GLP C+ +E FYNGLN  +K +V+ASAN  +L  T+NEA EIL+ I +NN Q  +    P + + ++ EV+ L +++ ++ +M                 
Subjt:  GLPACLLVEHFYNGLNQTSKAIVNASANALLLKNTFNEANEILDAIVANNSQCGE-EETPSRSSVKIQEVNHLETLSERMLTM-----------------

Query:  ------------------------------NAAN-------AAIANASATPINNK---------------------------------------------
                                      N A+       A+  N    P +N                                              
Subjt:  ------------------------------NAAN-------AAIANASATPINNK---------------------------------------------

Query:  ------------------LENLMRELMTKNDVVIQSQIASLRSLESQIGQLAKEMRNRPIGTLPSNTKNSKKEGL
                          LE+L++E M +ND VIQSQ  SLR+LE Q+GQLA E+RNRP+G LP++T+  K+EG+
Subjt:  ------------------LENLMRELMTKNDVVIQSQIASLRSLESQIGQLAKEMRNRPIGTLPSNTKNSKKEGL

A0A6J1H7K8 uncharacterized protein LOC1114611672.4e-1062.9Show/hide
Query:  LENLMRELMTKNDVVIQSQIASLRSLESQIGQLAKEMRNRPIGTLPSNTKNSKKEGLSKDKA
        LE+L++E M KNDVVIQSQ ASLR+LE Q+GQLA E+RNRP+G LPS+T+  K+EG+ + +A
Subjt:  LENLMRELMTKNDVVIQSQIASLRSLESQIGQLAKEMRNRPIGTLPSNTKNSKKEGLSKDKA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCACACCGCGTGGCTTGCCTGCTTGTCTTCTTGTTGAGCATTTTTACAATGGATTGAATCAAACTTCCAAAGCAATAGTCAATGCATCAGCTAATGCTTTGTTGCT
TAAAAATACTTTCAATGAGGCAAATGAGATTTTAGACGCAATTGTTGCAAATAACAGTCAATGTGGAGAAGAAGAAACACCATCTAGAAGCAGTGTCAAGATCCAAGAAG
TCAATCATCTCGAGACACTATCGGAGCGGATGCTAACTATGAATGCCGCAAATGCAGCAATAGCCAATGCAAGTGCAACCCCTATCAACAACAAACTGGAGAATCTTATG
AGGGAACTAATGACCAAGAATGACGTAGTAATTCAAAGTCAAATAGCTTCCTTGAGGAGTTTAGAATCTCAAATTGGGCAGTTAGCCAAGGAAATGAGGAACAGACCAAT
AGGGACCCTACCAAGCAATACAAAGAACTCGAAAAAAGAAGGCCTAAGCAAGGATAAGGCCTCCATTTCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCCACACCGCGTGGCTTGCCTGCTTGTCTTCTTGTTGAGCATTTTTACAATGGATTGAATCAAACTTCCAAAGCAATAGTCAATGCATCAGCTAATGCTTTGTTGCT
TAAAAATACTTTCAATGAGGCAAATGAGATTTTAGACGCAATTGTTGCAAATAACAGTCAATGTGGAGAAGAAGAAACACCATCTAGAAGCAGTGTCAAGATCCAAGAAG
TCAATCATCTCGAGACACTATCGGAGCGGATGCTAACTATGAATGCCGCAAATGCAGCAATAGCCAATGCAAGTGCAACCCCTATCAACAACAAACTGGAGAATCTTATG
AGGGAACTAATGACCAAGAATGACGTAGTAATTCAAAGTCAAATAGCTTCCTTGAGGAGTTTAGAATCTCAAATTGGGCAGTTAGCCAAGGAAATGAGGAACAGACCAAT
AGGGACCCTACCAAGCAATACAAAGAACTCGAAAAAAGAAGGCCTAAGCAAGGATAAGGCCTCCATTTCCTAA
Protein sequenceShow/hide protein sequence
MSTPRGLPACLLVEHFYNGLNQTSKAIVNASANALLLKNTFNEANEILDAIVANNSQCGEEETPSRSSVKIQEVNHLETLSERMLTMNAANAAIANASATPINNKLENLM
RELMTKNDVVIQSQIASLRSLESQIGQLAKEMRNRPIGTLPSNTKNSKKEGLSKDKASIS