; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025475 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025475
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr10:13560569..13561559
RNA-Seq ExpressionLag0025475
SyntenyLag0025475
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFY85402.1 hypothetical protein Acr_04g0001400 [Actinidia rufa]4.6e-3338.27Show/hide
Query:  SLVSTSVVTSPYPPHSATAYPFVPPFQASAPFFPSPQ-----PATNPFPSLTPPLNIKLTHSNYLLWKNQLLNHILAFDMESLINGT-PAPPKFLDTAKT
        S  + S  T P PP      P   P  +S+   P+PQ     P  N  PS+  PL +KL   NY++WK QLLN ++A  +E  ++G+   PP+FLD  + 
Subjt:  SLVSTSVVTSPYPPHSATAYPFVPPFQASAPFFPSPQ-----PATNPFPSLTPPLNIKLTHSNYLLWKNQLLNHILAFDMESLINGT-PAPPKFLDTAKT

Query:  Q---------------------------IGEIIGCSTAFEIWEHLRIVYESSSTARIMGLRSQLQKIRKDGLTVSQILAKIKDIANQFSAIGEPLSYRDH
        Q                           +G+I+G ++A +IWE L  +Y ++S A +  LR+ LQ I+K+GLT    + K + + N  ++IGEP++Y DH
Subjt:  Q---------------------------IGEIIGCSTAFEIWEHLRIVYESSSTARIMGLRSQLQKIRKDGLTVSQILAKIKDIANQFSAIGEPLSYRDH

Query:  LGYILEGLGTEYNPFVTSIQNRTDRPSLADVRSLLIAYEARLE
        L Y L GLG +YNPFVTSIQ++  RPS+ +V SLL++Y+ARLE
Subjt:  LGYILEGLGTEYNPFVTSIQNRTDRPSLADVRSLLIAYEARLE

PON47862.1 hypothetical protein TorRG33x02_321990 [Trema orientale]7.6e-3644.61Show/hide
Query:  PQPATNPFPSLTPPLNIKLTHSNYLLWKNQLLNHILAFDMESLINGT-PAPPKFLDTAK-------------------------TQ--IGEIIGCSTAFE
        P P     PS+  P  IKL   NYL+WKNQLLN I+A  +E  I+G+ P PP+F D A+                         TQ  +G+I+G ++AFE
Subjt:  PQPATNPFPSLTPPLNIKLTHSNYLLWKNQLLNHILAFDMESLINGT-PAPPKFLDTAK-------------------------TQ--IGEIIGCSTAFE

Query:  IWEHLRIVYESSSTARIMGLRSQLQKIRKDGLTVSQILAKIKDIANQFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLADVRSLLIAYEA
        IWE L  +Y SSS A+I  LR++LQ +RKDGLT  + + K K+I N  +A+GEP+S +DHL Y+  GL  EYN FVTSI  R D   L ++ SLL++YE 
Subjt:  IWEHLRIVYESSSTARIMGLRSQLQKIRKDGLTVSQILAKIKDIANQFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLADVRSLLIAYEA

Query:  RLEN
        RLE+
Subjt:  RLEN

RVW69807.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]3.0e-3245.15Show/hide
Query:  PSPQ--PATNPFPSLTPPLNIKLTHSNYLLWKNQLLNHILAFDMESLIN-GTPAPPKFLDTAKTQ---------------------------IGEIIGCS
        P+PQ    T P PSL+  L+IKL  +N LL K+QLLN I+A  +E  I+    +PPK+LD A  Q                           +G+I+  S
Subjt:  PSPQ--PATNPFPSLTPPLNIKLTHSNYLLWKNQLLNHILAFDMESLIN-GTPAPPKFLDTAKTQ---------------------------IGEIIGCS

Query:  TAFEIWEHLRIVYESSSTARIMGLRSQLQKIRKDGLTVSQILAKIKDIANQFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLADVRSLLI
        TA +IW  L   YES S A +M L SQLQ+I+K  + +S+ L+++K + ++F+ IGEPLSYRD L  ILEGL  EY+ FVTSI NR+DRPSL +V SLL 
Subjt:  TAFEIWEHLRIVYESSSTARIMGLRSQLQKIRKDGLTVSQILAKIKDIANQFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLADVRSLLI

Query:  AYEARL
         YE RL
Subjt:  AYEARL

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]2.3e-4849.07Show/hide
Query:  FPSPQP----------ATNPFPSLTPPLNIKLTHSNYLLWKNQLLNHILAFDMESLINGT-PAPPKFLD---------------------------TAKT
        FP P P          + NPFP+L  PLN+KL  +N+LLWKNQLLN ++A  +   ++GT   PP+FLD                            ++ 
Subjt:  FPSPQP----------ATNPFPSLTPPLNIKLTHSNYLLWKNQLLNHILAFDMESLINGT-PAPPKFLD---------------------------TAKT

Query:  QIGEIIGCSTAFEIWEHLRIVYESSSTARIMGLRSQLQKIRKDGLTVSQILAKIKDIANQFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPS
        ++GE++   T  +IW  L  VY+S +TARIMGL+++LQ +RKDG +VSQ LAKIK+IA++F+A+GEPLSYRDHL ++L+GLG+EYN FVTSI NR D PS
Subjt:  QIGEIIGCSTAFEIWEHLRIVYESSSTARIMGLRSQLQKIRKDGLTVSQILAKIKDIANQFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPS

Query:  LADVRSLLIAYEARLE
        L DVRSLL+AYEARL+
Subjt:  LADVRSLLIAYEARLE

XP_038887133.1 uncharacterized protein LOC120077323 [Benincasa hispida]5.1e-4073.91Show/hide
Query:  IGEIIGCSTAFEIWEHLRIVYESSSTARIMGLRSQLQKIRKDGLTVSQILAKIKDIANQFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSL
        +GEI+G  +AF+IWE LR VYESSS A IMG  SQLQKI+KDGLTVSQ LA+IKD+ + F+AIGEPLSYRDHL YILEGLG+EYNPFV+SI NRT+RPS+
Subjt:  IGEIIGCSTAFEIWEHLRIVYESSSTARIMGLRSQLQKIRKDGLTVSQILAKIKDIANQFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSL

Query:  ADVRSLLIAYEARLE
        ADVR+LLI Y++RLE
Subjt:  ADVRSLLIAYEARLE

TrEMBL top hitse value%identityAlignment
A0A2P5BGF8 Uncharacterized protein3.7e-3644.61Show/hide
Query:  PQPATNPFPSLTPPLNIKLTHSNYLLWKNQLLNHILAFDMESLINGT-PAPPKFLDTAK-------------------------TQ--IGEIIGCSTAFE
        P P     PS+  P  IKL   NYL+WKNQLLN I+A  +E  I+G+ P PP+F D A+                         TQ  +G+I+G ++AFE
Subjt:  PQPATNPFPSLTPPLNIKLTHSNYLLWKNQLLNHILAFDMESLINGT-PAPPKFLDTAK-------------------------TQ--IGEIIGCSTAFE

Query:  IWEHLRIVYESSSTARIMGLRSQLQKIRKDGLTVSQILAKIKDIANQFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLADVRSLLIAYEA
        IWE L  +Y SSS A+I  LR++LQ +RKDGLT  + + K K+I N  +A+GEP+S +DHL Y+  GL  EYN FVTSI  R D   L ++ SLL++YE 
Subjt:  IWEHLRIVYESSSTARIMGLRSQLQKIRKDGLTVSQILAKIKDIANQFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLADVRSLLIAYEA

Query:  RLEN
        RLE+
Subjt:  RLEN

A0A438GC62 Retrovirus-related Pol polyprotein from transposon RE11.5e-3245.15Show/hide
Query:  PSPQ--PATNPFPSLTPPLNIKLTHSNYLLWKNQLLNHILAFDMESLIN-GTPAPPKFLDTAKTQ---------------------------IGEIIGCS
        P+PQ    T P PSL+  L+IKL  +N LL K+QLLN I+A  +E  I+    +PPK+LD A  Q                           +G+I+  S
Subjt:  PSPQ--PATNPFPSLTPPLNIKLTHSNYLLWKNQLLNHILAFDMESLIN-GTPAPPKFLDTAKTQ---------------------------IGEIIGCS

Query:  TAFEIWEHLRIVYESSSTARIMGLRSQLQKIRKDGLTVSQILAKIKDIANQFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLADVRSLLI
        TA +IW  L   YES S A +M L SQLQ+I+K  + +S+ L+++K + ++F+ IGEPLSYRD L  ILEGL  EY+ FVTSI NR+DRPSL +V SLL 
Subjt:  TAFEIWEHLRIVYESSSTARIMGLRSQLQKIRKDGLTVSQILAKIKDIANQFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLADVRSLLI

Query:  AYEARL
         YE RL
Subjt:  AYEARL

A0A6J1DQX7 uncharacterized protein LOC1110223151.1e-4849.07Show/hide
Query:  FPSPQP----------ATNPFPSLTPPLNIKLTHSNYLLWKNQLLNHILAFDMESLINGT-PAPPKFLD---------------------------TAKT
        FP P P          + NPFP+L  PLN+KL  +N+LLWKNQLLN ++A  +   ++GT   PP+FLD                            ++ 
Subjt:  FPSPQP----------ATNPFPSLTPPLNIKLTHSNYLLWKNQLLNHILAFDMESLINGT-PAPPKFLD---------------------------TAKT

Query:  QIGEIIGCSTAFEIWEHLRIVYESSSTARIMGLRSQLQKIRKDGLTVSQILAKIKDIANQFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPS
        ++GE++   T  +IW  L  VY+S +TARIMGL+++LQ +RKDG +VSQ LAKIK+IA++F+A+GEPLSYRDHL ++L+GLG+EYN FVTSI NR D PS
Subjt:  QIGEIIGCSTAFEIWEHLRIVYESSSTARIMGLRSQLQKIRKDGLTVSQILAKIKDIANQFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPS

Query:  LADVRSLLIAYEARLE
        L DVRSLL+AYEARL+
Subjt:  LADVRSLLIAYEARLE

A0A7J0EGI5 Uncharacterized protein2.2e-3338.27Show/hide
Query:  SLVSTSVVTSPYPPHSATAYPFVPPFQASAPFFPSPQ-----PATNPFPSLTPPLNIKLTHSNYLLWKNQLLNHILAFDMESLINGT-PAPPKFLDTAKT
        S  + S  T P PP      P   P  +S+   P+PQ     P  N  PS+  PL +KL   NY++WK QLLN ++A  +E  ++G+   PP+FLD  + 
Subjt:  SLVSTSVVTSPYPPHSATAYPFVPPFQASAPFFPSPQ-----PATNPFPSLTPPLNIKLTHSNYLLWKNQLLNHILAFDMESLINGT-PAPPKFLDTAKT

Query:  Q---------------------------IGEIIGCSTAFEIWEHLRIVYESSSTARIMGLRSQLQKIRKDGLTVSQILAKIKDIANQFSAIGEPLSYRDH
        Q                           +G+I+G ++A +IWE L  +Y ++S A +  LR+ LQ I+K+GLT    + K + + N  ++IGEP++Y DH
Subjt:  Q---------------------------IGEIIGCSTAFEIWEHLRIVYESSSTARIMGLRSQLQKIRKDGLTVSQILAKIKDIANQFSAIGEPLSYRDH

Query:  LGYILEGLGTEYNPFVTSIQNRTDRPSLADVRSLLIAYEARLE
        L Y L GLG +YNPFVTSIQ++  RPS+ +V SLL++Y+ARLE
Subjt:  LGYILEGLGTEYNPFVTSIQNRTDRPSLADVRSLLIAYEARLE

A0A7J0GPN0 UBX domain-containing protein1.3e-2836.04Show/hide
Query:  ATAYPFVPPFQASAPFFPSPQPATNP----------FPSLTPPLNIKLTHSNYLLWKNQLLNHILAFDMESLINGT-PAPPKFLDTAKTQ----------
        +++ P  PP   S P   S  P  NP           PS+  PL +KL   NY++WK QLLN ++A  +E  ++G+   PP+FLD  + Q          
Subjt:  ATAYPFVPPFQASAPFFPSPQPATNP----------FPSLTPPLNIKLTHSNYLLWKNQLLNHILAFDMESLINGT-PAPPKFLDTAKTQ----------

Query:  -----------------IGEIIGCSTAFEIWEHLRIVYESSSTARIMGLRSQLQKIRKDGLTVSQILAKIKDIANQFSAIGEPLSYRDHLGYILEGLGTE
                         +G+I+G ++A +IWE L  +Y ++S A +  LR+ LQ I+K+GLT    + K + + N  ++IGEP++Y DHL Y L GLG +
Subjt:  -----------------IGEIIGCSTAFEIWEHLRIVYESSSTARIMGLRSQLQKIRKDGLTVSQILAKIKDIANQFSAIGEPLSYRDHLGYILEGLGTE

Query:  YNPFVTSIQNRTDRPSLADVRS
        YNPFVTSIQ++  RPS+ +  S
Subjt:  YNPFVTSIQNRTDRPSLADVRS

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.4e-1123.66Show/hide
Query:  KLTHSNYLLWKNQLLNHILAFDMESLING-TPAPPKFLDT----------------AKTQIGEIIG------------CSTAFEIWEHLRIVYESSSTAR
        KLT +NYL+W  Q+      +++   ++G T  PP  + T                 K     ++G             +TA +IWE LR +Y + S   
Subjt:  KLTHSNYLLWKNQLLNHILAFDMESLING-TPAPPKFLDT----------------AKTQIGEIIG------------CSTAFEIWEHLRIVYESSSTAR

Query:  IMGLRSQLQKIRKDGLTVSQILAKIKDIANQFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLADVRSLLIAYEARL
        +  LR+QL++  K   T+   +  +    +Q + +G+P+ + + +  +LE L  EY P +  I  +   P+L ++   L+ +E+++
Subjt:  IMGLRSQLQKIRKDGLTVSQILAKIKDIANQFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLADVRSLLIAYEARL

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.6e-1325Show/hide
Query:  PLNIKLTHSNYLLWKNQLLNHILAFDMESLINGTPAPPKFLD------------------TAKTQIGEIIGCSTAFEIWEHLRIVYESSSTARIMGLRSQ
        P+ + +  SNY  W+   L H L+FD+   I+GT  P    D                  T K   G  +  ST+ +IW  ++  + ++  AR + L S+
Subjt:  PLNIKLTHSNYLLWKNQLLNHILAFDMESLINGTPAPPKFLD------------------TAKTQIGEIIGCSTAFEIWEHLRIVYESSSTARIMGLRSQ

Query:  LQKIRKDGLTVSQILAKIKDIANQFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLADVRSLLIAYEARLENNLR
        L+      + V+    K+K +A+    +  P++ R+ + Y+L GL  +++  +  I++R   PS  D  ++L   E RL+  ++
Subjt:  LQKIRKDGLTVSQILAKIKDIANQFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLADVRSLLIAYEARLENNLR

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)5.3e-1123.24Show/hide
Query:  LNIKLTHSNYLLWKNQLLNHILAFDMESLINGTPAPPKFLDTA-KTQIGEI------------------IGCSTAFEIWEHLRIVYESSSTARIMGLRSQ
        + + L   NY +W+       L+F +   I+G+  P    +   K + G +                  +GC TA ++W  L  ++  +  AR +   ++
Subjt:  LNIKLTHSNYLLWKNQLLNHILAFDMESLINGTPAPPKFLDTA-KTQIGEI------------------IGCSTAFEIWEHLRIVYESSSTARIMGLRSQ

Query:  LQKIRKDGLTVSQILAKIKDIANQFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLADVRSLLIAYEARLENNLRS
        L+    D L+V +   K+K +++  + +  P+S R  + ++L GL  +Y+  +  I++++  PS  + RS+L+  E+RL N  +S
Subjt:  LQKIRKDGLTVSQILAKIKDIANQFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLADVRSLLIAYEARLENNLRS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCAAGCTCTAGCTCACTCTTTTCCGGTCAGGACATCGCCCCGACTGCTCCCACCACCTCGACAGTATTATCAATCCCTTCCCTTGTCTCTACTTCGGTTGTCAC
CTCCCCTTACCCTCCCCACTCTGCTACCGCTTATCCTTTTGTCCCTCCTTTTCAAGCTTCTGCCCCCTTTTTCCCAAGTCCCCAACCTGCAACAAATCCTTTTCCCTCCC
TTACCCCACCCCTCAACATCAAATTGACACATTCCAACTACCTCCTCTGGAAGAACCAACTGTTGAACCACATTCTCGCCTTCGATATGGAGAGTCTTATAAACGGTACC
CCTGCTCCTCCTAAGTTCTTAGACACTGCGAAAACTCAGATAGGTGAAATTATTGGCTGTTCTACTGCTTTTGAGATTTGGGAGCATCTTAGAATAGTGTATGAATCATC
GTCCACTGCTCGCATTATGGGGTTACGGTCTCAGTTACAGAAAATACGCAAAGATGGACTCACAGTTTCTCAAATTCTAGCTAAGATAAAGGACATAGCTAATCAGTTCT
CGGCCATCGGTGAGCCATTATCCTACAGGGATCACCTCGGCTACATTCTTGAGGGATTAGGAACAGAATATAACCCATTTGTTACCTCCATTCAAAATAGAACTGACCGT
CCCTCTCTTGCTGATGTCCGAAGCTTATTGATTGCTTATGAGGCTCGTCTTGAAAACAATCTTCGGTCGATCAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCAAGCTCTAGCTCACTCTTTTCCGGTCAGGACATCGCCCCGACTGCTCCCACCACCTCGACAGTATTATCAATCCCTTCCCTTGTCTCTACTTCGGTTGTCAC
CTCCCCTTACCCTCCCCACTCTGCTACCGCTTATCCTTTTGTCCCTCCTTTTCAAGCTTCTGCCCCCTTTTTCCCAAGTCCCCAACCTGCAACAAATCCTTTTCCCTCCC
TTACCCCACCCCTCAACATCAAATTGACACATTCCAACTACCTCCTCTGGAAGAACCAACTGTTGAACCACATTCTCGCCTTCGATATGGAGAGTCTTATAAACGGTACC
CCTGCTCCTCCTAAGTTCTTAGACACTGCGAAAACTCAGATAGGTGAAATTATTGGCTGTTCTACTGCTTTTGAGATTTGGGAGCATCTTAGAATAGTGTATGAATCATC
GTCCACTGCTCGCATTATGGGGTTACGGTCTCAGTTACAGAAAATACGCAAAGATGGACTCACAGTTTCTCAAATTCTAGCTAAGATAAAGGACATAGCTAATCAGTTCT
CGGCCATCGGTGAGCCATTATCCTACAGGGATCACCTCGGCTACATTCTTGAGGGATTAGGAACAGAATATAACCCATTTGTTACCTCCATTCAAAATAGAACTGACCGT
CCCTCTCTTGCTGATGTCCGAAGCTTATTGATTGCTTATGAGGCTCGTCTTGAAAACAATCTTCGGTCGATCAATTGA
Protein sequenceShow/hide protein sequence
MASSSSSLFSGQDIAPTAPTTSTVLSIPSLVSTSVVTSPYPPHSATAYPFVPPFQASAPFFPSPQPATNPFPSLTPPLNIKLTHSNYLLWKNQLLNHILAFDMESLINGT
PAPPKFLDTAKTQIGEIIGCSTAFEIWEHLRIVYESSSTARIMGLRSQLQKIRKDGLTVSQILAKIKDIANQFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDR
PSLADVRSLLIAYEARLENNLRSIN