; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0034027 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0034027
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr3:3847518..3848276
RNA-Seq ExpressionLag0034027
SyntenyLag0034027
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFY85402.1 hypothetical protein Acr_04g0001400 [Actinidia rufa]8.2e-2542.48Show/hide
Query:  TSSPGYPFIPSYPTSNPFFPAPQPSPNP----------FPTLTPPLNIKLTDSNYLLWKNQLLNHILAFDMESLINGT-PAPPRFLDTAETQVNPLFPIW
        T S   P  P  PTSNP   +  P+PNP           P++  PL +KL D NY++WK QLLN ++A  +E  ++G+   PPRFLD  + Q NP F  W
Subjt:  TSSPGYPFIPSYPTSNPFFPAPQPSPNP----------FPTLTPPLNIKLTDSNYLLWKNQLLNHILAFDMESLINGT-PAPPRFLDTAETQVNPLFPIW

Query:  QKYNRTLMSWIYSSLTEDKIGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRS
        Q+YNR +MSWIY+S+ E  +G+I+G ++A +IWE L+ +Y ++S A +  LR+
Subjt:  QKYNRTLMSWIYSSLTEDKIGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRS

GFZ12741.1 UBX domain-containing protein [Actinidia rufa]1.1e-2442.48Show/hide
Query:  TSSPGYPFIPSYPTSNPFFPAPQPSPNP----------FPTLTPPLNIKLTDSNYLLWKNQLLNHILAFDMESLINGT-PAPPRFLDTAETQVNPLFPIW
        T S   P  P  PTSNP   +  P PNP           P++  PL +KL D NY++WK QLLN ++A  +E  ++G+   PPRFLD  + Q NP F  W
Subjt:  TSSPGYPFIPSYPTSNPFFPAPQPSPNP----------FPTLTPPLNIKLTDSNYLLWKNQLLNHILAFDMESLINGT-PAPPRFLDTAETQVNPLFPIW

Query:  QKYNRTLMSWIYSSLTEDKIGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRS
        Q+YNR +MSWIY+S+ E  +G+I+G ++A +IWE L+ +Y ++S A +  LR+
Subjt:  QKYNRTLMSWIYSSLTEDKIGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRS

PON47862.1 hypothetical protein TorRG33x02_321990 [Trema orientale]2.6e-2343.79Show/hide
Query:  PGFQYPPTSSPGY--PFIPSYPTSNPFFPAPQPSPNPFPTLTPPLNIKLTDSNYLLWKNQLLNHILAFDMESLINGT-PAPPRFLDTAETQVNPLFPIWQ
        P  Q PPT+ P      I + P        P P     P++  P  IKL   NYL+WKNQLLN I+A  +E  I+G+ P PPRF D A   VN  +  WQ
Subjt:  PGFQYPPTSSPGY--PFIPSYPTSNPFFPAPQPSPNPFPTLTPPLNIKLTDSNYLLWKNQLLNHILAFDMESLINGT-PAPPRFLDTAETQVNPLFPIWQ

Query:  KYNRTLMSWIYSSLTEDKIGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRSQ
        ++NR +MSWIY+SLT+  +G+I+G ++A+EIWE L  +Y SSS A++  LR++
Subjt:  KYNRTLMSWIYSSLTEDKIGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRSQ

RVW56403.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.3e-2250.39Show/hide
Query:  PAPQPSPN----PFPTLTPPLNIKLTDSNYLLWKNQLLNHILAFDMESLING-TPAPPRFLDTAETQVNPLFPIWQKYNRTLMSWIYSSLTEDKIGEIIG
        P P  SPN     +P+L+ PL IKL ++N  LWKNQLLN I+A  +E  I G TP P +FLD A+ QVNPLF  W++ N  +MSWIY SLT   +G I+ 
Subjt:  PAPQPSPN----PFPTLTPPLNIKLTDSNYLLWKNQLLNHILAFDMESLING-TPAPPRFLDTAETQVNPLFPIWQKYNRTLMSWIYSSLTEDKIGEIIG

Query:  CSTAYEIWEHLKIVYESSSTARVMGLRSQ
          TA EIW  L  VY+S S   V+ L SQ
Subjt:  CSTAYEIWEHLKIVYESSSTARVMGLRSQ

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]9.9e-3150Show/hide
Query:  YPTSNPFFPAPQPSP---NPFPTLTPPLNIKLTDSNYLLWKNQLLNHILAFDMESLINGT-PAPPRFLDTAETQVNPLFPIWQKYNRTLMSWIYSSLTED
        +P   P F A  P+P   NPFPTL  PLN+KL D+N+LLWKNQLLN ++A  +   ++GT   PP+FLD  + Q NP +  W++YNR LM WIYSSL+E+
Subjt:  YPTSNPFFPAPQPSP---NPFPTLTPPLNIKLTDSNYLLWKNQLLNHILAFDMESLINGT-PAPPRFLDTAETQVNPLFPIWQKYNRTLMSWIYSSLTED

Query:  KIGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRSQ
        K+GE++   T ++IW  L  VY+S +TAR+MGL+++
Subjt:  KIGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRSQ

TrEMBL top hitse value%identityAlignment
A0A2P5BGF8 Uncharacterized protein1.3e-2343.79Show/hide
Query:  PGFQYPPTSSPGY--PFIPSYPTSNPFFPAPQPSPNPFPTLTPPLNIKLTDSNYLLWKNQLLNHILAFDMESLINGT-PAPPRFLDTAETQVNPLFPIWQ
        P  Q PPT+ P      I + P        P P     P++  P  IKL   NYL+WKNQLLN I+A  +E  I+G+ P PPRF D A   VN  +  WQ
Subjt:  PGFQYPPTSSPGY--PFIPSYPTSNPFFPAPQPSPNPFPTLTPPLNIKLTDSNYLLWKNQLLNHILAFDMESLINGT-PAPPRFLDTAETQVNPLFPIWQ

Query:  KYNRTLMSWIYSSLTEDKIGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRSQ
        ++NR +MSWIY+SLT+  +G+I+G ++A+EIWE L  +Y SSS A++  LR++
Subjt:  KYNRTLMSWIYSSLTEDKIGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRSQ

A0A438F8X3 Retrovirus-related Pol polyprotein from transposon RE16.3e-2350.39Show/hide
Query:  PAPQPSPN----PFPTLTPPLNIKLTDSNYLLWKNQLLNHILAFDMESLING-TPAPPRFLDTAETQVNPLFPIWQKYNRTLMSWIYSSLTEDKIGEIIG
        P P  SPN     +P+L+ PL IKL ++N  LWKNQLLN I+A  +E  I G TP P +FLD A+ QVNPLF  W++ N  +MSWIY SLT   +G I+ 
Subjt:  PAPQPSPN----PFPTLTPPLNIKLTDSNYLLWKNQLLNHILAFDMESLING-TPAPPRFLDTAETQVNPLFPIWQKYNRTLMSWIYSSLTEDKIGEIIG

Query:  CSTAYEIWEHLKIVYESSSTARVMGLRSQ
          TA EIW  L  VY+S S   V+ L SQ
Subjt:  CSTAYEIWEHLKIVYESSSTARVMGLRSQ

A0A6J1DQX7 uncharacterized protein LOC1110223154.8e-3150Show/hide
Query:  YPTSNPFFPAPQPSP---NPFPTLTPPLNIKLTDSNYLLWKNQLLNHILAFDMESLINGT-PAPPRFLDTAETQVNPLFPIWQKYNRTLMSWIYSSLTED
        +P   P F A  P+P   NPFPTL  PLN+KL D+N+LLWKNQLLN ++A  +   ++GT   PP+FLD  + Q NP +  W++YNR LM WIYSSL+E+
Subjt:  YPTSNPFFPAPQPSP---NPFPTLTPPLNIKLTDSNYLLWKNQLLNHILAFDMESLINGT-PAPPRFLDTAETQVNPLFPIWQKYNRTLMSWIYSSLTED

Query:  KIGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRSQ
        K+GE++   T ++IW  L  VY+S +TAR+MGL+++
Subjt:  KIGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRSQ

A0A7J0EGI5 Uncharacterized protein3.9e-2542.48Show/hide
Query:  TSSPGYPFIPSYPTSNPFFPAPQPSPNP----------FPTLTPPLNIKLTDSNYLLWKNQLLNHILAFDMESLINGT-PAPPRFLDTAETQVNPLFPIW
        T S   P  P  PTSNP   +  P+PNP           P++  PL +KL D NY++WK QLLN ++A  +E  ++G+   PPRFLD  + Q NP F  W
Subjt:  TSSPGYPFIPSYPTSNPFFPAPQPSPNP----------FPTLTPPLNIKLTDSNYLLWKNQLLNHILAFDMESLINGT-PAPPRFLDTAETQVNPLFPIW

Query:  QKYNRTLMSWIYSSLTEDKIGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRS
        Q+YNR +MSWIY+S+ E  +G+I+G ++A +IWE L+ +Y ++S A +  LR+
Subjt:  QKYNRTLMSWIYSSLTEDKIGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRS

A0A7J0GPN0 UBX domain-containing protein5.2e-2542.48Show/hide
Query:  TSSPGYPFIPSYPTSNPFFPAPQPSPNP----------FPTLTPPLNIKLTDSNYLLWKNQLLNHILAFDMESLINGT-PAPPRFLDTAETQVNPLFPIW
        T S   P  P  PTSNP   +  P PNP           P++  PL +KL D NY++WK QLLN ++A  +E  ++G+   PPRFLD  + Q NP F  W
Subjt:  TSSPGYPFIPSYPTSNPFFPAPQPSPNP----------FPTLTPPLNIKLTDSNYLLWKNQLLNHILAFDMESLINGT-PAPPRFLDTAETQVNPLFPIW

Query:  QKYNRTLMSWIYSSLTEDKIGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRS
        Q+YNR +MSWIY+S+ E  +G+I+G ++A +IWE L+ +Y ++S A +  LR+
Subjt:  QKYNRTLMSWIYSSLTEDKIGEIIGCSTAYEIWEHLKIVYESSSTARVMGLRS

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.2e-0828.97Show/hide
Query:  KLTDSNYLLWKNQLLNHILAFDMESLING-TPAPPRFLDT-AETQVNPLFPIWQKYNRTLMSWIYSSLTEDKIGEIIGCSTAYEIWEHLKIVYESSSTAR
        KLT +NYL+W  Q+      +++   ++G T  PP  + T A  +VNP +  W++ ++ + S +  +++      +   +TA +IWE L+ +Y + S   
Subjt:  KLTDSNYLLWKNQLLNHILAFDMESLING-TPAPPRFLDT-AETQVNPLFPIWQKYNRTLMSWIYSSLTEDKIGEIIGCSTAYEIWEHLKIVYESSSTAR

Query:  VMGLRSQ
        V  LR+Q
Subjt:  VMGLRSQ

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.2e-0830.48Show/hide
Query:  KLTDSNYLLWKNQLLNHILAFDMESLING-TPAPPRFLDT-AETQVNPLFPIWQKYNRTLMSWIYSSLTEDKIGEIIGCSTAYEIWEHLKIVYESSSTAR
        KLT +NYL+W  Q+      +++   ++G TP PP  + T A  +VNP +  W++ ++ + S I  +++      +   +TA +IWE L+ +Y + S   
Subjt:  KLTDSNYLLWKNQLLNHILAFDMESLING-TPAPPRFLDT-AETQVNPLFPIWQKYNRTLMSWIYSSLTEDKIGEIIGCSTAYEIWEHLKIVYESSSTAR

Query:  VMGLR
        V  LR
Subjt:  VMGLR

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)9.7e-0830Show/hide
Query:  PLNIKLTDSNYLLWKNQLLNHILAFDMESLINGTPAPPRFLDTAETQVNPLFPIWQKYNRTLMSWIYSSLTEDKI-GEIIGCSTAYEIWEHLKIVYESSS
        P+ + + +SNY  W+   L H L+FD+   I+GT      L T    VN     WQK +  +   +Y +LT  +  G  +  ST+ +IW  +K  + ++ 
Subjt:  PLNIKLTDSNYLLWKNQLLNHILAFDMESLINGTPAPPRFLDTAETQVNPLFPIWQKYNRTLMSWIYSSLTEDKI-GEIIGCSTAYEIWEHLKIVYESSS

Query:  TARVMGLRSQ
         AR + L S+
Subjt:  TARVMGLRSQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCAACCTCAACCTTATCTTCCTCCGGCCAAGACACTGTTCCTGCTCCTCCTACTACCTCAACCGTAATCACAGTTCCCTCCCTCATTTCCACCTCCATCGTCAC
CACCCCTGTTTCAACGCCTGGCCCTTCTCATCGATCTCAACTCAGAGGTTCCACTCCCGTTTCCAATACCAATACTAGGCCCTTAAACCCTAACAATCCTCCCTATCAAT
CACATTTCCAATCCAACCCAATTTCATCATCTTTTCCTTTTCCAAATGCAGCGCCTCAACCCGGTTTTCAGTACCCGCCTACGTCTTCCCCTGGTTATCCTTTCATTCCT
TCTTACCCTACTTCAAACCCTTTTTTTCCTGCTCCGCAACCTTCACCCAATCCTTTTCCCACCCTCACTCCACCTCTCAATATAAAGCTCACAGACTCAAATTATCTCCT
TTGGAAGAATCAATTGCTCAACCACATCCTTGCCTTTGACATGGAAAGTCTAATCAACGGTACGCCTGCTCCTCCTAGATTTCTGGATACTGCAGAAACTCAGGTAAACC
CTCTTTTTCCTATTTGGCAGAAGTATAATCGCACGTTAATGAGCTGGATTTACTCTTCACTGACTGAGGATAAGATAGGTGAAATAATTGGTTGCTCTACTGCTTATGAA
ATTTGGGAGCATCTTAAAATTGTTTATGAATCGTCTTCCACTGCTCGTGTTATGGGGTTAAGGTCTCAATACCATTGTCAATATGGTATGCGATGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCAACCTCAACCTTATCTTCCTCCGGCCAAGACACTGTTCCTGCTCCTCCTACTACCTCAACCGTAATCACAGTTCCCTCCCTCATTTCCACCTCCATCGTCAC
CACCCCTGTTTCAACGCCTGGCCCTTCTCATCGATCTCAACTCAGAGGTTCCACTCCCGTTTCCAATACCAATACTAGGCCCTTAAACCCTAACAATCCTCCCTATCAAT
CACATTTCCAATCCAACCCAATTTCATCATCTTTTCCTTTTCCAAATGCAGCGCCTCAACCCGGTTTTCAGTACCCGCCTACGTCTTCCCCTGGTTATCCTTTCATTCCT
TCTTACCCTACTTCAAACCCTTTTTTTCCTGCTCCGCAACCTTCACCCAATCCTTTTCCCACCCTCACTCCACCTCTCAATATAAAGCTCACAGACTCAAATTATCTCCT
TTGGAAGAATCAATTGCTCAACCACATCCTTGCCTTTGACATGGAAAGTCTAATCAACGGTACGCCTGCTCCTCCTAGATTTCTGGATACTGCAGAAACTCAGGTAAACC
CTCTTTTTCCTATTTGGCAGAAGTATAATCGCACGTTAATGAGCTGGATTTACTCTTCACTGACTGAGGATAAGATAGGTGAAATAATTGGTTGCTCTACTGCTTATGAA
ATTTGGGAGCATCTTAAAATTGTTTATGAATCGTCTTCCACTGCTCGTGTTATGGGGTTAAGGTCTCAATACCATTGTCAATATGGTATGCGATGTTGA
Protein sequenceShow/hide protein sequence
MASTSTLSSSGQDTVPAPPTTSTVITVPSLISTSIVTTPVSTPGPSHRSQLRGSTPVSNTNTRPLNPNNPPYQSHFQSNPISSSFPFPNAAPQPGFQYPPTSSPGYPFIP
SYPTSNPFFPAPQPSPNPFPTLTPPLNIKLTDSNYLLWKNQLLNHILAFDMESLINGTPAPPRFLDTAETQVNPLFPIWQKYNRTLMSWIYSSLTEDKIGEIIGCSTAYE
IWEHLKIVYESSSTARVMGLRSQYHCQYGMRC