; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028823 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028823
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr8:31451112..31455520
RNA-Seq ExpressionLag0028823
SyntenyLag0028823
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
GO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN67902.1 hypothetical protein VITISV_037907 [Vitis vinifera]8.4e-2026.3Show/hide
Query:  VLMGNDQHCLVKGIGSVRLKLYGGMEKVLFDVRFVPDLKRNLFSLGQLDKRGFSCKLEGGVLRIYKGSLLEM------------------KVSTAAY---
        VL+GN++HC + G G+VR+K Y  +E+VL DVR++P+LK+NL SLG LD+  ++ KLE   LR+ +GSL+ M                  KVST      
Subjt:  VLMGNDQHCLVKGIGSVRLKLYGGMEKVLFDVRFVPDLKRNLFSLGQLDKRGFSCKLEGGVLRIYKGSLLEM------------------KVSTAAY---

Query:  -----------HIN-------------------------------------------------------------------RSPSVALDMKTPQEVWLGK
                   HI+                                                                   RSPS AL  KTPQE W+GK
Subjt:  -----------HIN-------------------------------------------------------------------RSPSVALDMKTPQEVWLGK

Query:  PPNLSHL--------------------------------KGYKLWFLQPGEGRCIISRDVKFDEFDMPWELSGSVSFKKFVDNVSSNCFEIELENTH--T
          N  HL                                KGYKLW    G+ +CIISRDV F+E DM  +        K+V+      FE+E EN     
Subjt:  PPNLSHL--------------------------------KGYKLWFLQPGEGRCIISRDVKFDEFDMPWELSGSVSFKKFVDNVSSNCFEIELENTH--T

Query:  VGSLGEQSVSNNEAVRVDDQLTRVEVDWSLGAINVLYHLQDFPHAAYNEMVVAPSNEQLSDAVRE
              ++V         D+ T+    +SL   ++   + D     Y E + +    Q   A++E
Subjt:  VGSLGEQSVSNNEAVRVDDQLTRVEVDWSLGAINVLYHLQDFPHAAYNEMVVAPSNEQLSDAVRE

CAN69199.1 hypothetical protein VITISV_025494 [Vitis vinifera]3.4e-2132.89Show/hide
Query:  KVLMGNDQHCLVKGIGSVRLKLYGGMEKVLFDVRFVPDLKRNLFSLGQLDKRGFSCKLEGGVLRIYKGSLLEM----------------------KVSTA
        K+L+GN+  C V GIG+V + ++ GM + L +VR  PDLKRNL  L  LD+ G++ K+E G L I K   +                         V T 
Subjt:  KVLMGNDQHCLVKGIGSVRLKLYGGMEKVLFDVRFVPDLKRNLFSLGQLDKRGFSCKLEGGVLRIYKGSLLEM----------------------KVSTA

Query:  AYHINRSPSVALDMKTPQEVWLGKPPNLSHL--------------------------------KGYKLWFLQPGEGRCIISRDVKFDEFDMPWELSGSVS
         Y +NR+PS A+D+KT +E+W GKP N  HL                                KGYKLW  +    + IISRDV F+E +M +  + +  
Subjt:  AYHINRSPSVALDMKTPQEVWLGKPPNLSHL--------------------------------KGYKLWFLQPGEGRCIISRDVKFDEFDMPWELSGSVS

Query:  FKKFVDNVSSNC------FEIELEN
         KK   N S         FE+EL +
Subjt:  FKKFVDNVSSNC------FEIELEN

CCI55401.1 PH01B015M02.2 [Phyllostachys edulis]6.4e-2036.02Show/hide
Query:  CLVKGIGSVRLKLYGGMEKVLFDVRFVPDLKRNLFSLGQLDKRGFSCKLEGGVLRIYKGSLLEMK----------VSTAAYHINRSPSVALDMKTPQEVW
        C V GIGS+R+K++    + L +V+++PD+KRNL SL  LD +G+     GGVL++ KGSL+ MK           STA Y INRSPS+A++ KTP E+W
Subjt:  CLVKGIGSVRLKLYGGMEKVLFDVRFVPDLKRNLFSLGQLDKRGFSCKLEGGVLRIYKGSLLEMK----------VSTAAYHINRSPSVALDMKTPQEVW

Query:  LGKPPNLSHL--------------------------------KGYKLWFLQPGEGRCIISRDVKFDEFDMPWE-LSGSVSFKKFVD
             N S L                                KGYKLW   P   + +ISR + F+E  M  + LS +V  +K ++
Subjt:  LGKPPNLSHL--------------------------------KGYKLWFLQPGEGRCIISRDVKFDEFDMPWE-LSGSVSFKKFVD

KAE8692398.1 hypothetical protein F3Y22_tig00110839pilonHSYRG00037 [Hibiscus syriacus]3.4e-2128.42Show/hide
Query:  VLMGNDQHCLVKGIGSVRLKLYGGMEKVLFDVRFVPDLKRNLFSLGQLDKRGFSCKLEGGVLRIYKGSLLEMK---------------------------
        VL+G+++ C V G G++R++++ G E++L  VR++P+LKRNL S G L+  G+S   E G +R+ KGS++ MK                           
Subjt:  VLMGNDQHCLVKGIGSVRLKLYGGMEKVLFDVRFVPDLKRNLFSLGQLDKRGFSCKLEGGVLRIYKGSLLEMK---------------------------

Query:  VSTAAYHIN----------------------------------RSPSVALDMKTPQEVWLGKPPNLSHL-------------------------------
          T  +H+                                   R PS A+ MKTP E+W GKP N ++L                               
Subjt:  VSTAAYHIN----------------------------------RSPSVALDMKTPQEVWLGKPPNLSHL-------------------------------

Query:  -KGYKLWFLQPGEGRCIISRDVKFDEFDMPWELSGSVSFKKFVDNVSSNCFEIELENTHTVGSLGEQSVSNNEAVRVDDQLTRVE
         KGYKLW ++PG+ +CIISRDV FDE  M   L   V+     +NV+ +   IE+E       L +Q    ++   ++DQ   VE
Subjt:  -KGYKLWFLQPGEGRCIISRDVKFDEFDMPWELSGSVSFKKFVDNVSSNCFEIELENTHTVGSLGEQSVSNNEAVRVDDQLTRVE

RVX00023.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]7.3e-2430.03Show/hide
Query:  VLMGNDQHCLVKGIGSVRLKLYGGMEKVLFDVRFVPDLKRNLFSLGQLDKRGFSCKLEGGVLRIYKGSLLEMK---------------------------
        VL+GN++HC + G G+VR+K Y G+E+VL DVR++P+LKRNL SLG LDK G++ K E   LR+ +GSL  MK                           
Subjt:  VLMGNDQHCLVKGIGSVRLKLYGGMEKVLFDVRFVPDLKRNLFSLGQLDKRGFSCKLEGGVLRIYKGSLLEMK---------------------------

Query:  ------------------------VSTAAYHINR----SPSVALDMKTPQEVWLGKPPNLSHL--------------------------------KGYKL
                                 S      NR    SPS AL  KTPQE W GK  +  HL                                KGYKL
Subjt:  ------------------------VSTAAYHINR----SPSVALDMKTPQEVWLGKPPNLSHL--------------------------------KGYKL

Query:  WFLQPGEGRCIISRDVKFDEFDMPWELSGSVSFKKFVDNVSSNCFEIELENTHTVGSLGEQSVSNNEAVRVDDQLTRVEVDWSLGAINVLYHLQDFPHAA
        W    G+G+CIISRDV F+E DM  +        K V+ +    FE+E E      S    S    + + + D++ +++  W+   I V   + D     
Subjt:  WFLQPGEGRCIISRDVKFDEFDMPWELSGSVSFKKFVDNVSSNCFEIELENTHTVGSLGEQSVSNNEAVRVDDQLTRVEVDWSLGAINVLYHLQDFPHAA

Query:  YNEMVVAPSNEQLSDAVREVGVE
        Y E +    N   +D   +VG +
Subjt:  YNEMVVAPSNEQLSDAVREVGVE

TrEMBL top hitse value%identityAlignment
A0A2N9J7Q6 Uncharacterized protein1.8e-2030.73Show/hide
Query:  RNLKVNKVLMGNDQHCLVKGIGSVRLKLYGGMEKVLFDVRFVPDLKRNLFSLGQLDKRGFSCKLEGGVLRIYKG------------SLLEMKVSTAAYHI
        +++    V MGND  C + G+G++++K+  G+ + L +VR +PD+++NL SLG LD +G+S K E G++++ KG               E  V  A Y I
Subjt:  RNLKVNKVLMGNDQHCLVKGIGSVRLKLYGGMEKVLFDVRFVPDLKRNLFSLGQLDKRGFSCKLEGGVLRIYKG------------SLLEMKVSTAAYHI

Query:  NRSPSVALDMKTPQEVWLGKPPNLSHL-----------------------------------KGYKLWFLQPGEGRCIISRDVKFDEFDMPWELSGSVS-
        NRSP VALD K  +EVW G+  + S +                                   KGYKLW   P   + +I+RDV FDE  M        S 
Subjt:  NRSPSVALDMKTPQEVWLGKPPNLSHL-----------------------------------KGYKLWFLQPGEGRCIISRDVKFDEFDMPWELSGSVS-

Query:  FKKFVDNVSSNCFEIELE
          +  +N+  +  ++EL+
Subjt:  FKKFVDNVSSNCFEIELE

A0A438ITF4 Retrovirus-related Pol polyprotein from transposon TNT 1-943.5e-2430.03Show/hide
Query:  VLMGNDQHCLVKGIGSVRLKLYGGMEKVLFDVRFVPDLKRNLFSLGQLDKRGFSCKLEGGVLRIYKGSLLEMK---------------------------
        VL+GN++HC + G G+VR+K Y G+E+VL DVR++P+LKRNL SLG LDK G++ K E   LR+ +GSL  MK                           
Subjt:  VLMGNDQHCLVKGIGSVRLKLYGGMEKVLFDVRFVPDLKRNLFSLGQLDKRGFSCKLEGGVLRIYKGSLLEMK---------------------------

Query:  ------------------------VSTAAYHINR----SPSVALDMKTPQEVWLGKPPNLSHL--------------------------------KGYKL
                                 S      NR    SPS AL  KTPQE W GK  +  HL                                KGYKL
Subjt:  ------------------------VSTAAYHINR----SPSVALDMKTPQEVWLGKPPNLSHL--------------------------------KGYKL

Query:  WFLQPGEGRCIISRDVKFDEFDMPWELSGSVSFKKFVDNVSSNCFEIELENTHTVGSLGEQSVSNNEAVRVDDQLTRVEVDWSLGAINVLYHLQDFPHAA
        W    G+G+CIISRDV F+E DM  +        K V+ +    FE+E E      S    S    + + + D++ +++  W+   I V   + D     
Subjt:  WFLQPGEGRCIISRDVKFDEFDMPWELSGSVSFKKFVDNVSSNCFEIELENTHTVGSLGEQSVSNNEAVRVDDQLTRVEVDWSLGAINVLYHLQDFPHAA

Query:  YNEMVVAPSNEQLSDAVREVGVE
        Y E +    N   +D   +VG +
Subjt:  YNEMVVAPSNEQLSDAVREVGVE

A0A6A2ZKG6 CCHC-type domain-containing protein1.6e-2128.42Show/hide
Query:  VLMGNDQHCLVKGIGSVRLKLYGGMEKVLFDVRFVPDLKRNLFSLGQLDKRGFSCKLEGGVLRIYKGSLLEMK---------------------------
        VL+G+++ C V G G++R++++ G E++L  VR++P+LKRNL S G L+  G+S   E G +R+ KGS++ MK                           
Subjt:  VLMGNDQHCLVKGIGSVRLKLYGGMEKVLFDVRFVPDLKRNLFSLGQLDKRGFSCKLEGGVLRIYKGSLLEMK---------------------------

Query:  VSTAAYHIN----------------------------------RSPSVALDMKTPQEVWLGKPPNLSHL-------------------------------
          T  +H+                                   R PS A+ MKTP E+W GKP N ++L                               
Subjt:  VSTAAYHIN----------------------------------RSPSVALDMKTPQEVWLGKPPNLSHL-------------------------------

Query:  -KGYKLWFLQPGEGRCIISRDVKFDEFDMPWELSGSVSFKKFVDNVSSNCFEIELENTHTVGSLGEQSVSNNEAVRVDDQLTRVE
         KGYKLW ++PG+ +CIISRDV FDE  M   L   V+     +NV+ +   IE+E       L +Q    ++   ++DQ   VE
Subjt:  -KGYKLWFLQPGEGRCIISRDVKFDEFDMPWELSGSVSFKKFVDNVSSNCFEIELENTHTVGSLGEQSVSNNEAVRVDDQLTRVE

A5AM57 Uncharacterized protein1.6e-2132.89Show/hide
Query:  KVLMGNDQHCLVKGIGSVRLKLYGGMEKVLFDVRFVPDLKRNLFSLGQLDKRGFSCKLEGGVLRIYKGSLLEM----------------------KVSTA
        K+L+GN+  C V GIG+V + ++ GM + L +VR  PDLKRNL  L  LD+ G++ K+E G L I K   +                         V T 
Subjt:  KVLMGNDQHCLVKGIGSVRLKLYGGMEKVLFDVRFVPDLKRNLFSLGQLDKRGFSCKLEGGVLRIYKGSLLEM----------------------KVSTA

Query:  AYHINRSPSVALDMKTPQEVWLGKPPNLSHL--------------------------------KGYKLWFLQPGEGRCIISRDVKFDEFDMPWELSGSVS
         Y +NR+PS A+D+KT +E+W GKP N  HL                                KGYKLW  +    + IISRDV F+E +M +  + +  
Subjt:  AYHINRSPSVALDMKTPQEVWLGKPPNLSHL--------------------------------KGYKLWFLQPGEGRCIISRDVKFDEFDMPWELSGSVS

Query:  FKKFVDNVSSNC------FEIELEN
         KK   N S         FE+EL +
Subjt:  FKKFVDNVSSNC------FEIELEN

L0P1Q9 Gag-Pol-p1993.1e-2036.02Show/hide
Query:  CLVKGIGSVRLKLYGGMEKVLFDVRFVPDLKRNLFSLGQLDKRGFSCKLEGGVLRIYKGSLLEMK----------VSTAAYHINRSPSVALDMKTPQEVW
        C V GIGS+R+K++    + L +V+++PD+KRNL SL  LD +G+     GGVL++ KGSL+ MK           STA Y INRSPS+A++ KTP E+W
Subjt:  CLVKGIGSVRLKLYGGMEKVLFDVRFVPDLKRNLFSLGQLDKRGFSCKLEGGVLRIYKGSLLEMK----------VSTAAYHINRSPSVALDMKTPQEVW

Query:  LGKPPNLSHL--------------------------------KGYKLWFLQPGEGRCIISRDVKFDEFDMPWE-LSGSVSFKKFVD
             N S L                                KGYKLW   P   + +ISR + F+E  M  + LS +V  +K ++
Subjt:  LGKPPNLSHL--------------------------------KGYKLWFLQPGEGRCIISRDVKFDEFDMPWE-LSGSVSFKKFVD

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.6e-0541.1Show/hide
Query:  VLMGNDQHCLVKGIGSVRLKLYGGMEKVLFDVRFVPDLKRNLFSLGQLDKRGFSCKLEGGVLRIYKGSLLEMK
        V MGN  +  + GIG + +K   G   VL DVR VPDL+ NL S   LD+ G+         R+ KGSL+  K
Subjt:  VLMGNDQHCLVKGIGSVRLKLYGGMEKVLFDVRFVPDLKRNLFSLGQLDKRGFSCKLEGGVLRIYKGSLLEMK

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAGACCATGGTTGAAGAAAGTGACACTGTCCGAGTGGAAGCTCCTAAAAAACAGAAGGATGGAAAAAAGAGCAAAAAGATGAAGAAGACCATGGTTGAAGAAAG
TGACACTGTCCGAGTGGAAGCTCCTAAAAAACAGAAGGGTGGAAAAAAGAGCAAAAAGATGAAAAGACCGTGGTTGAAGAAAGTGACACTGTCCGAGTGCAAAGTAAATA
CATGCATGTCTGTACCGGTTTTGGGCTTATTTGTCTGTGCCACTAATACATTCATAACAGAAATAATGGACACCATCAACAACATCTTAGGAGATAGGTGTAGAGACGCT
TTCAGAAACATGTGCTTTGGCCACTTGCTTAACTTCTCGTTCAAAAAGACGTCTTCGCAATTACTATTACACCTGATCCAGCATCAGTGCAAGCCCAAACGGACGTCGGA
ACTTTACTTCAAGATTGGTGGAAAAATCTTAAAGTTTGGTCTACGGGAGTTCGCATTACTTTATGTCGCCATAAAAAAGAAGTTTACCCCGATTTTTGCTGTGAGAATAA
AACGGGATCTGGTTAATGATATTGAAAAGCCATTGTGGACTAAGTTGGAGTCACTTTATTTAAATATTACTTTCAAGATTTCGAGAAACTTGAAGGTTAACAAGGTTTTG
ATGGGCAATGATCAACATTGTCTTGTGAAGGGTATAGGATCTGTCAGGTTGAAACTTTATGGTGGAATGGAGAAGGTTCTGTTTGATGTTCGGTTTGTTCCAGATTTAAA
GAGAAATTTATTTTCTCTTGGGCAACTTGATAAAAGAGGTTTCAGTTGCAAGTTAGAGGGAGGTGTTCTTCGAATTTATAAAGGTTCACTTCTTGAAATGAAAGTTTCGA
CAGCAGCGTATCACATAAATAGAAGTCCTTCAGTTGCCTTGGATATGAAAACTCCTCAAGAAGTTTGGTTAGGAAAGCCTCCCAATCTTAGTCATTTGAAGGGTTATAAG
CTATGGTTCCTTCAACCAGGTGAGGGACGATGTATTATCAGTAGAGATGTAAAATTTGATGAATTTGACATGCCTTGGGAGCTATCAGGTAGTGTGAGTTTTAAGAAATT
TGTTGATAACGTGTCTTCAAACTGTTTTGAGATTGAGTTAGAAAACACACACACTGTCGGATCTTTAGGTGAGCAATCTGTTTCTAATAATGAAGCTGTAAGGGTTGATG
ATCAGTTAACTAGAGTAGAGGTCGACTGGAGTCTTGGTGCTATCAATGTACTATACCACCTTCAAGATTTCCCCCACGCAGCATATAATGAGATGGTTGTGGCGCCATCG
AATGAGCAGCTGAGTGATGCCGTGAGGGAAGTCGGTGTTGAAGGGGCGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGAAGACCATGGTTGAAGAAAGTGACACTGTCCGAGTGGAAGCTCCTAAAAAACAGAAGGATGGAAAAAAGAGCAAAAAGATGAAGAAGACCATGGTTGAAGAAAG
TGACACTGTCCGAGTGGAAGCTCCTAAAAAACAGAAGGGTGGAAAAAAGAGCAAAAAGATGAAAAGACCGTGGTTGAAGAAAGTGACACTGTCCGAGTGCAAAGTAAATA
CATGCATGTCTGTACCGGTTTTGGGCTTATTTGTCTGTGCCACTAATACATTCATAACAGAAATAATGGACACCATCAACAACATCTTAGGAGATAGGTGTAGAGACGCT
TTCAGAAACATGTGCTTTGGCCACTTGCTTAACTTCTCGTTCAAAAAGACGTCTTCGCAATTACTATTACACCTGATCCAGCATCAGTGCAAGCCCAAACGGACGTCGGA
ACTTTACTTCAAGATTGGTGGAAAAATCTTAAAGTTTGGTCTACGGGAGTTCGCATTACTTTATGTCGCCATAAAAAAGAAGTTTACCCCGATTTTTGCTGTGAGAATAA
AACGGGATCTGGTTAATGATATTGAAAAGCCATTGTGGACTAAGTTGGAGTCACTTTATTTAAATATTACTTTCAAGATTTCGAGAAACTTGAAGGTTAACAAGGTTTTG
ATGGGCAATGATCAACATTGTCTTGTGAAGGGTATAGGATCTGTCAGGTTGAAACTTTATGGTGGAATGGAGAAGGTTCTGTTTGATGTTCGGTTTGTTCCAGATTTAAA
GAGAAATTTATTTTCTCTTGGGCAACTTGATAAAAGAGGTTTCAGTTGCAAGTTAGAGGGAGGTGTTCTTCGAATTTATAAAGGTTCACTTCTTGAAATGAAAGTTTCGA
CAGCAGCGTATCACATAAATAGAAGTCCTTCAGTTGCCTTGGATATGAAAACTCCTCAAGAAGTTTGGTTAGGAAAGCCTCCCAATCTTAGTCATTTGAAGGGTTATAAG
CTATGGTTCCTTCAACCAGGTGAGGGACGATGTATTATCAGTAGAGATGTAAAATTTGATGAATTTGACATGCCTTGGGAGCTATCAGGTAGTGTGAGTTTTAAGAAATT
TGTTGATAACGTGTCTTCAAACTGTTTTGAGATTGAGTTAGAAAACACACACACTGTCGGATCTTTAGGTGAGCAATCTGTTTCTAATAATGAAGCTGTAAGGGTTGATG
ATCAGTTAACTAGAGTAGAGGTCGACTGGAGTCTTGGTGCTATCAATGTACTATACCACCTTCAAGATTTCCCCCACGCAGCATATAATGAGATGGTTGTGGCGCCATCG
AATGAGCAGCTGAGTGATGCCGTGAGGGAAGTCGGTGTTGAAGGGGCGTAG
Protein sequenceShow/hide protein sequence
MKKTMVEESDTVRVEAPKKQKDGKKSKKMKKTMVEESDTVRVEAPKKQKGGKKSKKMKRPWLKKVTLSECKVNTCMSVPVLGLFVCATNTFITEIMDTINNILGDRCRDA
FRNMCFGHLLNFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILKFGLREFALLYVAIKKKFTPIFAVRIKRDLVNDIEKPLWTKLESLYLNITFKISRNLKVNKVL
MGNDQHCLVKGIGSVRLKLYGGMEKVLFDVRFVPDLKRNLFSLGQLDKRGFSCKLEGGVLRIYKGSLLEMKVSTAAYHINRSPSVALDMKTPQEVWLGKPPNLSHLKGYK
LWFLQPGEGRCIISRDVKFDEFDMPWELSGSVSFKKFVDNVSSNCFEIELENTHTVGSLGEQSVSNNEAVRVDDQLTRVEVDWSLGAINVLYHLQDFPHAAYNEMVVAPS
NEQLSDAVREVGVEGA