; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0023211 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0023211
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr7:45987488..45987953
RNA-Seq ExpressionLag0023211
SyntenyLag0023211
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFS33695.1 hypothetical protein Acr_00g0030110 [Actinidia rufa]7.3e-1946.22Show/hide
Query:  GLTVSQYLAKIKDVADKFSAIGEPISYRDHLAHILDGLGSDYNSFVTTILNRADNPSLEDVRSLLLAYDARLERQNSVDQLNLAQANLSTLNIQQTNRRS
        GLT   Y+ K + + +  ++IGEP++Y DHL + L GLG DYN FVT+I ++A  PS+E+V SLLL+YDARLERQ++ D L+  QANL+ L  Q      
Subjt:  GLTVSQYLAKIKDVADKFSAIGEPISYRDHLAHILDGLGSDYNSFVTTILNRADNPSLEDVRSLLLAYDARLERQNSVDQLNLAQANLSTLNIQQTNRRS

Query:  SPRPQFNHYSKSSFTSPSS
          +P+F + S +SF + +S
Subjt:  SPRPQFNHYSKSSFTSPSS

XP_022148871.1 uncharacterized protein LOC111017438 [Momordica charantia]6.2e-2664Show/hide
Query:  GLTVSQYLAKIKDVADKFSAIGEPISYRDHLAHILDGLGSDYNSFVTTILNRADNPSLEDVRSLLLAYDARLERQNSVDQLNLAQANLSTLNIQQTNRRS
        GL+VSQYLAKIK++  K S+IGEPIS +DH+++I++GLG +YN+FVT+I NR+D  +LEDVR+LLLAYD RLE+QNSVDQLN+ QAN++ L   Q NR+S
Subjt:  GLTVSQYLAKIKDVADKFSAIGEPISYRDHLAHILDGLGSDYNSFVTTILNRADNPSLEDVRSLLLAYDARLERQNSVDQLNLAQANLSTLNIQQTNRRS

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]3.0e-3667.5Show/hide
Query:  GLTVSQYLAKIKDVADKFSAIGEPISYRDHLAHILDGLGSDYNSFVTTILNRADNPSLEDVRSLLLAYDARLERQNSVDQLNLAQANLSTLNIQQTNRRS
        G +VSQYLAKIK++ADKF+A+GEP+SYRDHLAH+LDGLGS+YN+FVT+I NRAD+PSLEDVRSLLLAY+ARL++QN+VDQLN+AQANL  L++Q  ++R 
Subjt:  GLTVSQYLAKIKDVADKFSAIGEPISYRDHLAHILDGLGSDYNSFVTTILNRADNPSLEDVRSLLLAYDARLERQNSVDQLNLAQANLSTLNIQQTNRRS

Query:  SPRPQFNHYSKSSFTSPSSP
         P+  F ++ K SF  P+SP
Subjt:  SPRPQFNHYSKSSFTSPSSP

XP_038887133.1 uncharacterized protein LOC120077323 [Benincasa hispida]1.5e-2757.76Show/hide
Query:  GLTVSQYLAKIKDVADKFSAIGEPISYRDHLAHILDGLGSDYNSFVTTILNRADNPSLEDVRSLLLAYDARLERQNSVDQLNLAQANLSTLNIQQTNRRS
        GLTVSQYLA+IKDV D F+AIGEP+SYRDHL++IL+GLGS+YN FV++I NR + PS+ DVR+LL+ YD+RLE+Q + D L L QAN++ L+I   NR  
Subjt:  GLTVSQYLAKIKDVADKFSAIGEPISYRDHLAHILDGLGSDYNSFVTTILNRADNPSLEDVRSLLLAYDARLERQNSVDQLNLAQANLSTLNIQQTNRRS

Query:  SPRPQFNHYSKSSFTS
           PQ+  +++SS  S
Subjt:  SPRPQFNHYSKSSFTS

XP_038891713.1 uncharacterized protein LOC120081111 [Benincasa hispida]1.2e-2964.22Show/hide
Query:  LTVSQYLAKIKDVADKFSAIGEPISYRDHLAHILDGLGSDYNSFVTTILNRADNPSLEDVRSLLLAYDARLERQNSVDQLNLAQANLSTLNIQQTNRRSS
        L++SQYL++IKDVADKFS +GE ISYRDHL HILDGLGS+YN+FVT+I N  DN S+EDV SLLL+Y+A+LE+QN++D LN+AQA LS L+ Q  ++R++
Subjt:  LTVSQYLAKIKDVADKFSAIGEPISYRDHLAHILDGLGSDYNSFVTTILNRADNPSLEDVRSLLLAYDARLERQNSVDQLNLAQANLSTLNIQQTNRRSS

Query:  PRPQFNHYS
         RP  NH S
Subjt:  PRPQFNHYS

TrEMBL top hitse value%identityAlignment
A0A6J1D6N7 uncharacterized protein LOC1110174383.0e-2664Show/hide
Query:  GLTVSQYLAKIKDVADKFSAIGEPISYRDHLAHILDGLGSDYNSFVTTILNRADNPSLEDVRSLLLAYDARLERQNSVDQLNLAQANLSTLNIQQTNRRS
        GL+VSQYLAKIK++  K S+IGEPIS +DH+++I++GLG +YN+FVT+I NR+D  +LEDVR+LLLAYD RLE+QNSVDQLN+ QAN++ L   Q NR+S
Subjt:  GLTVSQYLAKIKDVADKFSAIGEPISYRDHLAHILDGLGSDYNSFVTTILNRADNPSLEDVRSLLLAYDARLERQNSVDQLNLAQANLSTLNIQQTNRRS

A0A6J1DQX7 uncharacterized protein LOC1110223151.4e-3667.5Show/hide
Query:  GLTVSQYLAKIKDVADKFSAIGEPISYRDHLAHILDGLGSDYNSFVTTILNRADNPSLEDVRSLLLAYDARLERQNSVDQLNLAQANLSTLNIQQTNRRS
        G +VSQYLAKIK++ADKF+A+GEP+SYRDHLAH+LDGLGS+YN+FVT+I NRAD+PSLEDVRSLLLAY+ARL++QN+VDQLN+AQANL  L++Q  ++R 
Subjt:  GLTVSQYLAKIKDVADKFSAIGEPISYRDHLAHILDGLGSDYNSFVTTILNRADNPSLEDVRSLLLAYDARLERQNSVDQLNLAQANLSTLNIQQTNRRS

Query:  SPRPQFNHYSKSSFTSPSSP
         P+  F ++ K SF  P+SP
Subjt:  SPRPQFNHYSKSSFTSPSSP

A0A7J0DER3 Uncharacterized protein3.5e-1946.22Show/hide
Query:  GLTVSQYLAKIKDVADKFSAIGEPISYRDHLAHILDGLGSDYNSFVTTILNRADNPSLEDVRSLLLAYDARLERQNSVDQLNLAQANLSTLNIQQTNRRS
        GLT   Y+ K + + +  ++IGEP++Y DHL + L GLG DYN FVT+I ++A  PS+E+V SLLL+YDARLERQ++ D L+  QANL+ L  Q      
Subjt:  GLTVSQYLAKIKDVADKFSAIGEPISYRDHLAHILDGLGSDYNSFVTTILNRADNPSLEDVRSLLLAYDARLERQNSVDQLNLAQANLSTLNIQQTNRRS

Query:  SPRPQFNHYSKSSFTSPSS
          +P+F + S +SF + +S
Subjt:  SPRPQFNHYSKSSFTSPSS

A0A7J0E8R3 Uncharacterized protein3.5e-1946.22Show/hide
Query:  GLTVSQYLAKIKDVADKFSAIGEPISYRDHLAHILDGLGSDYNSFVTTILNRADNPSLEDVRSLLLAYDARLERQNSVDQLNLAQANLSTLNIQQTNRRS
        GLT   Y+ K + + +  ++IGEP++Y DHL + L GLG DYN FVT+I ++A  PS+E+V SLLL+YDARLERQ++ D L+  QANL+ L  Q      
Subjt:  GLTVSQYLAKIKDVADKFSAIGEPISYRDHLAHILDGLGSDYNSFVTTILNRADNPSLEDVRSLLLAYDARLERQNSVDQLNLAQANLSTLNIQQTNRRS

Query:  SPRPQFNHYSKSSFTSPSS
          +P+F + S +SF + +S
Subjt:  SPRPQFNHYSKSSFTSPSS

A0A7J6FPX2 Uncharacterized protein7.9e-1944.63Show/hide
Query:  GLTVSQYLAKIKDVADKFSAIGEPISYRDHLAHILDGLGSDYNSFVTTILNRADNPSLEDVRSLLLAYDARLERQNSVDQLNLAQANLSTLNIQQTNRRS
        G+T   YL KI  + +  ++ G+P+S +DHL  +L+GLG  YN+FVT IL R+  PS+E+V SLLL+YDARL+RQ +   L+  QAN + L++ ++N + 
Subjt:  GLTVSQYLAKIKDVADKFSAIGEPISYRDHLAHILDGLGSDYNSFVTTILNRADNPSLEDVRSLLLAYDARLERQNSVDQLNLAQANLSTLNIQQTNRRS

Query:  SPRPQFNH----YSKSSFTSP
         PRP  +H    YS S  T P
Subjt:  SPRPQFNH----YSKSSFTSP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)4.9e-0524.44Show/hide
Query:  LTVSQYLAKIKDVADKFSAIGEPISYRDHLAHILDGLGSDYNSFVTTILNRADNPSLEDVRSLLLAYDARLERQNSVDQLNLAQANLSTL
        + V+ Y  K+K +AD    +  P++ R+ + ++L+GL   +++ +  I +R   PS +D  ++L   + RL+R    +  ++  ++ ST+
Subjt:  LTVSQYLAKIKDVADKFSAIGEPISYRDHLAHILDGLGSDYNSFVTTILNRADNPSLEDVRSLLLAYDARLERQNSVDQLNLAQANLSTL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTATCGTCGGGTCTGACTGTTTCTCAGTATTTAGCTAAAATAAAAGATGTAGCTGATAAGTTTTCCGCGATTGGCGAGCCTATTTCTTACCGTGATCACTTAGCACA
TATCTTAGATGGATTAGGAAGTGATTACAATTCGTTTGTTACTACTATTTTGAATCGTGCTGATAATCCCTCTCTAGAAGATGTACGCAGTCTCTTACTTGCCTATGATG
CTCGGTTGGAGAGGCAGAATTCTGTTGATCAATTGAACCTTGCTCAAGCCAATCTAAGCACACTTAACATTCAACAAACTAATCGTCGCTCTTCCCCAAGGCCTCAGTTT
AACCATTACTCAAAATCATCTTTCACCTCTCCATCCTCTCCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTATCGTCGGGTCTGACTGTTTCTCAGTATTTAGCTAAAATAAAAGATGTAGCTGATAAGTTTTCCGCGATTGGCGAGCCTATTTCTTACCGTGATCACTTAGCACA
TATCTTAGATGGATTAGGAAGTGATTACAATTCGTTTGTTACTACTATTTTGAATCGTGCTGATAATCCCTCTCTAGAAGATGTACGCAGTCTCTTACTTGCCTATGATG
CTCGGTTGGAGAGGCAGAATTCTGTTGATCAATTGAACCTTGCTCAAGCCAATCTAAGCACACTTAACATTCAACAAACTAATCGTCGCTCTTCCCCAAGGCCTCAGTTT
AACCATTACTCAAAATCATCTTTCACCTCTCCATCCTCTCCCTAG
Protein sequenceShow/hide protein sequence
MLSSGLTVSQYLAKIKDVADKFSAIGEPISYRDHLAHILDGLGSDYNSFVTTILNRADNPSLEDVRSLLLAYDARLERQNSVDQLNLAQANLSTLNIQQTNRRSSPRPQF
NHYSKSSFTSPSSP