; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0004526 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0004526
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr6:4664435..4664986
RNA-Seq ExpressionLag0004526
SyntenyLag0004526
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFY85402.1 hypothetical protein Acr_04g0001400 [Actinidia rufa]7.0e-4247.95Show/hide
Query:  MANGLSGFLDGSISAPSKFIDQAQTQINPEYIGWERYNRFIMCWIYSSLSEEKMGEIVNLSSAAEIWSSLSRSYDSNTTACIMGLKSQLQNLKKEGFFVS
        +ANGL  FLDGS   P +F+D  Q Q NPE+  W+RYNR +M WIY+S++E  +G+IV  +SA++IW +L R Y + + A +  L++ LQ +KKEG    
Subjt:  MANGLSGFLDGSISAPSKFIDQAQTQINPEYIGWERYNRFIMCWIYSSLSEEKMGEIVNLSSAAEIWSSLSRSYDSNTTACIMGLKSQLQNLKKEGFFVS

Query:  QYLAKIKEITDKFAAIGEPISYRDHLAHILNGLGSEYNPFVTTIQNRSDSPTLEDVRSPLLAYEARLEKQS
         Y+ K + + +  A+IGEP++Y DHL + L GLG +YNPFVT+IQ+++  P++E+V S LL+Y+ARLE+QS
Subjt:  QYLAKIKEITDKFAAIGEPISYRDHLAHILNGLGSEYNPFVTTIQNRSDSPTLEDVRSPLLAYEARLEKQS

GFZ12741.1 UBX domain-containing protein [Actinidia rufa]8.8e-3745.91Show/hide
Query:  MANGLSGFLDGSISAPSKFIDQAQTQINPEYIGWERYNRFIMCWIYSSLSEEKMGEIVNLSSAAEIWSSLSRSYDSNTTACIMGLKSQLQNLKKEGFFVS
        +ANGL  FLDGS   P +F+D  Q Q NPE+  W+RYNR +M WIY+S++E  +G+IV  +SA++IW +L R Y + + A +  L++ LQ +KKEG    
Subjt:  MANGLSGFLDGSISAPSKFIDQAQTQINPEYIGWERYNRFIMCWIYSSLSEEKMGEIVNLSSAAEIWSSLSRSYDSNTTACIMGLKSQLQNLKKEGFFVS

Query:  QYLAKIKEITDKFAAIGEPISYRDHLAHILNGLGSEYNPFVTTIQNRSDSPTLEDVRSP
         Y+ K + + +  A+IGEP++Y DHL + L GLG +YNPFVT+IQ+++  P++E+  SP
Subjt:  QYLAKIKEITDKFAAIGEPISYRDHLAHILNGLGSEYNPFVTTIQNRSDSPTLEDVRSP

PON47862.1 hypothetical protein TorRG33x02_321990 [Trema orientale]5.0e-4048.02Show/hide
Query:  MANGLSGFLDGSISAPSKFIDQAQTQINPEYIGWERYNRFIMCWIYSSLSEEKMGEIVNLSSAAEIWSSLSRSYDSNTTACIMGLKSQLQNLKKEGFFVS
        +ANGL  F+DGS   P +F D A+  +N EYI W+R+NR IM WIY+SL++  MG+IV  +SA EIW +L++ Y S++ A I  L+++LQNL+K+G    
Subjt:  MANGLSGFLDGSISAPSKFIDQAQTQINPEYIGWERYNRFIMCWIYSSLSEEKMGEIVNLSSAAEIWSSLSRSYDSNTTACIMGLKSQLQNLKKEGFFVS

Query:  QYLAKIKEITDKFAAIGEPISYRDHLAHILNGLGSEYNPFVTTIQNRSDSPTLEDVRSPLLAYEARLEKQSNVEQLN
        +Y+ K K I +  AA+GEP+S +DHL ++  GL  EYN FVT+I  R D+  LE++ S LL+YE RLE Q+   QL+
Subjt:  QYLAKIKEITDKFAAIGEPISYRDHLAHILNGLGSEYNPFVTTIQNRSDSPTLEDVRSPLLAYEARLEKQSNVEQLN

RVW69807.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.4e-4248.09Show/hide
Query:  MANGLSGFLDGSISAPSKFIDQAQTQINPEYIGWERYNRFIMCWIYSSLSEEKMGEIVNLSSAAEIWSSLSRSYDSNTTACIMGLKSQLQNLKKEGFFVS
        +ANGL  F+D   S+P K++D A  Q+NPE++ W+R N+ +M WIYSSL+   +G+IV  S+A +IW+SL+  Y+S + A +M L SQLQ +KK    +S
Subjt:  MANGLSGFLDGSISAPSKFIDQAQTQINPEYIGWERYNRFIMCWIYSSLSEEKMGEIVNLSSAAEIWSSLSRSYDSNTTACIMGLKSQLQNLKKEGFFVS

Query:  QYLAKIKEITDKFAAIGEPISYRDHLAHILNGLGSEYNPFVTTIQNRSDSPTLEDVRSPLLAYEARLEKQSNVEQLNLAPSSP
        +YL+++K + D+FA IGEP+SYRD L  IL GL  EY+ FVT+I NRSD P+L++V S L  YE RL ++S  + LN   ++P
Subjt:  QYLAKIKEITDKFAAIGEPISYRDHLAHILNGLGSEYNPFVTTIQNRSDSPTLEDVRSPLLAYEARLEKQSNVEQLNLAPSSP

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]1.9e-6870.33Show/hide
Query:  MANGLSGFLDGSISAPSKFIDQAQTQINPEYIGWERYNRFIMCWIYSSLSEEKMGEIVNLSSAAEIWSSLSRSYDSNTTACIMGLKSQLQNLKKEGFFVS
        +ANGL G+LDG+I  P +F+D  Q Q NP Y  WERYNR +MCWIYSSLSEEKMGE+V+L +  +IWSSL+R YDS TTA IMGLK++LQNL+K+G  VS
Subjt:  MANGLSGFLDGSISAPSKFIDQAQTQINPEYIGWERYNRFIMCWIYSSLSEEKMGEIVNLSSAAEIWSSLSRSYDSNTTACIMGLKSQLQNLKKEGFFVS

Query:  QYLAKIKEITDKFAAIGEPISYRDHLAHILNGLGSEYNPFVTTIQNRSDSPTLEDVRSPLLAYEARLEKQSNVEQLNLAPSS
        QYLAKIKEI DKFAA+GEP+SYRDHLAH+L+GLGSEYN FVT+I NR+DSP+LEDVRS LLAYEARL+KQ+ V+QLN+A ++
Subjt:  QYLAKIKEITDKFAAIGEPISYRDHLAHILNGLGSEYNPFVTTIQNRSDSPTLEDVRSPLLAYEARLEKQSNVEQLNLAPSS

TrEMBL top hitse value%identityAlignment
A0A2P5BGF8 Uncharacterized protein2.4e-4048.02Show/hide
Query:  MANGLSGFLDGSISAPSKFIDQAQTQINPEYIGWERYNRFIMCWIYSSLSEEKMGEIVNLSSAAEIWSSLSRSYDSNTTACIMGLKSQLQNLKKEGFFVS
        +ANGL  F+DGS   P +F D A+  +N EYI W+R+NR IM WIY+SL++  MG+IV  +SA EIW +L++ Y S++ A I  L+++LQNL+K+G    
Subjt:  MANGLSGFLDGSISAPSKFIDQAQTQINPEYIGWERYNRFIMCWIYSSLSEEKMGEIVNLSSAAEIWSSLSRSYDSNTTACIMGLKSQLQNLKKEGFFVS

Query:  QYLAKIKEITDKFAAIGEPISYRDHLAHILNGLGSEYNPFVTTIQNRSDSPTLEDVRSPLLAYEARLEKQSNVEQLN
        +Y+ K K I +  AA+GEP+S +DHL ++  GL  EYN FVT+I  R D+  LE++ S LL+YE RLE Q+   QL+
Subjt:  QYLAKIKEITDKFAAIGEPISYRDHLAHILNGLGSEYNPFVTTIQNRSDSPTLEDVRSPLLAYEARLEKQSNVEQLN

A0A438GC62 Retrovirus-related Pol polyprotein from transposon RE16.8e-4348.09Show/hide
Query:  MANGLSGFLDGSISAPSKFIDQAQTQINPEYIGWERYNRFIMCWIYSSLSEEKMGEIVNLSSAAEIWSSLSRSYDSNTTACIMGLKSQLQNLKKEGFFVS
        +ANGL  F+D   S+P K++D A  Q+NPE++ W+R N+ +M WIYSSL+   +G+IV  S+A +IW+SL+  Y+S + A +M L SQLQ +KK    +S
Subjt:  MANGLSGFLDGSISAPSKFIDQAQTQINPEYIGWERYNRFIMCWIYSSLSEEKMGEIVNLSSAAEIWSSLSRSYDSNTTACIMGLKSQLQNLKKEGFFVS

Query:  QYLAKIKEITDKFAAIGEPISYRDHLAHILNGLGSEYNPFVTTIQNRSDSPTLEDVRSPLLAYEARLEKQSNVEQLNLAPSSP
        +YL+++K + D+FA IGEP+SYRD L  IL GL  EY+ FVT+I NRSD P+L++V S L  YE RL ++S  + LN   ++P
Subjt:  QYLAKIKEITDKFAAIGEPISYRDHLAHILNGLGSEYNPFVTTIQNRSDSPTLEDVRSPLLAYEARLEKQSNVEQLNLAPSSP

A0A6J1DQX7 uncharacterized protein LOC1110223159.4e-6970.33Show/hide
Query:  MANGLSGFLDGSISAPSKFIDQAQTQINPEYIGWERYNRFIMCWIYSSLSEEKMGEIVNLSSAAEIWSSLSRSYDSNTTACIMGLKSQLQNLKKEGFFVS
        +ANGL G+LDG+I  P +F+D  Q Q NP Y  WERYNR +MCWIYSSLSEEKMGE+V+L +  +IWSSL+R YDS TTA IMGLK++LQNL+K+G  VS
Subjt:  MANGLSGFLDGSISAPSKFIDQAQTQINPEYIGWERYNRFIMCWIYSSLSEEKMGEIVNLSSAAEIWSSLSRSYDSNTTACIMGLKSQLQNLKKEGFFVS

Query:  QYLAKIKEITDKFAAIGEPISYRDHLAHILNGLGSEYNPFVTTIQNRSDSPTLEDVRSPLLAYEARLEKQSNVEQLNLAPSS
        QYLAKIKEI DKFAA+GEP+SYRDHLAH+L+GLGSEYN FVT+I NR+DSP+LEDVRS LLAYEARL+KQ+ V+QLN+A ++
Subjt:  QYLAKIKEITDKFAAIGEPISYRDHLAHILNGLGSEYNPFVTTIQNRSDSPTLEDVRSPLLAYEARLEKQSNVEQLNLAPSS

A0A7J0EGI5 Uncharacterized protein3.4e-4247.95Show/hide
Query:  MANGLSGFLDGSISAPSKFIDQAQTQINPEYIGWERYNRFIMCWIYSSLSEEKMGEIVNLSSAAEIWSSLSRSYDSNTTACIMGLKSQLQNLKKEGFFVS
        +ANGL  FLDGS   P +F+D  Q Q NPE+  W+RYNR +M WIY+S++E  +G+IV  +SA++IW +L R Y + + A +  L++ LQ +KKEG    
Subjt:  MANGLSGFLDGSISAPSKFIDQAQTQINPEYIGWERYNRFIMCWIYSSLSEEKMGEIVNLSSAAEIWSSLSRSYDSNTTACIMGLKSQLQNLKKEGFFVS

Query:  QYLAKIKEITDKFAAIGEPISYRDHLAHILNGLGSEYNPFVTTIQNRSDSPTLEDVRSPLLAYEARLEKQS
         Y+ K + + +  A+IGEP++Y DHL + L GLG +YNPFVT+IQ+++  P++E+V S LL+Y+ARLE+QS
Subjt:  QYLAKIKEITDKFAAIGEPISYRDHLAHILNGLGSEYNPFVTTIQNRSDSPTLEDVRSPLLAYEARLEKQS

A0A803P233 Uncharacterized protein2.7e-3948.82Show/hide
Query:  MANGLSGFLDGSISAPSKFIDQAQTQINPEYIGWERYNRFIMCWIYSSLSEEKMGEIVNLSSAAEIWSSLSRSYDSNTTACIMGLKSQLQNLKKEGFFVS
        +AN L G++DG++  PS+F+D    QINPE+  W R NR ++ W+Y+SLS+  +G+IV+ ++AAEIW SL R+  + + A     ++ LQNLKKEG   S
Subjt:  MANGLSGFLDGSISAPSKFIDQAQTQINPEYIGWERYNRFIMCWIYSSLSEEKMGEIVNLSSAAEIWSSLSRSYDSNTTACIMGLKSQLQNLKKEGFFVS

Query:  QYLAKIKEITDKFAAIGEPISYRDHLAHILNGLGSEYNPFVTTIQNRSDSPTLEDVRSPLLAYEARLEKQ
         YL K+K + +  A +GEPIS +DHL+++LN LG EYN FVT I  R   PT+E+V + LL YEARLE+Q
Subjt:  QYLAKIKEITDKFAAIGEPISYRDHLAHILNGLGSEYNPFVTTIQNRSDSPTLEDVRSPLLAYEARLEKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)8.0e-1223.18Show/hide
Query:  IDQAQTQINPEYIGWERYNRFIMCWIYSSLSEEK-MGEIVNLSSAAEIWSSLSRSYDSNTTACIMGLKSQLQNLKKEGFFVSQYLAKIKEITDKFAAIGE
        ID      N   + W++ +  +   +Y +L+ ++  G  V  S++ +IW  +   + +N  A  + L S+L+        V+ Y  K+K++ D    +  
Subjt:  IDQAQTQINPEYIGWERYNRFIMCWIYSSLSEEK-MGEIVNLSSAAEIWSSLSRSYDSNTTACIMGLKSQLQNLKKEGFFVSQYLAKIKEITDKFAAIGE

Query:  PISYRDHLAHILNGLGSEYNPFVTTIQNRSDSPTLEDVRSPLLAYEARLEK
        P++ R+ + ++LNGL  +++  +  I++R   P+ +D  + L   E RL++
Subjt:  PISYRDHLAHILNGLGSEYNPFVTTIQNRSDSPTLEDVRSPLLAYEARLEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAATGGACTCTCTGGTTTTCTGGATGGATCAATATCAGCTCCCTCAAAATTTATCGACCAAGCTCAGACGCAAATCAACCCTGAATATATCGGCTGGGAAAGGTA
CAATCGTTTCATAATGTGTTGGATTTATTCTTCCTTGTCTGAAGAGAAGATGGGTGAAATAGTGAATTTATCATCTGCGGCTGAAATCTGGTCATCTCTATCTCGTTCTT
ATGACTCTAACACTACTGCTTGCATAATGGGTTTAAAATCTCAGCTACAAAATTTAAAGAAGGAGGGTTTTTTTGTCAGTCAATACTTAGCCAAAATTAAAGAAATAACA
GACAAATTTGCGGCCATTGGGGAGCCCATATCTTATAGGGACCATTTAGCTCATATTCTTAATGGTTTAGGGAGTGAGTACAATCCTTTTGTAACCACTATACAGAACAG
ATCTGATAGCCCAACCTTAGAGGATGTTCGTAGTCCGTTGCTAGCCTATGAGGCACGGCTGGAAAAACAGTCCAATGTTGAACAACTGAATTTGGCTCCAAGCTCACCTT
AG
mRNA sequenceShow/hide mRNA sequence
ATGGCGAATGGACTCTCTGGTTTTCTGGATGGATCAATATCAGCTCCCTCAAAATTTATCGACCAAGCTCAGACGCAAATCAACCCTGAATATATCGGCTGGGAAAGGTA
CAATCGTTTCATAATGTGTTGGATTTATTCTTCCTTGTCTGAAGAGAAGATGGGTGAAATAGTGAATTTATCATCTGCGGCTGAAATCTGGTCATCTCTATCTCGTTCTT
ATGACTCTAACACTACTGCTTGCATAATGGGTTTAAAATCTCAGCTACAAAATTTAAAGAAGGAGGGTTTTTTTGTCAGTCAATACTTAGCCAAAATTAAAGAAATAACA
GACAAATTTGCGGCCATTGGGGAGCCCATATCTTATAGGGACCATTTAGCTCATATTCTTAATGGTTTAGGGAGTGAGTACAATCCTTTTGTAACCACTATACAGAACAG
ATCTGATAGCCCAACCTTAGAGGATGTTCGTAGTCCGTTGCTAGCCTATGAGGCACGGCTGGAAAAACAGTCCAATGTTGAACAACTGAATTTGGCTCCAAGCTCACCTT
AG
Protein sequenceShow/hide protein sequence
MANGLSGFLDGSISAPSKFIDQAQTQINPEYIGWERYNRFIMCWIYSSLSEEKMGEIVNLSSAAEIWSSLSRSYDSNTTACIMGLKSQLQNLKKEGFFVSQYLAKIKEIT
DKFAAIGEPISYRDHLAHILNGLGSEYNPFVTTIQNRSDSPTLEDVRSPLLAYEARLEKQSNVEQLNLAPSSP