; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038399 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038399
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr2:16687522..16691909
RNA-Seq ExpressionLag0038399
SyntenyLag0038399
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG52621.1 hypothetical protein EZV62_021790 [Acer yangbiense]3.8e-2245.22Show/hide
Query:  FVGKFIRYFTAQAAVGKPVSIEDHILYILSGLDSEYDSMISVITAKVCSETVQEVMTLLLTHENRIESKSINTDGSLPSTHLSVQSPNMKEI---ENQRS
        ++ K      A A+VG  +S ED +  I +GL +EY+S+I  +T++V   TV EV  LLL HE RIES S+N DGS PS +L+  +   K+         
Subjt:  FVGKFIRYFTAQAAVGKPVSIEDHILYILSGLDSEYDSMISVITAKVCSETVQEVMTLLLTHENRIESKSINTDGSLPSTHLSVQSPNMKEI---ENQRS

Query:  GNRFQNDQNFSGNRGRGRSYNQNRGGRSSWNNQNKPQCQLCGKFGHTAIKCYICYDP
         N++  +Q F GNRG+GR      GGR  W+N NKPQCQLCGKFGH   KCY  +DP
Subjt:  GNRFQNDQNFSGNRGRGRSYNQNRGGRSSWNNQNKPQCQLCGKFGHTAIKCYICYDP

TXG70404.1 hypothetical protein EZV62_005339 [Acer yangbiense]2.2e-2245.22Show/hide
Query:  FVGKFIRYFTAQAAVGKPVSIEDHILYILSGLDSEYDSMISVITAKVCSETVQEVMTLLLTHENRIESKSINTDGSLPSTHLSVQSPNMKEI---ENQRS
        ++ K      A A+VG  +S ED +  I +GL ++Y+S+I  +T++V   TV EV  LLL HE RIES S+N DGS PS +L+  +  MK+         
Subjt:  FVGKFIRYFTAQAAVGKPVSIEDHILYILSGLDSEYDSMISVITAKVCSETVQEVMTLLLTHENRIESKSINTDGSLPSTHLSVQSPNMKEI---ENQRS

Query:  GNRFQNDQNFSGNRGRGRSYNQNRGGRSSWNNQNKPQCQLCGKFGHTAIKCYICYDP
         N++  +Q F GNRG+GR      GGR  W+N NKPQCQLCGKFGH   KCY  +DP
Subjt:  GNRFQNDQNFSGNRGRGRSYNQNRGGRSSWNNQNKPQCQLCGKFGHTAIKCYICYDP

XP_022136882.1 dr1-associated corepressor homolog isoform X1 [Momordica charantia]6.9e-2447.52Show/hide
Query:  AAVGKPVSIEDHILYILSGLDSEYDSMISVITAKVCSETVQEVMTLLLTHENRIESKSINTDGSLPSTHLSVQSPNMKEIENQRSGNRFQNDQNFSGNRG
        AA GK V++EDHI++IL+GL SE++S +SVI+A+  ++T+QEV +LLL+HE R E  SINTDG+LPS +L+ Q+ N    ++   G R     N S N G
Subjt:  AAVGKPVSIEDHILYILSGLDSEYDSMISVITAKVCSETVQEVMTLLLTHENRIESKSINTDGSLPSTHLSVQSPNMKEIENQRSGNRFQNDQNFSGNRG

Query:  RGRSYNQNRGGRSSWNNQNKPQCQLCGKFGHTAIKCYICYD
               N   R +WN+ N+PQCQ+ GKFGHTA++CY+ ++
Subjt:  RGRSYNQNRGGRSSWNNQNKPQCQLCGKFGHTAIKCYICYD

XP_022136883.1 dr1-associated corepressor homolog isoform X2 [Momordica charantia]6.9e-2447.52Show/hide
Query:  AAVGKPVSIEDHILYILSGLDSEYDSMISVITAKVCSETVQEVMTLLLTHENRIESKSINTDGSLPSTHLSVQSPNMKEIENQRSGNRFQNDQNFSGNRG
        AA GK V++EDHI++IL+GL SE++S +SVI+A+  ++T+QEV +LLL+HE R E  SINTDG+LPS +L+ Q+ N    ++   G R     N S N G
Subjt:  AAVGKPVSIEDHILYILSGLDSEYDSMISVITAKVCSETVQEVMTLLLTHENRIESKSINTDGSLPSTHLSVQSPNMKEIENQRSGNRFQNDQNFSGNRG

Query:  RGRSYNQNRGGRSSWNNQNKPQCQLCGKFGHTAIKCYICYD
               N   R +WN+ N+PQCQ+ GKFGHTA++CY+ ++
Subjt:  RGRSYNQNRGGRSSWNNQNKPQCQLCGKFGHTAIKCYICYD

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]6.5e-2246.1Show/hide
Query:  AAVGKPVSIEDHILYILSGLDSEYDSMISVITAKVCSETVQEVMTLLLTHENRIESKSINTDGSLPSTHLSVQSPNMKEIENQRSGNRFQNDQNFSGNRG
        A  GK +S EDHI++IL+GL  E+D++ISVITA+   +T+QEV +LLL  E R E   IN+DGSLPS +L++   + K   N      F   Q+    RG
Subjt:  AAVGKPVSIEDHILYILSGLDSEYDSMISVITAKVCSETVQEVMTLLLTHENRIESKSINTDGSLPSTHLSVQSPNMKEIENQRSGNRFQNDQNFSGNRG

Query:  RGRSYNQNRGGRSSWNNQNKPQCQLCGKFGHTAIKCYICYD
        RG   N     R +W   NKPQCQ+CG+FGHTA++CY+ ++
Subjt:  RGRSYNQNRGGRSSWNNQNKPQCQLCGKFGHTAIKCYICYD

TrEMBL top hitse value%identityAlignment
A0A5C7H6P1 Uncharacterized protein1.8e-2245.22Show/hide
Query:  FVGKFIRYFTAQAAVGKPVSIEDHILYILSGLDSEYDSMISVITAKVCSETVQEVMTLLLTHENRIESKSINTDGSLPSTHLSVQSPNMKEI---ENQRS
        ++ K      A A+VG  +S ED +  I +GL +EY+S+I  +T++V   TV EV  LLL HE RIES S+N DGS PS +L+  +   K+         
Subjt:  FVGKFIRYFTAQAAVGKPVSIEDHILYILSGLDSEYDSMISVITAKVCSETVQEVMTLLLTHENRIESKSINTDGSLPSTHLSVQSPNMKEI---ENQRS

Query:  GNRFQNDQNFSGNRGRGRSYNQNRGGRSSWNNQNKPQCQLCGKFGHTAIKCYICYDP
         N++  +Q F GNRG+GR      GGR  W+N NKPQCQLCGKFGH   KCY  +DP
Subjt:  GNRFQNDQNFSGNRGRGRSYNQNRGGRSSWNNQNKPQCQLCGKFGHTAIKCYICYDP

A0A5C7INH4 Uncharacterized protein1.1e-2245.22Show/hide
Query:  FVGKFIRYFTAQAAVGKPVSIEDHILYILSGLDSEYDSMISVITAKVCSETVQEVMTLLLTHENRIESKSINTDGSLPSTHLSVQSPNMKEI---ENQRS
        ++ K      A A+VG  +S ED +  I +GL ++Y+S+I  +T++V   TV EV  LLL HE RIES S+N DGS PS +L+  +  MK+         
Subjt:  FVGKFIRYFTAQAAVGKPVSIEDHILYILSGLDSEYDSMISVITAKVCSETVQEVMTLLLTHENRIESKSINTDGSLPSTHLSVQSPNMKEI---ENQRS

Query:  GNRFQNDQNFSGNRGRGRSYNQNRGGRSSWNNQNKPQCQLCGKFGHTAIKCYICYDP
         N++  +Q F GNRG+GR      GGR  W+N NKPQCQLCGKFGH   KCY  +DP
Subjt:  GNRFQNDQNFSGNRGRGRSYNQNRGGRSSWNNQNKPQCQLCGKFGHTAIKCYICYDP

A0A6J1C6N9 dr1-associated corepressor homolog isoform X13.4e-2447.52Show/hide
Query:  AAVGKPVSIEDHILYILSGLDSEYDSMISVITAKVCSETVQEVMTLLLTHENRIESKSINTDGSLPSTHLSVQSPNMKEIENQRSGNRFQNDQNFSGNRG
        AA GK V++EDHI++IL+GL SE++S +SVI+A+  ++T+QEV +LLL+HE R E  SINTDG+LPS +L+ Q+ N    ++   G R     N S N G
Subjt:  AAVGKPVSIEDHILYILSGLDSEYDSMISVITAKVCSETVQEVMTLLLTHENRIESKSINTDGSLPSTHLSVQSPNMKEIENQRSGNRFQNDQNFSGNRG

Query:  RGRSYNQNRGGRSSWNNQNKPQCQLCGKFGHTAIKCYICYD
               N   R +WN+ N+PQCQ+ GKFGHTA++CY+ ++
Subjt:  RGRSYNQNRGGRSSWNNQNKPQCQLCGKFGHTAIKCYICYD

A0A6J1C8R2 dr1-associated corepressor homolog isoform X23.4e-2447.52Show/hide
Query:  AAVGKPVSIEDHILYILSGLDSEYDSMISVITAKVCSETVQEVMTLLLTHENRIESKSINTDGSLPSTHLSVQSPNMKEIENQRSGNRFQNDQNFSGNRG
        AA GK V++EDHI++IL+GL SE++S +SVI+A+  ++T+QEV +LLL+HE R E  SINTDG+LPS +L+ Q+ N    ++   G R     N S N G
Subjt:  AAVGKPVSIEDHILYILSGLDSEYDSMISVITAKVCSETVQEVMTLLLTHENRIESKSINTDGSLPSTHLSVQSPNMKEIENQRSGNRFQNDQNFSGNRG

Query:  RGRSYNQNRGGRSSWNNQNKPQCQLCGKFGHTAIKCYICYD
               N   R +WN+ N+PQCQ+ GKFGHTA++CY+ ++
Subjt:  RGRSYNQNRGGRSSWNNQNKPQCQLCGKFGHTAIKCYICYD

A0A6J1DLT9 uncharacterized protein LOC1110217573.1e-2246.1Show/hide
Query:  AAVGKPVSIEDHILYILSGLDSEYDSMISVITAKVCSETVQEVMTLLLTHENRIESKSINTDGSLPSTHLSVQSPNMKEIENQRSGNRFQNDQNFSGNRG
        A  GK +S EDHI++IL+GL  E+D++ISVITA+   +T+QEV +LLL  E R E   IN+DGSLPS +L++   + K   N      F   Q+    RG
Subjt:  AAVGKPVSIEDHILYILSGLDSEYDSMISVITAKVCSETVQEVMTLLLTHENRIESKSINTDGSLPSTHLSVQSPNMKEIENQRSGNRFQNDQNFSGNRG

Query:  RGRSYNQNRGGRSSWNNQNKPQCQLCGKFGHTAIKCYICYD
        RG   N     R +W   NKPQCQ+CG+FGHTA++CY+ ++
Subjt:  RGRSYNQNRGGRSSWNNQNKPQCQLCGKFGHTAIKCYICYD

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.7e-0732.17Show/hide
Query:  FTAQAAVGKPVSIEDHILYILSGLDSEYDSMISVITAKVCSETVQEVMTLLLTHENRIESKSINTDGSLPSTHLSVQSPNMKEIENQRSGNRFQNDQNFS
        F   A +GKP+  ++ +  +L  L  EY  +I  I AK    T+ E+   LL HE++I + S  T   +P T  +V   N     N  +GNR  N  +  
Subjt:  FTAQAAVGKPVSIEDHILYILSGLDSEYDSMISVITAKVCSETVQEVMTLLLTHENRIESKSINTDGSLPSTHLSVQSPNMKEIENQRSGNRFQNDQNFS

Query:  GNRGRGRSYNQNRGGRSSWNNQNKP---QCQLCGKFGHTAIKC
         N    + + Q+       NNQ+KP   +CQ+CG  GH+A +C
Subjt:  GNRGRGRSYNQNRGGRSSWNNQNKP---QCQLCGKFGHTAIKC

Arabidopsis top hitse value%identityAlignment
AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)8.5e-0433.88Show/hide
Query:  VGKPVSIEDHILYILSGLDSEYDSMISVITAKVCSETVQEVMTLLLTHENRIESKSINTDGSLPSTHLSVQSPNMKEIENQRSGNRF-QNDQNFSGNRGR
        V  P+S    ++++L+GL  +YD +++VI  K    +  E  ++LL  E+R+ +KS     SL  T+    S  +  +  Q+   R+ Q   N + N GR
Subjt:  VGKPVSIEDHILYILSGLDSEYDSMISVITAKVCSETVQEVMTLLLTHENRIESKSINTDGSLPSTHLSVQSPNMKEIENQRSGNRF-QNDQNFSGNRGR

Query:  GRSYNQNRGGRSS---WNNQN
        GRS  +NRGG SS   +NN N
Subjt:  GRSYNQNRGGRSS---WNNQN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTGGAGAATTAGGAATTGTTCGAGTGTTCGGGACCTAAAAAGGGGCAAAAGAATCAAGGAAGATGTTGAAGAAGTGTTGCAGCAAGCCATTTTCCCTAGAGGAAA
GCGTCGAGACACTCCTTTGAGGCGTCTCGATGTTGTTGCAGTCAGTTGCCTAGTAAGAATTCTTGAGGCACCAACATCGAGATACTGGTTTGTAGCGTCCCGATATGTCC
AACATCTTCAGATGCCTGGAATCTCAGATGACCGGAATTTTAGATATCCGGCATCTTCAGATAGCCACGTTTGTTTTGTAGGAAAATTTATAAGGTACTTCACTGCACAA
GCAGCAGTAGGAAAACCTGTATCAATTGAGGATCACATACTATATATTCTATCTGGTCTTGACTCGGAATATGATTCTATGATTTCAGTAATCACAGCAAAAGTTTGTTC
TGAAACAGTACAAGAAGTCATGACACTCCTACTTACACATGAAAATCGAATTGAGAGTAAGTCGATTAATACTGATGGGTCTCTCCCTTCTACTCATCTTTCTGTTCAAA
GCCCAAATATGAAGGAGATTGAAAATCAAAGGAGTGGTAATCGTTTTCAGAATGACCAAAACTTCAGTGGCAATCGAGGTAGAGGTAGATCATATAATCAAAACCGAGGC
GGTCGTTCTTCATGGAATAATCAGAATAAACCTCAATGCCAACTCTGTGGAAAATTTGGTCATACAGCTATAAAATGCTATATTTGTTATGATCCGCCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATTGGAGAATTAGGAATTGTTCGAGTGTTCGGGACCTAAAAAGGGGCAAAAGAATCAAGGAAGATGTTGAAGAAGTGTTGCAGCAAGCCATTTTCCCTAGAGGAAA
GCGTCGAGACACTCCTTTGAGGCGTCTCGATGTTGTTGCAGTCAGTTGCCTAGTAAGAATTCTTGAGGCACCAACATCGAGATACTGGTTTGTAGCGTCCCGATATGTCC
AACATCTTCAGATGCCTGGAATCTCAGATGACCGGAATTTTAGATATCCGGCATCTTCAGATAGCCACGTTTGTTTTGTAGGAAAATTTATAAGGTACTTCACTGCACAA
GCAGCAGTAGGAAAACCTGTATCAATTGAGGATCACATACTATATATTCTATCTGGTCTTGACTCGGAATATGATTCTATGATTTCAGTAATCACAGCAAAAGTTTGTTC
TGAAACAGTACAAGAAGTCATGACACTCCTACTTACACATGAAAATCGAATTGAGAGTAAGTCGATTAATACTGATGGGTCTCTCCCTTCTACTCATCTTTCTGTTCAAA
GCCCAAATATGAAGGAGATTGAAAATCAAAGGAGTGGTAATCGTTTTCAGAATGACCAAAACTTCAGTGGCAATCGAGGTAGAGGTAGATCATATAATCAAAACCGAGGC
GGTCGTTCTTCATGGAATAATCAGAATAAACCTCAATGCCAACTCTGTGGAAAATTTGGTCATACAGCTATAAAATGCTATATTTGTTATGATCCGCCGTGA
Protein sequenceShow/hide protein sequence
MNWRIRNCSSVRDLKRGKRIKEDVEEVLQQAIFPRGKRRDTPLRRLDVVAVSCLVRILEAPTSRYWFVASRYVQHLQMPGISDDRNFRYPASSDSHVCFVGKFIRYFTAQ
AAVGKPVSIEDHILYILSGLDSEYDSMISVITAKVCSETVQEVMTLLLTHENRIESKSINTDGSLPSTHLSVQSPNMKEIENQRSGNRFQNDQNFSGNRGRGRSYNQNRG
GRSSWNNQNKPQCQLCGKFGHTAIKCYICYDPP