; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011911 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011911
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr1:34891383..34892123
RNA-Seq ExpressionLag0011911
SyntenyLag0011911
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW77188.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]3.8e-3540.35Show/hide
Query:  MSEIVNLDYASTIWNCLNWSYDSNTIARIMPLKAQLQKIKKDGLFVSHVLTKIKEITDKFVAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPSL
        M++IV    A  IWN  N  Y ++++ R+  L  +LQ  KKDGL     + K+K I +    IGE +S ++HL ++  GL  EYN FVTSIQNRSD P++
Subjt:  MSEIVNLDYASTIWNCLNWSYDSNTIARIMPLKAQLQKIKKDGLFVSHVLTKIKEITDKFVAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPSL

Query:  EDVRSLLLAYKARLERQTTVDQLNLAQANLSILNIHHSGRRFSNKSQPHFNYF------SRSSFISSNSQPSFSLEKLDKPQTASPNPNPWPSKS-NSSK
        E + SLLL+Y   LE+Q ++  L+ AQ +++ LN     + + +  QP FN+       S   F  + SQ   ++    +PQ   P P   P  S  + K
Subjt:  EDVRSLLLAYKARLERQTTVDQLNLAQANLSILNIHHSGRRFSNKSQPHFNYF------SRSSFISSNSQPSFSLEKLDKPQTASPNPNPWPSKS-NSSK

Query:  PQCQICGKFGHTVLICHHRTNLAYKTPP
        PQCQICGKFGH  LIC H TNL Y   P
Subjt:  PQCQICGKFGHTVLICHHRTNLAYKTPP

XP_022148871.1 uncharacterized protein LOC111017438 [Momordica charantia]3.4e-3650Show/hide
Query:  QLQKIKKDGLFVSHVLTKIKEITDKFVAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPSLEDVRSLLLAYKARLERQTTVDQLNLAQANLSILN
        ++Q++KKDGL VS  L KIKEIT K  +IGEPIS +DH+++I++GLG EYNAFVTSIQNRSD  +LEDVR+LLLAY  RLE+Q +VDQLN+ QAN++ L 
Subjt:  QLQKIKKDGLFVSHVLTKIKEITDKFVAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPSLEDVRSLLLAYKARLERQTTVDQLNLAQANLSILN

Query:  IHHSGRRFSNKSQPHFNYFSRSSFISSNSQPSFSLEKLDKPQTASPNPN---PWPSKSNS---SKPQCQICGKFGHTVLICHHRTNLAYK
        ++       N   P           S +S   F+  K        PN N   PWP    S    K QCQIC K GHT   C+HR NL YK
Subjt:  IHHSGRRFSNKSQPHFNYFSRSSFISSNSQPSFSLEKLDKPQTASPNPN---PWPSKSNS---SKPQCQICGKFGHTVLICHHRTNLAYK

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]4.5e-6055.79Show/hide
Query:  MSEIVNLDYASTIWNCLNWSYDSNTIARIMPLKAQLQKIKKDGLFVSHVLTKIKEITDKFVAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPSL
        M E+V+L+    IW+ L   YDS T ARIM LK +LQ ++KDG  VS  L KIKEI DKF A+GEP+SYRDHLAH+LDGLGSEYNAFVTSI NR+D+PSL
Subjt:  MSEIVNLDYASTIWNCLNWSYDSNTIARIMPLKAQLQKIKKDGLFVSHVLTKIKEITDKFVAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPSL

Query:  EDVRSLLLAYKARLERQTTVDQLNLAQANLSILNIHHSGRRFSNKSQPHFNY--FSRSSFISSNSQPSFSLEKLDKPQTASPNPNPWPSKSNSSKPQCQI
        EDVRSLLLAY+ARL++Q TVDQLN+AQANL  L++ H+ +R      P F++    + SF +S    + S   L KPQ+     + WP K +SSK QCQI
Subjt:  EDVRSLLLAYKARLERQTTVDQLNLAQANLSILNIHHSGRRFSNKSQPHFNY--FSRSSFISSNSQPSFSLEKLDKPQTASPNPNPWPSKSNSSKPQCQI

Query:  CGKFGHTVLICHHRTNLAYKTPPHNLKPCCTHH
        CGK GH+  +C+HRTN+AY    HN  P   +H
Subjt:  CGKFGHTVLICHHRTNLAYKTPPHNLKPCCTHH

XP_038887133.1 uncharacterized protein LOC120077323 [Benincasa hispida]5.0e-3552.8Show/hide
Query:  MSEIVNLDYASTIWNCLNWSYDSNTIARIMPLKAQLQKIKKDGLFVSHVLTKIKEITDKFVAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPSL
        M EIV  + A  IW  L   Y+S++IA IM   +QLQKIKKDGL VS  L +IK++ D F AIGEP+SYRDHL++IL+GLGSEYN FV+SI NR++ PS+
Subjt:  MSEIVNLDYASTIWNCLNWSYDSNTIARIMPLKAQLQKIKKDGLFVSHVLTKIKEITDKFVAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPSL

Query:  EDVRSLLLAYKARLERQTTVDQLNLAQANLSILNIHHSGRRFSNKSQPHFNYFSRSSFISS
         DVR+LL+ Y +RLE+QT  D L L QAN++ L+I+   R       P +   +RSS  SS
Subjt:  EDVRSLLLAYKARLERQTTVDQLNLAQANLSILNIHHSGRRFSNKSQPHFNYFSRSSFISS

XP_038891713.1 uncharacterized protein LOC120081111 [Benincasa hispida]3.3e-3948.46Show/hide
Query:  MPLKAQLQKIKKDGLFVSHVLTKIKEITDKFVAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPSLEDVRSLLLAYKARLERQTTVDQLNLAQAN
        M LKA+LQKI+KD L +S  L++IK++ DKF  +GE ISYRDHL HILDGLGSEYNAFVTSIQN  DN S+EDV SLLL+Y+A+LE+Q  +D LN+AQA 
Subjt:  MPLKAQLQKIKKDGLFVSHVLTKIKEITDKFVAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPSLEDVRSLLLAYKARLERQTTVDQLNLAQAN

Query:  LSILNIHHSGRR-----FSNKS-----QPHFNYFSRSSFISSNSQPSFSLEKLDKPQTASPNPNPW-PSKSNSSKPQCQICGKFGHTVLICHHRTNLAYK
        LS L+  H+ +R     F N S      P+F+    S+  S +++PSF+                W PSK  SSKPQCQI  KFGH V  CH   + AY+
Subjt:  LSILNIHHSGRR-----FSNKS-----QPHFNYFSRSSFISSNSQPSFSLEKLDKPQTASPNPNPW-PSKSNSSKPQCQICGKFGHTVLICHHRTNLAYK

Query:  --TPPHNLKPC--CTHHSLLSLLLITL
           P  ++      T  S++ +L ITL
Subjt:  --TPPHNLKPC--CTHHSLLSLLLITL

TrEMBL top hitse value%identityAlignment
A0A438GYC1 Retrovirus-related Pol polyprotein from transposon RE11.8e-3540.35Show/hide
Query:  MSEIVNLDYASTIWNCLNWSYDSNTIARIMPLKAQLQKIKKDGLFVSHVLTKIKEITDKFVAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPSL
        M++IV    A  IWN  N  Y ++++ R+  L  +LQ  KKDGL     + K+K I +    IGE +S ++HL ++  GL  EYN FVTSIQNRSD P++
Subjt:  MSEIVNLDYASTIWNCLNWSYDSNTIARIMPLKAQLQKIKKDGLFVSHVLTKIKEITDKFVAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPSL

Query:  EDVRSLLLAYKARLERQTTVDQLNLAQANLSILNIHHSGRRFSNKSQPHFNYF------SRSSFISSNSQPSFSLEKLDKPQTASPNPNPWPSKS-NSSK
        E + SLLL+Y   LE+Q ++  L+ AQ +++ LN     + + +  QP FN+       S   F  + SQ   ++    +PQ   P P   P  S  + K
Subjt:  EDVRSLLLAYKARLERQTTVDQLNLAQANLSILNIHHSGRRFSNKSQPHFNYF------SRSSFISSNSQPSFSLEKLDKPQTASPNPNPWPSKS-NSSK

Query:  PQCQICGKFGHTVLICHHRTNLAYKTPP
        PQCQICGKFGH  LIC H TNL Y   P
Subjt:  PQCQICGKFGHTVLICHHRTNLAYKTPP

A0A6J1D6N7 uncharacterized protein LOC1110174381.7e-3650Show/hide
Query:  QLQKIKKDGLFVSHVLTKIKEITDKFVAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPSLEDVRSLLLAYKARLERQTTVDQLNLAQANLSILN
        ++Q++KKDGL VS  L KIKEIT K  +IGEPIS +DH+++I++GLG EYNAFVTSIQNRSD  +LEDVR+LLLAY  RLE+Q +VDQLN+ QAN++ L 
Subjt:  QLQKIKKDGLFVSHVLTKIKEITDKFVAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPSLEDVRSLLLAYKARLERQTTVDQLNLAQANLSILN

Query:  IHHSGRRFSNKSQPHFNYFSRSSFISSNSQPSFSLEKLDKPQTASPNPN---PWPSKSNS---SKPQCQICGKFGHTVLICHHRTNLAYK
        ++       N   P           S +S   F+  K        PN N   PWP    S    K QCQIC K GHT   C+HR NL YK
Subjt:  IHHSGRRFSNKSQPHFNYFSRSSFISSNSQPSFSLEKLDKPQTASPNPN---PWPSKSNS---SKPQCQICGKFGHTVLICHHRTNLAYK

A0A6J1DQX7 uncharacterized protein LOC1110223152.2e-6055.79Show/hide
Query:  MSEIVNLDYASTIWNCLNWSYDSNTIARIMPLKAQLQKIKKDGLFVSHVLTKIKEITDKFVAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPSL
        M E+V+L+    IW+ L   YDS T ARIM LK +LQ ++KDG  VS  L KIKEI DKF A+GEP+SYRDHLAH+LDGLGSEYNAFVTSI NR+D+PSL
Subjt:  MSEIVNLDYASTIWNCLNWSYDSNTIARIMPLKAQLQKIKKDGLFVSHVLTKIKEITDKFVAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPSL

Query:  EDVRSLLLAYKARLERQTTVDQLNLAQANLSILNIHHSGRRFSNKSQPHFNY--FSRSSFISSNSQPSFSLEKLDKPQTASPNPNPWPSKSNSSKPQCQI
        EDVRSLLLAY+ARL++Q TVDQLN+AQANL  L++ H+ +R      P F++    + SF +S    + S   L KPQ+     + WP K +SSK QCQI
Subjt:  EDVRSLLLAYKARLERQTTVDQLNLAQANLSILNIHHSGRRFSNKSQPHFNY--FSRSSFISSNSQPSFSLEKLDKPQTASPNPNPWPSKSNSSKPQCQI

Query:  CGKFGHTVLICHHRTNLAYKTPPHNLKPCCTHH
        CGK GH+  +C+HRTN+AY    HN  P   +H
Subjt:  CGKFGHTVLICHHRTNLAYKTPPHNLKPCCTHH

A0A7J0E8R3 Uncharacterized protein9.2e-3542.34Show/hide
Query:  MSEIVNLDYASTIWNCLNWSYDSNTIARIMPLKAQLQKIKKDGLFVSHVLTKIKEITDKFVAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPSL
        + +IV    AS IW  L   Y + + A +  L+  LQ IKKDGL     + K + + +   +IGEP++Y DHL + L GLG +YN FVTSIQ+++  PS+
Subjt:  MSEIVNLDYASTIWNCLNWSYDSNTIARIMPLKAQLQKIKKDGLFVSHVLTKIKEITDKFVAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPSL

Query:  EDVRSLLLAYKARLERQTTVDQLNLAQANLSILNIHHSGRRFSNKSQPHF-NYFSRSSFISSNSQPSFSLEKLDKPQTASPNPNPWPSKSNSSKPQCQIC
        E+V SLLL+Y ARLERQ+  D L+  QANL+  N+ +   +F N S   F N  S S     N  PS+S      P  +SP P          +P+CQIC
Subjt:  EDVRSLLLAYKARLERQTTVDQLNLAQANLSILNIHHSGRRFSNKSQPHF-NYFSRSSFISSNSQPSFSLEKLDKPQTASPNPNPWPSKSNSSKPQCQIC

Query:  GKFGHTVLICHHRTNLAYKTPP
         K GHT   C+HRTNL Y+ PP
Subjt:  GKFGHTVLICHHRTNLAYKTPP

A5BWU6 Reverse transcriptase Ty1/copia-type domain-containing protein2.4e-3540.35Show/hide
Query:  MSEIVNLDYASTIWNCLNWSYDSNTIARIMPLKAQLQKIKKDGLFVSHVLTKIKEITDKFVAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPSL
        M++IV    A  IWN  N  Y ++++ R+  L  +LQ  KKDGL     + K+K I +    IGE +S ++HL ++  GL  EYN FVTSIQNRSD P++
Subjt:  MSEIVNLDYASTIWNCLNWSYDSNTIARIMPLKAQLQKIKKDGLFVSHVLTKIKEITDKFVAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPSL

Query:  EDVRSLLLAYKARLERQTTVDQLNLAQANLSILNIHHSGRRFSNKSQPHFNYF------SRSSFISSNSQPSFSLEKLDKPQTASPNPNPWPSKS-NSSK
        E + SLLL+Y   LE+Q ++  L+ AQ +++ LN     + + +  QP FN+       S   F  + SQ   ++    +PQ   P P   P  S  + K
Subjt:  EDVRSLLLAYKARLERQTTVDQLNLAQANLSILNIHHSGRRFSNKSQPHFNYF------SRSSFISSNSQPSFSLEKLDKPQTASPNPNPWPSKS-NSSK

Query:  PQCQICGKFGHTVLICHHRTNLAYKTPP
        PQCQICGKFGH  LIC H TNL Y   P
Subjt:  PQCQICGKFGHTVLICHHRTNLAYKTPP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.6e-0524.04Show/hide
Query:  IWNCLNWSYDSNTIARIMPLKAQLQKIKKDGLFVSHVLTKIKEITDKFVAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPSLEDVRSLLLAYKA
        IW  +   + +N  AR + L ++L+      + V+    K+K++ D    +  P++ R+ + ++L+GL  +++  +  I++R   PS +D  ++L   + 
Subjt:  IWNCLNWSYDSNTIARIMPLKAQLQKIKKDGLFVSHVLTKIKEITDKFVAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPSLEDVRSLLLAYKA

Query:  RLER
        RL+R
Subjt:  RLER

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.9e-0829.36Show/hide
Query:  ASTIWNCLNWSYDSNTIARIMPLKAQLQKIKKDGLFVSHVLTKIKEITDKFVAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPSLEDVRSLLLA
        A  +W  L   +  N  AR +  + +L+    D L V     K+K ++D    +  PIS R  + H+L+GL  +Y+  +  I+++S  PS  + RS+LL 
Subjt:  ASTIWNCLNWSYDSNTIARIMPLKAQLQKIKKDGLFVSHVLTKIKEITDKFVAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPSLEDVRSLLLA

Query:  YKARLERQT
         ++RL  ++
Subjt:  YKARLERQT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGAGATTGTTAATTTAGATTATGCTTCTACTATTTGGAATTGCCTTAATTGGTCTTATGATTCTAACACTATTGCACGTATTATGCCTTTGAAAGCTCAG
TTACAAAAAATTAAAAAGGATGGTTTGTTTGTGAGTCATGTTTTAACTAAAATCAAGGAGATTACAGATAAATTTGTTGCCATTGGTGAACCTATTTCCTATAGG
GATCATCTAGCTCACATACTTGATGGTTTAGGTAGTGAATACAATGCATTTGTTACCTCCATCCAAAATAGATCCGATAATCCTAGTTTGGAAGATGTTAGGAGT
CTCCTTCTGGCCTATAAAGCTCGATTGGAAAGGCAAACTACTGTTGATCAACTAAACTTAGCTCAAGCTAACCTAAGTATTCTCAACATTCATCATTCTGGTCGT
CGTTTTTCTAATAAGTCTCAACCTCATTTCAACTATTTCTCTAGGTCATCCTTTATATCATCCAACTCTCAACCTTCCTTTTCTCTCGAGAAACTTGATAAACCT
CAAACTGCTAGCCCAAACCCCAACCCTTGGCCTTCAAAATCAAACAGCTCGAAGCCTCAATGCCAAATTTGTGGAAAATTTGGCCATACTGTCTTAATATGTCAT
CATAGAACCAATTTAGCCTATAAAACCCCTCCCCATAACCTCAAGCCCTGCTGCACACATCACAGCCTTCTGTCTCTCCTACTGATAACTCTCCAAAAGGGAGTT
ATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTGAGATTGTTAATTTAGATTATGCTTCTACTATTTGGAATTGCCTTAATTGGTCTTATGATTCTAACACTATTGCACGTATTATGCCTTTGAAAGCTCAG
TTACAAAAAATTAAAAAGGATGGTTTGTTTGTGAGTCATGTTTTAACTAAAATCAAGGAGATTACAGATAAATTTGTTGCCATTGGTGAACCTATTTCCTATAGG
GATCATCTAGCTCACATACTTGATGGTTTAGGTAGTGAATACAATGCATTTGTTACCTCCATCCAAAATAGATCCGATAATCCTAGTTTGGAAGATGTTAGGAGT
CTCCTTCTGGCCTATAAAGCTCGATTGGAAAGGCAAACTACTGTTGATCAACTAAACTTAGCTCAAGCTAACCTAAGTATTCTCAACATTCATCATTCTGGTCGT
CGTTTTTCTAATAAGTCTCAACCTCATTTCAACTATTTCTCTAGGTCATCCTTTATATCATCCAACTCTCAACCTTCCTTTTCTCTCGAGAAACTTGATAAACCT
CAAACTGCTAGCCCAAACCCCAACCCTTGGCCTTCAAAATCAAACAGCTCGAAGCCTCAATGCCAAATTTGTGGAAAATTTGGCCATACTGTCTTAATATGTCAT
CATAGAACCAATTTAGCCTATAAAACCCCTCCCCATAACCTCAAGCCCTGCTGCACACATCACAGCCTTCTGTCTCTCCTACTGATAACTCTCCAAAAGGGAGTT
ATGTAG
Protein sequenceShow/hide protein sequence
MSEIVNLDYASTIWNCLNWSYDSNTIARIMPLKAQLQKIKKDGLFVSHVLTKIKEITDKFVAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRSDNPSLEDVRS
LLLAYKARLERQTTVDQLNLAQANLSILNIHHSGRRFSNKSQPHFNYFSRSSFISSNSQPSFSLEKLDKPQTASPNPNPWPSKSNSSKPQCQICGKFGHTVLICH
HRTNLAYKTPPHNLKPCCTHHSLLSLLLITLQKGVM