; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036293 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036293
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr3:43564689..43565192
RNA-Seq ExpressionLag0036293
SyntenyLag0036293
Gene Ontology termsGO:0034641 - cellular nitrogen compound metabolic process (biological process)
GO:0071704 - organic substance metabolic process (biological process)
GO:0005488 - binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045111.1 putative glutathione S-transferase isoform X1 [Cucumis melo var. makuwa]1.2e-2952.48Show/hide
Query:  MTETILEQVLHCTTAKEIWSYLLQIFNSRHLAQIMKIKVKLQNIQKGGSSMNEYVSKIKKCIDALAAIGKTVLVEDHIMYILAGLGSEYEIIVYVITAKT
        M E IL Q+LHCT+AK+IW  L  I++SR+LA+ M+ K KL N++KG  S+ EY  KI++C+DALA+I K +  +DHI+YILAGLG+EY+ ++ VI+A+T
Subjt:  MTETILEQVLHCTTAKEIWSYLLQIFNSRHLAQIMKIKVKLQNIQKGGSSMNEYVSKIKKCIDALAAIGKTVLVEDHIMYILAGLGSEYEIIVYVITAKT

Query:  STQNVQDVIALLLTHESRIESKNSINADGVLPTANLIVQNR
         + +VQDV++LLLT ES+IESK  I ++  LPT N+    R
Subjt:  STQNVQDVIALLLTHESRIESKNSINADGVLPTANLIVQNR

KAA0046195.1 putative Ty1-copia-like retrotransposon [Cucumis melo var. makuwa]8.1e-2949.01Show/hide
Query:  MTETILEQVLHCTTAKEIWSYLLQIFNSRHLAQIMKIKVKLQNIQKGGSSMNEYVSKIKKCIDALAAIGKTVLVEDHIMYILAGLGSEYEIIVYVITAKT
        M+E IL Q+LH T+AK+IW  L  I++SR+LA+ M+ K KL N++KG  S+ EY  KI++C+DALA+I K +  +DHI+YILAGLG+EY+ I+ +I+A+T
Subjt:  MTETILEQVLHCTTAKEIWSYLLQIFNSRHLAQIMKIKVKLQNIQKGGSSMNEYVSKIKKCIDALAAIGKTVLVEDHIMYILAGLGSEYEIIVYVITAKT

Query:  STQNVQDVIALLLTHESRIESKNSINADGVLPTANLIVQNR----IQKEND
         + +VQD ++LLLT ES+IESK  I ++  LPT N+    R    ++KE++
Subjt:  STQNVQDVIALLLTHESRIESKNSINADGVLPTANLIVQNR----IQKEND

XP_022136882.1 dr1-associated corepressor homolog isoform X1 [Momordica charantia]1.5e-3048.75Show/hide
Query:  MTETILEQVLHCTTAKEIWSYLLQIFNSRHLAQIMKIKVKLQNIQKGGSSMNEYVSKIKKCIDALAAIGKTVLVEDHIMYILAGLGSEYEIIVYVITAKT
        M E IL +++HC TA+E+W  L  ++ SR+LA++M++K KL+NI+KG   + +Y  K+K  +D+LAA GK V VEDHIM+IL GL SE+E  V VI+A+T
Subjt:  MTETILEQVLHCTTAKEIWSYLLQIFNSRHLAQIMKIKVKLQNIQKGGSSMNEYVSKIKKCIDALAAIGKTVLVEDHIMYILAGLGSEYEIIVYVITAKT

Query:  STQNVQDVIALLLTHESRIESKNSINADGVLPTANLIVQNRIQKENDSQKLINQKQQQQN
         TQ +Q+V +LLL+HE R E +NSIN DG LP+ NL  Q +    N +Q +  Q+   QN
Subjt:  STQNVQDVIALLLTHESRIESKNSINADGVLPTANLIVQNRIQKENDSQKLINQKQQQQN

XP_022136883.1 dr1-associated corepressor homolog isoform X2 [Momordica charantia]1.5e-3048.75Show/hide
Query:  MTETILEQVLHCTTAKEIWSYLLQIFNSRHLAQIMKIKVKLQNIQKGGSSMNEYVSKIKKCIDALAAIGKTVLVEDHIMYILAGLGSEYEIIVYVITAKT
        M E IL +++HC TA+E+W  L  ++ SR+LA++M++K KL+NI+KG   + +Y  K+K  +D+LAA GK V VEDHIM+IL GL SE+E  V VI+A+T
Subjt:  MTETILEQVLHCTTAKEIWSYLLQIFNSRHLAQIMKIKVKLQNIQKGGSSMNEYVSKIKKCIDALAAIGKTVLVEDHIMYILAGLGSEYEIIVYVITAKT

Query:  STQNVQDVIALLLTHESRIESKNSINADGVLPTANLIVQNRIQKENDSQKLINQKQQQQN
         TQ +Q+V +LLL+HE R E +NSIN DG LP+ NL  Q +    N +Q +  Q+   QN
Subjt:  STQNVQDVIALLLTHESRIESKNSINADGVLPTANLIVQNRIQKENDSQKLINQKQQQQN

XP_022158089.1 uncharacterized protein LOC111024658 [Momordica charantia]1.8e-2844.83Show/hide
Query:  MTETILEQVLHCTTAKEIWSYLLQIFNSRHLAQIMKIKVKLQNIQKGGSSMNEYVSKIKKCIDALAAIGKTVLVEDHIMYILAGLGSEYEIIVYVITAKT
        M E IL  +LHC+TAKEIW  L Q+F +++L ++M++K +LQN++KGG S+ EY+ +IK  +D+L A GK++  EDHIM+IL+GLGSEYE  V VIT K 
Subjt:  MTETILEQVLHCTTAKEIWSYLLQIFNSRHLAQIMKIKVKLQNIQKGGSSMNEYVSKIKKCIDALAAIGKTVLVEDHIMYILAGLGSEYEIIVYVITAKT

Query:  STQNVQDVIALLLTHESRIESKNSINADGVLPTANLIVQNRIQKENDSQKLI-------NQKQQQQNFGNNRGR
            +QDV ALLL+H+ RIE + S   D  LP+A++ + ++   +N+    +       ++ QQ  +  NNRGR
Subjt:  STQNVQDVIALLLTHESRIESKNSINADGVLPTANLIVQNRIQKENDSQKLI-------NQKQQQQNFGNNRGR

TrEMBL top hitse value%identityAlignment
A0A5A7TUB3 Putative glutathione S-transferase isoform X16.0e-3052.48Show/hide
Query:  MTETILEQVLHCTTAKEIWSYLLQIFNSRHLAQIMKIKVKLQNIQKGGSSMNEYVSKIKKCIDALAAIGKTVLVEDHIMYILAGLGSEYEIIVYVITAKT
        M E IL Q+LHCT+AK+IW  L  I++SR+LA+ M+ K KL N++KG  S+ EY  KI++C+DALA+I K +  +DHI+YILAGLG+EY+ ++ VI+A+T
Subjt:  MTETILEQVLHCTTAKEIWSYLLQIFNSRHLAQIMKIKVKLQNIQKGGSSMNEYVSKIKKCIDALAAIGKTVLVEDHIMYILAGLGSEYEIIVYVITAKT

Query:  STQNVQDVIALLLTHESRIESKNSINADGVLPTANLIVQNR
         + +VQDV++LLLT ES+IESK  I ++  LPT N+    R
Subjt:  STQNVQDVIALLLTHESRIESKNSINADGVLPTANLIVQNR

A0A5D3CRZ7 Putative Ty1-copia-like retrotransposon3.9e-2949.01Show/hide
Query:  MTETILEQVLHCTTAKEIWSYLLQIFNSRHLAQIMKIKVKLQNIQKGGSSMNEYVSKIKKCIDALAAIGKTVLVEDHIMYILAGLGSEYEIIVYVITAKT
        M+E IL Q+LH T+AK+IW  L  I++SR+LA+ M+ K KL N++KG  S+ EY  KI++C+DALA+I K +  +DHI+YILAGLG+EY+ I+ +I+A+T
Subjt:  MTETILEQVLHCTTAKEIWSYLLQIFNSRHLAQIMKIKVKLQNIQKGGSSMNEYVSKIKKCIDALAAIGKTVLVEDHIMYILAGLGSEYEIIVYVITAKT

Query:  STQNVQDVIALLLTHESRIESKNSINADGVLPTANLIVQNR----IQKEND
         + +VQD ++LLLT ES+IESK  I ++  LPT N+    R    ++KE++
Subjt:  STQNVQDVIALLLTHESRIESKNSINADGVLPTANLIVQNR----IQKEND

A0A6J1C6N9 dr1-associated corepressor homolog isoform X17.1e-3148.75Show/hide
Query:  MTETILEQVLHCTTAKEIWSYLLQIFNSRHLAQIMKIKVKLQNIQKGGSSMNEYVSKIKKCIDALAAIGKTVLVEDHIMYILAGLGSEYEIIVYVITAKT
        M E IL +++HC TA+E+W  L  ++ SR+LA++M++K KL+NI+KG   + +Y  K+K  +D+LAA GK V VEDHIM+IL GL SE+E  V VI+A+T
Subjt:  MTETILEQVLHCTTAKEIWSYLLQIFNSRHLAQIMKIKVKLQNIQKGGSSMNEYVSKIKKCIDALAAIGKTVLVEDHIMYILAGLGSEYEIIVYVITAKT

Query:  STQNVQDVIALLLTHESRIESKNSINADGVLPTANLIVQNRIQKENDSQKLINQKQQQQN
         TQ +Q+V +LLL+HE R E +NSIN DG LP+ NL  Q +    N +Q +  Q+   QN
Subjt:  STQNVQDVIALLLTHESRIESKNSINADGVLPTANLIVQNRIQKENDSQKLINQKQQQQN

A0A6J1C8R2 dr1-associated corepressor homolog isoform X27.1e-3148.75Show/hide
Query:  MTETILEQVLHCTTAKEIWSYLLQIFNSRHLAQIMKIKVKLQNIQKGGSSMNEYVSKIKKCIDALAAIGKTVLVEDHIMYILAGLGSEYEIIVYVITAKT
        M E IL +++HC TA+E+W  L  ++ SR+LA++M++K KL+NI+KG   + +Y  K+K  +D+LAA GK V VEDHIM+IL GL SE+E  V VI+A+T
Subjt:  MTETILEQVLHCTTAKEIWSYLLQIFNSRHLAQIMKIKVKLQNIQKGGSSMNEYVSKIKKCIDALAAIGKTVLVEDHIMYILAGLGSEYEIIVYVITAKT

Query:  STQNVQDVIALLLTHESRIESKNSINADGVLPTANLIVQNRIQKENDSQKLINQKQQQQN
         TQ +Q+V +LLL+HE R E +NSIN DG LP+ NL  Q +    N +Q +  Q+   QN
Subjt:  STQNVQDVIALLLTHESRIESKNSINADGVLPTANLIVQNRIQKENDSQKLINQKQQQQN

A0A6J1DYD5 uncharacterized protein LOC1110246588.7e-2944.83Show/hide
Query:  MTETILEQVLHCTTAKEIWSYLLQIFNSRHLAQIMKIKVKLQNIQKGGSSMNEYVSKIKKCIDALAAIGKTVLVEDHIMYILAGLGSEYEIIVYVITAKT
        M E IL  +LHC+TAKEIW  L Q+F +++L ++M++K +LQN++KGG S+ EY+ +IK  +D+L A GK++  EDHIM+IL+GLGSEYE  V VIT K 
Subjt:  MTETILEQVLHCTTAKEIWSYLLQIFNSRHLAQIMKIKVKLQNIQKGGSSMNEYVSKIKKCIDALAAIGKTVLVEDHIMYILAGLGSEYEIIVYVITAKT

Query:  STQNVQDVIALLLTHESRIESKNSINADGVLPTANLIVQNRIQKENDSQKLI-------NQKQQQQNFGNNRGR
            +QDV ALLL+H+ RIE + S   D  LP+A++ + ++   +N+    +       ++ QQ  +  NNRGR
Subjt:  STQNVQDVIALLLTHESRIESKNSINADGVLPTANLIVQNRIQKENDSQKLI-------NQKQQQQNFGNNRGR

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.5e-0626.58Show/hide
Query:  VLHCTTAKEIWSYLLQIFNSRHLAQIMKIKVKLQNIQKGGSSMNEYVSKIKKCIDALAAIGKTVLVEDHIMYILAGLGSEYEIIVYVITAKTSTQNVQDV
        V   TTA +IW  L +I+ +     + +++ +L+   KG  ++++Y+  +    D LA +GK +  ++ +  +L  L  EY+ ++  I AK +   + ++
Subjt:  VLHCTTAKEIWSYLLQIFNSRHLAQIMKIKVKLQNIQKGGSSMNEYVSKIKKCIDALAAIGKTVLVEDHIMYILAGLGSEYEIIVYVITAKTSTQNVQDV

Query:  IALLLTHESRIESKNSINADGVLP-TANLIV-QNRIQKENDSQKLINQKQQQQNFGNN
           LL HES+I + +S     V+P TAN +  +N     N++    N +   +N  NN
Subjt:  IALLLTHESRIESKNSINADGVLP-TANLIV-QNRIQKENDSQKLINQKQQQQNFGNN

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.1e-0424.07Show/hide
Query:  TTAKEIWSYLLQIFNSRHLAQIMKIKVKLQNIQKGGSSMNEYVSKIKKCIDALAAIGKTVLVEDHIMYILAGLGSEYEIIVYVITAKTSTQNVQDVIALL
        +T+++IW  +   F +   A+ +++  +L+    G   + +Y  K+KK  D+L  +   V   + +MY+L GL  +++ I+ VI  +    +  D   +L
Subjt:  TTAKEIWSYLLQIFNSRHLAQIMKIKVKLQNIQKGGSSMNEYVSKIKKCIDALAAIGKTVLVEDHIMYILAGLGSEYEIIVYVITAKTSTQNVQDVIALL

Query:  LTHESRIE
           E R++
Subjt:  LTHESRIE

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.3e-0524.12Show/hide
Query:  MTETILEQVLHC-TTAKEIWSYLLQIFNSRHLAQIMKIKVKLQNIQKGGSSMNEYVSKIKKCIDALAAIGKTVLVEDHIMYILAGLGSEYEIIVYVITAK
        +T+++L+ ++    TA+++W  L  +F     A+ ++ + +L+       S++EY  K+K   D L  +   +     +M++L GL  +Y+ I+ VI  K
Subjt:  MTETILEQVLHC-TTAKEIWSYLLQIFNSRHLAQIMKIKVKLQNIQKGGSSMNEYVSKIKKCIDALAAIGKTVLVEDHIMYILAGLGSEYEIIVYVITAK

Query:  TSTQNVQDVIALLLTHESRI--ESKNSINADGVLPTANLIVQNRIQKENDSQKLINQKQQQQNFGNNRGR
        +   +  +  ++LL  ESR+  +SK+S++       +N++     Q+E   Q+  N      N G  R +
Subjt:  TSTQNVQDVIALLLTHESRI--ESKNSINADGVLPTANLIVQNRIQKENDSQKLINQKQQQQNFGNNRGR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGAAACGATCTTAGAACAAGTTTTACACTGTACCACTGCAAAAGAAATTTGGTCATATCTTCTTCAGATTTTTAATTCTAGACACCTAGCACAGATAATGAAGAT
TAAAGTCAAATTACAGAATATACAAAAAGGAGGATCGTCTATGAATGAGTATGTATCGAAAATTAAAAAATGCATTGATGCCCTAGCTGCAATAGGAAAAACAGTCCTTG
TGGAAGATCATATTATGTATATTTTGGCTGGATTGGGATCTGAATATGAAATAATAGTTTATGTTATTACTGCCAAGACTAGTACTCAAAATGTGCAAGATGTTATTGCT
TTATTACTAACACATGAGAGTAGAATCGAAAGCAAAAATTCAATCAATGCAGATGGAGTCCTTCCTACAGCAAATCTTATAGTACAAAATCGTATTCAGAAGGAAAATGA
TTCTCAGAAATTGATAAATCAGAAGCAACAACAGCAAAATTTTGGTAATAATAGAGGTAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGACTGAAACGATCTTAGAACAAGTTTTACACTGTACCACTGCAAAAGAAATTTGGTCATATCTTCTTCAGATTTTTAATTCTAGACACCTAGCACAGATAATGAAGAT
TAAAGTCAAATTACAGAATATACAAAAAGGAGGATCGTCTATGAATGAGTATGTATCGAAAATTAAAAAATGCATTGATGCCCTAGCTGCAATAGGAAAAACAGTCCTTG
TGGAAGATCATATTATGTATATTTTGGCTGGATTGGGATCTGAATATGAAATAATAGTTTATGTTATTACTGCCAAGACTAGTACTCAAAATGTGCAAGATGTTATTGCT
TTATTACTAACACATGAGAGTAGAATCGAAAGCAAAAATTCAATCAATGCAGATGGAGTCCTTCCTACAGCAAATCTTATAGTACAAAATCGTATTCAGAAGGAAAATGA
TTCTCAGAAATTGATAAATCAGAAGCAACAACAGCAAAATTTTGGTAATAATAGAGGTAGATGA
Protein sequenceShow/hide protein sequence
MTETILEQVLHCTTAKEIWSYLLQIFNSRHLAQIMKIKVKLQNIQKGGSSMNEYVSKIKKCIDALAAIGKTVLVEDHIMYILAGLGSEYEIIVYVITAKTSTQNVQDVIA
LLLTHESRIESKNSINADGVLPTANLIVQNRIQKENDSQKLINQKQQQQNFGNNRGR