; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031890 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031890
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr11:17894812..17896326
RNA-Seq ExpressionLag0031890
SyntenyLag0031890
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0071704 - organic substance metabolic process (biological process)
GO:0016740 - transferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.1e-2340.97Show/hide
Query:  SRHLAQMMKIKSKLQNIQKGGSSMNEYISKIKKCIDALAAIGKKVSVEDHILYIFSGL------------VKTGSQLVQDVIALLLTHESRLESKSSINS
        SR+LAQ M+ K+KL NI+KG   + EY  KI +C+DALA+I K VS +DHILYI +GL             +T S  VQ+V++LLLT ES+ ESK  + S
Subjt:  SRHLAQMMKIKSKLQNIQKGGSSMNEYISKIKKCIDALAAIGKKVSVEDHILYIFSGL------------VKTGSQLVQDVIALLLTHESRLESKSSINS

Query:  DGVLPTANLTVQNHVQEEVENLRSMNQHQQQQNFG----NGRGRGQSNFGNGRGGRSWNNRNRPQCQLCNKFGHTAIKCYSRVQMPGAYNTQFGSPGQVF
        +  LP+ N+  Q   ++  E+    NQ+    N       GRG G+SN   GR G    NRN+PQCQ+C K G++A +C+ R      Y  +  S G   
Subjt:  DGVLPTANLTVQNHVQEEVENLRSMNQHQQQQNFG----NGRGRGQSNFGNGRGGRSWNNRNRPQCQLCNKFGHTAIKCYSRVQMPGAYNTQFGSPGQVF

Query:  SSGQNFGQQFGNQFPQMQAMMAAQNFN
        +S  N      N  PQM AM+AA + N
Subjt:  SSGQNFGQQFGNQFPQMQAMMAAQNFN

XP_022136882.1 dr1-associated corepressor homolog isoform X1 [Momordica charantia]2.8e-2840Show/hide
Query:  SRHLAQMMKIKSKLQNIQKGGSSMNEYISKIKKCIDALAAIGKKVSVEDHILYIFSGL------------VKTGSQLVQDVIALLLTHESRLESKSSINS
        SR+LA++M++KSKL+NI+KG   + +Y  K+K  +D+LAA GKKV+VEDHI++I +GL             +T +Q +Q+V +LLL+HE R E ++SIN+
Subjt:  SRHLAQMMKIKSKLQNIQKGGSSMNEYISKIKKCIDALAAIGKKVSVEDHILYIFSGL------------VKTGSQLVQDVIALLLTHESRLESKSSINS

Query:  DGVLPTANLTVQNHVQEEVENLRSMNQHQQQQNFGNGRGRGQSNFGNGRGGRSWNNRNRPQCQLCNKFGHTAIKCYSRVQMPGAYNTQFGSPGQ-----V
        DG LP+ NLT Q       +++     + Q     N R +   N GN    R+WN+ NRPQCQ+  KFGHTA++CY R +      T  G  GQ      
Subjt:  DGVLPTANLTVQNHVQEEVENLRSMNQHQQQQNFGNGRGRGQSNFGNGRGGRSWNNRNRPQCQLCNKFGHTAIKCYSRVQMPGAYNTQFGSPGQ-----V

Query:  FSSGQN---------FGQQ---FGNQF-----PQMQAMMAAQNFN
        FS G N         FG Q   F N F       M A +A Q+FN
Subjt:  FSSGQN---------FGQQ---FGNQF-----PQMQAMMAAQNFN

XP_022136883.1 dr1-associated corepressor homolog isoform X2 [Momordica charantia]2.8e-2840Show/hide
Query:  SRHLAQMMKIKSKLQNIQKGGSSMNEYISKIKKCIDALAAIGKKVSVEDHILYIFSGL------------VKTGSQLVQDVIALLLTHESRLESKSSINS
        SR+LA++M++KSKL+NI+KG   + +Y  K+K  +D+LAA GKKV+VEDHI++I +GL             +T +Q +Q+V +LLL+HE R E ++SIN+
Subjt:  SRHLAQMMKIKSKLQNIQKGGSSMNEYISKIKKCIDALAAIGKKVSVEDHILYIFSGL------------VKTGSQLVQDVIALLLTHESRLESKSSINS

Query:  DGVLPTANLTVQNHVQEEVENLRSMNQHQQQQNFGNGRGRGQSNFGNGRGGRSWNNRNRPQCQLCNKFGHTAIKCYSRVQMPGAYNTQFGSPGQ-----V
        DG LP+ NLT Q       +++     + Q     N R +   N GN    R+WN+ NRPQCQ+  KFGHTA++CY R +      T  G  GQ      
Subjt:  DGVLPTANLTVQNHVQEEVENLRSMNQHQQQQNFGNGRGRGQSNFGNGRGGRSWNNRNRPQCQLCNKFGHTAIKCYSRVQMPGAYNTQFGSPGQ-----V

Query:  FSSGQN---------FGQQ---FGNQF-----PQMQAMMAAQNFN
        FS G N         FG Q   F N F       M A +A Q+FN
Subjt:  FSSGQN---------FGQQ---FGNQF-----PQMQAMMAAQNFN

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]8.3e-2537.7Show/hide
Query:  SRHLAQMMKIKSKLQNIQKGGSSMNEYISKIKKCIDALAAIGKKVSVEDHILYIFSGL------------VKTGSQLVQDVIALLLTHESRLESKSSINS
        SR LA++M++K KL+N +KG  S+ +Y  KIK  +D+LA  GKK+S EDHI++I +GL             +   Q +Q+V +LLL  E R E ++ INS
Subjt:  SRHLAQMMKIKSKLQNIQKGGSSMNEYISKIKKCIDALAAIGKKVSVEDHILYIFSGL------------VKTGSQLVQDVIALLLTHESRLESKSSINS

Query:  DGVLPTANLTVQNHVQEEVENLRSMNQHQQQQNFGNGRGRGQSNFGNGRGGRSWNNRNRPQCQLCNKFGHTAIKCYSRVQM----PGAYNTQFGSPGQVF
        DG LP+ NLT+ +    +  NL         Q+  + RGRG +N  + R  R+W   N+PQCQ+C +FGHTA++CY R +     P      F   G  F
Subjt:  DGVLPTANLTVQNHVQEEVENLRSMNQHQQQQNFGNGRGRGQSNFGNGRGGRSWNNRNRPQCQLCNKFGHTAIKCYSRVQM----PGAYNTQFGSPGQVF

Query:  SSGQNFGQQFGNQF-----------------PQMQAMMAAQNFN
        SSG        N F                  QMQA+M AQ+FN
Subjt:  SSGQNFGQQFGNQF-----------------PQMQAMMAAQNFN

XP_022158089.1 uncharacterized protein LOC111024658 [Momordica charantia]4.0e-2740.19Show/hide
Query:  SRHLAQMMKIKSKLQNIQKGGSSMNEYISKIKKCIDALAAIGKKVSVEDHILYIFSGL------------VKTGSQLVQDVIALLLTHESRLESKSSINS
        +++L ++M++K++LQN++KGG S+ EYI +IK  +D+L A GK ++ EDHI++I SGL            VK G   +QDV ALLL+H+ R+E + S   
Subjt:  SRHLAQMMKIKSKLQNIQKGGSSMNEYISKIKKCIDALAAIGKKVSVEDHILYIFSGL------------VKTGSQLVQDVIALLLTHESRLESKSSINS

Query:  DGVLPTA--NLTVQNHVQEEVENLRSMN---QHQQQQNFGNGRGRGQSNFGNGRGGRSWNNRNRPQCQLCNKFGHTAIKCY---SRVQMPGAYNTQFGSP
        D  LP+A  NL  Q   Q    N  S N    H+ QQ++ N   RG+  F +  GGR WN+RN+ QCQ+C++FGHTA + Y   S VQ    Y+T++   
Subjt:  DGVLPTA--NLTVQNHVQEEVENLRSMN---QHQQQQNFGNGRGRGQSNFGNGRGGRSWNNRNRPQCQLCNKFGHTAIKCY---SRVQMPGAYNTQFGSP

Query:  GQVFSSGQN
           +   QN
Subjt:  GQVFSSGQN

TrEMBL top hitse value%identityAlignment
A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-949.9e-2440.97Show/hide
Query:  SRHLAQMMKIKSKLQNIQKGGSSMNEYISKIKKCIDALAAIGKKVSVEDHILYIFSGL------------VKTGSQLVQDVIALLLTHESRLESKSSINS
        SR+LAQ M+ K+KL NI+KG   + EY  KI +C+DALA+I K VS +DHILYI +GL             +T S  VQ+V++LLLT ES+ ESK  + S
Subjt:  SRHLAQMMKIKSKLQNIQKGGSSMNEYISKIKKCIDALAAIGKKVSVEDHILYIFSGL------------VKTGSQLVQDVIALLLTHESRLESKSSINS

Query:  DGVLPTANLTVQNHVQEEVENLRSMNQHQQQQNFG----NGRGRGQSNFGNGRGGRSWNNRNRPQCQLCNKFGHTAIKCYSRVQMPGAYNTQFGSPGQVF
        +  LP+ N+  Q   ++  E+    NQ+    N       GRG G+SN   GR G    NRN+PQCQ+C K G++A +C+ R      Y  +  S G   
Subjt:  DGVLPTANLTVQNHVQEEVENLRSMNQHQQQQNFG----NGRGRGQSNFGNGRGGRSWNNRNRPQCQLCNKFGHTAIKCYSRVQMPGAYNTQFGSPGQVF

Query:  SSGQNFGQQFGNQFPQMQAMMAAQNFN
        +S  N      N  PQM AM+AA + N
Subjt:  SSGQNFGQQFGNQFPQMQAMMAAQNFN

A0A6J1C6N9 dr1-associated corepressor homolog isoform X11.3e-2840Show/hide
Query:  SRHLAQMMKIKSKLQNIQKGGSSMNEYISKIKKCIDALAAIGKKVSVEDHILYIFSGL------------VKTGSQLVQDVIALLLTHESRLESKSSINS
        SR+LA++M++KSKL+NI+KG   + +Y  K+K  +D+LAA GKKV+VEDHI++I +GL             +T +Q +Q+V +LLL+HE R E ++SIN+
Subjt:  SRHLAQMMKIKSKLQNIQKGGSSMNEYISKIKKCIDALAAIGKKVSVEDHILYIFSGL------------VKTGSQLVQDVIALLLTHESRLESKSSINS

Query:  DGVLPTANLTVQNHVQEEVENLRSMNQHQQQQNFGNGRGRGQSNFGNGRGGRSWNNRNRPQCQLCNKFGHTAIKCYSRVQMPGAYNTQFGSPGQ-----V
        DG LP+ NLT Q       +++     + Q     N R +   N GN    R+WN+ NRPQCQ+  KFGHTA++CY R +      T  G  GQ      
Subjt:  DGVLPTANLTVQNHVQEEVENLRSMNQHQQQQNFGNGRGRGQSNFGNGRGGRSWNNRNRPQCQLCNKFGHTAIKCYSRVQMPGAYNTQFGSPGQ-----V

Query:  FSSGQN---------FGQQ---FGNQF-----PQMQAMMAAQNFN
        FS G N         FG Q   F N F       M A +A Q+FN
Subjt:  FSSGQN---------FGQQ---FGNQF-----PQMQAMMAAQNFN

A0A6J1C8R2 dr1-associated corepressor homolog isoform X21.3e-2840Show/hide
Query:  SRHLAQMMKIKSKLQNIQKGGSSMNEYISKIKKCIDALAAIGKKVSVEDHILYIFSGL------------VKTGSQLVQDVIALLLTHESRLESKSSINS
        SR+LA++M++KSKL+NI+KG   + +Y  K+K  +D+LAA GKKV+VEDHI++I +GL             +T +Q +Q+V +LLL+HE R E ++SIN+
Subjt:  SRHLAQMMKIKSKLQNIQKGGSSMNEYISKIKKCIDALAAIGKKVSVEDHILYIFSGL------------VKTGSQLVQDVIALLLTHESRLESKSSINS

Query:  DGVLPTANLTVQNHVQEEVENLRSMNQHQQQQNFGNGRGRGQSNFGNGRGGRSWNNRNRPQCQLCNKFGHTAIKCYSRVQMPGAYNTQFGSPGQ-----V
        DG LP+ NLT Q       +++     + Q     N R +   N GN    R+WN+ NRPQCQ+  KFGHTA++CY R +      T  G  GQ      
Subjt:  DGVLPTANLTVQNHVQEEVENLRSMNQHQQQQNFGNGRGRGQSNFGNGRGGRSWNNRNRPQCQLCNKFGHTAIKCYSRVQMPGAYNTQFGSPGQ-----V

Query:  FSSGQN---------FGQQ---FGNQF-----PQMQAMMAAQNFN
        FS G N         FG Q   F N F       M A +A Q+FN
Subjt:  FSSGQN---------FGQQ---FGNQF-----PQMQAMMAAQNFN

A0A6J1DLT9 uncharacterized protein LOC1110217574.0e-2537.7Show/hide
Query:  SRHLAQMMKIKSKLQNIQKGGSSMNEYISKIKKCIDALAAIGKKVSVEDHILYIFSGL------------VKTGSQLVQDVIALLLTHESRLESKSSINS
        SR LA++M++K KL+N +KG  S+ +Y  KIK  +D+LA  GKK+S EDHI++I +GL             +   Q +Q+V +LLL  E R E ++ INS
Subjt:  SRHLAQMMKIKSKLQNIQKGGSSMNEYISKIKKCIDALAAIGKKVSVEDHILYIFSGL------------VKTGSQLVQDVIALLLTHESRLESKSSINS

Query:  DGVLPTANLTVQNHVQEEVENLRSMNQHQQQQNFGNGRGRGQSNFGNGRGGRSWNNRNRPQCQLCNKFGHTAIKCYSRVQM----PGAYNTQFGSPGQVF
        DG LP+ NLT+ +    +  NL         Q+  + RGRG +N  + R  R+W   N+PQCQ+C +FGHTA++CY R +     P      F   G  F
Subjt:  DGVLPTANLTVQNHVQEEVENLRSMNQHQQQQNFGNGRGRGQSNFGNGRGGRSWNNRNRPQCQLCNKFGHTAIKCYSRVQM----PGAYNTQFGSPGQVF

Query:  SSGQNFGQQFGNQF-----------------PQMQAMMAAQNFN
        SSG        N F                  QMQA+M AQ+FN
Subjt:  SSGQNFGQQFGNQF-----------------PQMQAMMAAQNFN

A0A6J1DYD5 uncharacterized protein LOC1110246581.9e-2740.19Show/hide
Query:  SRHLAQMMKIKSKLQNIQKGGSSMNEYISKIKKCIDALAAIGKKVSVEDHILYIFSGL------------VKTGSQLVQDVIALLLTHESRLESKSSINS
        +++L ++M++K++LQN++KGG S+ EYI +IK  +D+L A GK ++ EDHI++I SGL            VK G   +QDV ALLL+H+ R+E + S   
Subjt:  SRHLAQMMKIKSKLQNIQKGGSSMNEYISKIKKCIDALAAIGKKVSVEDHILYIFSGL------------VKTGSQLVQDVIALLLTHESRLESKSSINS

Query:  DGVLPTA--NLTVQNHVQEEVENLRSMN---QHQQQQNFGNGRGRGQSNFGNGRGGRSWNNRNRPQCQLCNKFGHTAIKCY---SRVQMPGAYNTQFGSP
        D  LP+A  NL  Q   Q    N  S N    H+ QQ++ N   RG+  F +  GGR WN+RN+ QCQ+C++FGHTA + Y   S VQ    Y+T++   
Subjt:  DGVLPTA--NLTVQNHVQEEVENLRSMN---QHQQQQNFGNGRGRGQSNFGNGRGGRSWNNRNRPQCQLCNKFGHTAIKCY---SRVQMPGAYNTQFGSP

Query:  GQVFSSGQN
           +   QN
Subjt:  GQVFSSGQN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCTTTCTCCTCCATTCCTTTCACTGCATTTTTACTAATCTTGCGACGACATATTACCCGAAATTTTATCTTCGGCCACACACGCCAACATTGTCGGTGATGACCGT
CGTCGACCATTCTCCTTTTCTTCTCATTTTCTTCCGCCTTCTTCCTAACTTTGTGGTCACTGCTATGTTCTCTCTGTTAAGGCAGCCCATCAAATACTCAGTCCAAGATG
AGTTTCAGTGCAAGCCCAAGTGTGTTGTTAGTACGTGGCTTCTAGAATTTTTCAGTTTCAGAGTTCTTAGGATTTTTCCATTTTTTGTTGCTGTAAATATGGACATGAAC
TGGAGAATCATATTGAGGAAGATTGTCAACCTTCTCCGTAAACAATCAAGACATCTTGCTCAGATGATGAAAATTAAGTCTAAACTTCAAAATATTCAGAAAGGAGGTTC
TTCCATGAATGAGTACATTTCTAAAATTAAGAAATGCATTGATGCTCTAGCTGCGATAGGAAAAAAAGTCTCAGTAGAAGATCATATATTGTATATTTTTTCTGGACTGG
TAAAAACAGGATCTCAATTAGTCCAAGATGTAATAGCACTTTTGTTAACCCATGAAAGTCGATTAGAAAGTAAATCTTCAATAAATTCTGATGGTGTTTTACCTACGGCT
AATTTGACTGTTCAAAATCATGTTCAAGAGGAAGTTGAAAATCTAAGAAGTATGAACCAGCATCAACAACAGCAGAATTTTGGTAACGGTAGAGGTAGAGGACAATCTAA
TTTTGGTAATGGTAGAGGTGGAAGGTCATGGAATAATCGTAATAGACCTCAGTGTCAATTGTGCAATAAGTTTGGACACACTGCTATAAAATGTTACTCACGTGTTCAAA
TGCCAGGAGCTTATAATACTCAGTTTGGTTCCCCTGGTCAAGTCTTTTCTTCTGGACAAAATTTTGGACAGCAATTTGGAAATCAATTTCCTCAAATGCAAGCAATGATG
GCTGCTCAAAACTTCAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTCTTTCTCCTCCATTCCTTTCACTGCATTTTTACTAATCTTGCGACGACATATTACCCGAAATTTTATCTTCGGCCACACACGCCAACATTGTCGGTGATGACCGT
CGTCGACCATTCTCCTTTTCTTCTCATTTTCTTCCGCCTTCTTCCTAACTTTGTGGTCACTGCTATGTTCTCTCTGTTAAGGCAGCCCATCAAATACTCAGTCCAAGATG
AGTTTCAGTGCAAGCCCAAGTGTGTTGTTAGTACGTGGCTTCTAGAATTTTTCAGTTTCAGAGTTCTTAGGATTTTTCCATTTTTTGTTGCTGTAAATATGGACATGAAC
TGGAGAATCATATTGAGGAAGATTGTCAACCTTCTCCGTAAACAATCAAGACATCTTGCTCAGATGATGAAAATTAAGTCTAAACTTCAAAATATTCAGAAAGGAGGTTC
TTCCATGAATGAGTACATTTCTAAAATTAAGAAATGCATTGATGCTCTAGCTGCGATAGGAAAAAAAGTCTCAGTAGAAGATCATATATTGTATATTTTTTCTGGACTGG
TAAAAACAGGATCTCAATTAGTCCAAGATGTAATAGCACTTTTGTTAACCCATGAAAGTCGATTAGAAAGTAAATCTTCAATAAATTCTGATGGTGTTTTACCTACGGCT
AATTTGACTGTTCAAAATCATGTTCAAGAGGAAGTTGAAAATCTAAGAAGTATGAACCAGCATCAACAACAGCAGAATTTTGGTAACGGTAGAGGTAGAGGACAATCTAA
TTTTGGTAATGGTAGAGGTGGAAGGTCATGGAATAATCGTAATAGACCTCAGTGTCAATTGTGCAATAAGTTTGGACACACTGCTATAAAATGTTACTCACGTGTTCAAA
TGCCAGGAGCTTATAATACTCAGTTTGGTTCCCCTGGTCAAGTCTTTTCTTCTGGACAAAATTTTGGACAGCAATTTGGAAATCAATTTCCTCAAATGCAAGCAATGATG
GCTGCTCAAAACTTCAATTAA
Protein sequenceShow/hide protein sequence
MVFLLHSFHCIFTNLATTYYPKFYLRPHTPTLSVMTVVDHSPFLLIFFRLLPNFVVTAMFSLLRQPIKYSVQDEFQCKPKCVVSTWLLEFFSFRVLRIFPFFVAVNMDMN
WRIILRKIVNLLRKQSRHLAQMMKIKSKLQNIQKGGSSMNEYISKIKKCIDALAAIGKKVSVEDHILYIFSGLVKTGSQLVQDVIALLLTHESRLESKSSINSDGVLPTA
NLTVQNHVQEEVENLRSMNQHQQQQNFGNGRGRGQSNFGNGRGGRSWNNRNRPQCQLCNKFGHTAIKCYSRVQMPGAYNTQFGSPGQVFSSGQNFGQQFGNQFPQMQAMM
AAQNFN