; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000732 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000732
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr4:14541978..14545957
RNA-Seq ExpressionLag0000732
SyntenyLag0000732
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0053390.1 polyprotein [Cucumis melo var. makuwa]7.4e-4356.6Show/hide
Query:  ERGESSTSSVDKMNERVDELNTSQTTMLRLFNEMTEDFRATIETLRGEMDEMNTRLNLTMQAVENQTPNQGHGVHNKIKVPEPKAFSGNRDAKELENFIF
        ERG+SST S+  + ERV EL+ SQ T+L + N M+EDFRAT++ +R E+ ++N RL+LTM+A+ NQ P +G    +++K+PEPK F G RDAK LEN+IF
Subjt:  ERGESSTSSVDKMNERVDELNTSQTTMLRLFNEMTEDFRATIETLRGEMDEMNTRLNLTMQAVENQTPNQGHGVHNKIKVPEPKAFSGNRDAKELENFIF

Query:  DVEQYFKATVTTSEEMKITLATMHLSNDAKLWWRTKVNDIQSGLATVNSWEDLKKELRA
        D+EQYF+AT T +EE K+TLATMHLS DAKLWWR++  DIQ G  T+++W+ LK+ELR+
Subjt:  DVEQYFKATVTTSEEMKITLATMHLSNDAKLWWRTKVNDIQSGLATVNSWEDLKKELRA

WP_217833205.1 retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002]6.5e-4767.88Show/hide
Query:  SQTTMLRLFNEMTEDFRATIETLRGEMDEMNTRLNLTMQAVENQTPNQGHGVHNKIKVPEPKAFSGNRDAKELENFIFDVEQYFKATVTTSEEMKITLAT
        S   ML+LFN++T+DFRAT+ET+R EM EM T++NLTM+AV NQTPNQ   + N+ K+PEPKAFSGNRDAK+LENFIFD++QYFKA+ T SEE+K+TLA+
Subjt:  SQTTMLRLFNEMTEDFRATIETLRGEMDEMNTRLNLTMQAVENQTPNQGHGVHNKIKVPEPKAFSGNRDAKELENFIFDVEQYFKATVTTSEEMKITLAT

Query:  MHLSNDAKLWWRTKVNDIQSGLATVNSWEDLKKELRA
        MHLS+DAKLWWR+KVND+Q G  T+++W DLKK+LRA
Subjt:  MHLSNDAKLWWRTKVNDIQSGLATVNSWEDLKKELRA

XP_022150099.1 uncharacterized protein LOC111018360 [Momordica charantia]9.0e-5768.86Show/hide
Query:  NKATNPCSTERGESSTSSVDKMNERVDELNTSQTTMLRLFNEMTEDFRATIETLRGEMDEMNTRLNLTMQAVENQTPNQGHGVHNKIKVPEPKAFSGNRD
        +KAT P S ERG+SST+   ++  R+ ELN S + M++LFNEMTEDF+ TI+TLR EM E++TR+NLTM+AV NQ PNQ +   NK+KVPEPK F+GNRD
Subjt:  NKATNPCSTERGESSTSSVDKMNERVDELNTSQTTMLRLFNEMTEDFRATIETLRGEMDEMNTRLNLTMQAVENQTPNQGHGVHNKIKVPEPKAFSGNRD

Query:  AKELENFIFDVEQYFKATVTTSEEMKITLATMHLSNDAKLWWRTKVNDIQSGLATVNSWEDLKKELR
        AK+LENF+FDVEQYFKAT TTSEEMK+TLATMHL++DAKLWWR+KVNDIQ+G  T+NSW+DLKKELR
Subjt:  AKELENFIFDVEQYFKATVTTSEEMKITLATMHLSNDAKLWWRTKVNDIQSGLATVNSWEDLKKELR

XP_022154605.1 uncharacterized protein LOC111021829 [Momordica charantia]1.6e-5366.47Show/hide
Query:  NKATNPCSTERGESSTSSVDKMNERVDELNTSQTTMLRLFNEMTEDFRATIETLRGEMDEMNTRLNLTMQAVENQTPNQGHGVHNKIKVPEPKAFSGNRD
        +KAT P S ERG+SST+   ++  R+ ELN S +TM++LFNEMTEDF+ TI+TLR EM E++TR+NLTM+AV NQ PNQ +   NK+KVPEPK F+GNR 
Subjt:  NKATNPCSTERGESSTSSVDKMNERVDELNTSQTTMLRLFNEMTEDFRATIETLRGEMDEMNTRLNLTMQAVENQTPNQGHGVHNKIKVPEPKAFSGNRD

Query:  AKELENFIFDVEQYFKATVTTSEEMKITLATMHLSNDAKLWWRTKVNDIQSGLATVNSWEDLKKELR
         K+LENF FDVEQYFK T T SE MK+TLATMHL++DAKLWWR+KVNDIQ+G  T+NSW+DLKKELR
Subjt:  AKELENFIFDVEQYFKATVTTSEEMKITLATMHLSNDAKLWWRTKVNDIQSGLATVNSWEDLKKELR

XP_031745591.1 uncharacterized protein LOC116406038 isoform X1 [Cucumis sativus]4.4e-4358.49Show/hide
Query:  ERGESSTSSVDKMNERVDELNTSQTTMLRLFNEMTEDFRATIETLRGEMDEMNTRLNLTMQAVENQTPNQGHGVHNKIKVPEPKAFSGNRDAKELENFIF
        ERG SS+ SV +M ERV EL+ SQ  ++ + N MTEDFRAT++ +R E+ E+NT++NLTM+A+ NQ P  G     KIK+PEPK F G RDAK LENFIF
Subjt:  ERGESSTSSVDKMNERVDELNTSQTTMLRLFNEMTEDFRATIETLRGEMDEMNTRLNLTMQAVENQTPNQGHGVHNKIKVPEPKAFSGNRDAKELENFIF

Query:  DVEQYFKATVTTSEEMKITLATMHLSNDAKLWWRTKVNDIQSGLATVNSWEDLKKELRA
        D+E+YFKAT T +EE K+TLATMHLS DAKLWWR++  DIQ G   +++W+ LK+ELR+
Subjt:  DVEQYFKATVTTSEEMKITLATMHLSNDAKLWWRTKVNDIQSGLATVNSWEDLKKELRA

TrEMBL top hitse value%identityAlignment
A0A5A7UDW8 Polyprotein3.6e-4356.6Show/hide
Query:  ERGESSTSSVDKMNERVDELNTSQTTMLRLFNEMTEDFRATIETLRGEMDEMNTRLNLTMQAVENQTPNQGHGVHNKIKVPEPKAFSGNRDAKELENFIF
        ERG+SST S+  + ERV EL+ SQ T+L + N M+EDFRAT++ +R E+ ++N RL+LTM+A+ NQ P +G    +++K+PEPK F G RDAK LEN+IF
Subjt:  ERGESSTSSVDKMNERVDELNTSQTTMLRLFNEMTEDFRATIETLRGEMDEMNTRLNLTMQAVENQTPNQGHGVHNKIKVPEPKAFSGNRDAKELENFIF

Query:  DVEQYFKATVTTSEEMKITLATMHLSNDAKLWWRTKVNDIQSGLATVNSWEDLKKELRA
        D+EQYF+AT T +EE K+TLATMHLS DAKLWWR++  DIQ G  T+++W+ LK+ELR+
Subjt:  DVEQYFKATVTTSEEMKITLATMHLSNDAKLWWRTKVNDIQSGLATVNSWEDLKKELRA

A0A5A7UIP7 Reverse transcriptase3.6e-4357.23Show/hide
Query:  ERGESSTSSVDKMNERVDELNTSQTTMLRLFNEMTEDFRATIETLRGEMDEMNTRLNLTMQAVENQTPNQGHGVHNKIKVPEPKAFSGNRDAKELENFIF
        ERG+SST SV  + ERV EL++SQ T+L + N M+EDFRAT++ +R E+ ++N RL+LTM+A+ NQ P  G    +++K+PEPK F G RDAK LEN+IF
Subjt:  ERGESSTSSVDKMNERVDELNTSQTTMLRLFNEMTEDFRATIETLRGEMDEMNTRLNLTMQAVENQTPNQGHGVHNKIKVPEPKAFSGNRDAKELENFIF

Query:  DVEQYFKATVTTSEEMKITLATMHLSNDAKLWWRTKVNDIQSGLATVNSWEDLKKELRA
        D+EQYF+AT T +EE K+TLATMHLS DAKLWWR++  DIQ G  T+++W+ LK+ELR+
Subjt:  DVEQYFKATVTTSEEMKITLATMHLSNDAKLWWRTKVNDIQSGLATVNSWEDLKKELRA

A0A5A7V4W1 Uncharacterized protein3.6e-4357.86Show/hide
Query:  ERGESSTSSVDKMNERVDELNTSQTTMLRLFNEMTEDFRATIETLRGEMDEMNTRLNLTMQAVENQTPNQGHGVHNKIKVPEPKAFSGNRDAKELENFIF
        ERG+SS+     M ERV EL++SQ T+L + N+M+EDFR T++ +R E+ +MNTRLNLTM+A+ NQ P  G    +K+K+PEPK F G RDAK LEN+IF
Subjt:  ERGESSTSSVDKMNERVDELNTSQTTMLRLFNEMTEDFRATIETLRGEMDEMNTRLNLTMQAVENQTPNQGHGVHNKIKVPEPKAFSGNRDAKELENFIF

Query:  DVEQYFKATVTTSEEMKITLATMHLSNDAKLWWRTKVNDIQSGLATVNSWEDLKKELRA
         +EQYFKAT T +EE K+TLATMHLS DAKLWWR++  DIQ G  T+++W+ LK+ELR+
Subjt:  DVEQYFKATVTTSEEMKITLATMHLSNDAKLWWRTKVNDIQSGLATVNSWEDLKKELRA

A0A6J1D906 Reverse transcriptase4.4e-5768.86Show/hide
Query:  NKATNPCSTERGESSTSSVDKMNERVDELNTSQTTMLRLFNEMTEDFRATIETLRGEMDEMNTRLNLTMQAVENQTPNQGHGVHNKIKVPEPKAFSGNRD
        +KAT P S ERG+SST+   ++  R+ ELN S + M++LFNEMTEDF+ TI+TLR EM E++TR+NLTM+AV NQ PNQ +   NK+KVPEPK F+GNRD
Subjt:  NKATNPCSTERGESSTSSVDKMNERVDELNTSQTTMLRLFNEMTEDFRATIETLRGEMDEMNTRLNLTMQAVENQTPNQGHGVHNKIKVPEPKAFSGNRD

Query:  AKELENFIFDVEQYFKATVTTSEEMKITLATMHLSNDAKLWWRTKVNDIQSGLATVNSWEDLKKELR
        AK+LENF+FDVEQYFKAT TTSEEMK+TLATMHL++DAKLWWR+KVNDIQ+G  T+NSW+DLKKELR
Subjt:  AKELENFIFDVEQYFKATVTTSEEMKITLATMHLSNDAKLWWRTKVNDIQSGLATVNSWEDLKKELR

A0A6J1DK29 uncharacterized protein LOC1110218297.7e-5466.47Show/hide
Query:  NKATNPCSTERGESSTSSVDKMNERVDELNTSQTTMLRLFNEMTEDFRATIETLRGEMDEMNTRLNLTMQAVENQTPNQGHGVHNKIKVPEPKAFSGNRD
        +KAT P S ERG+SST+   ++  R+ ELN S +TM++LFNEMTEDF+ TI+TLR EM E++TR+NLTM+AV NQ PNQ +   NK+KVPEPK F+GNR 
Subjt:  NKATNPCSTERGESSTSSVDKMNERVDELNTSQTTMLRLFNEMTEDFRATIETLRGEMDEMNTRLNLTMQAVENQTPNQGHGVHNKIKVPEPKAFSGNRD

Query:  AKELENFIFDVEQYFKATVTTSEEMKITLATMHLSNDAKLWWRTKVNDIQSGLATVNSWEDLKKELR
         K+LENF FDVEQYFK T T SE MK+TLATMHL++DAKLWWR+KVNDIQ+G  T+NSW+DLKKELR
Subjt:  AKELENFIFDVEQYFKATVTTSEEMKITLATMHLSNDAKLWWRTKVNDIQSGLATVNSWEDLKKELR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAAATGTAGAAGATTTACCCGCTTCTAAAGATCTGCCGCGAATCAAACGGAAATCACGTGGGTCTCAAACCGGGGTGAATTTGAGGCTCAAATTCGAGTGGTTCAC
ACTGAAACTCAAAGAGAACGCGTGGGCAAGGCCGCCGAGATCGAGAAGTCGCGCAGCAATCGTATGGGTGCCGCGCAGAGGTGAAGATAAACGGTTTCTGATCACCGGAG
GAGGAGGAAAACGCTGGACAGAGGCTGCTCGCGGTTGCTACTGGAACTCACGAAAATTTCCTGTAGGCTGCTGTGGCTGCAAGCCCGATCGCGAGGAGCTGTCGTTGACG
AAGCTCGTGGCTGCCGCGCGGAGGACCCTTGGGTGGGGTGAAAGACATCTCATTCATTCTCGTAGAGAATTCCCCTGCACCAGTCTCCTGCCTCTAGAAGTCTCAGAGTC
ATACTGGTGTAGCCATAGTGCAATTGCGCAGCCCAAGGAAGCAAAATTTTCAGCCTTCAGCTTCTTCTCCATCACGTGCAGCCGCCAGCAGATTCCCCTTCTTTTTCTTG
TTAGTTCCAGCCGCCCAAGCTTGCTTTTCGTGCGTTTGTTCTTCTCCAGCCGTGAAAGACTCCTCTCACCGAACGTGGTATTCACCTTTAGCTCGCCGTCGACAGCCTCT
GCACCACCGTTCTCTCTCCGGCGACCAGAAGTCTGCAGTGGCGCACGTGTGAACCCCTCAAACGTTCTCTCCAATGAAGCCTCACAGCGGTTTGGTCCGAGATCCAGCGC
GGCAGCAACGTTTTTCTTCGTGGTTTGGGTTTCAACAGCAACTTGTAGGTGGGCTAAGTGTCTTGAGCATGCTTGTCCAGGGCGAGTTGTGCCTATGACCGCATGTCCAA
ACCATCCTCACCCATCATGTTTTAATGCCATAGTTGGCAATAAGGGTAGAGACCCTGGAAACAAAGCTACGAACCCTTGTAGCACTGAACGTGGAGAAAGTTCCACGAGT
TCCGTCGACAAGATGAACGAGCGGGTCGACGAGTTGAATACATCCCAAACAACTATGCTGCGGTTGTTCAACGAAATGACAGAAGACTTTAGAGCAACCATCGAAACCTT
GAGAGGTGAGATGGACGAGATGAATACTCGCCTGAATCTAACCATGCAAGCAGTGGAGAACCAGACCCCGAACCAAGGGCACGGGGTGCACAACAAGATTAAGGTCCCGG
AACCCAAAGCCTTCAGTGGGAACCGTGACGCGAAAGAACTTGAAAATTTCATCTTCGACGTGGAACAGTATTTCAAAGCCACTGTAACTACTTCAGAAGAGATGAAAATC
ACGTTGGCCACAATGCATCTCTCTAATGATGCAAAGCTATGGTGGCGCACTAAGGTGAACGACATTCAGAGTGGCCTGGCCACAGTTAACTCTTGGGAAGACCTCAAGAA
AGAGTTGAGGGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGATAAATGTAGAAGATTTACCCGCTTCTAAAGATCTGCCGCGAATCAAACGGAAATCACGTGGGTCTCAAACCGGGGTGAATTTGAGGCTCAAATTCGAGTGGTTCAC
ACTGAAACTCAAAGAGAACGCGTGGGCAAGGCCGCCGAGATCGAGAAGTCGCGCAGCAATCGTATGGGTGCCGCGCAGAGGTGAAGATAAACGGTTTCTGATCACCGGAG
GAGGAGGAAAACGCTGGACAGAGGCTGCTCGCGGTTGCTACTGGAACTCACGAAAATTTCCTGTAGGCTGCTGTGGCTGCAAGCCCGATCGCGAGGAGCTGTCGTTGACG
AAGCTCGTGGCTGCCGCGCGGAGGACCCTTGGGTGGGGTGAAAGACATCTCATTCATTCTCGTAGAGAATTCCCCTGCACCAGTCTCCTGCCTCTAGAAGTCTCAGAGTC
ATACTGGTGTAGCCATAGTGCAATTGCGCAGCCCAAGGAAGCAAAATTTTCAGCCTTCAGCTTCTTCTCCATCACGTGCAGCCGCCAGCAGATTCCCCTTCTTTTTCTTG
TTAGTTCCAGCCGCCCAAGCTTGCTTTTCGTGCGTTTGTTCTTCTCCAGCCGTGAAAGACTCCTCTCACCGAACGTGGTATTCACCTTTAGCTCGCCGTCGACAGCCTCT
GCACCACCGTTCTCTCTCCGGCGACCAGAAGTCTGCAGTGGCGCACGTGTGAACCCCTCAAACGTTCTCTCCAATGAAGCCTCACAGCGGTTTGGTCCGAGATCCAGCGC
GGCAGCAACGTTTTTCTTCGTGGTTTGGGTTTCAACAGCAACTTGTAGGTGGGCTAAGTGTCTTGAGCATGCTTGTCCAGGGCGAGTTGTGCCTATGACCGCATGTCCAA
ACCATCCTCACCCATCATGTTTTAATGCCATAGTTGGCAATAAGGGTAGAGACCCTGGAAACAAAGCTACGAACCCTTGTAGCACTGAACGTGGAGAAAGTTCCACGAGT
TCCGTCGACAAGATGAACGAGCGGGTCGACGAGTTGAATACATCCCAAACAACTATGCTGCGGTTGTTCAACGAAATGACAGAAGACTTTAGAGCAACCATCGAAACCTT
GAGAGGTGAGATGGACGAGATGAATACTCGCCTGAATCTAACCATGCAAGCAGTGGAGAACCAGACCCCGAACCAAGGGCACGGGGTGCACAACAAGATTAAGGTCCCGG
AACCCAAAGCCTTCAGTGGGAACCGTGACGCGAAAGAACTTGAAAATTTCATCTTCGACGTGGAACAGTATTTCAAAGCCACTGTAACTACTTCAGAAGAGATGAAAATC
ACGTTGGCCACAATGCATCTCTCTAATGATGCAAAGCTATGGTGGCGCACTAAGGTGAACGACATTCAGAGTGGCCTGGCCACAGTTAACTCTTGGGAAGACCTCAAGAA
AGAGTTGAGGGCTTAG
Protein sequenceShow/hide protein sequence
MINVEDLPASKDLPRIKRKSRGSQTGVNLRLKFEWFTLKLKENAWARPPRSRSRAAIVWVPRRGEDKRFLITGGGGKRWTEAARGCYWNSRKFPVGCCGCKPDREELSLT
KLVAAARRTLGWGERHLIHSRREFPCTSLLPLEVSESYWCSHSAIAQPKEAKFSAFSFFSITCSRQQIPLLFLVSSSRPSLLFVRLFFSSRERLLSPNVVFTFSSPSTAS
APPFSLRRPEVCSGARVNPSNVLSNEASQRFGPRSSAAATFFFVVWVSTATCRWAKCLEHACPGRVVPMTACPNHPHPSCFNAIVGNKGRDPGNKATNPCSTERGESSTS
SVDKMNERVDELNTSQTTMLRLFNEMTEDFRATIETLRGEMDEMNTRLNLTMQAVENQTPNQGHGVHNKIKVPEPKAFSGNRDAKELENFIFDVEQYFKATVTTSEEMKI
TLATMHLSNDAKLWWRTKVNDIQSGLATVNSWEDLKKELRA