; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0006296 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0006296
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr6:40768766..40774842
RNA-Seq ExpressionLag0006296
SyntenyLag0006296
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.0e-1333.64Show/hide
Query:  MSSDDHILRILVGLGPKYDPTVSVITSNEEMPPLQRVYSLLLTQESRLQRHSTTTVNPDGTLPSVNL----------------------TQSRSMQSGPI
        +SSDDHIL IL GLG  Y   +SVI++  + P +Q V SLLLTQES+    + + +  +  LPSVN+                        S + + G  
Subjt:  MSSDDHILRILVGLGPKYDPTVSVITSNEEMPPLQRVYSLLLTQESRLQRHSTTTVNPDGTLPSVNL----------------------TQSRSMQSGPI

Query:  PNNSNR-----KNKDQGPNSPSLGKGFQQ----------HHGFSPAPNQQQQFQHQSQQPFTAYTLQHDMNKENQWYPDSGASNHVTNDLSNLSIGMEYR
           SNR     +NK Q      LG    +            G+SP  +        +    +A     D+N ++ WYPDSGA+NH+T+ LSNLSIG EY 
Subjt:  PNNSNR-----KNKDQGPNSPSLGKGFQQ----------HHGFSPAPNQQQQFQHQSQQPFTAYTLQHDMNKENQWYPDSGASNHVTNDLSNLSIGMEYR

Query:  RDNKIHIDNGAVLSQCH
          N+I+  NG+ L   H
Subjt:  RDNKIHIDNGAVLSQCH

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.0e-1333.64Show/hide
Query:  MSSDDHILRILVGLGPKYDPTVSVITSNEEMPPLQRVYSLLLTQESRLQRHSTTTVNPDGTLPSVNL----------------------TQSRSMQSGPI
        +SSDDHIL IL GLG  Y   +SVI++  + P +Q V SLLLTQES+    + + +  +  LPSVN+                        S + + G  
Subjt:  MSSDDHILRILVGLGPKYDPTVSVITSNEEMPPLQRVYSLLLTQESRLQRHSTTTVNPDGTLPSVNL----------------------TQSRSMQSGPI

Query:  PNNSNR-----KNKDQGPNSPSLGKGFQQ----------HHGFSPAPNQQQQFQHQSQQPFTAYTLQHDMNKENQWYPDSGASNHVTNDLSNLSIGMEYR
           SNR     +NK Q      LG    +            G+SP  +        +    +A     D+N ++ WYPDSGA+NH+T+ LSNLSIG EY 
Subjt:  PNNSNR-----KNKDQGPNSPSLGKGFQQ----------HHGFSPAPNQQQQFQHQSQQPFTAYTLQHDMNKENQWYPDSGASNHVTNDLSNLSIGMEYR

Query:  RDNKIHIDNGAVLSQCH
          N+I+  NG+ L   H
Subjt:  RDNKIHIDNGAVLSQCH

XP_022136882.1 dr1-associated corepressor homolog isoform X1 [Momordica charantia]1.1e-1837.61Show/hide
Query:  DDHILRILVGLGPKYDPTVSVITSNEEMPPLQRVYSLLLTQESRLQRHSTTTVNPDGTLPSVNLTQ----SRSMQS--GPIPNNSNRKNKDQG-PN----
        +DHI+ IL GL  +++ TVSVI++  +   LQ VYSLLL+ E R +R+S   +N DGTLPSVNLTQ    S S QS  G  P   N ++K+ G PN    
Subjt:  DDHILRILVGLGPKYDPTVSVITSNEEMPPLQRVYSLLLTQESRLQRHSTTTVNPDGTLPSVNLTQ----SRSMQS--GPIPNNSNRKNKDQG-PN----

Query:  ---------------------------SPSLGKGFQQH--HGFSPAPN-----------QQQQ-----FQHQSQQPFTAYTLQHDMNKENQWYPDSGASN
                                      LG   Q H  H FS   N            QQQ     FQ  S     A+  Q D N++  WYPDSGA+N
Subjt:  ---------------------------SPSLGKGFQQH--HGFSPAPN-----------QQQQ-----FQHQSQQPFTAYTLQHDMNKENQWYPDSGASN

Query:  HVTNDLSNLSIGMEYRRDNKIHIDNG
        HVT++ +NL+   EY  DN++ I NG
Subjt:  HVTNDLSNLSIGMEYRRDNKIHIDNG

XP_022136883.1 dr1-associated corepressor homolog isoform X2 [Momordica charantia]1.1e-1837.61Show/hide
Query:  DDHILRILVGLGPKYDPTVSVITSNEEMPPLQRVYSLLLTQESRLQRHSTTTVNPDGTLPSVNLTQ----SRSMQS--GPIPNNSNRKNKDQG-PN----
        +DHI+ IL GL  +++ TVSVI++  +   LQ VYSLLL+ E R +R+S   +N DGTLPSVNLTQ    S S QS  G  P   N ++K+ G PN    
Subjt:  DDHILRILVGLGPKYDPTVSVITSNEEMPPLQRVYSLLLTQESRLQRHSTTTVNPDGTLPSVNLTQ----SRSMQS--GPIPNNSNRKNKDQG-PN----

Query:  ---------------------------SPSLGKGFQQH--HGFSPAPN-----------QQQQ-----FQHQSQQPFTAYTLQHDMNKENQWYPDSGASN
                                      LG   Q H  H FS   N            QQQ     FQ  S     A+  Q D N++  WYPDSGA+N
Subjt:  ---------------------------SPSLGKGFQQH--HGFSPAPN-----------QQQQ-----FQHQSQQPFTAYTLQHDMNKENQWYPDSGASN

Query:  HVTNDLSNLSIGMEYRRDNKIHIDNG
        HVT++ +NL+   EY  DN++ I NG
Subjt:  HVTNDLSNLSIGMEYRRDNKIHIDNG

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]4.5e-1736.17Show/hide
Query:  MSSDDHILRILVGLGPKYDPTVSVITSNEEMPPLQRVYSLLLTQESRLQRHSTTTVNPDGTLPSVNLT-------------------QSRSMQSGPIPNN
        +S++DHI+ IL GLGP++D  +SVIT+      LQ V SLLL QE R +R+    +N DG+LPSVNLT                   QS   Q G   NN
Subjt:  MSSDDHILRILVGLGPKYDPTVSVITSNEEMPPLQRVYSLLLTQESRLQRHSTTTVNPDGTLPSVNLT-------------------QSRSMQSGPIPNN

Query:  --SNRK-----NKDQ-----------------------GPN------SPS-LGKGF----QQHHGFSPAPNQQQQFQHQSQQP--FTAYTLQHDMNKENQ
          SNR+     NK Q                       GPN      SP+    GF      H+ FS    Q   F   S  P    A  +  D N+++ 
Subjt:  --SNRK-----NKDQ-----------------------GPN------SPS-LGKGF----QQHHGFSPAPNQQQQFQHQSQQP--FTAYTLQHDMNKENQ

Query:  WYPDSGASNHVTNDLSNLSIGMEYRRDNKIHIDNG
        WY DSG +NHVTN+  N S+G EY  D KI + NG
Subjt:  WYPDSGASNHVTNDLSNLSIGMEYRRDNKIHIDNG

TrEMBL top hitse value%identityAlignment
A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-945.0e-1433.64Show/hide
Query:  MSSDDHILRILVGLGPKYDPTVSVITSNEEMPPLQRVYSLLLTQESRLQRHSTTTVNPDGTLPSVNL----------------------TQSRSMQSGPI
        +SSDDHIL IL GLG  Y   +SVI++  + P +Q V SLLLTQES+    + + +  +  LPSVN+                        S + + G  
Subjt:  MSSDDHILRILVGLGPKYDPTVSVITSNEEMPPLQRVYSLLLTQESRLQRHSTTTVNPDGTLPSVNL----------------------TQSRSMQSGPI

Query:  PNNSNR-----KNKDQGPNSPSLGKGFQQ----------HHGFSPAPNQQQQFQHQSQQPFTAYTLQHDMNKENQWYPDSGASNHVTNDLSNLSIGMEYR
           SNR     +NK Q      LG    +            G+SP  +        +    +A     D+N ++ WYPDSGA+NH+T+ LSNLSIG EY 
Subjt:  PNNSNR-----KNKDQGPNSPSLGKGFQQ----------HHGFSPAPNQQQQFQHQSQQPFTAYTLQHDMNKENQWYPDSGASNHVTNDLSNLSIGMEYR

Query:  RDNKIHIDNGAVLSQCH
          N+I+  NG+ L   H
Subjt:  RDNKIHIDNGAVLSQCH

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-945.0e-1433.64Show/hide
Query:  MSSDDHILRILVGLGPKYDPTVSVITSNEEMPPLQRVYSLLLTQESRLQRHSTTTVNPDGTLPSVNL----------------------TQSRSMQSGPI
        +SSDDHIL IL GLG  Y   +SVI++  + P +Q V SLLLTQES+    + + +  +  LPSVN+                        S + + G  
Subjt:  MSSDDHILRILVGLGPKYDPTVSVITSNEEMPPLQRVYSLLLTQESRLQRHSTTTVNPDGTLPSVNL----------------------TQSRSMQSGPI

Query:  PNNSNR-----KNKDQGPNSPSLGKGFQQ----------HHGFSPAPNQQQQFQHQSQQPFTAYTLQHDMNKENQWYPDSGASNHVTNDLSNLSIGMEYR
           SNR     +NK Q      LG    +            G+SP  +        +    +A     D+N ++ WYPDSGA+NH+T+ LSNLSIG EY 
Subjt:  PNNSNR-----KNKDQGPNSPSLGKGFQQ----------HHGFSPAPNQQQQFQHQSQQPFTAYTLQHDMNKENQWYPDSGASNHVTNDLSNLSIGMEYR

Query:  RDNKIHIDNGAVLSQCH
          N+I+  NG+ L   H
Subjt:  RDNKIHIDNGAVLSQCH

A0A6J1C6N9 dr1-associated corepressor homolog isoform X15.1e-1937.61Show/hide
Query:  DDHILRILVGLGPKYDPTVSVITSNEEMPPLQRVYSLLLTQESRLQRHSTTTVNPDGTLPSVNLTQ----SRSMQS--GPIPNNSNRKNKDQG-PN----
        +DHI+ IL GL  +++ TVSVI++  +   LQ VYSLLL+ E R +R+S   +N DGTLPSVNLTQ    S S QS  G  P   N ++K+ G PN    
Subjt:  DDHILRILVGLGPKYDPTVSVITSNEEMPPLQRVYSLLLTQESRLQRHSTTTVNPDGTLPSVNLTQ----SRSMQS--GPIPNNSNRKNKDQG-PN----

Query:  ---------------------------SPSLGKGFQQH--HGFSPAPN-----------QQQQ-----FQHQSQQPFTAYTLQHDMNKENQWYPDSGASN
                                      LG   Q H  H FS   N            QQQ     FQ  S     A+  Q D N++  WYPDSGA+N
Subjt:  ---------------------------SPSLGKGFQQH--HGFSPAPN-----------QQQQ-----FQHQSQQPFTAYTLQHDMNKENQWYPDSGASN

Query:  HVTNDLSNLSIGMEYRRDNKIHIDNG
        HVT++ +NL+   EY  DN++ I NG
Subjt:  HVTNDLSNLSIGMEYRRDNKIHIDNG

A0A6J1C8R2 dr1-associated corepressor homolog isoform X25.1e-1937.61Show/hide
Query:  DDHILRILVGLGPKYDPTVSVITSNEEMPPLQRVYSLLLTQESRLQRHSTTTVNPDGTLPSVNLTQ----SRSMQS--GPIPNNSNRKNKDQG-PN----
        +DHI+ IL GL  +++ TVSVI++  +   LQ VYSLLL+ E R +R+S   +N DGTLPSVNLTQ    S S QS  G  P   N ++K+ G PN    
Subjt:  DDHILRILVGLGPKYDPTVSVITSNEEMPPLQRVYSLLLTQESRLQRHSTTTVNPDGTLPSVNLTQ----SRSMQS--GPIPNNSNRKNKDQG-PN----

Query:  ---------------------------SPSLGKGFQQH--HGFSPAPN-----------QQQQ-----FQHQSQQPFTAYTLQHDMNKENQWYPDSGASN
                                      LG   Q H  H FS   N            QQQ     FQ  S     A+  Q D N++  WYPDSGA+N
Subjt:  ---------------------------SPSLGKGFQQH--HGFSPAPN-----------QQQQ-----FQHQSQQPFTAYTLQHDMNKENQWYPDSGASN

Query:  HVTNDLSNLSIGMEYRRDNKIHIDNG
        HVT++ +NL+   EY  DN++ I NG
Subjt:  HVTNDLSNLSIGMEYRRDNKIHIDNG

A0A6J1DLT9 uncharacterized protein LOC1110217572.2e-1736.17Show/hide
Query:  MSSDDHILRILVGLGPKYDPTVSVITSNEEMPPLQRVYSLLLTQESRLQRHSTTTVNPDGTLPSVNLT-------------------QSRSMQSGPIPNN
        +S++DHI+ IL GLGP++D  +SVIT+      LQ V SLLL QE R +R+    +N DG+LPSVNLT                   QS   Q G   NN
Subjt:  MSSDDHILRILVGLGPKYDPTVSVITSNEEMPPLQRVYSLLLTQESRLQRHSTTTVNPDGTLPSVNLT-------------------QSRSMQSGPIPNN

Query:  --SNRK-----NKDQ-----------------------GPN------SPS-LGKGF----QQHHGFSPAPNQQQQFQHQSQQP--FTAYTLQHDMNKENQ
          SNR+     NK Q                       GPN      SP+    GF      H+ FS    Q   F   S  P    A  +  D N+++ 
Subjt:  --SNRK-----NKDQ-----------------------GPN------SPS-LGKGF----QQHHGFSPAPNQQQQFQHQSQQP--FTAYTLQHDMNKENQ

Query:  WYPDSGASNHVTNDLSNLSIGMEYRRDNKIHIDNG
        WY DSG +NHVTN+  N S+G EY  D KI + NG
Subjt:  WYPDSGASNHVTNDLSNLSIGMEYRRDNKIHIDNG

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.8e-0825.7Show/hide
Query:  MSSDDHILRILVGLGPKYDPTVSVITSNEEMPPLQRVYSLLLTQESRLQRHSTTTVNPDGTLPSVNLTQSRSMQSGPIPNNSNRKNKDQGPNSPSLGKGF
        M  D+ + R+L  L  +Y P +  I + +  P L  ++  LL  ES++   S+ TV P     + N    R+  +    NN NR N+    N+ +  K +
Subjt:  MSSDDHILRILVGLGPKYDPTVSVITSNEEMPPLQRVYSLLLTQESRLQRHSTTTVNPDGTLPSVNLTQSRSMQSGPIPNNSNRKNKDQGPNSPSLGKGF

Query:  QQHH-GFSPAPNQQQ--------------------QFQH--------QSQQPFTAYTLQHDM-----NKENQWYPDSGASNHVTNDLSNLSIGMEYRRDN
        QQ    F P  NQ +                    Q QH        Q   PFT +  + ++        N W  DSGA++H+T+D +NLS+   Y   +
Subjt:  QQHH-GFSPAPNQQQ--------------------QFQH--------QSQQPFTAYTLQHDM-----NKENQWYPDSGASNHVTNDLSNLSIGMEYRRDN

Query:  KIHIDNGAVLSQCH
         + + +G+ +   H
Subjt:  KIHIDNGAVLSQCH

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.3e-0727.03Show/hide
Query:  MSSDDHILRILVGLGPKYDPTVSVITSNEEMPPLQRVYSLLLTQESRLQRHSTTTVNP------DGTLPSVNLTQSRSMQSGPIPNNSNRKNKDQGPNS-
        M  D+ + R+L  L   Y P +  I + +  P L  ++  L+ +ES+L   ++  V P           + N  Q+    +    NN+NR N  Q  +S 
Subjt:  MSSDDHILRILVGLGPKYDPTVSVITSNEEMPPLQRVYSLLLTQESRLQRHSTTTVNP------DGTLPSVNLTQSRSMQSGPIPNNSNRKNKDQGPNS-

Query:  ---------PSLG------------KGFQQHHGFSPAPNQQQQFQHQSQQPFTAYTLQHDM-----NKENQWYPDSGASNHVTNDLSNLSIGMEYRRDNK
                 P LG            K   Q H F    NQQ     QS  PFT +  + ++        N W  DSGA++H+T+D +NLS    Y   + 
Subjt:  ---------PSLG------------KGFQQHHGFSPAPNQQQQFQHQSQQPFTAYTLQHDM-----NKENQWYPDSGASNHVTNDLSNLSIGMEYRRDNK

Query:  IHIDNGAVLSQCHR-DAERPTS
        + I +G+ +   H   A  PTS
Subjt:  IHIDNGAVLSQCHR-DAERPTS

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCTGATGATCATATTCTACGAATTCTTGTCGGTCTTGGTCCTAAATATGACCCTACGGTCTCTGTGATTACTAGTAATGAGGAGATGCCTCCTCTGCAA
AGAGTCTATTCACTATTGTTAACTCAAGAGAGTCGTCTACAACGTCATTCTACCACCACTGTGAATCCTGATGGAACTCTTCCCTCGGTAAACTTGACTCAGTCC
AGATCGATGCAATCAGGACCCATTCCTAATAACTCTAATCGAAAGAACAAGGACCAAGGTCCAAATTCTCCATCTCTAGGGAAAGGTTTTCAGCAACACCATGGC
TTTTCCCCTGCTCCTAATCAGCAGCAACAGTTTCAACATCAATCTCAACAGCCCTTTACCGCATATACCTTGCAACATGATATGAATAAAGAAAATCAGTGGTAC
CCGGATTCAGGAGCATCCAATCATGTTACAAATGACTTATCCAATCTGTCTATTGGAATGGAGTATCGAAGAGATAATAAAATCCACATCGACAATGGTGCAGTT
TTGTCGCAATGTCATCGCGACGCGGAGCGGCCGACGTCCTTTCCTAAGGGTCTCTTTCGAGGTGAGAGGTTTGGAGTCGCCACCAATCATCTTGGGGAGCCTTGT
TGGAAAAAAAAGCGGCGGCACGGTGGCGGCGGTGACAGTGGTGGCGGCAACGGCGGTACGGCGGTGAGCGGCGGCGGCGGTACGACGGTGAGCGGCGGCGACGGT
ACGGTGACCAGGCTTCGACCTCTACTGACTAGCGGGGTTCGATTTAACCAAAAGAAAAGGGAATGGGACCTGGCTCAGACCAGGATTCGACCTCTACTGACTAGC
AGGACCAGGCTTCGACCTCTACTGACTAGCGGGTTCGATTTAACCAAAAGAAAAGGGGAATGGGACCTGGCTCAGACCAGGCTTCGACCTCTACTGACTAGCAGG
ACCAGGCTTCGACCTCTACTGACTAGCGGGGTTCGATTTAACCAAAAGAAAAGGGGAATGGACCTGGCTCAGACCAGGCTTCGACCTCTACTGACTAGCAGGATT
GAGAGACATAGACGAAGTGAAGTAAGCTGTTTTGCTGACGATGAACCAAAACACTTTGAGCTTTCTATTCTTGAATGTAGAGAACTGCTCGAGATTGGAAGATTC
TTCATAAAGAAGCGTGATTCTCTTAACTTGGCTCTTGCCCCAGTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCTGATGATCATATTCTACGAATTCTTGTCGGTCTTGGTCCTAAATATGACCCTACGGTCTCTGTGATTACTAGTAATGAGGAGATGCCTCCTCTGCAA
AGAGTCTATTCACTATTGTTAACTCAAGAGAGTCGTCTACAACGTCATTCTACCACCACTGTGAATCCTGATGGAACTCTTCCCTCGGTAAACTTGACTCAGTCC
AGATCGATGCAATCAGGACCCATTCCTAATAACTCTAATCGAAAGAACAAGGACCAAGGTCCAAATTCTCCATCTCTAGGGAAAGGTTTTCAGCAACACCATGGC
TTTTCCCCTGCTCCTAATCAGCAGCAACAGTTTCAACATCAATCTCAACAGCCCTTTACCGCATATACCTTGCAACATGATATGAATAAAGAAAATCAGTGGTAC
CCGGATTCAGGAGCATCCAATCATGTTACAAATGACTTATCCAATCTGTCTATTGGAATGGAGTATCGAAGAGATAATAAAATCCACATCGACAATGGTGCAGTT
TTGTCGCAATGTCATCGCGACGCGGAGCGGCCGACGTCCTTTCCTAAGGGTCTCTTTCGAGGTGAGAGGTTTGGAGTCGCCACCAATCATCTTGGGGAGCCTTGT
TGGAAAAAAAAGCGGCGGCACGGTGGCGGCGGTGACAGTGGTGGCGGCAACGGCGGTACGGCGGTGAGCGGCGGCGGCGGTACGACGGTGAGCGGCGGCGACGGT
ACGGTGACCAGGCTTCGACCTCTACTGACTAGCGGGGTTCGATTTAACCAAAAGAAAAGGGAATGGGACCTGGCTCAGACCAGGATTCGACCTCTACTGACTAGC
AGGACCAGGCTTCGACCTCTACTGACTAGCGGGTTCGATTTAACCAAAAGAAAAGGGGAATGGGACCTGGCTCAGACCAGGCTTCGACCTCTACTGACTAGCAGG
ACCAGGCTTCGACCTCTACTGACTAGCGGGGTTCGATTTAACCAAAAGAAAAGGGGAATGGACCTGGCTCAGACCAGGCTTCGACCTCTACTGACTAGCAGGATT
GAGAGACATAGACGAAGTGAAGTAAGCTGTTTTGCTGACGATGAACCAAAACACTTTGAGCTTTCTATTCTTGAATGTAGAGAACTGCTCGAGATTGGAAGATTC
TTCATAAAGAAGCGTGATTCTCTTAACTTGGCTCTTGCCCCAGTCTAA
Protein sequenceShow/hide protein sequence
MSSDDHILRILVGLGPKYDPTVSVITSNEEMPPLQRVYSLLLTQESRLQRHSTTTVNPDGTLPSVNLTQSRSMQSGPIPNNSNRKNKDQGPNSPSLGKGFQQHHG
FSPAPNQQQQFQHQSQQPFTAYTLQHDMNKENQWYPDSGASNHVTNDLSNLSIGMEYRRDNKIHIDNGAVLSQCHRDAERPTSFPKGLFRGERFGVATNHLGEPC
WKKKRRHGGGGDSGGGNGGTAVSGGGGTTVSGGDGTVTRLRPLLTSGVRFNQKKREWDLAQTRIRPLLTSRTRLRPLLTSGFDLTKRKGEWDLAQTRLRPLLTSR
TRLRPLLTSGVRFNQKKRGMDLAQTRLRPLLTSRIERHRRSEVSCFADDEPKHFELSILECRELLEIGRFFIKKRDSLNLALAPV