; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0020050 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0020050
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransposon TX1 uncharacterized 149 kDa protein
Genome locationchr5:47666319..47668429
RNA-Seq ExpressionLag0020050
SyntenyLag0020050
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0004518 - nuclease activity (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW15385.1 putative ribonuclease H protein [Vitis vinifera]2.5e-3133.89Show/hide
Query:  GTSNKKEGWERGLKDLICRINPDVVILQESRLNSVDRRIVKSIWSSRHIAWIARDAISSAGGILLMWKDSVIEVPDSVFGAFFVTLHCSFQGQKVGWISR
        G  ++K+   R ++  +   NPDVV+LQE++    DRR+V SIW  + + W+A  A  ++GGI+++W        + V G+F VT+  +   +   W++ 
Subjt:  GTSNKKEGWERGLKDLICRINPDVVILQESRLNSVDRRIVKSIWSSRHIAWIARDAISSAGGILLMWKDSVIEVPDSVFGAFFVTLHCSFQGQKVGWISR

Query:  VYGPCNYKDRTQCLQELYDLKGLYQGIWCLVGDFN-------------------------KSGEALPSYLRPLSHHDVV----RALKWGPTPFRFENAWL
        VYGP     R     EL DL GL    WC+ GDFN                         +  EALP   R  S H  +        WGPTPFRFEN WL
Subjt:  VYGPCNYKDRTQCLQELYDLKGLYQGIWCLVGDFN-------------------------KSGEALPSYLRPLSHHDVV----RALKWGPTPFRFENAWL

Query:  DNRDFKGKVELWWKDLNPFGWAGFKLMEKLRGLKLKITQ
         + +FK K   WW++    GW G K M KL+ +K K+ +
Subjt:  DNRDFKGKVELWWKDLNPFGWAGFKLMEKLRGLKLKITQ

RVW55793.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]2.5e-3133.89Show/hide
Query:  GTSNKKEGWERGLKDLICRINPDVVILQESRLNSVDRRIVKSIWSSRHIAWIARDAISSAGGILLMWKDSVIEVPDSVFGAFFVTLHCSFQGQKVGWISR
        G  ++K+   R ++  +   NPDVV+LQE++    DRR+V SIW  + + W+A  A  ++GGI+++W        + V G+F VT+  +   +   W++ 
Subjt:  GTSNKKEGWERGLKDLICRINPDVVILQESRLNSVDRRIVKSIWSSRHIAWIARDAISSAGGILLMWKDSVIEVPDSVFGAFFVTLHCSFQGQKVGWISR

Query:  VYGPCNYKDRTQCLQELYDLKGLYQGIWCLVGDFN-------------------------KSGEALPSYLRPLSHHDVV----RALKWGPTPFRFENAWL
        VYGP     R     EL DL GL    WC+ GDFN                         +  EALP   R  S H  +        WGPTPFRFEN WL
Subjt:  VYGPCNYKDRTQCLQELYDLKGLYQGIWCLVGDFN-------------------------KSGEALPSYLRPLSHHDVV----RALKWGPTPFRFENAWL

Query:  DNRDFKGKVELWWKDLNPFGWAGFKLMEKLRGLKLKITQ
         + +FK K   WW++    GW G K M KL+ +K K+ +
Subjt:  DNRDFKGKVELWWKDLNPFGWAGFKLMEKLRGLKLKITQ

RVW70784.1 hypothetical protein CK203_058030 [Vitis vinifera]6.1e-3336.21Show/hide
Query:  GTSNKKEGWERGLKDLICRINPDVVILQESRLNSVDRRIVKSIWSSRHIAWIARDAISSAGGILLMWKDSVIEVPDSVFGAFFVTLHCSFQGQKVGWISR
        G  +KK+   R ++  +   NPD+V+LQE++  + DRR V S+W  + + W A  A  ++GGI+++W  S  E    V G+F VT+  +   +   W++ 
Subjt:  GTSNKKEGWERGLKDLICRINPDVVILQESRLNSVDRRIVKSIWSSRHIAWIARDAISSAGGILLMWKDSVIEVPDSVFGAFFVTLHCSFQGQKVGWISR

Query:  VYGPCNYKDRTQCLQELYDLKGLYQGIWCLVGDFN-------KSG-----------EALPSYLRPLSHHDVV----RALKWGPTPFRFENAWLDNRDFKG
        VY P N   R     EL DL GL    WC+ GDFN       K G           EALP   R  S H ++      LKWG TPFRF+N WL + +FK 
Subjt:  VYGPCNYKDRTQCLQELYDLKGLYQGIWCLVGDFN-------KSG-----------EALPSYLRPLSHHDVV----RALKWGPTPFRFENAWLDNRDFKG

Query:  KVELWWKDLNPFGWAGFKLMEKLRGLKLKITQ
        K  +WW++    GW G K M KL+ +K K+ +
Subjt:  KVELWWKDLNPFGWAGFKLMEKLRGLKLKITQ

RVW83303.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]1.1e-3133.89Show/hide
Query:  GTSNKKEGWERGLKDLICRINPDVVILQESRLNSVDRRIVKSIWSSRHIAWIARDAISSAGGILLMWKDSVIEVPDSVFGAFFVTLHCSFQGQKVGWISR
        G  ++K+   R ++  +   NPDVV+LQE++    DRR+V SIW  + + W+A  A  ++GGI+++W        + V G+F VT+  +   ++  W++ 
Subjt:  GTSNKKEGWERGLKDLICRINPDVVILQESRLNSVDRRIVKSIWSSRHIAWIARDAISSAGGILLMWKDSVIEVPDSVFGAFFVTLHCSFQGQKVGWISR

Query:  VYGPCNYKDRTQCLQELYDLKGLYQGIWCLVGDFN-------------------------KSGEALPSYLRPLSHHDVV----RALKWGPTPFRFENAWL
        VYGP     R     EL DL GL    WC+ GDFN                         +  EALP   R  S H  +        WGPTPFRFEN WL
Subjt:  VYGPCNYKDRTQCLQELYDLKGLYQGIWCLVGDFN-------------------------KSGEALPSYLRPLSHHDVV----RALKWGPTPFRFENAWL

Query:  DNRDFKGKVELWWKDLNPFGWAGFKLMEKLRGLKLKITQ
         + +FK K   WW++    GW G K M KL+ +K K+ +
Subjt:  DNRDFKGKVELWWKDLNPFGWAGFKLMEKLRGLKLKITQ

XP_010263157.1 PREDICTED: uncharacterized protein LOC104601500 [Nelumbo nucifera]4.3e-3137.56Show/hide
Query:  LKDLICRINPDVVILQESRLNSVDRRIVKSIWSSRHIAWIARDAISSAGGILLMWKDSVIEVPDSVFGAFFVTLHCSFQGQKVGWI-SRVYGPCNYKDRT
        +K+++ + NPD+ ++QES+L ++D+R V+S+W +  ++W+   +  S+GGI+ +WKD V+E  + + G F V++       +  W+ + VYGP  YK R 
Subjt:  LKDLICRINPDVVILQESRLNSVDRRIVKSIWSSRHIAWIARDAISSAGGILLMWKDSVIEVPDSVFGAFFVTLHCSFQGQKVGWI-SRVYGPCNYKDRT

Query:  QCLQELYDLKGLYQGIWCLVGDFNKSGEALPSYLRPLSHHDVVRALK-----WGPTPFRFENAWLDNRDFKGKVELWWKDLNP-FGWAGFKLMEKLRGLK
            EL D++G ++  W   GD      ALP   R  S H  +   K      GP+PFRFEN WL + DFK KV+ WW ++NP   WAG K   KL+ LK
Subjt:  QCLQELYDLKGLYQGIWCLVGDFNKSGEALPSYLRPLSHHDVVRALK-----WGPTPFRFENAWLDNRDFKGKVELWWKDLNP-FGWAGFKLMEKLRGLK

Query:  LKITQ
        +KI +
Subjt:  LKITQ

TrEMBL top hitse value%identityAlignment
A0A438GEZ6 Uncharacterized protein2.9e-3336.21Show/hide
Query:  GTSNKKEGWERGLKDLICRINPDVVILQESRLNSVDRRIVKSIWSSRHIAWIARDAISSAGGILLMWKDSVIEVPDSVFGAFFVTLHCSFQGQKVGWISR
        G  +KK+   R ++  +   NPD+V+LQE++  + DRR V S+W  + + W A  A  ++GGI+++W  S  E    V G+F VT+  +   +   W++ 
Subjt:  GTSNKKEGWERGLKDLICRINPDVVILQESRLNSVDRRIVKSIWSSRHIAWIARDAISSAGGILLMWKDSVIEVPDSVFGAFFVTLHCSFQGQKVGWISR

Query:  VYGPCNYKDRTQCLQELYDLKGLYQGIWCLVGDFN-------KSG-----------EALPSYLRPLSHHDVV----RALKWGPTPFRFENAWLDNRDFKG
        VY P N   R     EL DL GL    WC+ GDFN       K G           EALP   R  S H ++      LKWG TPFRF+N WL + +FK 
Subjt:  VYGPCNYKDRTQCLQELYDLKGLYQGIWCLVGDFN-------KSG-----------EALPSYLRPLSHHDVV----RALKWGPTPFRFENAWLDNRDFKG

Query:  KVELWWKDLNPFGWAGFKLMEKLRGLKLKITQ
        K  +WW++    GW G K M KL+ +K K+ +
Subjt:  KVELWWKDLNPFGWAGFKLMEKLRGLKLKITQ

A0A803PZR9 Uncharacterized protein1.5e-3741.41Show/hide
Query:  RGLKDLICRINPDVVILQESRLNSVDRRIVKSIWSSRHIAWIARDAISSAGGILLMWKDSVIEVPDSVFGAFFVTLHCSFQGQKVGWISRVYGPCNYKDR
        R +K  IC+IN D+VILQE +  ++DR  + +IW SR  AWI   A   +GG LL+W    + V DS+ G F ++     +G+   W   +YGPC+YK R
Subjt:  RGLKDLICRINPDVVILQESRLNSVDRRIVKSIWSSRHIAWIARDAISSAGGILLMWKDSVIEVPDSVFGAFFVTLHCSFQGQKVGWISRVYGPCNYKDR

Query:  TQCLQELYDLKGLYQGIWCLVGDFN---KSGEALPSYLRPLSHHDVV---RALKWGPTPFRFENAWLDNRDFKGKVELWWKDLNPFGWAGFKLMEKLR
             EL  LK +    WCL GDFN   + GE L S     +H  VV      KWG +PFRF+N WL+N+ F    E+WW   N  GW G + M KLR
Subjt:  TQCLQELYDLKGLYQGIWCLVGDFN---KSGEALPSYLRPLSHHDVV---RALKWGPTPFRFENAWLDNRDFKGKVELWWKDLNPFGWAGFKLMEKLR

A0A803QEA6 Uncharacterized protein1.3e-3333.21Show/hide
Query:  EGWERGLKDLICRINPDVVILQESRLNSVDRRIVKSIWSSRHIAWIARDAISSAGGILLMWKDSVIEVPDSVFGAFFVTLHCSFQGQKVGWISRVYGPCN
        +G    +K  IC+ NPD+VILQE +  +VDRR + SIW SR  AWI   AI  +GG LL+W    I V DS+ G F +++  + +G++  W S VYGPC+
Subjt:  EGWERGLKDLICRINPDVVILQESRLNSVDRRIVKSIWSSRHIAWIARDAISSAGGILLMWKDSVIEVPDSVFGAFFVTLHCSFQGQKVGWISRVYGPCN

Query:  YKDRTQCLQELYDLKGLYQGIWCLVGDFN---KSGEALPS------------------------------------------------------------
        YK R +   EL  L  +    WC+ GDFN   + GE L S                                                            
Subjt:  YKDRTQCLQELYDLKGLYQGIWCLVGDFN---KSGEALPS------------------------------------------------------------

Query:  ---YLRPLSHHDVV----RALKWGPTPFRFENAWLDNRDFKGKVELWWKDLNPFGWAGFKLMEKLRGLKLKITQ
            +R +S H  V       KWGP PFRF+N WLD++ F    E WWK+    GW G K M+KL+ L+ K+ +
Subjt:  ---YLRPLSHHDVV----RALKWGPTPFRFENAWLDNRDFKGKVELWWKDLNPFGWAGFKLMEKLRGLKLKITQ

A0A803QI00 Uncharacterized protein2.5e-3232.12Show/hide
Query:  EGWERGLKDLICRINPDVVILQESRLNSVDRRIVKSIWSSRHIAWIARDAISSAGGILLMWKDSVIEVPDSVFGAFFVTLHCSFQGQKVGWISRVYGPCN
        +G    +K  IC+ NPD+VILQE +  SVDRR + SIW SR  AWI   AI  +GG LL+W    I V DS+ G F +++    +G+   W S VYGPC+
Subjt:  EGWERGLKDLICRINPDVVILQESRLNSVDRRIVKSIWSSRHIAWIARDAISSAGGILLMWKDSVIEVPDSVFGAFFVTLHCSFQGQKVGWISRVYGPCN

Query:  YKDRTQCLQELYDLKGLYQGIWCLVGDFN---KSGEALPS------------------------------------------------------------
        YK R     EL  L  +    WC+ GDFN   + GE L S                                                            
Subjt:  YKDRTQCLQELYDLKGLYQGIWCLVGDFN---KSGEALPS------------------------------------------------------------

Query:  ---YLRPLSHHDVV----RALKWGPTPFRFENAWLDNRDFKGKVELWWKDLNPFGWAGFKLMEKLRGLKLKITQ
            +R +S H  V       +WGP PFRF+N WL++  F      WWK+ +  GW G K M KL+  + K+ +
Subjt:  ---YLRPLSHHDVV----RALKWGPTPFRFENAWLDNRDFKGKVELWWKDLNPFGWAGFKLMEKLRGLKLKITQ

A0A803QQM3 Uncharacterized protein1.1e-3232.48Show/hide
Query:  EGWERGLKDLICRINPDVVILQESRLNSVDRRIVKSIWSSRHIAWIARDAISSAGGILLMWKDSVIEVPDSVFGAFFVTLHCSFQGQKVGWISRVYGPCN
        +G    +K  IC+ NPD+VILQE +  +VDRR + SIW SR  AWI   A+  +GG LL+W    I V DS+ G F +++  + +G++  W S VYGPC+
Subjt:  EGWERGLKDLICRINPDVVILQESRLNSVDRRIVKSIWSSRHIAWIARDAISSAGGILLMWKDSVIEVPDSVFGAFFVTLHCSFQGQKVGWISRVYGPCN

Query:  YKDRTQCLQELYDLKGLYQGIWCLVGDFN---KSGEALPS------------------------------------------------------------
        YK R +   EL  L  +    WC+ GDFN   + GE L S                                                            
Subjt:  YKDRTQCLQELYDLKGLYQGIWCLVGDFN---KSGEALPS------------------------------------------------------------

Query:  ---YLRPLSHHDVV----RALKWGPTPFRFENAWLDNRDFKGKVELWWKDLNPFGWAGFKLMEKLRGLKLKITQ
            +R +S H  V       KWGP PFRF+N WL+++ F    E WWK+    GW G K M+KL+ L+ K+ +
Subjt:  ---YLRPLSHHDVV----RALKWGPTPFRFENAWLDNRDFKGKVELWWKDLNPFGWAGFKLMEKLRGLKLKITQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTGGCGCCTCAACGTATTTAAAGCTATAGGTGATTGCCTTGGAGGCTTCATTGAATATGAAGAGTCGAATTCTCTCCTCATTGATTGCGTGGACTTAAAGATGAA
AATTAAATATAATTACTATGGATTTATTCCTGCTGAAGTTCGAATCATGGATGGAAATAACCACTTTCATATTCAGATTGTCACTTTTCAAGAAGGAAATTTGCTCATTG
ACAGGGTCGCCGGCATCCATGGTAGCTTCTCACCGACAGCGGCTCATGCCTTTCATCGCGGTCCAAATGATCCCTTCTTTTGCCCGGTAGACATATGGAGGATTGAAGAT
GGCTTAGTTTATCAGTTGGTTAGTATCCAAAAGAAGCCCGTCAGTTTGAACGAGGATAATAGACAGCTGGAAAGCGCTAATGAGATTTTTGAAATATACCGCCAAAAGTC
CTTGCTAGGGTTTAGTGAGGAAGCTGGGCTGTCCCATGTGACTAATAGTCAGGCCCAAGCCGAGGATGCAAAACAATGTACGAGCGATAGCAAAAAGCCAATGGGACCAA
GTGACACGTTTGCTGCTGAGTGGAATGGGAAGGAGGCCTCGTCTCTGATTTCACCATGTCAAGCCCAGTTAGCAAAGGCTCCGAAATCTGGGAAGAAGAAATCTCGTGTA
GAAATTAGTGAGACCCTGCCTGACGGATATTATCAATGTTTTGAAGTTGATAATGAGATAGTTAGAAGTTTGCAACCGCAAGAGGATGAGATAGGAATGAATCTTTCGAA
CAGCATGGTGAACCAAGATACGGATCCTCCTCGAGAGGAACTCTTAGAAAGAACTTGCACTGATAATGTTACTGAGACTTCTCCAAAGGCTTTGGCTTGCATACCGCCGA
TAGAGGTCGCTTCGAACAAGAACAATCAGTGTGTTGAAGGATTTGCTATCAGCAAAGAGCTGGTGATAACGCTTAGGAAGAATAACTTGTGTATTAGACCCATTGTGGGG
ACTAGTAACAAAAAAGAGGGTTGGGAGCGAGGTCTAAAAGACCTCATTTGTAGGATTAATCCTGATGTAGTTATCTTGCAAGAGTCTAGATTGAATTCTGTTGATAGGCG
GATTGTGAAATCCATTTGGAGTTCCAGGCATATAGCGTGGATTGCCCGGGATGCTATTAGTTCGGCAGGAGGAATTTTACTCATGTGGAAGGATTCTGTTATAGAGGTTC
CGGACTCTGTGTTCGGGGCATTTTTTGTTACTTTGCATTGTTCTTTTCAGGGCCAGAAGGTGGGGTGGATCTCTAGGGTCTATGGGCCATGTAACTATAAGGATAGAACG
CAATGTCTGCAAGAATTGTATGACTTGAAGGGTCTTTACCAAGGCATCTGGTGCCTGGTGGGAGATTTTAATAAGAGTGGAGAAGCTCTACCGTCCTACCTCAGACCACT
TTCCCATCATGATGTCGTTAGGGCTCTTAAGTGGGGCCCAACCCCGTTTAGATTTGAAAATGCTTGGCTTGATAACCGAGACTTCAAAGGCAAAGTTGAGTTGTGGTGGA
AGGATTTAAACCCGTTTGGCTGGGCTGGTTTCAAGCTGATGGAGAAGCTGAGAGGTTTGAAACTCAAAATCACTCAAAATCAAGGAATGGAGCAAGGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTGGCGCCTCAACGTATTTAAAGCTATAGGTGATTGCCTTGGAGGCTTCATTGAATATGAAGAGTCGAATTCTCTCCTCATTGATTGCGTGGACTTAAAGATGAA
AATTAAATATAATTACTATGGATTTATTCCTGCTGAAGTTCGAATCATGGATGGAAATAACCACTTTCATATTCAGATTGTCACTTTTCAAGAAGGAAATTTGCTCATTG
ACAGGGTCGCCGGCATCCATGGTAGCTTCTCACCGACAGCGGCTCATGCCTTTCATCGCGGTCCAAATGATCCCTTCTTTTGCCCGGTAGACATATGGAGGATTGAAGAT
GGCTTAGTTTATCAGTTGGTTAGTATCCAAAAGAAGCCCGTCAGTTTGAACGAGGATAATAGACAGCTGGAAAGCGCTAATGAGATTTTTGAAATATACCGCCAAAAGTC
CTTGCTAGGGTTTAGTGAGGAAGCTGGGCTGTCCCATGTGACTAATAGTCAGGCCCAAGCCGAGGATGCAAAACAATGTACGAGCGATAGCAAAAAGCCAATGGGACCAA
GTGACACGTTTGCTGCTGAGTGGAATGGGAAGGAGGCCTCGTCTCTGATTTCACCATGTCAAGCCCAGTTAGCAAAGGCTCCGAAATCTGGGAAGAAGAAATCTCGTGTA
GAAATTAGTGAGACCCTGCCTGACGGATATTATCAATGTTTTGAAGTTGATAATGAGATAGTTAGAAGTTTGCAACCGCAAGAGGATGAGATAGGAATGAATCTTTCGAA
CAGCATGGTGAACCAAGATACGGATCCTCCTCGAGAGGAACTCTTAGAAAGAACTTGCACTGATAATGTTACTGAGACTTCTCCAAAGGCTTTGGCTTGCATACCGCCGA
TAGAGGTCGCTTCGAACAAGAACAATCAGTGTGTTGAAGGATTTGCTATCAGCAAAGAGCTGGTGATAACGCTTAGGAAGAATAACTTGTGTATTAGACCCATTGTGGGG
ACTAGTAACAAAAAAGAGGGTTGGGAGCGAGGTCTAAAAGACCTCATTTGTAGGATTAATCCTGATGTAGTTATCTTGCAAGAGTCTAGATTGAATTCTGTTGATAGGCG
GATTGTGAAATCCATTTGGAGTTCCAGGCATATAGCGTGGATTGCCCGGGATGCTATTAGTTCGGCAGGAGGAATTTTACTCATGTGGAAGGATTCTGTTATAGAGGTTC
CGGACTCTGTGTTCGGGGCATTTTTTGTTACTTTGCATTGTTCTTTTCAGGGCCAGAAGGTGGGGTGGATCTCTAGGGTCTATGGGCCATGTAACTATAAGGATAGAACG
CAATGTCTGCAAGAATTGTATGACTTGAAGGGTCTTTACCAAGGCATCTGGTGCCTGGTGGGAGATTTTAATAAGAGTGGAGAAGCTCTACCGTCCTACCTCAGACCACT
TTCCCATCATGATGTCGTTAGGGCTCTTAAGTGGGGCCCAACCCCGTTTAGATTTGAAAATGCTTGGCTTGATAACCGAGACTTCAAAGGCAAAGTTGAGTTGTGGTGGA
AGGATTTAAACCCGTTTGGCTGGGCTGGTTTCAAGCTGATGGAGAAGCTGAGAGGTTTGAAACTCAAAATCACTCAAAATCAAGGAATGGAGCAAGGATAA
Protein sequenceShow/hide protein sequence
MAWRLNVFKAIGDCLGGFIEYEESNSLLIDCVDLKMKIKYNYYGFIPAEVRIMDGNNHFHIQIVTFQEGNLLIDRVAGIHGSFSPTAAHAFHRGPNDPFFCPVDIWRIED
GLVYQLVSIQKKPVSLNEDNRQLESANEIFEIYRQKSLLGFSEEAGLSHVTNSQAQAEDAKQCTSDSKKPMGPSDTFAAEWNGKEASSLISPCQAQLAKAPKSGKKKSRV
EISETLPDGYYQCFEVDNEIVRSLQPQEDEIGMNLSNSMVNQDTDPPREELLERTCTDNVTETSPKALACIPPIEVASNKNNQCVEGFAISKELVITLRKNNLCIRPIVG
TSNKKEGWERGLKDLICRINPDVVILQESRLNSVDRRIVKSIWSSRHIAWIARDAISSAGGILLMWKDSVIEVPDSVFGAFFVTLHCSFQGQKVGWISRVYGPCNYKDRT
QCLQELYDLKGLYQGIWCLVGDFNKSGEALPSYLRPLSHHDVVRALKWGPTPFRFENAWLDNRDFKGKVELWWKDLNPFGWAGFKLMEKLRGLKLKITQNQGMEQG