; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007282 (gene) of Snake gourd v1 genome

Gene IDTan0007282
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationLG09:48689696..48692625
RNA-Seq ExpressionTan0007282
SyntenyTan0007282
Gene Ontology termsNA
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4347816.1 hypothetical protein G4B88_012929 [Cannabis sativa]1.3e-3353.24Show/hide
Query:  SGGLLMLWKDSIQLSVVSYSKAHMDSTIKDRNGE-WRFTGFYGDPSVEKRHESWLLLERLSGLFDLPWLVGGDFNELLSAGEKNGGAPKSKKTMNSFGEC
        SGGLL+LW D  ++SV S++  H+D+ +K    E WRFTGFYG+P    R ESW LL RL  LFDLPW+ GGDFNE+LS  EK GG+ +S   M  F   
Subjt:  SGGLLMLWKDSIQLSVVSYSKAHMDSTIKDRNGE-WRFTGFYGDPSVEKRHESWLLLERLSGLFDLPWLVGGDFNELLSAGEKNGGAPKSKKTMNSFGEC

Query:  IFRCKLVDAGFKGDKFTW-RKSRGVDTTKERLDRYFLNQ
        + RC L D GF+G  FTW  K +G    +ERLDRYF NQ
Subjt:  IFRCKLVDAGFKGDKFTW-RKSRGVDTTKERLDRYFLNQ

KAF4351405.1 hypothetical protein F8388_001025, partial [Cannabis sativa]1.0e-3549.68Show/hide
Query:  MGRSGGLLMLWKDSIQLSVVSYSKAHMDSTIKDRN-GEWRFTGFYGDPSVEKRHESWLLLERLSGLFDLPWLVGGDFNELLSAGEKNGGAPKSKKTMNSF
        +G+SGGLL+LW D  ++SV S+S  H+D+ +K      WRFTGFYG+P    R +SW LL RL GLFDLPW+ GGDFNE+LS  EK GG  +S   ++ F
Subjt:  MGRSGGLLMLWKDSIQLSVVSYSKAHMDSTIKDRN-GEWRFTGFYGDPSVEKRHESWLLLERLSGLFDLPWLVGGDFNELLSAGEKNGGAPKSKKTMNSF

Query:  GECIFRCKLVDAGFKGDKFTW-RKSRGVDTTKERLDRYFLNQSLMNRVNKLRWRVGD
         + + +C LVD GF+G  FTW  K +GV   +ERLDRYF NQ   N    ++   GD
Subjt:  GECIFRCKLVDAGFKGDKFTW-RKSRGVDTTKERLDRYFLNQSLMNRVNKLRWRVGD

KAF4372682.1 hypothetical protein F8388_000849, partial [Cannabis sativa]2.3e-3549.68Show/hide
Query:  MGRSGGLLMLWKDSIQLSVVSYSKAHMDSTIKDRN-GEWRFTGFYGDPSVEKRHESWLLLERLSGLFDLPWLVGGDFNELLSAGEKNGGAPKSKKTMNSF
        +G+SGGLL+LW D  ++SV S+S  H+D+ +K      WRFTGFYG+P    R  SW LL RL GLFDLPW+ GGDFNE+LS  EK GG  +S   ++ F
Subjt:  MGRSGGLLMLWKDSIQLSVVSYSKAHMDSTIKDRN-GEWRFTGFYGDPSVEKRHESWLLLERLSGLFDLPWLVGGDFNELLSAGEKNGGAPKSKKTMNSF

Query:  GECIFRCKLVDAGFKGDKFTW-RKSRGVDTTKERLDRYFLNQSLMNRVNKLRWRVGD
         + + +C LVD GF+G  FTW  K +GV   +ERLDRYF NQ   N    ++   GD
Subjt:  GECIFRCKLVDAGFKGDKFTW-RKSRGVDTTKERLDRYFLNQSLMNRVNKLRWRVGD

KAF4381998.1 hypothetical protein G4B88_006630 [Cannabis sativa]1.3e-3353.24Show/hide
Query:  SGGLLMLWKDSIQLSVVSYSKAHMDSTIKDRNGE-WRFTGFYGDPSVEKRHESWLLLERLSGLFDLPWLVGGDFNELLSAGEKNGGAPKSKKTMNSFGEC
        SGGLL+LW D  ++SV S++  H+D+ +K    E WRFTGFYG+P    R ESW LL RL  LFDLPW+ GGDFNE+LS  EK GG+ +S   M  F   
Subjt:  SGGLLMLWKDSIQLSVVSYSKAHMDSTIKDRNGE-WRFTGFYGDPSVEKRHESWLLLERLSGLFDLPWLVGGDFNELLSAGEKNGGAPKSKKTMNSFGEC

Query:  IFRCKLVDAGFKGDKFTW-RKSRGVDTTKERLDRYFLNQ
        + RC L D GF+G  FTW  K +G    +ERLDRYF NQ
Subjt:  IFRCKLVDAGFKGDKFTW-RKSRGVDTTKERLDRYFLNQ

KAF4383622.1 hypothetical protein F8388_014122 [Cannabis sativa]1.3e-3353.24Show/hide
Query:  SGGLLMLWKDSIQLSVVSYSKAHMDSTIKDRNGE-WRFTGFYGDPSVEKRHESWLLLERLSGLFDLPWLVGGDFNELLSAGEKNGGAPKSKKTMNSFGEC
        SGGLL+LW D  ++SV S++  H+D+ +K    E WRFTGFYG+P    R ESW LL RL  LFDLPW+ GGDFNE+LS  EK GG+ +S   M  F   
Subjt:  SGGLLMLWKDSIQLSVVSYSKAHMDSTIKDRNGE-WRFTGFYGDPSVEKRHESWLLLERLSGLFDLPWLVGGDFNELLSAGEKNGGAPKSKKTMNSFGEC

Query:  IFRCKLVDAGFKGDKFTW-RKSRGVDTTKERLDRYFLNQ
        + RC L D GF+G  FTW  K +G    +ERLDRYF NQ
Subjt:  IFRCKLVDAGFKGDKFTW-RKSRGVDTTKERLDRYFLNQ

TrEMBL top hitse value%identityAlignment
A0A7J6DPD3 CCHC-type domain-containing protein6.1e-3453.24Show/hide
Query:  SGGLLMLWKDSIQLSVVSYSKAHMDSTIKDRNGE-WRFTGFYGDPSVEKRHESWLLLERLSGLFDLPWLVGGDFNELLSAGEKNGGAPKSKKTMNSFGEC
        SGGLL+LW D  ++SV S++  H+D+ +K    E WRFTGFYG+P    R ESW LL RL  LFDLPW+ GGDFNE+LS  EK GG+ +S   M  F   
Subjt:  SGGLLMLWKDSIQLSVVSYSKAHMDSTIKDRNGE-WRFTGFYGDPSVEKRHESWLLLERLSGLFDLPWLVGGDFNELLSAGEKNGGAPKSKKTMNSFGEC

Query:  IFRCKLVDAGFKGDKFTW-RKSRGVDTTKERLDRYFLNQ
        + RC L D GF+G  FTW  K +G    +ERLDRYF NQ
Subjt:  IFRCKLVDAGFKGDKFTW-RKSRGVDTTKERLDRYFLNQ

A0A7J6DZ24 CCHC-type domain-containing protein5.0e-3649.68Show/hide
Query:  MGRSGGLLMLWKDSIQLSVVSYSKAHMDSTIKDRN-GEWRFTGFYGDPSVEKRHESWLLLERLSGLFDLPWLVGGDFNELLSAGEKNGGAPKSKKTMNSF
        +G+SGGLL+LW D  ++SV S+S  H+D+ +K      WRFTGFYG+P    R +SW LL RL GLFDLPW+ GGDFNE+LS  EK GG  +S   ++ F
Subjt:  MGRSGGLLMLWKDSIQLSVVSYSKAHMDSTIKDRN-GEWRFTGFYGDPSVEKRHESWLLLERLSGLFDLPWLVGGDFNELLSAGEKNGGAPKSKKTMNSF

Query:  GECIFRCKLVDAGFKGDKFTW-RKSRGVDTTKERLDRYFLNQSLMNRVNKLRWRVGD
         + + +C LVD GF+G  FTW  K +GV   +ERLDRYF NQ   N    ++   GD
Subjt:  GECIFRCKLVDAGFKGDKFTW-RKSRGVDTTKERLDRYFLNQSLMNRVNKLRWRVGD

A0A7J6FPV7 CCHC-type domain-containing protein1.1e-3549.68Show/hide
Query:  MGRSGGLLMLWKDSIQLSVVSYSKAHMDSTIKDRN-GEWRFTGFYGDPSVEKRHESWLLLERLSGLFDLPWLVGGDFNELLSAGEKNGGAPKSKKTMNSF
        +G+SGGLL+LW D  ++SV S+S  H+D+ +K      WRFTGFYG+P    R  SW LL RL GLFDLPW+ GGDFNE+LS  EK GG  +S   ++ F
Subjt:  MGRSGGLLMLWKDSIQLSVVSYSKAHMDSTIKDRN-GEWRFTGFYGDPSVEKRHESWLLLERLSGLFDLPWLVGGDFNELLSAGEKNGGAPKSKKTMNSF

Query:  GECIFRCKLVDAGFKGDKFTW-RKSRGVDTTKERLDRYFLNQSLMNRVNKLRWRVGD
         + + +C LVD GF+G  FTW  K +GV   +ERLDRYF NQ   N    ++   GD
Subjt:  GECIFRCKLVDAGFKGDKFTW-RKSRGVDTTKERLDRYFLNQSLMNRVNKLRWRVGD

A0A7J6GGL8 CCHC-type domain-containing protein6.1e-3453.24Show/hide
Query:  SGGLLMLWKDSIQLSVVSYSKAHMDSTIKDRNGE-WRFTGFYGDPSVEKRHESWLLLERLSGLFDLPWLVGGDFNELLSAGEKNGGAPKSKKTMNSFGEC
        SGGLL+LW D  ++SV S++  H+D+ +K    E WRFTGFYG+P    R ESW LL RL  LFDLPW+ GGDFNE+LS  EK GG+ +S   M  F   
Subjt:  SGGLLMLWKDSIQLSVVSYSKAHMDSTIKDRNGE-WRFTGFYGDPSVEKRHESWLLLERLSGLFDLPWLVGGDFNELLSAGEKNGGAPKSKKTMNSFGEC

Query:  IFRCKLVDAGFKGDKFTW-RKSRGVDTTKERLDRYFLNQ
        + RC L D GF+G  FTW  K +G    +ERLDRYF NQ
Subjt:  IFRCKLVDAGFKGDKFTW-RKSRGVDTTKERLDRYFLNQ

A0A7J6GL46 CCHC-type domain-containing protein6.1e-3453.24Show/hide
Query:  SGGLLMLWKDSIQLSVVSYSKAHMDSTIKDRNGE-WRFTGFYGDPSVEKRHESWLLLERLSGLFDLPWLVGGDFNELLSAGEKNGGAPKSKKTMNSFGEC
        SGGLL+LW D  ++SV S++  H+D+ +K    E WRFTGFYG+P    R ESW LL RL  LFDLPW+ GGDFNE+LS  EK GG+ +S   M  F   
Subjt:  SGGLLMLWKDSIQLSVVSYSKAHMDSTIKDRNGE-WRFTGFYGDPSVEKRHESWLLLERLSGLFDLPWLVGGDFNELLSAGEKNGGAPKSKKTMNSFGEC

Query:  IFRCKLVDAGFKGDKFTW-RKSRGVDTTKERLDRYFLNQ
        + RC L D GF+G  FTW  K +G    +ERLDRYF NQ
Subjt:  IFRCKLVDAGFKGDKFTW-RKSRGVDTTKERLDRYFLNQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G40390.1 DNAse I-like superfamily protein1.7e-0430.85Show/hide
Query:  KRHESWLLLERLSG---LFDLPWLVGGDFNELLSAGEKNGGAPK--SKKTMNSFGECIFRCKLVDAGFKGDKFTWRKSRGVDTTKERLDRYFLN
        +R   W  + RLS    L + PWLV GDFN++ S  E     P   S + +     C+    LVD   +G  +TW   +  +    +LDR  +N
Subjt:  KRHESWLLLERLSG---LFDLPWLVGGDFNELLSAGEKNGGAPK--SKKTMNSFGECIFRCKLVDAGFKGDKFTWRKSRGVDTTKERLDRYFLN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGAAGTGGAGGCCTTTTGATGTTGTGGAAGGATAGCATCCAGCTCTCGGTGGTCTCTTACTCAAAGGCGCATATGGACTCTACTATTAAAGATCGTAATGGGGA
GTGGAGGTTTACAGGATTTTATGGGGATCCTTCGGTGGAAAAAAGACATGAGTCATGGCTCCTCCTTGAGCGGCTGAGTGGGTTGTTTGACCTTCCCTGGCTGGTGGGAG
GAGATTTTAATGAGCTCCTCTCGGCTGGAGAAAAAAATGGTGGAGCTCCTAAGAGCAAAAAGACTATGAACTCTTTTGGAGAGTGCATTTTCAGGTGTAAGTTGGTTGAT
GCTGGGTTCAAGGGAGATAAGTTCACGTGGAGAAAGAGCAGAGGTGTTGACACCACTAAGGAGCGCCTCGACAGGTATTTTCTAAACCAAAGCTTGATGAACCGGGTTAA
CAAGCTCAGATGGAGAGTTGGGGATGGTCGGTACATTGACATTGGTGATGACTCATGGATCGCTAGAAAGGGGATTAAGAGACCTATTCTCGCTAGTTATGATCTGGCCA
AAATAAGACTCTGTGAGTTGATAGGAAATGATGGGAGTTGGAAGGAGGAGGATGTTAGAGATGGTTTCACCCATCAAGATATTGATGATATATTAAACACTCCGATCGGC
CCTAGAGGTTCCAAGGACGAGATTATTTGGGGAGAGGACCCAAAAGGTCTCTTTTCGGTCAAAAGTGCTTATATTTTAGCCAAAAATCTTGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAGAAGTGGAGGCCTTTTGATGTTGTGGAAGGATAGCATCCAGCTCTCGGTGGTCTCTTACTCAAAGGCGCATATGGACTCTACTATTAAAGATCGTAATGGGGA
GTGGAGGTTTACAGGATTTTATGGGGATCCTTCGGTGGAAAAAAGACATGAGTCATGGCTCCTCCTTGAGCGGCTGAGTGGGTTGTTTGACCTTCCCTGGCTGGTGGGAG
GAGATTTTAATGAGCTCCTCTCGGCTGGAGAAAAAAATGGTGGAGCTCCTAAGAGCAAAAAGACTATGAACTCTTTTGGAGAGTGCATTTTCAGGTGTAAGTTGGTTGAT
GCTGGGTTCAAGGGAGATAAGTTCACGTGGAGAAAGAGCAGAGGTGTTGACACCACTAAGGAGCGCCTCGACAGGTATTTTCTAAACCAAAGCTTGATGAACCGGGTTAA
CAAGCTCAGATGGAGAGTTGGGGATGGTCGGTACATTGACATTGGTGATGACTCATGGATCGCTAGAAAGGGGATTAAGAGACCTATTCTCGCTAGTTATGATCTGGCCA
AAATAAGACTCTGTGAGTTGATAGGAAATGATGGGAGTTGGAAGGAGGAGGATGTTAGAGATGGTTTCACCCATCAAGATATTGATGATATATTAAACACTCCGATCGGC
CCTAGAGGTTCCAAGGACGAGATTATTTGGGGAGAGGACCCAAAAGGTCTCTTTTCGGTCAAAAGTGCTTATATTTTAGCCAAAAATCTTGCTTGA
Protein sequenceShow/hide protein sequence
MGRSGGLLMLWKDSIQLSVVSYSKAHMDSTIKDRNGEWRFTGFYGDPSVEKRHESWLLLERLSGLFDLPWLVGGDFNELLSAGEKNGGAPKSKKTMNSFGECIFRCKLVD
AGFKGDKFTWRKSRGVDTTKERLDRYFLNQSLMNRVNKLRWRVGDGRYIDIGDDSWIARKGIKRPILASYDLAKIRLCELIGNDGSWKEEDVRDGFTHQDIDDILNTPIG
PRGSKDEIIWGEDPKGLFSVKSAYILAKNLA