; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004328 (gene) of Snake gourd v1 genome

Gene IDTan0004328
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrotransposon protein
Genome locationLG08:13383650..13384411
RNA-Seq ExpressionTan0004328
SyntenyTan0004328
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022158565.1 uncharacterized protein LOC111025018 [Momordica charantia]4.9e-4547.27Show/hide
Query:  GFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPQEMAFNVAEQMEEEIRLGSQDFIGTEQRTMENPGLGDVGED
        GFGWN++ KC+EAEKEVFD WVKSH NAKG+RNKP PHYD+L V FGKDRATG   + P +MA + A  + E+    +QDF   +          D  E+
Subjt:  GFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPQEMAFNVAEQMEEEIRLGSQDFIGTEQRTMENPGLGDVGED

Query:  DLPDTPTSRRNTSGMSSRCTGSKRKRSSFQTELIDVVRTKNGYAHYHMQQLLSWQKEKYELEASRRKEVVDLLYQIEGLTEHDRVSLIDLLWIDIRKIDC
        DLP+TPTS + T G SS   GSKRKRS + +E++DVVRT       H++++ +W  +K E + +RRK V D L QI  L  +D V L+ +L  +++K   
Subjt:  DLPDTPTSRRNTSGMSSRCTGSKRKRSSFQTELIDVVRTKNGYAHYHMQQLLSWQKEKYELEASRRKEVVDLLYQIEGLTEHDRVSLIDLLWIDIRKIDC

Query:  FLQVPPQSRKAYCMRLLGRT
        FL+VP + +K +CM+LLG++
Subjt:  FLQVPPQSRKAYCMRLLGRT

XP_038887234.1 uncharacterized protein LOC120077425 [Benincasa hispida]1.7e-4549.55Show/hide
Query:  SGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPQEMAFNVAEQMEEEIRLGSQDFIGTEQRTMENPGLGDVGE
        SGF WNEEFKCV+ E+E+FD WV+SH NAKGM  KPFPHYD+L+ VFGKDRA      TP                         E R  E+P   D  +
Subjt:  SGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPQEMAFNVAEQMEEEIRLGSQDFIGTEQRTMENPGLGDVGE

Query:  DDLPDTPTSRRNTSGMSSRCTGSKRKRSSFQTELIDVVRTKNGYAHYHMQQLLSWQKEKYELEASRRKEVVDLLYQIEGLTEHDRVSLIDLLWIDIRKID
        ++  +  T R +    SSR  GSKRKRSSFQ E+ID+V++       HM +L SWQ EKYELE    KEVV+ +Y I+ L E+D+V+LIDL+  DI+K D
Subjt:  DDLPDTPTSRRNTSGMSSRCTGSKRKRSSFQTELIDVVRTKNGYAHYHMQQLLSWQKEKYELEASRRKEVVDLLYQIEGLTEHDRVSLIDLLWIDIRKID

Query:  CFLQVPPQSRKAYCMRLLGR
        CFL VP  +RK YC+RLLGR
Subjt:  CFLQVPPQSRKAYCMRLLGR

XP_038895773.1 uncharacterized protein LOC120083935 [Benincasa hispida]1.2e-4346.82Show/hide
Query:  SGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPQEMAFNVAEQMEEEIRLGSQDFIGTEQRTMENPGLGDVGE
        SGF WNEEFKCV+ E+E+FD WV+SH NAKGM NKPFPHYD+L+ VFGK +A G  +E P  M  N   + E+EIRLGSQD                   
Subjt:  SGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPQEMAFNVAEQMEEEIRLGSQDFIGTEQRTMENPGLGDVGE

Query:  DDLPDTPTSRRNTSGMSSRCTGSKRKRSSFQTELIDVVRTKNGYAHYHMQQLLSWQKEKYELEASRRKEVVDLLYQIEGLTEHDRVSLIDLLWIDIRKID
             TP S                                      HM +L SWQKEKYELE  RRKEVV+ +Y I+GL E D+V+LIDLL  DI+K +
Subjt:  DDLPDTPTSRRNTSGMSSRCTGSKRKRSSFQTELIDVVRTKNGYAHYHMQQLLSWQKEKYELEASRRKEVVDLLYQIEGLTEHDRVSLIDLLWIDIRKID

Query:  CFLQVPPQSRKAYCMRLLGR
        CFL VP  +RK YC+RLLGR
Subjt:  CFLQVPPQSRKAYCMRLLGR

XP_038896380.1 uncharacterized protein LOC120084641 [Benincasa hispida]5.8e-4650Show/hide
Query:  SGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPQEMAFNVAEQMEEEIRLGSQDFIGTEQRTMENPGLGDVGE
        SGFGWNEEFKCV+ EKE+FD WV+SH NAKGM NK F HYD+L+ VFGKDRA      TP                         E    E+P   D  +
Subjt:  SGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPQEMAFNVAEQMEEEIRLGSQDFIGTEQRTMENPGLGDVGE

Query:  DDLPDTPTSRRNTSGMSSRCTGSKRKRSSFQTELIDVVRTKNGYAHYHMQQLLSWQKEKYELEASRRKEVVDLLYQIEGLTEHDRVSLIDLLWIDIRKID
        ++  +  T R +    SSR  GSKRKR SFQ E+ID++R+       HM +L SWQKEKYELE  RRKEVV+ +Y I+GL E D+V+ IDLL  DI+K D
Subjt:  DDLPDTPTSRRNTSGMSSRCTGSKRKRSSFQTELIDVVRTKNGYAHYHMQQLLSWQKEKYELEASRRKEVVDLLYQIEGLTEHDRVSLIDLLWIDIRKID

Query:  CFLQVPPQSRKAYCMRLLGR
        CFL VP  +RK YC+ LL R
Subjt:  CFLQVPPQSRKAYCMRLLGR

XP_038899910.1 uncharacterized protein LOC120087100 [Benincasa hispida]7.3e-4152.69Show/hide
Query:  SGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPQEMAFNVAEQMEEEIRLGSQDFIGTEQRTMENPGLGDVGE
        SGFGWNEEFKCV+ EKE+F+   +SH NAKGM NK FPHYD+L+ VFGKDRA G  +E P  MA N   + E+EIRLGSQD    E R  E+P   D  +
Subjt:  SGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPQEMAFNVAEQMEEEIRLGSQDFIGTEQRTMENPGLGDVGE

Query:  DDLPDTPTSRRNTSGMSSRCTGSKRKRSSFQTELIDVVRTKNGYAHYHMQQLLSWQKEKYELEASRRKEVVDLLYQIEGLTEHDRV
        ++  +  T R +    +S+  GSKRKR SFQ E+ID++R+       HM +L SWQKEKYELE    KEVV+ +Y I+GL E DR+
Subjt:  DDLPDTPTSRRNTSGMSSRCTGSKRKRSSFQTELIDVVRTKNGYAHYHMQQLLSWQKEKYELEASRRKEVVDLLYQIEGLTEHDRV

TrEMBL top hitse value%identityAlignment
A0A1S3B4L3 uncharacterized protein LOC1034859537.2e-3438.16Show/hide
Query:  MLGNGCSGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPQEMAFNVAEQMEEEIRLGSQDFIGTEQRTMENPG
        M G  CSGFGWNEEF+C+ AE+++FD+W+KSH  AKG+ +K FP+YD+L+ VFGKDRATG  +ET   +  NV+    + I LG  D    +  TM + G
Subjt:  MLGNGCSGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPQEMAFNVAEQMEEEIRLGSQDFIGTEQRTMENPG

Query:  LGDVGEDDL----PDTPTSRRNTSGMSSRCTGSKRKRSSFQTELIDVVRTKNGYAHYHMQQLLSWQKEKYELEASRRKEVVDLLYQIEGLTEHDRVSLID
        +  +  D++        + RRN S +      SKRKR S + E ++V+R+   + +  ++ +  W KEK  +E   R +VV  L  I  L   DR  L+ 
Subjt:  LGDVGEDDL----PDTPTSRRNTSGMSSRCTGSKRKRSSFQTELIDVVRTKNGYAHYHMQQLLSWQKEKYELEASRRKEVVDLLYQIEGLTEHDRVSLID

Query:  LLWIDIRKIDCFLQVPPQSRKAYCMRLL
        +L+  +  I+ FL +P + +  YC  LL
Subjt:  LLWIDIRKIDCFLQVPPQSRKAYCMRLL

A0A5A7U0H7 Retrotransposon protein7.2e-3438.16Show/hide
Query:  MLGNGCSGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPQEMAFNVAEQMEEEIRLGSQDFIGTEQRTMENPG
        M G  CSGFGWNEEF+C+ AE+++FD+W+KSH  AKG+ +K FP+YD+L+ VFGKDRATG  +ET   +  NV+    + I LG  D    +  TM + G
Subjt:  MLGNGCSGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPQEMAFNVAEQMEEEIRLGSQDFIGTEQRTMENPG

Query:  LGDVGEDDL----PDTPTSRRNTSGMSSRCTGSKRKRSSFQTELIDVVRTKNGYAHYHMQQLLSWQKEKYELEASRRKEVVDLLYQIEGLTEHDRVSLID
        +  +  D++        + RRN S +      SKRKR S + E ++V+R+   + +  ++ +  W KEK  +E   R +VV  L  I  L   DR  L+ 
Subjt:  LGDVGEDDL----PDTPTSRRNTSGMSSRCTGSKRKRSSFQTELIDVVRTKNGYAHYHMQQLLSWQKEKYELEASRRKEVVDLLYQIEGLTEHDRVSLID

Query:  LLWIDIRKIDCFLQVPPQSRKAYCMRLL
        +L+  +  I+ FL +P + +  YC  LL
Subjt:  LLWIDIRKIDCFLQVPPQSRKAYCMRLL

A0A5A7U4M2 Retrotransposon protein8.2e-3039.27Show/hide
Query:  CSGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPQEMAFNVAEQMEEEIRLGSQDFIGTEQRTMENPGLGDVG
        CSGFGWN+E KC+ AEKEVFD WVKSH  AKG+ NK F HYDEL+ VFGKDRATG  AE+  ++  N     +      +   + T+   M +PGL ++ 
Subjt:  CSGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPQEMAFNVAEQMEEEIRLGSQDFIGTEQRTMENPGLGDVG

Query:  EDDLPDTPTSRRNTSGMSSRCTGSKRKRSSFQTELIDVVRTKNGYAHYHMQQLLSWQKEKYELEASRRKEVVDLLYQIEGLTEHDRVSLIDLLWIDIRKI
         DDL +T T+R   S   +  +GSK KR    T+  D+VRT   Y +  + ++  W   + +     R+E+V  L  I  LT  DR  L+ ++  ++  +
Subjt:  EDDLPDTPTSRRNTSGMSSRCTGSKRKRSSFQTELIDVVRTKNGYAHYHMQQLLSWQKEKYELEASRRKEVVDLLYQIEGLTEHDRVSLIDLLWIDIRKI

Query:  DCFLQVPPQSRKAYCMRLL
          FL+VP   +  YC  +L
Subjt:  DCFLQVPPQSRKAYCMRLL

A0A5D3C7T4 Uncharacterized protein9.7e-3140.85Show/hide
Query:  MLGNGCSGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPQEMAFNVAEQMEEEIRLGSQDFIGTEQRTMENP-
        M+G  CSGFGWNE  KC+E EK VFD WVK H NA+G+ NKPFP++ +L VVFG+DRATG   +TP EM+   A   EE+      D I  E   + NP 
Subjt:  MLGNGCSGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPQEMAFNVAEQMEEEIRLGSQDFIGTEQRTMENP-

Query:  GLGDVGEDDLPDTPTSRRNTSGMSSRCTGSKRKRSSFQTELIDVVRTKNGYAHYHMQQLLSWQKEKYELEASRRKEVVDLLYQIEGLTEHDRVSLIDLLW
        GL     +D+P TPTS  + +G SSR     +KR S+  +L+D  R         + ++ +WQ+EK E+E+S  K +   L  I G+   D + + + L 
Subjt:  GLGDVGEDDLPDTPTSRRNTSGMSSRCTGSKRKRSSFQTELIDVVRTKNGYAHYHMQQLLSWQKEKYELEASRRKEVVDLLYQIEGLTEHDRVSLIDLLW

Query:  IDIRKIDCFLQVP
         D   +  FL  P
Subjt:  IDIRKIDCFLQVP

A0A6J1DW73 uncharacterized protein LOC1110250182.4e-4547.27Show/hide
Query:  GFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPQEMAFNVAEQMEEEIRLGSQDFIGTEQRTMENPGLGDVGED
        GFGWN++ KC+EAEKEVFD WVKSH NAKG+RNKP PHYD+L V FGKDRATG   + P +MA + A  + E+    +QDF   +          D  E+
Subjt:  GFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPQEMAFNVAEQMEEEIRLGSQDFIGTEQRTMENPGLGDVGED

Query:  DLPDTPTSRRNTSGMSSRCTGSKRKRSSFQTELIDVVRTKNGYAHYHMQQLLSWQKEKYELEASRRKEVVDLLYQIEGLTEHDRVSLIDLLWIDIRKIDC
        DLP+TPTS + T G SS   GSKRKRS + +E++DVVRT       H++++ +W  +K E + +RRK V D L QI  L  +D V L+ +L  +++K   
Subjt:  DLPDTPTSRRNTSGMSSRCTGSKRKRSSFQTELIDVVRTKNGYAHYHMQQLLSWQKEKYELEASRRKEVVDLLYQIEGLTEHDRVSLIDLLWIDIRKIDC

Query:  FLQVPPQSRKAYCMRLLGRT
        FL+VP + +K +CM+LLG++
Subjt:  FLQVPPQSRKAYCMRLLGRT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30140.1 unknown protein1.6e-0640.74Show/hide
Query:  SGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATG
        SGFGW+ E K   A  EV+  ++K+H N K M+ +   H+++L ++FG   ATG
Subjt:  SGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATG

AT4G02210.1 unknown protein1.5e-0434.04Show/hide
Query:  GFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFG
        GF W+ E + V A+  V+  ++K+H +A+    +P P+Y +L V+ G
Subjt:  GFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFG

AT4G02210.2 unknown protein1.5e-0434.04Show/hide
Query:  GFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFG
        GF W+ E + V A+  V+  ++K+H +A+    +P P+Y +L V+ G
Subjt:  GFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTGGGAATGGTTGTAGTGGTTTTGGTTGGAATGAAGAATTTAAATGTGTTGAAGCAGAGAAAGAGGTATTCGATGCGTGGGTTAAGAGCCATACAAATGCCAAAGG
GATGAGGAACAAGCCATTTCCGCACTATGATGAGCTGGCAGTTGTCTTCGGAAAAGATAGAGCTACAGGAATAGGCGCAGAGACCCCACAGGAAATGGCCTTCAATGTTG
CGGAACAAATGGAGGAGGAGATTCGTTTGGGATCACAAGACTTCATAGGGACAGAACAACGAACGATGGAGAATCCAGGACTTGGTGACGTAGGGGAAGATGACTTGCCA
GACACTCCTACTAGTAGGCGTAATACATCTGGCATGTCTTCTAGATGTACTGGGAGCAAAAGAAAACGATCGTCCTTCCAGACTGAATTAATTGATGTTGTGCGGACAAA
GAATGGATATGCGCACTACCACATGCAACAACTTCTATCATGGCAGAAGGAGAAGTATGAATTGGAGGCCTCACGAAGGAAAGAAGTAGTCGATCTCTTGTATCAGATAG
AAGGATTGACTGAGCATGATCGTGTCTCCCTGATTGACTTGCTTTGGATCGATATCCGTAAGATCGATTGCTTTCTACAGGTTCCGCCTCAATCGAGAAAGGCGTATTGC
ATGCGTCTTCTAGGAAGGACTGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGCTTGGGAATGGTTGTAGTGGTTTTGGTTGGAATGAAGAATTTAAATGTGTTGAAGCAGAGAAAGAGGTATTCGATGCGTGGGTTAAGAGCCATACAAATGCCAAAGG
GATGAGGAACAAGCCATTTCCGCACTATGATGAGCTGGCAGTTGTCTTCGGAAAAGATAGAGCTACAGGAATAGGCGCAGAGACCCCACAGGAAATGGCCTTCAATGTTG
CGGAACAAATGGAGGAGGAGATTCGTTTGGGATCACAAGACTTCATAGGGACAGAACAACGAACGATGGAGAATCCAGGACTTGGTGACGTAGGGGAAGATGACTTGCCA
GACACTCCTACTAGTAGGCGTAATACATCTGGCATGTCTTCTAGATGTACTGGGAGCAAAAGAAAACGATCGTCCTTCCAGACTGAATTAATTGATGTTGTGCGGACAAA
GAATGGATATGCGCACTACCACATGCAACAACTTCTATCATGGCAGAAGGAGAAGTATGAATTGGAGGCCTCACGAAGGAAAGAAGTAGTCGATCTCTTGTATCAGATAG
AAGGATTGACTGAGCATGATCGTGTCTCCCTGATTGACTTGCTTTGGATCGATATCCGTAAGATCGATTGCTTTCTACAGGTTCCGCCTCAATCGAGAAAGGCGTATTGC
ATGCGTCTTCTAGGAAGGACTGGATGA
Protein sequenceShow/hide protein sequence
MLGNGCSGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPQEMAFNVAEQMEEEIRLGSQDFIGTEQRTMENPGLGDVGEDDLP
DTPTSRRNTSGMSSRCTGSKRKRSSFQTELIDVVRTKNGYAHYHMQQLLSWQKEKYELEASRRKEVVDLLYQIEGLTEHDRVSLIDLLWIDIRKIDCFLQVPPQSRKAYC
MRLLGRTG