; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002420 (gene) of Snake gourd v1 genome

Gene IDTan0002420
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrotransposon protein
Genome locationLG06:48686482..48687243
RNA-Seq ExpressionTan0002420
SyntenyTan0002420
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022158565.1 uncharacterized protein LOC111025018 [Momordica charantia]8.7e-5050Show/hide
Query:  GFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPMEMASNVAEQMEEEIRLGSQDFIGTEQRTMENPGLGDVGED
        GFGWN++ KC+EAEKEVFD WVKSH NAKG+RNKP PHYD+L V FGKDRATG   + P++MAS+ A  + E+    +QDF   +          D  E+
Subjt:  GFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPMEMASNVAEQMEEEIRLGSQDFIGTEQRTMENPGLGDVGED

Query:  DLPDTPTSRRNTSGMSSRCTGSKRKRSSFQIELIDVVRTAMDMQTSHMQQLLSWQKEKYELEVARRKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQNTDC
        DLP+TPTS + T G SS   GSKRKRS +  E++DVVRT M MQT+H++++ +W  +K E ++ARRK V D L QI  L  +D V L+ +L+T+++ +  
Subjt:  DLPDTPTSRRNTSGMSSRCTGSKRKRSSFQIELIDVVRTAMDMQTSHMQQLLSWQKEKYELEVARRKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQNTDC

Query:  FLQVPPQSRKAYCMRLLGRT
        FL+VP + +K +CM+LLG++
Subjt:  FLQVPPQSRKAYCMRLLGRT

XP_038877407.1 uncharacterized protein LOC120069696 [Benincasa hispida]1.1e-4449.55Show/hide
Query:  SGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPMEMASNVAEQMEEEIRLGSQDFIGTEQRTMENPGLGDVGE
        SGF WNEEFKCV+ E+E+F+ WV+SH NAKGM NKPFPHYD+L+              TP E+      Q+E  +     D   TEQ T         G 
Subjt:  SGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPMEMASNVAEQMEEEIRLGSQDFIGTEQRTMENPGLGDVGE

Query:  DDLPDTPTSRRNTSGMSSRCTGSKRKRSSFQIELIDVVRTAMDMQTSHMQQLLSWQKEKYELEVARRKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQNTD
          +P            SSR  GSKRKRSSFQ+E+ID++R+ ++M ++HM +L SWQK+KYELE  R+KEVV+ +Y I+GL E  +V+LIDL+VTDIQ TD
Subjt:  DDLPDTPTSRRNTSGMSSRCTGSKRKRSSFQIELIDVVRTAMDMQTSHMQQLLSWQKEKYELEVARRKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQNTD

Query:  CFLQVPPQSRKAYCMRLLGR
        CFL VP  + K YC+RLLGR
Subjt:  CFLQVPPQSRKAYCMRLLGR

XP_038887234.1 uncharacterized protein LOC120077425 [Benincasa hispida]1.0e-5051.82Show/hide
Query:  SGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPMEMASNVAEQMEEEIRLGSQDFIGTEQRTMENPGLGDVGE
        SGF WNEEFKCV+ E+E+FD WV+SH NAKGM  KPFPHYD+L+ VFGKDRA      TP                         E R  E+P   D  +
Subjt:  SGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPMEMASNVAEQMEEEIRLGSQDFIGTEQRTMENPGLGDVGE

Query:  DDLPDTPTSRRNTSGMSSRCTGSKRKRSSFQIELIDVVRTAMDMQTSHMQQLLSWQKEKYELEVARRKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQNTD
        ++  +  T R +    SSR  GSKRKRSSFQ+E+ID+V++ ++MQ++HM +L SWQ EKYELE+   KEVV+ +Y I+ L E+D+V+LIDL+VTDIQ TD
Subjt:  DDLPDTPTSRRNTSGMSSRCTGSKRKRSSFQIELIDVVRTAMDMQTSHMQQLLSWQKEKYELEVARRKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQNTD

Query:  CFLQVPPQSRKAYCMRLLGR
        CFL VP  +RK YC+RLLGR
Subjt:  CFLQVPPQSRKAYCMRLLGR

XP_038895773.1 uncharacterized protein LOC120083935 [Benincasa hispida]4.4e-4647.73Show/hide
Query:  SGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPMEMASNVAEQMEEEIRLGSQDFIGTEQRTMENPGLGDVGE
        SGF WNEEFKCV+ E+E+FD WV+SH NAKGM NKPFPHYD+L+ VFGK +A G  +E P  M +N   + E+EIRLGSQD                   
Subjt:  SGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPMEMASNVAEQMEEEIRLGSQDFIGTEQRTMENPGLGDVGE

Query:  DDLPDTPTSRRNTSGMSSRCTGSKRKRSSFQIELIDVVRTAMDMQTSHMQQLLSWQKEKYELEVARRKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQNTD
             TP                                     +++HM +L SWQKEKYELE  RRKEVV+ +Y I+GL E D+V+LIDLLVTDIQ T+
Subjt:  DDLPDTPTSRRNTSGMSSRCTGSKRKRSSFQIELIDVVRTAMDMQTSHMQQLLSWQKEKYELEVARRKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQNTD

Query:  CFLQVPPQSRKAYCMRLLGR
        CFL VP  +RK YC+RLLGR
Subjt:  CFLQVPPQSRKAYCMRLLGR

XP_038896380.1 uncharacterized protein LOC120084641 [Benincasa hispida]1.3e-5052.27Show/hide
Query:  SGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPMEMASNVAEQMEEEIRLGSQDFIGTEQRTMENPGLGDVGE
        SGFGWNEEFKCV+ EKE+FD WV+SH NAKGM NK F HYD+L+ VFGKDRA      TP                         E    E+P   D  +
Subjt:  SGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPMEMASNVAEQMEEEIRLGSQDFIGTEQRTMENPGLGDVGE

Query:  DDLPDTPTSRRNTSGMSSRCTGSKRKRSSFQIELIDVVRTAMDMQTSHMQQLLSWQKEKYELEVARRKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQNTD
        ++  +  T R +    SSR  GSKRKR SFQ E+ID++R+ ++MQ++HM +L SWQKEKYELE  RRKEVV+ +Y I+GL E D+V+ IDLLVTDIQ TD
Subjt:  DDLPDTPTSRRNTSGMSSRCTGSKRKRSSFQIELIDVVRTAMDMQTSHMQQLLSWQKEKYELEVARRKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQNTD

Query:  CFLQVPPQSRKAYCMRLLGR
        CFL VP  +RK YC+ LL R
Subjt:  CFLQVPPQSRKAYCMRLLGR

TrEMBL top hitse value%identityAlignment
A0A1S3B4L3 uncharacterized protein LOC1034859538.5e-3539.04Show/hide
Query:  MFGNGCSGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPMEMASNVAEQMEEEIRLGSQDFIGTEQRTMENPG
        M G  CSGFGWNEEF+C+ AE+++FD+W+KSH  AKG+ +K FP+YD+L+ VFGKDRATG  +ET   + SNV+    + I LG  D    +  TM + G
Subjt:  MFGNGCSGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPMEMASNVAEQMEEEIRLGSQDFIGTEQRTMENPG

Query:  LGDVGEDDL----PDTPTSRRNTSGMSSRCTGSKRKRSSFQIELIDVVRTAMDMQTSHMQQLLSWQKEKYELEVARRKEVVDLLYQIEGLTEHDRVSLID
        +  +  D++        + RRN S +      SKRKR S + E ++V+R+ M+     ++ +  W KEK  +EV  R +VV  L  I  L   DR  L+ 
Subjt:  LGDVGEDDL----PDTPTSRRNTSGMSSRCTGSKRKRSSFQIELIDVVRTAMDMQTSHMQQLLSWQKEKYELEVARRKEVVDLLYQIEGLTEHDRVSLID

Query:  LLVTDIQNTDCFLQVPPQSRKAYCMRLL
        +L   ++  + FL +P + +  YC  LL
Subjt:  LLVTDIQNTDCFLQVPPQSRKAYCMRLL

A0A5A7U0H7 Retrotransposon protein8.5e-3539.04Show/hide
Query:  MFGNGCSGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPMEMASNVAEQMEEEIRLGSQDFIGTEQRTMENPG
        M G  CSGFGWNEEF+C+ AE+++FD+W+KSH  AKG+ +K FP+YD+L+ VFGKDRATG  +ET   + SNV+    + I LG  D    +  TM + G
Subjt:  MFGNGCSGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPMEMASNVAEQMEEEIRLGSQDFIGTEQRTMENPG

Query:  LGDVGEDDL----PDTPTSRRNTSGMSSRCTGSKRKRSSFQIELIDVVRTAMDMQTSHMQQLLSWQKEKYELEVARRKEVVDLLYQIEGLTEHDRVSLID
        +  +  D++        + RRN S +      SKRKR S + E ++V+R+ M+     ++ +  W KEK  +EV  R +VV  L  I  L   DR  L+ 
Subjt:  LGDVGEDDL----PDTPTSRRNTSGMSSRCTGSKRKRSSFQIELIDVVRTAMDMQTSHMQQLLSWQKEKYELEVARRKEVVDLLYQIEGLTEHDRVSLID

Query:  LLVTDIQNTDCFLQVPPQSRKAYCMRLL
        +L   ++  + FL +P + +  YC  LL
Subjt:  LLVTDIQNTDCFLQVPPQSRKAYCMRLL

A0A5A7U4M2 Retrotransposon protein7.4e-3139.01Show/hide
Query:  CSGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPMEMASNVAEQMEEEIRLGSQDFIGTEQRTMENPGLGDVG
        CSGFGWN+E KC+ AEKEVFD WVKSH  AKG+ NK F HYDEL+ VFGKDRATG  AE+  ++ SN     +      +   + T+   M +PGL ++ 
Subjt:  CSGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPMEMASNVAEQMEEEIRLGSQDFIGTEQRTMENPGLGDVG

Query:  EDDLPDTPTSRRNTSGMSSRCTGSKRKRSSFQIELIDVVRTAMDMQTSHMQQLLSWQKEKYELEVARRKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQNT
         DDL +T T+R   S   +  +GSK KR     +  D+VRTA++     + ++  W   + +     R+E+V  L  I  LT  DR  L+ +++ ++ + 
Subjt:  EDDLPDTPTSRRNTSGMSSRCTGSKRKRSSFQIELIDVVRTAMDMQTSHMQQLLSWQKEKYELEVARRKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQNT

Query:  DCFLQVPPQSRKAYCMRLLGRTR
          FL+VP   +  YC  +L   R
Subjt:  DCFLQVPPQSRKAYCMRLLGRTR

A0A5D3C7T4 Uncharacterized protein1.5e-3141.31Show/hide
Query:  MFGNGCSGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPMEMASNVAEQMEEEIRLGSQDFIGTEQRTMENP-
        M G  CSGFGWNE  KC+E EK VFD WVK H NA+G+ NKPFP++ +L VVFG+DRATG   +TP+EM+S  A   EE+      D I  E   + NP 
Subjt:  MFGNGCSGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPMEMASNVAEQMEEEIRLGSQDFIGTEQRTMENP-

Query:  GLGDVGEDDLPDTPTSRRNTSGMSSRCTGSKRKRSSFQIELIDVVRTAMDMQTSHMQQLLSWQKEKYELEVARRKEVVDLLYQIEGLTEHDRVSLIDLLV
        GL     +D+P TPTS  + +G SSR     +KR S+  +L+D  R +M   +  + ++ +WQ+EK E+E +  K +   L  I G+   D + + + L+
Subjt:  GLGDVGEDDLPDTPTSRRNTSGMSSRCTGSKRKRSSFQIELIDVVRTAMDMQTSHMQQLLSWQKEKYELEVARRKEVVDLLYQIEGLTEHDRVSLIDLLV

Query:  TDIQNTDCFLQVP
         D      FL  P
Subjt:  TDIQNTDCFLQVP

A0A6J1DW73 uncharacterized protein LOC1110250184.2e-5050Show/hide
Query:  GFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPMEMASNVAEQMEEEIRLGSQDFIGTEQRTMENPGLGDVGED
        GFGWN++ KC+EAEKEVFD WVKSH NAKG+RNKP PHYD+L V FGKDRATG   + P++MAS+ A  + E+    +QDF   +          D  E+
Subjt:  GFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPMEMASNVAEQMEEEIRLGSQDFIGTEQRTMENPGLGDVGED

Query:  DLPDTPTSRRNTSGMSSRCTGSKRKRSSFQIELIDVVRTAMDMQTSHMQQLLSWQKEKYELEVARRKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQNTDC
        DLP+TPTS + T G SS   GSKRKRS +  E++DVVRT M MQT+H++++ +W  +K E ++ARRK V D L QI  L  +D V L+ +L+T+++ +  
Subjt:  DLPDTPTSRRNTSGMSSRCTGSKRKRSSFQIELIDVVRTAMDMQTSHMQQLLSWQKEKYELEVARRKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQNTDC

Query:  FLQVPPQSRKAYCMRLLGRT
        FL+VP + +K +CM+LLG++
Subjt:  FLQVPPQSRKAYCMRLLGRT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30140.1 unknown protein1.6e-0640.74Show/hide
Query:  SGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATG
        SGFGW+ E K   A  EV+  ++K+H N K M+ +   H+++L ++FG   ATG
Subjt:  SGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATG

AT4G02210.1 unknown protein2.0e-0434.04Show/hide
Query:  GFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFG
        GF W+ E + V A+  V+  ++K+H +A+    +P P+Y +L V+ G
Subjt:  GFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFG

AT4G02210.2 unknown protein2.0e-0434.04Show/hide
Query:  GFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFG
        GF W+ E + V A+  V+  ++K+H +A+    +P P+Y +L V+ G
Subjt:  GFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGGGAATGGTTGTAGTGGTTTTGGTTGGAATGAAGAATTTAAATGTGTTGAAGCAGAGAAAGAGGTATTCGATGCGTGGGTTAAGAGCCATACAAATGCCAAAGG
GATGAGGAACAAGCCATTTCCGCACTATGATGAGCTGGCAGTTGTCTTCGGAAAAGATAGAGCTACAGGAATAGGCGCAGAGACCCCAATGGAAATGGCCTCTAATGTTG
CGGAACAAATGGAGGAGGAGATTCGTTTGGGATCACAAGACTTCATAGGGACAGAACAACGAACGATGGAGAATCCAGGACTTGGTGACGTAGGGGAAGATGACTTGCCA
GACACTCCTACTAGTAGACGTAATACATCTGGCATGTCTTCTAGATGTACTGGGAGCAAAAGAAAACGATCGTCCTTCCAAATTGAATTAATTGATGTTGTGCGGACAGC
AATGGATATGCAGACCAGTCACATGCAACAACTTCTATCATGGCAGAAGGAGAAGTATGAATTGGAGGTCGCACGAAGGAAGGAAGTAGTCGATCTCTTGTATCAGATAG
AAGGATTGACTGAGCATGATCGTGTCTCCCTGATTGACTTGCTTGTGACTGATATCCAGAATACTGATTGCTTTCTACAGGTTCCGCCTCAATCGAGAAAGGCGTATTGC
ATGCGTCTTCTAGGAAGGACTAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTGGGAATGGTTGTAGTGGTTTTGGTTGGAATGAAGAATTTAAATGTGTTGAAGCAGAGAAAGAGGTATTCGATGCGTGGGTTAAGAGCCATACAAATGCCAAAGG
GATGAGGAACAAGCCATTTCCGCACTATGATGAGCTGGCAGTTGTCTTCGGAAAAGATAGAGCTACAGGAATAGGCGCAGAGACCCCAATGGAAATGGCCTCTAATGTTG
CGGAACAAATGGAGGAGGAGATTCGTTTGGGATCACAAGACTTCATAGGGACAGAACAACGAACGATGGAGAATCCAGGACTTGGTGACGTAGGGGAAGATGACTTGCCA
GACACTCCTACTAGTAGACGTAATACATCTGGCATGTCTTCTAGATGTACTGGGAGCAAAAGAAAACGATCGTCCTTCCAAATTGAATTAATTGATGTTGTGCGGACAGC
AATGGATATGCAGACCAGTCACATGCAACAACTTCTATCATGGCAGAAGGAGAAGTATGAATTGGAGGTCGCACGAAGGAAGGAAGTAGTCGATCTCTTGTATCAGATAG
AAGGATTGACTGAGCATGATCGTGTCTCCCTGATTGACTTGCTTGTGACTGATATCCAGAATACTGATTGCTTTCTACAGGTTCCGCCTCAATCGAGAAAGGCGTATTGC
ATGCGTCTTCTAGGAAGGACTAGATGA
Protein sequenceShow/hide protein sequence
MFGNGCSGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDELAVVFGKDRATGIGAETPMEMASNVAEQMEEEIRLGSQDFIGTEQRTMENPGLGDVGEDDLP
DTPTSRRNTSGMSSRCTGSKRKRSSFQIELIDVVRTAMDMQTSHMQQLLSWQKEKYELEVARRKEVVDLLYQIEGLTEHDRVSLIDLLVTDIQNTDCFLQVPPQSRKAYC
MRLLGRTR