; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0013563 (gene) of Snake gourd v1 genome

Gene IDTan0013563
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein Ycf2-like
Genome locationLG02:47054951..47059686
RNA-Seq ExpressionTan0013563
SyntenyTan0013563
Gene Ontology termsNA
InterPro domainsIPR015410 - Domain of unknown function DUF1985


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXC30509.1 hypothetical protein L484_010758 [Morus notabilis]5.4e-2735.29Show/hide
Query:  PKKKEDSEDSEDDDYFLSTSRRSRPMKINLCCKSSIMPTIRKTLQDKYEERFRQACFGHLWDFSFTRTSSQLLVHLIQHQCKPKRSSELNFKIGGQALSF
        P+   +  D E     +   + +   KINL  K+ ++  + + L  + +E FR+ CFGHL DF   +  SQL+ HLI  QC   + +EL F I G  + F
Subjt:  PKKKEDSEDSEDDDYFLSTSRRSRPMKINLCCKSSIMPTIRKTLQDKYEERFRQACFGHLWDFSFTRTSSQLLVHLIQHQCKPKRSSELNFKIGGQALSF

Query:  LLRDFALITELNCEN------------------------VKRSFLNMAFKVNKMAPEDDMVKMALLYCLKSFLLPRQEKMHIENGHILMVDDDELFNSYP
         +++FALIT LNC N                        V+R  LN  F+ N+   ++D+VK+A LYCL+S L+P++ + +I+  H+ MVD+ ELF++YP
Subjt:  LLRDFALITELNCEN------------------------VKRSFLNMAFKVNKMAPEDDMVKMALLYCLKSFLLPRQEKMHIENGHILMVDDDELFNSYP

Query:  WGEL
        WG L
Subjt:  WGEL

KAA0047596.1 protein Ycf2-like [Cucumis melo var. makuwa]3.1e-3844.34Show/hide
Query:  KQKGKGTPKKKEDSEDSEDDDYFLSTSRRSRPMKINLCCKSSIMPTIRKTLQDKYEERFRQACFGHLWDFSFTRTSSQLLVHLIQHQCKPKRSSELNFKI
        ++KGK  P   E SEDS    Y + + RR+ P+KINL  KS+++  I++ L D+   RFR+  FGH  + S T  SSQLL+HLIQ  CKPK +S+L F I
Subjt:  KQKGKGTPKKKEDSEDSEDDDYFLSTSRRSRPMKINLCCKSSIMPTIRKTLQDKYEERFRQACFGHLWDFSFTRTSSQLLVHLIQHQCKPKRSSELNFKI

Query:  GGQALSFLLRDFALITELNC----------------------ENVK---RSFLNMAFKVNKMAPEDDMVKMALLYCLKSFLLPRQEKMHIENGHILMVDD
        GG+ L F LR+FALIT L C                      EN+K   R +LN+ F ++    +DD +KMA LY L+SFL+P+QE   ++  HI+MVDD
Subjt:  GGQALSFLLRDFALITELNC----------------------ENVK---RSFLNMAFKVNKMAPEDDMVKMALLYCLKSFLLPRQEKMHIENGHILMVDD

Query:  DELFNSYPWGEL
        DE+F+ YPWG +
Subjt:  DELFNSYPWGEL

TYK12922.1 uncharacterized protein E5676_scaffold255G005170 [Cucumis melo var. makuwa]2.4e-2233.18Show/hide
Query:  KQKGKGTPKKKEDSEDSEDDD---------YFLSTSRRSRPMKINLCCKSSIMPTIRKTLQDKYEERFRQACFGHLWDFSFTRTSSQLLVHLIQHQCKPK
        K++GK   K    S  SE DD           L  S  S   +INL  K  ++  I+ TL ++  ++F+++CFG+  D   ++ SSQL  HLI+ QC  K
Subjt:  KQKGKGTPKKKEDSEDSEDDD---------YFLSTSRRSRPMKINLCCKSSIMPTIRKTLQDKYEERFRQACFGHLWDFSFTRTSSQLLVHLIQHQCKPK

Query:  RSSELNFKIGGQALSFLLRDFALITELNC------------------------ENVKRSFLNMAFKVNKMAPEDDMVKMALLYCLKSFLLPRQEKMHIEN
           EL F + G+   F ++DFALIT LNC                        + ++R+ L+  F         D+VKMA LY L+ F+L +Q +  I +
Subjt:  RSSELNFKIGGQALSFLLRDFALITELNC------------------------ENVKRSFLNMAFKVNKMAPEDDMVKMALLYCLKSFLLPRQEKMHIEN

Query:  GHILMVDDDELFNSYPWGEL
         + L++DD E F+SYPWG +
Subjt:  GHILMVDDDELFNSYPWGEL

XP_024031030.1 uncharacterized protein LOC21394043 [Morus notabilis]5.4e-2735.29Show/hide
Query:  PKKKEDSEDSEDDDYFLSTSRRSRPMKINLCCKSSIMPTIRKTLQDKYEERFRQACFGHLWDFSFTRTSSQLLVHLIQHQCKPKRSSELNFKIGGQALSF
        P+   +  D E     +   + +   KINL  K+ ++  + + L  + +E FR+ CFGHL DF   +  SQL+ HLI  QC   + +EL F I G  + F
Subjt:  PKKKEDSEDSEDDDYFLSTSRRSRPMKINLCCKSSIMPTIRKTLQDKYEERFRQACFGHLWDFSFTRTSSQLLVHLIQHQCKPKRSSELNFKIGGQALSF

Query:  LLRDFALITELNCEN------------------------VKRSFLNMAFKVNKMAPEDDMVKMALLYCLKSFLLPRQEKMHIENGHILMVDDDELFNSYP
         +++FALIT LNC N                        V+R  LN  F+ N+   ++D+VK+A LYCL+S L+P++ + +I+  H+ MVD+ ELF++YP
Subjt:  LLRDFALITELNCEN------------------------VKRSFLNMAFKVNKMAPEDDMVKMALLYCLKSFLLPRQEKMHIENGHILMVDDDELFNSYP

Query:  WGEL
        WG L
Subjt:  WGEL

XP_031743193.1 uncharacterized protein LOC101221625 isoform X6 [Cucumis sativus]2.4e-2231.36Show/hide
Query:  ERIERKHFWILSKQKGKGTPKKKE-----------DSEDSEDDDYFLSTSRRS--RPMKINLCCKSSIMPTIRKTLQDKYEERFRQACFGHLWDFSFTRT
        E  +RK     SK++G+   +KK             S + +D++Y L   R S     +INL  K  ++  I+ TL ++  ++F+++CFG+  D   ++ 
Subjt:  ERIERKHFWILSKQKGKGTPKKKE-----------DSEDSEDDDYFLSTSRRS--RPMKINLCCKSSIMPTIRKTLQDKYEERFRQACFGHLWDFSFTRT

Query:  SSQLLVHLIQHQCKPKRSSELNFKIGGQALSFLLRDFALITELNC------------------------ENVKRSFLNMAFKVNKMAPEDDMVKMALLYC
        SSQL  HLI+ QC  K  +EL F + G+   F ++DFALIT LNC                        + ++R+ L+  F         D+VKMA LY 
Subjt:  SSQLLVHLIQHQCKPKRSSELNFKIGGQALSFLLRDFALITELNC------------------------ENVKRSFLNMAFKVNKMAPEDDMVKMALLYC

Query:  LKSFLLPRQEKMHIENGHILMVDDDELFNSYPWGEL
        L+ F+L +Q +  I + + L++DD + F+SYPWG +
Subjt:  LKSFLLPRQEKMHIENGHILMVDDDELFNSYPWGEL

TrEMBL top hitse value%identityAlignment
A0A1S3B0L9 uncharacterized protein LOC103484737 isoform X51.1e-2233.18Show/hide
Query:  KQKGKGTPKKKEDSEDSEDDD---------YFLSTSRRSRPMKINLCCKSSIMPTIRKTLQDKYEERFRQACFGHLWDFSFTRTSSQLLVHLIQHQCKPK
        K++GK   K    S  SE DD           L  S  S   +INL  K  ++  I+ TL ++  ++F+++CFG+  D   ++ SSQL  HLI+ QC  K
Subjt:  KQKGKGTPKKKEDSEDSEDDD---------YFLSTSRRSRPMKINLCCKSSIMPTIRKTLQDKYEERFRQACFGHLWDFSFTRTSSQLLVHLIQHQCKPK

Query:  RSSELNFKIGGQALSFLLRDFALITELNC------------------------ENVKRSFLNMAFKVNKMAPEDDMVKMALLYCLKSFLLPRQEKMHIEN
           EL F + G+   F ++DFALIT LNC                        + ++R+ L+  F         D+VKMA LY L+ F+L +Q +  I +
Subjt:  RSSELNFKIGGQALSFLLRDFALITELNC------------------------ENVKRSFLNMAFKVNKMAPEDDMVKMALLYCLKSFLLPRQEKMHIEN

Query:  GHILMVDDDELFNSYPWGEL
         + L++DD E F+SYPWG +
Subjt:  GHILMVDDDELFNSYPWGEL

A0A1S3B181 uncharacterized protein LOC103484737 isoform X71.1e-2233.18Show/hide
Query:  KQKGKGTPKKKEDSEDSEDDD---------YFLSTSRRSRPMKINLCCKSSIMPTIRKTLQDKYEERFRQACFGHLWDFSFTRTSSQLLVHLIQHQCKPK
        K++GK   K    S  SE DD           L  S  S   +INL  K  ++  I+ TL ++  ++F+++CFG+  D   ++ SSQL  HLI+ QC  K
Subjt:  KQKGKGTPKKKEDSEDSEDDD---------YFLSTSRRSRPMKINLCCKSSIMPTIRKTLQDKYEERFRQACFGHLWDFSFTRTSSQLLVHLIQHQCKPK

Query:  RSSELNFKIGGQALSFLLRDFALITELNC------------------------ENVKRSFLNMAFKVNKMAPEDDMVKMALLYCLKSFLLPRQEKMHIEN
           EL F + G+   F ++DFALIT LNC                        + ++R+ L+  F         D+VKMA LY L+ F+L +Q +  I +
Subjt:  RSSELNFKIGGQALSFLLRDFALITELNC------------------------ENVKRSFLNMAFKVNKMAPEDDMVKMALLYCLKSFLLPRQEKMHIEN

Query:  GHILMVDDDELFNSYPWGEL
         + L++DD E F+SYPWG +
Subjt:  GHILMVDDDELFNSYPWGEL

A0A5A7U047 Protein Ycf2-like1.5e-3844.34Show/hide
Query:  KQKGKGTPKKKEDSEDSEDDDYFLSTSRRSRPMKINLCCKSSIMPTIRKTLQDKYEERFRQACFGHLWDFSFTRTSSQLLVHLIQHQCKPKRSSELNFKI
        ++KGK  P   E SEDS    Y + + RR+ P+KINL  KS+++  I++ L D+   RFR+  FGH  + S T  SSQLL+HLIQ  CKPK +S+L F I
Subjt:  KQKGKGTPKKKEDSEDSEDDDYFLSTSRRSRPMKINLCCKSSIMPTIRKTLQDKYEERFRQACFGHLWDFSFTRTSSQLLVHLIQHQCKPKRSSELNFKI

Query:  GGQALSFLLRDFALITELNC----------------------ENVK---RSFLNMAFKVNKMAPEDDMVKMALLYCLKSFLLPRQEKMHIENGHILMVDD
        GG+ L F LR+FALIT L C                      EN+K   R +LN+ F ++    +DD +KMA LY L+SFL+P+QE   ++  HI+MVDD
Subjt:  GGQALSFLLRDFALITELNC----------------------ENVK---RSFLNMAFKVNKMAPEDDMVKMALLYCLKSFLLPRQEKMHIENGHILMVDD

Query:  DELFNSYPWGEL
        DE+F+ YPWG +
Subjt:  DELFNSYPWGEL

A0A5D3CNI7 TF-B3 domain-containing protein1.1e-2233.18Show/hide
Query:  KQKGKGTPKKKEDSEDSEDDD---------YFLSTSRRSRPMKINLCCKSSIMPTIRKTLQDKYEERFRQACFGHLWDFSFTRTSSQLLVHLIQHQCKPK
        K++GK   K    S  SE DD           L  S  S   +INL  K  ++  I+ TL ++  ++F+++CFG+  D   ++ SSQL  HLI+ QC  K
Subjt:  KQKGKGTPKKKEDSEDSEDDD---------YFLSTSRRSRPMKINLCCKSSIMPTIRKTLQDKYEERFRQACFGHLWDFSFTRTSSQLLVHLIQHQCKPK

Query:  RSSELNFKIGGQALSFLLRDFALITELNC------------------------ENVKRSFLNMAFKVNKMAPEDDMVKMALLYCLKSFLLPRQEKMHIEN
           EL F + G+   F ++DFALIT LNC                        + ++R+ L+  F         D+VKMA LY L+ F+L +Q +  I +
Subjt:  RSSELNFKIGGQALSFLLRDFALITELNC------------------------ENVKRSFLNMAFKVNKMAPEDDMVKMALLYCLKSFLLPRQEKMHIEN

Query:  GHILMVDDDELFNSYPWGEL
         + L++DD E F+SYPWG +
Subjt:  GHILMVDDDELFNSYPWGEL

W9SF50 DUF1985 domain-containing protein2.6e-2735.29Show/hide
Query:  PKKKEDSEDSEDDDYFLSTSRRSRPMKINLCCKSSIMPTIRKTLQDKYEERFRQACFGHLWDFSFTRTSSQLLVHLIQHQCKPKRSSELNFKIGGQALSF
        P+   +  D E     +   + +   KINL  K+ ++  + + L  + +E FR+ CFGHL DF   +  SQL+ HLI  QC   + +EL F I G  + F
Subjt:  PKKKEDSEDSEDDDYFLSTSRRSRPMKINLCCKSSIMPTIRKTLQDKYEERFRQACFGHLWDFSFTRTSSQLLVHLIQHQCKPKRSSELNFKIGGQALSF

Query:  LLRDFALITELNCEN------------------------VKRSFLNMAFKVNKMAPEDDMVKMALLYCLKSFLLPRQEKMHIENGHILMVDDDELFNSYP
         +++FALIT LNC N                        V+R  LN  F+ N+   ++D+VK+A LYCL+S L+P++ + +I+  H+ MVD+ ELF++YP
Subjt:  LLRDFALITELNCEN------------------------VKRSFLNMAFKVNKMAPEDDMVKMALLYCLKSFLLPRQEKMHIENGHILMVDDDELFNSYP

Query:  WGEL
        WG L
Subjt:  WGEL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G31150.1 Domain of unknown function (DUF1985)5.6e-0629.46Show/hide
Query:  KINLCCKSSIMPTIRKTLQDKYE-ERFRQACFGHLWDFSFTRTS-SQLLVH-LIQHQCKPKRSSELNFKIGGQALSFLLRDFALITELNC------ENVK
        ++N+  +   + TI   L+   E ER + + FG L++F   R S S  L+H L+  Q   K+  EL F  GG  + F +R+F ++T L C      + VK
Subjt:  KINLCCKSSIMPTIRKTLQDKYE-ERFRQACFGHLWDFSFTRTS-SQLLVH-LIQHQCKPKRSSELNFKIGGQALSFLLRDFALITELNC------ENVK

Query:  R-------SFLNMAFKVNKMAPEDDMVKM
        +       S  N  F   +M    D+++M
Subjt:  R-------SFLNMAFKVNKMAPEDDMVKM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCTGAATCCAATATCCGGGAAGAGTCATTACTCTCCACTAAACATGTTTCTACCACAAGTAAATCACATTTACCTTGGTTCTTCCTCTCGGCCAGAATTTGGGAC
AATTTCTCTTCCAATGTCCCAACGAGTTTTGTCCTGCAGGTCGAACCCCGTGATAAGACTCCATGAGGCAACATTTGCCTCAGATTCCTTGACCATGTTGAGGGATTGGA
AATTCTGTAGCTCATTGAGGAGTGTAGCTAATTTTGTTCATAACAGCATTACTACGAAACTGAAGGAAACTCTTCGGAAGAGTCTCCATGATAAAACTGACCTGACTCGA
CTCATCGATCGAAGCCCCATTCATCTTAGCCAGATTAAAGCGGGTCATCATAAGGAAGTTGGTGGGAACTTTGTGTTTTGGAAGAGGTGGAGAAATGAAAGAATTGAGAG
AAAGCACTTTTGGATTCTTAGCAAACAAAAAGGCAAGGGGACCCCTAAAAAAAAGGAAGATTCTGAAGACAGTGAGGATGATGACTACTTCCTGTCGACATCAAGAAGAA
GCCGCCCGATGAAGATAAATTTATGTTGTAAGAGCAGCATCATGCCAACAATTCGGAAAACCCTTCAAGACAAGTATGAAGAGAGATTTCGACAAGCATGTTTTGGTCAT
CTTTGGGACTTCTCGTTCACAAGAACTTCGTCCCAGCTACTAGTCCACCTCATTCAACACCAATGCAAGCCAAAGCGGTCATCGGAGTTAAATTTTAAGATTGGAGGTCA
GGCCCTGAGTTTTTTATTAAGAGACTTCGCTTTGATTACCGAGTTGAATTGTGAAAATGTGAAACGCTCATTTCTGAATATGGCTTTCAAAGTCAACAAAATGGCACCAG
AGGACGACATGGTAAAGATGGCACTGTTGTACTGCTTGAAGAGTTTCCTACTCCCTAGGCAAGAGAAGATGCATATAGAGAACGGTCATATCCTCATGGTTGACGACGAT
GAACTGTTTAATTCTTACCCATGGGGCGAGTTGCATTCCAGTTATTGGTCAAATACATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGCTGAATCCAATATCCGGGAAGAGTCATTACTCTCCACTAAACATGTTTCTACCACAAGTAAATCACATTTACCTTGGTTCTTCCTCTCGGCCAGAATTTGGGAC
AATTTCTCTTCCAATGTCCCAACGAGTTTTGTCCTGCAGGTCGAACCCCGTGATAAGACTCCATGAGGCAACATTTGCCTCAGATTCCTTGACCATGTTGAGGGATTGGA
AATTCTGTAGCTCATTGAGGAGTGTAGCTAATTTTGTTCATAACAGCATTACTACGAAACTGAAGGAAACTCTTCGGAAGAGTCTCCATGATAAAACTGACCTGACTCGA
CTCATCGATCGAAGCCCCATTCATCTTAGCCAGATTAAAGCGGGTCATCATAAGGAAGTTGGTGGGAACTTTGTGTTTTGGAAGAGGTGGAGAAATGAAAGAATTGAGAG
AAAGCACTTTTGGATTCTTAGCAAACAAAAAGGCAAGGGGACCCCTAAAAAAAAGGAAGATTCTGAAGACAGTGAGGATGATGACTACTTCCTGTCGACATCAAGAAGAA
GCCGCCCGATGAAGATAAATTTATGTTGTAAGAGCAGCATCATGCCAACAATTCGGAAAACCCTTCAAGACAAGTATGAAGAGAGATTTCGACAAGCATGTTTTGGTCAT
CTTTGGGACTTCTCGTTCACAAGAACTTCGTCCCAGCTACTAGTCCACCTCATTCAACACCAATGCAAGCCAAAGCGGTCATCGGAGTTAAATTTTAAGATTGGAGGTCA
GGCCCTGAGTTTTTTATTAAGAGACTTCGCTTTGATTACCGAGTTGAATTGTGAAAATGTGAAACGCTCATTTCTGAATATGGCTTTCAAAGTCAACAAAATGGCACCAG
AGGACGACATGGTAAAGATGGCACTGTTGTACTGCTTGAAGAGTTTCCTACTCCCTAGGCAAGAGAAGATGCATATAGAGAACGGTCATATCCTCATGGTTGACGACGAT
GAACTGTTTAATTCTTACCCATGGGGCGAGTTGCATTCCAGTTATTGGTCAAATACATGA
Protein sequenceShow/hide protein sequence
MALNPISGKSHYSPLNMFLPQVNHIYLGSSSRPEFGTISLPMSQRVLSCRSNPVIRLHEATFASDSLTMLRDWKFCSSLRSVANFVHNSITTKLKETLRKSLHDKTDLTR
LIDRSPIHLSQIKAGHHKEVGGNFVFWKRWRNERIERKHFWILSKQKGKGTPKKKEDSEDSEDDDYFLSTSRRSRPMKINLCCKSSIMPTIRKTLQDKYEERFRQACFGH
LWDFSFTRTSSQLLVHLIQHQCKPKRSSELNFKIGGQALSFLLRDFALITELNCENVKRSFLNMAFKVNKMAPEDDMVKMALLYCLKSFLLPRQEKMHIENGHILMVDDD
ELFNSYPWGELHSSYWSNT