; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016057 (gene) of Snake gourd v1 genome

Gene IDTan0016057
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF1068)
Genome locationLG02:3131412..3133352
RNA-Seq ExpressionTan0016057
SyntenyTan0016057
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR010471 - Protein of unknown function DUF1068


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572284.1 hypothetical protein SDJN03_29012, partial [Cucurbita argyrosperma subsp. sororia]1.8e-9391.19Show/hide
Query:  MAIQASSCNPSLIKFGLSLIAITIAGYILGPPLYWHLKEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNSTFADCVKHDPEVSQDTEKNFADLLL
        MAIQASSCNP+LIKFGL+LIA++IAGYILGPPLYWH KEGLAVV+RSSSSSSCPPCFCDCPSQP+I+IP+ELRN+TF DC+KHDPEVSQDTEKNFADLLL
Subjt:  MAIQASSCNPSLIKFGLSLIAITIAGYILGPPLYWHLKEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNSTFADCVKHDPEVSQDTEKNFADLLL

Query:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGVAKSRTQKQANVQTA
        EELKLKE EALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVL AQKRLTAMWEQRARQRGWKEG+AKSRTQKQ N+QTA
Subjt:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGVAKSRTQKQANVQTA

XP_004144504.1 uncharacterized protein LOC101202853 isoform X1 [Cucumis sativus]1.1e-9393.26Show/hide
Query:  MAIQASSCNPSLIKFGLSLIAITIAGYILGPPLYWHLKEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNSTFADCVKHDPEVSQDTEKNFADLLL
        MAIQ SSCNPSLIK GL+LIAITI GYILGPPLYWH KEGLAVV+ SSSSSSCPPCFCDCPS PVISIPEELRNSTFADCVKHDPEVS+DTEKNFADLLL
Subjt:  MAIQASSCNPSLIKFGLSLIAITIAGYILGPPLYWHLKEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNSTFADCVKHDPEVSQDTEKNFADLLL

Query:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGVAKSRTQKQANVQTA
        EELKLKEAEALENQRRAD+ALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEG AKSRTQKQ N+QTA
Subjt:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGVAKSRTQKQANVQTA

XP_008455457.1 PREDICTED: uncharacterized protein LOC103495614 isoform X1 [Cucumis melo]3.1e-9392.75Show/hide
Query:  MAIQASSCNPSLIKFGLSLIAITIAGYILGPPLYWHLKEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNSTFADCVKHDPEVSQDTEKNFADLLL
        MAIQ +SCNPSLIK GL+LIAITI GYILGPPLYWH KEGLAVVS SS+SSSCPPCFCDCPSQPVISIPEELRNSTFADCVKHDPEVS+DTEKNFADLLL
Subjt:  MAIQASSCNPSLIKFGLSLIAITIAGYILGPPLYWHLKEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNSTFADCVKHDPEVSQDTEKNFADLLL

Query:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGVAKSRTQKQANVQTA
        EELKLKEAEALENQRRAD+ALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTA WEQRARQRGWKEG AKSRTQKQ N+QTA
Subjt:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGVAKSRTQKQANVQTA

XP_022952351.1 uncharacterized protein LOC111455059 isoform X1 [Cucurbita moschata]8.3e-9491.71Show/hide
Query:  MAIQASSCNPSLIKFGLSLIAITIAGYILGPPLYWHLKEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNSTFADCVKHDPEVSQDTEKNFADLLL
        MAIQASSCNPSLIKFGL+LIA++IAGYILGPPLYWH KEGLAVV+RSSSSSSCPPCFCDCPSQP+I+IP+ELRN+TF DC+KHDPEVSQDTEKNFADLLL
Subjt:  MAIQASSCNPSLIKFGLSLIAITIAGYILGPPLYWHLKEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNSTFADCVKHDPEVSQDTEKNFADLLL

Query:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGVAKSRTQKQANVQTA
        EELKLKE EALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVL AQKRLTAMWEQRARQRGWKEG+AKSRTQKQ N+QTA
Subjt:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGVAKSRTQKQANVQTA

XP_022969266.1 uncharacterized protein LOC111468321 [Cucurbita maxima]1.1e-9391.71Show/hide
Query:  MAIQASSCNPSLIKFGLSLIAITIAGYILGPPLYWHLKEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNSTFADCVKHDPEVSQDTEKNFADLLL
        MAIQASSCNPSLIKFGL+LIA++IAGYIL PPLYWH KEGLAVV+RSSSSSSCPPCFCDCPSQP+I+IP+ELRN+TF DC+KHDPEVSQDTEKNFADLLL
Subjt:  MAIQASSCNPSLIKFGLSLIAITIAGYILGPPLYWHLKEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNSTFADCVKHDPEVSQDTEKNFADLLL

Query:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGVAKSRTQKQANVQTA
        EELKLKE EALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVL AQKRLTAMWEQRARQRGWKEG+AKSRTQKQAN+QTA
Subjt:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGVAKSRTQKQANVQTA

TrEMBL top hitse value%identityAlignment
A0A0A0K245 Uncharacterized protein5.2e-9493.26Show/hide
Query:  MAIQASSCNPSLIKFGLSLIAITIAGYILGPPLYWHLKEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNSTFADCVKHDPEVSQDTEKNFADLLL
        MAIQ SSCNPSLIK GL+LIAITI GYILGPPLYWH KEGLAVV+ SSSSSSCPPCFCDCPS PVISIPEELRNSTFADCVKHDPEVS+DTEKNFADLLL
Subjt:  MAIQASSCNPSLIKFGLSLIAITIAGYILGPPLYWHLKEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNSTFADCVKHDPEVSQDTEKNFADLLL

Query:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGVAKSRTQKQANVQTA
        EELKLKEAEALENQRRAD+ALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEG AKSRTQKQ N+QTA
Subjt:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGVAKSRTQKQANVQTA

A0A1S3C135 uncharacterized protein LOC103495614 isoform X11.5e-9392.75Show/hide
Query:  MAIQASSCNPSLIKFGLSLIAITIAGYILGPPLYWHLKEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNSTFADCVKHDPEVSQDTEKNFADLLL
        MAIQ +SCNPSLIK GL+LIAITI GYILGPPLYWH KEGLAVVS SS+SSSCPPCFCDCPSQPVISIPEELRNSTFADCVKHDPEVS+DTEKNFADLLL
Subjt:  MAIQASSCNPSLIKFGLSLIAITIAGYILGPPLYWHLKEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNSTFADCVKHDPEVSQDTEKNFADLLL

Query:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGVAKSRTQKQANVQTA
        EELKLKEAEALENQRRAD+ALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTA WEQRARQRGWKEG AKSRTQKQ N+QTA
Subjt:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGVAKSRTQKQANVQTA

A0A6J1ERF9 uncharacterized protein LOC111437126 isoform X11.9e-9191.19Show/hide
Query:  MAIQASSCNPSLIKFGLSLIAITIAGYILGPPLYWHLKEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNSTFADCVKHDPEVSQDTEKNFADLLL
        MAIQA SC+PSLIK GL+LIA++IAGYILGPPLYWH+KEG AVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNSTFADCVKHDPEVSQDTEKNFADLLL
Subjt:  MAIQASSCNPSLIKFGLSLIAITIAGYILGPPLYWHLKEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNSTFADCVKHDPEVSQDTEKNFADLLL

Query:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGVAKSRTQKQANVQTA
        EELKLKEAEA E+ RRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVL AQKRLTA WEQRARQRGWKEG AKSR QKQ N+QTA
Subjt:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGVAKSRTQKQANVQTA

A0A6J1GK01 uncharacterized protein LOC111455059 isoform X14.0e-9491.71Show/hide
Query:  MAIQASSCNPSLIKFGLSLIAITIAGYILGPPLYWHLKEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNSTFADCVKHDPEVSQDTEKNFADLLL
        MAIQASSCNPSLIKFGL+LIA++IAGYILGPPLYWH KEGLAVV+RSSSSSSCPPCFCDCPSQP+I+IP+ELRN+TF DC+KHDPEVSQDTEKNFADLLL
Subjt:  MAIQASSCNPSLIKFGLSLIAITIAGYILGPPLYWHLKEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNSTFADCVKHDPEVSQDTEKNFADLLL

Query:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGVAKSRTQKQANVQTA
        EELKLKE EALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVL AQKRLTAMWEQRARQRGWKEG+AKSRTQKQ N+QTA
Subjt:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGVAKSRTQKQANVQTA

A0A6J1HXB7 uncharacterized protein LOC1114683215.2e-9491.71Show/hide
Query:  MAIQASSCNPSLIKFGLSLIAITIAGYILGPPLYWHLKEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNSTFADCVKHDPEVSQDTEKNFADLLL
        MAIQASSCNPSLIKFGL+LIA++IAGYIL PPLYWH KEGLAVV+RSSSSSSCPPCFCDCPSQP+I+IP+ELRN+TF DC+KHDPEVSQDTEKNFADLLL
Subjt:  MAIQASSCNPSLIKFGLSLIAITIAGYILGPPLYWHLKEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNSTFADCVKHDPEVSQDTEKNFADLLL

Query:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGVAKSRTQKQANVQTA
        EELKLKE EALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVL AQKRLTAMWEQRARQRGWKEG+AKSRTQKQAN+QTA
Subjt:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGVAKSRTQKQANVQTA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05070.1 Protein of unknown function (DUF1068)3.6e-6364.64Show/hide
Query:  IKFGLSLIAITIAGYILGPPLYWHLKEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNSTFADCVKHDPEVSQDTEKNFADLLLEELKLKEAEALE
        +K GL+L+ +++AGYILGPPLYWHL E LA V    S+SSCP C C+C +   ++IP+EL N++FADC KHDPEV++DTEKN+A+LL EELKL+EAE+LE
Subjt:  IKFGLSLIAITIAGYILGPPLYWHLKEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNSTFADCVKHDPEVSQDTEKNFADLLLEELKLKEAEALE

Query:  NQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGVAKSRTQKQANVQTA
          +RADM LLEAKK+TS YQKEADKCNSGMETCEEAREKAE  L  QK+LT+ WE+RARQ+GW+EG  K   + ++NVQ A
Subjt:  NQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGVAKSRTQKQANVQTA

AT2G32580.1 Protein of unknown function (DUF1068)2.3e-5760.22Show/hide
Query:  IKFGLSLIAITIAGYILGPPLYWHLKEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNSTFADCVKHDPEVSQDTEKNFADLLLEELKLKEAEALE
        +K GL+L+A+++ GYILGPPLYWHL E LAV     S++SC  C CDC S P+++IP  L N +F DC K DPEV++DTEKN+A+LL EELK +EA ++E
Subjt:  IKFGLSLIAITIAGYILGPPLYWHLKEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNSTFADCVKHDPEVSQDTEKNFADLLLEELKLKEAEALE

Query:  NQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGVAKSRTQKQANVQTA
          +R D  LLEAKK+TS YQKEADKCNSGMETCEEAREKAE  L  QK+LT+MWEQRARQ+G+K+G  KS  + ++  + A
Subjt:  NQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGVAKSRTQKQANVQTA

AT2G32580.2 Protein of unknown function (DUF1068)3.2e-3561.86Show/hide
Query:  TFADCVKHDPEVSQDTEKNFADLLLEELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGW
        +  +C K DPEV++DTEKN+A+LL EELK +EA ++E  +R D  LLEAKK+TS YQKEADKCNSGMETCEEAREKAE  L  QK+LT+MWEQRARQ+G+
Subjt:  TFADCVKHDPEVSQDTEKNFADLLLEELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGW

Query:  KEGVAKSRTQKQANVQTA
        K+G  KS  + ++  + A
Subjt:  KEGVAKSRTQKQANVQTA

AT4G04360.1 Protein of unknown function (DUF1068)1.4e-4657.93Show/hide
Query:  LIAITIAGYILGPPLYWHLKEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNSTFADCVKHDPEVSQDTEKNFADLLLEELKLKEAEALENQRRAD
        ++ + I  YI GP LYWHL E +A     S  SSCPPC CDC SQP++SIP+ L N +F DC++H+ E S+++E +F +++ EELKL+EA+A E++ RAD
Subjt:  LIAITIAGYILGPPLYWHLKEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNSTFADCVKHDPEVSQDTEKNFADLLLEELKLKEAEALENQRRAD

Query:  MALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGVAKS
          LL+AKK  SQYQKEADKC+ GMETCE AREKAEA L  Q+RL+ MWE RARQ GWKEG   S
Subjt:  MALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGVAKS

AT4G30996.1 Protein of unknown function (DUF1068)3.8e-3647.83Show/hide
Query:  LSLIAITIAGYILGPPLYWHLKEGLAVVSRSSSSSSCPPCFCDCPSQ-PVISIPEELRNSTFADCVKHDPEVSQDTEKNFADLLLEELKLKEAEALENQR
        L + A+  A  + GP LYW   +G   V  + ++S CPPC CDCP    ++ I   L N +  DC   DPE+ Q+ EK F DLL EELKL+EA A E+ R
Subjt:  LSLIAITIAGYILGPPLYWHLKEGLAVVSRSSSSSSCPPCFCDCPSQ-PVISIPEELRNSTFADCVKHDPEVSQDTEKNFADLLLEELKLKEAEALENQR

Query:  RADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWK
          ++ L EAK++ SQYQKEA+KCN+  E CE ARE+AEA+L  ++++T++WE+RARQ GW+
Subjt:  RADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAATTCAGGCGAGTTCTTGCAATCCCTCACTGATTAAGTTCGGTTTGTCTTTGATTGCCATCACCATTGCCGGTTACATTCTCGGCCCTCCTCTCTATTGGCACTT
GAAGGAAGGTTTAGCCGTCGTTAGCCGCTCCTCTTCTTCTTCTTCCTGCCCTCCTTGTTTCTGCGATTGCCCTTCTCAGCCCGTCATCTCAATTCCCGAAGAATTGAGGA
ACTCTACCTTTGCAGATTGTGTTAAGCATGACCCAGAAGTGAGTCAAGACACCGAGAAGAACTTCGCAGACCTATTGTTAGAGGAACTGAAGTTAAAGGAAGCCGAAGCC
TTGGAAAATCAGCGCCGTGCTGATATGGCTCTGCTCGAGGCGAAGAAGATGACATCTCAGTATCAAAAAGAAGCAGACAAGTGCAATTCTGGGATGGAAACATGTGAAGA
AGCAAGAGAGAAAGCTGAAGCAGTACTAACTGCACAGAAGAGACTAACAGCAATGTGGGAGCAAAGGGCTCGCCAAAGGGGATGGAAAGAAGGGGTTGCCAAGTCTCGTA
CTCAAAAACAAGCAAATGTTCAAACTGCATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAATTCAGGCGAGTTCTTGCAATCCCTCACTGATTAAGTTCGGTTTGTCTTTGATTGCCATCACCATTGCCGGTTACATTCTCGGCCCTCCTCTCTATTGGCACTT
GAAGGAAGGTTTAGCCGTCGTTAGCCGCTCCTCTTCTTCTTCTTCCTGCCCTCCTTGTTTCTGCGATTGCCCTTCTCAGCCCGTCATCTCAATTCCCGAAGAATTGAGGA
ACTCTACCTTTGCAGATTGTGTTAAGCATGACCCAGAAGTGAGTCAAGACACCGAGAAGAACTTCGCAGACCTATTGTTAGAGGAACTGAAGTTAAAGGAAGCCGAAGCC
TTGGAAAATCAGCGCCGTGCTGATATGGCTCTGCTCGAGGCGAAGAAGATGACATCTCAGTATCAAAAAGAAGCAGACAAGTGCAATTCTGGGATGGAAACATGTGAAGA
AGCAAGAGAGAAAGCTGAAGCAGTACTAACTGCACAGAAGAGACTAACAGCAATGTGGGAGCAAAGGGCTCGCCAAAGGGGATGGAAAGAAGGGGTTGCCAAGTCTCGTA
CTCAAAAACAAGCAAATGTTCAAACTGCATAA
Protein sequenceShow/hide protein sequence
MAIQASSCNPSLIKFGLSLIAITIAGYILGPPLYWHLKEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNSTFADCVKHDPEVSQDTEKNFADLLLEELKLKEAEA
LENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLTAQKRLTAMWEQRARQRGWKEGVAKSRTQKQANVQTA