; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001965 (gene) of Snake gourd v1 genome

Gene IDTan0001965
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF1068)
Genome locationLG05:7303810..7306249
RNA-Seq ExpressionTan0001965
SyntenyTan0001965
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR010471 - Protein of unknown function DUF1068


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8652279.1 hypothetical protein Csa_022101 [Cucumis sativus]1.3e-8389.67Show/hide
Query:  MAVKPVVGLCSPGLTKVGLAFMALCLAAYILGPPLYWHFMEGLAAFSSPPSSTCPPCFCDCSSQTDFAFTDELENTTFRDCVKHDSGMNEETEKSFAELL
        MAVKP VG CSPGLTKVGL  MALC+AAYILGPPLYWHFMEGL AFSS   STCPPCFCDCSS TDFAFT+ELENTTFRDCVKHDSGMNEETEK+FAELL
Subjt:  MAVKPVVGLCSPGLTKVGLAFMALCLAAYILGPPLYWHFMEGLAAFSSPPSSTCPPCFCDCSSQTDFAFTDELENTTFRDCVKHDSGMNEETEKSFAELL

Query:  SEELKLREVEALERQQRADISLLEAKKITSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWETRARQRGWRDDIVTSR
        SEELKLRE EALE  +RADISLLEAKK+TSQYQKEADKCNSGMETCEAARERAEATLASQK+LTALWETRARQRGWRD+IVTSR
Subjt:  SEELKLREVEALERQQRADISLLEAKKITSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWETRARQRGWRDDIVTSR

KAG6604871.1 hypothetical protein SDJN03_02188, partial [Cucurbita argyrosperma subsp. sororia]1.9e-8588.14Show/hide
Query:  MAVKPVVGLCSPGLTKVGLAFMALCLAAYILGPPLYWHFMEGLAAFSSPPSSTCPPCFCDCSSQTDFAFTDELENTTFRDCVKHDSGMNEETEKSFAELL
        MAVKP  G C PGLTKVGL F+ALC+AAYILGPPLYWHFMEGLA  SS  SSTCPPC CDCSSQTDFAFTDE ENTTFRDCVKHDSGMNEETE+SFAELL
Subjt:  MAVKPVVGLCSPGLTKVGLAFMALCLAAYILGPPLYWHFMEGLAAFSSPPSSTCPPCFCDCSSQTDFAFTDELENTTFRDCVKHDSGMNEETEKSFAELL

Query:  SEELKLREVEALERQQRADISLLEAKKITSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWETRARQRGWRDDIVTSRAQARDAVQTS
        SEELKLRE EA+ER +RADISLLEAKK+TSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWE RARQRGWRDDIVTS A ARD VQTS
Subjt:  SEELKLREVEALERQQRADISLLEAKKITSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWETRARQRGWRDDIVTSRAQARDAVQTS

XP_022947173.1 uncharacterized protein LOC111451120 isoform X1 [Cucurbita moschata]1.4e-8587.63Show/hide
Query:  MAVKPVVGLCSPGLTKVGLAFMALCLAAYILGPPLYWHFMEGLAAFSSPPSSTCPPCFCDCSSQTDFAFTDELENTTFRDCVKHDSGMNEETEKSFAELL
        MAVKP  G CSPGLTKVGL F+ALC+AAYILGPPLYWHF+EGLA  SS  SSTCPPCFCDCSSQTDFAFTDE ENTTFRDCVKHDSGMNEETE++FAELL
Subjt:  MAVKPVVGLCSPGLTKVGLAFMALCLAAYILGPPLYWHFMEGLAAFSSPPSSTCPPCFCDCSSQTDFAFTDELENTTFRDCVKHDSGMNEETEKSFAELL

Query:  SEELKLREVEALERQQRADISLLEAKKITSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWETRARQRGWRDDIVTSRAQARDAVQTS
        SEELKLRE EA+ER +RADISLLEAKK+TSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWE RARQRGWRDDIV S A ARD VQTS
Subjt:  SEELKLREVEALERQQRADISLLEAKKITSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWETRARQRGWRDDIVTSRAQARDAVQTS

XP_022971112.1 uncharacterized protein LOC111469880 isoform X1 [Cucurbita maxima]2.4e-8587.63Show/hide
Query:  MAVKPVVGLCSPGLTKVGLAFMALCLAAYILGPPLYWHFMEGLAAFSSPPSSTCPPCFCDCSSQTDFAFTDELENTTFRDCVKHDSGMNEETEKSFAELL
        MAVKP  G CSPGLTKVGL F+ALC+AAYILGPPLYWHFMEGLA  SS  SSTCPPCFCDCSSQTDFAFTDE ENTTFRDCVKHDSGMNEETE+SFAELL
Subjt:  MAVKPVVGLCSPGLTKVGLAFMALCLAAYILGPPLYWHFMEGLAAFSSPPSSTCPPCFCDCSSQTDFAFTDELENTTFRDCVKHDSGMNEETEKSFAELL

Query:  SEELKLREVEALERQQRADISLLEAKKITSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWETRARQRGWRDDIVTSRAQARDAVQTS
        SE+LKLRE +A+ER +RADISLLEAKK+TSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWE RARQRGWRDDIV S A ARD VQTS
Subjt:  SEELKLREVEALERQQRADISLLEAKKITSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWETRARQRGWRDDIVTSRAQARDAVQTS

XP_023534557.1 uncharacterized protein LOC111796098 isoform X1 [Cucurbita pepo subsp. pepo]5.8e-8789.18Show/hide
Query:  MAVKPVVGLCSPGLTKVGLAFMALCLAAYILGPPLYWHFMEGLAAFSSPPSSTCPPCFCDCSSQTDFAFTDELENTTFRDCVKHDSGMNEETEKSFAELL
        MAVKP  G CSPGLTKVGL F+ALC+AAYILGPPLYWHFMEGLA  SS  SSTCPPCFCDCSSQTDFAFTDE ENTTFRDCVKHDSGMNEETE+SFAELL
Subjt:  MAVKPVVGLCSPGLTKVGLAFMALCLAAYILGPPLYWHFMEGLAAFSSPPSSTCPPCFCDCSSQTDFAFTDELENTTFRDCVKHDSGMNEETEKSFAELL

Query:  SEELKLREVEALERQQRADISLLEAKKITSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWETRARQRGWRDDIVTSRAQARDAVQTS
        SEELKLRE EA+ER +RADISLLEAKK+TSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWE RARQRGWRDDIVTS A ARD VQTS
Subjt:  SEELKLREVEALERQQRADISLLEAKKITSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWETRARQRGWRDDIVTSRAQARDAVQTS

TrEMBL top hitse value%identityAlignment
A0A0A0LQC2 Uncharacterized protein6.4e-8489.67Show/hide
Query:  MAVKPVVGLCSPGLTKVGLAFMALCLAAYILGPPLYWHFMEGLAAFSSPPSSTCPPCFCDCSSQTDFAFTDELENTTFRDCVKHDSGMNEETEKSFAELL
        MAVKP VG CSPGLTKVGL  MALC+AAYILGPPLYWHFMEGL AFSS   STCPPCFCDCSS TDFAFT+ELENTTFRDCVKHDSGMNEETEK+FAELL
Subjt:  MAVKPVVGLCSPGLTKVGLAFMALCLAAYILGPPLYWHFMEGLAAFSSPPSSTCPPCFCDCSSQTDFAFTDELENTTFRDCVKHDSGMNEETEKSFAELL

Query:  SEELKLREVEALERQQRADISLLEAKKITSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWETRARQRGWRDDIVTSR
        SEELKLRE EALE  +RADISLLEAKK+TSQYQKEADKCNSGMETCEAARERAEATLASQK+LTALWETRARQRGWRD+IVTSR
Subjt:  SEELKLREVEALERQQRADISLLEAKKITSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWETRARQRGWRDDIVTSR

A0A1S3C1Z9 uncharacterized protein LOC103495987 isoform X11.6e-8285.57Show/hide
Query:  MAVKPVVGLCSPGLTKVGLAFMALCLAAYILGPPLYWHFMEGLAAFSSPPSSTCPPCFCDCSSQTDFAFTDELENTTFRDCVKHDSGMNEETEKSFAELL
        MA KP VG  SPGLTKVGL FMA+C+AAYILGPPLYWHF EGLAAFSS   STCPPCFCDCSS TDFAFT+EL+NTTFRDCVKHDSGMNEETEK+FAELL
Subjt:  MAVKPVVGLCSPGLTKVGLAFMALCLAAYILGPPLYWHFMEGLAAFSSPPSSTCPPCFCDCSSQTDFAFTDELENTTFRDCVKHDSGMNEETEKSFAELL

Query:  SEELKLREVEALERQQRADISLLEAKKITSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWETRARQRGWRDDIVTSRAQARDAVQTS
        SEELKLRE EALE  +RADISLLEAKK+TSQYQKEADKCNSGMETCEAARERAEATLASQK+LT LWETRARQRGWRDDIVTSR      VQTS
Subjt:  SEELKLREVEALERQQRADISLLEAKKITSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWETRARQRGWRDDIVTSRAQARDAVQTS

A0A6J1G5W1 uncharacterized protein LOC111451120 isoform X16.9e-8687.63Show/hide
Query:  MAVKPVVGLCSPGLTKVGLAFMALCLAAYILGPPLYWHFMEGLAAFSSPPSSTCPPCFCDCSSQTDFAFTDELENTTFRDCVKHDSGMNEETEKSFAELL
        MAVKP  G CSPGLTKVGL F+ALC+AAYILGPPLYWHF+EGLA  SS  SSTCPPCFCDCSSQTDFAFTDE ENTTFRDCVKHDSGMNEETE++FAELL
Subjt:  MAVKPVVGLCSPGLTKVGLAFMALCLAAYILGPPLYWHFMEGLAAFSSPPSSTCPPCFCDCSSQTDFAFTDELENTTFRDCVKHDSGMNEETEKSFAELL

Query:  SEELKLREVEALERQQRADISLLEAKKITSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWETRARQRGWRDDIVTSRAQARDAVQTS
        SEELKLRE EA+ER +RADISLLEAKK+TSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWE RARQRGWRDDIV S A ARD VQTS
Subjt:  SEELKLREVEALERQQRADISLLEAKKITSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWETRARQRGWRDDIVTSRAQARDAVQTS

A0A6J1G609 uncharacterized protein LOC111451120 isoform X22.1e-7984.02Show/hide
Query:  MAVKPVVGLCSPGLTKVGLAFMALCLAAYILGPPLYWHFMEGLAAFSSPPSSTCPPCFCDCSSQTDFAFTDELENTTFRDCVKHDSGMNEETEKSFAELL
        MAVKP  G CSPGLTKVGL F+ALC+AAYILGPPLYWHF+EGLA  SS  SSTCPPCFCDCSSQTDFAFTD        DCVKHDSGMNEETE++FAELL
Subjt:  MAVKPVVGLCSPGLTKVGLAFMALCLAAYILGPPLYWHFMEGLAAFSSPPSSTCPPCFCDCSSQTDFAFTDELENTTFRDCVKHDSGMNEETEKSFAELL

Query:  SEELKLREVEALERQQRADISLLEAKKITSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWETRARQRGWRDDIVTSRAQARDAVQTS
        SEELKLRE EA+ER +RADISLLEAKK+TSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWE RARQRGWRDDIV S A ARD VQTS
Subjt:  SEELKLREVEALERQQRADISLLEAKKITSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWETRARQRGWRDDIVTSRAQARDAVQTS

A0A6J1I131 uncharacterized protein LOC111469880 isoform X11.2e-8587.63Show/hide
Query:  MAVKPVVGLCSPGLTKVGLAFMALCLAAYILGPPLYWHFMEGLAAFSSPPSSTCPPCFCDCSSQTDFAFTDELENTTFRDCVKHDSGMNEETEKSFAELL
        MAVKP  G CSPGLTKVGL F+ALC+AAYILGPPLYWHFMEGLA  SS  SSTCPPCFCDCSSQTDFAFTDE ENTTFRDCVKHDSGMNEETE+SFAELL
Subjt:  MAVKPVVGLCSPGLTKVGLAFMALCLAAYILGPPLYWHFMEGLAAFSSPPSSTCPPCFCDCSSQTDFAFTDELENTTFRDCVKHDSGMNEETEKSFAELL

Query:  SEELKLREVEALERQQRADISLLEAKKITSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWETRARQRGWRDDIVTSRAQARDAVQTS
        SE+LKLRE +A+ER +RADISLLEAKK+TSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWE RARQRGWRDDIV S A ARD VQTS
Subjt:  SEELKLREVEALERQQRADISLLEAKKITSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWETRARQRGWRDDIVTSRAQARDAVQTS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05070.1 Protein of unknown function (DUF1068)4.0e-5459.22Show/hide
Query:  KVGLAFMALCLAAYILGPPLYWHFMEGLAAFSSPPSSTCPPCFCDCSSQTDFAFTDELENTTFRDCVKHDSGMNEETEKSFAELLSEELKLREVEALERQ
        K+GLA + L +A YILGPPLYWH  E LAA S   +S+CP C C+CS+ +      EL N +F DC KHD  +NE+TEK++AELL+EELKLRE E+LE+ 
Subjt:  KVGLAFMALCLAAYILGPPLYWHFMEGLAAFSSPPSSTCPPCFCDCSSQTDFAFTDELENTTFRDCVKHDSGMNEETEKSFAELLSEELKLREVEALERQ

Query:  QRADISLLEAKKITSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWETRARQRGWRDDIVTSRAQARDAVQTS
        +RAD+ LLEAKK+TS YQKEADKCNSGMETCE ARE+AE  LA QKKLT+ WE RARQ+GWR+       +++  VQ +
Subjt:  QRADISLLEAKKITSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWETRARQRGWRDDIVTSRAQARDAVQTS

AT2G32580.1 Protein of unknown function (DUF1068)2.1e-4754.19Show/hide
Query:  KVGLAFMALCLAAYILGPPLYWHFMEGLAAFSSPPSSTCPPCFCDCSSQTDFAFTDELENTTFRDCVKHDSGMNEETEKSFAELLSEELKLREVEALERQ
        KVGLA +AL +  YILGPPLYWH  E LA      +++C  C CDCSS         L N +F DC K D  +NE+TEK++AELL+EELK RE  ++E+ 
Subjt:  KVGLAFMALCLAAYILGPPLYWHFMEGLAAFSSPPSSTCPPCFCDCSSQTDFAFTDELENTTFRDCVKHDSGMNEETEKSFAELLSEELKLREVEALERQ

Query:  QRADISLLEAKKITSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWETRARQRGWRDDIVTSRAQARDAVQTS
        +R D  LLEAKKITS YQKEADKCNSGMETCE ARE+AE  L  QKKLT++WE RARQ+G++D    S  +++   + +
Subjt:  QRADISLLEAKKITSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWETRARQRGWRDDIVTSRAQARDAVQTS

AT2G32580.2 Protein of unknown function (DUF1068)8.2e-3157.39Show/hide
Query:  DCVKHDSGMNEETEKSFAELLSEELKLREVEALERQQRADISLLEAKKITSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWETRARQRGWRDD
        +C K D  +NE+TEK++AELL+EELK RE  ++E+ +R D  LLEAKKITS YQKEADKCNSGMETCE ARE+AE  L  QKKLT++WE RARQ+G++D 
Subjt:  DCVKHDSGMNEETEKSFAELLSEELKLREVEALERQQRADISLLEAKKITSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWETRARQRGWRDD

Query:  IVTSRAQARDAVQTS
           S  +++   + +
Subjt:  IVTSRAQARDAVQTS

AT4G04360.1 Protein of unknown function (DUF1068)1.4e-4355.36Show/hide
Query:  KVGLAFMALCLAAYILGPPLYWHFMEGLAAFSSPPSSTCPPCFCDCSSQTDFAFTDELENTTFRDCVKHDSGMNEETEKSFAELLSEELKLREVEALERQ
        KV    M LC+ AYI GP LYWH  E +A       S+CPPC CDCSSQ   +  D L N +F DC++H+ G +EE+E SF E+++EELKLRE +A E +
Subjt:  KVGLAFMALCLAAYILGPPLYWHFMEGLAAFSSPPSSTCPPCFCDCSSQTDFAFTDELENTTFRDCVKHDSGMNEETEKSFAELLSEELKLREVEALERQ

Query:  QRADISLLEAKKITSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWETRARQRGWRDDIVTS
         RAD  LL+AKK  SQYQKEADKC+ GMETCE ARE+AEA L  Q++L+ +WE RARQ GW++  V S
Subjt:  QRADISLLEAKKITSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWETRARQRGWRDDIVTS

AT4G30996.1 Protein of unknown function (DUF1068)2.5e-3245.68Show/hide
Query:  LAFMALCLAAYILGPPLYWHFMEGLAAFSSPPSSTCPPCFCDCSSQTD-FAFTDELENTTFRDCVKHDSGMNEETEKSFAELLSEELKLREVEALERQQR
        L   A+  A  + GP LYW F +G    S+  +S CPPC CDC            L N +  DC   D  + +E EK F +LL+EELKL+E  A E  + 
Subjt:  LAFMALCLAAYILGPPLYWHFMEGLAAFSSPPSSTCPPCFCDCSSQTD-FAFTDELENTTFRDCVKHDSGMNEETEKSFAELLSEELKLREVEALERQQR

Query:  ADISLLEAKKITSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWETRARQRGWRDD
         +++L EAK++ SQYQKEA+KCN+  E CE+ARERAEA L  ++K+T+LWE RARQ GW  +
Subjt:  ADISLLEAKKITSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWETRARQRGWRDD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGTGAAGCCGGTGGTGGGTTTGTGCTCTCCAGGACTGACGAAGGTGGGATTGGCTTTTATGGCTCTCTGTTTAGCAGCTTACATTCTGGGTCCGCCTCTCTACTG
GCATTTCATGGAGGGTTTGGCCGCTTTCTCTTCTCCCCCTTCCTCAACTTGCCCACCTTGCTTTTGTGACTGTTCTTCACAGACTGACTTCGCCTTCACTGATGAGCTCG
AAAACACAACTTTTAGAGATTGTGTGAAACATGACTCTGGCATGAATGAGGAAACAGAAAAGAGTTTCGCAGAGTTGTTGTCAGAGGAACTGAAGCTGAGGGAAGTTGAA
GCTTTGGAACGTCAGCAGCGTGCCGACATATCTCTGCTGGAAGCAAAGAAGATAACATCTCAATACCAGAAAGAAGCAGACAAGTGCAATTCAGGCATGGAAACATGTGA
AGCAGCAAGGGAAAGAGCTGAAGCTACATTAGCTTCACAAAAGAAGCTAACAGCATTATGGGAGACTAGGGCTCGCCAAAGAGGATGGAGAGATGACATTGTCACATCCC
GTGCTCAGGCTCGTGATGCCGTTCAAACCTCATGA
mRNA sequenceShow/hide mRNA sequence
CAAAAATTGTAACTTAGCTACACACTCTCTTGATTATTCCCCTAATTCCCCACTACATTTGATTTCCATTTCACTTCCCCTTCTGTAATCCATCTTCTCCGACGAGCTCT
TTCCCATTTTCCATTTCTGGACATACGCAGAGAAGTAGAATAGATACAGAAAGGTTTAGAAATGGCAGTGAAGCCGGTGGTGGGTTTGTGCTCTCCAGGACTGACGAAGG
TGGGATTGGCTTTTATGGCTCTCTGTTTAGCAGCTTACATTCTGGGTCCGCCTCTCTACTGGCATTTCATGGAGGGTTTGGCCGCTTTCTCTTCTCCCCCTTCCTCAACT
TGCCCACCTTGCTTTTGTGACTGTTCTTCACAGACTGACTTCGCCTTCACTGATGAGCTCGAAAACACAACTTTTAGAGATTGTGTGAAACATGACTCTGGCATGAATGA
GGAAACAGAAAAGAGTTTCGCAGAGTTGTTGTCAGAGGAACTGAAGCTGAGGGAAGTTGAAGCTTTGGAACGTCAGCAGCGTGCCGACATATCTCTGCTGGAAGCAAAGA
AGATAACATCTCAATACCAGAAAGAAGCAGACAAGTGCAATTCAGGCATGGAAACATGTGAAGCAGCAAGGGAAAGAGCTGAAGCTACATTAGCTTCACAAAAGAAGCTA
ACAGCATTATGGGAGACTAGGGCTCGCCAAAGAGGATGGAGAGATGACATTGTCACATCCCGTGCTCAGGCTCGTGATGCCGTTCAAACCTCATGAAAGACCCGACGCTT
ATTCAAGCAGGCTGCTTTGACAGTAGGTTGGAATTCATTGGTCACCAATCTATGGTGGAAGGCCAGGATTTTCTTTTACATATCCCAATTAGGTTACTTCTTCAAACTTC
CACTTGTATGAACATAAGATAGTGATATACTATGTAGTAGAAGTTAGCAACCAGATGCCTCTCTCCCTTCCCTCCAAGATATCAACCCAAAATCTTTGGCAAGGTTATGT
AGCAATTTTTCCTGTGGTGAATGGGC
Protein sequenceShow/hide protein sequence
MAVKPVVGLCSPGLTKVGLAFMALCLAAYILGPPLYWHFMEGLAAFSSPPSSTCPPCFCDCSSQTDFAFTDELENTTFRDCVKHDSGMNEETEKSFAELLSEELKLREVE
ALERQQRADISLLEAKKITSQYQKEADKCNSGMETCEAARERAEATLASQKKLTALWETRARQRGWRDDIVTSRAQARDAVQTS