; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017306 (gene) of Snake gourd v1 genome

Gene IDTan0017306
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
Genome locationLG04:17274936..17275391
RNA-Seq ExpressionTan0017306
SyntenyTan0017306
Gene Ontology termsGO:0009507 - chloroplast (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6599936.1 hypothetical protein SDJN03_05169, partial [Cucurbita argyrosperma subsp. sororia]3.2e-5386.92Show/hide
Query:  PFFHRPSAVVFTKFRPTSVISPPWLHRQAAAAAPRCVSQGGWGSSVAELERELAAEGEEWLKLRRLEEKCGGGEKGVVELLECLEREAIMGEDEGRDPTD
        PFFHRPSAVVF KFRP S ISPPWL RQ  AAAPRCVS GGWGSSVAELEREL+ EGEEWLKL RLEEKCG G KG+VELLE LEREAIM EDEGRDPTD
Subjt:  PFFHRPSAVVFTKFRPTSVISPPWLHRQAAAAAPRCVSQGGWGSSVAELERELAAEGEEWLKLRRLEEKCGGGEKGVVELLECLEREAIMGEDEGRDPTD

Query:  YNRRAKIFSTSSRVFQALKQHSDDESEETE
        Y+RRAKIFSTSSRVFQALKQHSDDESEE E
Subjt:  YNRRAKIFSTSSRVFQALKQHSDDESEETE

KAG7030615.1 hypothetical protein SDJN02_04652, partial [Cucurbita argyrosperma subsp. argyrosperma]1.5e-5386.15Show/hide
Query:  PFFHRPSAVVFTKFRPTSVISPPWLHRQAAAAAPRCVSQGGWGSSVAELERELAAEGEEWLKLRRLEEKCGGGEKGVVELLECLEREAIMGEDEGRDPTD
        PFFHRPSAVVF KFRP S +SPPWL RQ  AAAPRCVS GGWGSSV ELEREL+ EGEEWLKL RLEEKCG G KG+VELLE LEREAIMGEDEGRDPTD
Subjt:  PFFHRPSAVVFTKFRPTSVISPPWLHRQAAAAAPRCVSQGGWGSSVAELERELAAEGEEWLKLRRLEEKCGGGEKGVVELLECLEREAIMGEDEGRDPTD

Query:  YNRRAKIFSTSSRVFQALKQHSDDESEETE
        Y+RRAKIFSTSSRVFQALKQHSDDESEE E
Subjt:  YNRRAKIFSTSSRVFQALKQHSDDESEETE

XP_022941969.1 uncharacterized protein LOC111447176 [Cucurbita moschata]3.5e-5585.93Show/hide
Query:  PFFHRPSAVVFTKFRPTSVISPPWLHRQAAAAAPRCVSQGGWGSSVAELERELAAEGEEWLKLRRLEEKCGGGEKGVVELLECLEREAIMGEDEGRDPTD
        PFFHRPSAVVF KFRP S +SPPWL RQ  AAAPRCVSQGGWGSSVAELEREL+ EGEEWLKL RLEEKCG G KG+VELLE LEREAIMGEDEGRDPTD
Subjt:  PFFHRPSAVVFTKFRPTSVISPPWLHRQAAAAAPRCVSQGGWGSSVAELERELAAEGEEWLKLRRLEEKCGGGEKGVVELLECLEREAIMGEDEGRDPTD

Query:  YNRRAKIFSTSSRVFQALKQHSDDESEETEREEEQ
        Y+RRAKIFSTSSRVFQALKQHSDDESEE ERE ++
Subjt:  YNRRAKIFSTSSRVFQALKQHSDDESEETEREEEQ

XP_023541783.1 uncharacterized protein LOC111801831 [Cucurbita pepo subsp. pepo]2.5e-5382.96Show/hide
Query:  PFFHRPSAVVFTKFRPTSVISPPWLHRQAAAAAPRCVSQGGWGSSVAELERELAAEGEEWLKLRRLEEKCGGGEKGVVELLECLEREAIMGEDEGRDPTD
        PFFHRPSAVVF KFRP S +SPPWL RQ   AAPRCVSQGGWGSSVAELERE + EGEEWLKL RLEEKCG G KG+VELLE LEREAIMGEDEGRDPT+
Subjt:  PFFHRPSAVVFTKFRPTSVISPPWLHRQAAAAAPRCVSQGGWGSSVAELERELAAEGEEWLKLRRLEEKCGGGEKGVVELLECLEREAIMGEDEGRDPTD

Query:  YNRRAKIFSTSSRVFQALKQHSDDESEETEREEEQ
        Y+RRAKIFSTSSRVFQALK+HSDDESEE ERE ++
Subjt:  YNRRAKIFSTSSRVFQALKQHSDDESEETEREEEQ

XP_038892559.1 uncharacterized protein LOC120081603 [Benincasa hispida]5.7e-5077.54Show/hide
Query:  MQVSASLLIPPPPPLPFFHRPSAVVFTKFRPTSVISPPWLHRQAAAAAPRCVSQGGWGSSVAELERELAAEGEEWLKLRRLEEKCGGGEKGVVELLECLE
        MQVSASL+IPPPPP P FHRPS  VF KFRPT  IS PWL  +A A  PRCVSQGGWG SVAEL+       EEWLK  +LEEKCGGG KGVVELLECLE
Subjt:  MQVSASLLIPPPPPLPFFHRPSAVVFTKFRPTSVISPPWLHRQAAAAAPRCVSQGGWGSSVAELERELAAEGEEWLKLRRLEEKCGGGEKGVVELLECLE

Query:  REAIMGEDEGRDPTDYNRRAKIFSTSSRVFQALKQHSD
        +EAIMGEDEG+DPTDYNRRAKIFSTSS+VFQALKQHSD
Subjt:  REAIMGEDEGRDPTDYNRRAKIFSTSSRVFQALKQHSD

TrEMBL top hitse value%identityAlignment
A0A0A0KQQ4 Uncharacterized protein9.2e-4674.29Show/hide
Query:  MQVSASLLIPPPPPLPFFHRPSAVVFTKFRPTSVISPPWL-HRQA-AAAAPRCVSQGGWGSSVAELERELAAEGEEWLKLRRLEEKCGGGEKGVVELLEC
        MQ+SASL+ PPPPPL   HRPS   F   RPT  +S PWL HR A A A PRCVSQGGWGSSV   E ++    EEWLKL RLEEKCGGG KG+VELLEC
Subjt:  MQVSASLLIPPPPPLPFFHRPSAVVFTKFRPTSVISPPWL-HRQA-AAAAPRCVSQGGWGSSVAELERELAAEGEEWLKLRRLEEKCGGGEKGVVELLEC

Query:  LEREAIMGEDEGRDPTDYNRRAKIFSTSSRVFQALKQHSD
        LE+EAIMGEDEGRDPTDYNRRAKIFSTSS VFQALKQHSD
Subjt:  LEREAIMGEDEGRDPTDYNRRAKIFSTSSRVFQALKQHSD

A0A6J1D0H7 uncharacterized protein LOC1110159269.6e-4373.76Show/hide
Query:  MQVSASLLIPPPPPLPFFHRPSAVVFTKFRPTSVISPPWLHRQAAAAAPRCVSQGGWGSSVAELERELAAEGEEWLKLRRLEEKCGGGEKGVVELLECLE
        M+VS+   + PPPP P  HRPSAV   KFRPTS ISP W  R     A RCVSQGGWGS  AELERE+AAEGEEWLKL RL+EKCGGG KGVVELLECLE
Subjt:  MQVSASLLIPPPPPLPFFHRPSAVVFTKFRPTSVISPPWLHRQAAAAAPRCVSQGGWGSSVAELERELAAEGEEWLKLRRLEEKCGGGEKGVVELLECLE

Query:  REAIMGEDEGRDPTDYNRRAKIFSTSSRVFQALKQHSDDES
         EAIMGEDEGRDP DY+RRAKIFSTSS+VFQALKQ + D S
Subjt:  REAIMGEDEGRDPTDYNRRAKIFSTSSRVFQALKQHSDDES

A0A6J1FPZ7 uncharacterized protein LOC1114471761.7e-5585.93Show/hide
Query:  PFFHRPSAVVFTKFRPTSVISPPWLHRQAAAAAPRCVSQGGWGSSVAELERELAAEGEEWLKLRRLEEKCGGGEKGVVELLECLEREAIMGEDEGRDPTD
        PFFHRPSAVVF KFRP S +SPPWL RQ  AAAPRCVSQGGWGSSVAELEREL+ EGEEWLKL RLEEKCG G KG+VELLE LEREAIMGEDEGRDPTD
Subjt:  PFFHRPSAVVFTKFRPTSVISPPWLHRQAAAAAPRCVSQGGWGSSVAELERELAAEGEEWLKLRRLEEKCGGGEKGVVELLECLEREAIMGEDEGRDPTD

Query:  YNRRAKIFSTSSRVFQALKQHSDDESEETEREEEQ
        Y+RRAKIFSTSSRVFQALKQHSDDESEE ERE ++
Subjt:  YNRRAKIFSTSSRVFQALKQHSDDESEETEREEEQ

A0A6J1FW01 uncharacterized protein LOC1114478173.1e-4978.26Show/hide
Query:  MQVSASLLIPPPPPLPFFHRPSAVVFTKFRPTSVISPPWLHRQAAAAAPRCVSQGGWGSSVAELERELAAEGEEWLKLRRLEEKCGGGEKGVVELLECLE
        MQV ++ LI PPP  PFFHR S VVF K RP S ISP WL R+ AAAAPRCVSQGGWG SVAE       E EEWLKL RLEEKCGGG KGVVELLECLE
Subjt:  MQVSASLLIPPPPPLPFFHRPSAVVFTKFRPTSVISPPWLHRQAAAAAPRCVSQGGWGSSVAELERELAAEGEEWLKLRRLEEKCGGGEKGVVELLECLE

Query:  REAIMGEDEGRDPTDYNRRAKIFSTSSRVFQALKQHSD
        REAIMGEDEGR+PTDYNRRAKIFSTSS VFQALKQHSD
Subjt:  REAIMGEDEGRDPTDYNRRAKIFSTSSRVFQALKQHSD

A0A6J1JM89 uncharacterized protein LOC1114859334.0e-4979.41Show/hide
Query:  VSASLLIPPPPPLPFFHRPSAVVFTKFRPTSVISPPWLHRQAAAAAPRCVSQGGWGSSVAELERELAAEGEEWLKLRRLEEKCGGGEKGVVELLECLERE
        VSASL+IPPP P  FFHR S VVF K RP S ISPPW  R+ AAAAPRCVSQGGWG SVAE       E EEWLKL RL+EKCGGG KGVVELLECLERE
Subjt:  VSASLLIPPPPPLPFFHRPSAVVFTKFRPTSVISPPWLHRQAAAAAPRCVSQGGWGSSVAELERELAAEGEEWLKLRRLEEKCGGGEKGVVELLECLERE

Query:  AIMGEDEGRDPTDYNRRAKIFSTSSRVFQALKQHSD
        AIMGEDEGR+PTDYNRRAKIFSTSS VFQALKQHSD
Subjt:  AIMGEDEGRDPTDYNRRAKIFSTSSRVFQALKQHSD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G05220.1 unknown protein3.4e-1644.72Show/hide
Query:  PSAVVFTKFRPTSVISPPWL----HRQAAAAAPRCVSQGGWGSSVAELERELAAEGEEWLKLRRLEEKCGGG-EKGVVELLECLEREAIMGEDEGRDPTD
        PS  ++    P + I    L     ++   AA RCV+    GS  A     +  E EE L  RR    CGG   +GV ELLECLE+EAIMG D+GRDP D
Subjt:  PSAVVFTKFRPTSVISPPWL----HRQAAAAAPRCVSQGGWGSSVAELERELAAEGEEWLKLRRLEEKCGGG-EKGVVELLECLEREAIMGEDEGRDPTD

Query:  YNRRAKIFSTSSRVFQALKQHSD
        YNRRAKIF  SS++F+ L +  D
Subjt:  YNRRAKIFSTSSRVFQALKQHSD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGTCTCTGCATCCCTTCTAATACCACCACCACCACCGCTACCCTTCTTCCACCGCCCCTCCGCCGTGGTCTTCACTAAATTCAGGCCAACTTCAGTCATTTCTCC
ACCATGGCTCCACCGTCAGGCTGCAGCGGCAGCGCCCAGATGTGTTAGTCAGGGTGGTTGGGGGAGCTCTGTGGCGGAGCTGGAGAGAGAATTGGCAGCAGAGGGAGAAG
AGTGGCTGAAGCTCAGGAGGCTGGAGGAGAAGTGCGGCGGCGGAGAAAAGGGAGTGGTGGAGTTGCTTGAATGTTTGGAAAGAGAAGCCATCATGGGGGAAGATGAAGGT
AGAGACCCTACTGATTACAATAGGAGGGCTAAAATTTTCAGTACCAGTTCTAGAGTTTTTCAAGCTCTCAAGCAACATTCTGATGATGAATCTGAAGAGACAGAGAGAGA
GGAAGAACAAGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGCAAGTCTCTGCATCCCTTCTAATACCACCACCACCACCGCTACCCTTCTTCCACCGCCCCTCCGCCGTGGTCTTCACTAAATTCAGGCCAACTTCAGTCATTTCTCC
ACCATGGCTCCACCGTCAGGCTGCAGCGGCAGCGCCCAGATGTGTTAGTCAGGGTGGTTGGGGGAGCTCTGTGGCGGAGCTGGAGAGAGAATTGGCAGCAGAGGGAGAAG
AGTGGCTGAAGCTCAGGAGGCTGGAGGAGAAGTGCGGCGGCGGAGAAAAGGGAGTGGTGGAGTTGCTTGAATGTTTGGAAAGAGAAGCCATCATGGGGGAAGATGAAGGT
AGAGACCCTACTGATTACAATAGGAGGGCTAAAATTTTCAGTACCAGTTCTAGAGTTTTTCAAGCTCTCAAGCAACATTCTGATGATGAATCTGAAGAGACAGAGAGAGA
GGAAGAACAAGGATGA
Protein sequenceShow/hide protein sequence
MQVSASLLIPPPPPLPFFHRPSAVVFTKFRPTSVISPPWLHRQAAAAAPRCVSQGGWGSSVAELERELAAEGEEWLKLRRLEEKCGGGEKGVVELLECLEREAIMGEDEG
RDPTDYNRRAKIFSTSSRVFQALKQHSDDESEETEREEEQG