; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008169 (gene) of Snake gourd v1 genome

Gene IDTan0008169
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionBEST Arabidopsis thaliana protein match is: s in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink).
Genome locationLG08:10461110..10461688
RNA-Seq ExpressionTan0008169
SyntenyTan0008169
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7029810.1 hypothetical protein SDJN02_08153, partial [Cucurbita argyrosperma subsp. argyrosperma]1.1e-6672.68Show/hide
Query:  MDGKFHWDVKNKAISRTSSVGCSSRSIYYYGTAQGVPFKWETQPGTPKDPPPQEILPPLSPPPAVLSLGVSKPCIEQPKTRPRLR--MRLRFWRGGKKSR
        MDG FHWD+K+K I+RTSS+GCSSRS YY G A+GVPFKWETQPGTPKDPPPQ++LPPLSPPPAV+SLG+  PCIE+PKTRPR R  M+LRFWR  KKSR
Subjt:  MDGKFHWDVKNKAISRTSSVGCSSRSIYYYGTAQGVPFKWETQPGTPKDPPPQEILPPLSPPPAVLSLGVSKPCIEQPKTRPRLR--MRLRFWRGGKKSR

Query:  DSERASQATTIDYNKEDPFGHNDKLETFSFFSYDCEFMASPQGSMSSSSSSSSSSSPSLLLESLTVRNMGRVSFRRPPS-SSVWQINRILSCYH
        D  RA+QATTIDY+       NDKLETFSF S DCEFMASP+ SM   SSSSSSSSPS L+ESLTVRNMGRVS  RP S S+  +I+RIL+CYH
Subjt:  DSERASQATTIDYNKEDPFGHNDKLETFSFFSYDCEFMASPQGSMSSSSSSSSSSSPSLLLESLTVRNMGRVSFRRPPS-SSVWQINRILSCYH

XP_022153305.1 uncharacterized protein LOC111020830 [Momordica charantia]9.5e-6672.4Show/hide
Query:  MDGKFHWDVKNKAISRTSSVGCSSRSIYYYGTAQGVPFKWETQPGTPKDPPPQEILPPLSPPPAVLSLGVSKPCIEQPKTRPRLRMRLRFWRGGKKSRDS
        MDGKFHWD K+K ISRTSSVGCSSRSIYY GTA+GVPFKWETQPGTPKDPPPQE+LPPLSPPPAVLSLG+ KPCIE+P  RPRLRM LRFWR  +K R++
Subjt:  MDGKFHWDVKNKAISRTSSVGCSSRSIYYYGTAQGVPFKWETQPGTPKDPPPQEILPPLSPPPAVLSLGVSKPCIEQPKTRPRLRMRLRFWRGGKKSRDS

Query:  ERASQATTIDYNKEDPFGHNDKLETFSFFSYDCEFMASPQGSMSSSSSSSSSSSPSLLLESLTVRNMGRVSFRRPPSSSVWQINRILSCYHC
                      D FGH DKLET SF S D EFMASP+ S+SS SSSSSSS PS LLESL VRNMGRVSF RP     WQINR L+CYHC
Subjt:  ERASQATTIDYNKEDPFGHNDKLETFSFFSYDCEFMASPQGSMSSSSSSSSSSSPSLLLESLTVRNMGRVSFRRPPSSSVWQINRILSCYHC

XP_022929606.1 uncharacterized protein LOC111436141 [Cucurbita moschata]1.5e-6672.4Show/hide
Query:  MDGKFHWDVKNKAISRTSSVGCSSRSIYYYGTAQGVPFKWETQPGTPKDPPPQEILPPLSPPPAVLSLGVSKPCIEQPKTRPRLRMRLRFWRGGKKSRDS
        MDG FHWD+K+K I+RTSS+GCSSRS YY G A+GVPFKWETQPGTPKDPPPQ++LPPLSPPPAV+SLG+  PCIE+PKTRPR  M+L FWR  KKSRD 
Subjt:  MDGKFHWDVKNKAISRTSSVGCSSRSIYYYGTAQGVPFKWETQPGTPKDPPPQEILPPLSPPPAVLSLGVSKPCIEQPKTRPRLRMRLRFWRGGKKSRDS

Query:  ERASQATTIDYNKEDPFGHNDKLETFSFFSYDCEFMASPQGSMSSSSSSSSSSSPSLLLESLTVRNMGRVSFRRPPS-SSVWQINRILSCYH
         RA+QATTIDY+       NDKLETFSF S DCEFMASP+ SM   SSSSSSSSPS L+ESLTVRNMGRVS  RP S S+  +I+RIL+CYH
Subjt:  ERASQATTIDYNKEDPFGHNDKLETFSFFSYDCEFMASPQGSMSSSSSSSSSSSPSLLLESLTVRNMGRVSFRRPPS-SSVWQINRILSCYH

XP_022997465.1 uncharacterized protein LOC111492375 [Cucurbita maxima]2.0e-6873.44Show/hide
Query:  MDGKFHWDVKNKAISRTSSVGCSSRSIYYYGTAQGVPFKWETQPGTPKDPPPQEILPPLSPPPAVLSLGVSKPCIEQPKTRPRLRMRLRFWRGGKKSRDS
        MDG FHWD+K+K I+RTSS+GCSSRS YY G A+GVPFKWETQPGTPKDPPPQ++LPPLSPPPAVLSLG+  PCIE+PKTRPR  M+LRFWR  KKSRD 
Subjt:  MDGKFHWDVKNKAISRTSSVGCSSRSIYYYGTAQGVPFKWETQPGTPKDPPPQEILPPLSPPPAVLSLGVSKPCIEQPKTRPRLRMRLRFWRGGKKSRDS

Query:  ERASQATTIDYNKEDPFGHNDKLETFSFFSYDCEFMASPQGSMSSSSSSSSSSSPSLLLESLTVRNMGRVSFRRPPS-SSVWQINRILSCYH
         RA+QATTIDY+       N KLETFSF S DCEFMASP+ SMSSSSSSSSSSSPS L+ESLTVR+MGRVS  RP S  +  +I+RIL+CYH
Subjt:  ERASQATTIDYNKEDPFGHNDKLETFSFFSYDCEFMASPQGSMSSSSSSSSSSSPSLLLESLTVRNMGRVSFRRPPS-SSVWQINRILSCYH

XP_023546389.1 uncharacterized protein LOC111805512 [Cucurbita pepo subsp. pepo]9.5e-6671.35Show/hide
Query:  MDGKFHWDVKNKAISRTSSVGCSSRSIYYYGTAQGVPFKWETQPGTPKDPPPQEILPPLSPPPAVLSLGVSKPCIEQPKTRPRLRMRLRFWRGGKKSRDS
        MDG FHWD+K+K I+RTSS+GCSSRS YY G A+GVPF WETQPGTPKDPPPQ++LPPLSPPPAV+SLG+  PCIE+PKTRPR  M+LRFWR  KKSRD 
Subjt:  MDGKFHWDVKNKAISRTSSVGCSSRSIYYYGTAQGVPFKWETQPGTPKDPPPQEILPPLSPPPAVLSLGVSKPCIEQPKTRPRLRMRLRFWRGGKKSRDS

Query:  ERASQATTIDYNKEDPFGHNDKLETFSFFSYDCEFMASPQGSMSSSSSSSSSSSPSLLLESLTVRNMGRVSFRRPPS-SSVWQINRILSCYH
         RA+QATTIDY+       NDKLETFSF S DCEFMASP+ SM   SSSSSSSSPS L+ESLTV+NMGR S  RP S S+  +I+RIL+CYH
Subjt:  ERASQATTIDYNKEDPFGHNDKLETFSFFSYDCEFMASPQGSMSSSSSSSSSSSPSLLLESLTVRNMGRVSFRRPPS-SSVWQINRILSCYH

TrEMBL top hitse value%identityAlignment
A0A5A7UK88 Putative OSBP(Oxysterol binding protein)-related protein 4B7.6e-6169.04Show/hide
Query:  MDGK-FHWDVKNKAISRTSSVGCSSRSIYYYGTA-QGVPFKWETQPGTPKDPPPQEILPPLSPPPAVLSLGVSKP-CIEQPKTRPRLRMRLRFWRGGKKS
        M+G+ FHWD+KNK ISRTSS+GCSS SIYY GTA QGVPFKWETQPGTPKDPPPQ++LPPLSPPPAVLSLGV KP CI+QPK+R   RMRLRFW+   KS
Subjt:  MDGK-FHWDVKNKAISRTSSVGCSSRSIYYYGTA-QGVPFKWETQPGTPKDPPPQEILPPLSPPPAVLSLGVSKP-CIEQPKTRPRLRMRLRFWRGGKKS

Query:  RDSERASQATTIDYNKEDPFGHNDKLETFSFFSYDCEFMA--SPQGSMSSSSSSSSSSSPSLLLESLTVRNMGRVSFRRPPS-SSVWQINRILSCYH
        RD  RA+Q T ID+       +NDKLETFSF S DCEFMA  SP+  MSSSS+SSSSSSPS  ++SL V N  +VSF RP S ++ WQINR+L CYH
Subjt:  RDSERASQATTIDYNKEDPFGHNDKLETFSFFSYDCEFMA--SPQGSMSSSSSSSSSSSPSLLLESLTVRNMGRVSFRRPPS-SSVWQINRILSCYH

A0A5D3BQA0 Putative OSBP(Oxysterol binding protein)-related protein 4B2.0e-6169.54Show/hide
Query:  MDGK-FHWDVKNKAISRTSSVGCSSRSIYYYGTA-QGVPFKWETQPGTPKDPPPQEILPPLSPPPAVLSLGVSKP-CIEQPKTRPRLRMRLRFWRGGKKS
        M+G+ FHWD+KNK ISRTSS+GCSS SIYY GTA QGVPFKWETQPGTPKDPPPQ++LPPLSPPPAVLSLGV KP CI+QPK+R   RMRLRFW+   KS
Subjt:  MDGK-FHWDVKNKAISRTSSVGCSSRSIYYYGTA-QGVPFKWETQPGTPKDPPPQEILPPLSPPPAVLSLGVSKP-CIEQPKTRPRLRMRLRFWRGGKKS

Query:  RDSERASQATTIDYNKEDPFGHNDKLETFSFFSYDCEFMA--SPQGSMSSSSSSSSSSSPSLLLESLTVRNMGRVSFRRPPS-SSVWQINRILSCYH
        RD  RA+Q T ID+       +NDKLETFSF S DCEFMA  SP+ SMSSSS+SSSSSSPS  ++SL V N  +VSF RP S ++ WQINR+L CYH
Subjt:  RDSERASQATTIDYNKEDPFGHNDKLETFSFFSYDCEFMA--SPQGSMSSSSSSSSSSSPSLLLESLTVRNMGRVSFRRPPS-SSVWQINRILSCYH

A0A6J1DGF8 uncharacterized protein LOC1110208304.6e-6672.4Show/hide
Query:  MDGKFHWDVKNKAISRTSSVGCSSRSIYYYGTAQGVPFKWETQPGTPKDPPPQEILPPLSPPPAVLSLGVSKPCIEQPKTRPRLRMRLRFWRGGKKSRDS
        MDGKFHWD K+K ISRTSSVGCSSRSIYY GTA+GVPFKWETQPGTPKDPPPQE+LPPLSPPPAVLSLG+ KPCIE+P  RPRLRM LRFWR  +K R++
Subjt:  MDGKFHWDVKNKAISRTSSVGCSSRSIYYYGTAQGVPFKWETQPGTPKDPPPQEILPPLSPPPAVLSLGVSKPCIEQPKTRPRLRMRLRFWRGGKKSRDS

Query:  ERASQATTIDYNKEDPFGHNDKLETFSFFSYDCEFMASPQGSMSSSSSSSSSSSPSLLLESLTVRNMGRVSFRRPPSSSVWQINRILSCYHC
                      D FGH DKLET SF S D EFMASP+ S+SS SSSSSSS PS LLESL VRNMGRVSF RP     WQINR L+CYHC
Subjt:  ERASQATTIDYNKEDPFGHNDKLETFSFFSYDCEFMASPQGSMSSSSSSSSSSSPSLLLESLTVRNMGRVSFRRPPSSSVWQINRILSCYHC

A0A6J1ENL5 uncharacterized protein LOC1114361417.1e-6772.4Show/hide
Query:  MDGKFHWDVKNKAISRTSSVGCSSRSIYYYGTAQGVPFKWETQPGTPKDPPPQEILPPLSPPPAVLSLGVSKPCIEQPKTRPRLRMRLRFWRGGKKSRDS
        MDG FHWD+K+K I+RTSS+GCSSRS YY G A+GVPFKWETQPGTPKDPPPQ++LPPLSPPPAV+SLG+  PCIE+PKTRPR  M+L FWR  KKSRD 
Subjt:  MDGKFHWDVKNKAISRTSSVGCSSRSIYYYGTAQGVPFKWETQPGTPKDPPPQEILPPLSPPPAVLSLGVSKPCIEQPKTRPRLRMRLRFWRGGKKSRDS

Query:  ERASQATTIDYNKEDPFGHNDKLETFSFFSYDCEFMASPQGSMSSSSSSSSSSSPSLLLESLTVRNMGRVSFRRPPS-SSVWQINRILSCYH
         RA+QATTIDY+       NDKLETFSF S DCEFMASP+ SM   SSSSSSSSPS L+ESLTVRNMGRVS  RP S S+  +I+RIL+CYH
Subjt:  ERASQATTIDYNKEDPFGHNDKLETFSFFSYDCEFMASPQGSMSSSSSSSSSSSPSLLLESLTVRNMGRVSFRRPPS-SSVWQINRILSCYH

A0A6J1K9Q8 uncharacterized protein LOC1114923759.9e-6973.44Show/hide
Query:  MDGKFHWDVKNKAISRTSSVGCSSRSIYYYGTAQGVPFKWETQPGTPKDPPPQEILPPLSPPPAVLSLGVSKPCIEQPKTRPRLRMRLRFWRGGKKSRDS
        MDG FHWD+K+K I+RTSS+GCSSRS YY G A+GVPFKWETQPGTPKDPPPQ++LPPLSPPPAVLSLG+  PCIE+PKTRPR  M+LRFWR  KKSRD 
Subjt:  MDGKFHWDVKNKAISRTSSVGCSSRSIYYYGTAQGVPFKWETQPGTPKDPPPQEILPPLSPPPAVLSLGVSKPCIEQPKTRPRLRMRLRFWRGGKKSRDS

Query:  ERASQATTIDYNKEDPFGHNDKLETFSFFSYDCEFMASPQGSMSSSSSSSSSSSPSLLLESLTVRNMGRVSFRRPPS-SSVWQINRILSCYH
         RA+QATTIDY+       N KLETFSF S DCEFMASP+ SMSSSSSSSSSSSPS L+ESLTVR+MGRVS  RP S  +  +I+RIL+CYH
Subjt:  ERASQATTIDYNKEDPFGHNDKLETFSFFSYDCEFMASPQGSMSSSSSSSSSSSPSLLLESLTVRNMGRVSFRRPPS-SSVWQINRILSCYH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G40475.1 unknown protein1.4e-0652.46Show/hide
Query:  VKNKAISRTSSVGCSSRSIYYYGTAQGVPFKWETQPGTPKDPPPQEI--LPPLSPPPAVLS
        + +K I + SS   SS  IYYYG A  VPF WET+PGTPK     E   LPPL+PPP+  S
Subjt:  VKNKAISRTSSVGCSSRSIYYYGTAQGVPFKWETQPGTPKDPPPQEI--LPPLSPPPAVLS

AT4G25845.1 BEST Arabidopsis thaliana protein match is: OSBP(oxysterol binding protein)-related protein 4B (TAIR:AT4G25850.2)7.6e-1348.84Show/hide
Query:  KNKAISRTSSVGCSSRSIYYYGTAQGVPFKWETQPGTPKDPPPQEILPPLSPPPAVLSLGVSKP--CIEQPKTRP-RLRMRLRFWR
        +  +ISR SSVG      Y     +GVPF+WE QPGTP +  P+E++PPLSPPPA+LSLG+ KP   IE+PK      +++LR W+
Subjt:  KNKAISRTSSVGCSSRSIYYYGTAQGVPFKWETQPGTPKDPPPQEILPPLSPPPAVLSLGVSKP--CIEQPKTRP-RLRMRLRFWR

AT4G25850.2 OSBP(oxysterol binding protein)-related protein 4B7.6e-1348.84Show/hide
Query:  KNKAISRTSSVGCSSRSIYYYGTAQGVPFKWETQPGTPKDPPPQEILPPLSPPPAVLSLGVSKP--CIEQPKTRP-RLRMRLRFWR
        +  +ISR SSVG      Y     +GVPF+WE QPGTP +  P+E++PPLSPPPA+LSLG+ KP   IE+PK      +++LR W+
Subjt:  KNKAISRTSSVGCSSRSIYYYGTAQGVPFKWETQPGTPKDPPPQEILPPLSPPPAVLSLGVSKP--CIEQPKTRP-RLRMRLRFWR

AT5G01790.1 unknown protein1.2e-0531.69Show/hide
Query:  SVGCSSRSIYYY-GTAQGVPFKWETQPGTPKDPPPQ-EILPPLSPPPAVLSLGVSKPCIEQPKTRPRLRMRL---RFWRGGKKSRDSER--------ASQ
        S  C S  IYYY G A  VPF+WE+ PGTPK P  +   LPPL+PPP+  S    +      K+  ++   +    FW  G  +  +++         S+
Subjt:  SVGCSSRSIYYY-GTAQGVPFKWETQPGTPKDPPPQ-EILPPLSPPPAVLSLGVSKPCIEQPKTRPRLRMRL---RFWRGGKKSRDSER--------ASQ

Query:  ATTIDYNKEDPFGHNDKLETF-SFFSYDCEFMASPQGSMSSS
           ID N+ D F    +      F S+D       Q S S+S
Subjt:  ATTIDYNKEDPFGHNDKLETF-SFFSYDCEFMASPQGSMSSS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGGAAAATTTCATTGGGATGTGAAGAACAAAGCCATATCAAGAACTTCTTCAGTTGGTTGTTCTTCTCGCAGCATTTATTATTATGGAACAGCTCAAGGAGTTCC
TTTCAAATGGGAAACACAACCAGGAACACCCAAAGATCCACCCCCTCAAGAGATCCTTCCTCCCCTCAGCCCTCCGCCCGCCGTTCTCAGCCTCGGGGTATCGAAACCGT
GCATCGAACAACCGAAAACCCGACCCCGACTGCGAATGAGGCTTAGGTTTTGGAGGGGAGGCAAAAAGAGTAGGGATAGTGAGAGAGCTTCCCAAGCAACAACTATAGAC
TATAACAAAGAAGATCCTTTTGGTCATAATGATAAGTTGGAAACTTTTTCATTTTTTAGTTATGATTGTGAGTTCATGGCATCTCCTCAGGGTTCGATGTCATCGTCCTC
GTCCTCATCGTCTTCTTCATCGCCATCGCTTTTGCTCGAGTCATTAACCGTACGCAATATGGGGAGGGTGTCATTTCGAAGGCCTCCGAGTTCAAGTGTTTGGCAAATTA
ATCGAATTCTGTCTTGTTACCATTGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATGGAAAATTTCATTGGGATGTGAAGAACAAAGCCATATCAAGAACTTCTTCAGTTGGTTGTTCTTCTCGCAGCATTTATTATTATGGAACAGCTCAAGGAGTTCC
TTTCAAATGGGAAACACAACCAGGAACACCCAAAGATCCACCCCCTCAAGAGATCCTTCCTCCCCTCAGCCCTCCGCCCGCCGTTCTCAGCCTCGGGGTATCGAAACCGT
GCATCGAACAACCGAAAACCCGACCCCGACTGCGAATGAGGCTTAGGTTTTGGAGGGGAGGCAAAAAGAGTAGGGATAGTGAGAGAGCTTCCCAAGCAACAACTATAGAC
TATAACAAAGAAGATCCTTTTGGTCATAATGATAAGTTGGAAACTTTTTCATTTTTTAGTTATGATTGTGAGTTCATGGCATCTCCTCAGGGTTCGATGTCATCGTCCTC
GTCCTCATCGTCTTCTTCATCGCCATCGCTTTTGCTCGAGTCATTAACCGTACGCAATATGGGGAGGGTGTCATTTCGAAGGCCTCCGAGTTCAAGTGTTTGGCAAATTA
ATCGAATTCTGTCTTGTTACCATTGCTAG
Protein sequenceShow/hide protein sequence
MDGKFHWDVKNKAISRTSSVGCSSRSIYYYGTAQGVPFKWETQPGTPKDPPPQEILPPLSPPPAVLSLGVSKPCIEQPKTRPRLRMRLRFWRGGKKSRDSERASQATTID
YNKEDPFGHNDKLETFSFFSYDCEFMASPQGSMSSSSSSSSSSSPSLLLESLTVRNMGRVSFRRPPSSSVWQINRILSCYHC