; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001891 (gene) of Snake gourd v1 genome

Gene IDTan0001891
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDUF761 domain-containing protein
Genome locationLG08:61345451..61346142
RNA-Seq ExpressionTan0001891
SyntenyTan0001891
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsIPR008480 - Protein of unknown function DUF761, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580813.1 hypothetical protein SDJN03_20815, partial [Cucurbita argyrosperma subsp. sororia]1.8e-4466.47Show/hide
Query:  MKKGSLSLSPSSLQVIFGSNSSSSTLTQLVKFKTLLQSLILSLAKAISRAKTTALHIFKQANYQSTAMANWKKKKNKLLFGSFRLHYNWCSSSSSHVTPA
        MK  SLS S SSLQ IF S+SS          K LLQ+LILSLA+AISRAKTTALHI KQAN+QS A+A +K+ KNKLLFGSFRLHYNWCSSS+ HV P 
Subjt:  MKKGSLSLSPSSLQVIFGSNSSSSTLTQLVKFKTLLQSLILSLAKAISRAKTTALHIFKQANYQSTAMANWKKKKNKLLFGSFRLHYNWCSSSSSHVTPA

Query:  PVTWEGDS----GDELSGYLQWLEERDEK----KEVNEIDKLAEIFIARCHEKFRLEKQESYRRFQQLMATSL
        P+TW  +S     D L+GYLQWLE+RD++    + VNEIDKLA+IFIARCHEKFRLEKQESYR+FQ++ A SL
Subjt:  PVTWEGDS----GDELSGYLQWLEERDEK----KEVNEIDKLAEIFIARCHEKFRLEKQESYRRFQQLMATSL

XP_022935250.1 uncharacterized protein LOC111442186 [Cucurbita moschata]2.1e-4567.65Show/hide
Query:  MKKGSLSLSPSSLQVIFGSNSSSSTLTQLVKFKTLLQSLILSLAKAISRAKTTALHIFKQANYQSTAMANWKKKKNKLLFGSFRLHYNWCSSSSSHVTPA
        MK  SLS S SSLQ IF S+SS          K LLQ+LILSLA+AISRAKTTALHI KQAN+QS A+A +K+ KNKLLFGSFRLHYNWCSSS+ HV P 
Subjt:  MKKGSLSLSPSSLQVIFGSNSSSSTLTQLVKFKTLLQSLILSLAKAISRAKTTALHIFKQANYQSTAMANWKKKKNKLLFGSFRLHYNWCSSSSSHVTPA

Query:  PVTW-EGDSGDELSGYLQWLEERDEKKE----VNEIDKLAEIFIARCHEKFRLEKQESYRRFQQLMATSL
        P+TW +  + D L+GYLQWLE+RD+++E    VNEIDKLA+IFIARCHEKFRLEKQESYR+FQ++ A SL
Subjt:  PVTW-EGDSGDELSGYLQWLEERDEKKE----VNEIDKLAEIFIARCHEKFRLEKQESYRRFQQLMATSL

XP_022983138.1 uncharacterized protein LOC111481779 [Cucurbita maxima]2.7e-4566.47Show/hide
Query:  LSLSPSSLQVIFGSNSSSSTLTQLVKFKTLLQSLILSLAKAISRAKTTALHIFKQANYQSTAMANWKKKKNKLLFGSFRLHYNWCSSSSSHVTPAPVTWE
        +SLSPSS   IF S+SS          K LLQ+LILSLA+AISRAKTTALHI KQAN+QS A+A +K+ KNKLLFGSFRLHYNWCSSS+ HV P P+TW+
Subjt:  LSLSPSSLQVIFGSNSSSSTLTQLVKFKTLLQSLILSLAKAISRAKTTALHIFKQANYQSTAMANWKKKKNKLLFGSFRLHYNWCSSSSSHVTPAPVTWE

Query:  GDS---GDELSGYLQWLEERDEKKEV----NEIDKLAEIFIARCHEKFRLEKQESYRRFQQLMATSL
         +S    D L+GYLQWLE+RD+++E+    NEIDKLA+IFIARCHEKFRLEKQESYR+FQ++ A SL
Subjt:  GDS---GDELSGYLQWLEERDEKKEV----NEIDKLAEIFIARCHEKFRLEKQESYRRFQQLMATSL

XP_023526429.1 uncharacterized protein LOC111789933 [Cucurbita pepo subsp. pepo]7.9e-4566.47Show/hide
Query:  MKKGSLSLSPSSLQVIFGSNSSSSTLTQLVKFKTLLQSLILSLAKAISRAKTTALHIFKQANYQSTAMANWKKKKNKLLFGSFRLHYNWCSSSSSHVTPA
        MK  SLS S SSLQ IF S+SS          K LLQ+LILSLA+AISRAKTTALHI KQAN+QS A+A +K+ KNKLLFGSFRLHYNWCSSS+ HV P 
Subjt:  MKKGSLSLSPSSLQVIFGSNSSSSTLTQLVKFKTLLQSLILSLAKAISRAKTTALHIFKQANYQSTAMANWKKKKNKLLFGSFRLHYNWCSSSSSHVTPA

Query:  PVTWEGDS----GDELSGYLQWLEERDEKKE----VNEIDKLAEIFIARCHEKFRLEKQESYRRFQQLMATSL
        P+TW+ +S     D L+GYLQWLE +D+++E    VNEIDKLA+IFIARCHEKFRLEKQESYR+FQ++ A SL
Subjt:  PVTWEGDS----GDELSGYLQWLEERDEKKE----VNEIDKLAEIFIARCHEKFRLEKQESYRRFQQLMATSL

XP_038906153.1 uncharacterized protein LOC120092033 [Benincasa hispida]5.9e-4060.45Show/hide
Query:  GSL--SLSPSSLQVIFGSNSSSSTLTQLVKFKTLLQSLILSLAKAISRAKTTALHIFKQANYQSTAMANWKKKKNKLLFGSFRLHYNWCSSSS----SHV
        GSL  S SP S   I  S+SS S  + LVKFK +LQ+LILSLA+AISRAKTTA HI KQAN+Q       K+ K KLL+GSFRLHYNWCS SS    SHV
Subjt:  GSL--SLSPSSLQVIFGSNSSSSTLTQLVKFKTLLQSLILSLAKAISRAKTTALHIFKQANYQSTAMANWKKKKNKLLFGSFRLHYNWCSSSS----SHV

Query:  TPAPVTWE-------GDSGDELSGYLQWLEERDEKKE------VNEIDKLAEIFIARCHEKFRLEKQESYRRFQQLM
        TP  +TW+       G  GD+L GYL+WLEER+   +      VNEIDKLAEIFIAR HEKF+LEKQESYRRFQ ++
Subjt:  TPAPVTWE-------GDSGDELSGYLQWLEERDEKKE------VNEIDKLAEIFIARCHEKFRLEKQESYRRFQQLM

TrEMBL top hitse value%identityAlignment
A0A0A0LBV5 Uncharacterized protein1.2e-3860.99Show/hide
Query:  SLSPSSLQVIFGSNSSSSTLTQLVKFKTLLQSLILSLAKAISRAKTTALHIFKQANYQSTAMANWKKKKNKLLFGSFRLHYNWCSSSS---SHVTPAPVT
        S S SSLQV+   + SSSTL   +KFK LLQ+LILSLA+AISRAKTTA   F+ AN   TA+   K+ K KLL+GSFRLHYNWCS SS   SHVTPA +T
Subjt:  SLSPSSLQVIFGSNSSSSTLTQLVKFKTLLQSLILSLAKAISRAKTTALHIFKQANYQSTAMANWKKKKNKLLFGSFRLHYNWCSSSS---SHVTPAPVT

Query:  WE------GDSGDELSGYLQWLEERD---------------EKKEVNEIDKLAEIFIARCHEKFRLEKQESYRRFQQLMATS
         +      G  GD+L GYLQWLEERD               E + VNEIDKLAEIFIARCHEKF+LEKQESYRRFQ +MA S
Subjt:  WE------GDSGDELSGYLQWLEERD---------------EKKEVNEIDKLAEIFIARCHEKFRLEKQESYRRFQQLMATS

A0A1S3B8H1 uncharacterized protein LOC1034871585.9e-3859.24Show/hide
Query:  SLSLSPSSLQVIFGSNSSSSTLTQLVKFKTLLQSLILSLAKAISRAKTTALHIFKQANYQSTAMANWKKKKNKLLFGSFRLHYNWCSSSS---SHVTPAP
        S S S SSLQV+   + SSSTL   +KFK LLQ+LI SLA+AISRAKTTA        +QS  +A  K+ K KLL+GSFRLHYNWCS SS   SHVTPA 
Subjt:  SLSLSPSSLQVIFGSNSSSSTLTQLVKFKTLLQSLILSLAKAISRAKTTALHIFKQANYQSTAMANWKKKKNKLLFGSFRLHYNWCSSSS---SHVTPAP

Query:  VTWE-----GDSGDELSGYLQWLEERDEKKE----------------VNEIDKLAEIFIARCHEKFRLEKQESYRRFQQLMATS
        +T++     G  GD+L GYLQWLEERD  K+                VNEIDKLAEIFIARCHEKF+LEKQESYRRFQ +MA S
Subjt:  VTWE-----GDSGDELSGYLQWLEERDEKKE----------------VNEIDKLAEIFIARCHEKFRLEKQESYRRFQQLMATS

A0A5A7TJT8 Uncharacterized protein5.9e-3859.24Show/hide
Query:  SLSLSPSSLQVIFGSNSSSSTLTQLVKFKTLLQSLILSLAKAISRAKTTALHIFKQANYQSTAMANWKKKKNKLLFGSFRLHYNWCSSSS---SHVTPAP
        S S S SSLQV+   + SSSTL   +KFK LLQ+LI SLA+AISRAKTTA        +QS  +A  K+ K KLL+GSFRLHYNWCS SS   SHVTPA 
Subjt:  SLSLSPSSLQVIFGSNSSSSTLTQLVKFKTLLQSLILSLAKAISRAKTTALHIFKQANYQSTAMANWKKKKNKLLFGSFRLHYNWCSSSS---SHVTPAP

Query:  VTWE-----GDSGDELSGYLQWLEERDEKKE----------------VNEIDKLAEIFIARCHEKFRLEKQESYRRFQQLMATS
        +T++     G  GD+L GYLQWLEERD  K+                VNEIDKLAEIFIARCHEKF+LEKQESYRRFQ +MA S
Subjt:  VTWE-----GDSGDELSGYLQWLEERDEKKE----------------VNEIDKLAEIFIARCHEKFRLEKQESYRRFQQLMATS

A0A6J1FA41 uncharacterized protein LOC1114421861.0e-4567.65Show/hide
Query:  MKKGSLSLSPSSLQVIFGSNSSSSTLTQLVKFKTLLQSLILSLAKAISRAKTTALHIFKQANYQSTAMANWKKKKNKLLFGSFRLHYNWCSSSSSHVTPA
        MK  SLS S SSLQ IF S+SS          K LLQ+LILSLA+AISRAKTTALHI KQAN+QS A+A +K+ KNKLLFGSFRLHYNWCSSS+ HV P 
Subjt:  MKKGSLSLSPSSLQVIFGSNSSSSTLTQLVKFKTLLQSLILSLAKAISRAKTTALHIFKQANYQSTAMANWKKKKNKLLFGSFRLHYNWCSSSSSHVTPA

Query:  PVTW-EGDSGDELSGYLQWLEERDEKKE----VNEIDKLAEIFIARCHEKFRLEKQESYRRFQQLMATSL
        P+TW +  + D L+GYLQWLE+RD+++E    VNEIDKLA+IFIARCHEKFRLEKQESYR+FQ++ A SL
Subjt:  PVTW-EGDSGDELSGYLQWLEERDEKKE----VNEIDKLAEIFIARCHEKFRLEKQESYRRFQQLMATSL

A0A6J1J6X1 uncharacterized protein LOC1114817791.3e-4566.47Show/hide
Query:  LSLSPSSLQVIFGSNSSSSTLTQLVKFKTLLQSLILSLAKAISRAKTTALHIFKQANYQSTAMANWKKKKNKLLFGSFRLHYNWCSSSSSHVTPAPVTWE
        +SLSPSS   IF S+SS          K LLQ+LILSLA+AISRAKTTALHI KQAN+QS A+A +K+ KNKLLFGSFRLHYNWCSSS+ HV P P+TW+
Subjt:  LSLSPSSLQVIFGSNSSSSTLTQLVKFKTLLQSLILSLAKAISRAKTTALHIFKQANYQSTAMANWKKKKNKLLFGSFRLHYNWCSSSSSHVTPAPVTWE

Query:  GDS---GDELSGYLQWLEERDEKKEV----NEIDKLAEIFIARCHEKFRLEKQESYRRFQQLMATSL
         +S    D L+GYLQWLE+RD+++E+    NEIDKLA+IFIARCHEKFRLEKQESYR+FQ++ A SL
Subjt:  GDS---GDELSGYLQWLEERDEKKEV----NEIDKLAEIFIARCHEKFRLEKQESYRRFQQLMATSL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G42180.1 unknown protein2.2e-1336.42Show/hide
Query:  SNSSSSTLTQLVKFKTLLQSLILSLAKAISRAKTTALHIFKQANYQSTAMANWKKK----KNKLLFGSFRLHYNWCSSSSSHVT-----PAPVTWEGDSG
        S+SSS +      F  L+   +  L +++SRA++  + I K    +   M  +  K    ++ + FG            SSHV      P P + +G   
Subjt:  SNSSSSTLTQLVKFKTLLQSLILSLAKAISRAKTTALHIFKQANYQSTAMANWKKK----KNKLLFGSFRLHYNWCSSSSSHVT-----PAPVTWEGDSG

Query:  DE---LSGYLQWLEER-DEKKEVN-------------EIDKLAEIFIARCHEKFRLEKQESYRRFQQLMATSL
        DE    S YLQWLEER DE   +N             +ID+LA+ FIARCHEKF LEK ESYRRFQ ++A SL
Subjt:  DE---LSGYLQWLEER-DEKKEVN-------------EIDKLAEIFIARCHEKFRLEKQESYRRFQQLMATSL

AT3G57950.1 unknown protein7.0e-2339.77Show/hide
Query:  NSSSSTLTQLVKFKTLLQSL----ILSLAKAISRAKTTALHIFK-QANYQSTAM-----ANWKKKKNKLLFGSFRLHYNWCSSSSSHVTPAP--------
        +SSSS+ +  +K KTL+Q+L    +    +A+++AK+  L I K  +N +   +         K + K+ FGSFRLHYNWC   SSHV P P        
Subjt:  NSSSSTLTQLVKFKTLLQSL----ILSLAKAISRAKTTALHIFK-QANYQSTAM-----ANWKKKKNKLLFGSFRLHYNWCSSSSSHVTPAP--------

Query:  -VTWEGDSGDELSGYLQWLEER--DEKKEV---------NEIDKLAEIFIARCHEKFRLEKQESYRRFQQLMATSL
         +  E +   +LSGYL+WLE +  D+ +E+         ++ID LA++FIA CHEKF LEK ESYRRFQ+++   L
Subjt:  -VTWEGDSGDELSGYLQWLEER--DEKKEV---------NEIDKLAEIFIARCHEKFRLEKQESYRRFQQLMATSL

AT5G06790.1 unknown protein3.6e-1939.9Show/hide
Query:  SLSLSPSSLQVIFGSNSSSSTLTQLVKFKTLLQSLILS----LAKAISRAKTTALHIFKQANYQSTAMANW-------KKKKNKLLFGSFRLHYNWCSSS
        S S S SS Q     +SSSS  +  +K K+L+Q+LI+S    L + ISR  +  + + ++  Y   ++++        KK+KN +LFGSFRLHYN+C   
Subjt:  SLSLSPSSLQVIFGSNSSSSTLTQLVKFKTLLQSLILS----LAKAISRAKTTALHIFKQANYQSTAMANW-------KKKKNKLLFGSFRLHYNWCSSS

Query:  SSHVTP--APV---------------TWEG----------DSGD----ELSGYLQWLEER-----DEKKE--VNEIDKLAEIFIARCHEKFRLEKQESYR
        SSHV P  APV               TWE           D  D    +LS YL+ LE++     +E+ E  +NEIDKLA+ FIA CHEKF LEK +SYR
Subjt:  SSHVTP--APV---------------TWEG----------DSGD----ELSGYLQWLEER-----DEKKE--VNEIDKLAEIFIARCHEKFRLEKQESYR

Query:  RFQ
        R Q
Subjt:  RFQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAAGGGTTCCCTTTCCCTTTCACCATCTTCACTCCAAGTAATTTTTGGCTCAAATTCATCATCTTCAACCTTAACCCAGCTGGTGAAATTCAAAACCCTATTGCA
GAGTCTCATTCTATCTCTGGCTAAAGCCATTTCCAGAGCCAAAACGACGGCGCTTCACATCTTCAAACAGGCCAATTACCAATCCACCGCCATGGCTAATTGGAAGAAGA
AAAAGAATAAGCTTCTCTTCGGATCCTTCAGACTTCATTACAACTGGTGCTCTTCGTCGTCATCGCACGTGACTCCGGCGCCGGTCACGTGGGAGGGGGACTCCGGCGAC
GAGCTTTCTGGGTATTTGCAGTGGCTGGAGGAGAGAGATGAAAAAAAAGAAGTGAATGAGATTGATAAATTGGCAGAGATTTTTATTGCCAGGTGTCATGAGAAATTCAG
GCTGGAAAAACAGGAGTCTTATAGGAGGTTTCAACAATTGATGGCTACAAGCTTGTGA
mRNA sequenceShow/hide mRNA sequence
AACTCCTTTACAACATAACTCAAAAAAAGAAAAAAAAAATGAAAAAGGGTTCCCTTTCCCTTTCACCATCTTCACTCCAAGTAATTTTTGGCTCAAATTCATCATCTTCA
ACCTTAACCCAGCTGGTGAAATTCAAAACCCTATTGCAGAGTCTCATTCTATCTCTGGCTAAAGCCATTTCCAGAGCCAAAACGACGGCGCTTCACATCTTCAAACAGGC
CAATTACCAATCCACCGCCATGGCTAATTGGAAGAAGAAAAAGAATAAGCTTCTCTTCGGATCCTTCAGACTTCATTACAACTGGTGCTCTTCGTCGTCATCGCACGTGA
CTCCGGCGCCGGTCACGTGGGAGGGGGACTCCGGCGACGAGCTTTCTGGGTATTTGCAGTGGCTGGAGGAGAGAGATGAAAAAAAAGAAGTGAATGAGATTGATAAATTG
GCAGAGATTTTTATTGCCAGGTGTCATGAGAAATTCAGGCTGGAAAAACAGGAGTCTTATAGGAGGTTTCAACAATTGATGGCTACAAGCTTGTGAGGATCTTTTTTTTA
AAAAAAAATTAATTGTGGGGTTTTGTTTTGGTGGGAGGAAAAAAATAAATTTGAATTTCTTGAGGATGGGGATGGTGGTGATATTAATCTGTAAGATTTTTTTTCTTTTT
TTTTTTTTACCAGTAATTGAAATTAATAAGGG
Protein sequenceShow/hide protein sequence
MKKGSLSLSPSSLQVIFGSNSSSSTLTQLVKFKTLLQSLILSLAKAISRAKTTALHIFKQANYQSTAMANWKKKKNKLLFGSFRLHYNWCSSSSSHVTPAPVTWEGDSGD
ELSGYLQWLEERDEKKEVNEIDKLAEIFIARCHEKFRLEKQESYRRFQQLMATSL