; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0010220 (gene) of Snake gourd v1 genome

Gene IDTan0010220
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionWW domain-containing protein
Genome locationLG06:28004207..28005419
RNA-Seq ExpressionTan0010220
SyntenyTan0010220
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR001202 - WW domain
IPR036020 - WW domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7013878.1 hypothetical protein SDJN02_24047 [Cucurbita argyrosperma subsp. argyrosperma]2.4e-6774.88Show/hide
Query:  MEVPELSLAPTQFVKSSSDEINSAASSPKLAQTN--CSRKRKLLLHDHQ-HHHFVKSQTSVDLHLKDPLPLDWEQCLDLQHVQSGKMYYLNRKTLRKSWN
        ME P LSLAPTQF+K  +D+INS  SS KL QTN  CSRKRKLLLH H  + H + SQTSVDL LKDPLPLDWEQCLDL   QSGKMYYLNRKTLRKSWN
Subjt:  MEVPELSLAPTQFVKSSSDEINSAASSPKLAQTN--CSRKRKLLLHDHQ-HHHFVKSQTSVDLHLKDPLPLDWEQCLDLQHVQSGKMYYLNRKTLRKSWN

Query:  WPKQDHHNHHHHHQKLDLELELNNINCSSSTTADHNLINFRHGHKHGSASSESSNMVALPCLNCHLLVILSKSSPSCPNCKHLHTLFPTSSSSSPNS-PP
        WPK+DHHN     +KLDLELELNNI+ SSS    ++L+NFRHG  +GS+SS+SSNMVALPCLNCHLLVILSKSSPSCPNCKH HTLFP    SSPNS PP
Subjt:  WPKQDHHNHHHHHQKLDLELELNNINCSSSTTADHNLINFRHGHKHGSASSESSNMVALPCLNCHLLVILSKSSPSCPNCKHLHTLFPTSSSSSPNS-PP

Query:  NTLSLLN
        NTLSLLN
Subjt:  NTLSLLN

XP_011656507.1 uncharacterized protein LOC105435750 isoform X1 [Cucumis sativus]1.6e-7177.18Show/hide
Query:  MEVPELSLAPT-QFVKSSSDEINSAASSPKL-AQTNCSRKRKLLLHDHQHHHFVKSQTSVDLHLKDPLPLDWEQCLDLQHVQSGKMYYLNRKTLRKSWNW
        ME PELSLAPT QFVK  SDE+NS++S   +   + CSRKRKLLLHD  HHHF+KSQTSVDL LKDPLPL WEQCLDLQ VQSGKMYYLNRKTLRKSWNW
Subjt:  MEVPELSLAPT-QFVKSSSDEINSAASSPKL-AQTNCSRKRKLLLHDHQHHHFVKSQTSVDLHLKDPLPLDWEQCLDLQHVQSGKMYYLNRKTLRKSWNW

Query:  PKQDHHNHHHHHQKLDLELELNNINCSSSTTADHNLINFRHGHKHGSAS-SESSNMVALPCLNCHLLVILSKSSPSCPNCKHLHTLFPTSSSSSPNSPPN
        PK   H+  HHHQKLDL LELNNIN S S+ AD NL+NF  GH+HG+ S SESSNMVALPCLNCHLLVILSKSSPSCPNCKH HTLFP +  SSPNSPPN
Subjt:  PKQDHHNHHHHHQKLDLELELNNINCSSSTTADHNLINFRHGHKHGSAS-SESSNMVALPCLNCHLLVILSKSSPSCPNCKHLHTLFPTSSSSSPNSPPN

Query:  TLSLLN
        TL LLN
Subjt:  TLSLLN

XP_011656508.1 uncharacterized protein LOC105435750 isoform X2 [Cucumis sativus]3.3e-6976.21Show/hide
Query:  MEVPELSLAPT-QFVKSSSDEINSAASSPKL-AQTNCSRKRKLLLHDHQHHHFVKSQTSVDLHLKDPLPLDWEQCLDLQHVQSGKMYYLNRKTLRKSWNW
        ME PELSLAPT QFVK  SDE+NS++S   +   + CSRKRKLLLHD  HHHF+KSQTSVDL LKDPLPL WEQCLDL   QSGKMYYLNRKTLRKSWNW
Subjt:  MEVPELSLAPT-QFVKSSSDEINSAASSPKL-AQTNCSRKRKLLLHDHQHHHFVKSQTSVDLHLKDPLPLDWEQCLDLQHVQSGKMYYLNRKTLRKSWNW

Query:  PKQDHHNHHHHHQKLDLELELNNINCSSSTTADHNLINFRHGHKHGSAS-SESSNMVALPCLNCHLLVILSKSSPSCPNCKHLHTLFPTSSSSSPNSPPN
        PK   H+  HHHQKLDL LELNNIN S S+ AD NL+NF  GH+HG+ S SESSNMVALPCLNCHLLVILSKSSPSCPNCKH HTLFP +  SSPNSPPN
Subjt:  PKQDHHNHHHHHQKLDLELELNNINCSSSTTADHNLINFRHGHKHGSAS-SESSNMVALPCLNCHLLVILSKSSPSCPNCKHLHTLFPTSSSSSPNSPPN

Query:  TLSLLN
        TL LLN
Subjt:  TLSLLN

XP_022992410.1 uncharacterized protein LOC111488725 [Cucurbita maxima]9.7e-6975.24Show/hide
Query:  MEVPELSLAPTQFVKSSSDEINSAASSPKLAQTN--CSRKRKLLLHDHQ-HHHFVKSQTSVDLHLKDPLPLDWEQCLDLQHVQSGKMYYLNRKTLRKSWN
        M+ P LSLAPTQF+K  SD+INS  SS KL QTN  CSRKRKLLLH H  + H + SQTSVDL LKDPLPLDWEQCLDL   QSGKMYYLNRKTLRKSWN
Subjt:  MEVPELSLAPTQFVKSSSDEINSAASSPKLAQTN--CSRKRKLLLHDHQ-HHHFVKSQTSVDLHLKDPLPLDWEQCLDLQHVQSGKMYYLNRKTLRKSWN

Query:  WPKQDHHNHHHHHQKLDLELELNNINCSSSTTADHNLINFRHGHKHGSASSESSNMVALPCLNCHLLVILSKSSPSCPNCKHLHTLFPTSSSSSPNSPPN
        WPK+DHHN     +KLDLELELNNI+ SSS    ++L+NFRHG  +GS+SS+SSNMVALPCLNCHLLVILSKSSPSCPNCKH HTLFP    SSPNSPPN
Subjt:  WPKQDHHNHHHHHQKLDLELELNNINCSSSTTADHNLINFRHGHKHGSASSESSNMVALPCLNCHLLVILSKSSPSCPNCKHLHTLFPTSSSSSPNSPPN

Query:  TLSLLN
        TLSLLN
Subjt:  TLSLLN

XP_038886190.1 uncharacterized protein LOC120076436 isoform X1 [Benincasa hispida]9.7e-6974.02Show/hide
Query:  MEVPELSLAP-TQFVKSSSDEINSAASSPKLAQTNCSRKRKLLLHDHQHHHFVKSQTSVDLHLKDPLPLDWEQCLDLQHVQSGKMYYLNRKTLRKSWNWP
        MEVPELSLAP TQFVK   DE+NS++         CSRKRKLL HDHQ  H++KSQTSVDL LKDPLPL WEQCLDLQ VQSGKMYYLNRKTLRKSWNWP
Subjt:  MEVPELSLAP-TQFVKSSSDEINSAASSPKLAQTNCSRKRKLLLHDHQHHHFVKSQTSVDLHLKDPLPLDWEQCLDLQHVQSGKMYYLNRKTLRKSWNWP

Query:  KQDHHNHHHHHQKLDLELELNNINCSSSTTADHNLINFRHGHKHGSASSESSNMVALPCLNCHLLVILSKSSPSCPNCKHLHTLFPTSSSSSPNSPPNTL
        K +      HHQKL+LELELNNIN +SS+T D NL+N RH     S+SSESSNMVALPC+NCHLLVILSKSSPSCPNCKHLHTLF   + SSPNSPPNTL
Subjt:  KQDHHNHHHHHQKLDLELELNNINCSSSTTADHNLINFRHGHKHGSASSESSNMVALPCLNCHLLVILSKSSPSCPNCKHLHTLFPTSSSSSPNSPPNTL

Query:  SLLN
         LLN
Subjt:  SLLN

TrEMBL top hitse value%identityAlignment
A0A0A0K842 WW domain-containing protein1.6e-6976.21Show/hide
Query:  MEVPELSLAPT-QFVKSSSDEINSAASSPKL-AQTNCSRKRKLLLHDHQHHHFVKSQTSVDLHLKDPLPLDWEQCLDLQHVQSGKMYYLNRKTLRKSWNW
        ME PELSLAPT QFVK  SDE+NS++S   +   + CSRKRKLLLHD  HHHF+KSQTSVDL LKDPLPL WEQCLDL   QSGKMYYLNRKTLRKSWNW
Subjt:  MEVPELSLAPT-QFVKSSSDEINSAASSPKL-AQTNCSRKRKLLLHDHQHHHFVKSQTSVDLHLKDPLPLDWEQCLDLQHVQSGKMYYLNRKTLRKSWNW

Query:  PKQDHHNHHHHHQKLDLELELNNINCSSSTTADHNLINFRHGHKHGSAS-SESSNMVALPCLNCHLLVILSKSSPSCPNCKHLHTLFPTSSSSSPNSPPN
        PK   H+  HHHQKLDL LELNNIN S S+ AD NL+NF  GH+HG+ S SESSNMVALPCLNCHLLVILSKSSPSCPNCKH HTLFP +  SSPNSPPN
Subjt:  PKQDHHNHHHHHQKLDLELELNNINCSSSTTADHNLINFRHGHKHGSAS-SESSNMVALPCLNCHLLVILSKSSPSCPNCKHLHTLFPTSSSSSPNSPPN

Query:  TLSLLN
        TL LLN
Subjt:  TLSLLN

A0A6J1D9E4 uncharacterized protein LOC1110185816.6e-6369.67Show/hide
Query:  MEVPELSLAPTQFVKSS--SDEINSAASSP-KLAQT-NCSRKRKLL-LHDHQHHHFVKSQTSVDLHLKDPLPLDWEQCLDLQHVQSGKMYYLNRKTLRKS
        MEVPELSLAPT+FVK S   DE+   +SSP KL ++ +CS+KRKLL LHDH        + SVDL LKDPLPLDWEQCLDL   QSGKMYYLNRKTLRKS
Subjt:  MEVPELSLAPTQFVKSS--SDEINSAASSP-KLAQT-NCSRKRKLL-LHDHQHHHFVKSQTSVDLHLKDPLPLDWEQCLDLQHVQSGKMYYLNRKTLRKS

Query:  WNWPKQDHHNHHHHHQKLDLELELNNINCSSSTT--ADHNLINFRHGHKHGSAS-SESSNMVALPCLNCHLLVILSKSSPSCPNCKHLHTLFPTSSSSSP
        WNWPK+D    + HHQKL+LELELNNIN SSS++  A++ +INFRHG +HG    + SSNMVALPCLNCHLLVILSKSSPSCPNCKH HTLFPT SSS  
Subjt:  WNWPKQDHHNHHHHHQKLDLELELNNINCSSSTT--ADHNLINFRHGHKHGSAS-SESSNMVALPCLNCHLLVILSKSSPSCPNCKHLHTLFPTSSSSSP

Query:  NSPPNTLSLLN
         +PPNTL LLN
Subjt:  NSPPNTLSLLN

A0A6J1EVH5 uncharacterized protein LOC1114363471.2e-6774.88Show/hide
Query:  MEVPELSLAPTQFVKSSSDEINSAASSPKLAQTN--CSRKRKLLLHDHQ-HHHFVKSQTSVDLHLKDPLPLDWEQCLDLQHVQSGKMYYLNRKTLRKSWN
        ME P LSLAPTQF+K  +D+INS  SS KL QTN  CSRKRKLLLH H  + H + SQTSVDL LKDPLPLDWEQCLDL   QSGKMYYLNRKTLRKSWN
Subjt:  MEVPELSLAPTQFVKSSSDEINSAASSPKLAQTN--CSRKRKLLLHDHQ-HHHFVKSQTSVDLHLKDPLPLDWEQCLDLQHVQSGKMYYLNRKTLRKSWN

Query:  WPKQDHHNHHHHHQKLDLELELNNINCSSSTTADHNLINFRHGHKHGSASSESSNMVALPCLNCHLLVILSKSSPSCPNCKHLHTLFPTSSSSSPNS-PP
        WPK+DHHN     +KLDLELELNNI+ SSS    ++L+NFRHG  +GS+SS+SSNMVALPCLNCHLLVILSKSSPSCPNCKH HTLFP    SSPNS PP
Subjt:  WPKQDHHNHHHHHQKLDLELELNNINCSSSTTADHNLINFRHGHKHGSASSESSNMVALPCLNCHLLVILSKSSPSCPNCKHLHTLFPTSSSSSPNS-PP

Query:  NTLSLLN
        NTLSLLN
Subjt:  NTLSLLN

A0A6J1JVM3 uncharacterized protein LOC1114887254.7e-6975.24Show/hide
Query:  MEVPELSLAPTQFVKSSSDEINSAASSPKLAQTN--CSRKRKLLLHDHQ-HHHFVKSQTSVDLHLKDPLPLDWEQCLDLQHVQSGKMYYLNRKTLRKSWN
        M+ P LSLAPTQF+K  SD+INS  SS KL QTN  CSRKRKLLLH H  + H + SQTSVDL LKDPLPLDWEQCLDL   QSGKMYYLNRKTLRKSWN
Subjt:  MEVPELSLAPTQFVKSSSDEINSAASSPKLAQTN--CSRKRKLLLHDHQ-HHHFVKSQTSVDLHLKDPLPLDWEQCLDLQHVQSGKMYYLNRKTLRKSWN

Query:  WPKQDHHNHHHHHQKLDLELELNNINCSSSTTADHNLINFRHGHKHGSASSESSNMVALPCLNCHLLVILSKSSPSCPNCKHLHTLFPTSSSSSPNSPPN
        WPK+DHHN     +KLDLELELNNI+ SSS    ++L+NFRHG  +GS+SS+SSNMVALPCLNCHLLVILSKSSPSCPNCKH HTLFP    SSPNSPPN
Subjt:  WPKQDHHNHHHHHQKLDLELELNNINCSSSTTADHNLINFRHGHKHGSASSESSNMVALPCLNCHLLVILSKSSPSCPNCKHLHTLFPTSSSSSPNSPPN

Query:  TLSLLN
        TLSLLN
Subjt:  TLSLLN

A0A6N2MYQ7 WW domain-containing protein2.7e-4053.14Show/hide
Query:  MEVPELSLAPTQFVKSSSDEINSAASSPKLAQTNCSRKRKLLLHDHQHHHFVKSQTSVDLHLKDPLPLDWEQCLDLQHVQSGKMYYLNRKTLRKSWNWPK
        M++ ELSLAPTQFV        S +SS   ++ N SRKRK L   H   +    Q SVDLH++DPLPLDWEQCLDL   QSG+MYYLNRKTLRKSWNWPK
Subjt:  MEVPELSLAPTQFVKSSSDEINSAASSPKLAQTNCSRKRKLLLHDHQHHHFVKSQTSVDLHLKDPLPLDWEQCLDLQHVQSGKMYYLNRKTLRKSWNWPK

Query:  QDHHNHHHHHQKLDLELELNNINCSSSTTADHNLINFRHGHKHGSASSESSNMVALPCLNCHLLVILSKSSPSCPNCKHLHTLFPTSSSS----SPNSPP
                 +QKLDLEL +++   S+      +  +          +  SSNMVAL CLNCHLLVILSKSSPSCPNCKH+H+L PT  S     SP+   
Subjt:  QDHHNHHHHHQKLDLELELNNINCSSSTTADHNLINFRHGHKHGSASSESSNMVALPCLNCHLLVILSKSSPSCPNCKHLHTLFPTSSSS----SPNSPP

Query:  NTLSLLN
        +TLSL+N
Subjt:  NTLSLLN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G08910.1 unknown protein4.7e-0525.35Show/hide
Query:  INSAASSPKLAQTNC------SRKRKLLLHDHQHHHFVKSQTSVDLHLKDPLPLDWEQCLDLQHVQSGKMYYLNRKTLRKSWNWPKQDHHNHHHHHQKLD
        +N     PK+A + C      SRKRK+         F +S  S +L +    P D                +  R+    S N        ++HHH  LD
Subjt:  INSAASSPKLAQTNC------SRKRKLLLHDHQHHHFVKSQTSVDLHLKDPLPLDWEQCLDLQHVQSGKMYYLNRKTLRKSWNWPKQDHHNHHHHHQKLD

Query:  LELELNNINCSSSTTADHNLINFRHGHKH--------------------------------------GSASSESSNMVALPCLNCHLLVILSKSSPSCPN
        LEL L+           H+L N+  G++                                       G        MVA  C+ CH+LV+L K+SP+CPN
Subjt:  LELELNNINCSSSTTADHNLINFRHGHKH--------------------------------------GSASSESSNMVALPCLNCHLLVILSKSSPSCPN

Query:  CKHLHTLFPTSSS
        CK +H+   TS S
Subjt:  CKHLHTLFPTSSS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGTTCCTGAGTTGTCTTTGGCTCCAACCCAGTTTGTGAAAAGTAGTAGTGATGAGATCAATTCAGCAGCTTCATCACCAAAATTAGCACAAACCAACTGTTCTAG
AAAGAGGAAGCTTCTTCTCCATGATCATCAACATCATCACTTTGTTAAGTCTCAAACAAGTGTTGATCTTCATCTTAAAGATCCTCTTCCTCTTGATTGGGAACAATGTC
TTGATCTTCAGCATGTGCAGTCAGGGAAAATGTACTATCTAAACAGAAAAACATTGAGAAAAAGTTGGAATTGGCCAAAACAAGATCATCATAATCATCACCATCATCAC
CAAAAGCTAGACTTGGAGTTGGAGCTCAACAACATTAATTGTTCTTCATCAACAACAGCTGATCATAATTTGATCAATTTCAGACATGGACATAAACATGGAAGTGCTTC
TTCAGAAAGCAGCAACATGGTGGCTTTGCCTTGTCTCAATTGCCACCTTCTTGTCATTCTCTCCAAATCTTCCCCATCTTGCCCTAATTGCAAGCATCTTCATACCCTTT
TTCCAACTTCTTCTTCTTCTTCACCCAATTCCCCTCCCAACACCTTGTCCCTTTTGAACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGTTCCTGAGTTGTCTTTGGCTCCAACCCAGTTTGTGAAAAGTAGTAGTGATGAGATCAATTCAGCAGCTTCATCACCAAAATTAGCACAAACCAACTGTTCTAG
AAAGAGGAAGCTTCTTCTCCATGATCATCAACATCATCACTTTGTTAAGTCTCAAACAAGTGTTGATCTTCATCTTAAAGATCCTCTTCCTCTTGATTGGGAACAATGTC
TTGATCTTCAGCATGTGCAGTCAGGGAAAATGTACTATCTAAACAGAAAAACATTGAGAAAAAGTTGGAATTGGCCAAAACAAGATCATCATAATCATCACCATCATCAC
CAAAAGCTAGACTTGGAGTTGGAGCTCAACAACATTAATTGTTCTTCATCAACAACAGCTGATCATAATTTGATCAATTTCAGACATGGACATAAACATGGAAGTGCTTC
TTCAGAAAGCAGCAACATGGTGGCTTTGCCTTGTCTCAATTGCCACCTTCTTGTCATTCTCTCCAAATCTTCCCCATCTTGCCCTAATTGCAAGCATCTTCATACCCTTT
TTCCAACTTCTTCTTCTTCTTCACCCAATTCCCCTCCCAACACCTTGTCCCTTTTGAACTAG
Protein sequenceShow/hide protein sequence
MEVPELSLAPTQFVKSSSDEINSAASSPKLAQTNCSRKRKLLLHDHQHHHFVKSQTSVDLHLKDPLPLDWEQCLDLQHVQSGKMYYLNRKTLRKSWNWPKQDHHNHHHHH
QKLDLELELNNINCSSSTTADHNLINFRHGHKHGSASSESSNMVALPCLNCHLLVILSKSSPSCPNCKHLHTLFPTSSSSSPNSPPNTLSLLN