; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0011032 (gene) of Snake gourd v1 genome

Gene IDTan0011032
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCotton fiber protein
Genome locationLG05:77706308..77706988
RNA-Seq ExpressionTan0011032
SyntenyTan0011032
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR008480 - Protein of unknown function DUF761, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8653186.1 hypothetical protein Csa_020019 [Cucumis sativus]3.1e-6071.19Show/hide
Query:  MSSNSNS-PRNLPVVDNS--FLA-SKSKPHTYNKNSNN-ILAFFSS-LFLLSIFIIFFISIFFKNLSFSSLFNSTIFWFFISNTLIFIIAADYALFSLSQ
        MS NS S P+NL VV+NS  FL   K KP  YNK +N+ +LAFFSS LFLLSIFII FISI FKNLSFSSLFNST+FWFFISNTLIFIIA DY LFSLSQ
Subjt:  MSSNSNS-PRNLPVVDNS--FLA-SKSKPHTYNKNSNN-ILAFFSS-LFLLSIFIIFFISIFFKNLSFSSLFNSTIFWFFISNTLIFIIAADYALFSLSQ

Query:  HKSFDLYE--HFSPPNPKQTHFQLQSPSLMVIDEEKENPDEKFKIVVQG--CDFP------PARTYRRSKSEKPKRRVLSKESKKMMA-RRSESVKYEAK
        HKSF LYE  ++SPPNPK THFQLQ+ SL+V DE++E PDEK + VVQ    D P      P RTY R KSEKPKR V  + SKKMM  RRSESVK E K
Subjt:  HKSFDLYE--HFSPPNPKQTHFQLQSPSLMVIDEEKENPDEKFKIVVQG--CDFP------PARTYRRSKSEKPKRRVLSKESKKMMA-RRSESVKYEAK

Query:  ELE-ENEFSKMTDEELNRRVEEFIQRFNRQMRLQTR
        ELE ENEF+KMTDEELNRRVEEFIQRFN+QMRLQT+
Subjt:  ELE-ENEFSKMTDEELNRRVEEFIQRFNRQMRLQTR

XP_022927149.1 uncharacterized protein LOC111434084 [Cucurbita moschata]2.5e-5766.38Show/hide
Query:  MSSNSNSPRNLPVVDNSFL--ASKSKPHTYNKNSNN-ILAFFSSLFLLSIFIIFFISIFFKNLSFSSLFNSTIFWFFISNTLIFIIAADYALFSLSQHKS
        MSS+S    NL  VDN F+    K KP+T NK +N+ ILAFFSSL LL  F      I FKNLSFSSLFNST+FWFFISN LIFIIAADYA FSLSQ+K 
Subjt:  MSSNSNSPRNLPVVDNSFL--ASKSKPHTYNKNSNN-ILAFFSSLFLLSIFIIFFISIFFKNLSFSSLFNSTIFWFFISNTLIFIIAADYALFSLSQHKS

Query:  FDLYEHFSPPNPKQTHFQLQSPSLMVIDEEKENPDEKFK-IVVQGCD------FPPARTYRRSKSEKPKRRVLSKESKKMMARRSESVKYEAKELEENEF
          LYEH+SPPNPK THFQ Q+ S +V DE++E PD   +  V + CD        P RTYRRSKSEKPKR V SKESKK MA+RSES KYE KELEENE+
Subjt:  FDLYEHFSPPNPKQTHFQLQSPSLMVIDEEKENPDEKFK-IVVQGCD------FPPARTYRRSKSEKPKRRVLSKESKKMMARRSESVKYEAKELEENEF

Query:  SKMTDEELNRRVEEFIQRFNRQMRLQTRT
        SKMTDEELNRRVEEFIQRFN QMRL+T++
Subjt:  SKMTDEELNRRVEEFIQRFNRQMRLQTRT

XP_023001467.1 uncharacterized protein LOC111495593 [Cucurbita maxima]1.3e-5867.23Show/hide
Query:  MSSNSNSPRNLPVVDNSFL--ASKSKPHTYNKNSNN-ILAFFSS-LFLLSIFIIFFISIFFKNLSFSSLFNSTIFWFFISNTLIFIIAADYALFSLSQHK
        MSS+S    NL  +DN F+    K KP+TYNK +N+ ILAFFSS LFLLS  II      FKNLSFSSLFNSTIFWFFISNTLIFIIAADYA FSLSQ+K
Subjt:  MSSNSNSPRNLPVVDNSFL--ASKSKPHTYNKNSNN-ILAFFSS-LFLLSIFIIFFISIFFKNLSFSSLFNSTIFWFFISNTLIFIIAADYALFSLSQHK

Query:  SFDLYEHFSPPNPKQTHFQLQSPSLMVIDEEKENPDEKFKIVVQG-CD------FPPARTYRRSKSEKPKRRVLSKESKKMMARRSESVKYEAKELEENE
            YEH+SPPNPK THFQ Q+ S +V +E++E PD   + +VQ  CD        P RTY+RSKSEKPKR V SKESKK MA+RSES KYE KELEENE
Subjt:  SFDLYEHFSPPNPKQTHFQLQSPSLMVIDEEKENPDEKFKIVVQG-CD------FPPARTYRRSKSEKPKRRVLSKESKKMMARRSESVKYEAKELEENE

Query:  FSKMTDEELNRRVEEFIQRFNRQMRLQTRT-ELKS
        +SKMTDEELNRRVEEFIQRFN QMRL+T++ ELKS
Subjt:  FSKMTDEELNRRVEEFIQRFNRQMRLQTRT-ELKS

XP_023520225.1 uncharacterized protein LOC111783530 [Cucurbita pepo subsp. pepo]2.0e-5168.53Show/hide
Query:  ILAFFSSLFLLSIFIIFFISIFFKNLSFSSLFNSTIFWFFISNTLIFIIAADYALFSLSQHKSFDLYEHFSPPNPKQTHFQLQSPSLMVIDEEKENPDEK
        +LAFFSSL    +F++FF  I FKNLSFSSLFNST+FWFFISN LIFIIAADYA FSL+Q+K   LYEH+SPPNPK THFQ Q+ S +V DE++E PD  
Subjt:  ILAFFSSLFLLSIFIIFFISIFFKNLSFSSLFNSTIFWFFISNTLIFIIAADYALFSLSQHKSFDLYEHFSPPNPKQTHFQLQSPSLMVIDEEKENPDEK

Query:  FKIVVQ-GCD------FPPARTYRRSKSEKPKRRVLSKESKKMMARRSESVKYEAKELEENEFSKMTDEELNRRVEEFIQRFNRQMRLQTRT-ELKS
         +  VQ  CD        P RTY RSKSEKPKR V SKESKK MA+R ES  YE KELEENE+SKMTDEELNRRVEEFIQRFN QMRL+T++ ELKS
Subjt:  FKIVVQ-GCD------FPPARTYRRSKSEKPKRRVLSKESKKMMARRSESVKYEAKELEENEFSKMTDEELNRRVEEFIQRFNRQMRLQTRT-ELKS

XP_038877172.1 uncharacterized protein LOC120069472 [Benincasa hispida]1.8e-6373.68Show/hide
Query:  MSSNSNSPRNLPVVDNSFLA-SKSKPHT-YNKNSNN-ILAFFSS-LFLLSIFIIFFISIFFKNLSFSSLFNSTIFWFFISNTLIFIIAADYALFSLSQHK
        MSSNS SP+NL VVDNSFL  +K K  + YNK +N+ ILAFFSS LFLLSIFII FISI FKNLS  SLFNSTIFWFFISNTLIFIIA DY +FSL QHK
Subjt:  MSSNSNSPRNLPVVDNSFLA-SKSKPHT-YNKNSNN-ILAFFSS-LFLLSIFIIFFISIFFKNLSFSSLFNSTIFWFFISNTLIFIIAADYALFSLSQHK

Query:  SFDLYEHFSPPNPKQTHFQLQSPSLMVIDEEKENPDEKFKIVVQG--CDFP----PARTYRRSKSEKPKRRVLSKESKKMMARRSESVKYEAKELEENEF
        SF LYE FSP NPK  HF+L S SL+V DE+KE   EK ++VVQG   D P    PART RR KSEKPK  ++SKES KMM +RSESVKYEAKELEENEF
Subjt:  SFDLYEHFSPPNPKQTHFQLQSPSLMVIDEEKENPDEKFKIVVQG--CDFP----PARTYRRSKSEKPKRRVLSKESKKMMARRSESVKYEAKELEENEF

Query:  SKMTDEELNRRVEEFIQRFNRQMRLQTR
         KMTDEELNRRVEEFIQRFNRQMRLQT+
Subjt:  SKMTDEELNRRVEEFIQRFNRQMRLQTR

TrEMBL top hitse value%identityAlignment
A0A0A0M033 Uncharacterized protein2.1e-4658.55Show/hide
Query:  MSSNSNS-PRNLPVVDNS--FLA-SKSKPHTYNKNSNNILAFFSSLFLLSIFIIFFISIFFKNLSFSSLFNSTIFWFFISNTLIFIIAADYALFSLSQHK
        MS NS S P+NL VV+NS  FL   K KP  YNK +N+                                  T+FWFFISNTLIFIIA DY LFSLSQHK
Subjt:  MSSNSNS-PRNLPVVDNS--FLA-SKSKPHTYNKNSNNILAFFSSLFLLSIFIIFFISIFFKNLSFSSLFNSTIFWFFISNTLIFIIAADYALFSLSQHK

Query:  SFDLYE--HFSPPNPKQTHFQLQSPSLMVIDEEKENPDEKFKIVVQG--CDFP------PARTYRRSKSEKPKRRVLSKESKKMMA-RRSESVKYEAKEL
        SF LYE  ++SPPNPK THFQLQ+ SL+V DE++E PDEK + VVQ    D P      P RTY R KSEKPKR V  + SKKMM  RRSESVK E KEL
Subjt:  SFDLYE--HFSPPNPKQTHFQLQSPSLMVIDEEKENPDEKFKIVVQG--CDFP------PARTYRRSKSEKPKRRVLSKESKKMMA-RRSESVKYEAKEL

Query:  E-ENEFSKMTDEELNRRVEEFIQRFNRQMRLQTR
        E ENEF+KMTDEELNRRVEEFIQRFN+QMRLQT+
Subjt:  E-ENEFSKMTDEELNRRVEEFIQRFNRQMRLQTR

A0A2N9GT34 Uncharacterized protein6.3e-2248.83Show/hide
Query:  PRNLPVVDNSFLASKSKPHTYNKNSNNILAFFSSLFLLSIFIIFFISIFFKNLSFSSLFNSTIFWFFISNTLIFIIAADYALFSLSQHKSFDLYEHFSPP
        P NL  V++S   +KS    Y + +N  ++F++  FL SIFI   I   F NLS S+LFN+T FWFF+SNTLI IIA DY  +S S+ K     E+    
Subjt:  PRNLPVVDNSFLASKSKPHTYNKNSNNILAFFSSLFLLSIFIIFFISIFFKNLSFSSLFNSTIFWFFISNTLIFIIAADYALFSLSQHKSFDLYEHFSPP

Query:  NPKQT-HFQLQSPSLMVIDEEKENPDEKFKIVVQGCDFPPARTYRRSKSEKPKRRVLSKESKKMMARRSESVKYE-AKELEENEFSKMTDEELNRRVEEF
          K    F  Q P  +    E     EK KI         A+ YRRSKSEK K RVL  ESK ++ R SE+ K+E    LEENEFS M+DEELNRRVEEF
Subjt:  NPKQT-HFQLQSPSLMVIDEEKENPDEKFKIVVQGCDFPPARTYRRSKSEKPKRRVLSKESKKMMARRSESVKYE-AKELEENEFSKMTDEELNRRVEEF

Query:  IQRFNRQMRLQTR
        IQRFNRQ+RLQ R
Subjt:  IQRFNRQMRLQTR

A0A6J1EH73 uncharacterized protein LOC1114340841.2e-5766.38Show/hide
Query:  MSSNSNSPRNLPVVDNSFL--ASKSKPHTYNKNSNN-ILAFFSSLFLLSIFIIFFISIFFKNLSFSSLFNSTIFWFFISNTLIFIIAADYALFSLSQHKS
        MSS+S    NL  VDN F+    K KP+T NK +N+ ILAFFSSL LL  F      I FKNLSFSSLFNST+FWFFISN LIFIIAADYA FSLSQ+K 
Subjt:  MSSNSNSPRNLPVVDNSFL--ASKSKPHTYNKNSNN-ILAFFSSLFLLSIFIIFFISIFFKNLSFSSLFNSTIFWFFISNTLIFIIAADYALFSLSQHKS

Query:  FDLYEHFSPPNPKQTHFQLQSPSLMVIDEEKENPDEKFK-IVVQGCD------FPPARTYRRSKSEKPKRRVLSKESKKMMARRSESVKYEAKELEENEF
          LYEH+SPPNPK THFQ Q+ S +V DE++E PD   +  V + CD        P RTYRRSKSEKPKR V SKESKK MA+RSES KYE KELEENE+
Subjt:  FDLYEHFSPPNPKQTHFQLQSPSLMVIDEEKENPDEKFK-IVVQGCD------FPPARTYRRSKSEKPKRRVLSKESKKMMARRSESVKYEAKELEENEF

Query:  SKMTDEELNRRVEEFIQRFNRQMRLQTRT
        SKMTDEELNRRVEEFIQRFN QMRL+T++
Subjt:  SKMTDEELNRRVEEFIQRFNRQMRLQTRT

A0A6J1KGM0 uncharacterized protein LOC1114955936.4e-5967.23Show/hide
Query:  MSSNSNSPRNLPVVDNSFL--ASKSKPHTYNKNSNN-ILAFFSS-LFLLSIFIIFFISIFFKNLSFSSLFNSTIFWFFISNTLIFIIAADYALFSLSQHK
        MSS+S    NL  +DN F+    K KP+TYNK +N+ ILAFFSS LFLLS  II      FKNLSFSSLFNSTIFWFFISNTLIFIIAADYA FSLSQ+K
Subjt:  MSSNSNSPRNLPVVDNSFL--ASKSKPHTYNKNSNN-ILAFFSS-LFLLSIFIIFFISIFFKNLSFSSLFNSTIFWFFISNTLIFIIAADYALFSLSQHK

Query:  SFDLYEHFSPPNPKQTHFQLQSPSLMVIDEEKENPDEKFKIVVQG-CD------FPPARTYRRSKSEKPKRRVLSKESKKMMARRSESVKYEAKELEENE
            YEH+SPPNPK THFQ Q+ S +V +E++E PD   + +VQ  CD        P RTY+RSKSEKPKR V SKESKK MA+RSES KYE KELEENE
Subjt:  SFDLYEHFSPPNPKQTHFQLQSPSLMVIDEEKENPDEKFKIVVQG-CD------FPPARTYRRSKSEKPKRRVLSKESKKMMARRSESVKYEAKELEENE

Query:  FSKMTDEELNRRVEEFIQRFNRQMRLQTRT-ELKS
        +SKMTDEELNRRVEEFIQRFN QMRL+T++ ELKS
Subjt:  FSKMTDEELNRRVEEFIQRFNRQMRLQTRT-ELKS

A0A7N2LQF9 Uncharacterized protein1.5e-2042.04Show/hide
Query:  KPHTYNKN---SNNILAFFSSLFLLSIFIIFFISIFFKNLSFSSLFNSTIFWFFISNTLIFIIAADYALFSLSQHKSFDLYEHF----------------
        KP+  +KN       L+F++SLF  SIFI   +   F NLS S+LF +T FWFF+SNTLI IIA DY  +S S  K  DLY+ +                
Subjt:  KPHTYNKN---SNNILAFFSSLFLLSIFIIFFISIFFKNLSFSSLFNSTIFWFFISNTLIFIIAADYALFSLSQHKSFDLYEHF----------------

Query:  ----SPPNPKQTHFQLQSPSLMVIDEEKENPDEKFKIVVQGCDFPP---------ARTYRRSKSEKPKRRVLSKESKKMMARRSESVKYEAKELEENEFS
                PKQ     Q    +++ E +  P+   ++V++     P         A+TYRRSKSE+ KR V+  ESK ++ R SE+ K      EENEFS
Subjt:  ----SPPNPKQTHFQLQSPSLMVIDEEKENPDEKFKIVVQGCDFPP---------ARTYRRSKSEKPKRRVLSKESKKMMARRSESVKYEAKELEENEFS

Query:  KMTDEELNRRVEEFIQRFNRQMRLQT
         M+DEELNRRVEEFIQRFNR++RLQ+
Subjt:  KMTDEELNRRVEEFIQRFNRQMRLQT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30190.1 unknown protein1.3e-1131.46Show/hide
Query:  KSKPHTYNKNSNNILAFFSSLF-LLSIFIIFFISIFFKNLSFSSLFNSTIFWFFISNTLIFIIAADYALFSLSQHKSF------------DLYEHFSP--
        K +P      S+++L  F  +F  + IF +F +S+       SS+F  T   FFISNTLI IIAADY  FS  + + F            +  +++SP  
Subjt:  KSKPHTYNKNSNNILAFFSSLF-LLSIFIIFFISIFFKNLSFSSLFNSTIFWFFISNTLIFIIAADYALFSLSQHKSF------------DLYEHFSP--

Query:  -------------PNPKQTHFQ---------------LQSPS--LMVIDEEKENPD---EKFKIVV-------QGCD----FPPARTYRRSKSEKPKRRV
                      NPK   F+               +  P   + V+ E+K+  D   E++K V        + C+      P + Y RSKS+KP+R+ 
Subjt:  -------------PNPKQTHFQ---------------LQSPS--LMVIDEEKENPD---EKFKIVV-------QGCD----FPPARTYRRSKSEKPKRRV

Query:  LSKES----KKMMARRSESV--------KYEAKELEENEFSKMTDEELNRRVEEFIQRFNRQMRLQT
        LS ++    +K   R+            K+E  + E  EFSK+++EELN+RVEEFIQRFNRQ+R Q+
Subjt:  LSKES----KKMMARRSESV--------KYEAKELEENEFSKMTDEELNRRVEEFIQRFNRQMRLQT

AT2G34610.1 unknown protein1.5e-0427.07Show/hide
Query:  FSSLFLLSIFIIFFISIF-FKNLSFSSLFNSTIFWFFISNTLIFIIAADYALFSLSQHKSFDLYEHFSP---------PNPKQTHFQLQSPSLMVIDEEK
        +SSL  + + I  +I IF   ++S  S+FN T   F ISN LI IIAADY  F  +  ++ D Y  ++          P P+++ + +         E++
Subjt:  FSSLFLLSIFIIFFISIF-FKNLSFSSLFNSTIFWFFISNTLIFIIAADYALFSLSQHKSFDLYEHFSP---------PNPKQTHFQLQSPSLMVIDEEK

Query:  EN------------PDEKFKI----------------VVQGCD---------------------FPPARTYRRSKSEKPKRRVLSKESKKMMAR------
        E             P++K K+                 ++ C+                        ++ Y RSKS+K +  V+SKE ++   +      
Subjt:  EN------------PDEKFKI----------------VVQGCD---------------------FPPARTYRRSKSEKPKRRVLSKESKKMMAR------

Query:  -RSES----------------------VKYEAKELEENEFSKMTDEELNRRVEEFIQRFNRQMRLQ
         RS+S                       K+E  + E  EFSKM++EELNRRVE+FIQRFNR ++ Q
Subjt:  -RSES----------------------VKYEAKELEENEFSKMTDEELNRRVEEFIQRFNRQMRLQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATCGAATTCAAATTCACCTCGAAATCTACCAGTTGTAGATAATTCATTTCTGGCCTCCAAATCCAAACCACACACTTATAATAAAAATTCAAACAACATATTAGC
CTTTTTCTCTTCCCTATTCCTTCTCTCCATTTTCATCATCTTCTTCATTTCCATCTTCTTCAAAAACCTCTCTTTTTCATCTCTCTTCAACAGCACCATTTTCTGGTTCT
TCATTTCCAACACCCTCATCTTCATCATTGCTGCTGATTATGCCCTTTTCTCTCTCTCCCAACACAAAAGTTTCGATCTTTATGAACATTTCTCACCACCGAACCCAAAG
CAAACTCATTTTCAACTTCAAAGTCCTTCATTGATGGTAATTGATGAAGAAAAAGAAAACCCAGATGAGAAATTCAAAATCGTGGTTCAAGGATGCGATTTTCCGCCGGC
GAGAACTTACCGCCGGAGCAAGTCGGAGAAACCCAAAAGAAGAGTACTGTCGAAGGAGAGCAAGAAAATGATGGCGAGGAGATCGGAAAGTGTTAAATATGAAGCGAAGG
AATTAGAAGAAAATGAGTTTTCAAAGATGACAGATGAGGAATTGAACAGAAGGGTGGAGGAATTTATTCAAAGATTTAACAGACAGATGAGACTTCAAACTAGAACTGAA
TTAAAGTCTCTCGTTTTATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCATCGAATTCAAATTCACCTCGAAATCTACCAGTTGTAGATAATTCATTTCTGGCCTCCAAATCCAAACCACACACTTATAATAAAAATTCAAACAACATATTAGC
CTTTTTCTCTTCCCTATTCCTTCTCTCCATTTTCATCATCTTCTTCATTTCCATCTTCTTCAAAAACCTCTCTTTTTCATCTCTCTTCAACAGCACCATTTTCTGGTTCT
TCATTTCCAACACCCTCATCTTCATCATTGCTGCTGATTATGCCCTTTTCTCTCTCTCCCAACACAAAAGTTTCGATCTTTATGAACATTTCTCACCACCGAACCCAAAG
CAAACTCATTTTCAACTTCAAAGTCCTTCATTGATGGTAATTGATGAAGAAAAAGAAAACCCAGATGAGAAATTCAAAATCGTGGTTCAAGGATGCGATTTTCCGCCGGC
GAGAACTTACCGCCGGAGCAAGTCGGAGAAACCCAAAAGAAGAGTACTGTCGAAGGAGAGCAAGAAAATGATGGCGAGGAGATCGGAAAGTGTTAAATATGAAGCGAAGG
AATTAGAAGAAAATGAGTTTTCAAAGATGACAGATGAGGAATTGAACAGAAGGGTGGAGGAATTTATTCAAAGATTTAACAGACAGATGAGACTTCAAACTAGAACTGAA
TTAAAGTCTCTCGTTTTATGA
Protein sequenceShow/hide protein sequence
MSSNSNSPRNLPVVDNSFLASKSKPHTYNKNSNNILAFFSSLFLLSIFIIFFISIFFKNLSFSSLFNSTIFWFFISNTLIFIIAADYALFSLSQHKSFDLYEHFSPPNPK
QTHFQLQSPSLMVIDEEKENPDEKFKIVVQGCDFPPARTYRRSKSEKPKRRVLSKESKKMMARRSESVKYEAKELEENEFSKMTDEELNRRVEEFIQRFNRQMRLQTRTE
LKSLVL