; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G008310 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G008310
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionProtein of unknown function (DUF819)
Genome locationCG_Chr05:8991109..8996469
RNA-Seq ExpressionClCG05G008310
SyntenyClCG05G008310
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR008537 - Protein of unknown function DUF819


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140757.1 uncharacterized protein LOC101211894 isoform X1 [Cucumis sativus]4.0e-20180.86Show/hide
Query:  MASQSQLADLHSKLPEVQPPCSSSSKFSVGFSRRIAMAPLPLVQPLSSSSLAAEVAGRRFWNFRRSSTGNVQLRRDVAVKSHLKLNLPLISPHDQWGNWT
        MASQSQL  LHSK PE+Q PC SS+KFSVGFSR I+MA  P +Q LSSSSL AE+   RFW+FR+SS GNVQ RRDVAV+SHLKLNLPL+SP+DQWGNWT
Subjt:  MASQSQLADLHSKLPEVQPPCSSSSKFSVGFSRRIAMAPLPLVQPLSSSSLAAEVAGRRFWNFRRSSTGNVQLRRDVAVKSHLKLNLPLISPHDQWGNWT

Query:  VLFSIGAFDLRSLRLMEMEFSMYTHKKNLAMLGFSLILKSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVI
        VLFSIGAF                             + SEKTK+GSALSGALVSTLVGLAASNFGIIASDAPAF IVLEFLLPLAVPLLLFRADLRRVI
Subjt:  VLFSIGAFDLRSLRLMEMEFSMYTHKKNLAMLGFSLILKSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVI

Query:  KSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADN-LCWV------------------
        KSTGTLLLAFLLGSVGTT+GTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADN +C V                  
Subjt:  KSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADN-LCWV------------------

Query:  ---DVGKDAEAEPGNKLPVLQSATAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMAMILMQVFFAVVGASGNVWSVI
            VGKDAE EP NKLPVLQSA+A+AVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMA+ILMQVFFAVVGASGNVWSVI
Subjt:  ---DVGKDAEAEPGNKLPVLQSATAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMAMILMQVFFAVVGASGNVWSVI

Query:  NTAPSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        NTAPSIF+FAFVQI+VHL IIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIA+ATFLGIGFGMMVLKYM
Subjt:  NTAPSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

XP_008439290.1 PREDICTED: uncharacterized membrane protein YjcL-like [Cucumis melo]3.6e-20281.87Show/hide
Query:  MASQSQLADLHSKLPEVQPPCSSSSKFSVGFSRRIAMAPLPLVQPLSSSSLAAEVAGRRFWNFRRSSTGNVQLRRDVAVKSHLKLNLPLISPHDQWGNWT
        MASQS LA LHS  PE+QPPC SSSKFSVGFSR IAMAP P +Q LSSSSLAA++   RFW+FR+SS GNVQ RRDVAVKSHLKLNLPL+SP DQWGNWT
Subjt:  MASQSQLADLHSKLPEVQPPCSSSSKFSVGFSRRIAMAPLPLVQPLSSSSLAAEVAGRRFWNFRRSSTGNVQLRRDVAVKSHLKLNLPLISPHDQWGNWT

Query:  VLFSIGAFDLRSLRLMEMEFSMYTHKKNLAMLGFSLILKSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVI
        VLFSIGAF                             + SEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAF  VLEFLLPLAVPLLLFRADLRRVI
Subjt:  VLFSIGAFDLRSLRLMEMEFSMYTHKKNLAMLGFSLILKSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVI

Query:  KSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADN-LCWV------------------
        KSTGTLLLAFLLGSVGTT+GTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADN +C V                  
Subjt:  KSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADN-LCWV------------------

Query:  ---DVGKDAEAEPGNKLPVLQSATAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMAMILMQVFFAVVGASGNVWSVI
            V KDAE EP NKLPVLQSA+AIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMA+ILMQVFFAVVGASGNVWSVI
Subjt:  ---DVGKDAEAEPGNKLPVLQSATAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMAMILMQVFFAVVGASGNVWSVI

Query:  NTAPSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        NTAPSIF+FAFVQI+VHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFG MVLKYM
Subjt:  NTAPSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

XP_022140902.1 uncharacterized protein LOC111011457 isoform X1 [Momordica charantia]2.2e-19174.57Show/hide
Query:  MYTLYRRI-LSDEVNCWRSVSVRYTLPEMASQSQLADLHSKLPEVQPPCSSSSKFSVGFSRRIAMAPLPLVQPLSSSSLAAEVAGRRFWNFRRSSTGNVQ
        ++ LY  I LSDEV+C  S+SV+   PEMA  SQ+A L SK P++Q PC SS+K S  F R I MAP P V P+SSSS AAE+  RRFWNF  +S+GN  
Subjt:  MYTLYRRI-LSDEVNCWRSVSVRYTLPEMASQSQLADLHSKLPEVQPPCSSSSKFSVGFSRRIAMAPLPLVQPLSSSSLAAEVAGRRFWNFRRSSTGNVQ

Query:  LRRDVAVKSHLKLNLPLISPHDQWGNWTVLFSIGAFDLRSLRLMEMEFSMYTHKKNLAMLGFSLILKSEKTKIGSALSGALVSTLVGLAASNFGIIASDA
        LRR +AVKSHLKLNLPLISPHDQW NWTVLFS+GAF                             + SEKTKIGSALSGALVSTLVGLAASN GIIASDA
Subjt:  LRRDVAVKSHLKLNLPLISPHDQWGNWTVLFSIGAFDLRSLRLMEMEFSMYTHKKNLAMLGFSLILKSEKTKIGSALSGALVSTLVGLAASNFGIIASDA

Query:  PAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAG
        PAFP+VLE LLPL++PLLLFRADLRRVIKSTGTLLLAFLLGSVGT IGT VAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAIS ALGVSPSVLAAG
Subjt:  PAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAG

Query:  LAADN-LCWV---------------------DVGKDAEAEPGNKLPVLQSATAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLA
        LAADN +C V                     +VGKD EAE  NKLPVLQSATA+AVSFAICK GSYLTK+FGIQGGSMPAITAVIVVLATIFPK FAYLA
Subjt:  LAADN-LCWV---------------------DVGKDAEAEPGNKLPVLQSATAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLA

Query:  PSGEAMAMILMQVFFAVVGASGNVWSVINTAPSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFG
        PSGEAMA+ILMQVFF VVGASGN+WSVINTAPSIF+F+ VQIAVHLA+ IGLGKLLRFDLK LLIASNANVGGPTTACGMATAKGWSSMV+PGILAGIFG
Subjt:  PSGEAMAMILMQVFFAVVGASGNVWSVINTAPSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFG

Query:  IAIATFLGIGFGMMVLKYM
        IAIATFLGIGFG+M LKYM
Subjt:  IAIATFLGIGFGMMVLKYM

XP_022140905.1 uncharacterized protein LOC111011457 isoform X3 [Momordica charantia]2.2e-19174.57Show/hide
Query:  MYTLYRRI-LSDEVNCWRSVSVRYTLPEMASQSQLADLHSKLPEVQPPCSSSSKFSVGFSRRIAMAPLPLVQPLSSSSLAAEVAGRRFWNFRRSSTGNVQ
        ++ LY  I LSDEV+C  S+SV+   PEMA  SQ+A L SK P++Q PC SS+K S  F R I MAP P V P+SSSS AAE+  RRFWNF  +S+GN  
Subjt:  MYTLYRRI-LSDEVNCWRSVSVRYTLPEMASQSQLADLHSKLPEVQPPCSSSSKFSVGFSRRIAMAPLPLVQPLSSSSLAAEVAGRRFWNFRRSSTGNVQ

Query:  LRRDVAVKSHLKLNLPLISPHDQWGNWTVLFSIGAFDLRSLRLMEMEFSMYTHKKNLAMLGFSLILKSEKTKIGSALSGALVSTLVGLAASNFGIIASDA
        LRR +AVKSHLKLNLPLISPHDQW NWTVLFS+GAF                             + SEKTKIGSALSGALVSTLVGLAASN GIIASDA
Subjt:  LRRDVAVKSHLKLNLPLISPHDQWGNWTVLFSIGAFDLRSLRLMEMEFSMYTHKKNLAMLGFSLILKSEKTKIGSALSGALVSTLVGLAASNFGIIASDA

Query:  PAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAG
        PAFP+VLE LLPL++PLLLFRADLRRVIKSTGTLLLAFLLGSVGT IGT VAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAIS ALGVSPSVLAAG
Subjt:  PAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAG

Query:  LAADN-LCWV---------------------DVGKDAEAEPGNKLPVLQSATAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLA
        LAADN +C V                     +VGKD EAE  NKLPVLQSATA+AVSFAICK GSYLTK+FGIQGGSMPAITAVIVVLATIFPK FAYLA
Subjt:  LAADN-LCWV---------------------DVGKDAEAEPGNKLPVLQSATAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLA

Query:  PSGEAMAMILMQVFFAVVGASGNVWSVINTAPSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFG
        PSGEAMA+ILMQVFF VVGASGN+WSVINTAPSIF+F+ VQIAVHLA+ IGLGKLLRFDLK LLIASNANVGGPTTACGMATAKGWSSMV+PGILAGIFG
Subjt:  PSGEAMAMILMQVFFAVVGASGNVWSVINTAPSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFG

Query:  IAIATFLGIGFGMMVLKYM
        IAIATFLGIGFG+M LKYM
Subjt:  IAIATFLGIGFGMMVLKYM

XP_038877446.1 uncharacterized membrane protein YjcL-like [Benincasa hispida]1.4e-20983.74Show/hide
Query:  MASQSQLADLHSKLPEVQPPCSSSSKFSVGFSRRIAMAPLPLVQP-LSSSSLAAEVAGRRFWNFRRSSTGNVQLRRDVAVKSHLKLNLPLISPHDQWGNW
        MASQSQLA+LHSK P  QPPC SS KFSVGFSRRIAMAP PL+QP LSSSSL AE+ GRRFWNFR+SSTGNVQLRRDVAVKSHLKLN+PLISPHDQWGNW
Subjt:  MASQSQLADLHSKLPEVQPPCSSSSKFSVGFSRRIAMAPLPLVQP-LSSSSLAAEVAGRRFWNFRRSSTGNVQLRRDVAVKSHLKLNLPLISPHDQWGNW

Query:  TVLFSIGAFDLRSLRLMEMEFSMYTHKKNLAMLGFSLILKSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRV
        TVLFSIGAF                             + SEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRV
Subjt:  TVLFSIGAFDLRSLRLMEMEFSMYTHKKNLAMLGFSLILKSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRV

Query:  IKSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADN-LCWV-----------------
        IKSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADN +C V                 
Subjt:  IKSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADN-LCWV-----------------

Query:  ----DVGKDAEAEPGNKLPVLQSATAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMAMILMQVFFAVVGASGNVWSV
            DVGK AE EP NKLPVLQSATAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMA+ILMQVFFAVVGASGN+WSV
Subjt:  ----DVGKDAEAEPGNKLPVLQSATAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMAMILMQVFFAVVGASGNVWSV

Query:  INTAPSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        INTAPSIF+F+FVQIAVHLAII+GLGKLLRFDLK LLIASNANVGGPTTACGMATAKGWSSM+IPGILAGIFGIAIATFLGIGFGM+VLKYM
Subjt:  INTAPSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

TrEMBL top hitse value%identityAlignment
A0A0A0L600 Uncharacterized protein1.9e-20180.86Show/hide
Query:  MASQSQLADLHSKLPEVQPPCSSSSKFSVGFSRRIAMAPLPLVQPLSSSSLAAEVAGRRFWNFRRSSTGNVQLRRDVAVKSHLKLNLPLISPHDQWGNWT
        MASQSQL  LHSK PE+Q PC SS+KFSVGFSR I+MA  P +Q LSSSSL AE+   RFW+FR+SS GNVQ RRDVAV+SHLKLNLPL+SP+DQWGNWT
Subjt:  MASQSQLADLHSKLPEVQPPCSSSSKFSVGFSRRIAMAPLPLVQPLSSSSLAAEVAGRRFWNFRRSSTGNVQLRRDVAVKSHLKLNLPLISPHDQWGNWT

Query:  VLFSIGAFDLRSLRLMEMEFSMYTHKKNLAMLGFSLILKSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVI
        VLFSIGAF                             + SEKTK+GSALSGALVSTLVGLAASNFGIIASDAPAF IVLEFLLPLAVPLLLFRADLRRVI
Subjt:  VLFSIGAFDLRSLRLMEMEFSMYTHKKNLAMLGFSLILKSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVI

Query:  KSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADN-LCWV------------------
        KSTGTLLLAFLLGSVGTT+GTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADN +C V                  
Subjt:  KSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADN-LCWV------------------

Query:  ---DVGKDAEAEPGNKLPVLQSATAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMAMILMQVFFAVVGASGNVWSVI
            VGKDAE EP NKLPVLQSA+A+AVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMA+ILMQVFFAVVGASGNVWSVI
Subjt:  ---DVGKDAEAEPGNKLPVLQSATAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMAMILMQVFFAVVGASGNVWSVI

Query:  NTAPSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        NTAPSIF+FAFVQI+VHL IIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIA+ATFLGIGFGMMVLKYM
Subjt:  NTAPSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

A0A1S3AZ41 uncharacterized membrane protein YjcL-like1.8e-20281.87Show/hide
Query:  MASQSQLADLHSKLPEVQPPCSSSSKFSVGFSRRIAMAPLPLVQPLSSSSLAAEVAGRRFWNFRRSSTGNVQLRRDVAVKSHLKLNLPLISPHDQWGNWT
        MASQS LA LHS  PE+QPPC SSSKFSVGFSR IAMAP P +Q LSSSSLAA++   RFW+FR+SS GNVQ RRDVAVKSHLKLNLPL+SP DQWGNWT
Subjt:  MASQSQLADLHSKLPEVQPPCSSSSKFSVGFSRRIAMAPLPLVQPLSSSSLAAEVAGRRFWNFRRSSTGNVQLRRDVAVKSHLKLNLPLISPHDQWGNWT

Query:  VLFSIGAFDLRSLRLMEMEFSMYTHKKNLAMLGFSLILKSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVI
        VLFSIGAF                             + SEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAF  VLEFLLPLAVPLLLFRADLRRVI
Subjt:  VLFSIGAFDLRSLRLMEMEFSMYTHKKNLAMLGFSLILKSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVI

Query:  KSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADN-LCWV------------------
        KSTGTLLLAFLLGSVGTT+GTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADN +C V                  
Subjt:  KSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADN-LCWV------------------

Query:  ---DVGKDAEAEPGNKLPVLQSATAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMAMILMQVFFAVVGASGNVWSVI
            V KDAE EP NKLPVLQSA+AIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMA+ILMQVFFAVVGASGNVWSVI
Subjt:  ---DVGKDAEAEPGNKLPVLQSATAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMAMILMQVFFAVVGASGNVWSVI

Query:  NTAPSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        NTAPSIF+FAFVQI+VHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFG MVLKYM
Subjt:  NTAPSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

A0A6J1CGH1 uncharacterized protein LOC111011457 isoform X31.1e-19174.57Show/hide
Query:  MYTLYRRI-LSDEVNCWRSVSVRYTLPEMASQSQLADLHSKLPEVQPPCSSSSKFSVGFSRRIAMAPLPLVQPLSSSSLAAEVAGRRFWNFRRSSTGNVQ
        ++ LY  I LSDEV+C  S+SV+   PEMA  SQ+A L SK P++Q PC SS+K S  F R I MAP P V P+SSSS AAE+  RRFWNF  +S+GN  
Subjt:  MYTLYRRI-LSDEVNCWRSVSVRYTLPEMASQSQLADLHSKLPEVQPPCSSSSKFSVGFSRRIAMAPLPLVQPLSSSSLAAEVAGRRFWNFRRSSTGNVQ

Query:  LRRDVAVKSHLKLNLPLISPHDQWGNWTVLFSIGAFDLRSLRLMEMEFSMYTHKKNLAMLGFSLILKSEKTKIGSALSGALVSTLVGLAASNFGIIASDA
        LRR +AVKSHLKLNLPLISPHDQW NWTVLFS+GAF                             + SEKTKIGSALSGALVSTLVGLAASN GIIASDA
Subjt:  LRRDVAVKSHLKLNLPLISPHDQWGNWTVLFSIGAFDLRSLRLMEMEFSMYTHKKNLAMLGFSLILKSEKTKIGSALSGALVSTLVGLAASNFGIIASDA

Query:  PAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAG
        PAFP+VLE LLPL++PLLLFRADLRRVIKSTGTLLLAFLLGSVGT IGT VAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAIS ALGVSPSVLAAG
Subjt:  PAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAG

Query:  LAADN-LCWV---------------------DVGKDAEAEPGNKLPVLQSATAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLA
        LAADN +C V                     +VGKD EAE  NKLPVLQSATA+AVSFAICK GSYLTK+FGIQGGSMPAITAVIVVLATIFPK FAYLA
Subjt:  LAADN-LCWV---------------------DVGKDAEAEPGNKLPVLQSATAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLA

Query:  PSGEAMAMILMQVFFAVVGASGNVWSVINTAPSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFG
        PSGEAMA+ILMQVFF VVGASGN+WSVINTAPSIF+F+ VQIAVHLA+ IGLGKLLRFDLK LLIASNANVGGPTTACGMATAKGWSSMV+PGILAGIFG
Subjt:  PSGEAMAMILMQVFFAVVGASGNVWSVINTAPSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFG

Query:  IAIATFLGIGFGMMVLKYM
        IAIATFLGIGFG+M LKYM
Subjt:  IAIATFLGIGFGMMVLKYM

A0A6J1CHF3 uncharacterized protein LOC111011457 isoform X41.1e-19174.57Show/hide
Query:  MYTLYRRI-LSDEVNCWRSVSVRYTLPEMASQSQLADLHSKLPEVQPPCSSSSKFSVGFSRRIAMAPLPLVQPLSSSSLAAEVAGRRFWNFRRSSTGNVQ
        ++ LY  I LSDEV+C  S+SV+   PEMA  SQ+A L SK P++Q PC SS+K S  F R I MAP P V P+SSSS AAE+  RRFWNF  +S+GN  
Subjt:  MYTLYRRI-LSDEVNCWRSVSVRYTLPEMASQSQLADLHSKLPEVQPPCSSSSKFSVGFSRRIAMAPLPLVQPLSSSSLAAEVAGRRFWNFRRSSTGNVQ

Query:  LRRDVAVKSHLKLNLPLISPHDQWGNWTVLFSIGAFDLRSLRLMEMEFSMYTHKKNLAMLGFSLILKSEKTKIGSALSGALVSTLVGLAASNFGIIASDA
        LRR +AVKSHLKLNLPLISPHDQW NWTVLFS+GAF                             + SEKTKIGSALSGALVSTLVGLAASN GIIASDA
Subjt:  LRRDVAVKSHLKLNLPLISPHDQWGNWTVLFSIGAFDLRSLRLMEMEFSMYTHKKNLAMLGFSLILKSEKTKIGSALSGALVSTLVGLAASNFGIIASDA

Query:  PAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAG
        PAFP+VLE LLPL++PLLLFRADLRRVIKSTGTLLLAFLLGSVGT IGT VAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAIS ALGVSPSVLAAG
Subjt:  PAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAG

Query:  LAADN-LCWV---------------------DVGKDAEAEPGNKLPVLQSATAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLA
        LAADN +C V                     +VGKD EAE  NKLPVLQSATA+AVSFAICK GSYLTK+FGIQGGSMPAITAVIVVLATIFPK FAYLA
Subjt:  LAADN-LCWV---------------------DVGKDAEAEPGNKLPVLQSATAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLA

Query:  PSGEAMAMILMQVFFAVVGASGNVWSVINTAPSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFG
        PSGEAMA+ILMQVFF VVGASGN+WSVINTAPSIF+F+ VQIAVHLA+ IGLGKLLRFDLK LLIASNANVGGPTTACGMATAKGWSSMV+PGILAGIFG
Subjt:  PSGEAMAMILMQVFFAVVGASGNVWSVINTAPSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFG

Query:  IAIATFLGIGFGMMVLKYM
        IAIATFLGIGFG+M LKYM
Subjt:  IAIATFLGIGFGMMVLKYM

A0A6J1CIC3 uncharacterized protein LOC111011457 isoform X11.1e-19174.57Show/hide
Query:  MYTLYRRI-LSDEVNCWRSVSVRYTLPEMASQSQLADLHSKLPEVQPPCSSSSKFSVGFSRRIAMAPLPLVQPLSSSSLAAEVAGRRFWNFRRSSTGNVQ
        ++ LY  I LSDEV+C  S+SV+   PEMA  SQ+A L SK P++Q PC SS+K S  F R I MAP P V P+SSSS AAE+  RRFWNF  +S+GN  
Subjt:  MYTLYRRI-LSDEVNCWRSVSVRYTLPEMASQSQLADLHSKLPEVQPPCSSSSKFSVGFSRRIAMAPLPLVQPLSSSSLAAEVAGRRFWNFRRSSTGNVQ

Query:  LRRDVAVKSHLKLNLPLISPHDQWGNWTVLFSIGAFDLRSLRLMEMEFSMYTHKKNLAMLGFSLILKSEKTKIGSALSGALVSTLVGLAASNFGIIASDA
        LRR +AVKSHLKLNLPLISPHDQW NWTVLFS+GAF                             + SEKTKIGSALSGALVSTLVGLAASN GIIASDA
Subjt:  LRRDVAVKSHLKLNLPLISPHDQWGNWTVLFSIGAFDLRSLRLMEMEFSMYTHKKNLAMLGFSLILKSEKTKIGSALSGALVSTLVGLAASNFGIIASDA

Query:  PAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAG
        PAFP+VLE LLPL++PLLLFRADLRRVIKSTGTLLLAFLLGSVGT IGT VAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAIS ALGVSPSVLAAG
Subjt:  PAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAG

Query:  LAADN-LCWV---------------------DVGKDAEAEPGNKLPVLQSATAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLA
        LAADN +C V                     +VGKD EAE  NKLPVLQSATA+AVSFAICK GSYLTK+FGIQGGSMPAITAVIVVLATIFPK FAYLA
Subjt:  LAADN-LCWV---------------------DVGKDAEAEPGNKLPVLQSATAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLA

Query:  PSGEAMAMILMQVFFAVVGASGNVWSVINTAPSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFG
        PSGEAMA+ILMQVFF VVGASGN+WSVINTAPSIF+F+ VQIAVHLA+ IGLGKLLRFDLK LLIASNANVGGPTTACGMATAKGWSSMV+PGILAGIFG
Subjt:  PSGEAMAMILMQVFFAVVGASGNVWSVINTAPSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFG

Query:  IAIATFLGIGFGMMVLKYM
        IAIATFLGIGFG+M LKYM
Subjt:  IAIATFLGIGFGMMVLKYM

SwissProt top hitse value%identityAlignment
O31634 Uncharacterized membrane protein YjcL4.0e-3429.95Show/hide
Query:  EKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKI
        ++ K  SA+SGA+++    +  +N G++  ++P +  V  +++PLA+PLLLF+ ++R++ K +  LL  FL+ SVGT +G+++A+FL+       D  KI
Subjt:  EKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKI

Query:  AAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADN---------------LCWV----------------DVGKDAEAEPGNK-LPVLQSATAIAVSF
           +   +IGG VN+ A++         ++A + ADN               L W                 + G  AE+    K + +   A     +F
Subjt:  AAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADN---------------LCWV----------------DVGKDAEAEPGNK-LPVLQSATAIAVSF

Query:  AICKVGSYLTKYF----------GIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMAMILMQVFFAVVGASGNVWSVINTAPSIFVFAFVQIAVHLAI
        A+  V   ++ YF          G  G     +T++ V++  +FP+ F  L  S E +   L+ +FF V+G   ++  ++  AP I +F F+    +LA+
Subjt:  AICKVGSYLTKYF----------GIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMAMILMQVFFAVVGASGNVWSVINTAPSIFVFAFVQIAVHLAI

Query:  IIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFG
         +  GKL R  L+ +L+A NA VGGPTTA  MA AKGW  +V P +L G  G  I  ++G   G
Subjt:  IIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFG

Arabidopsis top hitse value%identityAlignment
AT5G24000.1 Protein of unknown function (DUF819)5.4e-11153.86Show/hide
Query:  VKSHLKLNLPLISPHDQWGNWTVLFSIGAFDLRSLRLMEMEFSMYTHKKNLAMLGFSLILKSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIV
        VK + +L  PLISP D W  W  LF+ GAF                             + SEKTKIGS +SGAL STL+GLAASN  +I  + P++   
Subjt:  VKSHLKLNLPLISPHDQWGNWTVLFSIGAFDLRSLRLMEMEFSMYTHKKNLAMLGFSLILKSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIV

Query:  LEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNL
        +EFLLP  +PLLLFRADLRR+I+STG+LLLAFL+GSV T +GTVVA+ LVPMRSLG D+WKIAAALMG +IGG++N+VAIS+AL +SPSV+AAG+A DN+
Subjt:  LEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNL

Query:  CW-------------------------VDVGKDAEAEPGNKLPVLQSATAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSG
                                    D+ KD + E  N+  V+ ++ A++VSF ICK    LT  F IQG  +PA+TA+ +VLAT FP  F  LAPS 
Subjt:  CW-------------------------VDVGKDAEAEPGNKLPVLQSATAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSG

Query:  EAMAMILMQVFFAVVGASGNVWSVINTAPSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAI
        E +++ILMQVFF ++GA+G+VW+VINTAPSIF+FA +Q+ VHLA+ + LGKL   D+K LL+ASNAN+GGPTTAC MATAKGW+S+V+PGIL+G+FG++I
Subjt:  EAMAMILMQVFFAVVGASGNVWSVINTAPSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAI

Query:  ATFLGIGFGMMVLK
        ATFLGIG G+ VLK
Subjt:  ATFLGIGFGMMVLK

AT5G52540.1 Protein of unknown function (DUF819)2.3e-13865.24Show/hide
Query:  RDVAVKSHLKLNLPLISPHDQWGNWTVLFSIGAFDLRSLRLMEMEFSMYTHKKNLAMLGFSLILKSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPA
        R V V S   L+ PLISP+D+WG WT LF+ GA                            L L SEKTK+G+A+SGALVSTLVGLAASN GII+S APA
Subjt:  RDVAVKSHLKLNLPLISPHDQWGNWTVLFSIGAFDLRSLRLMEMEFSMYTHKKNLAMLGFSLILKSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPA

Query:  FPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLA
        F +VL FLLPLAVPLLLFRADLRRV++STG LLLAFL+GSV TT+GT +AY+LVPM+SLG DSWKIAAALMGRHIGGAVNYVAIS+ALGV+PSVLAAGLA
Subjt:  FPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLA

Query:  ADNLCW-------------------------VDVGKDAEAEPGNKLPVLQSATAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYL
        ADN+                           VD   +  +E  NK+PVL  AT IAVS AICK G+ LTKYFGI GGS+PAITAV+V+LAT+FP  F  L
Subjt:  ADNLCW-------------------------VDVGKDAEAEPGNKLPVLQSATAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYL

Query:  APSGEAMAMILMQVFFAVVGASGNVWSVINTAPSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIF
        APSGEAMA+ILMQVFF VVGASGN+WSVINTAPSIF+FA VQI  HLA+I+G+GKLL  +L+ LL+ASNANVGGPTTA GMATAKGW+S+++PGILAGIF
Subjt:  APSGEAMAMILMQVFFAVVGASGNVWSVINTAPSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIF

Query:  GIAIATFLGIGFGMMVLKYM
        GIAIATF+GI FG+ VLK+M
Subjt:  GIAIATFLGIGFGMMVLKYM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATACTTTATATCGCCGAATTTTAAGTGATGAAGTCAATTGTTGGCGTTCCGTCTCCGTCCGATACACCTTGCCGGAGATGGCTTCACAGTCACAGCTCGCAGATCT
TCACTCGAAGTTGCCTGAGGTACAGCCTCCATGTTCGTCTTCCTCCAAATTCTCTGTTGGATTCTCCAGGAGGATCGCTATGGCACCTCTACCACTGGTGCAACCGTTAT
CGTCGTCTTCATTAGCTGCTGAAGTTGCAGGCCGGAGATTCTGGAACTTTCGCAGATCCAGCACCGGAAATGTTCAATTGAGACGAGATGTTGCTGTTAAATCTCATCTG
AAATTGAATCTCCCCCTCATTTCTCCGCACGACCAATGGGGCAACTGGACTGTTTTATTCTCCATAGGAGCCTTCGACCTCAGATCCCTGAGATTAATGGAGATGGAATT
TAGTATGTACACTCATAAGAAGAACTTGGCAATGTTAGGATTTAGCTTGATTCTTAAGTCTGAGAAAACAAAGATTGGCAGTGCATTGAGCGGTGCCTTAGTGAGCACAT
TGGTAGGACTTGCAGCCAGTAATTTTGGGATCATTGCATCTGATGCTCCAGCTTTTCCTATTGTTTTGGAGTTTTTGCTTCCGCTAGCAGTTCCTTTGCTGTTATTTAGA
GCAGATTTGCGTCGTGTAATAAAGTCAACAGGGACACTTCTCTTGGCTTTTTTGTTAGGTTCAGTTGGAACAACAATTGGAACTGTAGTGGCCTATTTTCTTGTACCAAT
GCGATCACTTGGTCAAGACAGTTGGAAAATTGCCGCGGCACTGATGGGAAGACATATTGGTGGAGCTGTCAACTATGTTGCTATATCTGATGCTCTTGGTGTTTCTCCAT
CAGTATTAGCAGCTGGACTTGCTGCAGATAATTTATGTTGGGTAGATGTTGGGAAGGACGCAGAAGCTGAGCCTGGCAACAAGCTTCCGGTGTTGCAATCTGCCACAGCC
ATTGCTGTATCATTTGCCATTTGTAAAGTTGGTTCCTACCTGACCAAATATTTTGGAATTCAAGGTGGTAGCATGCCAGCAATAACGGCCGTGATTGTTGTCTTAGCAAC
CATTTTTCCTAAGCTGTTTGCTTACCTTGCTCCTTCTGGTGAAGCTATGGCTATGATTCTAATGCAGGTTTTCTTTGCTGTGGTGGGAGCAAGTGGAAATGTATGGAGTG
TCATCAACACTGCACCAAGCATCTTCGTATTTGCTTTTGTCCAGATTGCAGTCCATCTTGCCATAATCATTGGTCTTGGAAAGCTGCTTCGCTTCGATCTAAAGTCGTTG
CTGATAGCATCAAATGCCAATGTCGGAGGTCCTACAACAGCTTGCGGGATGGCCACAGCTAAGGGTTGGAGTTCAATGGTTATTCCTGGAATTCTTGCTGGAATTTTCGG
AATCGCTATTGCAACTTTCCTAGGGATTGGGTTTGGAATGATGGTCTTGAAATACATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTATACTTTATATCGCCGAATTTTAAGTGATGAAGTCAATTGTTGGCGTTCCGTCTCCGTCCGATACACCTTGCCGGAGATGGCTTCACAGTCACAGCTCGCAGATCT
TCACTCGAAGTTGCCTGAGGTACAGCCTCCATGTTCGTCTTCCTCCAAATTCTCTGTTGGATTCTCCAGGAGGATCGCTATGGCACCTCTACCACTGGTGCAACCGTTAT
CGTCGTCTTCATTAGCTGCTGAAGTTGCAGGCCGGAGATTCTGGAACTTTCGCAGATCCAGCACCGGAAATGTTCAATTGAGACGAGATGTTGCTGTTAAATCTCATCTG
AAATTGAATCTCCCCCTCATTTCTCCGCACGACCAATGGGGCAACTGGACTGTTTTATTCTCCATAGGAGCCTTCGACCTCAGATCCCTGAGATTAATGGAGATGGAATT
TAGTATGTACACTCATAAGAAGAACTTGGCAATGTTAGGATTTAGCTTGATTCTTAAGTCTGAGAAAACAAAGATTGGCAGTGCATTGAGCGGTGCCTTAGTGAGCACAT
TGGTAGGACTTGCAGCCAGTAATTTTGGGATCATTGCATCTGATGCTCCAGCTTTTCCTATTGTTTTGGAGTTTTTGCTTCCGCTAGCAGTTCCTTTGCTGTTATTTAGA
GCAGATTTGCGTCGTGTAATAAAGTCAACAGGGACACTTCTCTTGGCTTTTTTGTTAGGTTCAGTTGGAACAACAATTGGAACTGTAGTGGCCTATTTTCTTGTACCAAT
GCGATCACTTGGTCAAGACAGTTGGAAAATTGCCGCGGCACTGATGGGAAGACATATTGGTGGAGCTGTCAACTATGTTGCTATATCTGATGCTCTTGGTGTTTCTCCAT
CAGTATTAGCAGCTGGACTTGCTGCAGATAATTTATGTTGGGTAGATGTTGGGAAGGACGCAGAAGCTGAGCCTGGCAACAAGCTTCCGGTGTTGCAATCTGCCACAGCC
ATTGCTGTATCATTTGCCATTTGTAAAGTTGGTTCCTACCTGACCAAATATTTTGGAATTCAAGGTGGTAGCATGCCAGCAATAACGGCCGTGATTGTTGTCTTAGCAAC
CATTTTTCCTAAGCTGTTTGCTTACCTTGCTCCTTCTGGTGAAGCTATGGCTATGATTCTAATGCAGGTTTTCTTTGCTGTGGTGGGAGCAAGTGGAAATGTATGGAGTG
TCATCAACACTGCACCAAGCATCTTCGTATTTGCTTTTGTCCAGATTGCAGTCCATCTTGCCATAATCATTGGTCTTGGAAAGCTGCTTCGCTTCGATCTAAAGTCGTTG
CTGATAGCATCAAATGCCAATGTCGGAGGTCCTACAACAGCTTGCGGGATGGCCACAGCTAAGGGTTGGAGTTCAATGGTTATTCCTGGAATTCTTGCTGGAATTTTCGG
AATCGCTATTGCAACTTTCCTAGGGATTGGGTTTGGAATGATGGTCTTGAAATACATGTGAAACCATTCAAATTTAGATCATGAAATCCCTTACTCTCTCCAAAAGTGTG
TAAACTTATGTCAAGGGAATGACTTTTTACAATGGTTTTCTCAAAAAATAATAAAGAAAAAAGAAAAGACACATTGGAGGAGAGCATGTTGTCTTAGTGTGCACTTGCAG
GCAGTGTATTTAGAGTATCTCTTTGCATTTCTCGTTTTAATGGATTAAAACACCCATTTTAAGTCTAAAATGAGTTTTGTCCCACTTTTTAAAACTCAAGATTTTTGACC
TCTTGTTTAGATCATTTTTAGGCAAGTTTACAAAATTACCCTCACTTTCTTTTTTCCTCTTCTTTCCTCTTATGTGTTTTTTTTTCCTCCT
Protein sequenceShow/hide protein sequence
MYTLYRRILSDEVNCWRSVSVRYTLPEMASQSQLADLHSKLPEVQPPCSSSSKFSVGFSRRIAMAPLPLVQPLSSSSLAAEVAGRRFWNFRRSSTGNVQLRRDVAVKSHL
KLNLPLISPHDQWGNWTVLFSIGAFDLRSLRLMEMEFSMYTHKKNLAMLGFSLILKSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFR
ADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNLCWVDVGKDAEAEPGNKLPVLQSATA
IAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMAMILMQVFFAVVGASGNVWSVINTAPSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSL
LIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM