; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc05G10660 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc05G10660
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionProtein of unknown function (DUF819)
Genome locationClcChr05:8607694..8613616
RNA-Seq ExpressionClc05G10660
SyntenyClc05G10660
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR008537 - Protein of unknown function DUF819


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140757.1 uncharacterized protein LOC101211894 isoform X1 [Cucumis sativus]7.9e-22691.14Show/hide
Query:  MASQSQLADLHSKLPEVQPPCLSSSKFSVGFSRRIAMAPLPLVQPLSSSSLAAEVAGRRFWNFRRSSTGNVQLRRDVAVKSHLKLNLPLISPHDQWGNWT
        MASQSQL  LHSK PE+Q PC SS+KFSVGFSR I+MA  P +Q LSSSSL AE+   RFW+FR+SS GNVQ RRDVAV+SHLKLNLPL+SP+DQWGNWT
Subjt:  MASQSQLADLHSKLPEVQPPCLSSSKFSVGFSRRIAMAPLPLVQPLSSSSLAAEVAGRRFWNFRRSSTGNVQLRRDVAVKSHLKLNLPLISPHDQWGNWT

Query:  VLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLV
        VLFSIGAFGIWSEKTK+GSALSGALVSTLVGLAASNFGIIASDAPAF IVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTT+GTVVAYFLV
Subjt:  VLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLV

Query:  PMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASKVPPEPTILNNDVGKDAEAEPGNKLPVLQSATAIAV
        PMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASKVPPEPT L+N VGKDAE EP NKLPVLQSA+A+AV
Subjt:  PMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASKVPPEPTILNNDVGKDAEAEPGNKLPVLQSATAIAV

Query:  SFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMAMILMQVFFAVVGASGNVWSVINTAPSIFVFAFVQIAVHLAIIIGLGKLL
        SFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMA+ILMQVFFAVVGASGNVWSVINTAPSIF+FAFVQI+VHL IIIGLGKLL
Subjt:  SFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMAMILMQVFFAVVGASGNVWSVINTAPSIFVFAFVQIAVHLAIIIGLGKLL

Query:  RFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        RFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIA+ATFLGIGFGMMVLKYM
Subjt:  RFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

XP_008439290.1 PREDICTED: uncharacterized membrane protein YjcL-like [Cucumis melo]6.1e-22692.01Show/hide
Query:  MASQSQLADLHSKLPEVQPPCLSSSKFSVGFSRRIAMAPLPLVQPLSSSSLAAEVAGRRFWNFRRSSTGNVQLRRDVAVKSHLKLNLPLISPHDQWGNWT
        MASQS LA LHS  PE+QPPC SSSKFSVGFSR IAMAP P +Q LSSSSLAA++   RFW+FR+SS GNVQ RRDVAVKSHLKLNLPL+SP DQWGNWT
Subjt:  MASQSQLADLHSKLPEVQPPCLSSSKFSVGFSRRIAMAPLPLVQPLSSSSLAAEVAGRRFWNFRRSSTGNVQLRRDVAVKSHLKLNLPLISPHDQWGNWT

Query:  VLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLV
        VLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAF  VLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTT+GTVVAYFLV
Subjt:  VLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLV

Query:  PMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASKVPPEPTILNNDVGKDAEAEPGNKLPVLQSATAIAV
        PMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASKVPPE T L+N V KDAE EP NKLPVLQSA+AIAV
Subjt:  PMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASKVPPEPTILNNDVGKDAEAEPGNKLPVLQSATAIAV

Query:  SFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMAMILMQVFFAVVGASGNVWSVINTAPSIFVFAFVQIAVHLAIIIGLGKLL
        SFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMA+ILMQVFFAVVGASGNVWSVINTAPSIF+FAFVQI+VHLAIIIGLGKLL
Subjt:  SFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMAMILMQVFFAVVGASGNVWSVINTAPSIFVFAFVQIAVHLAIIIGLGKLL

Query:  RFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        RFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFG MVLKYM
Subjt:  RFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

XP_022140902.1 uncharacterized protein LOC111011457 isoform X1 [Momordica charantia]3.1e-21483.5Show/hide
Query:  MYTLYRRI-LSDEVNCWRSVSVRYTSPEMASQSQLADLHSKLPEVQPPCLSSSKFSVGFSRRIAMAPLPLVQPLSSSSLAAEVAGRRFWNFRRSSTGNVQ
        ++ LY  I LSDEV+C  S+SV+  SPEMA  SQ+A L SK P++Q PC SS+K S  F R I MAP P V P+SSSS AAE+  RRFWNF  +S+GN  
Subjt:  MYTLYRRI-LSDEVNCWRSVSVRYTSPEMASQSQLADLHSKLPEVQPPCLSSSKFSVGFSRRIAMAPLPLVQPLSSSSLAAEVAGRRFWNFRRSSTGNVQ

Query:  LRRDVAVKSHLKLNLPLISPHDQWGNWTVLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVI
        LRR +AVKSHLKLNLPLISPHDQW NWTVLFS+GAFGIWSEKTKIGSALSGALVSTLVGLAASN GIIASDAPAFP+VLE LLPL++PLLLFRADLRRVI
Subjt:  LRRDVAVKSHLKLNLPLISPHDQWGNWTVLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVI

Query:  KSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASKVPPEPTI
        KSTGTLLLAFLLGSVGT IGT VAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAIS ALGVSPSVLAAGLAADNVICAVYFATLFALASKVP EPT 
Subjt:  KSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASKVPPEPTI

Query:  LNNDVGKDAEAEPGNKLPVLQSATAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMAMILMQVFFAVVGASGNVWSVI
         +++VGKD EAE  NKLPVLQSATA+AVSFAICK GSYLTK+FGIQGGSMPAITAVIVVLATIFPK FAYLAPSGEAMA+ILMQVFF VVGASGN+WSVI
Subjt:  LNNDVGKDAEAEPGNKLPVLQSATAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMAMILMQVFFAVVGASGNVWSVI

Query:  NTAPSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        NTAPSIF+F+ VQIAVHLA+ IGLGKLLRFDLK LLIASNANVGGPTTACGMATAKGWSSMV+PGILAGIFGIAIATFLGIGFG+M LKYM
Subjt:  NTAPSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

XP_022140905.1 uncharacterized protein LOC111011457 isoform X3 [Momordica charantia]3.1e-21483.5Show/hide
Query:  MYTLYRRI-LSDEVNCWRSVSVRYTSPEMASQSQLADLHSKLPEVQPPCLSSSKFSVGFSRRIAMAPLPLVQPLSSSSLAAEVAGRRFWNFRRSSTGNVQ
        ++ LY  I LSDEV+C  S+SV+  SPEMA  SQ+A L SK P++Q PC SS+K S  F R I MAP P V P+SSSS AAE+  RRFWNF  +S+GN  
Subjt:  MYTLYRRI-LSDEVNCWRSVSVRYTSPEMASQSQLADLHSKLPEVQPPCLSSSKFSVGFSRRIAMAPLPLVQPLSSSSLAAEVAGRRFWNFRRSSTGNVQ

Query:  LRRDVAVKSHLKLNLPLISPHDQWGNWTVLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVI
        LRR +AVKSHLKLNLPLISPHDQW NWTVLFS+GAFGIWSEKTKIGSALSGALVSTLVGLAASN GIIASDAPAFP+VLE LLPL++PLLLFRADLRRVI
Subjt:  LRRDVAVKSHLKLNLPLISPHDQWGNWTVLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVI

Query:  KSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASKVPPEPTI
        KSTGTLLLAFLLGSVGT IGT VAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAIS ALGVSPSVLAAGLAADNVICAVYFATLFALASKVP EPT 
Subjt:  KSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASKVPPEPTI

Query:  LNNDVGKDAEAEPGNKLPVLQSATAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMAMILMQVFFAVVGASGNVWSVI
         +++VGKD EAE  NKLPVLQSATA+AVSFAICK GSYLTK+FGIQGGSMPAITAVIVVLATIFPK FAYLAPSGEAMA+ILMQVFF VVGASGN+WSVI
Subjt:  LNNDVGKDAEAEPGNKLPVLQSATAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMAMILMQVFFAVVGASGNVWSVI

Query:  NTAPSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        NTAPSIF+F+ VQIAVHLA+ IGLGKLLRFDLK LLIASNANVGGPTTACGMATAKGWSSMV+PGILAGIFGIAIATFLGIGFG+M LKYM
Subjt:  NTAPSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

XP_038877446.1 uncharacterized membrane protein YjcL-like [Benincasa hispida]2.1e-23494.4Show/hide
Query:  MASQSQLADLHSKLPEVQPPCLSSSKFSVGFSRRIAMAPLPLVQP-LSSSSLAAEVAGRRFWNFRRSSTGNVQLRRDVAVKSHLKLNLPLISPHDQWGNW
        MASQSQLA+LHSK P  QPPCLSS KFSVGFSRRIAMAP PL+QP LSSSSL AE+ GRRFWNFR+SSTGNVQLRRDVAVKSHLKLN+PLISPHDQWGNW
Subjt:  MASQSQLADLHSKLPEVQPPCLSSSKFSVGFSRRIAMAPLPLVQP-LSSSSLAAEVAGRRFWNFRRSSTGNVQLRRDVAVKSHLKLNLPLISPHDQWGNW

Query:  TVLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFL
        TVLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFL
Subjt:  TVLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFL

Query:  VPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASKVPPEPTILNNDVGKDAEAEPGNKLPVLQSATAIA
        VPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASKVPPE TIL+NDVGK AE EP NKLPVLQSATAIA
Subjt:  VPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASKVPPEPTILNNDVGKDAEAEPGNKLPVLQSATAIA

Query:  VSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMAMILMQVFFAVVGASGNVWSVINTAPSIFVFAFVQIAVHLAIIIGLGKL
        VSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMA+ILMQVFFAVVGASGN+WSVINTAPSIF+F+FVQIAVHLAII+GLGKL
Subjt:  VSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMAMILMQVFFAVVGASGNVWSVINTAPSIFVFAFVQIAVHLAIIIGLGKL

Query:  LRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        LRFDLK LLIASNANVGGPTTACGMATAKGWSSM+IPGILAGIFGIAIATFLGIGFGM+VLKYM
Subjt:  LRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

TrEMBL top hitse value%identityAlignment
A0A0A0L600 Uncharacterized protein3.8e-22691.14Show/hide
Query:  MASQSQLADLHSKLPEVQPPCLSSSKFSVGFSRRIAMAPLPLVQPLSSSSLAAEVAGRRFWNFRRSSTGNVQLRRDVAVKSHLKLNLPLISPHDQWGNWT
        MASQSQL  LHSK PE+Q PC SS+KFSVGFSR I+MA  P +Q LSSSSL AE+   RFW+FR+SS GNVQ RRDVAV+SHLKLNLPL+SP+DQWGNWT
Subjt:  MASQSQLADLHSKLPEVQPPCLSSSKFSVGFSRRIAMAPLPLVQPLSSSSLAAEVAGRRFWNFRRSSTGNVQLRRDVAVKSHLKLNLPLISPHDQWGNWT

Query:  VLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLV
        VLFSIGAFGIWSEKTK+GSALSGALVSTLVGLAASNFGIIASDAPAF IVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTT+GTVVAYFLV
Subjt:  VLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLV

Query:  PMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASKVPPEPTILNNDVGKDAEAEPGNKLPVLQSATAIAV
        PMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASKVPPEPT L+N VGKDAE EP NKLPVLQSA+A+AV
Subjt:  PMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASKVPPEPTILNNDVGKDAEAEPGNKLPVLQSATAIAV

Query:  SFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMAMILMQVFFAVVGASGNVWSVINTAPSIFVFAFVQIAVHLAIIIGLGKLL
        SFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMA+ILMQVFFAVVGASGNVWSVINTAPSIF+FAFVQI+VHL IIIGLGKLL
Subjt:  SFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMAMILMQVFFAVVGASGNVWSVINTAPSIFVFAFVQIAVHLAIIIGLGKLL

Query:  RFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        RFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIA+ATFLGIGFGMMVLKYM
Subjt:  RFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

A0A1S3AZ41 uncharacterized membrane protein YjcL-like2.9e-22692.01Show/hide
Query:  MASQSQLADLHSKLPEVQPPCLSSSKFSVGFSRRIAMAPLPLVQPLSSSSLAAEVAGRRFWNFRRSSTGNVQLRRDVAVKSHLKLNLPLISPHDQWGNWT
        MASQS LA LHS  PE+QPPC SSSKFSVGFSR IAMAP P +Q LSSSSLAA++   RFW+FR+SS GNVQ RRDVAVKSHLKLNLPL+SP DQWGNWT
Subjt:  MASQSQLADLHSKLPEVQPPCLSSSKFSVGFSRRIAMAPLPLVQPLSSSSLAAEVAGRRFWNFRRSSTGNVQLRRDVAVKSHLKLNLPLISPHDQWGNWT

Query:  VLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLV
        VLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAF  VLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTT+GTVVAYFLV
Subjt:  VLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGTVVAYFLV

Query:  PMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASKVPPEPTILNNDVGKDAEAEPGNKLPVLQSATAIAV
        PMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASKVPPE T L+N V KDAE EP NKLPVLQSA+AIAV
Subjt:  PMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASKVPPEPTILNNDVGKDAEAEPGNKLPVLQSATAIAV

Query:  SFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMAMILMQVFFAVVGASGNVWSVINTAPSIFVFAFVQIAVHLAIIIGLGKLL
        SFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMA+ILMQVFFAVVGASGNVWSVINTAPSIF+FAFVQI+VHLAIIIGLGKLL
Subjt:  SFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMAMILMQVFFAVVGASGNVWSVINTAPSIFVFAFVQIAVHLAIIIGLGKLL

Query:  RFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        RFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFG MVLKYM
Subjt:  RFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

A0A6J1CGH1 uncharacterized protein LOC111011457 isoform X31.5e-21483.5Show/hide
Query:  MYTLYRRI-LSDEVNCWRSVSVRYTSPEMASQSQLADLHSKLPEVQPPCLSSSKFSVGFSRRIAMAPLPLVQPLSSSSLAAEVAGRRFWNFRRSSTGNVQ
        ++ LY  I LSDEV+C  S+SV+  SPEMA  SQ+A L SK P++Q PC SS+K S  F R I MAP P V P+SSSS AAE+  RRFWNF  +S+GN  
Subjt:  MYTLYRRI-LSDEVNCWRSVSVRYTSPEMASQSQLADLHSKLPEVQPPCLSSSKFSVGFSRRIAMAPLPLVQPLSSSSLAAEVAGRRFWNFRRSSTGNVQ

Query:  LRRDVAVKSHLKLNLPLISPHDQWGNWTVLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVI
        LRR +AVKSHLKLNLPLISPHDQW NWTVLFS+GAFGIWSEKTKIGSALSGALVSTLVGLAASN GIIASDAPAFP+VLE LLPL++PLLLFRADLRRVI
Subjt:  LRRDVAVKSHLKLNLPLISPHDQWGNWTVLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVI

Query:  KSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASKVPPEPTI
        KSTGTLLLAFLLGSVGT IGT VAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAIS ALGVSPSVLAAGLAADNVICAVYFATLFALASKVP EPT 
Subjt:  KSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASKVPPEPTI

Query:  LNNDVGKDAEAEPGNKLPVLQSATAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMAMILMQVFFAVVGASGNVWSVI
         +++VGKD EAE  NKLPVLQSATA+AVSFAICK GSYLTK+FGIQGGSMPAITAVIVVLATIFPK FAYLAPSGEAMA+ILMQVFF VVGASGN+WSVI
Subjt:  LNNDVGKDAEAEPGNKLPVLQSATAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMAMILMQVFFAVVGASGNVWSVI

Query:  NTAPSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        NTAPSIF+F+ VQIAVHLA+ IGLGKLLRFDLK LLIASNANVGGPTTACGMATAKGWSSMV+PGILAGIFGIAIATFLGIGFG+M LKYM
Subjt:  NTAPSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

A0A6J1CHF3 uncharacterized protein LOC111011457 isoform X41.5e-21483.5Show/hide
Query:  MYTLYRRI-LSDEVNCWRSVSVRYTSPEMASQSQLADLHSKLPEVQPPCLSSSKFSVGFSRRIAMAPLPLVQPLSSSSLAAEVAGRRFWNFRRSSTGNVQ
        ++ LY  I LSDEV+C  S+SV+  SPEMA  SQ+A L SK P++Q PC SS+K S  F R I MAP P V P+SSSS AAE+  RRFWNF  +S+GN  
Subjt:  MYTLYRRI-LSDEVNCWRSVSVRYTSPEMASQSQLADLHSKLPEVQPPCLSSSKFSVGFSRRIAMAPLPLVQPLSSSSLAAEVAGRRFWNFRRSSTGNVQ

Query:  LRRDVAVKSHLKLNLPLISPHDQWGNWTVLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVI
        LRR +AVKSHLKLNLPLISPHDQW NWTVLFS+GAFGIWSEKTKIGSALSGALVSTLVGLAASN GIIASDAPAFP+VLE LLPL++PLLLFRADLRRVI
Subjt:  LRRDVAVKSHLKLNLPLISPHDQWGNWTVLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVI

Query:  KSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASKVPPEPTI
        KSTGTLLLAFLLGSVGT IGT VAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAIS ALGVSPSVLAAGLAADNVICAVYFATLFALASKVP EPT 
Subjt:  KSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASKVPPEPTI

Query:  LNNDVGKDAEAEPGNKLPVLQSATAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMAMILMQVFFAVVGASGNVWSVI
         +++VGKD EAE  NKLPVLQSATA+AVSFAICK GSYLTK+FGIQGGSMPAITAVIVVLATIFPK FAYLAPSGEAMA+ILMQVFF VVGASGN+WSVI
Subjt:  LNNDVGKDAEAEPGNKLPVLQSATAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMAMILMQVFFAVVGASGNVWSVI

Query:  NTAPSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        NTAPSIF+F+ VQIAVHLA+ IGLGKLLRFDLK LLIASNANVGGPTTACGMATAKGWSSMV+PGILAGIFGIAIATFLGIGFG+M LKYM
Subjt:  NTAPSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

A0A6J1CIC3 uncharacterized protein LOC111011457 isoform X11.5e-21483.5Show/hide
Query:  MYTLYRRI-LSDEVNCWRSVSVRYTSPEMASQSQLADLHSKLPEVQPPCLSSSKFSVGFSRRIAMAPLPLVQPLSSSSLAAEVAGRRFWNFRRSSTGNVQ
        ++ LY  I LSDEV+C  S+SV+  SPEMA  SQ+A L SK P++Q PC SS+K S  F R I MAP P V P+SSSS AAE+  RRFWNF  +S+GN  
Subjt:  MYTLYRRI-LSDEVNCWRSVSVRYTSPEMASQSQLADLHSKLPEVQPPCLSSSKFSVGFSRRIAMAPLPLVQPLSSSSLAAEVAGRRFWNFRRSSTGNVQ

Query:  LRRDVAVKSHLKLNLPLISPHDQWGNWTVLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVI
        LRR +AVKSHLKLNLPLISPHDQW NWTVLFS+GAFGIWSEKTKIGSALSGALVSTLVGLAASN GIIASDAPAFP+VLE LLPL++PLLLFRADLRRVI
Subjt:  LRRDVAVKSHLKLNLPLISPHDQWGNWTVLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVI

Query:  KSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASKVPPEPTI
        KSTGTLLLAFLLGSVGT IGT VAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAIS ALGVSPSVLAAGLAADNVICAVYFATLFALASKVP EPT 
Subjt:  KSTGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASKVPPEPTI

Query:  LNNDVGKDAEAEPGNKLPVLQSATAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMAMILMQVFFAVVGASGNVWSVI
         +++VGKD EAE  NKLPVLQSATA+AVSFAICK GSYLTK+FGIQGGSMPAITAVIVVLATIFPK FAYLAPSGEAMA+ILMQVFF VVGASGN+WSVI
Subjt:  LNNDVGKDAEAEPGNKLPVLQSATAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMAMILMQVFFAVVGASGNVWSVI

Query:  NTAPSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        NTAPSIF+F+ VQIAVHLA+ IGLGKLLRFDLK LLIASNANVGGPTTACGMATAKGWSSMV+PGILAGIFGIAIATFLGIGFG+M LKYM
Subjt:  NTAPSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

SwissProt top hitse value%identityAlignment
O31634 Uncharacterized membrane protein YjcL2.4e-3931.11Show/hide
Query:  LISPHDQWGNWTVLFSIGAFGIWSE-KTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSV
        LIS  D W  W  +    A  I  E + K  SA+SGA+++    +  +N G++  ++P +  V  +++PLA+PLLLF+ ++R++ K +  LL  FL+ SV
Subjt:  LISPHDQWGNWTVLFSIGAFGIWSE-KTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSV

Query:  GTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALAS--------KVPPEPTI-LNNDVG
        GT +G+++A+FL+       D  KI   +   +IGG VN+ A++         ++A + ADN + A+ F  L ++ +         +P E  +  + + G
Subjt:  GTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALAS--------KVPPEPTI-LNNDVG

Query:  KDAEAEPGNK-LPVLQSATAIAVSFAICKVGSYLTKYF----------GIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMAMILMQVFFAVVGASGN
          AE+    K + +   A     +FA+  V   ++ YF          G  G     +T++ V++  +FP+ F  L  S E +   L+ +FF V+G   +
Subjt:  KDAEAEPGNK-LPVLQSATAIAVSFAICKVGSYLTKYF----------GIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMAMILMQVFFAVVGASGN

Query:  VWSVINTAPSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFG
        +  ++  AP I +F F+    +LA+ +  GKL R  L+ +L+A NA VGGPTTA  MA AKGW  +V P +L G  G  I  ++G   G
Subjt:  VWSVINTAPSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFG

Arabidopsis top hitse value%identityAlignment
AT5G24000.1 Protein of unknown function (DUF819)1.5e-12962.18Show/hide
Query:  VKSHLKLNLPLISPHDQWGNWTVLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTL
        VK + +L  PLISP D W  W  LF+ GAFG+WSEKTKIGS +SGAL STL+GLAASN  +I  + P++   +EFLLP  +PLLLFRADLRR+I+STG+L
Subjt:  VKSHLKLNLPLISPHDQWGNWTVLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTL

Query:  LLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASKVPPEPTILNN---
        LLAFL+GSV T +GTVVA+ LVPMRSLG D+WKIAAALMG +IGG++N+VAIS+AL +SPSV+AAG+A DNVICA++F  LFALASK+PPE    ++   
Subjt:  LLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASKVPPEPTILNN---

Query:  DVGKDAEAEPGNKLPVLQSATAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMAMILMQVFFAVVGASGNVWSVINTA
        D+ KD + E  N+  V+ ++ A++VSF ICK    LT  F IQG  +PA+TA+ +VLAT FP  F  LAPS E +++ILMQVFF ++GA+G+VW+VINTA
Subjt:  DVGKDAEAEPGNKLPVLQSATAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMAMILMQVFFAVVGASGNVWSVINTA

Query:  PSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLK
        PSIF+FA +Q+ VHLA+ + LGKL   D+K LL+ASNAN+GGPTTAC MATAKGW+S+V+PGIL+G+FG++IATFLGIG G+ VLK
Subjt:  PSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLK

AT5G52540.1 Protein of unknown function (DUF819)3.2e-15674.05Show/hide
Query:  RDVAVKSHLKLNLPLISPHDQWGNWTVLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKS
        R V V S   L+ PLISP+D+WG WT LF+ GA G+WSEKTK+G+A+SGALVSTLVGLAASN GII+S APAF +VL FLLPLAVPLLLFRADLRRV++S
Subjt:  RDVAVKSHLKLNLPLISPHDQWGNWTVLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKS

Query:  TGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASKVP----PEP
        TG LLLAFL+GSV TT+GT +AY+LVPM+SLG DSWKIAAALMGRHIGGAVNYVAIS+ALGV+PSVLAAGLAADNVICAVYF TLFAL SK+P    P P
Subjt:  TGTLLLAFLLGSVGTTIGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASKVP----PEP

Query:  TILNNDVGKDAEAEPGNKLPVLQSATAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMAMILMQVFFAVVGASGNVWS
        T +  D   +  +E  NK+PVL  AT IAVS AICK G+ LTKYFGI GGS+PAITAV+V+LAT+FP  F  LAPSGEAMA+ILMQVFF VVGASGN+WS
Subjt:  TILNNDVGKDAEAEPGNKLPVLQSATAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMAMILMQVFFAVVGASGNVWS

Query:  VINTAPSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        VINTAPSIF+FA VQI  HLA+I+G+GKLL  +L+ LL+ASNANVGGPTTA GMATAKGW+S+++PGILAGIFGIAIATF+GI FG+ VLK+M
Subjt:  VINTAPSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATACTTTATATCGCCGAATTTTAAGTGATGAAGTCAATTGTTGGCGTTCCGTCTCCGTCCGATACACCTCGCCGGAGATGGCTTCACAGTCACAGCTCGCAGATCT
TCACTCGAAGTTGCCTGAGGTACAGCCTCCATGTTTGTCTTCCTCCAAATTCTCTGTTGGATTCTCCAGGAGGATCGCTATGGCACCTCTACCACTGGTGCAACCGTTAT
CGTCGTCTTCATTAGCTGCTGAAGTTGCAGGCCGGAGATTCTGGAACTTTCGCAGATCCAGCACCGGAAATGTTCAATTGAGACGAGATGTTGCTGTTAAATCTCATCTG
AAATTGAATCTCCCCCTCATTTCTCCGCACGACCAATGGGGCAACTGGACTGTTTTATTCTCCATAGGAGCCTTCGGTATCTGGTCTGAGAAAACAAAGATTGGCAGTGC
ATTGAGCGGTGCCTTAGTGAGCACATTGGTAGGACTTGCAGCCAGTAATTTTGGGATCATTGCATCTGATGCTCCAGCTTTTCCTATTGTTTTGGAGTTTTTGCTTCCGC
TAGCAGTTCCTTTGCTGTTATTTAGAGCAGATTTGCGTCGTGTAATAAAGTCAACAGGGACACTTCTCTTGGCTTTTTTGTTAGGTTCAGTTGGAACAACAATTGGAACT
GTAGTGGCCTATTTTCTTGTACCAATGCGATCACTTGGTCAAGACAGTTGGAAAATTGCCGCGGCACTGATGGGAAGACATATTGGTGGAGCTGTCAACTATGTTGCTAT
ATCTGATGCTCTTGGTGTTTCTCCATCAGTATTAGCAGCTGGACTTGCTGCAGATAATGTAATTTGTGCAGTGTATTTTGCAACACTGTTTGCATTAGCATCTAAAGTAC
CTCCTGAACCTACGATATTGAATAATGATGTTGGGAAGGACGCAGAAGCTGAGCCTGGCAACAAGCTTCCGGTGTTGCAATCTGCCACAGCCATTGCTGTATCATTTGCC
ATTTGTAAAGTTGGTTCCTACCTGACCAAATATTTTGGAATTCAAGGTGGTAGCATGCCAGCAATAACGGCCGTGATTGTTGTCTTAGCAACCATTTTTCCTAAGCTGTT
TGCTTACCTTGCTCCTTCTGGTGAAGCTATGGCTATGATTCTAATGCAGGTTTTCTTTGCTGTGGTGGGAGCAAGTGGAAATGTATGGAGTGTCATCAACACTGCACCAA
GCATCTTCGTATTTGCTTTTGTCCAGATTGCAGTCCATCTTGCCATAATCATTGGTCTTGGAAAGCTGCTTCGCTTCGATCTAAAGTCGTTGCTGATAGCATCAAATGCC
AATGTCGGAGGTCCTACAACAGCTTGCGGGATGGCCACAGCTAAGGGTTGGAGTTCAATGGTTATTCCTGGAATTCTTGCTGGAATTTTCGGAATCGCTATTGCAACTTT
CCTAGGGATTGGGTTTGGAATGATGGTCTTGAAATACATGTGA
mRNA sequenceShow/hide mRNA sequence
AAATTTTCCTTTAAATTTATAGGAGGTACTTATCGAAGATTTAGTATGTATACTTTATATCGCCGAATTTTAAGTGATGAAGTCAATTGTTGGCGTTCCGTCTCCGTCCG
ATACACCTCGCCGGAGATGGCTTCACAGTCACAGCTCGCAGATCTTCACTCGAAGTTGCCTGAGGTACAGCCTCCATGTTTGTCTTCCTCCAAATTCTCTGTTGGATTCT
CCAGGAGGATCGCTATGGCACCTCTACCACTGGTGCAACCGTTATCGTCGTCTTCATTAGCTGCTGAAGTTGCAGGCCGGAGATTCTGGAACTTTCGCAGATCCAGCACC
GGAAATGTTCAATTGAGACGAGATGTTGCTGTTAAATCTCATCTGAAATTGAATCTCCCCCTCATTTCTCCGCACGACCAATGGGGCAACTGGACTGTTTTATTCTCCAT
AGGAGCCTTCGGTATCTGGTCTGAGAAAACAAAGATTGGCAGTGCATTGAGCGGTGCCTTAGTGAGCACATTGGTAGGACTTGCAGCCAGTAATTTTGGGATCATTGCAT
CTGATGCTCCAGCTTTTCCTATTGTTTTGGAGTTTTTGCTTCCGCTAGCAGTTCCTTTGCTGTTATTTAGAGCAGATTTGCGTCGTGTAATAAAGTCAACAGGGACACTT
CTCTTGGCTTTTTTGTTAGGTTCAGTTGGAACAACAATTGGAACTGTAGTGGCCTATTTTCTTGTACCAATGCGATCACTTGGTCAAGACAGTTGGAAAATTGCCGCGGC
ACTGATGGGAAGACATATTGGTGGAGCTGTCAACTATGTTGCTATATCTGATGCTCTTGGTGTTTCTCCATCAGTATTAGCAGCTGGACTTGCTGCAGATAATGTAATTT
GTGCAGTGTATTTTGCAACACTGTTTGCATTAGCATCTAAAGTACCTCCTGAACCTACGATATTGAATAATGATGTTGGGAAGGACGCAGAAGCTGAGCCTGGCAACAAG
CTTCCGGTGTTGCAATCTGCCACAGCCATTGCTGTATCATTTGCCATTTGTAAAGTTGGTTCCTACCTGACCAAATATTTTGGAATTCAAGGTGGTAGCATGCCAGCAAT
AACGGCCGTGATTGTTGTCTTAGCAACCATTTTTCCTAAGCTGTTTGCTTACCTTGCTCCTTCTGGTGAAGCTATGGCTATGATTCTAATGCAGGTTTTCTTTGCTGTGG
TGGGAGCAAGTGGAAATGTATGGAGTGTCATCAACACTGCACCAAGCATCTTCGTATTTGCTTTTGTCCAGATTGCAGTCCATCTTGCCATAATCATTGGTCTTGGAAAG
CTGCTTCGCTTCGATCTAAAGTCGTTGCTGATAGCATCAAATGCCAATGTCGGAGGTCCTACAACAGCTTGCGGGATGGCCACAGCTAAGGGTTGGAGTTCAATGGTTAT
TCCTGGAATTCTTGCTGGAATTTTCGGAATCGCTATTGCAACTTTCCTAGGGATTGGGTTTGGAATGATGGTCTTGAAATACATGTGAAACCATTCAAATTTAGATCATG
AAATCCCTTACTCTCTCCAAAAGTGTGTAAACTTATGTCAAGGGAATGACTTTTTACAATGGTTTTCTCAAAAAATAATAAAGAAAAAAGAAAAGACACATTGGAGGAGA
GCATGTTGTCTTAGTGTGCACTTGCAGGCAGTGTATTTAGAGTATCTCTTTGCATTTCTCGTTTTAATGGATTAAAACACCCATTTTAAGTCTAAAATGAGTTTTGTCCC
ACTTTTTAAAACTCAAGATTTTTGACCTCTTGTTTAGATCATTTTTAGGCAAGTTTACAAAATTACCCTCACTTTCTTTTTTCCTCTTCTTTCCTCTTATGTGTTTTTTT
TTCCTCCTCCTTTGAACCAATTCCGATGAAGTTCTTTTTTCTTTTTCTTTCCTTTTTTTCTCCTTTAACTTCAATCGACTTCTTCTTCTCCCTCATTTTAGTTTGTGATG
CAGAATGATCTTGAGCGAGAGTGAGAGTGCGGGAGAGTGAGGAAATTGACAAAAAGAAAAGGTTGAGAGAGTTAGGAGAGCTAAGAGCGAAAGTAAAGGCCAATCGGCGT
TCATGAGGTTACATTTTGTAATTTCATCCGAATGTGACGTCAACAAGGGGTTAAAAAAACCTCAAATTTAAAAGGTGGGGAAACTTTCATTTTCTTCCCCATTTTTGGGT
TATCCATTCGCGGACCCAGTTTTTATCTTAGATTCGGATACTCTGTGTAGTAGTTATGTTTGTGATCTTAGAGCATTTCTTTTTGAAACTAAATAAATTGAAGTCATACT
AGTTGTTTGTTGTACAAATAGAAATGTGATAAATTGTATATTACAAGCCAAATTAAGCATAGCTCAATTGATATATGAAACAAG
Protein sequenceShow/hide protein sequence
MYTLYRRILSDEVNCWRSVSVRYTSPEMASQSQLADLHSKLPEVQPPCLSSSKFSVGFSRRIAMAPLPLVQPLSSSSLAAEVAGRRFWNFRRSSTGNVQLRRDVAVKSHL
KLNLPLISPHDQWGNWTVLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFPIVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTIGT
VVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASKVPPEPTILNNDVGKDAEAEPGNKLPVLQSATAIAVSFA
ICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMAMILMQVFFAVVGASGNVWSVINTAPSIFVFAFVQIAVHLAIIIGLGKLLRFDLKSLLIASNA
NVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM