; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi09G015290 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi09G015290
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionSWIM-type domain-containing protein
Genome locationchr09:23388009..23402507
RNA-Seq ExpressionLsi09G015290
SyntenyLsi09G015290
Gene Ontology termsGO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR004332 - Transposase, MuDR, plant
IPR006564 - Zinc finger, PMZ-type
IPR007527 - Zinc finger, SWIM-type
IPR018289 - MULE transposase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK02988.1 MuDR family transposase isoform 1 [Cucumis melo var. makuwa]0.0e+0076.3Show/hide
Query:  RYTNTPANKRVELFDERGINESTTDDNVQMTSCLHCNPRLEDVTIEICVEVINIKYGLLWRAIISQELNNVEKKTMDDSDPAIESSKMVTSFFYNAHAFS
        RYTNTPANKRVELFDERGINESTTDDNVQMTS                             +I SQELNNV KK MDDSDPAI+SSKM            
Subjt:  RYTNTPANKRVELFDERGINESTTDDNVQMTSCLHCNPRLEDVTIEICVEVINIKYGLLWRAIISQELNNVEKKTMDDSDPAIESSKMVTSFFYNAHAFS

Query:  FCFGFVLFCFCSFTVVEQPVEMGRQSAFTLEGQGHP-----GLLPLAHDHV--KDGKWR-IGFALYQFSSRWRSFFNSYSLHLRILYSPIPFYRAKGSWV
                      VVEQ +EMGRQSAFTLEGQ         +     + V  KDG    IG   Y      ++  N +   +  ++S           +
Subjt:  FCFGFVLFCFCSFTVVEQPVEMGRQSAFTLEGQGHP-----GLLPLAHDHV--KDGKWR-IGFALYQFSSRWRSFFNSYSLHLRILYSPIPFYRAKGSWV

Query:  QSM-VATYLGFNILRVSLTPKYCRVKQFLILLYPKINEVIFSMGVINGFLRYASIKINICILISHGLCFLFTRAFKMSSRTTVSEAVVPVVEPVDVAVET
        ++M +  +L  N   +    K   +K+ +  L   +   +F +         +++  +                   SSRTTVSEAVVPVVEPVDV VET
Subjt:  QSM-VATYLGFNILRVSLTPKYCRVKQFLILLYPKINEVIFSMGVINGFLRYASIKINICILISHGLCFLFTRAFKMSSRTTVSEAVVPVVEPVDVAVET

Query:  IIAIDQIGMDISNEIPLMCVPAGSSDEKHRKAAQQWENAIIGVDQRFNSFSEFREALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQ
        II +DQIGMDIS+E+PL+CVPAGS DEKHRKAAQQWENAIIGVDQRFNSFSEFREALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQ
Subjt:  IIAIDQIGMDISNEIPLMCVPAGSSDEKHRKAAQQWENAIIGVDQRFNSFSEFREALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQ

Query:  LICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKREYGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPG
        LICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKREYGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPG
Subjt:  LICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKREYGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPG

Query:  SVASFTTKDDSSFHRLFVSFHASISGFQQGCRPLLFLDSTPLNSKYQGFFFTATAVDGEDGIFPAAFAVVDAETEENWHWFLLELKSAVKRSE-ITFVAD
        SVASFTTKDDSSFHRLFVSFHASISGFQQGCRPLLFLDSTPLNSKYQGFF TATAVDGED IFPAAFAVVDAETEENW WFLLELKSAVKRSE ITFVAD
Subjt:  SVASFTTKDDSSFHRLFVSFHASISGFQQGCRPLLFLDSTPLNSKYQGFFFTATAVDGEDGIFPAAFAVVDAETEENWHWFLLELKSAVKRSE-ITFVAD

Query:  FQNGLKKSLGEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYAAALASKLEDFQRCAASIRGISPDAYNWIIQSQPEHWANAFFGGARYNHI
        FQNGL KSLGEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYAAALA+KLEDFQRCA SI+GISPDAYNWIIQS+PEHWANAFFGGARYNHI
Subjt:  FQNGLKKSLGEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYAAALASKLEDFQRCAASIRGISPDAYNWIIQSQPEHWANAFFGGARYNHI

Query:  TSNFGQQFYSSISEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEEKLQKEISIARSFQVSLSHGSTFEVRGESVDIVDVDNWDCSCKGWQ
        TSNFG QFYSSISEAHELPITQMIDVLRGKMMETIY RRVESDQWMTKLTP NEEKLQKEISIARSFQVSLSHGS FEVRGESV IVDVDNWDCSCK WQ
Subjt:  TSNFGQQFYSSISEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEEKLQKEISIARSFQVSLSHGSTFEVRGESVDIVDVDNWDCSCKGWQ

Query:  LTGLPCCHAIAVIECIGRSPYDYCPRYFTVESYRLTYAESIHPIPNVDRLILGESTEALVTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLGHN
        LTGLPCCHAIAVIECIGRSPYDYCPRYFTVESYRLTYAESIHPIPNVDRLILGEST+A+VTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCK LGHN
Subjt:  LTGLPCCHAIAVIECIGRSPYDYCPRYFTVESYRLTYAESIHPIPNVDRLILGESTEALVTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLGHN

Query:  KKTCKDS
        KKTCKDS
Subjt:  KKTCKDS

XP_004138593.1 uncharacterized protein LOC101203713 isoform X1 [Cucumis sativus]0.0e+0075.8Show/hide
Query:  YTNTPANKRVELFDERGINESTTDDNVQMTSCLHCNPRLEDVTIEICVEVINIKYGLLWRAIISQELNNVEKKTMDDSDPAIESSKMVTSFFYNAHAFSF
        YTNTPANKRVELFDERGINES+TDDNVQMT                              +IISQELNNV KK MDDSDPAIESSKM             
Subjt:  YTNTPANKRVELFDERGINESTTDDNVQMTSCLHCNPRLEDVTIEICVEVINIKYGLLWRAIISQELNNVEKKTMDDSDPAIESSKMVTSFFYNAHAFSF

Query:  CFGFVLFCFCSFTVVEQPVEMGRQSAFTLEGQGHP-----GLLPLAHDHVKDGKWRIGFA---LYQFSSRWRSFFNSYSLHLRILYSPIPFYRAKGSWVQ
                     VVEQ +EMGRQSAFTLEGQ         +     + VK+    + +     Y      ++  N +   +  ++S         S   
Subjt:  CFGFVLFCFCSFTVVEQPVEMGRQSAFTLEGQGHP-----GLLPLAHDHVKDGKWRIGFA---LYQFSSRWRSFFNSYSLHLRILYSPIPFYRAKGSWVQ

Query:  SMVATYLGFNILRVSLTPKYCRVKQFLILLYPKINEVIFSMGVINGFLRYASIKINICILISHGLCFLFTRAFKMSSRTTVSEAVVPVVEPVDVAVETII
          +  +L  N   +    K   +K+ +  L   +   +F +         +++  +                   SSRTTVSEAVVPVVEPVDV VE II
Subjt:  SMVATYLGFNILRVSLTPKYCRVKQFLILLYPKINEVIFSMGVINGFLRYASIKINICILISHGLCFLFTRAFKMSSRTTVSEAVVPVVEPVDVAVETII

Query:  AIDQIGMDISNEIPLMCVPAGSSDEKHRKAAQQWENAIIGVDQRFNSFSEFREALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLI
         +DQIGMDIS+E+PL+CVPAGSSDEKHRKAAQQWENAIIGVDQRFNSFSEFREALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLI
Subjt:  AIDQIGMDISNEIPLMCVPAGSSDEKHRKAAQQWENAIIGVDQRFNSFSEFREALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLI

Query:  CIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKREYGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGSV
        CIKKMNT+HSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKREYGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGSV
Subjt:  CIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKREYGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGSV

Query:  ASFTTKDDSSFHRLFVSFHASISGFQQGCRPLLFLDSTPLNSKYQGFFFTATAVDGEDGIFPAAFAVVDAETEENWHWFLLELKSAVKRSE-ITFVADFQ
        ASFTTKDDSSFHRLFVSFHASISGFQQGCRPLLFLDSTPLNSKYQGFFFTATAVDGED IFPAAFAVVDAETEENWHWFLLELKSAVKRSE ITFVADFQ
Subjt:  ASFTTKDDSSFHRLFVSFHASISGFQQGCRPLLFLDSTPLNSKYQGFFFTATAVDGEDGIFPAAFAVVDAETEENWHWFLLELKSAVKRSE-ITFVADFQ

Query:  NGLKKSLGEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYAAALASKLEDFQRCAASIRGISPDAYNWIIQSQPEHWANAFFGGARYNHITS
        NGL KSLGEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYAAALA+KLEDFQRCA SI+GISPDAYNWIIQS+PEHWANAFFGGARYNHITS
Subjt:  NGLKKSLGEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYAAALASKLEDFQRCAASIRGISPDAYNWIIQSQPEHWANAFFGGARYNHITS

Query:  NFGQQFYSSISEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEEKLQKEISIARSFQVSLSHGSTFEVRGESVDIVDVDNWDCSCKGWQLT
        NFGQQFYSSISEAHELPITQMIDVLRGKMMETIY RRVESDQW+TKLTP NEEKLQKEISIARSFQVSLSHG+ FEVRGESV  VDVDNWDCSCK WQLT
Subjt:  NFGQQFYSSISEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEEKLQKEISIARSFQVSLSHGSTFEVRGESVDIVDVDNWDCSCKGWQLT

Query:  GLPCCHAIAVIECIGRSPYDYCPRYFTVESYRLTYAESIHPIPNVDRLILGESTEALVTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLGHNKK
        GLPCCHAIAVIECIGRSPYDYCPRYFTVESYRLTYAESIHPIPNVDRLILGEST+A+VTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCK LGHNKK
Subjt:  GLPCCHAIAVIECIGRSPYDYCPRYFTVESYRLTYAESIHPIPNVDRLILGESTEALVTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLGHNKK

Query:  TCKDS
        TCKDS
Subjt:  TCKDS

XP_008458283.1 PREDICTED: uncharacterized protein LOC103497748 isoform X1 [Cucumis melo]0.0e+0076.27Show/hide
Query:  YTNTPANKRVELFDERGINESTTDDNVQMTSCLHCNPRLEDVTIEICVEVINIKYGLLWRAIISQELNNVEKKTMDDSDPAIESSKMVTSFFYNAHAFSF
        YTNTPANKRVELFDERGINESTTDDNVQMTS                             +I SQELNNV KK MDDSDPAI+SSKM             
Subjt:  YTNTPANKRVELFDERGINESTTDDNVQMTSCLHCNPRLEDVTIEICVEVINIKYGLLWRAIISQELNNVEKKTMDDSDPAIESSKMVTSFFYNAHAFSF

Query:  CFGFVLFCFCSFTVVEQPVEMGRQSAFTLEGQGHP-----GLLPLAHDHV--KDGKWR-IGFALYQFSSRWRSFFNSYSLHLRILYSPIPFYRAKGSWVQ
                     VVEQ +EMGRQSAFTLEGQ         +     + V  KDG    IG   Y      ++  N +   +  ++S           ++
Subjt:  CFGFVLFCFCSFTVVEQPVEMGRQSAFTLEGQGHP-----GLLPLAHDHV--KDGKWR-IGFALYQFSSRWRSFFNSYSLHLRILYSPIPFYRAKGSWVQ

Query:  SM-VATYLGFNILRVSLTPKYCRVKQFLILLYPKINEVIFSMGVINGFLRYASIKINICILISHGLCFLFTRAFKMSSRTTVSEAVVPVVEPVDVAVETI
        +M +  +L  N   +    K   +K+ +  L   +   +F +         +++  +                   SSRTTVSEAVVPVVEPVDV VETI
Subjt:  SM-VATYLGFNILRVSLTPKYCRVKQFLILLYPKINEVIFSMGVINGFLRYASIKINICILISHGLCFLFTRAFKMSSRTTVSEAVVPVVEPVDVAVETI

Query:  IAIDQIGMDISNEIPLMCVPAGSSDEKHRKAAQQWENAIIGVDQRFNSFSEFREALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQL
        I +DQIGMDIS+E+PL+CVPAGS DEKHRKAAQQWENAIIGVDQRFNSFSEFREALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQL
Subjt:  IAIDQIGMDISNEIPLMCVPAGSSDEKHRKAAQQWENAIIGVDQRFNSFSEFREALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQL

Query:  ICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKREYGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGS
        ICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKREYGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGS
Subjt:  ICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKREYGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGS

Query:  VASFTTKDDSSFHRLFVSFHASISGFQQGCRPLLFLDSTPLNSKYQGFFFTATAVDGEDGIFPAAFAVVDAETEENWHWFLLELKSAVKRSE-ITFVADF
        VASFTTKDDSSFHRLFVSFHASISGFQQGCRPLLFLDSTPLNSKYQGFF TATAVDGED IFPAAFAVVDAETEENW WFLLELKSAVKRSE ITFVADF
Subjt:  VASFTTKDDSSFHRLFVSFHASISGFQQGCRPLLFLDSTPLNSKYQGFFFTATAVDGEDGIFPAAFAVVDAETEENWHWFLLELKSAVKRSE-ITFVADF

Query:  QNGLKKSLGEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYAAALASKLEDFQRCAASIRGISPDAYNWIIQSQPEHWANAFFGGARYNHIT
        QNGL KSLGEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYAAALA+KLEDFQRCA SI+GISPDAYNWIIQS+PEHWANAFFGGARYNHIT
Subjt:  QNGLKKSLGEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYAAALASKLEDFQRCAASIRGISPDAYNWIIQSQPEHWANAFFGGARYNHIT

Query:  SNFGQQFYSSISEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEEKLQKEISIARSFQVSLSHGSTFEVRGESVDIVDVDNWDCSCKGWQL
        SNFG QFYSSISEAHELPITQMIDVLRGKMMETIY RRVESDQWMTKLTP NEEKLQKEISIARSFQVSLSHGS FEVRGESV IVDVDNWDCSCK WQL
Subjt:  SNFGQQFYSSISEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEEKLQKEISIARSFQVSLSHGSTFEVRGESVDIVDVDNWDCSCKGWQL

Query:  TGLPCCHAIAVIECIGRSPYDYCPRYFTVESYRLTYAESIHPIPNVDRLILGESTEALVTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLGHNK
        TGLPCCHAIAVIECIGRSPYDYCPRYFTVESYRLTYAESIHPIPNVDRLILGEST+A+VTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCK LGHNK
Subjt:  TGLPCCHAIAVIECIGRSPYDYCPRYFTVESYRLTYAESIHPIPNVDRLILGESTEALVTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLGHNK

Query:  KTCKDS
        KTCKDS
Subjt:  KTCKDS

XP_008458284.1 PREDICTED: uncharacterized protein LOC103497748 isoform X2 [Cucumis melo]0.0e+0076.64Show/hide
Query:  YTNTPANKRVELFDERGINESTTDDNVQMTSCLHCNPRLEDVTIEICVEVINIKYGLLWRAIISQELNNVEKKTMDDSDPAIESSKMVTSFFYNAHAFSF
        YTNTPANKRVELFDERGINESTTDDNVQMTS                             +I SQELNNV KK MDDSDPAI+SSKM             
Subjt:  YTNTPANKRVELFDERGINESTTDDNVQMTSCLHCNPRLEDVTIEICVEVINIKYGLLWRAIISQELNNVEKKTMDDSDPAIESSKMVTSFFYNAHAFSF

Query:  CFGFVLFCFCSFTVVEQPVEMGRQSAFTLEGQGHPGLLPLAHDHVKDGKWR-IGFALYQFSSRWRSFFNSYSLHLRILYSPIPFYRAKGSWVQSM-VATY
                     VVEQ +EMGRQSAFTLEG              KDG    IG   Y      ++  N +   +  ++S           +++M +  +
Subjt:  CFGFVLFCFCSFTVVEQPVEMGRQSAFTLEGQGHPGLLPLAHDHVKDGKWR-IGFALYQFSSRWRSFFNSYSLHLRILYSPIPFYRAKGSWVQSM-VATY

Query:  LGFNILRVSLTPKYCRVKQFLILLYPKINEVIFSMGVINGFLRYASIKINICILISHGLCFLFTRAFKMSSRTTVSEAVVPVVEPVDVAVETIIAIDQIG
        L  N   +    K   +K+ +  L   +   +F +         +++  +                   SSRTTVSEAVVPVVEPVDV VETII +DQIG
Subjt:  LGFNILRVSLTPKYCRVKQFLILLYPKINEVIFSMGVINGFLRYASIKINICILISHGLCFLFTRAFKMSSRTTVSEAVVPVVEPVDVAVETIIAIDQIG

Query:  MDISNEIPLMCVPAGSSDEKHRKAAQQWENAIIGVDQRFNSFSEFREALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMN
        MDIS+E+PL+CVPAGS DEKHRKAAQQWENAIIGVDQRFNSFSEFREALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMN
Subjt:  MDISNEIPLMCVPAGSSDEKHRKAAQQWENAIIGVDQRFNSFSEFREALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMN

Query:  TNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKREYGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGSVASFTTK
        TNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKREYGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGSVASFTTK
Subjt:  TNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKREYGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGSVASFTTK

Query:  DDSSFHRLFVSFHASISGFQQGCRPLLFLDSTPLNSKYQGFFFTATAVDGEDGIFPAAFAVVDAETEENWHWFLLELKSAVKRSE-ITFVADFQNGLKKS
        DDSSFHRLFVSFHASISGFQQGCRPLLFLDSTPLNSKYQGFF TATAVDGED IFPAAFAVVDAETEENW WFLLELKSAVKRSE ITFVADFQNGL KS
Subjt:  DDSSFHRLFVSFHASISGFQQGCRPLLFLDSTPLNSKYQGFFFTATAVDGEDGIFPAAFAVVDAETEENWHWFLLELKSAVKRSE-ITFVADFQNGLKKS

Query:  LGEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYAAALASKLEDFQRCAASIRGISPDAYNWIIQSQPEHWANAFFGGARYNHITSNFGQQF
        LGEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYAAALA+KLEDFQRCA SI+GISPDAYNWIIQS+PEHWANAFFGGARYNHITSNFG QF
Subjt:  LGEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYAAALASKLEDFQRCAASIRGISPDAYNWIIQSQPEHWANAFFGGARYNHITSNFGQQF

Query:  YSSISEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEEKLQKEISIARSFQVSLSHGSTFEVRGESVDIVDVDNWDCSCKGWQLTGLPCCH
        YSSISEAHELPITQMIDVLRGKMMETIY RRVESDQWMTKLTP NEEKLQKEISIARSFQVSLSHGS FEVRGESV IVDVDNWDCSCK WQLTGLPCCH
Subjt:  YSSISEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEEKLQKEISIARSFQVSLSHGSTFEVRGESVDIVDVDNWDCSCKGWQLTGLPCCH

Query:  AIAVIECIGRSPYDYCPRYFTVESYRLTYAESIHPIPNVDRLILGESTEALVTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLGHNKKTCKDS
        AIAVIECIGRSPYDYCPRYFTVESYRLTYAESIHPIPNVDRLILGEST+A+VTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCK LGHNKKTCKDS
Subjt:  AIAVIECIGRSPYDYCPRYFTVESYRLTYAESIHPIPNVDRLILGESTEALVTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLGHNKKTCKDS

XP_038906599.1 uncharacterized protein LOC120092551 isoform X1 [Benincasa hispida]0.0e+0077.73Show/hide
Query:  YTNTPANKRVELFDERGINESTTDDNVQMTSCLHCNPRLEDVTIEICVEVINIKYGLLWRAIISQELNNVEKKTMDDSDPAIESSKMVTSFFYNAHAFSF
        Y NTPANKRVELFDERGINESTTDDNVQ+TS                             +IISQELNNV +KTMDDSDPAIESSKM             
Subjt:  YTNTPANKRVELFDERGINESTTDDNVQMTSCLHCNPRLEDVTIEICVEVINIKYGLLWRAIISQELNNVEKKTMDDSDPAIESSKMVTSFFYNAHAFSF

Query:  CFGFVLFCFCSFTVVEQPVEMGRQSAFTLEGQ--GHPGLLPLAHDHVKDGKWRIGFALYQFSSRWRSFFNSYSLHLRILYSPIPFYRAKGSWVQSMVATY
                     VVEQ +EMGRQSAFTLE Q      ++ +     +  K + G   Y           +Y++ +    +   F        ++ VA  
Subjt:  CFGFVLFCFCSFTVVEQPVEMGRQSAFTLEGQ--GHPGLLPLAHDHVKDGKWRIGFALYQFSSRWRSFFNSYSLHLRILYSPIPFYRAKGSWVQSMVATY

Query:  LGFNILRVSLTPKYCRVKQFLILLYPKINEVIFSMG-------VINGFLRYASIKINICILISHGLC-FLFTRAFKMSSRTTVSEAVVPVVEPVDVAVET
           +I  +S       +K FL    P   + + S+        +IN FL+  SI +++ IL        L       SSRTTVSEAVVPVVEP DVAVET
Subjt:  LGFNILRVSLTPKYCRVKQFLILLYPKINEVIFSMG-------VINGFLRYASIKINICILISHGLC-FLFTRAFKMSSRTTVSEAVVPVVEPVDVAVET

Query:  IIAIDQIGMDISNEIPLMCVPAGSSDEKHRKAAQQWENAIIGVDQRFNSFSEFREALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQ
        +IAIDQIGMD+SNE+PL+CVPAGS+DEKHRKAAQQWENAIIGVDQRFNSFSEFREALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQ
Subjt:  IIAIDQIGMDISNEIPLMCVPAGSSDEKHRKAAQQWENAIIGVDQRFNSFSEFREALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQ

Query:  LICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKREYGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPG
        LICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKREYGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPG
Subjt:  LICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKREYGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPG

Query:  SVASFTTKDDSSFHRLFVSFHASISGFQQGCRPLLFLDSTPLNSKYQGFFFTATAVDGEDGIFPAAFAVVDAETEENWHWFLLELKSAVKRSE-ITFVAD
        SVASFTTKDDSSFHRLFVSFHASISGFQQGCRPLLFLDSTPLNSKYQGFFFTATAVDGEDGIFPAAFAVVDAETEENWHWFLLELKSAVKRSE ITFVAD
Subjt:  SVASFTTKDDSSFHRLFVSFHASISGFQQGCRPLLFLDSTPLNSKYQGFFFTATAVDGEDGIFPAAFAVVDAETEENWHWFLLELKSAVKRSE-ITFVAD

Query:  FQNGLKKSLGEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYAAALASKLEDFQRCAASIRGISPDAYNWIIQSQPEHWANAFFGGARYNHI
        FQNGL K LGEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYAAALASKLEDFQRCA SI+GISPDAYNWIIQS+PEHWANAFFGGARYNHI
Subjt:  FQNGLKKSLGEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYAAALASKLEDFQRCAASIRGISPDAYNWIIQSQPEHWANAFFGGARYNHI

Query:  TSNFGQQFYSSISEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEEKLQKEISIARSFQVSLSHGSTFEVRGESVDIVDVDNWDCSCKGWQ
        TSNFGQQFYSSISEAHELPITQMIDVLRGKMMETIYRRRVESDQW TKLTPANEEKLQKE SIARSFQVSLSHGSTFEVRGESVDIVDVD+WDCSCKGWQ
Subjt:  TSNFGQQFYSSISEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEEKLQKEISIARSFQVSLSHGSTFEVRGESVDIVDVDNWDCSCKGWQ

Query:  LTGLPCCHAIAVIECIGRSPYDYCPRYFTVESYRLTYAESIHPIPNVDRLILGESTEALVTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLGHN
        LTGLPCCHAIAVIECIGRSPYDYCPRYFTVESYRLTYAESIHPIPNVDRLILGEST+A+VTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLGHN
Subjt:  LTGLPCCHAIAVIECIGRSPYDYCPRYFTVESYRLTYAESIHPIPNVDRLILGESTEALVTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLGHN

Query:  KKTCKDS
        KKTCKDS
Subjt:  KKTCKDS

TrEMBL top hitse value%identityAlignment
A0A0A0KCW9 SWIM-type domain-containing protein0.0e+0075.8Show/hide
Query:  YTNTPANKRVELFDERGINESTTDDNVQMTSCLHCNPRLEDVTIEICVEVINIKYGLLWRAIISQELNNVEKKTMDDSDPAIESSKMVTSFFYNAHAFSF
        YTNTPANKRVELFDERGINES+TDDNVQMT                              +IISQELNNV KK MDDSDPAIESSKM             
Subjt:  YTNTPANKRVELFDERGINESTTDDNVQMTSCLHCNPRLEDVTIEICVEVINIKYGLLWRAIISQELNNVEKKTMDDSDPAIESSKMVTSFFYNAHAFSF

Query:  CFGFVLFCFCSFTVVEQPVEMGRQSAFTLEGQGHP-----GLLPLAHDHVKDGKWRIGFA---LYQFSSRWRSFFNSYSLHLRILYSPIPFYRAKGSWVQ
                     VVEQ +EMGRQSAFTLEGQ         +     + VK+    + +     Y      ++  N +   +  ++S         S   
Subjt:  CFGFVLFCFCSFTVVEQPVEMGRQSAFTLEGQGHP-----GLLPLAHDHVKDGKWRIGFA---LYQFSSRWRSFFNSYSLHLRILYSPIPFYRAKGSWVQ

Query:  SMVATYLGFNILRVSLTPKYCRVKQFLILLYPKINEVIFSMGVINGFLRYASIKINICILISHGLCFLFTRAFKMSSRTTVSEAVVPVVEPVDVAVETII
          +  +L  N   +    K   +K+ +  L   +   +F +         +++  +                   SSRTTVSEAVVPVVEPVDV VE II
Subjt:  SMVATYLGFNILRVSLTPKYCRVKQFLILLYPKINEVIFSMGVINGFLRYASIKINICILISHGLCFLFTRAFKMSSRTTVSEAVVPVVEPVDVAVETII

Query:  AIDQIGMDISNEIPLMCVPAGSSDEKHRKAAQQWENAIIGVDQRFNSFSEFREALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLI
         +DQIGMDIS+E+PL+CVPAGSSDEKHRKAAQQWENAIIGVDQRFNSFSEFREALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLI
Subjt:  AIDQIGMDISNEIPLMCVPAGSSDEKHRKAAQQWENAIIGVDQRFNSFSEFREALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLI

Query:  CIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKREYGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGSV
        CIKKMNT+HSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKREYGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGSV
Subjt:  CIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKREYGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGSV

Query:  ASFTTKDDSSFHRLFVSFHASISGFQQGCRPLLFLDSTPLNSKYQGFFFTATAVDGEDGIFPAAFAVVDAETEENWHWFLLELKSAVKRSE-ITFVADFQ
        ASFTTKDDSSFHRLFVSFHASISGFQQGCRPLLFLDSTPLNSKYQGFFFTATAVDGED IFPAAFAVVDAETEENWHWFLLELKSAVKRSE ITFVADFQ
Subjt:  ASFTTKDDSSFHRLFVSFHASISGFQQGCRPLLFLDSTPLNSKYQGFFFTATAVDGEDGIFPAAFAVVDAETEENWHWFLLELKSAVKRSE-ITFVADFQ

Query:  NGLKKSLGEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYAAALASKLEDFQRCAASIRGISPDAYNWIIQSQPEHWANAFFGGARYNHITS
        NGL KSLGEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYAAALA+KLEDFQRCA SI+GISPDAYNWIIQS+PEHWANAFFGGARYNHITS
Subjt:  NGLKKSLGEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYAAALASKLEDFQRCAASIRGISPDAYNWIIQSQPEHWANAFFGGARYNHITS

Query:  NFGQQFYSSISEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEEKLQKEISIARSFQVSLSHGSTFEVRGESVDIVDVDNWDCSCKGWQLT
        NFGQQFYSSISEAHELPITQMIDVLRGKMMETIY RRVESDQW+TKLTP NEEKLQKEISIARSFQVSLSHG+ FEVRGESV  VDVDNWDCSCK WQLT
Subjt:  NFGQQFYSSISEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEEKLQKEISIARSFQVSLSHGSTFEVRGESVDIVDVDNWDCSCKGWQLT

Query:  GLPCCHAIAVIECIGRSPYDYCPRYFTVESYRLTYAESIHPIPNVDRLILGESTEALVTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLGHNKK
        GLPCCHAIAVIECIGRSPYDYCPRYFTVESYRLTYAESIHPIPNVDRLILGEST+A+VTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCK LGHNKK
Subjt:  GLPCCHAIAVIECIGRSPYDYCPRYFTVESYRLTYAESIHPIPNVDRLILGESTEALVTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLGHNKK

Query:  TCKDS
        TCKDS
Subjt:  TCKDS

A0A1S3C712 uncharacterized protein LOC103497748 isoform X20.0e+0076.64Show/hide
Query:  YTNTPANKRVELFDERGINESTTDDNVQMTSCLHCNPRLEDVTIEICVEVINIKYGLLWRAIISQELNNVEKKTMDDSDPAIESSKMVTSFFYNAHAFSF
        YTNTPANKRVELFDERGINESTTDDNVQMTS                             +I SQELNNV KK MDDSDPAI+SSKM             
Subjt:  YTNTPANKRVELFDERGINESTTDDNVQMTSCLHCNPRLEDVTIEICVEVINIKYGLLWRAIISQELNNVEKKTMDDSDPAIESSKMVTSFFYNAHAFSF

Query:  CFGFVLFCFCSFTVVEQPVEMGRQSAFTLEGQGHPGLLPLAHDHVKDGKWR-IGFALYQFSSRWRSFFNSYSLHLRILYSPIPFYRAKGSWVQSM-VATY
                     VVEQ +EMGRQSAFTLEG              KDG    IG   Y      ++  N +   +  ++S           +++M +  +
Subjt:  CFGFVLFCFCSFTVVEQPVEMGRQSAFTLEGQGHPGLLPLAHDHVKDGKWR-IGFALYQFSSRWRSFFNSYSLHLRILYSPIPFYRAKGSWVQSM-VATY

Query:  LGFNILRVSLTPKYCRVKQFLILLYPKINEVIFSMGVINGFLRYASIKINICILISHGLCFLFTRAFKMSSRTTVSEAVVPVVEPVDVAVETIIAIDQIG
        L  N   +    K   +K+ +  L   +   +F +         +++  +                   SSRTTVSEAVVPVVEPVDV VETII +DQIG
Subjt:  LGFNILRVSLTPKYCRVKQFLILLYPKINEVIFSMGVINGFLRYASIKINICILISHGLCFLFTRAFKMSSRTTVSEAVVPVVEPVDVAVETIIAIDQIG

Query:  MDISNEIPLMCVPAGSSDEKHRKAAQQWENAIIGVDQRFNSFSEFREALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMN
        MDIS+E+PL+CVPAGS DEKHRKAAQQWENAIIGVDQRFNSFSEFREALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMN
Subjt:  MDISNEIPLMCVPAGSSDEKHRKAAQQWENAIIGVDQRFNSFSEFREALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMN

Query:  TNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKREYGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGSVASFTTK
        TNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKREYGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGSVASFTTK
Subjt:  TNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKREYGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGSVASFTTK

Query:  DDSSFHRLFVSFHASISGFQQGCRPLLFLDSTPLNSKYQGFFFTATAVDGEDGIFPAAFAVVDAETEENWHWFLLELKSAVKRSE-ITFVADFQNGLKKS
        DDSSFHRLFVSFHASISGFQQGCRPLLFLDSTPLNSKYQGFF TATAVDGED IFPAAFAVVDAETEENW WFLLELKSAVKRSE ITFVADFQNGL KS
Subjt:  DDSSFHRLFVSFHASISGFQQGCRPLLFLDSTPLNSKYQGFFFTATAVDGEDGIFPAAFAVVDAETEENWHWFLLELKSAVKRSE-ITFVADFQNGLKKS

Query:  LGEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYAAALASKLEDFQRCAASIRGISPDAYNWIIQSQPEHWANAFFGGARYNHITSNFGQQF
        LGEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYAAALA+KLEDFQRCA SI+GISPDAYNWIIQS+PEHWANAFFGGARYNHITSNFG QF
Subjt:  LGEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYAAALASKLEDFQRCAASIRGISPDAYNWIIQSQPEHWANAFFGGARYNHITSNFGQQF

Query:  YSSISEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEEKLQKEISIARSFQVSLSHGSTFEVRGESVDIVDVDNWDCSCKGWQLTGLPCCH
        YSSISEAHELPITQMIDVLRGKMMETIY RRVESDQWMTKLTP NEEKLQKEISIARSFQVSLSHGS FEVRGESV IVDVDNWDCSCK WQLTGLPCCH
Subjt:  YSSISEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEEKLQKEISIARSFQVSLSHGSTFEVRGESVDIVDVDNWDCSCKGWQLTGLPCCH

Query:  AIAVIECIGRSPYDYCPRYFTVESYRLTYAESIHPIPNVDRLILGESTEALVTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLGHNKKTCKDS
        AIAVIECIGRSPYDYCPRYFTVESYRLTYAESIHPIPNVDRLILGEST+A+VTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCK LGHNKKTCKDS
Subjt:  AIAVIECIGRSPYDYCPRYFTVESYRLTYAESIHPIPNVDRLILGESTEALVTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLGHNKKTCKDS

A0A1S3C8R6 uncharacterized protein LOC103497748 isoform X10.0e+0076.27Show/hide
Query:  YTNTPANKRVELFDERGINESTTDDNVQMTSCLHCNPRLEDVTIEICVEVINIKYGLLWRAIISQELNNVEKKTMDDSDPAIESSKMVTSFFYNAHAFSF
        YTNTPANKRVELFDERGINESTTDDNVQMTS                             +I SQELNNV KK MDDSDPAI+SSKM             
Subjt:  YTNTPANKRVELFDERGINESTTDDNVQMTSCLHCNPRLEDVTIEICVEVINIKYGLLWRAIISQELNNVEKKTMDDSDPAIESSKMVTSFFYNAHAFSF

Query:  CFGFVLFCFCSFTVVEQPVEMGRQSAFTLEGQGHP-----GLLPLAHDHV--KDGKWR-IGFALYQFSSRWRSFFNSYSLHLRILYSPIPFYRAKGSWVQ
                     VVEQ +EMGRQSAFTLEGQ         +     + V  KDG    IG   Y      ++  N +   +  ++S           ++
Subjt:  CFGFVLFCFCSFTVVEQPVEMGRQSAFTLEGQGHP-----GLLPLAHDHV--KDGKWR-IGFALYQFSSRWRSFFNSYSLHLRILYSPIPFYRAKGSWVQ

Query:  SM-VATYLGFNILRVSLTPKYCRVKQFLILLYPKINEVIFSMGVINGFLRYASIKINICILISHGLCFLFTRAFKMSSRTTVSEAVVPVVEPVDVAVETI
        +M +  +L  N   +    K   +K+ +  L   +   +F +         +++  +                   SSRTTVSEAVVPVVEPVDV VETI
Subjt:  SM-VATYLGFNILRVSLTPKYCRVKQFLILLYPKINEVIFSMGVINGFLRYASIKINICILISHGLCFLFTRAFKMSSRTTVSEAVVPVVEPVDVAVETI

Query:  IAIDQIGMDISNEIPLMCVPAGSSDEKHRKAAQQWENAIIGVDQRFNSFSEFREALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQL
        I +DQIGMDIS+E+PL+CVPAGS DEKHRKAAQQWENAIIGVDQRFNSFSEFREALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQL
Subjt:  IAIDQIGMDISNEIPLMCVPAGSSDEKHRKAAQQWENAIIGVDQRFNSFSEFREALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQL

Query:  ICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKREYGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGS
        ICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKREYGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGS
Subjt:  ICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKREYGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGS

Query:  VASFTTKDDSSFHRLFVSFHASISGFQQGCRPLLFLDSTPLNSKYQGFFFTATAVDGEDGIFPAAFAVVDAETEENWHWFLLELKSAVKRSE-ITFVADF
        VASFTTKDDSSFHRLFVSFHASISGFQQGCRPLLFLDSTPLNSKYQGFF TATAVDGED IFPAAFAVVDAETEENW WFLLELKSAVKRSE ITFVADF
Subjt:  VASFTTKDDSSFHRLFVSFHASISGFQQGCRPLLFLDSTPLNSKYQGFFFTATAVDGEDGIFPAAFAVVDAETEENWHWFLLELKSAVKRSE-ITFVADF

Query:  QNGLKKSLGEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYAAALASKLEDFQRCAASIRGISPDAYNWIIQSQPEHWANAFFGGARYNHIT
        QNGL KSLGEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYAAALA+KLEDFQRCA SI+GISPDAYNWIIQS+PEHWANAFFGGARYNHIT
Subjt:  QNGLKKSLGEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYAAALASKLEDFQRCAASIRGISPDAYNWIIQSQPEHWANAFFGGARYNHIT

Query:  SNFGQQFYSSISEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEEKLQKEISIARSFQVSLSHGSTFEVRGESVDIVDVDNWDCSCKGWQL
        SNFG QFYSSISEAHELPITQMIDVLRGKMMETIY RRVESDQWMTKLTP NEEKLQKEISIARSFQVSLSHGS FEVRGESV IVDVDNWDCSCK WQL
Subjt:  SNFGQQFYSSISEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEEKLQKEISIARSFQVSLSHGSTFEVRGESVDIVDVDNWDCSCKGWQL

Query:  TGLPCCHAIAVIECIGRSPYDYCPRYFTVESYRLTYAESIHPIPNVDRLILGESTEALVTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLGHNK
        TGLPCCHAIAVIECIGRSPYDYCPRYFTVESYRLTYAESIHPIPNVDRLILGEST+A+VTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCK LGHNK
Subjt:  TGLPCCHAIAVIECIGRSPYDYCPRYFTVESYRLTYAESIHPIPNVDRLILGESTEALVTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLGHNK

Query:  KTCKDS
        KTCKDS
Subjt:  KTCKDS

A0A5D3BVG7 MuDR family transposase isoform 10.0e+0076.3Show/hide
Query:  RYTNTPANKRVELFDERGINESTTDDNVQMTSCLHCNPRLEDVTIEICVEVINIKYGLLWRAIISQELNNVEKKTMDDSDPAIESSKMVTSFFYNAHAFS
        RYTNTPANKRVELFDERGINESTTDDNVQMTS                             +I SQELNNV KK MDDSDPAI+SSKM            
Subjt:  RYTNTPANKRVELFDERGINESTTDDNVQMTSCLHCNPRLEDVTIEICVEVINIKYGLLWRAIISQELNNVEKKTMDDSDPAIESSKMVTSFFYNAHAFS

Query:  FCFGFVLFCFCSFTVVEQPVEMGRQSAFTLEGQGHP-----GLLPLAHDHV--KDGKWR-IGFALYQFSSRWRSFFNSYSLHLRILYSPIPFYRAKGSWV
                      VVEQ +EMGRQSAFTLEGQ         +     + V  KDG    IG   Y      ++  N +   +  ++S           +
Subjt:  FCFGFVLFCFCSFTVVEQPVEMGRQSAFTLEGQGHP-----GLLPLAHDHV--KDGKWR-IGFALYQFSSRWRSFFNSYSLHLRILYSPIPFYRAKGSWV

Query:  QSM-VATYLGFNILRVSLTPKYCRVKQFLILLYPKINEVIFSMGVINGFLRYASIKINICILISHGLCFLFTRAFKMSSRTTVSEAVVPVVEPVDVAVET
        ++M +  +L  N   +    K   +K+ +  L   +   +F +         +++  +                   SSRTTVSEAVVPVVEPVDV VET
Subjt:  QSM-VATYLGFNILRVSLTPKYCRVKQFLILLYPKINEVIFSMGVINGFLRYASIKINICILISHGLCFLFTRAFKMSSRTTVSEAVVPVVEPVDVAVET

Query:  IIAIDQIGMDISNEIPLMCVPAGSSDEKHRKAAQQWENAIIGVDQRFNSFSEFREALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQ
        II +DQIGMDIS+E+PL+CVPAGS DEKHRKAAQQWENAIIGVDQRFNSFSEFREALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQ
Subjt:  IIAIDQIGMDISNEIPLMCVPAGSSDEKHRKAAQQWENAIIGVDQRFNSFSEFREALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQ

Query:  LICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKREYGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPG
        LICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKREYGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPG
Subjt:  LICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKREYGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPG

Query:  SVASFTTKDDSSFHRLFVSFHASISGFQQGCRPLLFLDSTPLNSKYQGFFFTATAVDGEDGIFPAAFAVVDAETEENWHWFLLELKSAVKRSE-ITFVAD
        SVASFTTKDDSSFHRLFVSFHASISGFQQGCRPLLFLDSTPLNSKYQGFF TATAVDGED IFPAAFAVVDAETEENW WFLLELKSAVKRSE ITFVAD
Subjt:  SVASFTTKDDSSFHRLFVSFHASISGFQQGCRPLLFLDSTPLNSKYQGFFFTATAVDGEDGIFPAAFAVVDAETEENWHWFLLELKSAVKRSE-ITFVAD

Query:  FQNGLKKSLGEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYAAALASKLEDFQRCAASIRGISPDAYNWIIQSQPEHWANAFFGGARYNHI
        FQNGL KSLGEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYAAALA+KLEDFQRCA SI+GISPDAYNWIIQS+PEHWANAFFGGARYNHI
Subjt:  FQNGLKKSLGEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYAAALASKLEDFQRCAASIRGISPDAYNWIIQSQPEHWANAFFGGARYNHI

Query:  TSNFGQQFYSSISEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEEKLQKEISIARSFQVSLSHGSTFEVRGESVDIVDVDNWDCSCKGWQ
        TSNFG QFYSSISEAHELPITQMIDVLRGKMMETIY RRVESDQWMTKLTP NEEKLQKEISIARSFQVSLSHGS FEVRGESV IVDVDNWDCSCK WQ
Subjt:  TSNFGQQFYSSISEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEEKLQKEISIARSFQVSLSHGSTFEVRGESVDIVDVDNWDCSCKGWQ

Query:  LTGLPCCHAIAVIECIGRSPYDYCPRYFTVESYRLTYAESIHPIPNVDRLILGESTEALVTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLGHN
        LTGLPCCHAIAVIECIGRSPYDYCPRYFTVESYRLTYAESIHPIPNVDRLILGEST+A+VTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCK LGHN
Subjt:  LTGLPCCHAIAVIECIGRSPYDYCPRYFTVESYRLTYAESIHPIPNVDRLILGESTEALVTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLGHN

Query:  KKTCKDS
        KKTCKDS
Subjt:  KKTCKDS

A0A6J1CAN4 uncharacterized protein LOC111009921 isoform X10.0e+0074.48Show/hide
Query:  LERRYTNTPANKRVELFDERGINESTTDDNVQMTSCLHCNPRLEDVTIEICVEVINIKYGLLWRAIISQELNNVEKKTMDDSDPAIESSKMVTSFFYNAH
        LE RYTNT ANKR+ELF ERG +E TTDDN++MTS L                              SQEL NV KK MDD+DPAIE SKM         
Subjt:  LERRYTNTPANKRVELFDERGINESTTDDNVQMTSCLHCNPRLEDVTIEICVEVINIKYGLLWRAIISQELNNVEKKTMDDSDPAIESSKMVTSFFYNAH

Query:  AFSFCFGFVLFCFCSFTVVEQPVEMGRQSAFTLEGQ--GHPGLLPLAHDHVKDGKWRIGFALYQFSSRWRSFFNSYSLHLRILYSPIPFYRAKGSWVQSM
                         VVEQ VE+GRQSA  LEGQ      ++ +     +  K + G   Y           +Y++ +    +   F        ++ 
Subjt:  AFSFCFGFVLFCFCSFTVVEQPVEMGRQSAFTLEGQ--GHPGLLPLAHDHVKDGKWRIGFALYQFSSRWRSFFNSYSLHLRILYSPIPFYRAKGSWVQSM

Query:  VATYLGFNILRVSLTPKYCRVKQFLILLYPKINEVIFSMGVINGFLRYA-----SIKINICILISHGLC-FLFTRAFKMSSRTTVSEAVVPVVEPVDVAV
        VA     +I  +S       +K FL    P     + S+       R       SI +++ IL        L       SSRTTVSEAVVPVV PVDVAV
Subjt:  VATYLGFNILRVSLTPKYCRVKQFLILLYPKINEVIFSMGVINGFLRYA-----SIKINICILISHGLC-FLFTRAFKMSSRTTVSEAVVPVVEPVDVAV

Query:  ETIIAIDQIGMDISNEIPLMCVPAGSSDEKHRKAAQQWENAIIGVDQRFNSFSEFREALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLST
        ETIIAIDQIGMD+S+E+PL+CVPAGSSDEKHRKAAQQWENAIIGVDQRFNSFSEFREALHKYS+AHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLST
Subjt:  ETIIAIDQIGMDISNEIPLMCVPAGSSDEKHRKAAQQWENAIIGVDQRFNSFSEFREALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLST

Query:  TQLICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKREYGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETN
        TQLICIKKMNT+HSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKREYGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFC+KIKETN
Subjt:  TQLICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDIKREYGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETN

Query:  PGSVASFTTKDDSSFHRLFVSFHASISGFQQGCRPLLFLDSTPLNSKYQGFFFTATAVDGEDGIFPAAFAVVDAETEENWHWFLLELKSAVKRSE-ITFV
        PGSVASFTTKDDSSFHRLFVSFHASISGFQQGCRPLLFLDSTPLNSKYQG F TATA DGEDGIFPAAFAVVDAET+ENWHWFLL+LKSAV+ SE ITFV
Subjt:  PGSVASFTTKDDSSFHRLFVSFHASISGFQQGCRPLLFLDSTPLNSKYQGFFFTATAVDGEDGIFPAAFAVVDAETEENWHWFLLELKSAVKRSE-ITFV

Query:  ADFQNGLKKSLGEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYAAALASKLEDFQRCAASIRGISPDAYNWIIQSQPEHWANAFFGGARYN
        ADFQNGL KSL EIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYAAALASKLEDFQRCA SI+GISPDAYNWIIQS+PEHWANAFFGGARYN
Subjt:  ADFQNGLKKSLGEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYAAALASKLEDFQRCAASIRGISPDAYNWIIQSQPEHWANAFFGGARYN

Query:  HITSNFGQQFYSSISEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEEKLQKEISIARSFQVSLSHGSTFEVRGESVDIVDVDNWDCSCKG
        HITSNFGQQFY S+SEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTP+NEEKLQKEISIARSFQV LSHGSTFEVRGESVDIVDVD+WDCSCKG
Subjt:  HITSNFGQQFYSSISEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEEKLQKEISIARSFQVSLSHGSTFEVRGESVDIVDVDNWDCSCKG

Query:  WQLTGLPCCHAIAVIECIGRSPYDYCPRYFTVESYRLTYAESIHPIPNVDRLILGESTEALVTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLG
        WQLTGLPCCHAIAV +CIGRSPYDYC RYFTVE+YRLTYAESIHP+PNVDR ILGEST+ LVTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLG
Subjt:  WQLTGLPCCHAIAVIECIGRSPYDYCPRYFTVESYRLTYAESIHPIPNVDRLILGESTEALVTVTPPPTRRPPGRPKMKQNESLEVVKRQLQCSKCKGLG

Query:  HNKKTCKDS
        HNKKTCKDS
Subjt:  HNKKTCKDS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G48240.1 novel plant snare 124.7e-0429.71Show/hide
Query:  LERRYTNTPANKRVELFDE-RGIN-ESTTDDNVQMTSCLHCNPRLEDVTIEICVEVINIKYGLLWRAIISQELNNVEKKTMDDSDPAIESSKMVTSFFYN
        L + Y NT  NK+VELFD   G++ E T ++NVQM S +                              +QEL +   K MD++D AIE SK        
Subjt:  LERRYTNTPANKRVELFDE-RGIN-ESTTDDNVQMTSCLHCNPRLEDVTIEICVEVINIKYGLLWRAIISQELNNVEKKTMDDSDPAIESSKMVTSFFYN

Query:  AHAFSFCFGFVLFCFCSFTVVEQPVEMGRQSAFTLEGQ
                           VV Q +E+G Q+A  L+GQ
Subjt:  AHAFSFCFGFVLFCFCSFTVVEQPVEMGRQSAFTLEGQ

AT1G49920.1 MuDR family transposase1.6e-3624.27Show/hide
Query:  FNSFSEFREALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPK
        F   +E ++A+   SI        ++ +     V+C+   C W I ASR     L  I + +  H C               +   I+  ++V P     
Subjt:  FNSFSEFREALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPK

Query:  DIADDIKREYGIQLNY-------SQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETN----PGSVASFT-TKDDSSFHRLFVSFHASISGFQQGCRPL
        ++    ++++G  L+             AK  A ++  G + +++  +P     +  +N         S T   + +SF  LF +F  SI GFQ  CRPL
Subjt:  DIADDIKREYGIQLNY-------SQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETN----PGSVASFT-TKDDSSFHRLFVSFHASISGFQQGCRPL

Query:  LFLDSTPLNSKYQGFFFTATAVDGEDGIFPAAFAVVDAETEENWHWFLLELKSAV-KRSEITFVADFQNGLKKSLGEIFDK-----SYHSYCLRHLAEKL
        + +D+  L  KY+     A+A D  +  FP AFAV    + ++W WFL  ++  V +R  I  ++     +   + E   +     +YH +CL HL  KL
Subjt:  LFLDSTPLNSKYQGFFFTATAVDGEDGIFPAAFAVVDAETEENWHWFLLELKSAV-KRSEITFVADFQNGLKKSLGEIFDK-----SYHSYCLRHLAEKL

Query:  NNDLKGQFSHEARRFMINDFYAAALASKLEDFQRCAASIRGISPDAYNWIIQSQPEHWANAFFGGARYNHITSN-----------------------FGQ
         +   G F +    F++++   A  +S+ E+F      I+  +P+A+ W+ Q  P  WA A   G RY  +  +                       FG 
Subjt:  NNDLKGQFSHEARRFMINDFYAAALASKLEDFQRCAASIRGISPDAYNWIIQSQPEHWANAFFGGARYNHITSN-----------------------FGQ

Query:  QFYSSISEAHELPITQMI--DVLRGKMMETIYRRRVESDQWMTKLTPANEEKLQKEISIARSFQVSLSHGSTFEVRGESVD----IVDVDNWDCSCKGWQ
        Q   + +E+ +L    +   DV    +ME +     +SD W+  +TP   +          ++QVS++      + G+S D    IV +++  C+C  +Q
Subjt:  QFYSSISEAHELPITQMI--DVLRGKMMETIYRRRVESDQWMTKLTPANEEKLQKEISIARSFQVSLSHGSTFEVRGESVD----IVDVDNWDCSCKGWQ

Query:  LTGLPCCHAIAVIECIGRSPYDYCPRYFTVESYRLTYAESIHPIPNVDRLILGESTEALV-TVTPPPTRRPPGRPKMKQNE
            PC HA+AV + +  +P  Y    +TVE Y  TY+    P+P +           L+  V  PP  +  G+ K K  E
Subjt:  LTGLPCCHAIAVIECIGRSPYDYCPRYFTVESYRLTYAESIHPIPNVDRLILGESTEALV-TVTPPPTRRPPGRPKMKQNE

AT1G64255.1 MuDR family transposase1.3e-2722.93Show/hide
Query:  KCKYQGCPWRIYASRLSTTQLICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDI----KREYGIQLNYSQAWRAKEIAREQL
        +C    C W + A+R+    L+ I K    H+C     +        +      ++++ +  Y P     ++    K++ G +L  S    AKE A +++
Subjt:  KCKYQGCPWRIYASRLSTTQLICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPKDIADDI----KREYGIQLNYSQAWRAKEIAREQL

Query:  QGSYKEAYNQLPYFCEKIKETN----PGSVASFTTKDDSSFHRLFVSFHASISGFQQGCRPLLFLDSTPLNSKYQGFFFTATAVDGEDGIFPAAFAVVDA
         G + +++   P     +  +N          F   + +SF  +F +F  SI GFQ  CRPL+ +D+  LN +YQ     A+ VD  +  FP AFAV   
Subjt:  QGSYKEAYNQLPYFCEKIKETN----PGSVASFTTKDDSSFHRLFVSFHASISGFQQGCRPLLFLDSTPLNSKYQGFFFTATAVDGEDGIFPAAFAVVDA

Query:  ETEENWHWFLLELKSAV-KRSEITFVADFQNGLKKSLGEIFDK-----SYHSYCLRHLAEKLNNDLKGQFSHEARRFMIN-DFYAAALASKLEDFQRCAA
         + + W WFL  ++  V +R  +  ++     +   + E   +     +YH + L H           QFS     F +      A   S+ ++F     
Subjt:  ETEENWHWFLLELKSAV-KRSEITFVADFQNGLKKSLGEIFDK-----SYHSYCLRHLAEKLNNDLKGQFSHEARRFMIN-DFYAAALASKLEDFQRCAA

Query:  SIRGISPDAYNWIIQSQPEHWANAFFGGARYNHITSNFGQQF-----YSSISEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEEKLQ-KE
         I+  +P+A  W+ Q     WA A   G RY  +  N    F     +          +  + D LR K  ++    R   +       P  ++  + + 
Subjt:  SIRGISPDAYNWIIQSQPEHWANAFFGGARYNHITSNFGQQF-----YSSISEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEEKLQ-KE

Query:  ISIARSFQVSLSHGSTFEV-----RGESVDIVDVDNWDCSCKGWQLTGLPCCHAIAVIECIGRSPYDYCPRYFTVESYRLTYAESIHPIPNVDRLILGES
          +  S+ V+    + F+V     +GE   IV + +  C+C  +Q    PC HA+AV + +  +P  Y    +T+E  + TYA     +P +        
Subjt:  ISIARSFQVSLSHGSTFEV-----RGESVDIVDVDNWDCSCKGWQLTGLPCCHAIAVIECIGRSPYDYCPRYFTVESYRLTYAESIHPIPNVDRLILGES

Query:  TEALVTVTPPPTRRPPGRP
           L+    PP+  PP  P
Subjt:  TEALVTVTPPPTRRPPGRP

AT1G64260.1 MuDR family transposase1.9e-3424.28Show/hide
Query:  FNSFSEFREALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPK
        F    E ++A+  + I        ++ +    T +C    C W + A+R+    L+ I K    H+C   + +        +  + I+  +++ P     
Subjt:  FNSFSEFREALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYKPK

Query:  DIADDIKREYGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETN----PGSVASFTTKDDSSFHRLFVSFHASISGFQQGCRPLLFLDSTPL
        ++    K + G +L  S+    K    +++ G   +++  +P        +N          F   D +SF  +F SF  SI GFQ  CRPL+ +D+  L
Subjt:  DIADDIKREYGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETN----PGSVASFTTKDDSSFHRLFVSFHASISGFQQGCRPLLFLDSTPL

Query:  NSKYQGFFFTATAVDGEDGIFPAAFAVVDAETEENWHWFLLELKSAV-KRSEITFVADFQNGLKKSL---GEIFDK--SYHSYCLRHLAEKLNNDLKGQF
        N KYQ     A+ VD  +  FP AFAV    + ++W WF  +++  V +R ++  ++     +   +   G ++ +  ++H +CL H        L+ QF
Subjt:  NSKYQGFFFTATAVDGEDGIFPAAFAVVDAETEENWHWFLLELKSAV-KRSEITFVADFQNGLKKSL---GEIFDK--SYHSYCLRHLAEKLNNDLKGQF

Query:  SHEARRFMINDFY-AAALASKLEDFQRCAASIRGISPDAYNWIIQSQPEHWANAFFGGARYNHITSNFGQQF-----YSSISEAHELPITQMIDVLRG--
            R + +      A   ++ E+F      I+  +P+A+ W+ Q     WA A   G RY  I  +    F     +   + A    +  M D LR   
Subjt:  SHEARRFMINDFY-AAALASKLEDFQRCAASIRGISPDAYNWIIQSQPEHWANAFFGGARYNHITSNFGQQF-----YSSISEAHELPITQMIDVLRG--

Query:  -KMMETIY----RRRVESDQWMTKLTPANEEKLQKEISIARSFQVSLSHGSTFEVRGESVDIVDVDNWDCSCKGWQLTGLPCCHAIAVIECIGRSPYDYC
         K + +IY    R  V ++ +M KL     + +   I+        +S  S  E       IV ++   C+C+ +Q    PC HA+AV E +  +P  Y 
Subjt:  -KMMETIY----RRRVESDQWMTKLTPANEEKLQKEISIARSFQVSLSHGSTFEVRGESVDIVDVDNWDCSCKGWQLTGLPCCHAIAVIECIGRSPYDYC

Query:  PRYFTVESYRLTYAESIHPIPNV
           +TVE Y  TYA +  P+P+V
Subjt:  PRYFTVESYRLTYAESIHPIPNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCTGGAACGCAGGTATACGAATACTCCTGCTAATAAGAGGGTAGAGCTGTTTGATGAGAGAGGCATTAATGAATCTACGACTGATGATAATGTTCAGATGACATC
ATGTTTGCATTGCAATCCCAGGCTTGAGGACGTGACAATTGAAATATGTGTTGAAGTAATAAATATTAAATACGGTTTATTGTGGAGAGCAATAATAAGTCAAGAATTGA
ACAATGTTGAGAAGAAGACAATGGATGATAGTGATCCTGCGATTGAAAGCTCTAAAATGGTGACCAGTTTTTTTTATAATGCACATGCTTTCTCATTTTGTTTTGGTTTT
GTTTTGTTTTGTTTTTGTTCTTTTACCGTAGTTGAACAACCAGTTGAAATGGGAAGGCAATCTGCTTTTACATTGGAAGGCCAAGGTCACCCTGGATTACTTCCCTTAGC
TCATGATCATGTTAAGGATGGTAAATGGCGTATCGGGTTTGCTCTTTATCAGTTTTCTAGCCGATGGAGGTCCTTTTTTAATTCCTATTCCTTACATTTGAGGATTCTTT
ATTCTCCTATTCCATTCTATAGGGCTAAGGGGTCATGGGTTCAATCCATGGTGGCCACCTACCTAGGATTTAATATCCTACGGGTTTCCTTGACGCCCAAATATTGTAGA
GTCAAACAGTTTCTTATACTTCTATACCCTAAAATTAATGAAGTTATTTTTTCTATGGGTGTAATTAACGGTTTCTTAAGGTATGCATCTATAAAGATAAACATTTGCAT
TTTAATATCTCATGGATTGTGCTTTTTATTCACCCGAGCATTTAAAATGTCAAGTCGAACAACCGTTTCTGAGGCAGTAGTTCCTGTAGTTGAGCCTGTAGATGTTGCAG
TTGAGACTATCATCGCCATTGATCAGATTGGTATGGACATATCCAATGAAATCCCTTTAATGTGTGTTCCTGCAGGGTCCAGTGATGAAAAACATCGAAAAGCTGCACAA
CAATGGGAAAATGCCATTATTGGTGTAGACCAAAGATTTAACAGCTTTAGTGAATTCCGAGAAGCCTTGCATAAGTACTCAATTGCGCATGGTTTTGCTTACCGGTATAA
GAAAAATGACAGCCATCGTGTCACCGTTAAATGCAAATACCAGGGTTGTCCATGGCGAATATATGCTTCAAGGCTGTCTACTACCCAACTGATTTGTATTAAGAAAATGA
ACACAAATCATTCATGTGAAGGGGCTGCTGCTAAAGCTGGGTATCGGGCTACTAGGGGATGGGTGGGAAATATCATTAAGGAAAAATTAAAAGTCTCCCCAAACTACAAA
CCAAAAGATATTGCAGATGACATAAAACGTGAATATGGAATTCAACTGAACTATTCACAGGCATGGCGTGCGAAAGAGATAGCAAGAGAGCAGCTTCAAGGCTCCTACAA
GGAAGCATATAATCAGTTGCCATACTTTTGTGAGAAGATCAAAGAAACTAACCCTGGAAGTGTTGCTTCATTCACCACTAAAGATGACTCGAGCTTTCATCGTCTATTTG
TATCATTTCATGCATCAATTTCTGGTTTCCAGCAAGGTTGTCGACCTCTCCTTTTCCTTGATAGTACTCCATTGAACTCAAAGTATCAGGGTTTTTTTTTTACTGCCACA
GCTGTAGATGGGGAAGATGGCATATTTCCAGCAGCTTTCGCTGTTGTAGATGCCGAGACAGAGGAAAATTGGCATTGGTTTTTACTGGAACTGAAATCTGCAGTGAAAAG
GTCAGAGATTACATTTGTAGCAGATTTTCAGAATGGACTGAAGAAGTCTTTGGGTGAAATATTTGACAAGTCATACCATAGCTACTGTTTGCGCCATCTAGCTGAAAAGC
TTAACAACGATTTAAAGGGCCAATTTTCTCATGAGGCCAGGCGATTTATGATTAATGATTTCTATGCTGCTGCTCTTGCATCTAAACTTGAGGATTTCCAGCGCTGTGCT
GCAAGCATAAGAGGAATTTCACCTGATGCTTACAATTGGATCATACAAAGTCAACCAGAGCATTGGGCAAATGCATTCTTTGGAGGAGCAAGGTATAACCATATTACATC
AAATTTTGGACAGCAGTTCTACAGTTCCATATCTGAAGCACATGAATTGCCTATAACACAGATGATTGATGTTTTACGGGGAAAAATGATGGAAACAATATATAGACGCA
GGGTTGAATCAGACCAGTGGATGACAAAATTAACACCAGCCAATGAGGAAAAGTTACAAAAAGAAATATCAATTGCGCGGTCATTTCAGGTTTCACTATCCCATGGAAGC
ACATTTGAGGTTCGTGGGGAATCTGTCGACATTGTTGATGTTGATAATTGGGATTGTAGCTGTAAGGGATGGCAGCTTACTGGTTTGCCTTGTTGCCATGCTATTGCTGT
CATTGAATGCATTGGTAGGAGCCCATATGATTATTGCCCCAGATATTTCACAGTTGAGAGTTACCGCCTAACTTATGCTGAGTCAATACATCCTATTCCAAATGTGGACA
GACTGATCCTAGGTGAATCCACTGAGGCACTAGTCACAGTAACCCCTCCACCTACTAGAAGACCACCAGGTCGTCCAAAGATGAAGCAGAATGAATCATTAGAAGTAGTT
AAACGTCAGCTCCAATGTAGCAAGTGTAAAGGCCTTGGCCACAATAAGAAGACATGCAAAGATTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATCTGGAACGCAGGTATACGAATACTCCTGCTAATAAGAGGGTAGAGCTGTTTGATGAGAGAGGCATTAATGAATCTACGACTGATGATAATGTTCAGATGACATC
ATGTTTGCATTGCAATCCCAGGCTTGAGGACGTGACAATTGAAATATGTGTTGAAGTAATAAATATTAAATACGGTTTATTGTGGAGAGCAATAATAAGTCAAGAATTGA
ACAATGTTGAGAAGAAGACAATGGATGATAGTGATCCTGCGATTGAAAGCTCTAAAATGGTGACCAGTTTTTTTTATAATGCACATGCTTTCTCATTTTGTTTTGGTTTT
GTTTTGTTTTGTTTTTGTTCTTTTACCGTAGTTGAACAACCAGTTGAAATGGGAAGGCAATCTGCTTTTACATTGGAAGGCCAAGGTCACCCTGGATTACTTCCCTTAGC
TCATGATCATGTTAAGGATGGTAAATGGCGTATCGGGTTTGCTCTTTATCAGTTTTCTAGCCGATGGAGGTCCTTTTTTAATTCCTATTCCTTACATTTGAGGATTCTTT
ATTCTCCTATTCCATTCTATAGGGCTAAGGGGTCATGGGTTCAATCCATGGTGGCCACCTACCTAGGATTTAATATCCTACGGGTTTCCTTGACGCCCAAATATTGTAGA
GTCAAACAGTTTCTTATACTTCTATACCCTAAAATTAATGAAGTTATTTTTTCTATGGGTGTAATTAACGGTTTCTTAAGGTATGCATCTATAAAGATAAACATTTGCAT
TTTAATATCTCATGGATTGTGCTTTTTATTCACCCGAGCATTTAAAATGTCAAGTCGAACAACCGTTTCTGAGGCAGTAGTTCCTGTAGTTGAGCCTGTAGATGTTGCAG
TTGAGACTATCATCGCCATTGATCAGATTGGTATGGACATATCCAATGAAATCCCTTTAATGTGTGTTCCTGCAGGGTCCAGTGATGAAAAACATCGAAAAGCTGCACAA
CAATGGGAAAATGCCATTATTGGTGTAGACCAAAGATTTAACAGCTTTAGTGAATTCCGAGAAGCCTTGCATAAGTACTCAATTGCGCATGGTTTTGCTTACCGGTATAA
GAAAAATGACAGCCATCGTGTCACCGTTAAATGCAAATACCAGGGTTGTCCATGGCGAATATATGCTTCAAGGCTGTCTACTACCCAACTGATTTGTATTAAGAAAATGA
ACACAAATCATTCATGTGAAGGGGCTGCTGCTAAAGCTGGGTATCGGGCTACTAGGGGATGGGTGGGAAATATCATTAAGGAAAAATTAAAAGTCTCCCCAAACTACAAA
CCAAAAGATATTGCAGATGACATAAAACGTGAATATGGAATTCAACTGAACTATTCACAGGCATGGCGTGCGAAAGAGATAGCAAGAGAGCAGCTTCAAGGCTCCTACAA
GGAAGCATATAATCAGTTGCCATACTTTTGTGAGAAGATCAAAGAAACTAACCCTGGAAGTGTTGCTTCATTCACCACTAAAGATGACTCGAGCTTTCATCGTCTATTTG
TATCATTTCATGCATCAATTTCTGGTTTCCAGCAAGGTTGTCGACCTCTCCTTTTCCTTGATAGTACTCCATTGAACTCAAAGTATCAGGGTTTTTTTTTTACTGCCACA
GCTGTAGATGGGGAAGATGGCATATTTCCAGCAGCTTTCGCTGTTGTAGATGCCGAGACAGAGGAAAATTGGCATTGGTTTTTACTGGAACTGAAATCTGCAGTGAAAAG
GTCAGAGATTACATTTGTAGCAGATTTTCAGAATGGACTGAAGAAGTCTTTGGGTGAAATATTTGACAAGTCATACCATAGCTACTGTTTGCGCCATCTAGCTGAAAAGC
TTAACAACGATTTAAAGGGCCAATTTTCTCATGAGGCCAGGCGATTTATGATTAATGATTTCTATGCTGCTGCTCTTGCATCTAAACTTGAGGATTTCCAGCGCTGTGCT
GCAAGCATAAGAGGAATTTCACCTGATGCTTACAATTGGATCATACAAAGTCAACCAGAGCATTGGGCAAATGCATTCTTTGGAGGAGCAAGGTATAACCATATTACATC
AAATTTTGGACAGCAGTTCTACAGTTCCATATCTGAAGCACATGAATTGCCTATAACACAGATGATTGATGTTTTACGGGGAAAAATGATGGAAACAATATATAGACGCA
GGGTTGAATCAGACCAGTGGATGACAAAATTAACACCAGCCAATGAGGAAAAGTTACAAAAAGAAATATCAATTGCGCGGTCATTTCAGGTTTCACTATCCCATGGAAGC
ACATTTGAGGTTCGTGGGGAATCTGTCGACATTGTTGATGTTGATAATTGGGATTGTAGCTGTAAGGGATGGCAGCTTACTGGTTTGCCTTGTTGCCATGCTATTGCTGT
CATTGAATGCATTGGTAGGAGCCCATATGATTATTGCCCCAGATATTTCACAGTTGAGAGTTACCGCCTAACTTATGCTGAGTCAATACATCCTATTCCAAATGTGGACA
GACTGATCCTAGGTGAATCCACTGAGGCACTAGTCACAGTAACCCCTCCACCTACTAGAAGACCACCAGGTCGTCCAAAGATGAAGCAGAATGAATCATTAGAAGTAGTT
AAACGTCAGCTCCAATGTAGCAAGTGTAAAGGCCTTGGCCACAATAAGAAGACATGCAAAGATTCTTAGCTTCATTTTAGGAATTCCAAATAGAAGTTTGTGGAAGTAAG
ATATGCCAGAGTCAGAATGGAAGTAATTCAACTGGTTTAAGGTATATATTCTCGATCAAAAGATCATGAGCCTATGAGCATGGGATCGCTGGGAATGGCGTGGATATGGA
GGAGAGAATGGAAACTTGTCAATATGGGCGACAAGTGTGAAGAAATTTGCTAAGGTATCTACTTGAAGCCCCGCAACCAGCGAAAAATTATTTGAGGAGAGGTTGGCATA
GTGTGGTGGATGGTTTTATATCGACAAACGTAGAAGAGAGAACTTAATTATTATACTTTACTCATACATCCAAGTATTTGAATGATGACCACATTAAATAATTTTTTGGT
TGATGAATTCAAATACATGCGAGGATGAAACTTCTTTATTGTGATTATTTTATCAACAGAATAATCTTGTTATCTTTATGAAGGACTTTGTAACATGCCAAATTTCTTTC
TTGAACAAATTCAATGAGTGGCCAAATATTCCACTCTTAAGTTGTACCAAGCTTGTTCTATTGTCTCTTTCTCCTGACACAGGCGATTTTGATGGCACTTTACTTTTCAA
TAAAGCAGTACAAATCTCAATGAGTGTCCACTGTTTCAATTTCTCTCAAGTGTCTTTCCTTTGCGAGGTCTTGAAGTGGTTGGACATTTTTCTTGAGCTGTC
Protein sequenceShow/hide protein sequence
MDLERRYTNTPANKRVELFDERGINESTTDDNVQMTSCLHCNPRLEDVTIEICVEVINIKYGLLWRAIISQELNNVEKKTMDDSDPAIESSKMVTSFFYNAHAFSFCFGF
VLFCFCSFTVVEQPVEMGRQSAFTLEGQGHPGLLPLAHDHVKDGKWRIGFALYQFSSRWRSFFNSYSLHLRILYSPIPFYRAKGSWVQSMVATYLGFNILRVSLTPKYCR
VKQFLILLYPKINEVIFSMGVINGFLRYASIKINICILISHGLCFLFTRAFKMSSRTTVSEAVVPVVEPVDVAVETIIAIDQIGMDISNEIPLMCVPAGSSDEKHRKAAQ
QWENAIIGVDQRFNSFSEFREALHKYSIAHGFAYRYKKNDSHRVTVKCKYQGCPWRIYASRLSTTQLICIKKMNTNHSCEGAAAKAGYRATRGWVGNIIKEKLKVSPNYK
PKDIADDIKREYGIQLNYSQAWRAKEIAREQLQGSYKEAYNQLPYFCEKIKETNPGSVASFTTKDDSSFHRLFVSFHASISGFQQGCRPLLFLDSTPLNSKYQGFFFTAT
AVDGEDGIFPAAFAVVDAETEENWHWFLLELKSAVKRSEITFVADFQNGLKKSLGEIFDKSYHSYCLRHLAEKLNNDLKGQFSHEARRFMINDFYAAALASKLEDFQRCA
ASIRGISPDAYNWIIQSQPEHWANAFFGGARYNHITSNFGQQFYSSISEAHELPITQMIDVLRGKMMETIYRRRVESDQWMTKLTPANEEKLQKEISIARSFQVSLSHGS
TFEVRGESVDIVDVDNWDCSCKGWQLTGLPCCHAIAVIECIGRSPYDYCPRYFTVESYRLTYAESIHPIPNVDRLILGESTEALVTVTPPPTRRPPGRPKMKQNESLEVV
KRQLQCSKCKGLGHNKKTCKDS