; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G01830 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G01830
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionARM repeat superfamily protein
Genome locationChr5:2492445..2496261
RNA-Seq ExpressionCSPI05G01830
SyntenyCSPI05G01830
Gene Ontology termsGO:0016180 - snRNA processing (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0032039 - integrator complex (cellular component)
InterPro domainsIPR016024 - Armadillo-type fold
IPR033060 - Integrator complex subunit 7


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008464722.1 PREDICTED: uncharacterized protein LOC103502541 isoform X1 [Cucumis melo]0.0e+0090.06Show/hide
Query:  SIALPGTQAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFMKGAFQFVNMESVVKILIDALDEHMLPT
        S+A    +AGLELASDTS+E FL+AMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSV VR+ SLRCLCFIFMKGA QFVNMESVVKILIDALDEHMLPT
Subjt:  SIALPGTQAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFMKGAFQFVNMESVVKILIDALDEHMLPT

Query:  SSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVCSFSLLPARVISLIMDQIASLAKMFVDL
        SSHCDALRLLQKI+FYVR N SFLDAN+YSNLVKAVENAA+SPVKLKRLLAFQVLVHLSLQLSGKMEVESGVCS SLLP+RVISLIMDQIASLAKMF+DL
Subjt:  SSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVCSFSLLPARVISLIMDQIASLAKMFVDL

Query:  PQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFAFILYGFVAISIAHLGQDVSITSEIFGK
        PQSN E F EIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRF+FILYGFVAIS+ +LGQ +SIT EIF K
Subjt:  PQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFAFILYGFVAISIAHLGQDVSITSEIFGK

Query:  VKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNKDGFPHFTFCEDLTENEIFMLECAKKLLVVGDEWPAYKAGRHAACHGSWFSATLIF
        VKLLVNSVC+SCLFSSHTCIIYSLLLNCKF+LSCRIAE FRICN +GFPHFTFCEDLTENEIF LECAKKLL VGDEWPAY AGRHAACHGSWF+ATLIF
Subjt:  VKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNKDGFPHFTFCEDLTENEIFMLECAKKLLVVGDEWPAYKAGRHAACHGSWFSATLIF

Query:  GHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIYRDKLLETYQCLCSSVEVLKAAAVPPVQ
        GHLISKVRSD+FHYWLKSLFQFALAERKIQFLLLP+YG  L IWLEKEA LNMFHI+E INHH+TGSITEGIY  KL ETYQCL SSVEVLKAAAVPPVQ
Subjt:  GHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIYRDKLLETYQCLCSSVEVLKAAAVPPVQ

Query:  SFCFQRWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIGTTFIGMDTKSSNVISALALNCSLLAFC
        SFCFQRWFLSLRAKILGTVGSILK LLNV CSISTDYGKLGTN+TGI LESVNEF KLSLKLERLS EFDLIGTTFIGMDTKS NVISALALNCSLLAFC
Subjt:  SFCFQRWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIGTTFIGMDTKSSNVISALALNCSLLAFC

Query:  TGFAFHDPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILDMGYEVRGILTLCRYAISEFIHMQSKSS
        TGFAFH PDLATTLMT+NVDDFRTKL AILIQNL SRL+LVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILD+GYEVRGILTLCRYAISEFI MQSKSS
Subjt:  TGFAFHDPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILDMGYEVRGILTLCRYAISEFIHMQSKSS

Query:  GVDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCLQLKNITQNMSVQIPKMYCILYCGLSFQ
        GVDK TFLQVIEDGMQFLSNIVM WI IPFRVPK FF VRPCIGCELFATTDV KLDEISIPYGFHLSLNLCLQLKNIT NMSVQI KMYCILYCG SFQ
Subjt:  GVDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCLQLKNITQNMSVQIPKMYCILYCGLSFQ

Query:  ELKH----NGQNHQVCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQGFSNCLFDVSHFPVGCYRIKWYSCCVD
        ELKH    NG+NHQV EAWENDD+VEMHNKLLHYVTESSKNEAYIGKC TS V KTDR +EVFVQFEPDEKGQGFSNCLFDVSH+PVGCYRIKWYSCCVD
Subjt:  ELKH----NGQNHQVCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQGFSNCLFDVSHFPVGCYRIKWYSCCVD

Query:  SEGCFWNLLPLNSGPLLTIHQLSSAG
        SEGCFWNLLPLNSGPL TIHQL SAG
Subjt:  SEGCFWNLLPLNSGPLLTIHQLSSAG

XP_008464724.1 PREDICTED: uncharacterized protein LOC103502541 isoform X3 [Cucumis melo]0.0e+0090.06Show/hide
Query:  SIALPGTQAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFMKGAFQFVNMESVVKILIDALDEHMLPT
        S+A    +AGLELASDTS+E FL+AMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSV VR+ SLRCLCFIFMKGA QFVNMESVVKILIDALDEHMLPT
Subjt:  SIALPGTQAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFMKGAFQFVNMESVVKILIDALDEHMLPT

Query:  SSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVCSFSLLPARVISLIMDQIASLAKMFVDL
        SSHCDALRLLQKI+FYVR N SFLDAN+YSNLVKAVENAA+SPVKLKRLLAFQVLVHLSLQLSGKMEVESGVCS SLLP+RVISLIMDQIASLAKMF+DL
Subjt:  SSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVCSFSLLPARVISLIMDQIASLAKMFVDL

Query:  PQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFAFILYGFVAISIAHLGQDVSITSEIFGK
        PQSN E F EIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRF+FILYGFVAIS+ +LGQ +SIT EIF K
Subjt:  PQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFAFILYGFVAISIAHLGQDVSITSEIFGK

Query:  VKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNKDGFPHFTFCEDLTENEIFMLECAKKLLVVGDEWPAYKAGRHAACHGSWFSATLIF
        VKLLVNSVC+SCLFSSHTCIIYSLLLNCKF+LSCRIAE FRICN +GFPHFTFCEDLTENEIF LECAKKLL VGDEWPAY AGRHAACHGSWF+ATLIF
Subjt:  VKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNKDGFPHFTFCEDLTENEIFMLECAKKLLVVGDEWPAYKAGRHAACHGSWFSATLIF

Query:  GHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIYRDKLLETYQCLCSSVEVLKAAAVPPVQ
        GHLISKVRSD+FHYWLKSLFQFALAERKIQFLLLP+YG  L IWLEKEA LNMFHI+E INHH+TGSITEGIY  KL ETYQCL SSVEVLKAAAVPPVQ
Subjt:  GHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIYRDKLLETYQCLCSSVEVLKAAAVPPVQ

Query:  SFCFQRWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIGTTFIGMDTKSSNVISALALNCSLLAFC
        SFCFQRWFLSLRAKILGTVGSILK LLNV CSISTDYGKLGTN+TGI LESVNEF KLSLKLERLS EFDLIGTTFIGMDTKS NVISALALNCSLLAFC
Subjt:  SFCFQRWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIGTTFIGMDTKSSNVISALALNCSLLAFC

Query:  TGFAFHDPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILDMGYEVRGILTLCRYAISEFIHMQSKSS
        TGFAFH PDLATTLMT+NVDDFRTKL AILIQNL SRL+LVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILD+GYEVRGILTLCRYAISEFI MQSKSS
Subjt:  TGFAFHDPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILDMGYEVRGILTLCRYAISEFIHMQSKSS

Query:  GVDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCLQLKNITQNMSVQIPKMYCILYCGLSFQ
        GVDK TFLQVIEDGMQFLSNIVM WI IPFRVPK FF VRPCIGCELFATTDV KLDEISIPYGFHLSLNLCLQLKNIT NMSVQI KMYCILYCG SFQ
Subjt:  GVDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCLQLKNITQNMSVQIPKMYCILYCGLSFQ

Query:  ELKH----NGQNHQVCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQGFSNCLFDVSHFPVGCYRIKWYSCCVD
        ELKH    NG+NHQV EAWENDD+VEMHNKLLHYVTESSKNEAYIGKC TS V KTDR +EVFVQFEPDEKGQGFSNCLFDVSH+PVGCYRIKWYSCCVD
Subjt:  ELKH----NGQNHQVCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQGFSNCLFDVSHFPVGCYRIKWYSCCVD

Query:  SEGCFWNLLPLNSGPLLTIHQLSSAG
        SEGCFWNLLPLNSGPL TIHQL SAG
Subjt:  SEGCFWNLLPLNSGPLLTIHQLSSAG

XP_008464725.1 PREDICTED: uncharacterized protein LOC103502541 isoform X4 [Cucumis melo]0.0e+0090.59Show/hide
Query:  IALPGTQAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFMKGAFQFVNMESVVKILIDALDEHMLPTS
        +ALPGTQAGLELASDTS+E FL+AMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSV VR+ SLRCLCFIFMKGA QFVNMESVVKILIDALDEHMLPTS
Subjt:  IALPGTQAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFMKGAFQFVNMESVVKILIDALDEHMLPTS

Query:  SHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVCSFSLLPARVISLIMDQIASLAKMFVDLP
        SHCDALRLLQKI+FYVR N SFLDAN+YSNLVKAVENAA+SPVKLKRLLAFQVLVHLSLQLSGKMEVESGVCS SLLP+RVISLIMDQIASLAKMF+DLP
Subjt:  SHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVCSFSLLPARVISLIMDQIASLAKMFVDLP

Query:  QSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFAFILYGFVAISIAHLGQDVSITSEIFGKV
        QSN E F EIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRF+FILYGFVAIS+ +LGQ +SIT EIF KV
Subjt:  QSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFAFILYGFVAISIAHLGQDVSITSEIFGKV

Query:  KLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNKDGFPHFTFCEDLTENEIFMLECAKKLLVVGDEWPAYKAGRHAACHGSWFSATLIFG
        KLLVNSVC+SCLFSSHTCIIYSLLLNCKF+LSCRIAE FRICN +GFPHFTFCEDLTENEIF LECAKKLL VGDEWPAY AGRHAACHGSWF+ATLIFG
Subjt:  KLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNKDGFPHFTFCEDLTENEIFMLECAKKLLVVGDEWPAYKAGRHAACHGSWFSATLIFG

Query:  HLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIYRDKLLETYQCLCSSVEVLKAAAVPPVQS
        HLISKVRSD+FHYWLKSLFQFALAERKIQFLLLP+YG  L IWLEKEA LNMFHI+E INHH+TGSITEGIY  KL ETYQCL SSVEVLKAAAVPPVQS
Subjt:  HLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIYRDKLLETYQCLCSSVEVLKAAAVPPVQS

Query:  FCFQRWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIGTTFIGMDTKSSNVISALALNCSLLAFCT
        FCFQRWFLSLRAKILGTVGSILK LLNV CSISTDYGKLGTN+TGI LESVNEF KLSLKLERLS EFDLIGTTFIGMDTKS NVISALALNCSLLAFCT
Subjt:  FCFQRWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIGTTFIGMDTKSSNVISALALNCSLLAFCT

Query:  GFAFHDPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILDMGYEVRGILTLCRYAISEFIHMQSKSSG
        GFAFH PDLATTLMT+NVDDFRTKL AILIQNL SRL+LVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILD+GYEVRGILTLCRYAISEFI MQSKSSG
Subjt:  GFAFHDPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILDMGYEVRGILTLCRYAISEFIHMQSKSSG

Query:  VDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCLQLKNITQNMSVQIPKMYCILYCGLSFQE
        VDK TFLQVIEDGMQFLSNIVM WI IPFRVPK FF VRPCIGCELFATTDV KLDEISIPYGFHLSLNLCLQLKNIT NMSVQI KMYCILYCG SFQE
Subjt:  VDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCLQLKNITQNMSVQIPKMYCILYCGLSFQE

Query:  LKH----NGQNHQVCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQGFSNCLFDVSHFPVGCYRIKWYSCCVDS
        LKH    NG+NHQV EAWENDD+VEMHNKLLHYVTESSKNEAYIGKC TS V KTDR +EVFVQFEPDEKGQGFSNCLFDVSH+PVGCYRIKWYSCCVDS
Subjt:  LKH----NGQNHQVCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQGFSNCLFDVSHFPVGCYRIKWYSCCVDS

Query:  EGCFWNLLPLNSGPLLTIHQLSSAG
        EGCFWNLLPLNSGPL TIHQL SAG
Subjt:  EGCFWNLLPLNSGPLLTIHQLSSAG

XP_011654518.1 uncharacterized protein LOC101204851 isoform X1 [Cucumis sativus]0.0e+0098.81Show/hide
Query:  SIALPGTQAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFMKGAFQFVNMESVVKILIDALDEHMLPT
        S+A    +AGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFMKGAFQFVNMESVVKILIDALDEHMLPT
Subjt:  SIALPGTQAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFMKGAFQFVNMESVVKILIDALDEHMLPT

Query:  SSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVCSFSLLPARVISLIMDQIASLAKMFVDL
        SSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVCSFSLLPARVISLIMDQIASLAKMFVDL
Subjt:  SSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVCSFSLLPARVISLIMDQIASLAKMFVDL

Query:  PQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFAFILYGFVAISIAHLGQDVSITSEIFGK
        PQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFAFILYGFVAISIAHLGQDVSITSEIFGK
Subjt:  PQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFAFILYGFVAISIAHLGQDVSITSEIFGK

Query:  VKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNKDGFPHFTFCEDLTENEIFMLECAKKLLVVGDEWPAYKAGRHAACHGSWFSATLIF
        VKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICN DGFPHFTFCEDLTENEIFMLECAKKLLVVGDEWPAYKAGRHAACHGSWFSATLIF
Subjt:  VKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNKDGFPHFTFCEDLTENEIFMLECAKKLLVVGDEWPAYKAGRHAACHGSWFSATLIF

Query:  GHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIYRDKLLETYQCLCSSVEVLKAAAVPPVQ
        GHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIY DKLLETYQCLCSSVEVLKAAAVPPVQ
Subjt:  GHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIYRDKLLETYQCLCSSVEVLKAAAVPPVQ

Query:  SFCFQRWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIGTTFIGMDTKSSNVISALALNCSLLAFC
        SFCFQRWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIGTTFIGMDTKSSNVISALALNCSLLAFC
Subjt:  SFCFQRWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIGTTFIGMDTKSSNVISALALNCSLLAFC

Query:  TGFAFHDPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILDMGYEVRGILTLCRYAISEFIHMQSKSS
        TGFAFH PDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILDMGYEVRGILTLCRYAISEFIHMQSKSS
Subjt:  TGFAFHDPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILDMGYEVRGILTLCRYAISEFIHMQSKSS

Query:  GVDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCLQLKNITQNMSVQIPKMYCILYCGLSFQ
        GVDKGTFLQVIE GMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCLQLKNITQNMSVQIPKMYCILYCGLSFQ
Subjt:  GVDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCLQLKNITQNMSVQIPKMYCILYCGLSFQ

Query:  ELKHNGQNHQVCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQGFSNCLFDVSHFPVGCYRIKWYSCCVDSEGC
        ELKHNGQNHQVCEAWENDDVVEMHNKLLHYVTES+KNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQGFSNCLFDVSHFPVGCYRIKWYSCCVDSEGC
Subjt:  ELKHNGQNHQVCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQGFSNCLFDVSHFPVGCYRIKWYSCCVDSEGC

Query:  FWNLLPLNSGPLLTIHQLSSAG
        FWNLLPLNSGPLLTIHQLSSAG
Subjt:  FWNLLPLNSGPLLTIHQLSSAG

XP_011654519.1 uncharacterized protein LOC101204851 isoform X2 [Cucumis sativus]0.0e+0098.54Show/hide
Query:  SKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFMKGAFQFVNMESVVKILIDALDEHMLPTSSHCDALRLLQKILFYVRPNTSFLDANKYSNLVK
        S S+  ++ +VQFLCSFLSHKKSVHVREKSLRCLCFIFMKGAFQFVNMESVVKILIDALDEHMLPTSSHCDALRLLQKILFYVRPNTSFLDANKYSNLVK
Subjt:  SKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFMKGAFQFVNMESVVKILIDALDEHMLPTSSHCDALRLLQKILFYVRPNTSFLDANKYSNLVK

Query:  AVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVCSFSLLPARVISLIMDQIASLAKMFVDLPQSNYEVFLEIKELLNLLLLIVREQSDLWILLLE
        AVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVCSFSLLPARVISLIMDQIASLAKMFVDLPQSNYEVFLEIKELLNLLLLIVREQSDLWILLLE
Subjt:  AVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVCSFSLLPARVISLIMDQIASLAKMFVDLPQSNYEVFLEIKELLNLLLLIVREQSDLWILLLE

Query:  KICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFAFILYGFVAISIAHLGQDVSITSEIFGKVKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSC
        KICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFAFILYGFVAISIAHLGQDVSITSEIFGKVKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSC
Subjt:  KICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFAFILYGFVAISIAHLGQDVSITSEIFGKVKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSC

Query:  RIAEVFRICNKDGFPHFTFCEDLTENEIFMLECAKKLLVVGDEWPAYKAGRHAACHGSWFSATLIFGHLISKVRSDVFHYWLKSLFQFALAERKIQFLLL
        RIAEVFRICN DGFPHFTFCEDLTENEIFMLECAKKLLVVGDEWPAYKAGRHAACHGSWFSATLIFGHLISKVRSDVFHYWLKSLFQFALAERKIQFLLL
Subjt:  RIAEVFRICNKDGFPHFTFCEDLTENEIFMLECAKKLLVVGDEWPAYKAGRHAACHGSWFSATLIFGHLISKVRSDVFHYWLKSLFQFALAERKIQFLLL

Query:  PNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIYRDKLLETYQCLCSSVEVLKAAAVPPVQSFCFQRWFLSLRAKILGTVGSILKQLLNVSCSIS
        PNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIY DKLLETYQCLCSSVEVLKAAAVPPVQSFCFQRWFLSLRAKILGTVGSILKQLLNVSCSIS
Subjt:  PNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIYRDKLLETYQCLCSSVEVLKAAAVPPVQSFCFQRWFLSLRAKILGTVGSILKQLLNVSCSIS

Query:  TDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIGTTFIGMDTKSSNVISALALNCSLLAFCTGFAFHDPDLATTLMTKNVDDFRTKLHAILIQNL
        TDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIGTTFIGMDTKSSNVISALALNCSLLAFCTGFAFH PDLATTLMTKNVDDFRTKLHAILIQNL
Subjt:  TDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIGTTFIGMDTKSSNVISALALNCSLLAFCTGFAFHDPDLATTLMTKNVDDFRTKLHAILIQNL

Query:  ISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILDMGYEVRGILTLCRYAISEFIHMQSKSSGVDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPK
        ISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILDMGYEVRGILTLCRYAISEFIHMQSKSSGVDKGTFLQVIE GMQFLSNIVMQWIRIPFRVPK
Subjt:  ISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILDMGYEVRGILTLCRYAISEFIHMQSKSSGVDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPK

Query:  CFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCLQLKNITQNMSVQIPKMYCILYCGLSFQELKHNGQNHQVCEAWENDDVVEMHNKLLHYVTES
        CFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCLQLKNITQNMSVQIPKMYCILYCGLSFQELKHNGQNHQVCEAWENDDVVEMHNKLLHYVTES
Subjt:  CFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCLQLKNITQNMSVQIPKMYCILYCGLSFQELKHNGQNHQVCEAWENDDVVEMHNKLLHYVTES

Query:  SKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQGFSNCLFDVSHFPVGCYRIKWYSCCVDSEGCFWNLLPLNSGPLLTIHQLSSAG
        +KNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQGFSNCLFDVSHFPVGCYRIKWYSCCVDSEGCFWNLLPLNSGPLLTIHQLSSAG
Subjt:  SKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQGFSNCLFDVSHFPVGCYRIKWYSCCVDSEGCFWNLLPLNSGPLLTIHQLSSAG

TrEMBL top hitse value%identityAlignment
A0A0A0KMS4 Uncharacterized protein0.0e+0099.47Show/hide
Query:  MFCCVALLFDYFLAVVNIWQILWGITSIALPGTQAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFMK
        MFCCVALLFDYFLAVVNIWQILWGITSIALPGTQAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFMK
Subjt:  MFCCVALLFDYFLAVVNIWQILWGITSIALPGTQAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFMK

Query:  GAFQFVNMESVVKILIDALDEHMLPTSSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVCS
        GAFQFVNMESVVKILIDALDEHMLPTSSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVCS
Subjt:  GAFQFVNMESVVKILIDALDEHMLPTSSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVCS

Query:  FSLLPARVISLIMDQIASLAKMFVDLPQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFAF
        FSLLPARVISLIMDQIASLAKMFVDLPQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFAF
Subjt:  FSLLPARVISLIMDQIASLAKMFVDLPQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFAF

Query:  ILYGFVAISIAHLGQDVSITSEIFGKVKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNKDGFPHFTFCEDLTENEIFMLECAKKLLVV
        ILYGFVAISIAHLGQDVSITSEIFGKVKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICN DGFPHFTFCEDLTENEIFMLECAKKLLVV
Subjt:  ILYGFVAISIAHLGQDVSITSEIFGKVKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNKDGFPHFTFCEDLTENEIFMLECAKKLLVV

Query:  GDEWPAYKAGRHAACHGSWFSATLIFGHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIYR
        GDEWPAYKAGRHAACHGSWFSATLIFGHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIY 
Subjt:  GDEWPAYKAGRHAACHGSWFSATLIFGHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIYR

Query:  DKLLETYQCLCSSVEVLKAAAVPPVQSFCFQRWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIGT
        DKLLETYQCLCSSVEVLKAAAVPPVQSFCFQRWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIGT
Subjt:  DKLLETYQCLCSSVEVLKAAAVPPVQSFCFQRWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIGT

Query:  TFIGMDTKSSNVISALALNCSLLAFCTGFAFHDPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILDM
        TFIGMDTKSSNVISALALNCSLLAFCTGFAFH PDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILDM
Subjt:  TFIGMDTKSSNVISALALNCSLLAFCTGFAFHDPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILDM

Query:  GYEVRGILTLCRYAISEFIHMQSKSSGVDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCLQ
        GYEVRGILTLCRYAISEFIHMQSKSSGVDKGTFLQVIE GMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCLQ
Subjt:  GYEVRGILTLCRYAISEFIHMQSKSSGVDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCLQ

Query:  LKNITQNMSVQIPKMYCILYCGLSFQELKHNGQNHQVCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQGFSNC
        LKNITQNMSVQIPKMYCILYCGLSFQELKHNGQNHQVCEAWENDDVVEMHNKLLHYVTES+KNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQGFSNC
Subjt:  LKNITQNMSVQIPKMYCILYCGLSFQELKHNGQNHQVCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQGFSNC

Query:  LFDVSHFPVGCYRIKWYSCCVDSEGCFWNLLPLNSGPLLTIHQLSSAG
        LFDVSHFPVGCYRIKWYSCCVDSEGCFWNLLPLNSGPLLTIHQLSSAG
Subjt:  LFDVSHFPVGCYRIKWYSCCVDSEGCFWNLLPLNSGPLLTIHQLSSAG

A0A1S3CM44 uncharacterized protein LOC103502541 isoform X40.0e+0090.59Show/hide
Query:  IALPGTQAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFMKGAFQFVNMESVVKILIDALDEHMLPTS
        +ALPGTQAGLELASDTS+E FL+AMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSV VR+ SLRCLCFIFMKGA QFVNMESVVKILIDALDEHMLPTS
Subjt:  IALPGTQAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFMKGAFQFVNMESVVKILIDALDEHMLPTS

Query:  SHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVCSFSLLPARVISLIMDQIASLAKMFVDLP
        SHCDALRLLQKI+FYVR N SFLDAN+YSNLVKAVENAA+SPVKLKRLLAFQVLVHLSLQLSGKMEVESGVCS SLLP+RVISLIMDQIASLAKMF+DLP
Subjt:  SHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVCSFSLLPARVISLIMDQIASLAKMFVDLP

Query:  QSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFAFILYGFVAISIAHLGQDVSITSEIFGKV
        QSN E F EIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRF+FILYGFVAIS+ +LGQ +SIT EIF KV
Subjt:  QSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFAFILYGFVAISIAHLGQDVSITSEIFGKV

Query:  KLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNKDGFPHFTFCEDLTENEIFMLECAKKLLVVGDEWPAYKAGRHAACHGSWFSATLIFG
        KLLVNSVC+SCLFSSHTCIIYSLLLNCKF+LSCRIAE FRICN +GFPHFTFCEDLTENEIF LECAKKLL VGDEWPAY AGRHAACHGSWF+ATLIFG
Subjt:  KLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNKDGFPHFTFCEDLTENEIFMLECAKKLLVVGDEWPAYKAGRHAACHGSWFSATLIFG

Query:  HLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIYRDKLLETYQCLCSSVEVLKAAAVPPVQS
        HLISKVRSD+FHYWLKSLFQFALAERKIQFLLLP+YG  L IWLEKEA LNMFHI+E INHH+TGSITEGIY  KL ETYQCL SSVEVLKAAAVPPVQS
Subjt:  HLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIYRDKLLETYQCLCSSVEVLKAAAVPPVQS

Query:  FCFQRWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIGTTFIGMDTKSSNVISALALNCSLLAFCT
        FCFQRWFLSLRAKILGTVGSILK LLNV CSISTDYGKLGTN+TGI LESVNEF KLSLKLERLS EFDLIGTTFIGMDTKS NVISALALNCSLLAFCT
Subjt:  FCFQRWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIGTTFIGMDTKSSNVISALALNCSLLAFCT

Query:  GFAFHDPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILDMGYEVRGILTLCRYAISEFIHMQSKSSG
        GFAFH PDLATTLMT+NVDDFRTKL AILIQNL SRL+LVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILD+GYEVRGILTLCRYAISEFI MQSKSSG
Subjt:  GFAFHDPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILDMGYEVRGILTLCRYAISEFIHMQSKSSG

Query:  VDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCLQLKNITQNMSVQIPKMYCILYCGLSFQE
        VDK TFLQVIEDGMQFLSNIVM WI IPFRVPK FF VRPCIGCELFATTDV KLDEISIPYGFHLSLNLCLQLKNIT NMSVQI KMYCILYCG SFQE
Subjt:  VDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCLQLKNITQNMSVQIPKMYCILYCGLSFQE

Query:  LKH----NGQNHQVCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQGFSNCLFDVSHFPVGCYRIKWYSCCVDS
        LKH    NG+NHQV EAWENDD+VEMHNKLLHYVTESSKNEAYIGKC TS V KTDR +EVFVQFEPDEKGQGFSNCLFDVSH+PVGCYRIKWYSCCVDS
Subjt:  LKH----NGQNHQVCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQGFSNCLFDVSHFPVGCYRIKWYSCCVDS

Query:  EGCFWNLLPLNSGPLLTIHQLSSAG
        EGCFWNLLPLNSGPL TIHQL SAG
Subjt:  EGCFWNLLPLNSGPLLTIHQLSSAG

A0A1S3CM96 uncharacterized protein LOC103502541 isoform X30.0e+0090.06Show/hide
Query:  SIALPGTQAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFMKGAFQFVNMESVVKILIDALDEHMLPT
        S+A    +AGLELASDTS+E FL+AMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSV VR+ SLRCLCFIFMKGA QFVNMESVVKILIDALDEHMLPT
Subjt:  SIALPGTQAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFMKGAFQFVNMESVVKILIDALDEHMLPT

Query:  SSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVCSFSLLPARVISLIMDQIASLAKMFVDL
        SSHCDALRLLQKI+FYVR N SFLDAN+YSNLVKAVENAA+SPVKLKRLLAFQVLVHLSLQLSGKMEVESGVCS SLLP+RVISLIMDQIASLAKMF+DL
Subjt:  SSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVCSFSLLPARVISLIMDQIASLAKMFVDL

Query:  PQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFAFILYGFVAISIAHLGQDVSITSEIFGK
        PQSN E F EIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRF+FILYGFVAIS+ +LGQ +SIT EIF K
Subjt:  PQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFAFILYGFVAISIAHLGQDVSITSEIFGK

Query:  VKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNKDGFPHFTFCEDLTENEIFMLECAKKLLVVGDEWPAYKAGRHAACHGSWFSATLIF
        VKLLVNSVC+SCLFSSHTCIIYSLLLNCKF+LSCRIAE FRICN +GFPHFTFCEDLTENEIF LECAKKLL VGDEWPAY AGRHAACHGSWF+ATLIF
Subjt:  VKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNKDGFPHFTFCEDLTENEIFMLECAKKLLVVGDEWPAYKAGRHAACHGSWFSATLIF

Query:  GHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIYRDKLLETYQCLCSSVEVLKAAAVPPVQ
        GHLISKVRSD+FHYWLKSLFQFALAERKIQFLLLP+YG  L IWLEKEA LNMFHI+E INHH+TGSITEGIY  KL ETYQCL SSVEVLKAAAVPPVQ
Subjt:  GHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIYRDKLLETYQCLCSSVEVLKAAAVPPVQ

Query:  SFCFQRWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIGTTFIGMDTKSSNVISALALNCSLLAFC
        SFCFQRWFLSLRAKILGTVGSILK LLNV CSISTDYGKLGTN+TGI LESVNEF KLSLKLERLS EFDLIGTTFIGMDTKS NVISALALNCSLLAFC
Subjt:  SFCFQRWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIGTTFIGMDTKSSNVISALALNCSLLAFC

Query:  TGFAFHDPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILDMGYEVRGILTLCRYAISEFIHMQSKSS
        TGFAFH PDLATTLMT+NVDDFRTKL AILIQNL SRL+LVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILD+GYEVRGILTLCRYAISEFI MQSKSS
Subjt:  TGFAFHDPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILDMGYEVRGILTLCRYAISEFIHMQSKSS

Query:  GVDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCLQLKNITQNMSVQIPKMYCILYCGLSFQ
        GVDK TFLQVIEDGMQFLSNIVM WI IPFRVPK FF VRPCIGCELFATTDV KLDEISIPYGFHLSLNLCLQLKNIT NMSVQI KMYCILYCG SFQ
Subjt:  GVDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCLQLKNITQNMSVQIPKMYCILYCGLSFQ

Query:  ELKH----NGQNHQVCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQGFSNCLFDVSHFPVGCYRIKWYSCCVD
        ELKH    NG+NHQV EAWENDD+VEMHNKLLHYVTESSKNEAYIGKC TS V KTDR +EVFVQFEPDEKGQGFSNCLFDVSH+PVGCYRIKWYSCCVD
Subjt:  ELKH----NGQNHQVCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQGFSNCLFDVSHFPVGCYRIKWYSCCVD

Query:  SEGCFWNLLPLNSGPLLTIHQLSSAG
        SEGCFWNLLPLNSGPL TIHQL SAG
Subjt:  SEGCFWNLLPLNSGPLLTIHQLSSAG

A0A1S3CMM3 uncharacterized protein LOC103502541 isoform X10.0e+0090.06Show/hide
Query:  SIALPGTQAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFMKGAFQFVNMESVVKILIDALDEHMLPT
        S+A    +AGLELASDTS+E FL+AMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSV VR+ SLRCLCFIFMKGA QFVNMESVVKILIDALDEHMLPT
Subjt:  SIALPGTQAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFMKGAFQFVNMESVVKILIDALDEHMLPT

Query:  SSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVCSFSLLPARVISLIMDQIASLAKMFVDL
        SSHCDALRLLQKI+FYVR N SFLDAN+YSNLVKAVENAA+SPVKLKRLLAFQVLVHLSLQLSGKMEVESGVCS SLLP+RVISLIMDQIASLAKMF+DL
Subjt:  SSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVCSFSLLPARVISLIMDQIASLAKMFVDL

Query:  PQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFAFILYGFVAISIAHLGQDVSITSEIFGK
        PQSN E F EIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRF+FILYGFVAIS+ +LGQ +SIT EIF K
Subjt:  PQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFAFILYGFVAISIAHLGQDVSITSEIFGK

Query:  VKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNKDGFPHFTFCEDLTENEIFMLECAKKLLVVGDEWPAYKAGRHAACHGSWFSATLIF
        VKLLVNSVC+SCLFSSHTCIIYSLLLNCKF+LSCRIAE FRICN +GFPHFTFCEDLTENEIF LECAKKLL VGDEWPAY AGRHAACHGSWF+ATLIF
Subjt:  VKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNKDGFPHFTFCEDLTENEIFMLECAKKLLVVGDEWPAYKAGRHAACHGSWFSATLIF

Query:  GHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIYRDKLLETYQCLCSSVEVLKAAAVPPVQ
        GHLISKVRSD+FHYWLKSLFQFALAERKIQFLLLP+YG  L IWLEKEA LNMFHI+E INHH+TGSITEGIY  KL ETYQCL SSVEVLKAAAVPPVQ
Subjt:  GHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIYRDKLLETYQCLCSSVEVLKAAAVPPVQ

Query:  SFCFQRWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIGTTFIGMDTKSSNVISALALNCSLLAFC
        SFCFQRWFLSLRAKILGTVGSILK LLNV CSISTDYGKLGTN+TGI LESVNEF KLSLKLERLS EFDLIGTTFIGMDTKS NVISALALNCSLLAFC
Subjt:  SFCFQRWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIGTTFIGMDTKSSNVISALALNCSLLAFC

Query:  TGFAFHDPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILDMGYEVRGILTLCRYAISEFIHMQSKSS
        TGFAFH PDLATTLMT+NVDDFRTKL AILIQNL SRL+LVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILD+GYEVRGILTLCRYAISEFI MQSKSS
Subjt:  TGFAFHDPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILDMGYEVRGILTLCRYAISEFIHMQSKSS

Query:  GVDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCLQLKNITQNMSVQIPKMYCILYCGLSFQ
        GVDK TFLQVIEDGMQFLSNIVM WI IPFRVPK FF VRPCIGCELFATTDV KLDEISIPYGFHLSLNLCLQLKNIT NMSVQI KMYCILYCG SFQ
Subjt:  GVDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCLQLKNITQNMSVQIPKMYCILYCGLSFQ

Query:  ELKH----NGQNHQVCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQGFSNCLFDVSHFPVGCYRIKWYSCCVD
        ELKH    NG+NHQV EAWENDD+VEMHNKLLHYVTESSKNEAYIGKC TS V KTDR +EVFVQFEPDEKGQGFSNCLFDVSH+PVGCYRIKWYSCCVD
Subjt:  ELKH----NGQNHQVCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQGFSNCLFDVSHFPVGCYRIKWYSCCVD

Query:  SEGCFWNLLPLNSGPLLTIHQLSSAG
        SEGCFWNLLPLNSGPL TIHQL SAG
Subjt:  SEGCFWNLLPLNSGPLLTIHQLSSAG

A0A1S3CNQ5 uncharacterized protein LOC103502541 isoform X20.0e+0088.85Show/hide
Query:  YFLAVVNIWQILWGITSIALPGTQAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFMKGAFQFVNMES
        Y +A   ++  L    S+A    +AGLELASDTS+E FL+AMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSV VR+ SLRCLCFIFMKGA QFVNMES
Subjt:  YFLAVVNIWQILWGITSIALPGTQAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFMKGAFQFVNMES

Query:  VVKILIDALDEHMLPTSSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVCSFSLLPARVIS
        VVKILIDALDEHMLPTSSHCDALRLLQKI+FYVR N SFLDAN+YSNLVKAVENAA+SPVKLKRLLAFQVLVHLSLQLSGKMEVESGVCS SLLP+RVIS
Subjt:  VVKILIDALDEHMLPTSSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVCSFSLLPARVIS

Query:  LIMDQIASLAKMFVDLPQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFAFILYGFVAISI
        LIMDQIASLAKMF+DLPQSN E F EIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRF+FILYGFVAIS+
Subjt:  LIMDQIASLAKMFVDLPQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFAFILYGFVAISI

Query:  AHLGQDVSITSEIFGKVKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNKDGFPHFTFCEDLTENEIFMLECAKKLLVVGDEWPAYKAG
         +LGQ +SIT EIF KVKLLVNSVC+SCLFSSHTCIIYSLLLNCKF+LSCRIAE FRICN +GFPHFTFCEDLTENEIF LECAKKLL VGDEWPAY AG
Subjt:  AHLGQDVSITSEIFGKVKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNKDGFPHFTFCEDLTENEIFMLECAKKLLVVGDEWPAYKAG

Query:  RHAACHGSWFSATLIFGHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIYRDKLLETYQCL
        RHAACHGSWF+ATLIFGHLISKVRSD+FHYWLKSLFQFALAERKIQFLLLP+YG  L IWLEKEA LNMFHI+E INHH+TGSITEGIY  KL ETYQCL
Subjt:  RHAACHGSWFSATLIFGHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIYRDKLLETYQCL

Query:  CSSVEVLKAAAVPPVQSFCFQRWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIGTTFIGMDTKSS
         SSVEVLKAAAVPPVQSFCFQRWFLSLRAKILGTVGSILK LLNV CSISTDYGKLGTN+TGI LESVNEF KLSLKLERLS EFDLIGTTFIGMDTKS 
Subjt:  CSSVEVLKAAAVPPVQSFCFQRWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIGTTFIGMDTKSS

Query:  NVISALALNCSLLAFCTGFAFHDPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILDMGYEVRGILTL
        NVISALALNCSLLAFCTGFAFH PDLATTLMT+NVDDFRTKL AILIQNL SRL+LVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILD+GYEVRGILTL
Subjt:  NVISALALNCSLLAFCTGFAFHDPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILDMGYEVRGILTL

Query:  CRYAISEFIHMQSKSSGVDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCLQLKNITQNMSV
        CRYAISEFI MQSKSSGVDK TFLQVIEDGMQFLSNIVM WI IPFRVPK FF VRPCIGCELFATTDV KLDEISIPYGFHLSLNLCLQLKNIT NMSV
Subjt:  CRYAISEFIHMQSKSSGVDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCLQLKNITQNMSV

Query:  QIPKMYCILYCGLSFQELKH----NGQNHQVCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQGFSNCLFDVSH
        QI KMYCILYCG SFQELKH    NG+NHQV EAWENDD+VEMHNKLLHYVTESSKNEAYIGKC TS V KTDR +EVFVQFEPDEKGQGFSNCLFDVSH
Subjt:  QIPKMYCILYCGLSFQELKH----NGQNHQVCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQGFSNCLFDVSH

Query:  FPVGCYRIKWYSCCVDSEGCFWNLLPLNSGPLLTIHQLSSAG
        +PVGCYRIKWYSCCVDSEGCFWNLLPLNSGPL TIHQL SAG
Subjt:  FPVGCYRIKWYSCCVDSEGCFWNLLPLNSGPLLTIHQLSSAG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G20060.1 ARM repeat superfamily protein3.1e-13132.67Show/hide
Query:  LAVVNIWQILWGITSIALPGTQAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFMKGAFQFVNMESVV
        LA V ++  +    +IA    +  ++L  D+  E  LV  L SL+KLAS+S  ++SE  + +  FL   K+ H R   LRCL F+  +G    +  E  +
Subjt:  LAVVNIWQILWGITSIALPGTQAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFMKGAFQFVNMESVV

Query:  KILIDALDEHMLPTSSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVCSFSLLPARVISLI
          +   L +  L +     AL++ QKI+ Y        DA++   L+   ENA+ S +     LA  VLV +  ++    E  S   S + LP +++ LI
Subjt:  KILIDALDEHMLPTSSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVCSFSLLPARVISLI

Query:  MDQIASLAKMFVDLPQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHED--AFDGQQRDVDFEVN--EKNDISLRFAFI--LYGFV
        MD++A L ++  DL ++ Y V  E+++LL +L L+V + S+L +L+LEK+ L    I+ +++     DG    +   +N  +K  + +R  F+  ++ F+
Subjt:  MDQIASLAKMFVDLPQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHED--AFDGQQRDVDFEVN--EKNDISLRFAFI--LYGFV

Query:  AISIAHLGQDVSITSEIFGKVKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNKDGFPHFTFCEDLTENEIFMLECAKKLLVVGDEWPA
         + + +L  D ++ SEI+ KVK +   V        HT +I++LLL+   +        F + +  G    +   D+    I  L+C+ ++L+  + WPA
Subjt:  AISIAHLGQDVSITSEIFGKVKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNKDGFPHFTFCEDLTENEIFMLECAKKLLVVGDEWPA

Query:  YKAGRHAACHGSWFSATLIFGHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIYRDKLLET
        Y+AG +AA  G+W ++ +IF  L + V+SD+   WLKSL   + AE K Q LL P+   +L  WL+    L       +++   +G     +    L E 
Subjt:  YKAGRHAACHGSWFSATLIFGHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIYRDKLLET

Query:  YQCLCSSVEVLKAAAVPPVQSFCFQRWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIGTTFIGMD
        Y  L SS+ +L    +     FCFQ WFL L+ ++L TV  +++ L  ++  +           TG   +S+ +  ++S++L++L++EFD++ T FI +D
Subjt:  YQCLCSSVEVLKAAAVPPVQSFCFQRWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIGTTFIGMD

Query:  TKSSNVISALALNCSLLAFCTGFAFHDPDLATTLMTKNVDDFRTK--LHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILDMGYEV
          SS++I+ ++L+CS+LAF  G     P  +     + +  F ++  L + L+++L+ RL  VD    + L  L   T    NC HL SR ++L +  +V
Subjt:  TKSSNVISALALNCSLLAFCTGFAFHDPDLATTLMTKNVDDFRTK--LHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILDMGYEV

Query:  RGILTLCRYAISEFIHMQSKSSGVDKGTFL-QVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFA---TTDVRKLDEISIPYGFHLSLNLCLQ
        + +L++CR A+S    +Q++S  + K   + ++ +     LS  +M+W++IPF +PK FF +RPC+G ELFA    +  R  D +S+  GF LSL+LCLQ
Subjt:  RGILTLCRYAISEFIHMQSKSSGVDKGTFL-QVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFA---TTDVRKLDEISIPYGFHLSLNLCLQ

Query:  LKNITQ-NMSVQIPKMYCILYCGLSFQELKHNGQNHQ---VCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQG
        LKNI Q  + V++ K+YC+LY  L++     +G+N++       W ++D++EM NKL H+  +S K     G+   +        V   VQFEP+E+GQG
Subjt:  LKNITQ-NMSVQIPKMYCILYCGLSFQELKHNGQNHQ---VCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQG

Query:  FSNCLFDVSHFPVGCYRIKWYSCCVDSEGCFWNLLPLNSGPLLTIHQLS
        FS+CL DVS FPVG Y+IKW SCCVD  G +WNLLPLN  P+ T+ + S
Subjt:  FSNCLFDVSHFPVGCYRIKWYSCCVDSEGCFWNLLPLNSGPLLTIHQLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCTGCTGTGTAGCATTATTATTTGATTATTTTCTTGCTGTGGTGAACATTTGGCAGATTCTTTGGGGGATTACCTCTATTGCTTTACCCGGCACTCAGGCTGGACT
TGAGCTTGCCTCAGATACTAGTGATGAGAGTTTTTTGGTTGCAATGTTATTTTCACTATCCAAACTAGCTTCCAAGTCGATATTTATTAGTTCTGAGCAGGTGCAATTTC
TCTGCTCATTTCTTAGCCACAAAAAGTCTGTGCATGTTCGAGAAAAATCTTTAAGGTGTCTGTGTTTTATTTTCATGAAAGGAGCATTTCAGTTTGTTAATATGGAATCT
GTGGTTAAAATTTTAATTGATGCACTAGATGAACACATGCTTCCAACTTCTTCACATTGTGATGCTCTACGACTGTTGCAAAAGATTCTTTTCTATGTGCGGCCAAACAC
TTCATTTTTGGATGCAAACAAATACTCTAACCTGGTTAAAGCTGTGGAGAATGCAGCCCAATCTCCAGTGAAGTTAAAGAGGCTCCTTGCTTTCCAAGTGTTGGTGCATT
TATCCTTGCAGCTTTCCGGAAAAATGGAAGTAGAATCAGGAGTTTGTTCATTTTCTTTGTTGCCAGCACGGGTTATTTCACTAATCATGGATCAAATTGCCTCATTGGCG
AAGATGTTCGTAGATCTTCCTCAGTCAAATTATGAAGTGTTTCTAGAAATTAAAGAGCTGCTAAACCTTCTCCTGCTCATTGTAAGAGAACAGTCAGATTTGTGGATTTT
GCTTTTGGAAAAAATATGTTTAACTGCTGCATTAATTATGAAGATGCATGAAGATGCCTTTGATGGTCAGCAAAGAGACGTAGACTTTGAAGTAAACGAGAAAAATGATA
TCAGCTTGAGATTTGCATTCATTCTATATGGTTTTGTGGCAATCTCCATTGCTCATCTAGGTCAAGATGTCTCTATCACGTCTGAAATATTTGGCAAGGTGAAACTATTG
GTCAACAGTGTATGCAAAAGCTGTTTGTTTAGTAGCCATACTTGCATAATCTATTCCTTGCTATTGAACTGTAAATTCGTTTTGAGTTGTAGGATAGCTGAGGTTTTTAG
GATTTGCAATAAGGATGGGTTTCCACATTTCACTTTTTGTGAAGATTTGACTGAAAACGAGATTTTTATGCTCGAGTGTGCTAAGAAGTTACTAGTAGTTGGGGATGAAT
GGCCTGCTTACAAGGCTGGGAGACATGCAGCATGTCATGGATCATGGTTTTCTGCTACCTTGATTTTTGGCCATTTAATTTCAAAGGTTCGCTCTGATGTCTTCCATTAC
TGGTTGAAATCTTTGTTTCAGTTTGCTCTTGCGGAAAGAAAAATCCAATTTCTACTGTTACCAAACTATGGTTTTCGCTTGGCAATCTGGTTAGAGAAGGAAGCAAGTCT
AAACATGTTTCACATTGAAGAACAAATAAACCATCACCACACTGGGAGCATCACCGAGGGCATTTACCGTGACAAGCTTTTGGAGACCTACCAGTGTCTTTGCTCTTCAG
TTGAGGTCTTAAAAGCAGCTGCTGTCCCACCGGTTCAGTCATTTTGTTTCCAGAGATGGTTTTTGTCTTTAAGAGCTAAGATTTTAGGAACTGTGGGGAGCATACTGAAG
CAATTGCTAAACGTTTCATGTAGTATTAGTACTGACTATGGTAAGCTTGGGACAAACGATACCGGTATTTTCCTTGAATCTGTGAACGAATTTGGTAAATTATCTTTAAA
GTTAGAGAGGCTGTCCCGTGAATTTGATCTCATTGGAACGACTTTTATTGGGATGGACACCAAGAGTTCGAACGTTATTTCAGCCCTTGCACTGAATTGCTCTTTGTTAG
CCTTTTGTACTGGTTTTGCCTTTCATGATCCTGACTTGGCTACAACTCTTATGACCAAAAATGTGGATGATTTCAGAACTAAATTACATGCAATACTCATTCAAAATCTG
ATTAGCAGGCTGCAGCTGGTAGACGATGAAACTAGTAAAATGCTTGCACAGCTTTTTGAGGTCACTGGACCACCAAATAACTGCTCGCATTTGGTTTCAAGAGGCAAAAT
ATTAGACATGGGGTACGAAGTAAGAGGCATCTTAACTCTCTGTAGGTATGCCATTTCTGAGTTTATCCATATGCAAAGCAAATCAAGCGGAGTGGATAAGGGGACATTTC
TCCAGGTTATCGAGGATGGCATGCAGTTTTTATCAAATATTGTTATGCAGTGGATACGCATTCCATTTCGAGTGCCTAAGTGTTTCTTTTGTGTAAGGCCTTGCATTGGC
TGTGAACTCTTTGCCACTACTGATGTGCGTAAACTGGATGAAATATCTATCCCATATGGCTTCCATCTGTCATTAAATCTTTGTCTTCAACTCAAAAACATTACTCAAAA
TATGTCAGTTCAAATCCCCAAGATGTACTGCATTTTGTATTGTGGTTTGTCCTTCCAGGAACTAAAGCACAATGGGCAAAATCATCAGGTTTGTGAAGCTTGGGAAAACG
ACGACGTTGTAGAAATGCATAACAAACTGTTGCATTATGTGACTGAGTCGAGCAAAAACGAGGCTTATATAGGCAAGTGCAGAACATCAAGTGTTCGTAAAACTGATAGG
GACGTTGAAGTGTTTGTGCAGTTCGAGCCAGATGAGAAAGGGCAAGGATTCTCAAATTGTTTGTTTGATGTATCTCATTTTCCTGTAGGTTGTTATAGAATCAAATGGTA
TAGCTGTTGTGTTGATAGTGAGGGGTGTTTTTGGAACCTCCTTCCTTTGAATTCTGGACCATTACTTACTATCCATCAACTTTCATCAGCTGGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTCTGCTGTGTAGCATTATTATTTGATTATTTTCTTGCTGTGGTGAACATTTGGCAGATTCTTTGGGGGATTACCTCTATTGCTTTACCCGGCACTCAGGCTGGACT
TGAGCTTGCCTCAGATACTAGTGATGAGAGTTTTTTGGTTGCAATGTTATTTTCACTATCCAAACTAGCTTCCAAGTCGATATTTATTAGTTCTGAGCAGGTGCAATTTC
TCTGCTCATTTCTTAGCCACAAAAAGTCTGTGCATGTTCGAGAAAAATCTTTAAGGTGTCTGTGTTTTATTTTCATGAAAGGAGCATTTCAGTTTGTTAATATGGAATCT
GTGGTTAAAATTTTAATTGATGCACTAGATGAACACATGCTTCCAACTTCTTCACATTGTGATGCTCTACGACTGTTGCAAAAGATTCTTTTCTATGTGCGGCCAAACAC
TTCATTTTTGGATGCAAACAAATACTCTAACCTGGTTAAAGCTGTGGAGAATGCAGCCCAATCTCCAGTGAAGTTAAAGAGGCTCCTTGCTTTCCAAGTGTTGGTGCATT
TATCCTTGCAGCTTTCCGGAAAAATGGAAGTAGAATCAGGAGTTTGTTCATTTTCTTTGTTGCCAGCACGGGTTATTTCACTAATCATGGATCAAATTGCCTCATTGGCG
AAGATGTTCGTAGATCTTCCTCAGTCAAATTATGAAGTGTTTCTAGAAATTAAAGAGCTGCTAAACCTTCTCCTGCTCATTGTAAGAGAACAGTCAGATTTGTGGATTTT
GCTTTTGGAAAAAATATGTTTAACTGCTGCATTAATTATGAAGATGCATGAAGATGCCTTTGATGGTCAGCAAAGAGACGTAGACTTTGAAGTAAACGAGAAAAATGATA
TCAGCTTGAGATTTGCATTCATTCTATATGGTTTTGTGGCAATCTCCATTGCTCATCTAGGTCAAGATGTCTCTATCACGTCTGAAATATTTGGCAAGGTGAAACTATTG
GTCAACAGTGTATGCAAAAGCTGTTTGTTTAGTAGCCATACTTGCATAATCTATTCCTTGCTATTGAACTGTAAATTCGTTTTGAGTTGTAGGATAGCTGAGGTTTTTAG
GATTTGCAATAAGGATGGGTTTCCACATTTCACTTTTTGTGAAGATTTGACTGAAAACGAGATTTTTATGCTCGAGTGTGCTAAGAAGTTACTAGTAGTTGGGGATGAAT
GGCCTGCTTACAAGGCTGGGAGACATGCAGCATGTCATGGATCATGGTTTTCTGCTACCTTGATTTTTGGCCATTTAATTTCAAAGGTTCGCTCTGATGTCTTCCATTAC
TGGTTGAAATCTTTGTTTCAGTTTGCTCTTGCGGAAAGAAAAATCCAATTTCTACTGTTACCAAACTATGGTTTTCGCTTGGCAATCTGGTTAGAGAAGGAAGCAAGTCT
AAACATGTTTCACATTGAAGAACAAATAAACCATCACCACACTGGGAGCATCACCGAGGGCATTTACCGTGACAAGCTTTTGGAGACCTACCAGTGTCTTTGCTCTTCAG
TTGAGGTCTTAAAAGCAGCTGCTGTCCCACCGGTTCAGTCATTTTGTTTCCAGAGATGGTTTTTGTCTTTAAGAGCTAAGATTTTAGGAACTGTGGGGAGCATACTGAAG
CAATTGCTAAACGTTTCATGTAGTATTAGTACTGACTATGGTAAGCTTGGGACAAACGATACCGGTATTTTCCTTGAATCTGTGAACGAATTTGGTAAATTATCTTTAAA
GTTAGAGAGGCTGTCCCGTGAATTTGATCTCATTGGAACGACTTTTATTGGGATGGACACCAAGAGTTCGAACGTTATTTCAGCCCTTGCACTGAATTGCTCTTTGTTAG
CCTTTTGTACTGGTTTTGCCTTTCATGATCCTGACTTGGCTACAACTCTTATGACCAAAAATGTGGATGATTTCAGAACTAAATTACATGCAATACTCATTCAAAATCTG
ATTAGCAGGCTGCAGCTGGTAGACGATGAAACTAGTAAAATGCTTGCACAGCTTTTTGAGGTCACTGGACCACCAAATAACTGCTCGCATTTGGTTTCAAGAGGCAAAAT
ATTAGACATGGGGTACGAAGTAAGAGGCATCTTAACTCTCTGTAGGTATGCCATTTCTGAGTTTATCCATATGCAAAGCAAATCAAGCGGAGTGGATAAGGGGACATTTC
TCCAGGTTATCGAGGATGGCATGCAGTTTTTATCAAATATTGTTATGCAGTGGATACGCATTCCATTTCGAGTGCCTAAGTGTTTCTTTTGTGTAAGGCCTTGCATTGGC
TGTGAACTCTTTGCCACTACTGATGTGCGTAAACTGGATGAAATATCTATCCCATATGGCTTCCATCTGTCATTAAATCTTTGTCTTCAACTCAAAAACATTACTCAAAA
TATGTCAGTTCAAATCCCCAAGATGTACTGCATTTTGTATTGTGGTTTGTCCTTCCAGGAACTAAAGCACAATGGGCAAAATCATCAGGTTTGTGAAGCTTGGGAAAACG
ACGACGTTGTAGAAATGCATAACAAACTGTTGCATTATGTGACTGAGTCGAGCAAAAACGAGGCTTATATAGGCAAGTGCAGAACATCAAGTGTTCGTAAAACTGATAGG
GACGTTGAAGTGTTTGTGCAGTTCGAGCCAGATGAGAAAGGGCAAGGATTCTCAAATTGTTTGTTTGATGTATCTCATTTTCCTGTAGGTTGTTATAGAATCAAATGGTA
TAGCTGTTGTGTTGATAGTGAGGGGTGTTTTTGGAACCTCCTTCCTTTGAATTCTGGACCATTACTTACTATCCATCAACTTTCATCAGCTGGGTGATTTTTCATCAAAC
AATGAAGTTTTACAGGCGATGGTAGTCTTTTCTCCCGTAGGTAAGAGAGGTTTGTATATAGAAAACATATATCAGTGCATCTTAGATTATATTAATCTTTGTGTATCTTA
TTCCGTGAAGAACTATTTTTTCTTGTTAAAATATTTTTATTTTCTTTTATGCGC
Protein sequenceShow/hide protein sequence
MFCCVALLFDYFLAVVNIWQILWGITSIALPGTQAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFMKGAFQFVNMES
VVKILIDALDEHMLPTSSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVCSFSLLPARVISLIMDQIASLA
KMFVDLPQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFAFILYGFVAISIAHLGQDVSITSEIFGKVKLL
VNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNKDGFPHFTFCEDLTENEIFMLECAKKLLVVGDEWPAYKAGRHAACHGSWFSATLIFGHLISKVRSDVFHY
WLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIYRDKLLETYQCLCSSVEVLKAAAVPPVQSFCFQRWFLSLRAKILGTVGSILK
QLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIGTTFIGMDTKSSNVISALALNCSLLAFCTGFAFHDPDLATTLMTKNVDDFRTKLHAILIQNL
ISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILDMGYEVRGILTLCRYAISEFIHMQSKSSGVDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIG
CELFATTDVRKLDEISIPYGFHLSLNLCLQLKNITQNMSVQIPKMYCILYCGLSFQELKHNGQNHQVCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDR
DVEVFVQFEPDEKGQGFSNCLFDVSHFPVGCYRIKWYSCCVDSEGCFWNLLPLNSGPLLTIHQLSSAG