; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy5G011830 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy5G011830
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionARM repeat superfamily protein
Genome locationGy14Chr5:13275260..13286868
RNA-Seq ExpressionCsGy5G011830
SyntenyCsGy5G011830
Gene Ontology termsGO:0016180 - snRNA processing (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0032039 - integrator complex (cellular component)
InterPro domainsIPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold
IPR033060 - Integrator complex subunit 7


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008464722.1 PREDICTED: uncharacterized protein LOC103502541 isoform X1 [Cucumis melo]0.091.15Show/hide
Query:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR
        MER+SAACAMEWSIELEKALR KKPGRAVEAIRQIGCRLQQWSREPEPN+AVYNMFDLVTWED+LFSNTILLRLADAFK DDKHIRLAVVRVFLSELYSR
Subjt:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR

Query:  NSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL
        +SSRSKQYQGILSKARVQN HELLTRVKVVL+GGDPEA+ LALI+LGCWAHFAKDSAQIRYL+F SLFSSHLSEVKASIFAAACI QLADDFAQVFL IL
Subjt:  NSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL

Query:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFM
        VNIMTSTTSL IRMAGARVFAKLGCSHSMAKTAYKAGLELASDTS+E FL+AMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSV VR+ SLRCLCFIFM
Subjt:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFM

Query:  KGAFQFVNMESVVKILIDALDEHMLPTSSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVC
        KGA QFVNMESVVKILIDALDEHMLPTSSHCDALRLLQKI+FYVR N SFLDAN+YSNLVKAVENAA+SPVKLKRLLAFQVLVHLSLQLSGKMEVESGVC
Subjt:  KGAFQFVNMESVVKILIDALDEHMLPTSSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVC

Query:  SFSLLPARVISLIMDQIASLAKMFVDLPQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFA
        S SLLP+RVISLIMDQIASLAKMF+DLPQSN E F EIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRF+
Subjt:  SFSLLPARVISLIMDQIASLAKMFVDLPQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFA

Query:  FILYGFVAISIAHLGQDVSITSEIFGKVKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNNDGFPHFTFCEDLTENEIFMLECAKKLLV
        FILYGFVAIS+ +LGQ +SIT EIF KVKLLVNSVC+SCLFSSHTCIIYSLLLNCKF+LSCRIAE FRICNN+GFPHFTFCEDLTENEIF LECAKKLL 
Subjt:  FILYGFVAISIAHLGQDVSITSEIFGKVKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNNDGFPHFTFCEDLTENEIFMLECAKKLLV

Query:  VGDEWPAYKAGRHAACHGSWFSATLIFGHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIY
        VGDEWPAY AGRHAACHGSWF+ATLIFGHLISKVRSD+FHYWLKSLFQFALAERKIQFLLLP+YG  L IWLEKEA LNMFHI+E INHH+TGSITEGIY
Subjt:  VGDEWPAYKAGRHAACHGSWFSATLIFGHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIY

Query:  CDKLLETYQCLCSSVEVLKAAAVPPVQSFCFQSWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIG
          KL ETYQCL SSVEVLKAAAVPPVQSFCFQ WFLSLRAKILGTVGSILK LLNV CSISTDYGKLGTN+TGI LESVNEF KLSLKLERLS EFDLIG
Subjt:  CDKLLETYQCLCSSVEVLKAAAVPPVQSFCFQSWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIG

Query:  TTFIGMDTKSSNVISALALNCSLLAFCTGFAFHVPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILD
        TTFIGMDTKS NVISALALNCSLLAFCTGFAFHVPDLATTLMT+NVDDFRTKL AILIQNL SRL+LVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILD
Subjt:  TTFIGMDTKSSNVISALALNCSLLAFCTGFAFHVPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILD

Query:  MGYEVRGILSLCRYAISEFIHMQSKSSGVDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCL
        +GYEVRGIL+LCRYAISEFI MQSKSSGVDK TFLQVIEDGMQFLSNIVM WI IPFRVPK FF VRPCIGCELFATTDV KLDEISIPYGFHLSLNLCL
Subjt:  MGYEVRGILSLCRYAISEFIHMQSKSSGVDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCL

Query:  QLKNITQNMSVQIPKMYCILYCGLSFQELKHNGQN----HQVCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQ
        QLKNIT NMSVQI KMYCILYCG SFQELKHNG+N    HQV EAWENDD+VEMHNKLLHYVTESSKNEAYIGKC TS V KTDR +EVFVQFEPDEKGQ
Subjt:  QLKNITQNMSVQIPKMYCILYCGLSFQELKHNGQN----HQVCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQ

Query:  GFSNCLFDVSHFPVGCYRIKWYSCCVDSEGCFWNLLPLNSGPLLTIHQLSSAG
        GFSNCLFDVSH+PVGCYRIKWYSCCVDSEGCFWNLLPLNSGPL TIHQL SAG
Subjt:  GFSNCLFDVSHFPVGCYRIKWYSCCVDSEGCFWNLLPLNSGPLLTIHQLSSAG

XP_011654518.1 uncharacterized protein LOC101204851 isoform X1 [Cucumis sativus]0.099.48Show/hide
Query:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR
        MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR
Subjt:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR

Query:  NSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL
        +SSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYL+FSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL
Subjt:  NSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL

Query:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFM
        VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFM
Subjt:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFM

Query:  KGAFQFVNMESVVKILIDALDEHMLPTSSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVC
        KGAFQFVNMESVVKILIDALDEHMLPTSSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVC
Subjt:  KGAFQFVNMESVVKILIDALDEHMLPTSSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVC

Query:  SFSLLPARVISLIMDQIASLAKMFVDLPQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFA
        SFSLLPARVISLIMDQIASLAKMFVDLPQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFA
Subjt:  SFSLLPARVISLIMDQIASLAKMFVDLPQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFA

Query:  FILYGFVAISIAHLGQDVSITSEIFGKVKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNNDGFPHFTFCEDLTENEIFMLECAKKLLV
        FILYGFVAISIAHLGQDVSITSEIFGKVKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNNDGFPHFTFCEDLTENEIFMLECAKKLLV
Subjt:  FILYGFVAISIAHLGQDVSITSEIFGKVKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNNDGFPHFTFCEDLTENEIFMLECAKKLLV

Query:  VGDEWPAYKAGRHAACHGSWFSATLIFGHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIY
        VGDEWPAYKAGRHAACHGSWFSATLIFGHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIY
Subjt:  VGDEWPAYKAGRHAACHGSWFSATLIFGHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIY

Query:  CDKLLETYQCLCSSVEVLKAAAVPPVQSFCFQSWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIG
        CDKLLETYQCLCSSVEVLKAAAVPPVQSFCFQ WFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIG
Subjt:  CDKLLETYQCLCSSVEVLKAAAVPPVQSFCFQSWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIG

Query:  TTFIGMDTKSSNVISALALNCSLLAFCTGFAFHVPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILD
        TTFIGMDTKSSNVISALALNCSLLAFCTGFAFHVPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILD
Subjt:  TTFIGMDTKSSNVISALALNCSLLAFCTGFAFHVPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILD

Query:  MGYEVRGILSLCRYAISEFIHMQSKSSGVDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCL
        MGYEVRGIL+LCRYAISEFIHMQSKSSGVDKGTFLQVIE GMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCL
Subjt:  MGYEVRGILSLCRYAISEFIHMQSKSSGVDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCL

Query:  QLKNITQNMSVQIPKMYCILYCGLSFQELKHNGQNHQVCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQGFSN
        QLKNITQNMSVQIPKMYCILYCGLSFQELKHNGQNHQVCEAWENDDVVEMHNKLLHYVTES+KNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQGFSN
Subjt:  QLKNITQNMSVQIPKMYCILYCGLSFQELKHNGQNHQVCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQGFSN

Query:  CLFDVSHFPVGCYRIKWYSCCVDSEGCFWNLLPLNSGPLLTIHQLSSAG
        CLFDVSHFPVGCYRIKWYSCCVDSEGCFWNLLPLNSGPLLTIHQLSSAG
Subjt:  CLFDVSHFPVGCYRIKWYSCCVDSEGCFWNLLPLNSGPLLTIHQLSSAG

XP_011654519.1 uncharacterized protein LOC101204851 isoform X2 [Cucumis sativus]0.096.34Show/hide
Query:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR
        MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR
Subjt:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR

Query:  NSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL
        +SSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYL+FSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL
Subjt:  NSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL

Query:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFM
        VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYK                                    VQFLCSFLSHKKSVHVREKSLRCLCFIFM
Subjt:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFM

Query:  KGAFQFVNMESVVKILIDALDEHMLPTSSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVC
        KGAFQFVNMESVVKILIDALDEHMLPTSSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVC
Subjt:  KGAFQFVNMESVVKILIDALDEHMLPTSSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVC

Query:  SFSLLPARVISLIMDQIASLAKMFVDLPQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFA
        SFSLLPARVISLIMDQIASLAKMFVDLPQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFA
Subjt:  SFSLLPARVISLIMDQIASLAKMFVDLPQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFA

Query:  FILYGFVAISIAHLGQDVSITSEIFGKVKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNNDGFPHFTFCEDLTENEIFMLECAKKLLV
        FILYGFVAISIAHLGQDVSITSEIFGKVKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNNDGFPHFTFCEDLTENEIFMLECAKKLLV
Subjt:  FILYGFVAISIAHLGQDVSITSEIFGKVKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNNDGFPHFTFCEDLTENEIFMLECAKKLLV

Query:  VGDEWPAYKAGRHAACHGSWFSATLIFGHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIY
        VGDEWPAYKAGRHAACHGSWFSATLIFGHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIY
Subjt:  VGDEWPAYKAGRHAACHGSWFSATLIFGHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIY

Query:  CDKLLETYQCLCSSVEVLKAAAVPPVQSFCFQSWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIG
        CDKLLETYQCLCSSVEVLKAAAVPPVQSFCFQ WFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIG
Subjt:  CDKLLETYQCLCSSVEVLKAAAVPPVQSFCFQSWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIG

Query:  TTFIGMDTKSSNVISALALNCSLLAFCTGFAFHVPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILD
        TTFIGMDTKSSNVISALALNCSLLAFCTGFAFHVPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILD
Subjt:  TTFIGMDTKSSNVISALALNCSLLAFCTGFAFHVPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILD

Query:  MGYEVRGILSLCRYAISEFIHMQSKSSGVDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCL
        MGYEVRGIL+LCRYAISEFIHMQSKSSGVDKGTFLQVIE GMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCL
Subjt:  MGYEVRGILSLCRYAISEFIHMQSKSSGVDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCL

Query:  QLKNITQNMSVQIPKMYCILYCGLSFQELKHNGQNHQVCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQGFSN
        QLKNITQNMSVQIPKMYCILYCGLSFQELKHNGQNHQVCEAWENDDVVEMHNKLLHYVTES+KNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQGFSN
Subjt:  QLKNITQNMSVQIPKMYCILYCGLSFQELKHNGQNHQVCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQGFSN

Query:  CLFDVSHFPVGCYRIKWYSCCVDSEGCFWNLLPLNSGPLLTIHQLSSAG
        CLFDVSHFPVGCYRIKWYSCCVDSEGCFWNLLPLNSGPLLTIHQLSSAG
Subjt:  CLFDVSHFPVGCYRIKWYSCCVDSEGCFWNLLPLNSGPLLTIHQLSSAG

XP_038892419.1 uncharacterized protein LOC120081531 isoform X1 [Benincasa hispida]0.084.81Show/hide
Query:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR
        MER+SAACAMEWSIELEKALR KKPG+AVEAI QIG RLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDK+IRLAVVRVFLSELYSR
Subjt:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR

Query:  NSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL
        +S+RSKQYQGILSKAR+QNHHELL+RVKVVLNGGDPEAR LALILLGCWAHFAKDSAQIRYL+FSS++SSHLSEVKASIFAAACI QLADDFAQVFL IL
Subjt:  NSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL

Query:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFM
        VNIMTSTTS+ IRMAGARVFAKLGCSHS+AK AYKAGLELASD+S+E FLVAMLFSLSKLASKSIFISSEQVQ LCS LSHKKSV V+E SLRCLCFIFM
Subjt:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFM

Query:  KGAFQFVNMESVVKILIDALDEHMLPTSSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVC
        KGA QF NMESVV+ L+DALDEHMLPTSSHCDALRLL+KILFYVRPN SFLDAN+YS LVKAVENAA+SP KLK LLA  +LV LSLQLSGKMEVESGVC
Subjt:  KGAFQFVNMESVVKILIDALDEHMLPTSSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVC

Query:  SFSLLPARVISLIMDQIASLAKMFVDLPQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFA
        SFSLLP++VISLIMDQIASL K+ VDL +SN EVF EIK LLNLLLLIV E SDLWI+LLEKICL A L+M MH+D FD QQRD  FE ++KN IS RFA
Subjt:  SFSLLPARVISLIMDQIASLAKMFVDLPQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFA

Query:  FILYGFVAISIAHLGQDVSITSEIFGKVKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNNDGFPHFTFCEDLTENEIFMLECAKKLLV
        FILYGFVAISI HLG  VSITSEIF KVKLLVN+VCKSCLFSSHTCIIYSLLLNCKF+LSCRI E FR CNNDGFP FTFCEDLTE EIF LECAKKLL 
Subjt:  FILYGFVAISIAHLGQDVSITSEIFGKVKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNNDGFPHFTFCEDLTENEIFMLECAKKLLV

Query:  VGDEWPAYKAGRHAACHGSWFSATLIFGHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIY
         GDEWPAYKAGRHAACHGSWF+ATLIFGHLISKVRSDVFHYWLKSLFQFALAERKIQ LLLP YG  LAIWLEKE  LNMF IEEQIN HH GSI    Y
Subjt:  VGDEWPAYKAGRHAACHGSWFSATLIFGHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIY

Query:  CDKLLETYQCLCSSVEVLKAAAVPPVQSFCFQSWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIG
         DKLLE YQCLCSS E LKA+AVPPVQ+FCFQ WFLSLRAK+LGTVGSILK L N+S  ISTDYGKLG NDT    ++VN+F KLSL LERLS EFDLIG
Subjt:  CDKLLETYQCLCSSVEVLKAAAVPPVQSFCFQSWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIG

Query:  TTFIGMDTKSSNVISALALNCSLLAFCTGFAFHVPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILD
        TTFIGMDTKSSN+ISALALNCSLLAFCTGFAFHVP+LATTLMT+NVDDFRTK HAILIQNLI RL LVDDETS+MLAQLFE+TGP NNC HLVSRGKILD
Subjt:  TTFIGMDTKSSNVISALALNCSLLAFCTGFAFHVPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILD

Query:  MGYEVRGILSLCRYAISEFIHMQSKSSGVDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCL
        +GYEVRGIL+LCRYA+SEFI +QSKS+ VD+GTFLQVIEDG+ FLSNI+ QWI IPFRVPKCFFCVRPCIG +LFATT+ RKLDEISIP GFHLSLNLCL
Subjt:  MGYEVRGILSLCRYAISEFIHMQSKSSGVDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCL

Query:  QLKNITQNMSVQIPKMYCILYCGLSFQELKHNGQN---HQVCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQG
        QLKNI  NMSVQI KMYCILYC LSFQELKHNG+N   HQV EAWENDD+VEMHNKLLHY+TESSKNE YIGKCRTS+V KT+R VE FV FEPDEKGQG
Subjt:  QLKNITQNMSVQIPKMYCILYCGLSFQELKHNGQN---HQVCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQG

Query:  FSNCLFDVSHFPVGCYRIKWYSCCVDSEGCFWNLLPLNSGPLLTIHQLSSAG
        FSNCL DVS+FPVGCYRIKWYSCCVD++GCFWNLLPLNSGPL TIHQL SAG
Subjt:  FSNCLFDVSHFPVGCYRIKWYSCCVDSEGCFWNLLPLNSGPLLTIHQLSSAG

XP_038892420.1 uncharacterized protein LOC120081531 isoform X2 [Benincasa hispida]0.082.47Show/hide
Query:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR
        MER+SAACAMEWSIELEKALR KKPG+AVEAI QIG RLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDK+IRLAVVRVFLSELYSR
Subjt:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR

Query:  NSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL
        +S+RSKQYQGILSKAR+QNHHELL+RVKVVLNGGDPEAR LALILLGCWAHFAKDSAQIRYL+FSS++SSHLSEVKASIFAAACI QLADDFAQVFL IL
Subjt:  NSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL

Query:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFM
        VNIMTSTTS+ IRMAGARVFAKLGCSHS+AK AYKAGLELASD+S+E FLVAMLFSLSKLASKSIFISSEQVQ LCS LSHKKSV V+E SLRCLCFIFM
Subjt:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFM

Query:  KGAFQFVNMESVVKILIDALDEHMLPTSSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVC
        KGA QF NMESVV+ L+DALDEHMLPTSSHCDALRLL+KILFYVRPN SFLDAN+YS LVKAVENAA+SP KLK LLA  +LV LSLQLSGKMEVESGVC
Subjt:  KGAFQFVNMESVVKILIDALDEHMLPTSSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVC

Query:  SFSLLPARVISLIMDQIASLAKMFVDLPQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFA
        SFSLLP++VISLIMDQIASL K+ VDL +SN EVF EIK                                    D FD QQRD  FE ++KN IS RFA
Subjt:  SFSLLPARVISLIMDQIASLAKMFVDLPQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFA

Query:  FILYGFVAISIAHLGQDVSITSEIFGKVKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNNDGFPHFTFCEDLTENEIFMLECAKKLLV
        FILYGFVAISI HLG  VSITSEIF KVKLLVN+VCKSCLFSSHTCIIYSLLLNCKF+LSCRI E FR CNNDGFP FTFCEDLTE EIF LECAKKLL 
Subjt:  FILYGFVAISIAHLGQDVSITSEIFGKVKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNNDGFPHFTFCEDLTENEIFMLECAKKLLV

Query:  VGDEWPAYKAGRHAACHGSWFSATLIFGHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIY
         GDEWPAYKAGRHAACHGSWF+ATLIFGHLISKVRSDVFHYWLKSLFQFALAERKIQ LLLP YG  LAIWLEKE  LNMF IEEQIN HH GSI    Y
Subjt:  VGDEWPAYKAGRHAACHGSWFSATLIFGHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIY

Query:  CDKLLETYQCLCSSVEVLKAAAVPPVQSFCFQSWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIG
         DKLLE YQCLCSS E LKA+AVPPVQ+FCFQ WFLSLRAK+LGTVGSILK L N+S  ISTDYGKLG NDT    ++VN+F KLSL LERLS EFDLIG
Subjt:  CDKLLETYQCLCSSVEVLKAAAVPPVQSFCFQSWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIG

Query:  TTFIGMDTKSSNVISALALNCSLLAFCTGFAFHVPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILD
        TTFIGMDTKSSN+ISALALNCSLLAFCTGFAFHVP+LATTLMT+NVDDFRTK HAILIQNLI RL LVDDETS+MLAQLFE+TGP NNC HLVSRGKILD
Subjt:  TTFIGMDTKSSNVISALALNCSLLAFCTGFAFHVPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILD

Query:  MGYEVRGILSLCRYAISEFIHMQSKSSGVDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCL
        +GYEVRGIL+LCRYA+SEFI +QSKS+ VD+GTFLQVIEDG+ FLSNI+ QWI IPFRVPKCFFCVRPCIG +LFATT+ RKLDEISIP GFHLSLNLCL
Subjt:  MGYEVRGILSLCRYAISEFIHMQSKSSGVDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCL

Query:  QLKNITQNMSVQIPKMYCILYCGLSFQELKHNGQN---HQVCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQG
        QLKNI  NMSVQI KMYCILYC LSFQELKHNG+N   HQV EAWENDD+VEMHNKLLHY+TESSKNE YIGKCRTS+V KT+R VE FV FEPDEKGQG
Subjt:  QLKNITQNMSVQIPKMYCILYCGLSFQELKHNGQN---HQVCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQG

Query:  FSNCLFDVSHFPVGCYRIKWYSCCVDSEGCFWNLLPLNSGPLLTIHQLSSAG
        FSNCL DVS+FPVGCYRIKWYSCCVD++GCFWNLLPLNSGPL TIHQL SAG
Subjt:  FSNCLFDVSHFPVGCYRIKWYSCCVDSEGCFWNLLPLNSGPLLTIHQLSSAG

TrEMBL top hitse value%identityAlignment
A0A0A0KMS4 Uncharacterized protein0.098.92Show/hide
Query:  SMAKTAYKAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFMKGAFQFVNMESVVKILIDALDEHMLPT
        S+A    +AGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFMKGAFQFVNMESVVKILIDALDEHMLPT
Subjt:  SMAKTAYKAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFMKGAFQFVNMESVVKILIDALDEHMLPT

Query:  SSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVCSFSLLPARVISLIMDQIASLAKMFVDL
        SSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVCSFSLLPARVISLIMDQIASLAKMFVDL
Subjt:  SSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVCSFSLLPARVISLIMDQIASLAKMFVDL

Query:  PQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFAFILYGFVAISIAHLGQDVSITSEIFGK
        PQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFAFILYGFVAISIAHLGQDVSITSEIFGK
Subjt:  PQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFAFILYGFVAISIAHLGQDVSITSEIFGK

Query:  VKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNNDGFPHFTFCEDLTENEIFMLECAKKLLVVGDEWPAYKAGRHAACHGSWFSATLIF
        VKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNNDGFPHFTFCEDLTENEIFMLECAKKLLVVGDEWPAYKAGRHAACHGSWFSATLIF
Subjt:  VKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNNDGFPHFTFCEDLTENEIFMLECAKKLLVVGDEWPAYKAGRHAACHGSWFSATLIF

Query:  GHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIYCDKLLETYQCLCSSVEVLKAAAVPPVQ
        GHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIYCDKLLETYQCLCSSVEVLKAAAVPPVQ
Subjt:  GHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIYCDKLLETYQCLCSSVEVLKAAAVPPVQ

Query:  SFCFQSWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIGTTFIGMDTKSSNVISALALNCSLLAFC
        SFCFQ WFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIGTTFIGMDTKSSNVISALALNCSLLAFC
Subjt:  SFCFQSWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIGTTFIGMDTKSSNVISALALNCSLLAFC

Query:  TGFAFHVPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILDMGYEVRGILSLCRYAISEFIHMQSKSS
        TGFAFHVPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILDMGYEVRGIL+LCRYAISEFIHMQSKSS
Subjt:  TGFAFHVPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILDMGYEVRGILSLCRYAISEFIHMQSKSS

Query:  GVDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCLQLKNITQNMSVQIPKMYCILYCGLSFQ
        GVDKGTFLQVIE GMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCLQLKNITQNMSVQIPKMYCILYCGLSFQ
Subjt:  GVDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCLQLKNITQNMSVQIPKMYCILYCGLSFQ

Query:  ELKHNGQNHQVCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQGFSNCLFDVSHFPVGCYRIKWYSCCVDSEGC
        ELKHNGQNHQVCEAWENDDVVEMHNKLLHYVTES+KNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQGFSNCLFDVSHFPVGCYRIKWYSCCVDSEGC
Subjt:  ELKHNGQNHQVCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQGFSNCLFDVSHFPVGCYRIKWYSCCVDSEGC

Query:  FWNLLPLNSGPLLTIHQLSSAG
        FWNLLPLNSGPLLTIHQLSSAG
Subjt:  FWNLLPLNSGPLLTIHQLSSAG

A0A1S3CMM3 uncharacterized protein LOC103502541 isoform X10.091.15Show/hide
Query:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR
        MER+SAACAMEWSIELEKALR KKPGRAVEAIRQIGCRLQQWSREPEPN+AVYNMFDLVTWED+LFSNTILLRLADAFK DDKHIRLAVVRVFLSELYSR
Subjt:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR

Query:  NSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL
        +SSRSKQYQGILSKARVQN HELLTRVKVVL+GGDPEA+ LALI+LGCWAHFAKDSAQIRYL+F SLFSSHLSEVKASIFAAACI QLADDFAQVFL IL
Subjt:  NSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL

Query:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFM
        VNIMTSTTSL IRMAGARVFAKLGCSHSMAKTAYKAGLELASDTS+E FL+AMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSV VR+ SLRCLCFIFM
Subjt:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFM

Query:  KGAFQFVNMESVVKILIDALDEHMLPTSSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVC
        KGA QFVNMESVVKILIDALDEHMLPTSSHCDALRLLQKI+FYVR N SFLDAN+YSNLVKAVENAA+SPVKLKRLLAFQVLVHLSLQLSGKMEVESGVC
Subjt:  KGAFQFVNMESVVKILIDALDEHMLPTSSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVC

Query:  SFSLLPARVISLIMDQIASLAKMFVDLPQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFA
        S SLLP+RVISLIMDQIASLAKMF+DLPQSN E F EIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRF+
Subjt:  SFSLLPARVISLIMDQIASLAKMFVDLPQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFA

Query:  FILYGFVAISIAHLGQDVSITSEIFGKVKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNNDGFPHFTFCEDLTENEIFMLECAKKLLV
        FILYGFVAIS+ +LGQ +SIT EIF KVKLLVNSVC+SCLFSSHTCIIYSLLLNCKF+LSCRIAE FRICNN+GFPHFTFCEDLTENEIF LECAKKLL 
Subjt:  FILYGFVAISIAHLGQDVSITSEIFGKVKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNNDGFPHFTFCEDLTENEIFMLECAKKLLV

Query:  VGDEWPAYKAGRHAACHGSWFSATLIFGHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIY
        VGDEWPAY AGRHAACHGSWF+ATLIFGHLISKVRSD+FHYWLKSLFQFALAERKIQFLLLP+YG  L IWLEKEA LNMFHI+E INHH+TGSITEGIY
Subjt:  VGDEWPAYKAGRHAACHGSWFSATLIFGHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIY

Query:  CDKLLETYQCLCSSVEVLKAAAVPPVQSFCFQSWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIG
          KL ETYQCL SSVEVLKAAAVPPVQSFCFQ WFLSLRAKILGTVGSILK LLNV CSISTDYGKLGTN+TGI LESVNEF KLSLKLERLS EFDLIG
Subjt:  CDKLLETYQCLCSSVEVLKAAAVPPVQSFCFQSWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIG

Query:  TTFIGMDTKSSNVISALALNCSLLAFCTGFAFHVPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILD
        TTFIGMDTKS NVISALALNCSLLAFCTGFAFHVPDLATTLMT+NVDDFRTKL AILIQNL SRL+LVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILD
Subjt:  TTFIGMDTKSSNVISALALNCSLLAFCTGFAFHVPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILD

Query:  MGYEVRGILSLCRYAISEFIHMQSKSSGVDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCL
        +GYEVRGIL+LCRYAISEFI MQSKSSGVDK TFLQVIEDGMQFLSNIVM WI IPFRVPK FF VRPCIGCELFATTDV KLDEISIPYGFHLSLNLCL
Subjt:  MGYEVRGILSLCRYAISEFIHMQSKSSGVDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCL

Query:  QLKNITQNMSVQIPKMYCILYCGLSFQELKHNGQN----HQVCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQ
        QLKNIT NMSVQI KMYCILYCG SFQELKHNG+N    HQV EAWENDD+VEMHNKLLHYVTESSKNEAYIGKC TS V KTDR +EVFVQFEPDEKGQ
Subjt:  QLKNITQNMSVQIPKMYCILYCGLSFQELKHNGQN----HQVCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQ

Query:  GFSNCLFDVSHFPVGCYRIKWYSCCVDSEGCFWNLLPLNSGPLLTIHQLSSAG
        GFSNCLFDVSH+PVGCYRIKWYSCCVDSEGCFWNLLPLNSGPL TIHQL SAG
Subjt:  GFSNCLFDVSHFPVGCYRIKWYSCCVDSEGCFWNLLPLNSGPLLTIHQLSSAG

A0A6J1DXI9 uncharacterized protein LOC1110244000.075.46Show/hide
Query:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR
        MER++AACAMEWSIELEKALR KK GRA EAI QIG RLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLA+AFKFDDKHIR AVVRVFLSELYSR
Subjt:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR

Query:  NSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL
        + +RSKQYQGILSKARVQNHHELLTRVKVVL+GGDPE+R LAL+L GCWAHFAKDS QIRYL+ SSL S H+SEVKASIFAAACI +LADDFAQVFL +L
Subjt:  NSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL

Query:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFM
        VNIMT + +L IRMAGARV  KLGCSHSMAK AYKAGLEL SD+ +E FLVAMLFSLSKLAS SIFISSEQVQ LCSFLS+KKSV V+E SLRCL FIFM
Subjt:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFM

Query:  KGAFQFVNMESVVKILIDALDEHMLPTSSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVC
        KGA QF NM   +++L++ALDE MLPT+ HCD LRLLQKILF V PN SFLDAN+YS LV AVE+AA+SP+KLK LLA   LV+LSL+LSG+MEVESG  
Subjt:  KGAFQFVNMESVVKILIDALDEHMLPTSSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVC

Query:  SFSLLPARVISLIMDQIASLAKMFVDLPQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFA
        SFSLLP++VISLIMDQIASL KM VDL QS  EVF EIK LLNLLLLIVRE SDLWILLL++ICLT  L M ++ED  D QQ D++FE ++KNDISLRFA
Subjt:  SFSLLPARVISLIMDQIASLAKMFVDLPQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFA

Query:  FILYGFVAISIAHLGQDVSITSEIFGKVKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNNDGFPHFTFCEDLTENEIFMLECAKKLLV
        FILYG VAI I ++GQ VSIT EIF KVKL+VNSVCKS LFS HTC+ YSLLLNCKF+LSCRI E F   N D FP FTFCEDLTENEI  LECA KLL 
Subjt:  FILYGFVAISIAHLGQDVSITSEIFGKVKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNNDGFPHFTFCEDLTENEIFMLECAKKLLV

Query:  VGDEWPAYKAGRHAACHGSWFSATLIFGHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIY
         GDEWP YKAGRHAACHGSWF+ATLIFGHL+ KV+SD+FH WLKSLFQFALAERKI  LLLP YG  LA WLEKE  L+MF  EE IN H  GSITE IY
Subjt:  VGDEWPAYKAGRHAACHGSWFSATLIFGHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIY

Query:  CDKLLETYQCLCSSVEVLKAAAVPPVQSFCFQSWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIG
         DKLLE +QCLCSS E LKAAAV PV++FCFQ WFLSLRA++LG V SILK L N+S   S+DY KLGT DT    E++ EF KLSL +ERLS E DLI 
Subjt:  CDKLLETYQCLCSSVEVLKAAAVPPVQSFCFQSWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIG

Query:  TTFIGMDTKSSNVISALALNCSLLAFCTGFAFHVPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILD
        T+FIG+DTKSSNVISALALNCSLLAFCTGFAFHVP+LATTLMT+NV+DFRT LHA LIQNL+ +L LVD ETSK+L QLFE+TG PNNC  L SR ++LD
Subjt:  TTFIGMDTKSSNVISALALNCSLLAFCTGFAFHVPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILD

Query:  MGYEVRGILSLCRYAISEFIHMQSKSSGVDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCL
        +GYE+R I +LC YA+SE + +QSKS+G ++GT LQVI++GMQFLSNI+ +W+ IPFRVPK FFCVRPC+G +LFA+TD RK D ISIP+GFHLSLNLCL
Subjt:  MGYEVRGILSLCRYAISEFIHMQSKSSGVDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCL

Query:  QLKNITQNMSVQIPKMYCILYCGLSFQELKHNGQNH----QVCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQ
        QL+NI  N SVQI KMYCILYCGLSFQE +H G+N+    Q CEAWENDD+V M NKL HYVTE SKNEA +GK  TSS R T+R VEVFVQFEPDEKGQ
Subjt:  QLKNITQNMSVQIPKMYCILYCGLSFQELKHNGQNH----QVCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQ

Query:  GFSNCLFDVSHFPVGCYRIKWYSCCVDSEGCFWNLLPLNSGPLLTIHQLSSAG
        GFSNCL DVS FPVGCYRIKWYSCCVDSEG  W+LLPLN GPL TIHQL   G
Subjt:  GFSNCLFDVSHFPVGCYRIKWYSCCVDSEGCFWNLLPLNSGPLLTIHQLSSAG

A0A6J1FP76 uncharacterized protein LOC1114472550.077.2Show/hide
Query:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR
        MER++AACAMEWSIELEKALR KKPGRAVEAI QIG RLQQWSREPEPN+AVYNMFDLVTWEDRLFSNTILLRLADAFK DDKHIR+AVV+VFLSEL SR
Subjt:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR

Query:  NSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL
        + ++SKQYQG+LSKARVQNHHELLTRVKVVL GGDPEAR LALILLGCWAHFA+ SAQIRY++ SS+ SSH+SEVKASIFAAACI QLADDFAQVFLAIL
Subjt:  NSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL

Query:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFM
        VNIMTSTTSL I+MAGARVFAKLGCSHSMAKTAYKAGLELAS++S+E FLVAMLFSLSKLASKS+FISSEQV+FLCSFLS KKS  V+E SLRCL FIFM
Subjt:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFM

Query:  KGAFQFVNMESVVKILIDALDEHMLPTSSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVC
        KG   F NMESVV+IL+DALDE ML T+SHCD LRLL+KI+FYVRPN SFLDAN+YS LVKAVE+AAQS V L  L A ++LV LSLQLSGKMEVESGV 
Subjt:  KGAFQFVNMESVVKILIDALDEHMLPTSSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVC

Query:  SFSLLPARVISLIMDQIASLAKMFVDLPQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFA
        SFSLLP +VISLIMDQI SL    VDL Q N EVF EIK LLNLLLLIVRE SDLW  LLEKIC T  LIM MHE  FD QQ D+D E ++KNDISLRFA
Subjt:  SFSLLPARVISLIMDQIASLAKMFVDLPQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFA

Query:  FILYGFVAISIAHLGQDVSITSEIFGKVKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNNDGFPHFTFCEDLTENEIFMLECAKKLLV
        FILYGF+AI + HLGQ V ITSEIF KVKLLV SVC+  LFSSH     SLLLNCKF+LSCRI E FR CNNDGFP FTFCE LTENEIF L+CAKKLL 
Subjt:  FILYGFVAISIAHLGQDVSITSEIFGKVKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNNDGFPHFTFCEDLTENEIFMLECAKKLLV

Query:  VGDEWPAYKAGRHAACHGSWFSATLIFGHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIY
         GDEWPAYKAGRHAACHGSWF+ATLIFGHL SKVRS  F +WLKSLFQFALAERKIQ LLLP YG  L  WLE+E  LN+F  EEQI HHH GSI+EGIY
Subjt:  VGDEWPAYKAGRHAACHGSWFSATLIFGHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIY

Query:  CDKLLETYQCLCSSVEVLKAAAVPPVQSFCFQSWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIG
         DKLLE YQCLCSS E LK++   PVQ+FCFQ WFLSLRAK+LGT+GSI+K LLNV  S         TNDT    E+V EF KLSL  ERLS EFDLIG
Subjt:  CDKLLETYQCLCSSVEVLKAAAVPPVQSFCFQSWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIG

Query:  TTFIGMDTKSSNVISALALNCSLLAFCTGFAFHVPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILD
        TTFIGMDT++ NVISALALNCSLLAFCTGFAF VP+LAT+L+T+NVDDFRT L ++LI+NLI RL  VD ETSK L +LF  TG PNNC HL+ R KILD
Subjt:  TTFIGMDTKSSNVISALALNCSLLAFCTGFAFHVPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILD

Query:  MGYEVRGILSLCRYAISEFIHMQSKSSGVDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCL
        MGYEVRGI +LCRYA+SE I  QSKS+G+D+GT L+V+EDGMQFLSNI+MQWI IPFRVPKCFFCVRPCIG EL+ATTD RKLD ISIP+GFHLSLNLCL
Subjt:  MGYEVRGILSLCRYAISEFIHMQSKSSGVDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCL

Query:  QLKNITQNMSVQIPKMYCILYCGLSFQELKHNGQNHQVCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQGFSN
        QLKNI  N SV+I +MYCILYCGLSFQE KHN Q  Q  EAWE+DD+VEM NKLLHYVTESSKNE  I + +TSS  +T+R V+ FV+FEP+EKGQGFSN
Subjt:  QLKNITQNMSVQIPKMYCILYCGLSFQELKHNGQNHQVCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQGFSN

Query:  CLFDVSHFPVGCYRIKWYSCCVDSEGCFWNLLPLNSGPLLTIHQLSSAG
        CL DVS FPVG YRIKWYSCCVDSEGCFW+LLPL+ GPL T+HQL SAG
Subjt:  CLFDVSHFPVGCYRIKWYSCCVDSEGCFWNLLPLNSGPLLTIHQLSSAG

A0A6J1JBG3 uncharacterized protein LOC1114852600.077.28Show/hide
Query:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR
        MER++AA AMEWSIELEKALR KKPGRAVEAI QIG RLQQWSREPEPN+AVYNMFDLVTWEDRLFSNTILLRLADAFK DDKHIR+AVVRVFLSEL SR
Subjt:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSR

Query:  NSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL
        + ++S+QYQG+LSKARVQNHHELLTRVKVVL GGDPEAR LALILLGCWAHFA+ SAQIRY++  SL SSH+SEVKASIFAAACI QLADDFA+VFLAIL
Subjt:  NSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAIL

Query:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFM
        VNIMTSTTSL ++MAGARVFAKLGCSHSMAKTAYKAGLELASD+S+E FLVAMLFSLSKLASKS+FISSEQV+ LCSFLS KKSV V+E SLRCL FIFM
Subjt:  VNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFM

Query:  KGAFQFVNMESVVKILIDALDEHMLPTSSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVC
        KG   F NMESVV+IL+DALDE ML T+SHCD LRLL+KI+FY+RPN SFLDAN+Y  LVKAVE+AAQS V L  LLA ++LV LSLQLSGKMEVESGV 
Subjt:  KGAFQFVNMESVVKILIDALDEHMLPTSSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVC

Query:  SFSLLPARVISLIMDQIASLAKMFVDLPQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFA
        SFSLLP +VISLIMDQI SL    VDL Q N EVF EIK L NLLLLIVRE SDLW  L EKIC T  LIM MHE  FD QQ DVD E ++KNDISLRFA
Subjt:  SFSLLPARVISLIMDQIASLAKMFVDLPQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFA

Query:  FILYGFVAISIAHLGQDVSITSEIFGKVKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNNDGFPHFTFCEDLTENEIFMLECAKKLLV
        FILYGF+AI + HLGQ V ITSEIF KVKLLV SVC+  LFSSH     SLLLNCKF+LSCRI E FRI NNDGFP FTFCE LTENEIF L+CAKKLL 
Subjt:  FILYGFVAISIAHLGQDVSITSEIFGKVKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNNDGFPHFTFCEDLTENEIFMLECAKKLLV

Query:  VGDEWPAYKAGRHAACHGSWFSATLIFGHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIY
         GDEWPAYKAGRHAACHGSWF+ATLIFGHL SKVRSD F +WLKSLFQFA+AERKIQ LLLP YG  L  WLE+E  LN+F  EEQI HHH GSI+ GIY
Subjt:  VGDEWPAYKAGRHAACHGSWFSATLIFGHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIY

Query:  CDKLLETYQCLCSSVEVLKAAAVPPVQSFCFQSWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIG
         DKLLE YQCLCSS E LK++   PVQ+FCFQ WFLSLRAK+LGTVGSI+K LLNV  S S DYGKL TNDT    E+V EF KLSL  ERLS EFDLIG
Subjt:  CDKLLETYQCLCSSVEVLKAAAVPPVQSFCFQSWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLERLSREFDLIG

Query:  TTFIGMDTKSSNVISALALNCSLLAFCTGFAFHVPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILD
        TTFIGMDT++  VISALALNCSLLAFCTGFAF VP+LAT+L+T+NVDDFRT L ++L++NLI RL  VD ETSK L +LF+ TG PNNCSHL+SR KILD
Subjt:  TTFIGMDTKSSNVISALALNCSLLAFCTGFAFHVPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNCSHLVSRGKILD

Query:  MGYEVRGILSLCRYAISEFIHMQSKSSGVDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCL
        +GYEVRGI +LCRYA+SE I  QSKS+G+D+GT  QV+EDGMQFLSNI MQWI IPFRVPKCFFCVRPCIG EL+A TD RKLD ISIP+GF LSLNLCL
Subjt:  MGYEVRGILSLCRYAISEFIHMQSKSSGVDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPYGFHLSLNLCL

Query:  QLKNITQNMSVQIPKMYCILYCGLSFQELKHNGQNHQVCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQGFSN
        QLKNI  NM V+I +MYCILYCGLSFQE KHN Q  Q  EAWE+DD+VEM NKLLHYVTESSKNEA I K +TSS  +T+R V+ FV+FEP+EKGQGFSN
Subjt:  QLKNITQNMSVQIPKMYCILYCGLSFQELKHNGQNHQVCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQGFSN

Query:  CLFDVSHFPVGCYRIKWYSCCVDSEGCFWNLLPLNSGPLLTIHQLSSAG
        CL DVS FPVG YRIKWYSCCVDSEGCFW+LLPL+ GP  TIHQL SAG
Subjt:  CLFDVSHFPVGCYRIKWYSCCVDSEGCFWNLLPLNSGPLLTIHQLSSAG

SwissProt top hitse value%identityAlignment
Q1RMS6 Integrator complex subunit 71.5e-0423.58Show/hide
Query:  NTILLRLADAFKFDDKHIRLAVVRVFLSELYSRNSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSL
        N+  L+LAD F+  +  +RL V++V            ++Q +  L K  + N  E + R+  V++  DP AR + L +LG  A    +     + +  SL
Subjt:  NTILLRLADAFKFDDKHIRLAVVRVFLSELYSRNSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSL

Query:  FSSHLSEVKASIFAAACICQLADDFA-QVFLAILVNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKAGLELASDTSDESFLVAMLFSLSKLASKSIF
         S    EV+A++FAAA     + DFA  +   I   I    T + +++    +   +     +A +A +   +L +       ++  L + + LA+ S+ 
Subjt:  FSSHLSEVKASIFAAACICQLADDFA-QVFLAILVNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKAGLELASDTSDESFLVAMLFSLSKLASKSIF

Query:  ISSEQVQFLCSFLSHKKSVHVREKSLRCL
         + +Q+Q L  +L +     V+  +++ L
Subjt:  ISSEQVQFLCSFLSHKKSVHVREKSLRCL

Q54PL2 Integrator complex subunit 7 homolog4.3e-0723.4Show/hide
Query:  LFSNTILLRLADAFKFDDKHIRLAVVRVF---LSELYSRNSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRY
        L  N+++ RL+D F+     ++  +++VF    SE++                 +V N  E+L R+  V+   DP AR L+L +LG   H   D   I +
Subjt:  LFSNTILLRLADAFKFDDKHIRLAVVRVF---LSELYSRNSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRY

Query:  LVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAILVNIMTSTTSLTI-RMAGARVFAKLGCSHSMAKTAYKAGLELASDTSDESFLVAMLFSLSKL
         + + + S    E++A+IF    +C+++  F+   +  +  ++ +  +  I ++   R+F  +  SHS+A  + +  + L        F+  +L +L+ L
Subjt:  LVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAILVNIMTSTTSLTI-RMAGARVFAKLGCSHSMAKTAYKAGLELASDTSDESFLVAMLFSLSKL

Query:  ASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCL
        + K I    + ++FL ++      V V+  +L+CL
Subjt:  ASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCL

Q5ZL91 Integrator complex subunit 71.8e-0524.89Show/hide
Query:  NTILLRLADAFKFDDKHIRLAVVRVFLSELYSRNSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSL
        N+  L+LAD F+  +  +RL V++V            ++Q +  L K  + N  E + RV  V++  DP AR + L +LG  A    +     + +  SL
Subjt:  NTILLRLADAFKFDDKHIRLAVVRVFLSELYSRNSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSL

Query:  FSSHLSEVKASIFAAACICQLADDFAQ-VFLAILVNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKAGLELASDTSDESFLVAMLFSLSKLASKSIF
         S    EV+A+IFAAA     + DFA  +   I   I    T + +++    +   +    S+A ++ +   +L +       ++  L + + LA+ S+ 
Subjt:  FSSHLSEVKASIFAAACICQLADDFAQ-VFLAILVNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKAGLELASDTSDESFLVAMLFSLSKLASKSIF

Query:  ISSEQVQFLCSFLSHKKSVHVREKSLRCL
           +QVQ L  +L +     V+  +++ L
Subjt:  ISSEQVQFLCSFLSHKKSVHVREKSLRCL

Q7TQK1 Integrator complex subunit 71.2e-0423.83Show/hide
Query:  NTILLRLADAFKFDDKHIRLAVVRVFLSELYSRNSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSL
        N+  L+LAD F+  +  +RL V++V            ++Q +  L K  + N  E + RV  V++  DP AR + L +LG  A    +     + +  SL
Subjt:  NTILLRLADAFKFDDKHIRLAVVRVFLSELYSRNSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSL

Query:  FSSHLSEVKASIFAAACICQLADDFA-QVFLAILVNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKAGLELASDTSDESFLVAMLFSLSKLASKSIF
         S    EV+A++FAAA     + DFA  +   I   I    T + +++    +   +     +A +A +   +L +       ++  L + + LA+ S+ 
Subjt:  FSSHLSEVKASIFAAACICQLADDFA-QVFLAILVNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKAGLELASDTSDESFLVAMLFSLSKLASKSIF

Query:  ISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFMK
         + +Q+Q L  +L +     V+  +++ L  +  K
Subjt:  ISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFMK

Q8JGR7 Integrator complex subunit 74.7e-0626.27Show/hide
Query:  NTILLRLADAFKFDDKHIRLAVVRVF-LSELYSRNSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSS
        N+  L+LAD F+  +  +RL V++V  LSE +                 ++ N  E + RV  V++  DP AR + L +LG  A    +     + +  S
Subjt:  NTILLRLADAFKFDDKHIRLAVVRVF-LSELYSRNSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSS

Query:  LFSSHLSEVKASIFAAACICQLADDFAQ-VFLAILVNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKAGLELASDTSDESFLVAMLFSLSKLASKSI
        L S    EV+A+IFAAA     + DFA  +   I   I    T + +++    +   +    S+A  + +   EL S     S L+  L + ++LA+ S+
Subjt:  LFSSHLSEVKASIFAAACICQLADDFAQ-VFLAILVNIMTSTTSLTIRMAGARVFAKLGCSHSMAKTAYKAGLELASDTSDESFLVAMLFSLSKLASKSI

Query:  FISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFMK
            EQ+  L  +L       V+  S++ L  +  K
Subjt:  FISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFMK

Arabidopsis top hitse value%identityAlignment
AT4G20060.1 ARM repeat superfamily protein1.6e-20338.11Show/hide
Query:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSEL-YS
        ME+ SAACAMEWSI+LEK+LR K   +AVEAI + G +L+QWS+EPE  +AVYN+F LV  ED+LFSNTILLRL DAF   DK I+LAVVRVF+S    S
Subjt:  MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSEL-YS

Query:  RNSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAI
        R  + ++     LSK RV NH ELLTRVK V + GD E++ LALIL GCW  FA + A +RYLVFSS+ S H  E ++++FAAAC C++ADDFA V L +
Subjt:  RNSSRSKQYQGILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAI

Query:  LVNIMTSTTSLT--IRMAGARVFAKLGCSHSMAKTAYKAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCF
        L N M     +T   R+A  RVFAK+GCSH++A  A+K  ++L  D+  E  LV  L SL+KLAS+S  ++SE  + +  FL   K+ H R   LRCL F
Subjt:  LVNIMTSTTSLT--IRMAGARVFAKLGCSHSMAKTAYKAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCF

Query:  IFMKGAFQFVNMESVVKILIDALDEHMLPTSSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVES
        +  +G    +  E  +  +   L +  L +     AL++ QKI+ Y        DA++   L+   ENA+ S +     LA  VLV +  ++    E  S
Subjt:  IFMKGAFQFVNMESVVKILIDALDEHMLPTSSHCDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVES

Query:  GVCSFSLLPARVISLIMDQIASLAKMFVDLPQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHED--AFDGQQRDVDFEVN--EKN
           S + LP +++ LIMD++A L ++  DL ++ Y V  E+++LL +L L+V + S+L +L+LEK+ L    I+ +++     DG    +   +N  +K 
Subjt:  GVCSFSLLPARVISLIMDQIASLAKMFVDLPQSNYEVFLEIKELLNLLLLIVREQSDLWILLLEKICLTAALIMKMHED--AFDGQQRDVDFEVN--EKN

Query:  DISLRFAFI--LYGFVAISIAHLGQDVSITSEIFGKVKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNNDGFPHFTFCEDLTENEIFM
         + +R  F+  ++ F+ + + +L  D ++ SEI+ KVK +   V        HT +I++LLL+   +        F + ++ G    +   D+    I  
Subjt:  DISLRFAFI--LYGFVAISIAHLGQDVSITSEIFGKVKLLVNSVCKSCLFSSHTCIIYSLLLNCKFVLSCRIAEVFRICNNDGFPHFTFCEDLTENEIFM

Query:  LECAKKLLVVGDEWPAYKAGRHAACHGSWFSATLIFGHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHH
        L+C+ ++L+  + WPAY+AG +AA  G+W ++ +IF  L + V+SD+   WLKSL   + AE K Q LL P+   +L  WL+    L       +++   
Subjt:  LECAKKLLVVGDEWPAYKAGRHAACHGSWFSATLIFGHLISKVRSDVFHYWLKSLFQFALAERKIQFLLLPNYGFRLAIWLEKEASLNMFHIEEQINHHH

Query:  TGSITEGIYCDKLLETYQCLCSSVEVLKAAAVPPVQSFCFQSWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLER
        +G   E  +C  L E Y  L SS+ +L    +     FCFQ+WFL L+ ++L TV  +++ L  ++  +           TG   +S+ +  ++S++L++
Subjt:  TGSITEGIYCDKLLETYQCLCSSVEVLKAAAVPPVQSFCFQSWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTNDTGIFLESVNEFGKLSLKLER

Query:  LSREFDLIGTTFIGMDTKSSNVISALALNCSLLAFCTGFAFHVPDLATTLMTKNVDDFRTK--LHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNC
        L++EFD++ T FI +D  SS++I+ ++L+CS+LAF  G    +P  +     + +  F ++  L + L+++L+ RL  VD    + L  L   T    NC
Subjt:  LSREFDLIGTTFIGMDTKSSNVISALALNCSLLAFCTGFAFHVPDLATTLMTKNVDDFRTK--LHAILIQNLISRLQLVDDETSKMLAQLFEVTGPPNNC

Query:  SHLVSRGKILDMGYEVRGILSLCRYAISEFIHMQSKSSGVDKGTFL-QVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFA---TTDVRKLDE
         HL SR ++L +  +V+ +LS+CR A+S    +Q++S  + K   + ++ +     LS  +M+W++IPF +PK FF +RPC+G ELFA    +  R  D 
Subjt:  SHLVSRGKILDMGYEVRGILSLCRYAISEFIHMQSKSSGVDKGTFL-QVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFA---TTDVRKLDE

Query:  ISIPYGFHLSLNLCLQLKNITQ-NMSVQIPKMYCILYCGLSFQELKHNGQNHQ---VCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDR
        +S+  GF LSL+LCLQLKNI Q  + V++ K+YC+LY  L++     +G+N++       W ++D++EM NKL H+  +S K     G+   +       
Subjt:  ISIPYGFHLSLNLCLQLKNITQ-NMSVQIPKMYCILYCGLSFQELKHNGQNHQ---VCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDR

Query:  DVEVFVQFEPDEKGQGFSNCLFDVSHFPVGCYRIKWYSCCVDSEGCFWNLLPLNSGPLLTIHQLS
         V   VQFEP+E+GQGFS+CL DVS FPVG Y+IKW SCCVD  G +WNLLPLN  P+ T+ + S
Subjt:  DVEVFVQFEPDEKGQGFSNCLFDVSHFPVGCYRIKWYSCCVDSEGCFWNLLPLNSGPLLTIHQLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAGAAGTTCTGCAGCTTGCGCTATGGAATGGAGTATTGAGCTGGAGAAGGCTCTCCGTTTGAAGAAACCAGGTCGGGCTGTTGAAGCTATACGTCAGATTGGCTG
TCGACTTCAGCAATGGAGTAGAGAGCCAGAACCCAATGTAGCTGTATATAATATGTTTGACCTTGTTACTTGGGAGGATAGGCTATTTTCCAACACTATTCTCCTACGGC
TTGCTGATGCCTTTAAGTTTGATGATAAGCATATTAGACTTGCAGTTGTTAGAGTTTTCTTATCCGAGCTATATAGCCGCAACAGCTCACGAAGTAAACAATACCAAGGG
ATTCTTTCAAAGGCAAGGGTGCAAAATCACCATGAATTACTTACTCGAGTCAAGGTTGTTCTTAATGGAGGGGATCCTGAGGCTAGAGGTCTAGCTTTGATTCTATTGGG
ATGTTGGGCACATTTTGCAAAAGACAGTGCCCAGATACGTTATTTGGTATTTTCTAGTCTGTTTTCTTCTCATCTTTCGGAGGTAAAAGCGTCAATATTTGCTGCAGCAT
GCATTTGTCAGTTAGCAGATGACTTTGCGCAAGTTTTCTTAGCGATTTTGGTTAATATAATGACTTCTACTACATCCTTGACCATCAGAATGGCTGGAGCTCGAGTTTTT
GCAAAACTGGGATGCTCACATTCAATGGCCAAAACTGCTTATAAGGCTGGACTTGAGCTTGCCTCAGATACTAGTGATGAGAGTTTTTTGGTTGCAATGTTATTTTCACT
ATCCAAACTAGCTTCCAAGTCGATATTTATTAGTTCTGAGCAGGTGCAATTTCTCTGCTCATTTCTTAGCCACAAAAAGTCTGTGCATGTTCGAGAAAAATCTTTAAGGT
GTCTGTGTTTTATTTTCATGAAAGGAGCATTTCAGTTTGTTAATATGGAATCTGTGGTTAAAATTTTAATTGATGCACTAGATGAACACATGCTTCCAACTTCTTCACAT
TGTGATGCTCTACGACTGTTGCAAAAGATTCTTTTCTATGTGCGGCCAAACACTTCATTTTTGGATGCAAACAAATACTCTAACCTGGTTAAAGCTGTGGAGAATGCAGC
CCAATCTCCAGTGAAGTTAAAGAGGCTCCTTGCTTTCCAAGTGTTGGTGCATTTATCCTTGCAGCTTTCCGGAAAAATGGAAGTAGAATCAGGAGTTTGTTCATTTTCTT
TGTTGCCAGCACGGGTTATTTCACTAATCATGGATCAAATTGCCTCATTGGCGAAGATGTTCGTAGATCTTCCTCAGTCAAATTATGAAGTGTTTCTAGAAATTAAAGAG
CTGCTAAACCTTCTCCTGCTCATTGTAAGAGAACAGTCAGATTTGTGGATTTTGCTTTTGGAAAAAATATGTTTAACTGCTGCATTAATTATGAAGATGCATGAAGATGC
CTTTGATGGTCAGCAAAGAGACGTAGACTTTGAAGTAAACGAGAAAAATGATATCAGCTTGAGATTTGCATTCATTCTATATGGTTTTGTGGCAATCTCCATTGCTCATC
TAGGTCAAGATGTCTCTATCACGTCTGAAATATTTGGCAAGGTGAAACTATTGGTCAACAGTGTATGCAAAAGCTGTTTGTTTAGTAGCCATACTTGCATAATCTATTCC
TTGCTATTGAACTGTAAATTCGTTTTGAGTTGTAGGATAGCTGAGGTTTTTAGGATTTGCAATAACGATGGGTTTCCACATTTCACTTTTTGTGAAGATTTGACTGAAAA
CGAGATTTTTATGCTCGAGTGTGCTAAGAAGTTACTAGTAGTTGGGGATGAATGGCCTGCTTACAAGGCTGGGAGACATGCAGCATGTCATGGATCATGGTTTTCTGCTA
CCTTGATTTTTGGCCATTTAATTTCAAAGGTTCGCTCTGATGTCTTCCATTACTGGTTGAAATCTTTGTTTCAGTTTGCTCTTGCGGAAAGAAAAATCCAATTTCTACTG
TTACCAAACTATGGTTTTCGCTTGGCAATCTGGTTAGAGAAGGAAGCAAGTCTAAACATGTTTCACATTGAAGAACAAATAAACCATCACCACACTGGGAGCATCACCGA
GGGCATTTACTGTGACAAGCTTTTGGAGACCTACCAGTGTCTTTGCTCTTCAGTTGAGGTCTTAAAAGCAGCTGCTGTCCCACCGGTTCAGTCATTTTGTTTCCAGAGCT
GGTTTTTGTCTTTAAGAGCTAAGATTTTAGGAACTGTGGGGAGCATACTGAAGCAATTGCTAAACGTTTCATGTAGTATTAGTACTGACTATGGTAAGCTTGGGACAAAC
GATACCGGTATTTTCCTTGAATCTGTGAACGAATTTGGTAAATTATCTTTAAAGTTAGAGAGGCTGTCCCGTGAATTTGATCTCATTGGAACGACTTTTATTGGGATGGA
CACCAAGAGTTCGAACGTTATTTCAGCCCTTGCACTGAATTGCTCTTTGTTAGCCTTTTGTACTGGTTTTGCCTTTCATGTTCCTGACTTGGCTACAACTCTTATGACCA
AAAATGTGGATGATTTCAGAACTAAATTACATGCAATACTCATTCAAAATCTGATTAGCAGGCTGCAGCTGGTAGACGATGAAACTAGTAAAATGCTTGCACAGCTTTTT
GAGGTCACTGGACCACCAAATAACTGCTCGCATTTGGTTTCAAGAGGCAAAATATTAGACATGGGGTACGAAGTAAGAGGCATCTTATCTCTCTGTAGGTATGCCATTTC
TGAGTTTATCCATATGCAAAGCAAATCAAGCGGAGTGGATAAGGGGACATTTCTCCAGGTTATCGAGGATGGCATGCAGTTTTTATCAAATATTGTTATGCAGTGGATAC
GCATTCCATTTCGAGTGCCTAAGTGTTTCTTTTGTGTAAGGCCTTGCATTGGCTGTGAACTCTTTGCCACTACTGATGTGCGTAAACTGGATGAAATATCTATCCCATAT
GGCTTCCATCTGTCATTAAATCTTTGTCTTCAACTCAAAAACATTACTCAAAATATGTCAGTTCAAATCCCCAAGATGTACTGCATTTTGTATTGTGGTTTGTCCTTCCA
GGAACTAAAGCACAATGGGCAAAATCATCAGGTTTGTGAAGCTTGGGAAAACGACGACGTTGTAGAAATGCATAACAAACTGTTGCATTATGTGACTGAGTCGAGCAAAA
ACGAGGCTTATATAGGCAAGTGCAGAACATCAAGTGTTCGTAAAACTGATAGGGACGTTGAAGTGTTTGTGCAGTTCGAGCCAGATGAGAAAGGGCAAGGATTCTCAAAT
TGTTTGTTTGATGTATCTCATTTTCCTGTAGGTTGTTATAGAATCAAATGGTATAGCTGTTGTGTTGATAGTGAGGGGTGTTTTTGGAACCTCCTTCCTTTGAATTCTGG
ACCATTACTTACTATCCATCAACTTTCATCAGCTGGGTGA
mRNA sequenceShow/hide mRNA sequence
CCGAACCTAAGCAGCAAAACCACTTGAGAGCTCAAATACTACCAAGCTCTCTACTACTCAACCTTCTTCTTCTTTCTTTCTCTTGATTCCCTAACGTTAAGTTTGTAAAG
AAACCAAGGAATACACTTTCTATAATCCCCTGATTCAAGATTGTACTCCATTTTACCTCCAATTCCACAGGGTTTCTGCTTCCCCACTTGTGTTTTAGGGTTCCACAGAA
GATGGCATTGGATGAGAATATTGTGTTGCGCGATGTCACTAATGCTGGCATTGTCATCAGCGACCGCACTGTCAAACACCAGAGGCGCTATATTAAGTAATTGGGTTTGA
AGATTTGGATGCCGTGTGCTGTGGTAGGGACTATTTGGTGATGGTTTTGTAAAGGGGTTCTATGGTTTCAGAGTTTATGTGTGTCTGTGACTAGCTATCATTTTTGGTCA
TTTGCTACAAATATGGAGAGAAGTTCTGCAGCTTGCGCTATGGAATGGAGTATTGAGCTGGAGAAGGCTCTCCGTTTGAAGAAACCAGGTCGGGCTGTTGAAGCTATACG
TCAGATTGGCTGTCGACTTCAGCAATGGAGTAGAGAGCCAGAACCCAATGTAGCTGTATATAATATGTTTGACCTTGTTACTTGGGAGGATAGGCTATTTTCCAACACTA
TTCTCCTACGGCTTGCTGATGCCTTTAAGTTTGATGATAAGCATATTAGACTTGCAGTTGTTAGAGTTTTCTTATCCGAGCTATATAGCCGCAACAGCTCACGAAGTAAA
CAATACCAAGGGATTCTTTCAAAGGCAAGGGTGCAAAATCACCATGAATTACTTACTCGAGTCAAGGTTGTTCTTAATGGAGGGGATCCTGAGGCTAGAGGTCTAGCTTT
GATTCTATTGGGATGTTGGGCACATTTTGCAAAAGACAGTGCCCAGATACGTTATTTGGTATTTTCTAGTCTGTTTTCTTCTCATCTTTCGGAGGTAAAAGCGTCAATAT
TTGCTGCAGCATGCATTTGTCAGTTAGCAGATGACTTTGCGCAAGTTTTCTTAGCGATTTTGGTTAATATAATGACTTCTACTACATCCTTGACCATCAGAATGGCTGGA
GCTCGAGTTTTTGCAAAACTGGGATGCTCACATTCAATGGCCAAAACTGCTTATAAGGCTGGACTTGAGCTTGCCTCAGATACTAGTGATGAGAGTTTTTTGGTTGCAAT
GTTATTTTCACTATCCAAACTAGCTTCCAAGTCGATATTTATTAGTTCTGAGCAGGTGCAATTTCTCTGCTCATTTCTTAGCCACAAAAAGTCTGTGCATGTTCGAGAAA
AATCTTTAAGGTGTCTGTGTTTTATTTTCATGAAAGGAGCATTTCAGTTTGTTAATATGGAATCTGTGGTTAAAATTTTAATTGATGCACTAGATGAACACATGCTTCCA
ACTTCTTCACATTGTGATGCTCTACGACTGTTGCAAAAGATTCTTTTCTATGTGCGGCCAAACACTTCATTTTTGGATGCAAACAAATACTCTAACCTGGTTAAAGCTGT
GGAGAATGCAGCCCAATCTCCAGTGAAGTTAAAGAGGCTCCTTGCTTTCCAAGTGTTGGTGCATTTATCCTTGCAGCTTTCCGGAAAAATGGAAGTAGAATCAGGAGTTT
GTTCATTTTCTTTGTTGCCAGCACGGGTTATTTCACTAATCATGGATCAAATTGCCTCATTGGCGAAGATGTTCGTAGATCTTCCTCAGTCAAATTATGAAGTGTTTCTA
GAAATTAAAGAGCTGCTAAACCTTCTCCTGCTCATTGTAAGAGAACAGTCAGATTTGTGGATTTTGCTTTTGGAAAAAATATGTTTAACTGCTGCATTAATTATGAAGAT
GCATGAAGATGCCTTTGATGGTCAGCAAAGAGACGTAGACTTTGAAGTAAACGAGAAAAATGATATCAGCTTGAGATTTGCATTCATTCTATATGGTTTTGTGGCAATCT
CCATTGCTCATCTAGGTCAAGATGTCTCTATCACGTCTGAAATATTTGGCAAGGTGAAACTATTGGTCAACAGTGTATGCAAAAGCTGTTTGTTTAGTAGCCATACTTGC
ATAATCTATTCCTTGCTATTGAACTGTAAATTCGTTTTGAGTTGTAGGATAGCTGAGGTTTTTAGGATTTGCAATAACGATGGGTTTCCACATTTCACTTTTTGTGAAGA
TTTGACTGAAAACGAGATTTTTATGCTCGAGTGTGCTAAGAAGTTACTAGTAGTTGGGGATGAATGGCCTGCTTACAAGGCTGGGAGACATGCAGCATGTCATGGATCAT
GGTTTTCTGCTACCTTGATTTTTGGCCATTTAATTTCAAAGGTTCGCTCTGATGTCTTCCATTACTGGTTGAAATCTTTGTTTCAGTTTGCTCTTGCGGAAAGAAAAATC
CAATTTCTACTGTTACCAAACTATGGTTTTCGCTTGGCAATCTGGTTAGAGAAGGAAGCAAGTCTAAACATGTTTCACATTGAAGAACAAATAAACCATCACCACACTGG
GAGCATCACCGAGGGCATTTACTGTGACAAGCTTTTGGAGACCTACCAGTGTCTTTGCTCTTCAGTTGAGGTCTTAAAAGCAGCTGCTGTCCCACCGGTTCAGTCATTTT
GTTTCCAGAGCTGGTTTTTGTCTTTAAGAGCTAAGATTTTAGGAACTGTGGGGAGCATACTGAAGCAATTGCTAAACGTTTCATGTAGTATTAGTACTGACTATGGTAAG
CTTGGGACAAACGATACCGGTATTTTCCTTGAATCTGTGAACGAATTTGGTAAATTATCTTTAAAGTTAGAGAGGCTGTCCCGTGAATTTGATCTCATTGGAACGACTTT
TATTGGGATGGACACCAAGAGTTCGAACGTTATTTCAGCCCTTGCACTGAATTGCTCTTTGTTAGCCTTTTGTACTGGTTTTGCCTTTCATGTTCCTGACTTGGCTACAA
CTCTTATGACCAAAAATGTGGATGATTTCAGAACTAAATTACATGCAATACTCATTCAAAATCTGATTAGCAGGCTGCAGCTGGTAGACGATGAAACTAGTAAAATGCTT
GCACAGCTTTTTGAGGTCACTGGACCACCAAATAACTGCTCGCATTTGGTTTCAAGAGGCAAAATATTAGACATGGGGTACGAAGTAAGAGGCATCTTATCTCTCTGTAG
GTATGCCATTTCTGAGTTTATCCATATGCAAAGCAAATCAAGCGGAGTGGATAAGGGGACATTTCTCCAGGTTATCGAGGATGGCATGCAGTTTTTATCAAATATTGTTA
TGCAGTGGATACGCATTCCATTTCGAGTGCCTAAGTGTTTCTTTTGTGTAAGGCCTTGCATTGGCTGTGAACTCTTTGCCACTACTGATGTGCGTAAACTGGATGAAATA
TCTATCCCATATGGCTTCCATCTGTCATTAAATCTTTGTCTTCAACTCAAAAACATTACTCAAAATATGTCAGTTCAAATCCCCAAGATGTACTGCATTTTGTATTGTGG
TTTGTCCTTCCAGGAACTAAAGCACAATGGGCAAAATCATCAGGTTTGTGAAGCTTGGGAAAACGACGACGTTGTAGAAATGCATAACAAACTGTTGCATTATGTGACTG
AGTCGAGCAAAAACGAGGCTTATATAGGCAAGTGCAGAACATCAAGTGTTCGTAAAACTGATAGGGACGTTGAAGTGTTTGTGCAGTTCGAGCCAGATGAGAAAGGGCAA
GGATTCTCAAATTGTTTGTTTGATGTATCTCATTTTCCTGTAGGTTGTTATAGAATCAAATGGTATAGCTGTTGTGTTGATAGTGAGGGGTGTTTTTGGAACCTCCTTCC
TTTGAATTCTGGACCATTACTTACTATCCATCAACTTTCATCAGCTGGGTGATTTTTCATCAAACAATGAAGTTTTACAGGCGATGGTAGTCTTTTCTCCCGTAGGTAAG
AGAGGTTTGTATATAGAAAACATATATCAGTGCATCTTAGATTATATTAATCTTTGTGTATCTTATTCCGTGAAGAACTATTTTTTCTTGTTAAAATATTTTTATTTTCT
TTTATGCGCTCGCTCAGTGAATAGCTCTTCTGTATATCATGACCATGGTTGAACTGTGAGTTTTATTTGATATCTTAACAAGTCTCTTATG
Protein sequenceShow/hide protein sequence
MERSSAACAMEWSIELEKALRLKKPGRAVEAIRQIGCRLQQWSREPEPNVAVYNMFDLVTWEDRLFSNTILLRLADAFKFDDKHIRLAVVRVFLSELYSRNSSRSKQYQG
ILSKARVQNHHELLTRVKVVLNGGDPEARGLALILLGCWAHFAKDSAQIRYLVFSSLFSSHLSEVKASIFAAACICQLADDFAQVFLAILVNIMTSTTSLTIRMAGARVF
AKLGCSHSMAKTAYKAGLELASDTSDESFLVAMLFSLSKLASKSIFISSEQVQFLCSFLSHKKSVHVREKSLRCLCFIFMKGAFQFVNMESVVKILIDALDEHMLPTSSH
CDALRLLQKILFYVRPNTSFLDANKYSNLVKAVENAAQSPVKLKRLLAFQVLVHLSLQLSGKMEVESGVCSFSLLPARVISLIMDQIASLAKMFVDLPQSNYEVFLEIKE
LLNLLLLIVREQSDLWILLLEKICLTAALIMKMHEDAFDGQQRDVDFEVNEKNDISLRFAFILYGFVAISIAHLGQDVSITSEIFGKVKLLVNSVCKSCLFSSHTCIIYS
LLLNCKFVLSCRIAEVFRICNNDGFPHFTFCEDLTENEIFMLECAKKLLVVGDEWPAYKAGRHAACHGSWFSATLIFGHLISKVRSDVFHYWLKSLFQFALAERKIQFLL
LPNYGFRLAIWLEKEASLNMFHIEEQINHHHTGSITEGIYCDKLLETYQCLCSSVEVLKAAAVPPVQSFCFQSWFLSLRAKILGTVGSILKQLLNVSCSISTDYGKLGTN
DTGIFLESVNEFGKLSLKLERLSREFDLIGTTFIGMDTKSSNVISALALNCSLLAFCTGFAFHVPDLATTLMTKNVDDFRTKLHAILIQNLISRLQLVDDETSKMLAQLF
EVTGPPNNCSHLVSRGKILDMGYEVRGILSLCRYAISEFIHMQSKSSGVDKGTFLQVIEDGMQFLSNIVMQWIRIPFRVPKCFFCVRPCIGCELFATTDVRKLDEISIPY
GFHLSLNLCLQLKNITQNMSVQIPKMYCILYCGLSFQELKHNGQNHQVCEAWENDDVVEMHNKLLHYVTESSKNEAYIGKCRTSSVRKTDRDVEVFVQFEPDEKGQGFSN
CLFDVSHFPVGCYRIKWYSCCVDSEGCFWNLLPLNSGPLLTIHQLSSAG