; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001564 (gene) of Snake gourd v1 genome

Gene IDTan0001564
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionARM repeat superfamily protein
Genome locationLG04:20922757..20933980
RNA-Seq ExpressionTan0001564
SyntenyTan0001564
Gene Ontology termsGO:0016180 - snRNA processing (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0032039 - integrator complex (cellular component)
InterPro domainsIPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold
IPR033060 - Integrator complex subunit 7


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7031480.1 Integrator complex subunit 7-like protein, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0079.9Show/hide
Query:  MERNAAARAMEWSIELEKALRSKKPDRAVEAILQIGSRLQQWSREPEPDVAVYNMFDLVTWEDRLFSNTILLRLADAFKFGDKHIRLAVVRVFLSELYSR
        MERNAAA AMEWSIELEKALRSKKP RAVEAILQIGSRLQQWSREPEP++AVYNMFDLVTWEDRLFSNTILLRLADAFK  DKHIR+AVV+VFLSEL SR
Subjt:  MERNAAARAMEWSIELEKALRSKKPDRAVEAILQIGSRLQQWSREPEPDVAVYNMFDLVTWEDRLFSNTILLRLADAFKFGDKHIRLAVVRVFLSELYSR

Query:  DKARTKKYQGILSKARVQNHHELLTRVKVVLNGGDPEARALALILLGCWAHFAKDSAQICYLILSSLLSSHLLEVKASIFAATCFSELADDFAQVFLAML
        D+ ++K+YQG+LSKARVQNHHELLTRVKVVL GGDPEARALALILLGCWAHFA+ SAQI Y+ILSS+LSSH+ EVKASIFAA C S+LADDFAQVFLA+L
Subjt:  DKARTKKYQGILSKARVQNHHELLTRVKVVLNGGDPEARALALILLGCWAHFAKDSAQICYLILSSLLSSHLLEVKASIFAATCFSELADDFAQVFLAML

Query:  VNIMTSSTTLAIRMAGARVFAKLGCSHAMAKRAYKVGLELASDSSEEDFLVAMLFSLSKLASKSIFISSEQVQLLCSFLSPQKSVRVQETSLRCLCYIFM
        VNIMTS+T+LAI+MAGARVFAKLGCSH+MAK AYK GLELAS+SSEEDFLVAMLFSLSKLASKS+FISSEQV+ LCSFLS +KSVRVQETSLRCL +IFM
Subjt:  VNIMTSSTTLAIRMAGARVFAKLGCSHAMAKRAYKVGLELASDSSEEDFLVAMLFSLSKLASKSIFISSEQVQLLCSFLSPQKSVRVQETSLRCLCYIFM

Query:  QGACQFTNMESMVRILVIALDEHMLPTTSHCDALRLLRKILFYVLPNPSFLDANEYSKLVKALESAARSPVKLKSLLAVRVMVDLSLKHSGKMEVESGGC
        +G C FTNMES+VRILV ALDE ML TTSHCD LRLLRKI+FYV PNPSFLDANEYSKLVKA+ESAA+S V L SL AVR++VDLSL+ SGKMEVESG  
Subjt:  QGACQFTNMESMVRILVIALDEHMLPTTSHCDALRLLRKILFYVLPNPSFLDANEYSKLVKALESAARSPVKLKSLLAVRVMVDLSLKHSGKMEVESGGC

Query:  SFSLLPSQVISLIMDQIESLGKMLVDLSRSNSEVFQEIKGLLNLLLLIVREHSDLWILLLEKLCLAVELIMNMYEDVCNSQQSEEKFEGDEKNDTGLRFA
        SFSLLP QVISLIMDQI SL   LVDLS+ NSEVFQEIKGLLNLLLLIVREHSDLW  LLEK+C  VELIMNM+E V + QQ +   EGD+KND  LRFA
Subjt:  SFSLLPSQVISLIMDQIESLGKMLVDLSRSNSEVFQEIKGLLNLLLLIVREHSDLWILLLEKLCLAVELIMNMYEDVCNSQQSEEKFEGDEKNDTGLRFA

Query:  FILYGFAAICIGHLGQVDSITTEIFNKVKPLVNSVCKSRLFSSHTCITYSLLLNCKFILSCRITEDSRICNNDGFPRFTFCEYLTENMIFTLECAKKMLK
        FILYGF AIC+GHLGQV  IT+EIF+KVK LV SVC+  LFSSH     SLLLNCKFILSCRITED RICNNDGFPRFTFCE LTEN IFTL+CAKK+LK
Subjt:  FILYGFAAICIGHLGQVDSITTEIFNKVKPLVNSVCKSRLFSSHTCITYSLLLNCKFILSCRITEDSRICNNDGFPRFTFCEYLTENMIFTLECAKKMLK

Query:  NGDEWPAYKAGRHAACHGSWLAATLIFGHLISKVHSDFFHCWLKSLFQFSLGERKIQLLLLPQYGSGLVNWLEKETILNAFSIEEQINQHQAGSITEAMY
        NGDEWPAYKAGRHAACHGSW AATLIFGHL SKV S FF  WLKSLFQF+L ERKIQLLLLPQYGSGL+NWLE+ETILN FS EEQI  H AGSI+E +Y
Subjt:  NGDEWPAYKAGRHAACHGSWLAATLIFGHLISKVHSDFFHCWLKSLFQFSLGERKIQLLLLPQYGSGLVNWLEKETILNAFSIEEQINQHQAGSITEAMY

Query:  YDKLLEAYQCVCSSGETLKAAAISLVQASCFQRWFLFLRAKVLGTVGSILKLLLNL---SSNTAAIHETVKEFCKLSLTLERLSCEFDLIGTTFIGMDIK
        YDKLLEAYQC+CSSGE LK++  + VQA CFQRWFL LRAK+LGT+GSI+KLLLN+   +++TAAIHETV+EF KLSLT ERLS EFDLIGTTFIGMD  
Subjt:  YDKLLEAYQCVCSSGETLKAAAISLVQASCFQRWFLFLRAKVLGTVGSILKLLLNL---SSNTAAIHETVKEFCKLSLTLERLSCEFDLIGTTFIGMDIK

Query:  SSSVISALALNCSLLAFCTGFAFHVPNLAATLMIENVAGSGTKLHVELIQNLIGRLWLVDSETSKKLTQLFEIPGRPNNCSHLLSRSKILDVRYEVRDIL
        + +VISALALNCSLLAFCTGFAF VPNLA +L+ ENV    T L   LI+NLIGRLWLVD ETSK+LT+LF   G PNNC HLLSR+KILDV YEVR I 
Subjt:  SSSVISALALNCSLLAFCTGFAFHVPNLAATLMIENVAGSGTKLHVELIQNLIGRLWLVDSETSKKLTQLFEIPGRPNNCSHLLSRSKILDVRYEVRDIL

Query:  TLCRYAVSEAVGLQNKVNGLNEETFLQVIQDGIRFLLNILMQWISIPFRVPKCFFCVRPCIGSEVFATTDARKPDGISIPFGFHLSLNLCLQLRNIAPNM
        TLCRYAVSE +  Q+K NG++E T LQV++DG++FL NILMQWISIPFRVP+CFFCVRPCIGSE++ATTDARKPDGISIPFGFHLSLNLCLQL+NI PN 
Subjt:  TLCRYAVSEAVGLQNKVNGLNEETFLQVIQDGIRFLLNILMQWISIPFRVPKCFFCVRPCIGSEVFATTDARKPDGISIPFGFHLSLNLCLQLRNIAPNM

Query:  SVQITKMYCILYCGLSFQEPRHNGKNNEQKQQGHEAWESNDMVEMQNKLLYYVTESTKNEACISKCRTSSFCRTERVVEAFVQFEPDEKGQGFSKCLLDV
        SV+IT+MYCILYCGLSFQEP+H    NEQKQQ +EAWE +D+VEMQNKLL+YVTES+KNEACISK +TSS CRTERVV+AFV+FEP+EKGQGFS CLLDV
Subjt:  SVQITKMYCILYCGLSFQEPRHNGKNNEQKQQGHEAWESNDMVEMQNKLLYYVTESTKNEACISKCRTSSFCRTERVVEAFVQFEPDEKGQGFSKCLLDV

Query:  SHFPVGCYRMKWYSCCVDSEGCFWSLLPLNSGPLFTIHQLPSAG
        S FPVG YR+KWYSCCVDSEGCFWSLLPL+ GPLFT+HQLPSAG
Subjt:  SHFPVGCYRMKWYSCCVDSEGCFWSLLPLNSGPLFTIHQLPSAG

XP_022157776.1 uncharacterized protein LOC111024400 [Momordica charantia]0.0e+0079.53Show/hide
Query:  MERNAAARAMEWSIELEKALRSKKPDRAVEAILQIGSRLQQWSREPEPDVAVYNMFDLVTWEDRLFSNTILLRLADAFKFGDKHIRLAVVRVFLSELYSR
        MERNAAA AMEWSIELEKALRSKK  RA EAILQIGSRLQQWSREPEP+VAVYNMFDLVTWEDRLFSNTILLRLA+AFKF DKHIR AVVRVFLSELYSR
Subjt:  MERNAAARAMEWSIELEKALRSKKPDRAVEAILQIGSRLQQWSREPEPDVAVYNMFDLVTWEDRLFSNTILLRLADAFKFGDKHIRLAVVRVFLSELYSR

Query:  DKARTKKYQGILSKARVQNHHELLTRVKVVLNGGDPEARALALILLGCWAHFAKDSAQICYLILSSLLSSHLLEVKASIFAATCFSELADDFAQVFLAML
        DK R+K+YQGILSKARVQNHHELLTRVKVVL+GGDPE+RALAL+L GCWAHFAKDS QI YLILSSLLS H+ EVKASIFAA C  ELADDFAQVFL ML
Subjt:  DKARTKKYQGILSKARVQNHHELLTRVKVVLNGGDPEARALALILLGCWAHFAKDSAQICYLILSSLLSSHLLEVKASIFAATCFSELADDFAQVFLAML

Query:  VNIMTSSTTLAIRMAGARVFAKLGCSHAMAKRAYKVGLELASDSSEEDFLVAMLFSLSKLASKSIFISSEQVQLLCSFLSPQKSVRVQETSLRCLCYIFM
        VNIMT S  LAIRMAGARV  KLGCSH+MAK AYK GLEL SDS EEDFLVAMLFSLSKLAS SIFISSEQVQLLCSFLS +KSVRVQETSLRCL +IFM
Subjt:  VNIMTSSTTLAIRMAGARVFAKLGCSHAMAKRAYKVGLELASDSSEEDFLVAMLFSLSKLASKSIFISSEQVQLLCSFLSPQKSVRVQETSLRCLCYIFM

Query:  QGACQFTNMESMVRILVIALDEHMLPTTSHCDALRLLRKILFYVLPNPSFLDANEYSKLVKALESAARSPVKLKSLLAVRVMVDLSLKHSGKMEVESGGC
        +GACQFTNM   +R+LV ALDE MLPTT HCD LRLL+KILF VLPNPSFLDANEYSKLV A+ESAARSP+KLKSLLAV  +V+LSLK SG+MEVESGG 
Subjt:  QGACQFTNMESMVRILVIALDEHMLPTTSHCDALRLLRKILFYVLPNPSFLDANEYSKLVKALESAARSPVKLKSLLAVRVMVDLSLKHSGKMEVESGGC

Query:  SFSLLPSQVISLIMDQIESLGKMLVDLSRSNSEVFQEIKGLLNLLLLIVREHSDLWILLLEKLCLAVELIMNMYEDVCNSQQSEEKFEGDEKNDTGLRFA
        SFSLLPS+VISLIMDQI SLGKMLVDLS+S SEVFQEIKGLLNLLLLIVREHSDLWILLL+++CL V+L M++YED  +SQQ++  FEGD+KND  LRFA
Subjt:  SFSLLPSQVISLIMDQIESLGKMLVDLSRSNSEVFQEIKGLLNLLLLIVREHSDLWILLLEKLCLAVELIMNMYEDVCNSQQSEEKFEGDEKNDTGLRFA

Query:  FILYGFAAICIGHLGQVDSITTEIFNKVKPLVNSVCKSRLFSSHTCITYSLLLNCKFILSCRITEDSRICNNDGFPRFTFCEYLTENMIFTLECAKKMLK
        FILYG  AICIG++GQV SIT EIF+KVK +VNSVCKS LFS HTC+TYSLLLNCKFILSCRITED    N D FP FTFCE LTEN I TLECA K+LK
Subjt:  FILYGFAAICIGHLGQVDSITTEIFNKVKPLVNSVCKSRLFSSHTCITYSLLLNCKFILSCRITEDSRICNNDGFPRFTFCEYLTENMIFTLECAKKMLK

Query:  NGDEWPAYKAGRHAACHGSWLAATLIFGHLISKVHSDFFHCWLKSLFQFSLGERKIQLLLLPQYGSGLVNWLEKETILNAFSIEEQINQHQAGSITEAMY
        +GDEWP YKAGRHAACHGSW AATLIFGHL+ KV SD FHCWLKSLFQF+L ERKI LLLLPQYGSGL NWLEKE IL+ FS EE INQHQAGSITEA+Y
Subjt:  NGDEWPAYKAGRHAACHGSWLAATLIFGHLISKVHSDFFHCWLKSLFQFSLGERKIQLLLLPQYGSGLVNWLEKETILNAFSIEEQINQHQAGSITEAMY

Query:  YDKLLEAYQCVCSSGETLKAAAISLVQASCFQRWFLFLRAKVLGTVGSILKLLLNLS------------SNTAAIHETVKEFCKLSLTLERLSCEFDLIG
        YDKLLEA+QC+CSSGETLKAAA+S V+A CFQRWFL LRA+VLG V SILKLL N+S             +T AIHET+KEF KLSL +ERLS E DLI 
Subjt:  YDKLLEAYQCVCSSGETLKAAAISLVQASCFQRWFLFLRAKVLGTVGSILKLLLNLS------------SNTAAIHETVKEFCKLSLTLERLSCEFDLIG

Query:  TTFIGMDIKSSSVISALALNCSLLAFCTGFAFHVPNLAATLMIENVAGSGTKLHVELIQNLIGRLWLVDSETSKKLTQLFEIPGRPNNCSHLLSRSKILD
        T+FIG+D KSS+VISALALNCSLLAFCTGFAFHVPNLA TLM ENV    T LH ELIQNL+G+LWLVDSETSK L QLFEI G PNNC  L SRS++LD
Subjt:  TTFIGMDIKSSSVISALALNCSLLAFCTGFAFHVPNLAATLMIENVAGSGTKLHVELIQNLIGRLWLVDSETSKKLTQLFEIPGRPNNCSHLLSRSKILD

Query:  VRYEVRDILTLCRYAVSEAVGLQNKVNGLNEETFLQVIQDGIRFLLNILMQWISIPFRVPKCFFCVRPCIGSEVFATTDARKPDGISIPFGFHLSLNLCL
        V YE+RDI  LC YAVSE VGLQ+K NG NE T LQVI++G++FL NIL +W+SIPFRVPK FFCVRPC+GS++FA+TDARKPDGISIPFGFHLSLNLCL
Subjt:  VRYEVRDILTLCRYAVSEAVGLQNKVNGLNEETFLQVIQDGIRFLLNILMQWISIPFRVPKCFFCVRPCIGSEVFATTDARKPDGISIPFGFHLSLNLCL

Query:  QLRNIAPNMSVQITKMYCILYCGLSFQEPRHNGKNNEQKQQGHEAWESNDMVEMQNKLLYYVTESTKNEACISKCRTSSFCRTERVVEAFVQFEPDEKGQ
        QLRNI PN SVQITKMYCILYCGLSFQEPRH G+NNE KQQ  EAWE++DMV MQNKL +YVTE +KNEA + K  TSS   TERVVE FVQFEPDEKGQ
Subjt:  QLRNIAPNMSVQITKMYCILYCGLSFQEPRHNGKNNEQKQQGHEAWESNDMVEMQNKLLYYVTESTKNEACISKCRTSSFCRTERVVEAFVQFEPDEKGQ

Query:  GFSKCLLDVSHFPVGCYRMKWYSCCVDSEGCFWSLLPLNSGPLFTIHQLPSAG
        GFS CLLDVS FPVGCYR+KWYSCCVDSEG  WSLLPLN GPLFTIHQLP  G
Subjt:  GFSKCLLDVSHFPVGCYRMKWYSCCVDSEGCFWSLLPLNSGPLFTIHQLPSAG

XP_023551506.1 uncharacterized protein LOC111809292 [Cucurbita pepo subsp. pepo]0.0e+0079.27Show/hide
Query:  MERNAAARAMEWSIELEKALRSKKPDRAVEAILQIGSRLQQWSREPEPDVAVYNMFDLVTWEDRLFSNTILLRLADAFKFGDKHIRLAVVRVFLSELYSR
        MERNAAA AMEWSIELEKALRSKKP RAVEAIL+IGSRLQQWSREPEP++AVYNMFDLVTWEDRLFSNTILLRLADAFK  DKHIR+AVV+VFLSEL SR
Subjt:  MERNAAARAMEWSIELEKALRSKKPDRAVEAILQIGSRLQQWSREPEPDVAVYNMFDLVTWEDRLFSNTILLRLADAFKFGDKHIRLAVVRVFLSELYSR

Query:  DKARTKKYQGILSKARVQNHHELLTRVKVVLNGGDPEARALALILLGCWAHFAKDSAQICYLILSSLLSSHLLEVKASIFAATCFSELADDFAQVFLAML
        D+ ++K+YQG+LSKARVQNHHELLTRVKVVL GGDPEARALALILLGCWAHFA+ SAQI Y+ILSS+LSSH+ EVKASIFAA C S+LADDFAQVFLA+L
Subjt:  DKARTKKYQGILSKARVQNHHELLTRVKVVLNGGDPEARALALILLGCWAHFAKDSAQICYLILSSLLSSHLLEVKASIFAATCFSELADDFAQVFLAML

Query:  VNIMTSSTTLAIRMAGARVFAKLGCSHAMAKRAYKVGLELASDSSEEDFLVAMLFSLSKLASKSIFISSEQVQLLCSFLSPQKSVRVQETSLRCLCYIFM
        VNIMTS+T+LAI+MAGARVFAKLGCSH+MAK AYK GLELASD SEEDFLVAMLFSLSKLASKS+FISSEQV+LLCSFLS +KSVRVQETSLRCL +IFM
Subjt:  VNIMTSSTTLAIRMAGARVFAKLGCSHAMAKRAYKVGLELASDSSEEDFLVAMLFSLSKLASKSIFISSEQVQLLCSFLSPQKSVRVQETSLRCLCYIFM

Query:  QGACQFTNMESMVRILVIALDEHMLPTTSHCDALRLLRKILFYVLPNPSFLDANEYSKLVKALESAARSPVKLKSLLAVRVMVDLSLKHSGKMEVESGGC
        +G C FT+MES+VRILV ALDE ML TTSHCD LRLLRKI+FYV PNPSFLDANEYSKLVKA+ESAA+S VKLKSLLAVR++VDLSL+ SGKMEVESG  
Subjt:  QGACQFTNMESMVRILVIALDEHMLPTTSHCDALRLLRKILFYVLPNPSFLDANEYSKLVKALESAARSPVKLKSLLAVRVMVDLSLKHSGKMEVESGGC

Query:  SFSLLPSQVISLIMDQIESLGKMLVDLSRSNSEVFQEIKGLLNLLLLIVREHSDLWILLLEKLCLAVELIMNMYEDVCNSQQSEEKFEGDEKNDTGLRFA
        SFSLLP QVISLIMDQI SL   LVDLS+ NSEVFQEIKGLLNLLLLIVREHSDLW  LLEK+C  VELIMNM+E V + QQ +   EGD+KND  LRFA
Subjt:  SFSLLPSQVISLIMDQIESLGKMLVDLSRSNSEVFQEIKGLLNLLLLIVREHSDLWILLLEKLCLAVELIMNMYEDVCNSQQSEEKFEGDEKNDTGLRFA

Query:  FILYGFAAICIGHLGQVDSITTEIFNKVKPLVNSVCKSRLFSSHTCITYSLLLNCKFILSCRITEDSRICNNDGFPRFTFCEYLTENMIFTLECAKKMLK
        FILYGF AIC+GHLGQV  IT+EIF+KVK LV SVC+  LFSS     YSLLLNCKFILSCRITED RICNNDGFPRFTFCE LTEN IFTL+CAKK+LK
Subjt:  FILYGFAAICIGHLGQVDSITTEIFNKVKPLVNSVCKSRLFSSHTCITYSLLLNCKFILSCRITEDSRICNNDGFPRFTFCEYLTENMIFTLECAKKMLK

Query:  NGDEWPAYKAGRHAACHGSWLAATLIFGHLISKVHSDFFHCWLKSLFQFSLGERKIQLLLLPQYGSGLVNWLEKETILNAFSIEEQINQHQAGSITEAMY
        NGDEWPAYKAGRHAACHGSW AATLIFGHL SKV S FF  WLKSLFQF+L ERKIQLLLLPQYGSGL+NWLE+ET+LN FS EEQI  H AGSI+E +Y
Subjt:  NGDEWPAYKAGRHAACHGSWLAATLIFGHLISKVHSDFFHCWLKSLFQFSLGERKIQLLLLPQYGSGLVNWLEKETILNAFSIEEQINQHQAGSITEAMY

Query:  YDKLLEAYQCVCSSGETLKAAAISLVQASCFQRWFLFLRAKVLGTVGSILKLLLNL------------SSNTAAIHETVKEFCKLSLTLERLSCEFDLIG
        YDKLLEAYQC+CSSGE LK++  + VQA CFQRWFL LRAK+LGTVGSI+KLLLN+            +++TAAIHETVKEF KLSLT ERLS EFDLIG
Subjt:  YDKLLEAYQCVCSSGETLKAAAISLVQASCFQRWFLFLRAKVLGTVGSILKLLLNL------------SSNTAAIHETVKEFCKLSLTLERLSCEFDLIG

Query:  TTFIGMDIKSSSVISALALNCSLLAFCTGFAFHVPNLAATLMIENVAGSGTKLHVELIQNLIGRLWLVDSETSKKLTQLFEIPGRPNNCSHLLSRSKILD
        TTFIGMD ++ +VISALALNCSLLAFCTGFAF VPNL  +L+ ENV    T L   LI+NLIGRLWLVD ETSK+LT+LF+  G PNNCSHLLSR+KILD
Subjt:  TTFIGMDIKSSSVISALALNCSLLAFCTGFAFHVPNLAATLMIENVAGSGTKLHVELIQNLIGRLWLVDSETSKKLTQLFEIPGRPNNCSHLLSRSKILD

Query:  VRYEVRDILTLCRYAVSEAVGLQNKVNGLNEETFLQVIQDGIRFLLNILMQWISIPFRVPKCFFCVRPCIGSEVFATTDARKPDGISIPFGFHLSLNLCL
        V YEVR I TLCRYAVSE +  Q+K NG++E T LQV++DG++FL NILMQW+SIPFRVPKCFFCVRPCIGSE++ATTDARK DGISIPFGFHLSLNLCL
Subjt:  VRYEVRDILTLCRYAVSEAVGLQNKVNGLNEETFLQVIQDGIRFLLNILMQWISIPFRVPKCFFCVRPCIGSEVFATTDARKPDGISIPFGFHLSLNLCL

Query:  QLRNIAPNMSVQITKMYCILYCGLSFQEPRHNGKNNEQKQQGHEAWESNDMVEMQNKLLYYVTESTKNEACISKCRTSSFCRTERVVEAFVQFEPDEKGQ
        QL+NI PN SV+IT+MYCILYCGLSFQE  H    NEQKQQ +EAWE +D+VEMQNKLL+YVTES+KNEACISK +TSS CRTERVV+AFV+FE +EKGQ
Subjt:  QLRNIAPNMSVQITKMYCILYCGLSFQEPRHNGKNNEQKQQGHEAWESNDMVEMQNKLLYYVTESTKNEACISKCRTSSFCRTERVVEAFVQFEPDEKGQ

Query:  GFSKCLLDVSHFPVGCYRMKWYSCCVDSEGCFWSLLPLNSGPLFTIHQLPSAG
        GFS CLLDVS FPVG YR+KWYSCCVDSEGC WSLLPL+ GPLFTIHQLPSAG
Subjt:  GFSKCLLDVSHFPVGCYRMKWYSCCVDSEGCFWSLLPLNSGPLFTIHQLPSAG

XP_038892419.1 uncharacterized protein LOC120081531 isoform X1 [Benincasa hispida]0.0e+0082.05Show/hide
Query:  MERNAAARAMEWSIELEKALRSKKPDRAVEAILQIGSRLQQWSREPEPDVAVYNMFDLVTWEDRLFSNTILLRLADAFKFGDKHIRLAVVRVFLSELYSR
        MERN+AA AMEWSIELEKALRSKKP +AVEAILQIGSRLQQWSREPEP+VAVYNMFDLVTWEDRLFSNTILLRLADAFKF DK+IRLAVVRVFLSELYSR
Subjt:  MERNAAARAMEWSIELEKALRSKKPDRAVEAILQIGSRLQQWSREPEPDVAVYNMFDLVTWEDRLFSNTILLRLADAFKFGDKHIRLAVVRVFLSELYSR

Query:  DKARTKKYQGILSKARVQNHHELLTRVKVVLNGGDPEARALALILLGCWAHFAKDSAQICYLILSSLLSSHLLEVKASIFAATCFSELADDFAQVFLAML
        D AR+K+YQGILSKAR+QNHHELL+RVKVVLNGGDPEARALALILLGCWAHFAKDSAQI YLI SS+ SSHL EVKASIFAA C S+LADDFAQVFL +L
Subjt:  DKARTKKYQGILSKARVQNHHELLTRVKVVLNGGDPEARALALILLGCWAHFAKDSAQICYLILSSLLSSHLLEVKASIFAATCFSELADDFAQVFLAML

Query:  VNIMTSSTTLAIRMAGARVFAKLGCSHAMAKRAYKVGLELASDSSEEDFLVAMLFSLSKLASKSIFISSEQVQLLCSFLSPQKSVRVQETSLRCLCYIFM
        VNIMTS+T++AIRMAGARVFAKLGCSH++AK AYK GLELASDSSEEDFLVAMLFSLSKLASKSIFISSEQVQLLCS LS +KSVRVQETSLRCLC+IFM
Subjt:  VNIMTSSTTLAIRMAGARVFAKLGCSHAMAKRAYKVGLELASDSSEEDFLVAMLFSLSKLASKSIFISSEQVQLLCSFLSPQKSVRVQETSLRCLCYIFM

Query:  QGACQFTNMESMVRILVIALDEHMLPTTSHCDALRLLRKILFYVLPNPSFLDANEYSKLVKALESAARSPVKLKSLLAVRVMVDLSLKHSGKMEVESGGC
        +GACQFTNMES+VR LV ALDEHMLPT+SHCDALRLLRKILFYV PNPSFLDANEYSKLVKA+E+AARSP KLK LLAV ++VDLSL+ SGKMEVESG C
Subjt:  QGACQFTNMESMVRILVIALDEHMLPTTSHCDALRLLRKILFYVLPNPSFLDANEYSKLVKALESAARSPVKLKSLLAVRVMVDLSLKHSGKMEVESGGC

Query:  SFSLLPSQVISLIMDQIESLGKMLVDLSRSNSEVFQEIKGLLNLLLLIVREHSDLWILLLEKLCLAVELIMNMYEDVCNSQQSEEKFEGDEKNDTGLRFA
        SFSLLPSQVISLIMDQI SLGK+ VDL+RSNSEVFQEIKGLLNLLLLIV EHSDLWI+LLEK+CL  EL+MNM++DV +SQQ +E FEGD+KN    RFA
Subjt:  SFSLLPSQVISLIMDQIESLGKMLVDLSRSNSEVFQEIKGLLNLLLLIVREHSDLWILLLEKLCLAVELIMNMYEDVCNSQQSEEKFEGDEKNDTGLRFA

Query:  FILYGFAAICIGHLGQVDSITTEIFNKVKPLVNSVCKSRLFSSHTCITYSLLLNCKFILSCRITEDSRICNNDGFPRFTFCEYLTENMIFTLECAKKMLK
        FILYGF AI IGHLG V SIT+EIF+KVK LVN+VCKS LFSSHTCI YSLLLNCKFILSCRI ED R CNNDGFP FTFCE LTE  IFTLECAKK+LK
Subjt:  FILYGFAAICIGHLGQVDSITTEIFNKVKPLVNSVCKSRLFSSHTCITYSLLLNCKFILSCRITEDSRICNNDGFPRFTFCEYLTENMIFTLECAKKMLK

Query:  NGDEWPAYKAGRHAACHGSWLAATLIFGHLISKVHSDFFHCWLKSLFQFSLGERKIQLLLLPQYGSGLVNWLEKETILNAFSIEEQINQHQAGSITEAMY
        NGDEWPAYKAGRHAACHGSW AATLIFGHLISKV SD FH WLKSLFQF+L ERKIQLLLLPQYGSGL  WLEKE ILN FSIEEQINQH AGSI    Y
Subjt:  NGDEWPAYKAGRHAACHGSWLAATLIFGHLISKVHSDFFHCWLKSLFQFSLGERKIQLLLLPQYGSGLVNWLEKETILNAFSIEEQINQHQAGSITEAMY

Query:  YDKLLEAYQCVCSSGETLKAAAISLVQASCFQRWFLFLRAKVLGTVGSILKLLLNLS------------SNTAAIHETVKEFCKLSLTLERLSCEFDLIG
         DKLLEAYQC+CSSGE LKA+A+  VQA CFQRWFL LRAKVLGTVGSILKLL N+S            ++TAAIH+TV +F KLSLTLERLS EFDLIG
Subjt:  YDKLLEAYQCVCSSGETLKAAAISLVQASCFQRWFLFLRAKVLGTVGSILKLLLNLS------------SNTAAIHETVKEFCKLSLTLERLSCEFDLIG

Query:  TTFIGMDIKSSSVISALALNCSLLAFCTGFAFHVPNLAATLMIENVAGSGTKLHVELIQNLIGRLWLVDSETSKKLTQLFEIPGRPNNCSHLLSRSKILD
        TTFIGMD KSS++ISALALNCSLLAFCTGFAFHVPNLA TLM ENV    TK H  LIQNLIGRLWLVD ETS+ L QLFEI G PNNC HL+SR KILD
Subjt:  TTFIGMDIKSSSVISALALNCSLLAFCTGFAFHVPNLAATLMIENVAGSGTKLHVELIQNLIGRLWLVDSETSKKLTQLFEIPGRPNNCSHLLSRSKILD

Query:  VRYEVRDILTLCRYAVSEAVGLQNKVNGLNEETFLQVIQDGIRFLLNILMQWISIPFRVPKCFFCVRPCIGSEVFATTDARKPDGISIPFGFHLSLNLCL
        V YEVR ILTLCRYAVSE + LQ+K N ++E TFLQVI+DG+ FL NIL QWISIPFRVPKCFFCVRPCIGS++FATT+ARK D ISIP GFHLSLNLCL
Subjt:  VRYEVRDILTLCRYAVSEAVGLQNKVNGLNEETFLQVIQDGIRFLLNILMQWISIPFRVPKCFFCVRPCIGSEVFATTDARKPDGISIPFGFHLSLNLCL

Query:  QLRNIAPNMSVQITKMYCILYCGLSFQEPRHNGKNNEQKQQGHEAWESNDMVEMQNKLLYYVTESTKNEACISKCRTSSFCRTERVVEAFVQFEPDEKGQ
        QL+NIA NMSVQITKMYCILYC LSFQE +HNGK N QK Q +EAWE++D+VEM NKLL+Y+TES+KNE  I KCRTS+ C+TERVVEAFV FEPDEKGQ
Subjt:  QLRNIAPNMSVQITKMYCILYCGLSFQEPRHNGKNNEQKQQGHEAWESNDMVEMQNKLLYYVTESTKNEACISKCRTSSFCRTERVVEAFVQFEPDEKGQ

Query:  GFSKCLLDVSHFPVGCYRMKWYSCCVDSEGCFWSLLPLNSGPLFTIHQLPSAG
        GFS CLLDVS+FPVGCYR+KWYSCCVD++GCFW+LLPLNSGPLFTIHQLPSAG
Subjt:  GFSKCLLDVSHFPVGCYRMKWYSCCVDSEGCFWSLLPLNSGPLFTIHQLPSAG

XP_038892420.1 uncharacterized protein LOC120081531 isoform X2 [Benincasa hispida]0.0e+0079.62Show/hide
Query:  MERNAAARAMEWSIELEKALRSKKPDRAVEAILQIGSRLQQWSREPEPDVAVYNMFDLVTWEDRLFSNTILLRLADAFKFGDKHIRLAVVRVFLSELYSR
        MERN+AA AMEWSIELEKALRSKKP +AVEAILQIGSRLQQWSREPEP+VAVYNMFDLVTWEDRLFSNTILLRLADAFKF DK+IRLAVVRVFLSELYSR
Subjt:  MERNAAARAMEWSIELEKALRSKKPDRAVEAILQIGSRLQQWSREPEPDVAVYNMFDLVTWEDRLFSNTILLRLADAFKFGDKHIRLAVVRVFLSELYSR

Query:  DKARTKKYQGILSKARVQNHHELLTRVKVVLNGGDPEARALALILLGCWAHFAKDSAQICYLILSSLLSSHLLEVKASIFAATCFSELADDFAQVFLAML
        D AR+K+YQGILSKAR+QNHHELL+RVKVVLNGGDPEARALALILLGCWAHFAKDSAQI YLI SS+ SSHL EVKASIFAA C S+LADDFAQVFL +L
Subjt:  DKARTKKYQGILSKARVQNHHELLTRVKVVLNGGDPEARALALILLGCWAHFAKDSAQICYLILSSLLSSHLLEVKASIFAATCFSELADDFAQVFLAML

Query:  VNIMTSSTTLAIRMAGARVFAKLGCSHAMAKRAYKVGLELASDSSEEDFLVAMLFSLSKLASKSIFISSEQVQLLCSFLSPQKSVRVQETSLRCLCYIFM
        VNIMTS+T++AIRMAGARVFAKLGCSH++AK AYK GLELASDSSEEDFLVAMLFSLSKLASKSIFISSEQVQLLCS LS +KSVRVQETSLRCLC+IFM
Subjt:  VNIMTSSTTLAIRMAGARVFAKLGCSHAMAKRAYKVGLELASDSSEEDFLVAMLFSLSKLASKSIFISSEQVQLLCSFLSPQKSVRVQETSLRCLCYIFM

Query:  QGACQFTNMESMVRILVIALDEHMLPTTSHCDALRLLRKILFYVLPNPSFLDANEYSKLVKALESAARSPVKLKSLLAVRVMVDLSLKHSGKMEVESGGC
        +GACQFTNMES+VR LV ALDEHMLPT+SHCDALRLLRKILFYV PNPSFLDANEYSKLVKA+E+AARSP KLK LLAV ++VDLSL+ SGKMEVESG C
Subjt:  QGACQFTNMESMVRILVIALDEHMLPTTSHCDALRLLRKILFYVLPNPSFLDANEYSKLVKALESAARSPVKLKSLLAVRVMVDLSLKHSGKMEVESGGC

Query:  SFSLLPSQVISLIMDQIESLGKMLVDLSRSNSEVFQEIKGLLNLLLLIVREHSDLWILLLEKLCLAVELIMNMYEDVCNSQQSEEKFEGDEKNDTGLRFA
        SFSLLPSQVISLIMDQI SLGK+ VDL+RSNSEVFQEIK                                    DV +SQQ +E FEGD+KN    RFA
Subjt:  SFSLLPSQVISLIMDQIESLGKMLVDLSRSNSEVFQEIKGLLNLLLLIVREHSDLWILLLEKLCLAVELIMNMYEDVCNSQQSEEKFEGDEKNDTGLRFA

Query:  FILYGFAAICIGHLGQVDSITTEIFNKVKPLVNSVCKSRLFSSHTCITYSLLLNCKFILSCRITEDSRICNNDGFPRFTFCEYLTENMIFTLECAKKMLK
        FILYGF AI IGHLG V SIT+EIF+KVK LVN+VCKS LFSSHTCI YSLLLNCKFILSCRI ED R CNNDGFP FTFCE LTE  IFTLECAKK+LK
Subjt:  FILYGFAAICIGHLGQVDSITTEIFNKVKPLVNSVCKSRLFSSHTCITYSLLLNCKFILSCRITEDSRICNNDGFPRFTFCEYLTENMIFTLECAKKMLK

Query:  NGDEWPAYKAGRHAACHGSWLAATLIFGHLISKVHSDFFHCWLKSLFQFSLGERKIQLLLLPQYGSGLVNWLEKETILNAFSIEEQINQHQAGSITEAMY
        NGDEWPAYKAGRHAACHGSW AATLIFGHLISKV SD FH WLKSLFQF+L ERKIQLLLLPQYGSGL  WLEKE ILN FSIEEQINQH AGSI    Y
Subjt:  NGDEWPAYKAGRHAACHGSWLAATLIFGHLISKVHSDFFHCWLKSLFQFSLGERKIQLLLLPQYGSGLVNWLEKETILNAFSIEEQINQHQAGSITEAMY

Query:  YDKLLEAYQCVCSSGETLKAAAISLVQASCFQRWFLFLRAKVLGTVGSILKLLLNLS------------SNTAAIHETVKEFCKLSLTLERLSCEFDLIG
         DKLLEAYQC+CSSGE LKA+A+  VQA CFQRWFL LRAKVLGTVGSILKLL N+S            ++TAAIH+TV +F KLSLTLERLS EFDLIG
Subjt:  YDKLLEAYQCVCSSGETLKAAAISLVQASCFQRWFLFLRAKVLGTVGSILKLLLNLS------------SNTAAIHETVKEFCKLSLTLERLSCEFDLIG

Query:  TTFIGMDIKSSSVISALALNCSLLAFCTGFAFHVPNLAATLMIENVAGSGTKLHVELIQNLIGRLWLVDSETSKKLTQLFEIPGRPNNCSHLLSRSKILD
        TTFIGMD KSS++ISALALNCSLLAFCTGFAFHVPNLA TLM ENV    TK H  LIQNLIGRLWLVD ETS+ L QLFEI G PNNC HL+SR KILD
Subjt:  TTFIGMDIKSSSVISALALNCSLLAFCTGFAFHVPNLAATLMIENVAGSGTKLHVELIQNLIGRLWLVDSETSKKLTQLFEIPGRPNNCSHLLSRSKILD

Query:  VRYEVRDILTLCRYAVSEAVGLQNKVNGLNEETFLQVIQDGIRFLLNILMQWISIPFRVPKCFFCVRPCIGSEVFATTDARKPDGISIPFGFHLSLNLCL
        V YEVR ILTLCRYAVSE + LQ+K N ++E TFLQVI+DG+ FL NIL QWISIPFRVPKCFFCVRPCIGS++FATT+ARK D ISIP GFHLSLNLCL
Subjt:  VRYEVRDILTLCRYAVSEAVGLQNKVNGLNEETFLQVIQDGIRFLLNILMQWISIPFRVPKCFFCVRPCIGSEVFATTDARKPDGISIPFGFHLSLNLCL

Query:  QLRNIAPNMSVQITKMYCILYCGLSFQEPRHNGKNNEQKQQGHEAWESNDMVEMQNKLLYYVTESTKNEACISKCRTSSFCRTERVVEAFVQFEPDEKGQ
        QL+NIA NMSVQITKMYCILYC LSFQE +HNGK N QK Q +EAWE++D+VEM NKLL+Y+TES+KNE  I KCRTS+ C+TERVVEAFV FEPDEKGQ
Subjt:  QLRNIAPNMSVQITKMYCILYCGLSFQEPRHNGKNNEQKQQGHEAWESNDMVEMQNKLLYYVTESTKNEACISKCRTSSFCRTERVVEAFVQFEPDEKGQ

Query:  GFSKCLLDVSHFPVGCYRMKWYSCCVDSEGCFWSLLPLNSGPLFTIHQLPSAG
        GFS CLLDVS+FPVGCYR+KWYSCCVD++GCFW+LLPLNSGPLFTIHQLPSAG
Subjt:  GFSKCLLDVSHFPVGCYRMKWYSCCVDSEGCFWSLLPLNSGPLFTIHQLPSAG

TrEMBL top hitse value%identityAlignment
A0A1S3CMM3 uncharacterized protein LOC103502541 isoform X10.0e+0077.54Show/hide
Query:  MERNAAARAMEWSIELEKALRSKKPDRAVEAILQIGSRLQQWSREPEPDVAVYNMFDLVTWEDRLFSNTILLRLADAFKFGDKHIRLAVVRVFLSELYSR
        MERN+AA AMEWSIELEKALR KKP RAVEAI QIG RLQQWSREPEP++AVYNMFDLVTWED+LFSNTILLRLADAFK  DKHIRLAVVRVFLSELYSR
Subjt:  MERNAAARAMEWSIELEKALRSKKPDRAVEAILQIGSRLQQWSREPEPDVAVYNMFDLVTWEDRLFSNTILLRLADAFKFGDKHIRLAVVRVFLSELYSR

Query:  DKARTKKYQGILSKARVQNHHELLTRVKVVLNGGDPEARALALILLGCWAHFAKDSAQICYLILSSLLSSHLLEVKASIFAATCFSELADDFAQVFLAML
        D +R+K+YQGILSKARVQN HELLTRVKVVL+GGDPEA+ALALI+LGCWAHFAKDSAQI YLI  SL SSHL EVKASIFAA C S+LADDFAQVFL +L
Subjt:  DKARTKKYQGILSKARVQNHHELLTRVKVVLNGGDPEARALALILLGCWAHFAKDSAQICYLILSSLLSSHLLEVKASIFAATCFSELADDFAQVFLAML

Query:  VNIMTSSTTLAIRMAGARVFAKLGCSHAMAKRAYKVGLELASDSSEEDFLVAMLFSLSKLASKSIFISSEQVQLLCSFLSPQKSVRVQETSLRCLCYIFM
        VNIMTS+T+LAIRMAGARVFAKLGCSH+MAK AYK GLELASD+SEE FL+AMLFSLSKLASKSIFISSEQVQ LCSFLS +KSVRV++TSLRCLC+IFM
Subjt:  VNIMTSSTTLAIRMAGARVFAKLGCSHAMAKRAYKVGLELASDSSEEDFLVAMLFSLSKLASKSIFISSEQVQLLCSFLSPQKSVRVQETSLRCLCYIFM

Query:  QGACQFTNMESMVRILVIALDEHMLPTTSHCDALRLLRKILFYVLPNPSFLDANEYSKLVKALESAARSPVKLKSLLAVRVMVDLSLKHSGKMEVESGGC
        +GACQF NMES+V+IL+ ALDEHMLPT+SHCDALRLL+KI+FYV  NPSFLDANEYS LVKA+E+AARSPVKLK LLA +V+V LSL+ SGKMEVESG C
Subjt:  QGACQFTNMESMVRILVIALDEHMLPTTSHCDALRLLRKILFYVLPNPSFLDANEYSKLVKALESAARSPVKLKSLLAVRVMVDLSLKHSGKMEVESGGC

Query:  SFSLLPSQVISLIMDQIESLGKMLVDLSRSNSEVFQEIKGLLNLLLLIVREHSDLWILLLEKLCLAVELIMNMYEDVCNSQQSEEKFEGDEKNDTGLRFA
        S SLLPS+VISLIMDQI SL KM +DL +SNSE FQEIK LLNLLLLIVRE SDLWILLLEK+CL   LIM M+ED  + QQ +  FE +EKND  LRF+
Subjt:  SFSLLPSQVISLIMDQIESLGKMLVDLSRSNSEVFQEIKGLLNLLLLIVREHSDLWILLLEKLCLAVELIMNMYEDVCNSQQSEEKFEGDEKNDTGLRFA

Query:  FILYGFAAICIGHLGQVDSITTEIFNKVKPLVNSVCKSRLFSSHTCITYSLLLNCKFILSCRITEDSRICNNDGFPRFTFCEYLTENMIFTLECAKKMLK
        FILYGF AI +G+LGQV SIT EIF+KVK LVNSVC+S LFSSHTCI YSLLLNCKFILSCRI ED RICNN+GFP FTFCE LTEN IFTLECAKK+L+
Subjt:  FILYGFAAICIGHLGQVDSITTEIFNKVKPLVNSVCKSRLFSSHTCITYSLLLNCKFILSCRITEDSRICNNDGFPRFTFCEYLTENMIFTLECAKKMLK

Query:  NGDEWPAYKAGRHAACHGSWLAATLIFGHLISKVHSDFFHCWLKSLFQFSLGERKIQLLLLPQYGSGLVNWLEKETILNAFSIEEQINQHQAGSITEAMY
         GDEWPAY AGRHAACHGSW AATLIFGHLISKV SD FH WLKSLFQF+L ERKIQ LLLP YGSGL  WLEKE ILN F I+E IN H  GSITE +Y
Subjt:  NGDEWPAYKAGRHAACHGSWLAATLIFGHLISKVHSDFFHCWLKSLFQFSLGERKIQLLLLPQYGSGLVNWLEKETILNAFSIEEQINQHQAGSITEAMY

Query:  YDKLLEAYQCVCSSGETLKAAAISLVQASCFQRWFLFLRAKVLGTVGSILKLLLNL------------SSNTAAIHETVKEFCKLSLTLERLSCEFDLIG
        Y KL E YQC+ SS E LKAAA+  VQ+ CFQRWFL LRAK+LGTVGSILK LLN+            ++NT  I E+V EF KLSL LERLS EFDLIG
Subjt:  YDKLLEAYQCVCSSGETLKAAAISLVQASCFQRWFLFLRAKVLGTVGSILKLLLNL------------SSNTAAIHETVKEFCKLSLTLERLSCEFDLIG

Query:  TTFIGMDIKSSSVISALALNCSLLAFCTGFAFHVPNLAATLMIENVAGSGTKLHVELIQNLIGRLWLVDSETSKKLTQLFEIPGRPNNCSHLLSRSKILD
        TTFIGMD KS +VISALALNCSLLAFCTGFAFHVP+LA TLM ENV    TKL   LIQNL  RL LVD ETSK L QLFE+ G PNNCSHL+SR KILD
Subjt:  TTFIGMDIKSSSVISALALNCSLLAFCTGFAFHVPNLAATLMIENVAGSGTKLHVELIQNLIGRLWLVDSETSKKLTQLFEIPGRPNNCSHLLSRSKILD

Query:  VRYEVRDILTLCRYAVSEAVGLQNKVNGLNEETFLQVIQDGIRFLLNILMQWISIPFRVPKCFFCVRPCIGSEVFATTDARKPDGISIPFGFHLSLNLCL
        + YEVR ILTLCRYA+SE + +Q+K +G+++ETFLQVI+DG++FL NI+M WISIPFRVPK FF VRPCIG E+FATTD  K D ISIP+GFHLSLNLCL
Subjt:  VRYEVRDILTLCRYAVSEAVGLQNKVNGLNEETFLQVIQDGIRFLLNILMQWISIPFRVPKCFFCVRPCIGSEVFATTDARKPDGISIPFGFHLSLNLCL

Query:  QLRNIAPNMSVQITKMYCILYCGLSFQEPRHNGKNNEQKQQGHEAWESNDMVEMQNKLLYYVTESTKNEACISKCRTSSFCRTERVVEAFVQFEPDEKGQ
        QL+NI PNMSVQITKMYCILYCG SFQE +HNGKNN +  Q +EAWE++D+VEM NKLL+YVTES+KNEA I KC TS  C+T+RV+E FVQFEPDEKGQ
Subjt:  QLRNIAPNMSVQITKMYCILYCGLSFQEPRHNGKNNEQKQQGHEAWESNDMVEMQNKLLYYVTESTKNEACISKCRTSSFCRTERVVEAFVQFEPDEKGQ

Query:  GFSKCLLDVSHFPVGCYRMKWYSCCVDSEGCFWSLLPLNSGPLFTIHQLPSAG
        GFS CL DVSH+PVGCYR+KWYSCCVDSEGCFW+LLPLNSGPLFTIHQL SAG
Subjt:  GFSKCLLDVSHFPVGCYRMKWYSCCVDSEGCFWSLLPLNSGPLFTIHQLPSAG

A0A1S3CNQ5 uncharacterized protein LOC103502541 isoform X20.0e+0075.64Show/hide
Query:  LAIRMAGARVFAKLGCSHAMAKRAYKVGLELASDSSEEDFLVAMLFSLSKLASKSIFISSEQVQLLCSFLSPQKSVRVQETSLRCLCYIFMQGACQFTNM
        L  RMAGARVFAKLGCSH+MAK AYK GLELASD+SEE FL+AMLFSLSKLASKSIFISSEQVQ LCSFLS +KSVRV++TSLRCLC+IFM+GACQF NM
Subjt:  LAIRMAGARVFAKLGCSHAMAKRAYKVGLELASDSSEEDFLVAMLFSLSKLASKSIFISSEQVQLLCSFLSPQKSVRVQETSLRCLCYIFMQGACQFTNM

Query:  ESMVRILVIALDEHMLPTTSHCDALRLLRKILFYVLPNPSFLDANEYSKLVKALESAARSPVKLKSLLAVRVMVDLSLKHSGKMEVESGGCSFSLLPSQV
        ES+V+IL+ ALDEHMLPT+SHCDALRLL+KI+FYV  NPSFLDANEYS LVKA+E+AARSPVKLK LLA +V+V LSL+ SGKMEVESG CS SLLPS+V
Subjt:  ESMVRILVIALDEHMLPTTSHCDALRLLRKILFYVLPNPSFLDANEYSKLVKALESAARSPVKLKSLLAVRVMVDLSLKHSGKMEVESGGCSFSLLPSQV

Query:  ISLIMDQIESLGKMLVDLSRSNSEVFQEIKGLLNLLLLIVREHSDLWILLLEKLCLAVELIMNMYEDVCNSQQSEEKFEGDEKNDTGLRFAFILYGFAAI
        ISLIMDQI SL KM +DL +SNSE FQEIK LLNLLLLIVRE SDLWILLLEK+CL   LIM M+ED  + QQ +  FE +EKND  LRF+FILYGF AI
Subjt:  ISLIMDQIESLGKMLVDLSRSNSEVFQEIKGLLNLLLLIVREHSDLWILLLEKLCLAVELIMNMYEDVCNSQQSEEKFEGDEKNDTGLRFAFILYGFAAI

Query:  CIGHLGQVDSITTEIFNKVKPLVNSVCKSRLFSSHTCITYSLLLNCKFILSCRITEDSRICNNDGFPRFTFCEYLTENMIFTLECAKKMLKNGDEWPAYK
         +G+LGQV SIT EIF+KVK LVNSVC+S LFSSHTCI YSLLLNCKFILSCRI ED RICNN+GFP FTFCE LTEN IFTLECAKK+L+ GDEWPAY 
Subjt:  CIGHLGQVDSITTEIFNKVKPLVNSVCKSRLFSSHTCITYSLLLNCKFILSCRITEDSRICNNDGFPRFTFCEYLTENMIFTLECAKKMLKNGDEWPAYK

Query:  AGRHAACHGSWLAATLIFGHLISKVHSDFFHCWLKSLFQFSLGERKIQLLLLPQYGSGLVNWLEKETILNAFSIEEQINQHQAGSITEAMYYDKLLEAYQ
        AGRHAACHGSW AATLIFGHLISKV SD FH WLKSLFQF+L ERKIQ LLLP YGSGL  WLEKE ILN F I+E IN H  GSITE +YY KL E YQ
Subjt:  AGRHAACHGSWLAATLIFGHLISKVHSDFFHCWLKSLFQFSLGERKIQLLLLPQYGSGLVNWLEKETILNAFSIEEQINQHQAGSITEAMYYDKLLEAYQ

Query:  CVCSSGETLKAAAISLVQASCFQRWFLFLRAKVLGTVGSILKLLLNL------------SSNTAAIHETVKEFCKLSLTLERLSCEFDLIGTTFIGMDIK
        C+ SS E LKAAA+  VQ+ CFQRWFL LRAK+LGTVGSILK LLN+            ++NT  I E+V EF KLSL LERLS EFDLIGTTFIGMD K
Subjt:  CVCSSGETLKAAAISLVQASCFQRWFLFLRAKVLGTVGSILKLLLNL------------SSNTAAIHETVKEFCKLSLTLERLSCEFDLIGTTFIGMDIK

Query:  SSSVISALALNCSLLAFCTGFAFHVPNLAATLMIENVAGSGTKLHVELIQNLIGRLWLVDSETSKKLTQLFEIPGRPNNCSHLLSRSKILDVRYEVRDIL
        S +VISALALNCSLLAFCTGFAFHVP+LA TLM ENV    TKL   LIQNL  RL LVD ETSK L QLFE+ G PNNCSHL+SR KILD+ YEVR IL
Subjt:  SSSVISALALNCSLLAFCTGFAFHVPNLAATLMIENVAGSGTKLHVELIQNLIGRLWLVDSETSKKLTQLFEIPGRPNNCSHLLSRSKILDVRYEVRDIL

Query:  TLCRYAVSEAVGLQNKVNGLNEETFLQVIQDGIRFLLNILMQWISIPFRVPKCFFCVRPCIGSEVFATTDARKPDGISIPFGFHLSLNLCLQLRNIAPNM
        TLCRYA+SE + +Q+K +G+++ETFLQVI+DG++FL NI+M WISIPFRVPK FF VRPCIG E+FATTD  K D ISIP+GFHLSLNLCLQL+NI PNM
Subjt:  TLCRYAVSEAVGLQNKVNGLNEETFLQVIQDGIRFLLNILMQWISIPFRVPKCFFCVRPCIGSEVFATTDARKPDGISIPFGFHLSLNLCLQLRNIAPNM

Query:  SVQITKMYCILYCGLSFQEPRHNGKNNEQKQQGHEAWESNDMVEMQNKLLYYVTESTKNEACISKCRTSSFCRTERVVEAFVQFEPDEKGQGFSKCLLDV
        SVQITKMYCILYCG SFQE +HNGKNN +  Q +EAWE++D+VEM NKLL+YVTES+KNEA I KC TS  C+T+RV+E FVQFEPDEKGQGFS CL DV
Subjt:  SVQITKMYCILYCGLSFQEPRHNGKNNEQKQQGHEAWESNDMVEMQNKLLYYVTESTKNEACISKCRTSSFCRTERVVEAFVQFEPDEKGQGFSKCLLDV

Query:  SHFPVGCYRMKWYSCCVDSEGCFWSLLPLNSGPLFTIHQLPSAG
        SH+PVGCYR+KWYSCCVDSEGCFW+LLPLNSGPLFTIHQL SAG
Subjt:  SHFPVGCYRMKWYSCCVDSEGCFWSLLPLNSGPLFTIHQLPSAG

A0A6J1DXI9 uncharacterized protein LOC1110244000.0e+0079.53Show/hide
Query:  MERNAAARAMEWSIELEKALRSKKPDRAVEAILQIGSRLQQWSREPEPDVAVYNMFDLVTWEDRLFSNTILLRLADAFKFGDKHIRLAVVRVFLSELYSR
        MERNAAA AMEWSIELEKALRSKK  RA EAILQIGSRLQQWSREPEP+VAVYNMFDLVTWEDRLFSNTILLRLA+AFKF DKHIR AVVRVFLSELYSR
Subjt:  MERNAAARAMEWSIELEKALRSKKPDRAVEAILQIGSRLQQWSREPEPDVAVYNMFDLVTWEDRLFSNTILLRLADAFKFGDKHIRLAVVRVFLSELYSR

Query:  DKARTKKYQGILSKARVQNHHELLTRVKVVLNGGDPEARALALILLGCWAHFAKDSAQICYLILSSLLSSHLLEVKASIFAATCFSELADDFAQVFLAML
        DK R+K+YQGILSKARVQNHHELLTRVKVVL+GGDPE+RALAL+L GCWAHFAKDS QI YLILSSLLS H+ EVKASIFAA C  ELADDFAQVFL ML
Subjt:  DKARTKKYQGILSKARVQNHHELLTRVKVVLNGGDPEARALALILLGCWAHFAKDSAQICYLILSSLLSSHLLEVKASIFAATCFSELADDFAQVFLAML

Query:  VNIMTSSTTLAIRMAGARVFAKLGCSHAMAKRAYKVGLELASDSSEEDFLVAMLFSLSKLASKSIFISSEQVQLLCSFLSPQKSVRVQETSLRCLCYIFM
        VNIMT S  LAIRMAGARV  KLGCSH+MAK AYK GLEL SDS EEDFLVAMLFSLSKLAS SIFISSEQVQLLCSFLS +KSVRVQETSLRCL +IFM
Subjt:  VNIMTSSTTLAIRMAGARVFAKLGCSHAMAKRAYKVGLELASDSSEEDFLVAMLFSLSKLASKSIFISSEQVQLLCSFLSPQKSVRVQETSLRCLCYIFM

Query:  QGACQFTNMESMVRILVIALDEHMLPTTSHCDALRLLRKILFYVLPNPSFLDANEYSKLVKALESAARSPVKLKSLLAVRVMVDLSLKHSGKMEVESGGC
        +GACQFTNM   +R+LV ALDE MLPTT HCD LRLL+KILF VLPNPSFLDANEYSKLV A+ESAARSP+KLKSLLAV  +V+LSLK SG+MEVESGG 
Subjt:  QGACQFTNMESMVRILVIALDEHMLPTTSHCDALRLLRKILFYVLPNPSFLDANEYSKLVKALESAARSPVKLKSLLAVRVMVDLSLKHSGKMEVESGGC

Query:  SFSLLPSQVISLIMDQIESLGKMLVDLSRSNSEVFQEIKGLLNLLLLIVREHSDLWILLLEKLCLAVELIMNMYEDVCNSQQSEEKFEGDEKNDTGLRFA
        SFSLLPS+VISLIMDQI SLGKMLVDLS+S SEVFQEIKGLLNLLLLIVREHSDLWILLL+++CL V+L M++YED  +SQQ++  FEGD+KND  LRFA
Subjt:  SFSLLPSQVISLIMDQIESLGKMLVDLSRSNSEVFQEIKGLLNLLLLIVREHSDLWILLLEKLCLAVELIMNMYEDVCNSQQSEEKFEGDEKNDTGLRFA

Query:  FILYGFAAICIGHLGQVDSITTEIFNKVKPLVNSVCKSRLFSSHTCITYSLLLNCKFILSCRITEDSRICNNDGFPRFTFCEYLTENMIFTLECAKKMLK
        FILYG  AICIG++GQV SIT EIF+KVK +VNSVCKS LFS HTC+TYSLLLNCKFILSCRITED    N D FP FTFCE LTEN I TLECA K+LK
Subjt:  FILYGFAAICIGHLGQVDSITTEIFNKVKPLVNSVCKSRLFSSHTCITYSLLLNCKFILSCRITEDSRICNNDGFPRFTFCEYLTENMIFTLECAKKMLK

Query:  NGDEWPAYKAGRHAACHGSWLAATLIFGHLISKVHSDFFHCWLKSLFQFSLGERKIQLLLLPQYGSGLVNWLEKETILNAFSIEEQINQHQAGSITEAMY
        +GDEWP YKAGRHAACHGSW AATLIFGHL+ KV SD FHCWLKSLFQF+L ERKI LLLLPQYGSGL NWLEKE IL+ FS EE INQHQAGSITEA+Y
Subjt:  NGDEWPAYKAGRHAACHGSWLAATLIFGHLISKVHSDFFHCWLKSLFQFSLGERKIQLLLLPQYGSGLVNWLEKETILNAFSIEEQINQHQAGSITEAMY

Query:  YDKLLEAYQCVCSSGETLKAAAISLVQASCFQRWFLFLRAKVLGTVGSILKLLLNLS------------SNTAAIHETVKEFCKLSLTLERLSCEFDLIG
        YDKLLEA+QC+CSSGETLKAAA+S V+A CFQRWFL LRA+VLG V SILKLL N+S             +T AIHET+KEF KLSL +ERLS E DLI 
Subjt:  YDKLLEAYQCVCSSGETLKAAAISLVQASCFQRWFLFLRAKVLGTVGSILKLLLNLS------------SNTAAIHETVKEFCKLSLTLERLSCEFDLIG

Query:  TTFIGMDIKSSSVISALALNCSLLAFCTGFAFHVPNLAATLMIENVAGSGTKLHVELIQNLIGRLWLVDSETSKKLTQLFEIPGRPNNCSHLLSRSKILD
        T+FIG+D KSS+VISALALNCSLLAFCTGFAFHVPNLA TLM ENV    T LH ELIQNL+G+LWLVDSETSK L QLFEI G PNNC  L SRS++LD
Subjt:  TTFIGMDIKSSSVISALALNCSLLAFCTGFAFHVPNLAATLMIENVAGSGTKLHVELIQNLIGRLWLVDSETSKKLTQLFEIPGRPNNCSHLLSRSKILD

Query:  VRYEVRDILTLCRYAVSEAVGLQNKVNGLNEETFLQVIQDGIRFLLNILMQWISIPFRVPKCFFCVRPCIGSEVFATTDARKPDGISIPFGFHLSLNLCL
        V YE+RDI  LC YAVSE VGLQ+K NG NE T LQVI++G++FL NIL +W+SIPFRVPK FFCVRPC+GS++FA+TDARKPDGISIPFGFHLSLNLCL
Subjt:  VRYEVRDILTLCRYAVSEAVGLQNKVNGLNEETFLQVIQDGIRFLLNILMQWISIPFRVPKCFFCVRPCIGSEVFATTDARKPDGISIPFGFHLSLNLCL

Query:  QLRNIAPNMSVQITKMYCILYCGLSFQEPRHNGKNNEQKQQGHEAWESNDMVEMQNKLLYYVTESTKNEACISKCRTSSFCRTERVVEAFVQFEPDEKGQ
        QLRNI PN SVQITKMYCILYCGLSFQEPRH G+NNE KQQ  EAWE++DMV MQNKL +YVTE +KNEA + K  TSS   TERVVE FVQFEPDEKGQ
Subjt:  QLRNIAPNMSVQITKMYCILYCGLSFQEPRHNGKNNEQKQQGHEAWESNDMVEMQNKLLYYVTESTKNEACISKCRTSSFCRTERVVEAFVQFEPDEKGQ

Query:  GFSKCLLDVSHFPVGCYRMKWYSCCVDSEGCFWSLLPLNSGPLFTIHQLPSAG
        GFS CLLDVS FPVGCYR+KWYSCCVDSEG  WSLLPLN GPLFTIHQLP  G
Subjt:  GFSKCLLDVSHFPVGCYRMKWYSCCVDSEGCFWSLLPLNSGPLFTIHQLPSAG

A0A6J1FP76 uncharacterized protein LOC1114472550.0e+0079.2Show/hide
Query:  MERNAAARAMEWSIELEKALRSKKPDRAVEAILQIGSRLQQWSREPEPDVAVYNMFDLVTWEDRLFSNTILLRLADAFKFGDKHIRLAVVRVFLSELYSR
        MERNAAA AMEWSIELEKALRSKKP RAVEAILQIGSRLQQWSREPEP++AVYNMFDLVTWEDRLFSNTILLRLADAFK  DKHIR+AVV+VFLSEL SR
Subjt:  MERNAAARAMEWSIELEKALRSKKPDRAVEAILQIGSRLQQWSREPEPDVAVYNMFDLVTWEDRLFSNTILLRLADAFKFGDKHIRLAVVRVFLSELYSR

Query:  DKARTKKYQGILSKARVQNHHELLTRVKVVLNGGDPEARALALILLGCWAHFAKDSAQICYLILSSLLSSHLLEVKASIFAATCFSELADDFAQVFLAML
        D+ ++K+YQG+LSKARVQNHHELLTRVKVVL GGDPEARALALILLGCWAHFA+ SAQI Y+ILSS+LSSH+ EVKASIFAA C S+LADDFAQVFLA+L
Subjt:  DKARTKKYQGILSKARVQNHHELLTRVKVVLNGGDPEARALALILLGCWAHFAKDSAQICYLILSSLLSSHLLEVKASIFAATCFSELADDFAQVFLAML

Query:  VNIMTSSTTLAIRMAGARVFAKLGCSHAMAKRAYKVGLELASDSSEEDFLVAMLFSLSKLASKSIFISSEQVQLLCSFLSPQKSVRVQETSLRCLCYIFM
        VNIMTS+T+LAI+MAGARVFAKLGCSH+MAK AYK GLELAS+SSEEDFLVAMLFSLSKLASKS+FISSEQV+ LCSFLS +KS RVQETSLRCL +IFM
Subjt:  VNIMTSSTTLAIRMAGARVFAKLGCSHAMAKRAYKVGLELASDSSEEDFLVAMLFSLSKLASKSIFISSEQVQLLCSFLSPQKSVRVQETSLRCLCYIFM

Query:  QGACQFTNMESMVRILVIALDEHMLPTTSHCDALRLLRKILFYVLPNPSFLDANEYSKLVKALESAARSPVKLKSLLAVRVMVDLSLKHSGKMEVESGGC
        +G C FTNMES+VRILV ALDE ML TTSHCD LRLLRKI+FYV PNPSFLDANEYSKLVKA+ESAA+S V L SL AVR++VDLSL+ SGKMEVESG  
Subjt:  QGACQFTNMESMVRILVIALDEHMLPTTSHCDALRLLRKILFYVLPNPSFLDANEYSKLVKALESAARSPVKLKSLLAVRVMVDLSLKHSGKMEVESGGC

Query:  SFSLLPSQVISLIMDQIESLGKMLVDLSRSNSEVFQEIKGLLNLLLLIVREHSDLWILLLEKLCLAVELIMNMYEDVCNSQQSEEKFEGDEKNDTGLRFA
        SFSLLP QVISLIMDQI SL   LVDLS+ NSEVFQEIKGLLNLLLLIVREHSDLW  LLEK+C  VELIMNM+E V + QQ +   EGD+KND  LRFA
Subjt:  SFSLLPSQVISLIMDQIESLGKMLVDLSRSNSEVFQEIKGLLNLLLLIVREHSDLWILLLEKLCLAVELIMNMYEDVCNSQQSEEKFEGDEKNDTGLRFA

Query:  FILYGFAAICIGHLGQVDSITTEIFNKVKPLVNSVCKSRLFSSHTCITYSLLLNCKFILSCRITEDSRICNNDGFPRFTFCEYLTENMIFTLECAKKMLK
        FILYGF AIC+GHLGQV  IT+EIF+KVK LV SVC+  LFSSH     SLLLNCKFILSCRITED R CNNDGFPRFTFCE LTEN IFTL+CAKK+LK
Subjt:  FILYGFAAICIGHLGQVDSITTEIFNKVKPLVNSVCKSRLFSSHTCITYSLLLNCKFILSCRITEDSRICNNDGFPRFTFCEYLTENMIFTLECAKKMLK

Query:  NGDEWPAYKAGRHAACHGSWLAATLIFGHLISKVHSDFFHCWLKSLFQFSLGERKIQLLLLPQYGSGLVNWLEKETILNAFSIEEQINQHQAGSITEAMY
        NGDEWPAYKAGRHAACHGSW AATLIFGHL SKV S FF  WLKSLFQF+L ERKIQLLLLPQYGSGL+NWLE+ETILN FS EEQI  H AGSI+E +Y
Subjt:  NGDEWPAYKAGRHAACHGSWLAATLIFGHLISKVHSDFFHCWLKSLFQFSLGERKIQLLLLPQYGSGLVNWLEKETILNAFSIEEQINQHQAGSITEAMY

Query:  YDKLLEAYQCVCSSGETLKAAAISLVQASCFQRWFLFLRAKVLGTVGSILKLLLNL---SSNTAAIHETVKEFCKLSLTLERLSCEFDLIGTTFIGMDIK
        YDKLLEAYQC+CSSGE LK++  + VQA CFQRWFL LRAK+LGT+GSI+KLLLN+   +++TAAIHETV+EF KLSLT ERLS EFDLIGTTFIGMD +
Subjt:  YDKLLEAYQCVCSSGETLKAAAISLVQASCFQRWFLFLRAKVLGTVGSILKLLLNL---SSNTAAIHETVKEFCKLSLTLERLSCEFDLIGTTFIGMDIK

Query:  SSSVISALALNCSLLAFCTGFAFHVPNLAATLMIENVAGSGTKLHVELIQNLIGRLWLVDSETSKKLTQLFEIPGRPNNCSHLLSRSKILDVRYEVRDIL
        + +VISALALNCSLLAFCTGFAF VPNLA +L+ ENV    T L   LI+NLIGRLW VD ETSK+LT+LF   G PNNC HLL R+KILD+ YEVR I 
Subjt:  SSSVISALALNCSLLAFCTGFAFHVPNLAATLMIENVAGSGTKLHVELIQNLIGRLWLVDSETSKKLTQLFEIPGRPNNCSHLLSRSKILDVRYEVRDIL

Query:  TLCRYAVSEAVGLQNKVNGLNEETFLQVIQDGIRFLLNILMQWISIPFRVPKCFFCVRPCIGSEVFATTDARKPDGISIPFGFHLSLNLCLQLRNIAPNM
        TLCRYAVSE +  Q+K NG++E T L+V++DG++FL NILMQWISIPFRVPKCFFCVRPCIGSE++ATTDARK DGISIPFGFHLSLNLCLQL+NI PN 
Subjt:  TLCRYAVSEAVGLQNKVNGLNEETFLQVIQDGIRFLLNILMQWISIPFRVPKCFFCVRPCIGSEVFATTDARKPDGISIPFGFHLSLNLCLQLRNIAPNM

Query:  SVQITKMYCILYCGLSFQEPRHNGKNNEQKQQGHEAWESNDMVEMQNKLLYYVTESTKNEACISKCRTSSFCRTERVVEAFVQFEPDEKGQGFSKCLLDV
        SV+IT+MYCILYCGLSFQEP+H    NEQKQQ +EAWE +D+VEMQNKLL+YVTES+KNE CIS+ +TSS CRTERVV+AFV+FEP+EKGQGFS CLLDV
Subjt:  SVQITKMYCILYCGLSFQEPRHNGKNNEQKQQGHEAWESNDMVEMQNKLLYYVTESTKNEACISKCRTSSFCRTERVVEAFVQFEPDEKGQGFSKCLLDV

Query:  SHFPVGCYRMKWYSCCVDSEGCFWSLLPLNSGPLFTIHQLPSAG
        S FPVG YR+KWYSCCVDSEGCFWSLLPL+ GPLFT+HQLPSAG
Subjt:  SHFPVGCYRMKWYSCCVDSEGCFWSLLPLNSGPLFTIHQLPSAG

A0A6J1JBG3 uncharacterized protein LOC1114852600.0e+0078.49Show/hide
Query:  MERNAAARAMEWSIELEKALRSKKPDRAVEAILQIGSRLQQWSREPEPDVAVYNMFDLVTWEDRLFSNTILLRLADAFKFGDKHIRLAVVRVFLSELYSR
        MERNAAA AMEWSIELEKALRSKKP RAVEAILQIGSRLQQWSREPEP++AVYNMFDLVTWEDRLFSNTILLRLADAFK  DKHIR+AVVRVFLSEL SR
Subjt:  MERNAAARAMEWSIELEKALRSKKPDRAVEAILQIGSRLQQWSREPEPDVAVYNMFDLVTWEDRLFSNTILLRLADAFKFGDKHIRLAVVRVFLSELYSR

Query:  DKARTKKYQGILSKARVQNHHELLTRVKVVLNGGDPEARALALILLGCWAHFAKDSAQICYLILSSLLSSHLLEVKASIFAATCFSELADDFAQVFLAML
        D+ ++++YQG+LSKARVQNHHELLTRVKVVL GGDPEARALALILLGCWAHFA+ SAQI Y+IL SLLSSH+ EVKASIFAA C S+LADDFA+VFLA+L
Subjt:  DKARTKKYQGILSKARVQNHHELLTRVKVVLNGGDPEARALALILLGCWAHFAKDSAQICYLILSSLLSSHLLEVKASIFAATCFSELADDFAQVFLAML

Query:  VNIMTSSTTLAIRMAGARVFAKLGCSHAMAKRAYKVGLELASDSSEEDFLVAMLFSLSKLASKSIFISSEQVQLLCSFLSPQKSVRVQETSLRCLCYIFM
        VNIMTS+T+LA++MAGARVFAKLGCSH+MAK AYK GLELASDSSEEDFLVAMLFSLSKLASKS+FISSEQV+LLCSFLS +KSVRVQETSLRCL +IFM
Subjt:  VNIMTSSTTLAIRMAGARVFAKLGCSHAMAKRAYKVGLELASDSSEEDFLVAMLFSLSKLASKSIFISSEQVQLLCSFLSPQKSVRVQETSLRCLCYIFM

Query:  QGACQFTNMESMVRILVIALDEHMLPTTSHCDALRLLRKILFYVLPNPSFLDANEYSKLVKALESAARSPVKLKSLLAVRVMVDLSLKHSGKMEVESGGC
        +G C FTNMES+VRILV ALDE ML TTSHCD LRLLRKI+FY+ PNPSFLDANEY KLVKA+ESAA+S V L SLLAVR++VDLSL+ SGKMEVESG  
Subjt:  QGACQFTNMESMVRILVIALDEHMLPTTSHCDALRLLRKILFYVLPNPSFLDANEYSKLVKALESAARSPVKLKSLLAVRVMVDLSLKHSGKMEVESGGC

Query:  SFSLLPSQVISLIMDQIESLGKMLVDLSRSNSEVFQEIKGLLNLLLLIVREHSDLWILLLEKLCLAVELIMNMYEDVCNSQQSEEKFEGDEKNDTGLRFA
        SFSLLP QVISLIMDQI SL   LVDLS+ NSEVFQEIKGL NLLLLIVREHSDLW  L EK+C  VELIMNM+E V + QQ +   EGD+KND  LRFA
Subjt:  SFSLLPSQVISLIMDQIESLGKMLVDLSRSNSEVFQEIKGLLNLLLLIVREHSDLWILLLEKLCLAVELIMNMYEDVCNSQQSEEKFEGDEKNDTGLRFA

Query:  FILYGFAAICIGHLGQVDSITTEIFNKVKPLVNSVCKSRLFSSHTCITYSLLLNCKFILSCRITEDSRICNNDGFPRFTFCEYLTENMIFTLECAKKMLK
        FILYGF AIC+GHLGQV  IT+EIF+KVK LV SVC+  LFSSH     SLLLNCKFILSCRITED RI NNDGFPRFTFCE LTEN IFTL+CAKK+LK
Subjt:  FILYGFAAICIGHLGQVDSITTEIFNKVKPLVNSVCKSRLFSSHTCITYSLLLNCKFILSCRITEDSRICNNDGFPRFTFCEYLTENMIFTLECAKKMLK

Query:  NGDEWPAYKAGRHAACHGSWLAATLIFGHLISKVHSDFFHCWLKSLFQFSLGERKIQLLLLPQYGSGLVNWLEKETILNAFSIEEQINQHQAGSITEAMY
        NGDEWPAYKAGRHAACHGSW AATLIFGHL SKV SDFF  WLKSLFQF++ ERKIQLLLLPQYGSGL+NWLE+ETILN FS EEQI  H AGSI+  +Y
Subjt:  NGDEWPAYKAGRHAACHGSWLAATLIFGHLISKVHSDFFHCWLKSLFQFSLGERKIQLLLLPQYGSGLVNWLEKETILNAFSIEEQINQHQAGSITEAMY

Query:  YDKLLEAYQCVCSSGETLKAAAISLVQASCFQRWFLFLRAKVLGTVGSILKLLLNL------------SSNTAAIHETVKEFCKLSLTLERLSCEFDLIG
        YDKLLEAYQC+CSSGE LK++  + VQA CFQRWFL LRAK+LGTVGSI+KLLLN+            +++TAAIHETV+EF KLSLT ERLS EFDLIG
Subjt:  YDKLLEAYQCVCSSGETLKAAAISLVQASCFQRWFLFLRAKVLGTVGSILKLLLNL------------SSNTAAIHETVKEFCKLSLTLERLSCEFDLIG

Query:  TTFIGMDIKSSSVISALALNCSLLAFCTGFAFHVPNLAATLMIENVAGSGTKLHVELIQNLIGRLWLVDSETSKKLTQLFEIPGRPNNCSHLLSRSKILD
        TTFIGMD ++  VISALALNCSLLAFCTGFAF VPNLA +L+ ENV    T L   L++NLIGRLW VD ETSK+LT+LF+  G PNNCSHLLSR+KILD
Subjt:  TTFIGMDIKSSSVISALALNCSLLAFCTGFAFHVPNLAATLMIENVAGSGTKLHVELIQNLIGRLWLVDSETSKKLTQLFEIPGRPNNCSHLLSRSKILD

Query:  VRYEVRDILTLCRYAVSEAVGLQNKVNGLNEETFLQVIQDGIRFLLNILMQWISIPFRVPKCFFCVRPCIGSEVFATTDARKPDGISIPFGFHLSLNLCL
        V YEVR I TLCRYAVSE +  Q+K NG++E T  QV++DG++FL NI MQWISIPFRVPKCFFCVRPCIGSE++A TDARK DGISIPFGF LSLNLCL
Subjt:  VRYEVRDILTLCRYAVSEAVGLQNKVNGLNEETFLQVIQDGIRFLLNILMQWISIPFRVPKCFFCVRPCIGSEVFATTDARKPDGISIPFGFHLSLNLCL

Query:  QLRNIAPNMSVQITKMYCILYCGLSFQEPRHNGKNNEQKQQGHEAWESNDMVEMQNKLLYYVTESTKNEACISKCRTSSFCRTERVVEAFVQFEPDEKGQ
        QL+NI PNM V+IT+MYCILYCGLSFQE +H    NEQKQQ +EAWE +D+VEMQNKLL+YVTES+KNEACISK +TSSFCRTERVV+AFV+FEP+EKGQ
Subjt:  QLRNIAPNMSVQITKMYCILYCGLSFQEPRHNGKNNEQKQQGHEAWESNDMVEMQNKLLYYVTESTKNEACISKCRTSSFCRTERVVEAFVQFEPDEKGQ

Query:  GFSKCLLDVSHFPVGCYRMKWYSCCVDSEGCFWSLLPLNSGPLFTIHQLPSAG
        GFS CLLDVS FPVG YR+KWYSCCVDSEGCFWSLLPL+ GP FTIHQLPSAG
Subjt:  GFSKCLLDVSHFPVGCYRMKWYSCCVDSEGCFWSLLPLNSGPLFTIHQLPSAG

SwissProt top hitse value%identityAlignment
Q1RMS6 Integrator complex subunit 71.5e-0726.13Show/hide
Query:  NTILLRLADAFKFGDKHIRLAVVRVFLSELYSRDKARTKKYQGILSKARVQNHHELLTRVKVVLNGGDPEARALALILLGCWAHFAKDSAQICYLILSSL
        N+  L+LAD F+ G+  +RL V++V            T++ +  L K  + N  E + R+  V++  DP ARA+ L +LG  A    +     + I  SL
Subjt:  NTILLRLADAFKFGDKHIRLAVVRVFLSELYSRDKARTKKYQGILSKARVQNHHELLTRVKVVLNGGDPEARALALILLGCWAHFAKDSAQICYLILSSL

Query:  LSSHLLEVKASIFAATCFSELADDFAQVFLAMLVNIMTS-STTLAIRMAGARVFAKLGCSHAMAKRAYKVGLELASDSSEEDFLVAMLFSLSKLASKSIF
         S   +EV+A++FAA  FS  + DFA      +  ++   +T + +++    +   +     +A  A ++  +L +       ++  L + + LA+ S+ 
Subjt:  LSSHLLEVKASIFAATCFSELADDFAQVFLAMLVNIMTS-STTLAIRMAGARVFAKLGCSHAMAKRAYKVGLELASDSSEEDFLVAMLFSLSKLASKSIF

Query:  ISSEQVQLLCSFL--SPQKSVR
         + +Q+QLL  +L   P+K+V+
Subjt:  ISSEQVQLLCSFL--SPQKSVR

Q5ZL91 Integrator complex subunit 71.6e-0926.09Show/hide
Query:  IELEKALRSKKPDRAVEAILQIGSRLQQWSREPEPDVAVYNMFDLVTWEDRLFSNTILLRLADAFKFGDKHIRLAVVRVFLSELYSRDKARTKKYQGILS
        +EL+K LRS K     EA+++     Q++   P P                +  N+  L+LAD F+ G+  +RL V++V            T++ +  L 
Subjt:  IELEKALRSKKPDRAVEAILQIGSRLQQWSREPEPDVAVYNMFDLVTWEDRLFSNTILLRLADAFKFGDKHIRLAVVRVFLSELYSRDKARTKKYQGILS

Query:  KARVQNHHELLTRVKVVLNGGDPEARALALILLGCWAHFAKDSAQICYLILSSLLSSHLLEVKASIFAATCFSELADDFAQVFLAMLVNIMTS-STTLAI
        K  + N  E + RV  V++  DP ARA+ L +LG  A    +     + I  SL S   +EV+A+IFAA  FS  + DFA      +  ++   +T + +
Subjt:  KARVQNHHELLTRVKVVLNGGDPEARALALILLGCWAHFAKDSAQICYLILSSLLSSHLLEVKASIFAATCFSELADDFAQVFLAMLVNIMTS-STTLAI

Query:  RMAGARVFAKLGCSHAMAKRAYKVGLELASDSSEEDFLVAMLFSLSKLASKSIFISSEQVQLLCSFL--SPQKSVR
        ++    +   +    ++A  + ++  +L +       ++  L + + LA+ S+    +QVQLL  +L   P+K+V+
Subjt:  RMAGARVFAKLGCSHAMAKRAYKVGLELASDSSEEDFLVAMLFSLSKLASKSIFISSEQVQLLCSFL--SPQKSVR

Q7TQK1 Integrator complex subunit 71.6e-0925.72Show/hide
Query:  IELEKALRSKKPDRAVEAILQIGSRLQQWSREPEPDVAVYNMFDLVTWEDRLFSNTILLRLADAFKFGDKHIRLAVVRVFLSELYSRDKARTKKYQGILS
        +EL+K LRS K     EA+++     Q++   P P                +  N+  L+LAD F+ G+  +RL V++V            T++ +  L 
Subjt:  IELEKALRSKKPDRAVEAILQIGSRLQQWSREPEPDVAVYNMFDLVTWEDRLFSNTILLRLADAFKFGDKHIRLAVVRVFLSELYSRDKARTKKYQGILS

Query:  KARVQNHHELLTRVKVVLNGGDPEARALALILLGCWAHFAKDSAQICYLILSSLLSSHLLEVKASIFAATCFSELADDFAQVFLAMLVNIMTS-STTLAI
        K  + N  E + RV  V++  DP ARA+ L +LG  A    +     + I  SL S   +EV+A++FAA  FS  + DFA      +  ++   +T + +
Subjt:  KARVQNHHELLTRVKVVLNGGDPEARALALILLGCWAHFAKDSAQICYLILSSLLSSHLLEVKASIFAATCFSELADDFAQVFLAMLVNIMTS-STTLAI

Query:  RMAGARVFAKLGCSHAMAKRAYKVGLELASDSSEEDFLVAMLFSLSKLASKSIFISSEQVQLLCSFL--SPQKSVR
        ++    +   +     +A  A ++  +L +       ++  L + + LA+ S+  + +Q+QLL  +L   P+K+V+
Subjt:  RMAGARVFAKLGCSHAMAKRAYKVGLELASDSSEEDFLVAMLFSLSKLASKSIFISSEQVQLLCSFL--SPQKSVR

Q8JGR7 Integrator complex subunit 72.0e-0926.71Show/hide
Query:  IELEKALRSKKPDRAVEAILQIGSRLQQWSREPEPDVAVYNMFDLVTWEDRLFSNTILLRLADAFKFGDKHIRLAVVRVF-LSELYSRDKARTKKYQGIL
        +EL+K LRS K     EA++      Q++   P P                +  N+  L+LAD F+ G+  +RL V++V  LSE +              
Subjt:  IELEKALRSKKPDRAVEAILQIGSRLQQWSREPEPDVAVYNMFDLVTWEDRLFSNTILLRLADAFKFGDKHIRLAVVRVF-LSELYSRDKARTKKYQGIL

Query:  SKARVQNHHELLTRVKVVLNGGDPEARALALILLGCWAHFAKDSAQICYLILSSLLSSHLLEVKASIFAATCFSELADDFAQVFLAMLVNIMTS-STTLA
           ++ N  E + RV  V++  DP ARA+ L +LG  A    +     + I  SL S   +EV+A+IFAA  FS  + DFA      +  ++    T + 
Subjt:  SKARVQNHHELLTRVKVVLNGGDPEARALALILLGCWAHFAKDSAQICYLILSSLLSSHLLEVKASIFAATCFSELADDFAQVFLAMLVNIMTS-STTLA

Query:  IRMAGARVFAKLGCSHAMAKRAYKVGLELASDSSEEDFLVAMLFSLSKLASKSIFISSEQVQLLCSFL--SPQKSVR
        +++    +   +    ++A  + ++  EL S       L+  L + ++LA+ S+    EQ+ LL  +L   P+K+V+
Subjt:  IRMAGARVFAKLGCSHAMAKRAYKVGLELASDSSEEDFLVAMLFSLSKLASKSIFISSEQVQLLCSFL--SPQKSVR

Q9NVH2 Integrator complex subunit 76.6e-0826.13Show/hide
Query:  NTILLRLADAFKFGDKHIRLAVVRVFLSELYSRDKARTKKYQGILSKARVQNHHELLTRVKVVLNGGDPEARALALILLGCWAHFAKDSAQICYLILSSL
        N+  L+LAD F+ G+  +RL V++V            T++ +  L K  + N  E + R+  V++  DP ARA+ L +LG  A    +     + I  SL
Subjt:  NTILLRLADAFKFGDKHIRLAVVRVFLSELYSRDKARTKKYQGILSKARVQNHHELLTRVKVVLNGGDPEARALALILLGCWAHFAKDSAQICYLILSSL

Query:  LSSHLLEVKASIFAATCFSELADDFAQVFLAMLVNIMTS-STTLAIRMAGARVFAKLGCSHAMAKRAYKVGLELASDSSEEDFLVAMLFSLSKLASKSIF
         S   +EV+A++FAA  FS  + DFA      +  ++   +T + +++    +   +     +A  A ++  +L +       ++  L + + LA+ S+ 
Subjt:  LSSHLLEVKASIFAATCFSELADDFAQVFLAMLVNIMTS-STTLAIRMAGARVFAKLGCSHAMAKRAYKVGLELASDSSEEDFLVAMLFSLSKLASKSIF

Query:  ISSEQVQLLCSFL--SPQKSVR
         + +Q+QLL  +L   P+K+V+
Subjt:  ISSEQVQLLCSFL--SPQKSVR

Arabidopsis top hitse value%identityAlignment
AT4G20060.1 ARM repeat superfamily protein1.5e-20939.21Show/hide
Query:  MERNAAARAMEWSIELEKALRSKKPDRAVEAILQIGSRLQQWSREPEPDVAVYNMFDLVTWEDRLFSNTILLRLADAFKFGDKHIRLAVVRVFLSEL-YS
        ME+ +AA AMEWSI+LEK+LRSK   +AVEAIL+ G +L+QWS+EPE  +AVYN+F LV  ED+LFSNTILLRL DAF  GDK I+LAVVRVF+S    S
Subjt:  MERNAAARAMEWSIELEKALRSKKPDRAVEAILQIGSRLQQWSREPEPDVAVYNMFDLVTWEDRLFSNTILLRLADAFKFGDKHIRLAVVRVFLSEL-YS

Query:  RDKARTKKYQGILSKARVQNHHELLTRVKVVLNGGDPEARALALILLGCWAHFAKDSAQICYLILSSLLSSHLLEVKASIFAATCFSELADDFAQVFLAM
        R K   +     LSK RV NH ELLTRVK V + GD E++ALALIL GCW  FA + A + YL+ SS++S H LE ++++FAA CF E+ADDFA V L M
Subjt:  RDKARTKKYQGILSKARVQNHHELLTRVKVVLNGGDPEARALALILLGCWAHFAKDSAQICYLILSSLLSSHLLEVKASIFAATCFSELADDFAQVFLAM

Query:  LVNIMT-SSTTLAIRMAGARVFAKLGCSHAMAKRAYKVGLELASDSSEEDFLVAMLFSLSKLASKSIFISSEQVQLLCSFLSPQKSVRVQETSLRCLCYI
        L +++     T   R+A  RVFAK+GCSH +A RA+K+ ++L  DS +ED LV  L SL+KLAS+S  ++SE  +++  FL   K+   +   LRCL ++
Subjt:  LVNIMT-SSTTLAIRMAGARVFAKLGCSHAMAKRAYKVGLELASDSSEEDFLVAMLFSLSKLASKSIFISSEQVQLLCSFLSPQKSVRVQETSLRCLCYI

Query:  FMQGACQFTNMESMVRILVIALDEHMLPTTSHCDALRLLRKILFYVLPNPSFLDANEYSKLVKALESAARSPVKLKSLLAVRVMVDLSLKHSGKMEVESG
          +G C     E  +  +   L +  L +     AL++ +KI+ Y L      DA+E  +L+   E+A+ S +   S LA+ V+V +  +     E  S 
Subjt:  FMQGACQFTNMESMVRILVIALDEHMLPTTSHCDALRLLRKILFYVLPNPSFLDANEYSKLVKALESAARSPVKLKSLLAVRVMVDLSLKHSGKMEVESG

Query:  GCSFSLLPSQVISLIMDQIESLGKMLVDLSRSNSEVFQEIKGLLNLLLLIVREHSDLWILLLEKLCLAVELIMNMYEDVCNSQQSEEKFEG----DEKND
          S + LP Q++ LIMD++  LG++  DL R+   V  E++ LL +L L+V +HS+L +L+LEK+ L +  I+++ + +  +  + E   G     +K  
Subjt:  GCSFSLLPSQVISLIMDQIESLGKMLVDLSRSNSEVFQEIKGLLNLLLLIVREHSDLWILLLEKLCLAVELIMNMYEDVCNSQQSEEKFEG----DEKND

Query:  TGLRFAFI--LYGFAAICIGHLGQVDSITTEIFNKVKPLVNSVCKSRLFSSHTCITYSLLLNCKFILSCRITEDSRICNNDGFPRFTFCEYLTENMIFTL
          +R  F+  ++ F  + + +L   D++ +EI+ KVK +   V        HT + ++LLL+   +    + +D+    N G    +    +    I +L
Subjt:  TGLRFAFI--LYGFAAICIGHLGQVDSITTEIFNKVKPLVNSVCKSRLFSSHTCITYSLLLNCKFILSCRITEDSRICNNDGFPRFTFCEYLTENMIFTL

Query:  ECAKKMLKNGDEWPAYKAGRHAACHGSWLAATLIFGHLISKVHSDFFHCWLKSLFQFSLGERKIQLLLLPQYGSGLVNWLEKETILNAFSIEEQINQHQA
        +C+ ++L   + WPAY+AG +AA  G+W+ + +IF  L + V SD   CWLKSL   S  E K QLLL P     LVNWL+    L       ++++  +
Subjt:  ECAKKMLKNGDEWPAYKAGRHAACHGSWLAATLIFGHLISKVHSDFFHCWLKSLFQFSLGERKIQLLLLPQYGSGLVNWLEKETILNAFSIEEQINQHQA

Query:  GSITEAMYYDKLLEAYQCVCSSGETLKAAAISLVQASCFQRWFLFLRAKVLGTVGSILKLLLNLSS---NTAAIHETVKEFC-------KLSLTLERLSC
        G     +    L EAY  + SS   L    I+     CFQ WFL L+ +VL TV  +++ L  L+    N   + E +   C       ++S+ L++L+ 
Subjt:  GSITEAMYYDKLLEAYQCVCSSGETLKAAAISLVQASCFQRWFLFLRAKVLGTVGSILKLLLNLSS---NTAAIHETVKEFC-------KLSLTLERLSC

Query:  EFDLIGTTFIGMDIKSSSVISALALNCSLLAFCTGFAFHVPNLA-ATLMIENVAGSGTKLHVELIQNLIGRLWLVDSETSKKLTQLFEIPGRPNNCSHLL
        EFD++ T FI +D  SSS+I+ ++L+CS+LAF  G    +P  +    ++   + SG  L   L+++L+ RLW VD    +KL  L        NC HL 
Subjt:  EFDLIGTTFIGMDIKSSSVISALALNCSLLAFCTGFAFHVPNLA-ATLMIENVAGSGTKLHVELIQNLIGRLWLVDSETSKKLTQLFEIPGRPNNCSHLL

Query:  SRSKILDVRYEVRDILTLCRYAVSEAVGLQNKVNGLNEETFLQVIQDGIRFLLN-ILMQWISIPFRVPKCFFCVRPCIGSEVFA---TTDARKPDGISIP
        SR+++L V  +V+ +L++CR A+S   GLQN+   +++E  +  I    R LL+  +M+W+ IPF +PK FF +RPC+G+E+FA    +  R PD +S+ 
Subjt:  SRSKILDVRYEVRDILTLCRYAVSEAVGLQNKVNGLNEETFLQVIQDGIRFLLN-ILMQWISIPFRVPKCFFCVRPCIGSEVFA---TTDARKPDGISIP

Query:  FGFHLSLNLCLQLRNIAP-NMSVQITKMYCILYCGLSFQEPRHNGKNNEQKQQGHEAWESNDMVEMQNKLLYYVTESTKNEACISKCRTSSFCRTERVVE
         GF LSL+LCLQL+NI    + V++ K+YC+LY  L++  P  +G+NN + Q  +  W   D++EM NKL ++  +S K         +  F   +  V 
Subjt:  FGFHLSLNLCLQLRNIAP-NMSVQITKMYCILYCGLSFQEPRHNGKNNEQKQQGHEAWESNDMVEMQNKLLYYVTESTKNEACISKCRTSSFCRTERVVE

Query:  AFVQFEPDEKGQGFSKCLLDVSHFPVGCYRMKWYSCCVDSEGCFWSLLPLNSGPLFTI
          VQFEP+E+GQGFS CLLDVS FPVG Y++KW SCCVD  G +W+LLPLN  P+FT+
Subjt:  AFVQFEPDEKGQGFSKCLLDVSHFPVGCYRMKWYSCCVDSEGCFWSLLPLNSGPLFTI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAGGAATGCTGCGGCTCGTGCGATGGAATGGAGTATTGAGCTGGAGAAGGCTCTTCGTTCCAAGAAACCAGATCGAGCTGTTGAAGCTATACTTCAGATTGGCTC
TAGACTTCAGCAGTGGAGTAGAGAGCCAGAACCAGATGTAGCAGTATATAATATGTTTGACCTTGTTACTTGGGAGGATAGACTATTTTCCAACACTATTCTCCTACGGC
TTGCTGATGCGTTTAAGTTTGGTGACAAGCATATTAGACTTGCAGTTGTTAGGGTATTCTTATCCGAGCTCTACAGCCGTGATAAGGCAAGAACTAAGAAATACCAAGGG
ATTCTTTCAAAGGCCAGGGTGCAAAACCACCATGAACTGCTGACTCGAGTAAAGGTTGTTCTCAATGGAGGGGATCCTGAGGCTAGAGCACTAGCTTTGATTCTATTAGG
ATGTTGGGCACACTTTGCAAAGGACAGTGCCCAGATATGTTATTTGATACTTTCTAGTCTGTTATCTTCTCATCTTCTGGAGGTAAAAGCATCTATATTTGCTGCAACAT
GCTTCAGTGAGTTAGCAGATGACTTTGCACAAGTTTTCTTAGCGATGTTGGTTAATATAATGACTTCTTCTACAACTTTGGCCATCAGAATGGCTGGAGCTCGAGTGTTT
GCAAAATTGGGATGCTCACATGCAATGGCCAAAAGAGCCTACAAGGTTGGACTCGAGCTTGCCTCAGACTCTAGTGAAGAGGATTTTTTGGTTGCAATGTTATTTTCACT
ATCCAAACTTGCTTCCAAGTCAATATTTATTAGTTCTGAGCAGGTTCAATTGCTTTGCTCATTTCTTAGCCCCCAAAAGTCTGTGCGTGTGCAAGAAACATCTTTAAGAT
GTTTGTGTTATATTTTCATGCAAGGAGCATGCCAGTTTACTAATATGGAATCTATGGTCAGAATTTTAGTCATTGCACTAGATGAACACATGCTTCCAACTACTTCACAT
TGTGATGCTCTACGACTATTGCGAAAGATTCTTTTCTATGTGCTGCCAAATCCCTCTTTTTTGGATGCAAATGAATACTCTAAACTGGTAAAAGCTTTGGAGAGTGCAGC
CCGATCTCCAGTGAAGTTAAAGAGCCTCCTTGCTGTCCGTGTAATGGTGGATTTATCATTAAAACATTCTGGAAAAATGGAAGTAGAATCAGGAGGTTGTTCATTTTCTT
TGTTGCCATCACAGGTTATTTCACTAATCATGGATCAAATTGAATCATTGGGAAAGATGTTAGTAGATCTTTCTCGGTCAAATTCTGAAGTGTTTCAAGAAATTAAAGGG
CTGTTAAACCTTCTCCTGCTCATTGTCAGAGAACATTCGGATCTGTGGATTTTGCTTTTGGAGAAATTATGCTTAGCTGTTGAATTAATTATGAATATGTATGAAGATGT
CTGCAATAGTCAACAATCAGAGGAGAAATTTGAAGGAGATGAAAAGAATGATACTGGCTTGAGATTTGCATTCATCCTATATGGATTTGCGGCAATCTGCATTGGTCATC
TAGGTCAAGTTGACTCCATCACCACTGAAATATTTAACAAGGTGAAACCATTGGTAAACAGTGTATGCAAAAGTCGTTTGTTCAGTAGCCATACTTGCATAACCTATTCC
TTGCTATTGAACTGTAAATTCATTTTGAGTTGTAGGATAACTGAGGATTCTAGGATTTGCAATAATGATGGGTTTCCACGTTTCACCTTTTGTGAATATTTGACTGAAAA
CATGATTTTTACGCTCGAGTGTGCGAAGAAGATGCTAAAAAATGGGGATGAATGGCCCGCTTACAAAGCTGGGAGACATGCAGCATGTCACGGATCATGGCTTGCTGCTA
CCTTAATTTTTGGCCATTTAATTTCAAAGGTTCACTCTGATTTCTTCCATTGCTGGCTGAAATCTTTGTTTCAGTTTTCTCTTGGGGAAAGAAAAATCCAGTTACTACTG
TTACCACAATATGGTTCTGGCTTGGTAAACTGGTTGGAGAAGGAAACAATTCTAAACGCATTTTCCATTGAAGAACAAATAAACCAACATCAGGCTGGGAGTATCACTGA
GGCCATGTACTATGACAAGCTTTTGGAGGCCTACCAGTGTGTTTGCTCTTCAGGTGAGACCTTAAAGGCTGCTGCCATCTCACTGGTTCAAGCATCTTGTTTCCAGAGAT
GGTTTTTGTTTTTAAGAGCTAAGGTATTAGGAACTGTGGGGAGCATACTGAAGCTGTTGCTCAATCTTTCGAGCAATACTGCTGCTATCCACGAAACTGTGAAAGAATTT
TGTAAATTATCTTTAACATTAGAGAGATTGTCCTGTGAATTTGACCTGATTGGAACAACTTTCATTGGAATGGACATCAAGAGTTCGAGCGTTATTTCAGCCCTTGCACT
GAATTGCTCTTTGTTGGCCTTTTGTACTGGTTTTGCCTTTCATGTTCCAAACTTGGCTGCAACTCTTATGATTGAAAATGTGGCTGGTTCCGGGACTAAATTACATGTAG
AACTCATTCAAAATCTAATTGGTAGGCTGTGGTTGGTAGACAGTGAAACGAGTAAAAAACTTACACAGCTTTTCGAGATTCCTGGAAGACCAAATAACTGTTCGCATTTG
CTTTCAAGAAGCAAGATATTAGATGTGAGGTATGAAGTAAGAGATATCTTGACTCTGTGTAGATATGCTGTCTCTGAAGCTGTCGGTTTGCAAAACAAGGTGAACGGACT
GAATGAGGAGACATTTCTCCAGGTCATCCAGGATGGCATACGGTTTTTATTGAACATTCTTATGCAGTGGATAAGCATTCCATTCCGAGTGCCCAAGTGCTTCTTTTGTG
TAAGGCCTTGCATTGGCTCTGAAGTCTTTGCCACTACAGATGCTCGTAAACCAGATGGAATATCTATCCCATTCGGCTTCCACCTATCACTAAATCTTTGTCTTCAACTG
AGAAACATTGCGCCGAATATGTCAGTTCAAATAACCAAGATGTACTGCATTTTGTATTGTGGATTATCCTTTCAGGAACCAAGGCACAATGGCAAGAACAATGAGCAAAA
ACAGCAAGGACATGAAGCTTGGGAAAGCAATGACATGGTAGAAATGCAAAACAAACTGTTATATTACGTGACCGAGTCGACCAAAAATGAGGCTTGTATCAGCAAGTGCA
GAACATCGAGCTTTTGCAGAACCGAGAGGGTAGTCGAAGCATTTGTGCAGTTTGAACCAGATGAAAAAGGGCAAGGATTCTCAAAGTGCTTGCTTGATGTGTCTCATTTT
CCTGTAGGTTGTTATAGAATGAAATGGTATAGCTGTTGTGTTGATAGTGAGGGGTGTTTTTGGAGCCTACTCCCTTTGAATTCTGGACCATTATTTACTATCCACCAACT
TCCATCAGCTGGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAGGAATGCTGCGGCTCGTGCGATGGAATGGAGTATTGAGCTGGAGAAGGCTCTTCGTTCCAAGAAACCAGATCGAGCTGTTGAAGCTATACTTCAGATTGGCTC
TAGACTTCAGCAGTGGAGTAGAGAGCCAGAACCAGATGTAGCAGTATATAATATGTTTGACCTTGTTACTTGGGAGGATAGACTATTTTCCAACACTATTCTCCTACGGC
TTGCTGATGCGTTTAAGTTTGGTGACAAGCATATTAGACTTGCAGTTGTTAGGGTATTCTTATCCGAGCTCTACAGCCGTGATAAGGCAAGAACTAAGAAATACCAAGGG
ATTCTTTCAAAGGCCAGGGTGCAAAACCACCATGAACTGCTGACTCGAGTAAAGGTTGTTCTCAATGGAGGGGATCCTGAGGCTAGAGCACTAGCTTTGATTCTATTAGG
ATGTTGGGCACACTTTGCAAAGGACAGTGCCCAGATATGTTATTTGATACTTTCTAGTCTGTTATCTTCTCATCTTCTGGAGGTAAAAGCATCTATATTTGCTGCAACAT
GCTTCAGTGAGTTAGCAGATGACTTTGCACAAGTTTTCTTAGCGATGTTGGTTAATATAATGACTTCTTCTACAACTTTGGCCATCAGAATGGCTGGAGCTCGAGTGTTT
GCAAAATTGGGATGCTCACATGCAATGGCCAAAAGAGCCTACAAGGTTGGACTCGAGCTTGCCTCAGACTCTAGTGAAGAGGATTTTTTGGTTGCAATGTTATTTTCACT
ATCCAAACTTGCTTCCAAGTCAATATTTATTAGTTCTGAGCAGGTTCAATTGCTTTGCTCATTTCTTAGCCCCCAAAAGTCTGTGCGTGTGCAAGAAACATCTTTAAGAT
GTTTGTGTTATATTTTCATGCAAGGAGCATGCCAGTTTACTAATATGGAATCTATGGTCAGAATTTTAGTCATTGCACTAGATGAACACATGCTTCCAACTACTTCACAT
TGTGATGCTCTACGACTATTGCGAAAGATTCTTTTCTATGTGCTGCCAAATCCCTCTTTTTTGGATGCAAATGAATACTCTAAACTGGTAAAAGCTTTGGAGAGTGCAGC
CCGATCTCCAGTGAAGTTAAAGAGCCTCCTTGCTGTCCGTGTAATGGTGGATTTATCATTAAAACATTCTGGAAAAATGGAAGTAGAATCAGGAGGTTGTTCATTTTCTT
TGTTGCCATCACAGGTTATTTCACTAATCATGGATCAAATTGAATCATTGGGAAAGATGTTAGTAGATCTTTCTCGGTCAAATTCTGAAGTGTTTCAAGAAATTAAAGGG
CTGTTAAACCTTCTCCTGCTCATTGTCAGAGAACATTCGGATCTGTGGATTTTGCTTTTGGAGAAATTATGCTTAGCTGTTGAATTAATTATGAATATGTATGAAGATGT
CTGCAATAGTCAACAATCAGAGGAGAAATTTGAAGGAGATGAAAAGAATGATACTGGCTTGAGATTTGCATTCATCCTATATGGATTTGCGGCAATCTGCATTGGTCATC
TAGGTCAAGTTGACTCCATCACCACTGAAATATTTAACAAGGTGAAACCATTGGTAAACAGTGTATGCAAAAGTCGTTTGTTCAGTAGCCATACTTGCATAACCTATTCC
TTGCTATTGAACTGTAAATTCATTTTGAGTTGTAGGATAACTGAGGATTCTAGGATTTGCAATAATGATGGGTTTCCACGTTTCACCTTTTGTGAATATTTGACTGAAAA
CATGATTTTTACGCTCGAGTGTGCGAAGAAGATGCTAAAAAATGGGGATGAATGGCCCGCTTACAAAGCTGGGAGACATGCAGCATGTCACGGATCATGGCTTGCTGCTA
CCTTAATTTTTGGCCATTTAATTTCAAAGGTTCACTCTGATTTCTTCCATTGCTGGCTGAAATCTTTGTTTCAGTTTTCTCTTGGGGAAAGAAAAATCCAGTTACTACTG
TTACCACAATATGGTTCTGGCTTGGTAAACTGGTTGGAGAAGGAAACAATTCTAAACGCATTTTCCATTGAAGAACAAATAAACCAACATCAGGCTGGGAGTATCACTGA
GGCCATGTACTATGACAAGCTTTTGGAGGCCTACCAGTGTGTTTGCTCTTCAGGTGAGACCTTAAAGGCTGCTGCCATCTCACTGGTTCAAGCATCTTGTTTCCAGAGAT
GGTTTTTGTTTTTAAGAGCTAAGGTATTAGGAACTGTGGGGAGCATACTGAAGCTGTTGCTCAATCTTTCGAGCAATACTGCTGCTATCCACGAAACTGTGAAAGAATTT
TGTAAATTATCTTTAACATTAGAGAGATTGTCCTGTGAATTTGACCTGATTGGAACAACTTTCATTGGAATGGACATCAAGAGTTCGAGCGTTATTTCAGCCCTTGCACT
GAATTGCTCTTTGTTGGCCTTTTGTACTGGTTTTGCCTTTCATGTTCCAAACTTGGCTGCAACTCTTATGATTGAAAATGTGGCTGGTTCCGGGACTAAATTACATGTAG
AACTCATTCAAAATCTAATTGGTAGGCTGTGGTTGGTAGACAGTGAAACGAGTAAAAAACTTACACAGCTTTTCGAGATTCCTGGAAGACCAAATAACTGTTCGCATTTG
CTTTCAAGAAGCAAGATATTAGATGTGAGGTATGAAGTAAGAGATATCTTGACTCTGTGTAGATATGCTGTCTCTGAAGCTGTCGGTTTGCAAAACAAGGTGAACGGACT
GAATGAGGAGACATTTCTCCAGGTCATCCAGGATGGCATACGGTTTTTATTGAACATTCTTATGCAGTGGATAAGCATTCCATTCCGAGTGCCCAAGTGCTTCTTTTGTG
TAAGGCCTTGCATTGGCTCTGAAGTCTTTGCCACTACAGATGCTCGTAAACCAGATGGAATATCTATCCCATTCGGCTTCCACCTATCACTAAATCTTTGTCTTCAACTG
AGAAACATTGCGCCGAATATGTCAGTTCAAATAACCAAGATGTACTGCATTTTGTATTGTGGATTATCCTTTCAGGAACCAAGGCACAATGGCAAGAACAATGAGCAAAA
ACAGCAAGGACATGAAGCTTGGGAAAGCAATGACATGGTAGAAATGCAAAACAAACTGTTATATTACGTGACCGAGTCGACCAAAAATGAGGCTTGTATCAGCAAGTGCA
GAACATCGAGCTTTTGCAGAACCGAGAGGGTAGTCGAAGCATTTGTGCAGTTTGAACCAGATGAAAAAGGGCAAGGATTCTCAAAGTGCTTGCTTGATGTGTCTCATTTT
CCTGTAGGTTGTTATAGAATGAAATGGTATAGCTGTTGTGTTGATAGTGAGGGGTGTTTTTGGAGCCTACTCCCTTTGAATTCTGGACCATTATTTACTATCCACCAACT
TCCATCAGCTGGGTGA
Protein sequenceShow/hide protein sequence
MERNAAARAMEWSIELEKALRSKKPDRAVEAILQIGSRLQQWSREPEPDVAVYNMFDLVTWEDRLFSNTILLRLADAFKFGDKHIRLAVVRVFLSELYSRDKARTKKYQG
ILSKARVQNHHELLTRVKVVLNGGDPEARALALILLGCWAHFAKDSAQICYLILSSLLSSHLLEVKASIFAATCFSELADDFAQVFLAMLVNIMTSSTTLAIRMAGARVF
AKLGCSHAMAKRAYKVGLELASDSSEEDFLVAMLFSLSKLASKSIFISSEQVQLLCSFLSPQKSVRVQETSLRCLCYIFMQGACQFTNMESMVRILVIALDEHMLPTTSH
CDALRLLRKILFYVLPNPSFLDANEYSKLVKALESAARSPVKLKSLLAVRVMVDLSLKHSGKMEVESGGCSFSLLPSQVISLIMDQIESLGKMLVDLSRSNSEVFQEIKG
LLNLLLLIVREHSDLWILLLEKLCLAVELIMNMYEDVCNSQQSEEKFEGDEKNDTGLRFAFILYGFAAICIGHLGQVDSITTEIFNKVKPLVNSVCKSRLFSSHTCITYS
LLLNCKFILSCRITEDSRICNNDGFPRFTFCEYLTENMIFTLECAKKMLKNGDEWPAYKAGRHAACHGSWLAATLIFGHLISKVHSDFFHCWLKSLFQFSLGERKIQLLL
LPQYGSGLVNWLEKETILNAFSIEEQINQHQAGSITEAMYYDKLLEAYQCVCSSGETLKAAAISLVQASCFQRWFLFLRAKVLGTVGSILKLLLNLSSNTAAIHETVKEF
CKLSLTLERLSCEFDLIGTTFIGMDIKSSSVISALALNCSLLAFCTGFAFHVPNLAATLMIENVAGSGTKLHVELIQNLIGRLWLVDSETSKKLTQLFEIPGRPNNCSHL
LSRSKILDVRYEVRDILTLCRYAVSEAVGLQNKVNGLNEETFLQVIQDGIRFLLNILMQWISIPFRVPKCFFCVRPCIGSEVFATTDARKPDGISIPFGFHLSLNLCLQL
RNIAPNMSVQITKMYCILYCGLSFQEPRHNGKNNEQKQQGHEAWESNDMVEMQNKLLYYVTESTKNEACISKCRTSSFCRTERVVEAFVQFEPDEKGQGFSKCLLDVSHF
PVGCYRMKWYSCCVDSEGCFWSLLPLNSGPLFTIHQLPSAG