; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS021084 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS021084
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionUnknown protein
Genome locationscaffold290:1101849..1104161
RNA-Seq ExpressionMS021084
SyntenyMS021084
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146104.1 uncharacterized protein LOC101206874 [Cucumis sativus]2.0e-23582.28Show/hide
Query:  SKHSEWRHTYYRYKMIDLFLAEPIFNDEKDVDSAKLRISLLSRLESVLRKLLVSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRTKPLKWALASQL
        S+   +    YRY+MIDLFL E  FNDE+DV S KLRISLLS LESVL KLL  GGRSEVRLWLSNTIAS+TSISPQHQRDLF+T LR KPLKWA ASQL
Subjt:  SKHSEWRHTYYRYKMIDLFLAEPIFNDEKDVDSAKLRISLLSRLESVLRKLLVSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRTKPLKWALASQL

Query:  LQMFFEKRPRGAGILIAKRSYIMENFFEGNSRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTV
        LQM FEKR R AGILIAKRSYIME FFEGN RRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDV QTV
Subjt:  LQMFFEKRPRGAGILIAKRSYIMENFFEGNSRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTV

Query:  RNFIEHVPEFWSSNEFAESLKDGEILLLDTEFFVKYFVDLMLKDDSKDVWEAINEYLMQESFSSLCRHLLITLEEADFCYFLKMLCKLLSPRIETKDSGS
        +NFI++VPEFWSSNEFAESLKDGEIL LDT+FFVKYFVDLMLKDD KDVWE INE+L  ESFSSLC+HLL+TLEEADFC FLKMLCKLL PRIETKD G+
Subjt:  RNFIEHVPEFWSSNEFAESLKDGEILLLDTEFFVKYFVDLMLKDDSKDVWEAINEYLMQESFSSLCRHLLITLEEADFCYFLKMLCKLLSPRIETKDSGS

Query:  SSFVSEIILSRYGDCESIDQILLLNAVINQGRQLLRLLRDEDAEEEWDEIKAIVSEISAISSNTHSLSPLLKECN-RRKTIEVIKWLGLQSWVLQYRMSE
        SSF+ E+IL++YGD ESIDQILLLNAVINQGRQLLRLLRDED EE+ DEIKAIV +IS+ISSN H L PLLKEC+ R+KTIE+IKWLGLQSWVL YRMSE
Subjt:  SSFVSEIILSRYGDCESIDQILLLNAVINQGRQLLRLLRDEDAEEEWDEIKAIVSEISAISSNTHSLSPLLKECN-RRKTIEVIKWLGLQSWVLQYRMSE

Query:  ECQTPELWESLFADNGIGFRKSNEYALLDHSCFSEDDGFELCDTASAKLMKRRK-GKGRKRRKRNFD----FDNELLGFDTKNDRIDLKLNTGSWLLSID
        ECQTPELWESLF DNGIGFRKSNEY LLDHSC SEDDGFEL + A A+  KR+K GKGRKRRK NFD     D+ELL FD KNDR+DLKLNTGSWLLS D
Subjt:  ECQTPELWESLFADNGIGFRKSNEYALLDHSCFSEDDGFELCDTASAKLMKRRK-GKGRKRRKRNFD----FDNELLGFDTKNDRIDLKLNTGSWLLSID

Query:  DYTVPWNA
        DYTVPWNA
Subjt:  DYTVPWNA

XP_008448632.1 PREDICTED: uncharacterized protein LOC103490747 isoform X2 [Cucumis melo]1.2e-23581.5Show/hide
Query:  SKHSEWRHTYYRYKMIDLFLAEPIFNDEKDVDSAKLRISLLSRLESVLRKLLVSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRTKPLKWALASQL
        S+   +    YRY+MIDLF+ E  FNDE+DV SAKLRISLLS LESVL KLL  GGRSEVRLWLSNTIAS+TSISPQHQRDLF+T LR KPLKWA ASQL
Subjt:  SKHSEWRHTYYRYKMIDLFLAEPIFNDEKDVDSAKLRISLLSRLESVLRKLLVSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRTKPLKWALASQL

Query:  LQMFFEKRPRGAGILIAKRSYIMENFFEGNSRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTV
        LQM FEKR R AGILIAKRSYIME FFEGN RRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDV QTV
Subjt:  LQMFFEKRPRGAGILIAKRSYIMENFFEGNSRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTV

Query:  RNFIEHVPEFWSSNEFAESLKDGEILLLDTEFFVKYFVDLMLKDDSKDVWEAINEYLMQESFSSLCRHLLITLEEADFCYFLKMLCKLLSPRIETKDSGS
        +NFI++VPEFWSSNEFAESLKDGEIL LDT+FFVK+F+DLMLKDDSKDVWE INE+LM ESFSSLC+HLL+TLE+ADFC FLK+LCKLL PRIETKD G+
Subjt:  RNFIEHVPEFWSSNEFAESLKDGEILLLDTEFFVKYFVDLMLKDDSKDVWEAINEYLMQESFSSLCRHLLITLEEADFCYFLKMLCKLLSPRIETKDSGS

Query:  SSFVSEIILSRYGDCESIDQILLLNAVINQGRQLLRLLRDEDAEEEWDEIKAIVSEISAISSNTHSLSPLLKECN-RRKTIEVIKWLGLQSWVLQYRMSE
        SSF+ E+IL++YGD ESIDQILLLNAVINQGRQLLRLLRDED EE+ DEIKAI+ +ISAISSN+H L PLLKEC+ R+KTIE+IKWLGLQSWVL YR SE
Subjt:  SSFVSEIILSRYGDCESIDQILLLNAVINQGRQLLRLLRDEDAEEEWDEIKAIVSEISAISSNTHSLSPLLKECN-RRKTIEVIKWLGLQSWVLQYRMSE

Query:  ECQTPELWESLFADNGIGFRKSNEYALLDHSCFSEDDGFELCDTASAKLMKRRKG-KGRKRRKRNFD----FDNELLGFDTKNDRIDLKLNTGSWLLSID
        ECQTPELWESLF DNGIGFRKSNEY LLDHSC SEDDGFE C+ A AK  KR+KG KGRKRRKRNFD     D+ELL  D +NDR+DLKLNTGSW LS D
Subjt:  ECQTPELWESLFADNGIGFRKSNEYALLDHSCFSEDDGFELCDTASAKLMKRRKG-KGRKRRKRNFD----FDNELLGFDTKNDRIDLKLNTGSWLLSID

Query:  DYTVPWNA
        DYTVPWNA
Subjt:  DYTVPWNA

XP_022145467.1 uncharacterized protein LOC111014910 isoform X1 [Momordica charantia]3.1e-28199.59Show/hide
Query:  MIDLFLAEPIFNDEKDVDSAKLRISLLSRLESVLRKLLVSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRTKPLKWALASQLLQMFFEKRPRGAGI
        MIDLFLAEPIFNDEKDVDSAKLRISLLSRLESVLRKLLVSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRTKPLKWALASQLLQMFFEKRPRGAGI
Subjt:  MIDLFLAEPIFNDEKDVDSAKLRISLLSRLESVLRKLLVSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRTKPLKWALASQLLQMFFEKRPRGAGI

Query:  LIAKRSYIMENFFEGNSRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTVRNFIEHVPEFWSSN
        LIAKRSYIME FFEGNSRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTVRNFIEHVPEFWSSN
Subjt:  LIAKRSYIMENFFEGNSRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTVRNFIEHVPEFWSSN

Query:  EFAESLKDGEILLLDTEFFVKYFVDLMLKDDSKDVWEAINEYLMQESFSSLCRHLLITLEEADFCYFLKMLCKLLSPRIETKDSGSSSFVSEIILSRYGD
        EFAESLKDGEILLLDTEFFVKYFVDLMLKDDSKDVWEAINEYLMQESFSSLCRHLLITLEEADFCYFLKMLCKLLSPRIETKDSGSSSFVSEIILSRYGD
Subjt:  EFAESLKDGEILLLDTEFFVKYFVDLMLKDDSKDVWEAINEYLMQESFSSLCRHLLITLEEADFCYFLKMLCKLLSPRIETKDSGSSSFVSEIILSRYGD

Query:  CESIDQILLLNAVINQGRQLLRLLRDEDAEEEWDEIKAIVSEISAISSNTHSLSPLLKECNRRKTIEVIKWLGLQSWVLQYRMSEECQTPELWESLFADN
        CESIDQILLLNAVINQGRQLLRLLRDEDAEEEWDEIKAIVSEISAISSNTHSLSPLLKECNRRKTIEVIKWLGLQSWVLQYRMSEECQTPELWESLFADN
Subjt:  CESIDQILLLNAVINQGRQLLRLLRDEDAEEEWDEIKAIVSEISAISSNTHSLSPLLKECNRRKTIEVIKWLGLQSWVLQYRMSEECQTPELWESLFADN

Query:  GIGFRKSNEYALLDHSCFSEDDGFELCDTASAKLMKRRKGKGRKRRKRNFDFDNELLGFDTKNDRIDLKLNTGSWLLSIDDYTVPWNAV
        GIGFRKSNEYALLDHSCFSEDDGFELCDTASAKLMKRRKGKGRKRRKRNFDFDNELLGFDTKNDRIDLKLNTGSWLLSIDDYTVPWNA+
Subjt:  GIGFRKSNEYALLDHSCFSEDDGFELCDTASAKLMKRRKGKGRKRRKRNFDFDNELLGFDTKNDRIDLKLNTGSWLLSIDDYTVPWNAV

XP_022145468.1 uncharacterized protein LOC111014910 isoform X2 [Momordica charantia]2.8e-25099.54Show/hide
Query:  MTSISPQHQRDLFVTFLRTKPLKWALASQLLQMFFEKRPRGAGILIAKRSYIMENFFEGNSRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEE
        MTSISPQHQRDLFVTFLRTKPLKWALASQLLQMFFEKRPRGAGILIAKRSYIME FFEGNSRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEE
Subjt:  MTSISPQHQRDLFVTFLRTKPLKWALASQLLQMFFEKRPRGAGILIAKRSYIMENFFEGNSRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEE

Query:  LEWKGKHGQSPAVVATKPHYFLDLDVQQTVRNFIEHVPEFWSSNEFAESLKDGEILLLDTEFFVKYFVDLMLKDDSKDVWEAINEYLMQESFSSLCRHLL
        LEWKGKHGQSPAVVATKPHYFLDLDVQQTVRNFIEHVPEFWSSNEFAESLKDGEILLLDTEFFVKYFVDLMLKDDSKDVWEAINEYLMQESFSSLCRHLL
Subjt:  LEWKGKHGQSPAVVATKPHYFLDLDVQQTVRNFIEHVPEFWSSNEFAESLKDGEILLLDTEFFVKYFVDLMLKDDSKDVWEAINEYLMQESFSSLCRHLL

Query:  ITLEEADFCYFLKMLCKLLSPRIETKDSGSSSFVSEIILSRYGDCESIDQILLLNAVINQGRQLLRLLRDEDAEEEWDEIKAIVSEISAISSNTHSLSPL
        ITLEEADFCYFLKMLCKLLSPRIETKDSGSSSFVSEIILSRYGDCESIDQILLLNAVINQGRQLLRLLRDEDAEEEWDEIKAIVSEISAISSNTHSLSPL
Subjt:  ITLEEADFCYFLKMLCKLLSPRIETKDSGSSSFVSEIILSRYGDCESIDQILLLNAVINQGRQLLRLLRDEDAEEEWDEIKAIVSEISAISSNTHSLSPL

Query:  LKECNRRKTIEVIKWLGLQSWVLQYRMSEECQTPELWESLFADNGIGFRKSNEYALLDHSCFSEDDGFELCDTASAKLMKRRKGKGRKRRKRNFDFDNEL
        LKECNRRKTIEVIKWLGLQSWVLQYRMSEECQTPELWESLFADNGIGFRKSNEYALLDHSCFSEDDGFELCDTASAKLMKRRKGKGRKRRKRNFDFDNEL
Subjt:  LKECNRRKTIEVIKWLGLQSWVLQYRMSEECQTPELWESLFADNGIGFRKSNEYALLDHSCFSEDDGFELCDTASAKLMKRRKGKGRKRRKRNFDFDNEL

Query:  LGFDTKNDRIDLKLNTGSWLLSIDDYTVPWNAV
        LGFDTKNDRIDLKLNTGSWLLSIDDYTVPWNA+
Subjt:  LGFDTKNDRIDLKLNTGSWLLSIDDYTVPWNAV

XP_038891380.1 uncharacterized protein LOC120080808 [Benincasa hispida]1.2e-23283.4Show/hide
Query:  MIDLFLAEPIFNDEKDVDSAKLRISLLSRLESVLRKLLVSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRTKPLKWALASQLLQMFFEKRPRGAGI
        MIDLFL EP FNDE+DV SAKLRISLLS+LESVL KLL SGGRSEVRLWL+N+IAS+TSISPQHQRDLF+T LR KP KWA ASQLLQM FEKR R AGI
Subjt:  MIDLFLAEPIFNDEKDVDSAKLRISLLSRLESVLRKLLVSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRTKPLKWALASQLLQMFFEKRPRGAGI

Query:  LIAKRSYIMENFFEGNSRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTVRNFIEHVPEFWSSN
        LIAKRSYIME FFEGNSRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDV QTV+NFIE+VPEFWSSN
Subjt:  LIAKRSYIMENFFEGNSRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTVRNFIEHVPEFWSSN

Query:  EFAESLKDGEILLLDTEFFVKYFVDLMLKDDSKDVWEAINEYLMQESFSSLCRHLLITLEEADFCYFLKMLCKLLSPRIETKDSGSSSFVSEIILSRYGD
        EFAESLKDGEIL LDT+FFVKYFVDLMLKDD KDVWE INE+LM ESFSSL +HLL+TLEEADFC FLKMLCKLL PRIETKD G+ SF  E+ILS+YGD
Subjt:  EFAESLKDGEILLLDTEFFVKYFVDLMLKDDSKDVWEAINEYLMQESFSSLCRHLLITLEEADFCYFLKMLCKLLSPRIETKDSGSSSFVSEIILSRYGD

Query:  CESIDQILLLNAVINQGRQLLRLLRDEDAEEEWDEIKAIVSEISAISSNTHSLSPLLKECN-RRKTIEVIKWLGLQSWVLQYRMSEECQTPELWESLFAD
         ESIDQILLLNAV+NQGRQ+LRLLRDED EE+ DEIKAIV +ISAISSNT SL PLL EC+ R++TIE+IKWLGLQSWVL YRMSEECQTPELWESLF D
Subjt:  CESIDQILLLNAVINQGRQLLRLLRDEDAEEEWDEIKAIVSEISAISSNTHSLSPLLKECN-RRKTIEVIKWLGLQSWVLQYRMSEECQTPELWESLFAD

Query:  NGIGFRKSNEYALLDHSCFSEDDGFELCDTASAKLMKRRK-GKGRKRRKRNFDF----DNELLGFDTKNDRIDLKLNTGSWLLSIDDYTVPWNA
        NGIGF+KSNEY+LLDHS  SEDDGFE C+ A AK  +R+K GKGRKRRKR+FD+    D+ELL FD KNDR+DLKLNTGSWLLS DDYTVPWNA
Subjt:  NGIGFRKSNEYALLDHSCFSEDDGFELCDTASAKLMKRRK-GKGRKRRKRNFDF----DNELLGFDTKNDRIDLKLNTGSWLLSIDDYTVPWNA

TrEMBL top hitse value%identityAlignment
A0A0A0L6D1 Uncharacterized protein9.6e-23682.28Show/hide
Query:  SKHSEWRHTYYRYKMIDLFLAEPIFNDEKDVDSAKLRISLLSRLESVLRKLLVSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRTKPLKWALASQL
        S+   +    YRY+MIDLFL E  FNDE+DV S KLRISLLS LESVL KLL  GGRSEVRLWLSNTIAS+TSISPQHQRDLF+T LR KPLKWA ASQL
Subjt:  SKHSEWRHTYYRYKMIDLFLAEPIFNDEKDVDSAKLRISLLSRLESVLRKLLVSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRTKPLKWALASQL

Query:  LQMFFEKRPRGAGILIAKRSYIMENFFEGNSRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTV
        LQM FEKR R AGILIAKRSYIME FFEGN RRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDV QTV
Subjt:  LQMFFEKRPRGAGILIAKRSYIMENFFEGNSRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTV

Query:  RNFIEHVPEFWSSNEFAESLKDGEILLLDTEFFVKYFVDLMLKDDSKDVWEAINEYLMQESFSSLCRHLLITLEEADFCYFLKMLCKLLSPRIETKDSGS
        +NFI++VPEFWSSNEFAESLKDGEIL LDT+FFVKYFVDLMLKDD KDVWE INE+L  ESFSSLC+HLL+TLEEADFC FLKMLCKLL PRIETKD G+
Subjt:  RNFIEHVPEFWSSNEFAESLKDGEILLLDTEFFVKYFVDLMLKDDSKDVWEAINEYLMQESFSSLCRHLLITLEEADFCYFLKMLCKLLSPRIETKDSGS

Query:  SSFVSEIILSRYGDCESIDQILLLNAVINQGRQLLRLLRDEDAEEEWDEIKAIVSEISAISSNTHSLSPLLKECN-RRKTIEVIKWLGLQSWVLQYRMSE
        SSF+ E+IL++YGD ESIDQILLLNAVINQGRQLLRLLRDED EE+ DEIKAIV +IS+ISSN H L PLLKEC+ R+KTIE+IKWLGLQSWVL YRMSE
Subjt:  SSFVSEIILSRYGDCESIDQILLLNAVINQGRQLLRLLRDEDAEEEWDEIKAIVSEISAISSNTHSLSPLLKECN-RRKTIEVIKWLGLQSWVLQYRMSE

Query:  ECQTPELWESLFADNGIGFRKSNEYALLDHSCFSEDDGFELCDTASAKLMKRRK-GKGRKRRKRNFD----FDNELLGFDTKNDRIDLKLNTGSWLLSID
        ECQTPELWESLF DNGIGFRKSNEY LLDHSC SEDDGFEL + A A+  KR+K GKGRKRRK NFD     D+ELL FD KNDR+DLKLNTGSWLLS D
Subjt:  ECQTPELWESLFADNGIGFRKSNEYALLDHSCFSEDDGFELCDTASAKLMKRRK-GKGRKRRKRNFD----FDNELLGFDTKNDRIDLKLNTGSWLLSID

Query:  DYTVPWNA
        DYTVPWNA
Subjt:  DYTVPWNA

A0A1S3BK58 uncharacterized protein LOC103490747 isoform X11.3e-23278.71Show/hide
Query:  SKHSEWRHTYYRYKMIDLFLAEPIFNDEKDVDSAKLRISLLSRLESVLRKLLVSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRTKPLKWALASQL
        S+   +    YRY+MIDLF+ E  FNDE+DV SAKLRISLLS LESVL KLL  GGRSEVRLWLSNTIAS+TSISPQHQRDLF+T LR KPLKWA ASQL
Subjt:  SKHSEWRHTYYRYKMIDLFLAEPIFNDEKDVDSAKLRISLLSRLESVLRKLLVSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRTKPLKWALASQL

Query:  LQMFFEKRPRGAGILIAKRSYIMENFFE------------------GNSRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPA
        LQM FEKR R AGILIAKRSYIME FFE                  GN RRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPA
Subjt:  LQMFFEKRPRGAGILIAKRSYIMENFFE------------------GNSRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPA

Query:  VVATKPHYFLDLDVQQTVRNFIEHVPEFWSSNEFAESLKDGEILLLDTEFFVKYFVDLMLKDDSKDVWEAINEYLMQESFSSLCRHLLITLEEADFCYFL
        VVATKPHYFLDLDV QTV+NFI++VPEFWSSNEFAESLKDGEIL LDT+FFVK+F+DLMLKDDSKDVWE INE+LM ESFSSLC+HLL+TLE+ADFC FL
Subjt:  VVATKPHYFLDLDVQQTVRNFIEHVPEFWSSNEFAESLKDGEILLLDTEFFVKYFVDLMLKDDSKDVWEAINEYLMQESFSSLCRHLLITLEEADFCYFL

Query:  KMLCKLLSPRIETKDSGSSSFVSEIILSRYGDCESIDQILLLNAVINQGRQLLRLLRDEDAEEEWDEIKAIVSEISAISSNTHSLSPLLKECN-RRKTIE
        K+LCKLL PRIETKD G+SSF+ E+IL++YGD ESIDQILLLNAVINQGRQLLRLLRDED EE+ DEIKAI+ +ISAISSN+H L PLLKEC+ R+KTIE
Subjt:  KMLCKLLSPRIETKDSGSSSFVSEIILSRYGDCESIDQILLLNAVINQGRQLLRLLRDEDAEEEWDEIKAIVSEISAISSNTHSLSPLLKECN-RRKTIE

Query:  VIKWLGLQSWVLQYRMSEECQTPELWESLFADNGIGFRKSNEYALLDHSCFSEDDGFELCDTASAKLMKRRKG-KGRKRRKRNFD----FDNELLGFDTK
        +IKWLGLQSWVL YR SEECQTPELWESLF DNGIGFRKSNEY LLDHSC SEDDGFE C+ A AK  KR+KG KGRKRRKRNFD     D+ELL  D +
Subjt:  VIKWLGLQSWVLQYRMSEECQTPELWESLFADNGIGFRKSNEYALLDHSCFSEDDGFELCDTASAKLMKRRKG-KGRKRRKRNFD----FDNELLGFDTK

Query:  NDRIDLKLNTGSWLLSIDDYTVPWNA
        NDR+DLKLNTGSW LS DDYTVPWNA
Subjt:  NDRIDLKLNTGSWLLSIDDYTVPWNA

A0A1S3BKS3 uncharacterized protein LOC103490747 isoform X25.6e-23681.5Show/hide
Query:  SKHSEWRHTYYRYKMIDLFLAEPIFNDEKDVDSAKLRISLLSRLESVLRKLLVSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRTKPLKWALASQL
        S+   +    YRY+MIDLF+ E  FNDE+DV SAKLRISLLS LESVL KLL  GGRSEVRLWLSNTIAS+TSISPQHQRDLF+T LR KPLKWA ASQL
Subjt:  SKHSEWRHTYYRYKMIDLFLAEPIFNDEKDVDSAKLRISLLSRLESVLRKLLVSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRTKPLKWALASQL

Query:  LQMFFEKRPRGAGILIAKRSYIMENFFEGNSRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTV
        LQM FEKR R AGILIAKRSYIME FFEGN RRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDV QTV
Subjt:  LQMFFEKRPRGAGILIAKRSYIMENFFEGNSRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTV

Query:  RNFIEHVPEFWSSNEFAESLKDGEILLLDTEFFVKYFVDLMLKDDSKDVWEAINEYLMQESFSSLCRHLLITLEEADFCYFLKMLCKLLSPRIETKDSGS
        +NFI++VPEFWSSNEFAESLKDGEIL LDT+FFVK+F+DLMLKDDSKDVWE INE+LM ESFSSLC+HLL+TLE+ADFC FLK+LCKLL PRIETKD G+
Subjt:  RNFIEHVPEFWSSNEFAESLKDGEILLLDTEFFVKYFVDLMLKDDSKDVWEAINEYLMQESFSSLCRHLLITLEEADFCYFLKMLCKLLSPRIETKDSGS

Query:  SSFVSEIILSRYGDCESIDQILLLNAVINQGRQLLRLLRDEDAEEEWDEIKAIVSEISAISSNTHSLSPLLKECN-RRKTIEVIKWLGLQSWVLQYRMSE
        SSF+ E+IL++YGD ESIDQILLLNAVINQGRQLLRLLRDED EE+ DEIKAI+ +ISAISSN+H L PLLKEC+ R+KTIE+IKWLGLQSWVL YR SE
Subjt:  SSFVSEIILSRYGDCESIDQILLLNAVINQGRQLLRLLRDEDAEEEWDEIKAIVSEISAISSNTHSLSPLLKECN-RRKTIEVIKWLGLQSWVLQYRMSE

Query:  ECQTPELWESLFADNGIGFRKSNEYALLDHSCFSEDDGFELCDTASAKLMKRRKG-KGRKRRKRNFD----FDNELLGFDTKNDRIDLKLNTGSWLLSID
        ECQTPELWESLF DNGIGFRKSNEY LLDHSC SEDDGFE C+ A AK  KR+KG KGRKRRKRNFD     D+ELL  D +NDR+DLKLNTGSW LS D
Subjt:  ECQTPELWESLFADNGIGFRKSNEYALLDHSCFSEDDGFELCDTASAKLMKRRKG-KGRKRRKRNFD----FDNELLGFDTKNDRIDLKLNTGSWLLSID

Query:  DYTVPWNA
        DYTVPWNA
Subjt:  DYTVPWNA

A0A6J1CW06 uncharacterized protein LOC111014910 isoform X21.4e-25099.54Show/hide
Query:  MTSISPQHQRDLFVTFLRTKPLKWALASQLLQMFFEKRPRGAGILIAKRSYIMENFFEGNSRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEE
        MTSISPQHQRDLFVTFLRTKPLKWALASQLLQMFFEKRPRGAGILIAKRSYIME FFEGNSRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEE
Subjt:  MTSISPQHQRDLFVTFLRTKPLKWALASQLLQMFFEKRPRGAGILIAKRSYIMENFFEGNSRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEE

Query:  LEWKGKHGQSPAVVATKPHYFLDLDVQQTVRNFIEHVPEFWSSNEFAESLKDGEILLLDTEFFVKYFVDLMLKDDSKDVWEAINEYLMQESFSSLCRHLL
        LEWKGKHGQSPAVVATKPHYFLDLDVQQTVRNFIEHVPEFWSSNEFAESLKDGEILLLDTEFFVKYFVDLMLKDDSKDVWEAINEYLMQESFSSLCRHLL
Subjt:  LEWKGKHGQSPAVVATKPHYFLDLDVQQTVRNFIEHVPEFWSSNEFAESLKDGEILLLDTEFFVKYFVDLMLKDDSKDVWEAINEYLMQESFSSLCRHLL

Query:  ITLEEADFCYFLKMLCKLLSPRIETKDSGSSSFVSEIILSRYGDCESIDQILLLNAVINQGRQLLRLLRDEDAEEEWDEIKAIVSEISAISSNTHSLSPL
        ITLEEADFCYFLKMLCKLLSPRIETKDSGSSSFVSEIILSRYGDCESIDQILLLNAVINQGRQLLRLLRDEDAEEEWDEIKAIVSEISAISSNTHSLSPL
Subjt:  ITLEEADFCYFLKMLCKLLSPRIETKDSGSSSFVSEIILSRYGDCESIDQILLLNAVINQGRQLLRLLRDEDAEEEWDEIKAIVSEISAISSNTHSLSPL

Query:  LKECNRRKTIEVIKWLGLQSWVLQYRMSEECQTPELWESLFADNGIGFRKSNEYALLDHSCFSEDDGFELCDTASAKLMKRRKGKGRKRRKRNFDFDNEL
        LKECNRRKTIEVIKWLGLQSWVLQYRMSEECQTPELWESLFADNGIGFRKSNEYALLDHSCFSEDDGFELCDTASAKLMKRRKGKGRKRRKRNFDFDNEL
Subjt:  LKECNRRKTIEVIKWLGLQSWVLQYRMSEECQTPELWESLFADNGIGFRKSNEYALLDHSCFSEDDGFELCDTASAKLMKRRKGKGRKRRKRNFDFDNEL

Query:  LGFDTKNDRIDLKLNTGSWLLSIDDYTVPWNAV
        LGFDTKNDRIDLKLNTGSWLLSIDDYTVPWNA+
Subjt:  LGFDTKNDRIDLKLNTGSWLLSIDDYTVPWNAV

A0A6J1CWP0 uncharacterized protein LOC111014910 isoform X11.5e-28199.59Show/hide
Query:  MIDLFLAEPIFNDEKDVDSAKLRISLLSRLESVLRKLLVSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRTKPLKWALASQLLQMFFEKRPRGAGI
        MIDLFLAEPIFNDEKDVDSAKLRISLLSRLESVLRKLLVSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRTKPLKWALASQLLQMFFEKRPRGAGI
Subjt:  MIDLFLAEPIFNDEKDVDSAKLRISLLSRLESVLRKLLVSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRTKPLKWALASQLLQMFFEKRPRGAGI

Query:  LIAKRSYIMENFFEGNSRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTVRNFIEHVPEFWSSN
        LIAKRSYIME FFEGNSRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTVRNFIEHVPEFWSSN
Subjt:  LIAKRSYIMENFFEGNSRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTVRNFIEHVPEFWSSN

Query:  EFAESLKDGEILLLDTEFFVKYFVDLMLKDDSKDVWEAINEYLMQESFSSLCRHLLITLEEADFCYFLKMLCKLLSPRIETKDSGSSSFVSEIILSRYGD
        EFAESLKDGEILLLDTEFFVKYFVDLMLKDDSKDVWEAINEYLMQESFSSLCRHLLITLEEADFCYFLKMLCKLLSPRIETKDSGSSSFVSEIILSRYGD
Subjt:  EFAESLKDGEILLLDTEFFVKYFVDLMLKDDSKDVWEAINEYLMQESFSSLCRHLLITLEEADFCYFLKMLCKLLSPRIETKDSGSSSFVSEIILSRYGD

Query:  CESIDQILLLNAVINQGRQLLRLLRDEDAEEEWDEIKAIVSEISAISSNTHSLSPLLKECNRRKTIEVIKWLGLQSWVLQYRMSEECQTPELWESLFADN
        CESIDQILLLNAVINQGRQLLRLLRDEDAEEEWDEIKAIVSEISAISSNTHSLSPLLKECNRRKTIEVIKWLGLQSWVLQYRMSEECQTPELWESLFADN
Subjt:  CESIDQILLLNAVINQGRQLLRLLRDEDAEEEWDEIKAIVSEISAISSNTHSLSPLLKECNRRKTIEVIKWLGLQSWVLQYRMSEECQTPELWESLFADN

Query:  GIGFRKSNEYALLDHSCFSEDDGFELCDTASAKLMKRRKGKGRKRRKRNFDFDNELLGFDTKNDRIDLKLNTGSWLLSIDDYTVPWNAV
        GIGFRKSNEYALLDHSCFSEDDGFELCDTASAKLMKRRKGKGRKRRKRNFDFDNELLGFDTKNDRIDLKLNTGSWLLSIDDYTVPWNA+
Subjt:  GIGFRKSNEYALLDHSCFSEDDGFELCDTASAKLMKRRKGKGRKRRKRNFDFDNELLGFDTKNDRIDLKLNTGSWLLSIDDYTVPWNAV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G48340.1 unknown protein3.1e-14652.29Show/hide
Query:  MIDLFLAEPIFNDEKDVDS-AKLRISLLSRLESVLRKLLVSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRTKPLKWALASQLLQMFFEKRPRGAG
        M++LFL+EP +ND+    S   + + LL++L S ++ L+  G RSE RLWL + ++++ SISP  Q ++F+  LR+KP K    SQ+L M FEKRPR  G
Subjt:  MIDLFLAEPIFNDEKDVDS-AKLRISLLSRLESVLRKLLVSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRTKPLKWALASQLLQMFFEKRPRGAG

Query:  ILIAKRSYIMENFFEGNSRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTVRNFIEHVPEFWSS
         L+AKRSYI+E FFEGN +RI +WFS FA +G SDH +GAKALAQFAF NRDICWEELEW+GKHGQSPAVVATKPHY LDLDV++T++NF+++VPEFWSS
Subjt:  ILIAKRSYIMENFFEGNSRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTVRNFIEHVPEFWSS

Query:  NEFAESLKDGEILLLDTEFFVKYFVDLMLKDDSKDVWEAINEYLMQESFSSLCRHLLITLEEADFCYFLKMLCKLLSPRIETKDSGSSSFVSEIILSRYG
        NEFAESLKDG+IL LDT+FF+  F+  M ++D  DVW+A+ E+L +ESFSSL +HLLITLEE D C FL++L     P IE+ DSG SS    ++LSRY 
Subjt:  NEFAESLKDGEILLLDTEFFVKYFVDLMLKDDSKDVWEAINEYLMQESFSSLCRHLLITLEEADFCYFLKMLCKLLSPRIETKDSGSSSFVSEIILSRYG

Query:  DCESIDQILLLNAVINQGRQLLRLLRDEDAEEEWDEIKAIVSEISAISSNTHSLSPLLKECNRRKTIEVIKWLGLQSWVLQYRMSEECQTPELWESLFAD
        D ESID++LLL+++INQGRQLLRL+RDE+  +E + +K  ++EI     N  S S +L+E ++ K I+VIK LGL SW + +R+SEECQTP+ WE LF +
Subjt:  DCESIDQILLLNAVINQGRQLLRLLRDEDAEEEWDEIKAIVSEISAISSNTHSLSPLLKECNRRKTIEVIKWLGLQSWVLQYRMSEECQTPELWESLFAD

Query:  NGIGFRKSNEYALLDHSCFSE--DDGFELCDTASAKLMKRRKGKGRKRRKRNFDFDNELLGFDTKNDRIDLKLNTGSWLLSIDDYTVPWNAVSTLIISPP
        NGI FR+S++++LL ++ FSE  +   +     S K  KR K K +K++KR FD D++       ++ +DL   + SWLLS D ++  W +V        
Subjt:  NGIGFRKSNEYALLDHSCFSE--DDGFELCDTASAKLMKRRKGKGRKRRKRNFDFDNELLGFDTKNDRIDLKLNTGSWLLSIDDYTVPWNAVSTLIISPP

Query:  FCI
        +C+
Subjt:  FCI

AT5G48340.2 unknown protein1.2e-14553.16Show/hide
Query:  MIDLFLAEPIFNDEKDVDS-AKLRISLLSRLESVLRKLLVSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRTKPLKWALASQLLQMFFEKRPRGAG
        M++LFL+EP +ND+    S   + + LL++L S ++ L+  G RSE RLWL + ++++ SISP  Q ++F+  LR+KP K    SQ+L M FEKRPR  G
Subjt:  MIDLFLAEPIFNDEKDVDS-AKLRISLLSRLESVLRKLLVSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRTKPLKWALASQLLQMFFEKRPRGAG

Query:  ILIAKRSYIMENFFEGNSRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTVRNFIEHVPEFWSS
         L+AKRSYI+E FFEGN +RI +WFS FA +G SDH +GAKALAQFAF NRDICWEELEW+GKHGQSPAVVATKPHY LDLDV++T++NF+++VPEFWSS
Subjt:  ILIAKRSYIMENFFEGNSRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTVRNFIEHVPEFWSS

Query:  NEFAESLKDGEILLLDTEFFVKYFVDLMLKDDSKDVWEAINEYLMQESFSSLCRHLLITLEEADFCYFLKMLCKLLSPRIETKDSGSSSFVSEIILSRYG
        NEFAESLKDG+IL LDT+FF+  F+  M ++D  DVW+A+ E+L +ESFSSL +HLLITLEE D C FL++L     P IE+ DSG SS    ++LSRY 
Subjt:  NEFAESLKDGEILLLDTEFFVKYFVDLMLKDDSKDVWEAINEYLMQESFSSLCRHLLITLEEADFCYFLKMLCKLLSPRIETKDSGSSSFVSEIILSRYG

Query:  DCESIDQILLLNAVINQGRQLLRLLRDEDAEEEWDEIKAIVSEISAISSNTHSLSPLLKECNRRKTIEVIKWLGLQSWVLQYRMSEECQTPELWESLFAD
        D ESID++LLL+++INQGRQLLRL+RDE+  +E + +K  ++EI     N  S S +L+E ++ K I+VIK LGL SW + +R+SEECQTP+ WE LF +
Subjt:  DCESIDQILLLNAVINQGRQLLRLLRDEDAEEEWDEIKAIVSEISAISSNTHSLSPLLKECNRRKTIEVIKWLGLQSWVLQYRMSEECQTPELWESLFAD

Query:  NGIGFRKSNEYALLDHSCFSE--DDGFELCDTASAKLMKRRKGKGRKRRKRNFDFDNELLGFDTKNDRIDLKLNTGSWLLSIDDYTVPWNA
        NGI FR+S++++LL ++ FSE  +   +     S K  KR K K +K++KR FD D++       ++ +DL   + SWLLS D ++  W +
Subjt:  NGIGFRKSNEYALLDHSCFSE--DDGFELCDTASAKLMKRRKGKGRKRRKRNFDFDNELLGFDTKNDRIDLKLNTGSWLLSIDDYTVPWNA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ACAAGCAAGCATTCTGAGTGGAGACATACGTATTATAGGTATAAAATGATTGATCTATTTCTGGCAGAGCCCATATTCAACGATGAAAAGGATGTTGACTCTGCCAAGTT
GAGAATCTCTCTGTTAAGTAGATTGGAATCTGTTTTACGGAAATTGCTGGTCTCTGGAGGACGGTCCGAGGTCCGATTATGGCTTTCTAATACTATAGCTAGTATGACAT
CGATCAGTCCCCAGCATCAGCGGGACCTGTTTGTGACCTTCCTGAGAACGAAGCCACTGAAGTGGGCCTTAGCATCTCAACTACTACAAATGTTCTTTGAAAAGAGACCA
CGAGGGGCAGGGATCCTCATTGCCAAGAGAAGCTACATAATGGAAAATTTTTTCGAAGGAAATTCGAGACGAATATCTCAGTGGTTTTCCAATTTTGCTACGAATGGTGC
ATCAGATCATGGGAAAGGAGCCAAGGCCCTGGCACAGTTCGCTTTTGTAAATCGGGACATTTGCTGGGAGGAGCTTGAGTGGAAGGGGAAACATGGGCAGTCGCCTGCAG
TGGTTGCAACAAAGCCCCATTATTTTCTTGATCTGGATGTGCAACAAACTGTGAGGAATTTCATTGAGCATGTACCCGAGTTTTGGTCTTCCAATGAGTTTGCTGAGTCA
CTAAAAGATGGTGAAATTTTGTTACTCGATACGGAATTCTTTGTGAAATATTTTGTTGATCTGATGCTTAAAGATGATTCAAAAGATGTTTGGGAAGCCATTAATGAGTA
CCTAATGCAGGAGTCATTTTCTTCACTATGTCGACACCTCCTTATTACTCTTGAAGAGGCTGATTTCTGCTACTTTCTGAAAATGCTATGTAAGCTCCTCAGCCCTAGAA
TAGAAACCAAGGATTCCGGTAGCTCATCTTTTGTGTCTGAGATCATACTTTCTAGATATGGTGACTGTGAATCTATTGATCAGATTTTACTATTAAATGCTGTTATTAAT
CAAGGACGCCAACTTCTACGACTTTTACGTGATGAAGATGCTGAGGAAGAATGGGATGAAATCAAAGCCATTGTCTCAGAGATTTCAGCAATCTCAAGCAACACTCATAG
CTTATCCCCACTATTGAAAGAGTGTAACAGGAGAAAGACCATAGAGGTGATAAAGTGGCTAGGGCTTCAGTCTTGGGTTCTTCAGTATAGAATGTCAGAGGAATGTCAGA
CTCCTGAGTTATGGGAATCCTTGTTTGCTGATAATGGTATAGGCTTCCGGAAATCTAATGAATATGCGTTGTTAGATCACAGTTGCTTCTCAGAAGATGATGGTTTTGAA
CTGTGTGATACAGCATCGGCTAAACTTATGAAGCGAAGAAAGGGAAAAGGTAGAAAGAGAAGAAAAAGGAACTTTGACTTTGACAATGAGCTGTTAGGCTTTGATACTAA
AAATGATAGGATTGATTTGAAGTTGAACACTGGAAGTTGGTTACTTTCCATTGATGACTATACTGTACCATGGAATGCTGTAAGTACTCTAATCATCAGCCCACCTTTCT
GTATC
mRNA sequenceShow/hide mRNA sequence
ACAAGCAAGCATTCTGAGTGGAGACATACGTATTATAGGTATAAAATGATTGATCTATTTCTGGCAGAGCCCATATTCAACGATGAAAAGGATGTTGACTCTGCCAAGTT
GAGAATCTCTCTGTTAAGTAGATTGGAATCTGTTTTACGGAAATTGCTGGTCTCTGGAGGACGGTCCGAGGTCCGATTATGGCTTTCTAATACTATAGCTAGTATGACAT
CGATCAGTCCCCAGCATCAGCGGGACCTGTTTGTGACCTTCCTGAGAACGAAGCCACTGAAGTGGGCCTTAGCATCTCAACTACTACAAATGTTCTTTGAAAAGAGACCA
CGAGGGGCAGGGATCCTCATTGCCAAGAGAAGCTACATAATGGAAAATTTTTTCGAAGGAAATTCGAGACGAATATCTCAGTGGTTTTCCAATTTTGCTACGAATGGTGC
ATCAGATCATGGGAAAGGAGCCAAGGCCCTGGCACAGTTCGCTTTTGTAAATCGGGACATTTGCTGGGAGGAGCTTGAGTGGAAGGGGAAACATGGGCAGTCGCCTGCAG
TGGTTGCAACAAAGCCCCATTATTTTCTTGATCTGGATGTGCAACAAACTGTGAGGAATTTCATTGAGCATGTACCCGAGTTTTGGTCTTCCAATGAGTTTGCTGAGTCA
CTAAAAGATGGTGAAATTTTGTTACTCGATACGGAATTCTTTGTGAAATATTTTGTTGATCTGATGCTTAAAGATGATTCAAAAGATGTTTGGGAAGCCATTAATGAGTA
CCTAATGCAGGAGTCATTTTCTTCACTATGTCGACACCTCCTTATTACTCTTGAAGAGGCTGATTTCTGCTACTTTCTGAAAATGCTATGTAAGCTCCTCAGCCCTAGAA
TAGAAACCAAGGATTCCGGTAGCTCATCTTTTGTGTCTGAGATCATACTTTCTAGATATGGTGACTGTGAATCTATTGATCAGATTTTACTATTAAATGCTGTTATTAAT
CAAGGACGCCAACTTCTACGACTTTTACGTGATGAAGATGCTGAGGAAGAATGGGATGAAATCAAAGCCATTGTCTCAGAGATTTCAGCAATCTCAAGCAACACTCATAG
CTTATCCCCACTATTGAAAGAGTGTAACAGGAGAAAGACCATAGAGGTGATAAAGTGGCTAGGGCTTCAGTCTTGGGTTCTTCAGTATAGAATGTCAGAGGAATGTCAGA
CTCCTGAGTTATGGGAATCCTTGTTTGCTGATAATGGTATAGGCTTCCGGAAATCTAATGAATATGCGTTGTTAGATCACAGTTGCTTCTCAGAAGATGATGGTTTTGAA
CTGTGTGATACAGCATCGGCTAAACTTATGAAGCGAAGAAAGGGAAAAGGTAGAAAGAGAAGAAAAAGGAACTTTGACTTTGACAATGAGCTGTTAGGCTTTGATACTAA
AAATGATAGGATTGATTTGAAGTTGAACACTGGAAGTTGGTTACTTTCCATTGATGACTATACTGTACCATGGAATGCTGTAAGTACTCTAATCATCAGCCCACCTTTCT
GTATC
Protein sequenceShow/hide protein sequence
TSKHSEWRHTYYRYKMIDLFLAEPIFNDEKDVDSAKLRISLLSRLESVLRKLLVSGGRSEVRLWLSNTIASMTSISPQHQRDLFVTFLRTKPLKWALASQLLQMFFEKRP
RGAGILIAKRSYIMENFFEGNSRRISQWFSNFATNGASDHGKGAKALAQFAFVNRDICWEELEWKGKHGQSPAVVATKPHYFLDLDVQQTVRNFIEHVPEFWSSNEFAES
LKDGEILLLDTEFFVKYFVDLMLKDDSKDVWEAINEYLMQESFSSLCRHLLITLEEADFCYFLKMLCKLLSPRIETKDSGSSSFVSEIILSRYGDCESIDQILLLNAVIN
QGRQLLRLLRDEDAEEEWDEIKAIVSEISAISSNTHSLSPLLKECNRRKTIEVIKWLGLQSWVLQYRMSEECQTPELWESLFADNGIGFRKSNEYALLDHSCFSEDDGFE
LCDTASAKLMKRRKGKGRKRRKRNFDFDNELLGFDTKNDRIDLKLNTGSWLLSIDDYTVPWNAVSTLIISPPFCI