; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS017015 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS017015
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionUnknown protein
Genome locationscaffold197:655348..657537
RNA-Seq ExpressionMS017015
SyntenyMS017015
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6595440.1 hypothetical protein SDJN03_11993, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0081.92Show/hide
Query:  MGVFGCSKDGYLNDSKFSEPLPWIGIYIAAASLVCLLAMAADLVHGVRHKKFWFPCKFFTLNSTSLTLIAVAIKLSVDLNTSMPGREDQLAKLSSAVLMC
        MGV GCS DGYLND+KFSEP+PWIGIYIA ASLVCLLAMAADLVHG RH+KFWFPCKFFTLNSTSLTLIAVAIKLSVDLNTSMPGR+DQLAKLSSAVLMC
Subjt:  MGVFGCSKDGYLNDSKFSEPLPWIGIYIAAASLVCLLAMAADLVHGVRHKKFWFPCKFFTLNSTSLTLIAVAIKLSVDLNTSMPGREDQLAKLSSAVLMC

Query:  TIMGNSMPSLGSMENQEIFTNIMALGILVITLIVNVCIELGTGVIYVFMKEHISIVFLMLVLLAIQSFSALTVPSTKSYLEMKYRVRHQLAFKEC-ADAK
        TIM N MPSLGSMENQEIF NIMALGILVITLI+NVC+E+GTGVIYV+MKE +SI+ LMLVLL I SFSAL VPSTKSYLEMKY +RH+LA KEC A+ K
Subjt:  TIMGNSMPSLGSMENQEIFTNIMALGILVITLIVNVCIELGTGVIYVFMKEHISIVFLMLVLLAIQSFSALTVPSTKSYLEMKYRVRHQLAFKEC-ADAK

Query:  TGPAVVERLREGLMKYWMMAHTCSPQFVIGRSATCTASGAICLLTAAILAEAIVRSYLMQRSFKFCNGQSDYKWSITLILIVQCAAVGVGTVAPAIRWFA
         G   VERL+EG++KYWMMA TCSPQFV+GRSATCTASGAICLL+A ILAEAI+ SYL ++SFKFCNGQSDYKWSI+ IL++QC AV VGT+APAIRWFA
Subjt:  TGPAVVERLREGLMKYWMMAHTCSPQFVIGRSATCTASGAICLLTAAILAEAIVRSYLMQRSFKFCNGQSDYKWSITLILIVQCAAVGVGTVAPAIRWFA

Query:  AINFRCPKLGKGGYKKEFTLENYWIRYLVEMKQCPLSIRVENRSFRKLAHSAKNKFLDACIILQTAIVFISKVIRLISIFFVRGIFSFCDCFKSLKKKLC
        AI FRCPKLGK GYKKEFTLENYWI+YLVE+KQCPL+I+V+NR  RKL HSAKNKFLD CIILQT IVF SKVIRLISIF V GIFSFCDCFKSLK KL 
Subjt:  AINFRCPKLGKGGYKKEFTLENYWIRYLVEMKQCPLSIRVENRSFRKLAHSAKNKFLDACIILQTAIVFISKVIRLISIFFVRGIFSFCDCFKSLKKKLC

Query:  FKDAISINNSGSEVDPDSKLDLSRFVLYLEGEDDLVHLMVANNCYATNQWIQKGQKKKPKNLINLLEGMIMTRGFKGVAEFDNHQVPCLDSKEPQNCWAL
        FKD IS+N+SGSEVD DSK+DLSRFVLYLEGEDDLVHLM+ANN +AT+ WIQKGQKKKPK LI+LLEG IM+RGFKGVAEFDN QVPCLDS++PQNCW L
Subjt:  FKDAISINNSGSEVDPDSKLDLSRFVLYLEGEDDLVHLMVANNCYATNQWIQKGQKKKPKNLINLLEGMIMTRGFKGVAEFDNHQVPCLDSKEPQNCWAL

Query:  SVVTLTSIAISLPNISRCLVKQLVSAVNEGLPYIRLLEDHLDTKGDFINLKKAAEIVWLGIDLHHKWLDVDLQKISHHKESPKEILEQLSDCAKKIYSVE
         VVTLT IA SLPN++R L+K L+ AVNEGL YIRL+EDHLDTKGDFINLKKAAEIVWLGIDLHHKWLD+D+ KISHHKESPKE+LEQLS+CAKKIYS E
Subjt:  SVVTLTSIAISLPNISRCLVKQLVSAVNEGLPYIRLLEDHLDTKGDFINLKKAAEIVWLGIDLHHKWLDVDLQKISHHKESPKEILEQLSDCAKKIYSVE

Query:  KTTNQRLCLKLSPSNWPIRILAANCMYRISESMLLKYENKYGYRNEELFAETTAMISAIMGACLTNLEDMIATKCSNSAIEKRERSVREAAYILGKTGNI
        K TNQ LCLKLSPS WPI++LAANCMYRISESMLLKYE KYG+ +E+LF E  A+I AIMGACLTNLE +I+TKCSNSAIEKRE+SVR+AAYILGKTGNI
Subjt:  KTTNQRLCLKLSPSNWPIRILAANCMYRISESMLLKYENKYGYRNEELFAETTAMISAIMGACLTNLEDMIATKCSNSAIEKRERSVREAAYILGKTGNI

Query:  LKLIEKATLPTLDLHQMASIDEWRLAYKLE
        LKL+EK TLP LD HQM SI+EWRL YKLE
Subjt:  LKLIEKATLPTLDLHQMASIDEWRLAYKLE

XP_022154498.1 uncharacterized protein LOC111021763 [Momordica charantia]0.0e+0097.67Show/hide
Query:  MGVFGCSKDGYLNDSKFSEPLPWIGIYIAAASLVCLLAMAADLVHGVRHKKFWFPCKFFTLNSTSLTLIAVAIKLSVDLNTSMPGREDQLAKLSSAVLMC
        MGVFGCSKDGYLNDSKFSEPLPWIGIYIAAASLVCL AMAADLVHGVRHKKFWFPCKFFTLNSTSLTLIAVAIKLSVDLNTSMPGREDQLAKLSSAVLMC
Subjt:  MGVFGCSKDGYLNDSKFSEPLPWIGIYIAAASLVCLLAMAADLVHGVRHKKFWFPCKFFTLNSTSLTLIAVAIKLSVDLNTSMPGREDQLAKLSSAVLMC

Query:  TIMGNSMPSLGSMENQEIFTNIMALGILVITLIVNVCIELGTGVIYVFMKEHISIVFLMLVLLAIQSFSALTVPSTKSYLEMKYRVRHQLAFKECADAKT
        TIMGNSMPSLGSMENQEIFTNIMALGILVITLIVNVCIELGTGVIYVFMKEHISIVFLMLVLLAIQSFSALTVPSTKSYLEMKYRVRHQLAFKECADAKT
Subjt:  TIMGNSMPSLGSMENQEIFTNIMALGILVITLIVNVCIELGTGVIYVFMKEHISIVFLMLVLLAIQSFSALTVPSTKSYLEMKYRVRHQLAFKECADAKT

Query:  GPAVVERLREGLMKYWMMAHTCSPQFVIGRSATCTASGAICLLTAAILAEAIVRSYLMQRSFKFCNGQSDYKWSITLILIVQCAAVGVGTVAPAIRWFAA
        GPAVVERLREGLMKYWMMAHTCSPQFVIGRSATCTASGAICLLTAAILAEAIVRSYLMQRSFKFCNGQSDYKWSITLILIVQCAAVGVGTVAPAIRWFAA
Subjt:  GPAVVERLREGLMKYWMMAHTCSPQFVIGRSATCTASGAICLLTAAILAEAIVRSYLMQRSFKFCNGQSDYKWSITLILIVQCAAVGVGTVAPAIRWFAA

Query:  INFRCPKLGKGGYKKEFTLENYWIRYLVEMKQCPLSIRVENRSFRKLAHSAKNKFLDACIILQTAIVFISKVIRLISIFFVRGIFSFCDCFKSLKKKLCF
        INFRCPKLGKGGYKKEFTLENYWIRYLVEMKQCPL+I VENRS RK AHSAK+KFLDACIILQTAIVFISKVIRL+SIFFV GIFSFCDCFKSLKKKLCF
Subjt:  INFRCPKLGKGGYKKEFTLENYWIRYLVEMKQCPLSIRVENRSFRKLAHSAKNKFLDACIILQTAIVFISKVIRLISIFFVRGIFSFCDCFKSLKKKLCF

Query:  KDAISINNSGSEVDPDSKLDLSRFVLYLEGEDDLVHLMVANNCYATNQWIQKGQKKKPKNLINLLEGMIMTRGFKGVAEFDNHQVPCLDSKEPQNCWALS
        KDAISINNSGSEVD DSKLDLSRFVLYLEGEDDLVHLMVANNCYATNQWIQKGQKKKPKNLI+LLEGMIMTRGFKGVAEFDNHQVPCLDSKEPQNCWAL 
Subjt:  KDAISINNSGSEVDPDSKLDLSRFVLYLEGEDDLVHLMVANNCYATNQWIQKGQKKKPKNLINLLEGMIMTRGFKGVAEFDNHQVPCLDSKEPQNCWALS

Query:  VVTLTSIAISLPNISRCLVKQLVSAVNEGLPYIRLLEDHLDTKGDFINLKKAAEIVWLGIDLHHKWLDVDLQKISHHKESPKEILEQLSDCAKKIYSVEK
        VVTLT+IAISLPNISRCL+KQLVSAVNEGLPYIRLLEDHLDTKGDFINLKKAAEIVWLGIDLHHKWLDVDLQKISHHKESPKEILEQLSDCAKKIYSVEK
Subjt:  VVTLTSIAISLPNISRCLVKQLVSAVNEGLPYIRLLEDHLDTKGDFINLKKAAEIVWLGIDLHHKWLDVDLQKISHHKESPKEILEQLSDCAKKIYSVEK

Query:  TTNQRLCLKLSPSNWPIRILAANCMYRISESMLLKYENKYGYRNEELFAETTAMISAIMGACLTNLEDMIATKCSNSAIEKRERSVREAAYILGKTGNIL
        TTNQRLCLKLSPSNWPIRILAANCMYRISESMLLKYE KYGYRNEELFAETT MISAIMGACLTNLED+IATKCSNS IEKRERSVREAAYILGKTGNIL
Subjt:  TTNQRLCLKLSPSNWPIRILAANCMYRISESMLLKYENKYGYRNEELFAETTAMISAIMGACLTNLEDMIATKCSNSAIEKRERSVREAAYILGKTGNIL

Query:  KLIEKATLPTLDLHQMASIDEWRLAYKLES
        KLIEKATLPTLDLHQMASIDEWRLAYKLES
Subjt:  KLIEKATLPTLDLHQMASIDEWRLAYKLES

XP_022924953.1 uncharacterized protein LOC111432343 [Cucurbita moschata]0.0e+0081.92Show/hide
Query:  MGVFGCSKDGYLNDSKFSEPLPWIGIYIAAASLVCLLAMAADLVHGVRHKKFWFPCKFFTLNSTSLTLIAVAIKLSVDLNTSMPGREDQLAKLSSAVLMC
        MGV GCS DGYLND+KFSEP+PWIGIYIA ASLVCLLAMAADLVHG RH+KFWFPCKFFTLNSTSLTLIAVAIKLSVDLNTSMPGREDQLAKLSSAVLMC
Subjt:  MGVFGCSKDGYLNDSKFSEPLPWIGIYIAAASLVCLLAMAADLVHGVRHKKFWFPCKFFTLNSTSLTLIAVAIKLSVDLNTSMPGREDQLAKLSSAVLMC

Query:  TIMGNSMPSLGSMENQEIFTNIMALGILVITLIVNVCIELGTGVIYVFMKEHISIVFLMLVLLAIQSFSALTVPSTKSYLEMKYRVRHQLAFKEC-ADAK
        TIM N MPSLGSMENQEIF NIMAL ILVITLI+NVC+E+GTGVIYV+MKE +SI+ LMLVLL I SFSAL VPSTKSYLEMKY +RH+LA KEC A+ K
Subjt:  TIMGNSMPSLGSMENQEIFTNIMALGILVITLIVNVCIELGTGVIYVFMKEHISIVFLMLVLLAIQSFSALTVPSTKSYLEMKYRVRHQLAFKEC-ADAK

Query:  TGPAVVERLREGLMKYWMMAHTCSPQFVIGRSATCTASGAICLLTAAILAEAIVRSYLMQRSFKFCNGQSDYKWSITLILIVQCAAVGVGTVAPAIRWFA
         G   VERL+EG++KYWMMA TCSPQFV+GRSATCTASGAICLL+A ILAEAI+ SYL ++SFKFCNG SDYKWSI+ IL++QC AV VGT+APAIRWFA
Subjt:  TGPAVVERLREGLMKYWMMAHTCSPQFVIGRSATCTASGAICLLTAAILAEAIVRSYLMQRSFKFCNGQSDYKWSITLILIVQCAAVGVGTVAPAIRWFA

Query:  AINFRCPKLGKGGYKKEFTLENYWIRYLVEMKQCPLSIRVENRSFRKLAHSAKNKFLDACIILQTAIVFISKVIRLISIFFVRGIFSFCDCFKSLKKKLC
        AI FRCPKLGK GYKKEFTLENYWI+YLVEMKQCPL+I+V+NR  RKL HSAKNKFLD CIILQT IVF SKVIRLISIF V GIFSFCDCFKSLK KL 
Subjt:  AINFRCPKLGKGGYKKEFTLENYWIRYLVEMKQCPLSIRVENRSFRKLAHSAKNKFLDACIILQTAIVFISKVIRLISIFFVRGIFSFCDCFKSLKKKLC

Query:  FKDAISINNSGSEVDPDSKLDLSRFVLYLEGEDDLVHLMVANNCYATNQWIQKGQKKKPKNLINLLEGMIMTRGFKGVAEFDNHQVPCLDSKEPQNCWAL
        FKD IS+N+SGSEVD DSK+DLSRFVLYLEGEDDLVHLM+ANN +AT+ WIQKGQKKKPK LI+LLEG IM+RGFKGVAEFDN QVPCLDS++PQNCW L
Subjt:  FKDAISINNSGSEVDPDSKLDLSRFVLYLEGEDDLVHLMVANNCYATNQWIQKGQKKKPKNLINLLEGMIMTRGFKGVAEFDNHQVPCLDSKEPQNCWAL

Query:  SVVTLTSIAISLPNISRCLVKQLVSAVNEGLPYIRLLEDHLDTKGDFINLKKAAEIVWLGIDLHHKWLDVDLQKISHHKESPKEILEQLSDCAKKIYSVE
         VVTLT IA SLPN++R L+K L+ AVNEGL YIRL+EDHLDTKGDFINLKKAAEIVWLGIDLHHKWLD+D+ KISHHKESPKE+LEQLS+CAKKIYS E
Subjt:  SVVTLTSIAISLPNISRCLVKQLVSAVNEGLPYIRLLEDHLDTKGDFINLKKAAEIVWLGIDLHHKWLDVDLQKISHHKESPKEILEQLSDCAKKIYSVE

Query:  KTTNQRLCLKLSPSNWPIRILAANCMYRISESMLLKYENKYGYRNEELFAETTAMISAIMGACLTNLEDMIATKCSNSAIEKRERSVREAAYILGKTGNI
        K TNQ LCLKLSPS WPI++LAANCMYRISESMLLKYE KYG+ +E+LF E  A+I AIMGACLTNLE +I+TKCSNSAIEKRE+SVR+AAYILGKTGNI
Subjt:  KTTNQRLCLKLSPSNWPIRILAANCMYRISESMLLKYENKYGYRNEELFAETTAMISAIMGACLTNLEDMIATKCSNSAIEKRERSVREAAYILGKTGNI

Query:  LKLIEKATLPTLDLHQMASIDEWRLAYKLE
        LKL+EK TLP LD HQM SI+EWRL YKLE
Subjt:  LKLIEKATLPTLDLHQMASIDEWRLAYKLE

XP_022966654.1 uncharacterized protein LOC111466274 [Cucurbita maxima]0.0e+0081.64Show/hide
Query:  MGVFGCSKDGYLNDSKFSEPLPWIGIYIAAASLVCLLAMAADLVHGVRHKKFWFPCKFFTLNSTSLTLIAVAIKLSVDLNTSMPGREDQLAKLSSAVLMC
        MGV GCS DGYLND+KFSEP+PWIGIYIA ASLVCLLAMAADLVHG RH+KFWFPCKFFTLNSTSLTLIAVAIKLSVDLNTSMPGREDQLAKLSSAVLMC
Subjt:  MGVFGCSKDGYLNDSKFSEPLPWIGIYIAAASLVCLLAMAADLVHGVRHKKFWFPCKFFTLNSTSLTLIAVAIKLSVDLNTSMPGREDQLAKLSSAVLMC

Query:  TIMGNSMPSLGSMENQEIFTNIMALGILVITLIVNVCIELGTGVIYVFMKEHISIVFLMLVLLAIQSFSALTVPSTKSYLEMKYRVRHQLAFKEC-ADAK
        TIM N MPSLGSMENQEIF NIMALGILVITLI+NVC+E+GTGVIYV+MKE +SI+ LMLVLL I SFSAL VPSTKSYLEMKY +RH+LA KEC A+ K
Subjt:  TIMGNSMPSLGSMENQEIFTNIMALGILVITLIVNVCIELGTGVIYVFMKEHISIVFLMLVLLAIQSFSALTVPSTKSYLEMKYRVRHQLAFKEC-ADAK

Query:  TGPAVVERLREGLMKYWMMAHTCSPQFVIGRSATCTASGAICLLTAAILAEAIVRSYLMQRSFKFCNGQSDYKWSITLILIVQCAAVGVGTVAPAIRWFA
         G   VERL+EG++KYWMMA TCSPQFV+GRSATCTASGAICLL+A ILAEAI+ SYL ++SFKFCNGQSDYKWSI+ IL++QC AV VGT+APAIRWFA
Subjt:  TGPAVVERLREGLMKYWMMAHTCSPQFVIGRSATCTASGAICLLTAAILAEAIVRSYLMQRSFKFCNGQSDYKWSITLILIVQCAAVGVGTVAPAIRWFA

Query:  AINFRCPKLGKGGYKKEFTLENYWIRYLVEMKQCPLSIRVENRSFRKLAHSAKNKFLDACIILQTAIVFISKVIRLISIFFVRGIFSFCDCFKSLKKKLC
        AI FR PKLGK GYKKEFTLENYWI+YLVEMKQCPL+I+V+NR  RKL HS KNKFLD CIILQT IVF SKVIRLISIF V GIFSFCDCFKSLK KLC
Subjt:  AINFRCPKLGKGGYKKEFTLENYWIRYLVEMKQCPLSIRVENRSFRKLAHSAKNKFLDACIILQTAIVFISKVIRLISIFFVRGIFSFCDCFKSLKKKLC

Query:  FKDAISINNSGSEVDPDSKLDLSRFVLYLEGEDDLVHLMVANNCYATNQWIQKGQKKKPKNLINLLEGMIMTRGFKGVAEFDNHQVPCLDSKEPQNCWAL
        FKD IS+N+SGSEVD DSK+DLSRFVLYLEGEDDLVHLM+ANN +AT+ WIQKGQKKKPK LI+LLEG IM+RGFKGVAEFDN QVPCLDS++PQN W L
Subjt:  FKDAISINNSGSEVDPDSKLDLSRFVLYLEGEDDLVHLMVANNCYATNQWIQKGQKKKPKNLINLLEGMIMTRGFKGVAEFDNHQVPCLDSKEPQNCWAL

Query:  SVVTLTSIAISLPNISRCLVKQLVSAVNEGLPYIRLLEDHLDTKGDFINLKKAAEIVWLGIDLHHKWLDVDLQKISHHKESPKEILEQLSDCAKKIYSVE
         VVTLT IA SLPN++R L+K L+ AVNEGL YIRL+EDHLDTKGDFINLKKAAEIVWLGIDLHHKWLD+D+ KISHHKESPKE+LEQLS+CAKKIYS E
Subjt:  SVVTLTSIAISLPNISRCLVKQLVSAVNEGLPYIRLLEDHLDTKGDFINLKKAAEIVWLGIDLHHKWLDVDLQKISHHKESPKEILEQLSDCAKKIYSVE

Query:  KTTNQRLCLKLSPSNWPIRILAANCMYRISESMLLKYENKYGYRNEELFAETTAMISAIMGACLTNLEDMIATKCSNSAIEKRERSVREAAYILGKTGNI
        K TNQ LCLKLSPS WPI++LAANCMYRISESMLLKYE KYG+ +E+LF E  A+I AI+GACLTNLE +I+TKCSNSAIEKRE+SVR+AA ILGKTGNI
Subjt:  KTTNQRLCLKLSPSNWPIRILAANCMYRISESMLLKYENKYGYRNEELFAETTAMISAIMGACLTNLEDMIATKCSNSAIEKRERSVREAAYILGKTGNI

Query:  LKLIEKATLPTLDLHQMASIDEWRLAYKLE
        LKL+EK TLP LD HQM SI+EWRL YKLE
Subjt:  LKLIEKATLPTLDLHQMASIDEWRLAYKLE

XP_023517573.1 uncharacterized protein LOC111781294 [Cucurbita pepo subsp. pepo]0.0e+0081.78Show/hide
Query:  MGVFGCSKDGYLNDSKFSEPLPWIGIYIAAASLVCLLAMAADLVHGVRHKKFWFPCKFFTLNSTSLTLIAVAIKLSVDLNTSMPGREDQLAKLSSAVLMC
        MGV  CS DGYLND+KFSEP+PWIGIYIA ASLVCLLAMAADLVHG RH+KFWFPCKFFTLNSTSLTLIAVAIKLSVDLNTSMPGREDQLAKLSSAVLMC
Subjt:  MGVFGCSKDGYLNDSKFSEPLPWIGIYIAAASLVCLLAMAADLVHGVRHKKFWFPCKFFTLNSTSLTLIAVAIKLSVDLNTSMPGREDQLAKLSSAVLMC

Query:  TIMGNSMPSLGSMENQEIFTNIMALGILVITLIVNVCIELGTGVIYVFMKEHISIVFLMLVLLAIQSFSALTVPSTKSYLEMKYRVRHQLAFKEC-ADAK
        TIM N MPSLGSMENQEIF NIMALGILVITLI+N C+E+GTGVIYV+MKEH+SI+ LMLVLL I SFSAL VPSTKSYLEMKY +RH+LA KEC A+ K
Subjt:  TIMGNSMPSLGSMENQEIFTNIMALGILVITLIVNVCIELGTGVIYVFMKEHISIVFLMLVLLAIQSFSALTVPSTKSYLEMKYRVRHQLAFKEC-ADAK

Query:  TGPAVVERLREGLMKYWMMAHTCSPQFVIGRSATCTASGAICLLTAAILAEAIVRSYLMQRSFKFCNGQSDYKWSITLILIVQCAAVGVGTVAPAIRWFA
         G   VERL+EG++KYWMMA TCSPQFV+GRSATCTASGAICLL+A ILAEAI+ SYL ++SFKFCNGQSDYKWSI+ IL++QC AV +GT+APAIRWFA
Subjt:  TGPAVVERLREGLMKYWMMAHTCSPQFVIGRSATCTASGAICLLTAAILAEAIVRSYLMQRSFKFCNGQSDYKWSITLILIVQCAAVGVGTVAPAIRWFA

Query:  AINFRCPKLGKGGYKKEFTLENYWIRYLVEMKQCPLSIRVENRSFRKLAHSAKNKFLDACIILQTAIVFISKVIRLISIFFVRGIFSFCDCFKSLKKKLC
        AI FRCPKLGK GYKKEFTLENYWI+YLVEMKQCPL+I+V+NR  RKL HSAKNKFLD CIILQT IVF SKVIRLISIF VRGIFSFCDCFKSLK KLC
Subjt:  AINFRCPKLGKGGYKKEFTLENYWIRYLVEMKQCPLSIRVENRSFRKLAHSAKNKFLDACIILQTAIVFISKVIRLISIFFVRGIFSFCDCFKSLKKKLC

Query:  FKDAISINNSGSEVDPDSKLDLSRFVLYLEGEDDLVHLMVANNCYATNQWIQKGQKKKPKNLINLLEGMIMTRGFKGVAEFDNHQVPCLDSKEPQNCWAL
        FKD IS+N+S SEVD DSK+DLSRFVLYLEGEDDLVHLM+ANN +AT+ WIQKGQKKKPK LI+LLEG IM+RGFKGVAEFDN QVPCLDS++PQNCW L
Subjt:  FKDAISINNSGSEVDPDSKLDLSRFVLYLEGEDDLVHLMVANNCYATNQWIQKGQKKKPKNLINLLEGMIMTRGFKGVAEFDNHQVPCLDSKEPQNCWAL

Query:  SVVTLTSIAISLPNISRCLVKQLVSAVNEGLPYIRLLEDHLDTKGDFINLKKAAEIVWLGIDLHHKWLDVDLQKISHHKESPKEILEQLSDCAKKIYSVE
         VVTLT IA SLPN++R L+K L+ AVNEGL YIRL+EDHLDTKGDFINLKKAAEIVWLGIDLHHKWLD+D+ KISHHKESPKE+LEQLS+CAKKIYS E
Subjt:  SVVTLTSIAISLPNISRCLVKQLVSAVNEGLPYIRLLEDHLDTKGDFINLKKAAEIVWLGIDLHHKWLDVDLQKISHHKESPKEILEQLSDCAKKIYSVE

Query:  KTTNQRLCLKLSPSNWPIRILAANCMYRISESMLLKYENKYGYRNEELFAETTAMISAIMGACLTNLEDMIATKCSNSAIEKRERSVREAAYILGKTGNI
        K TNQ LCLKLSPS WPI++LAANCMYRISESMLLKYE KYG+ +E+LF E  A+I AIMGACLTNLE +I+TKCSNSAIEKRE+SVR+AAYILGKTGNI
Subjt:  KTTNQRLCLKLSPSNWPIRILAANCMYRISESMLLKYENKYGYRNEELFAETTAMISAIMGACLTNLEDMIATKCSNSAIEKRERSVREAAYILGKTGNI

Query:  LKLIEKATLPTLDLHQMASIDEWRLAYKLE
        LKL+EK TLP LD HQM SI+EWRL Y  E
Subjt:  LKLIEKATLPTLDLHQMASIDEWRLAYKLE

TrEMBL top hitse value%identityAlignment
A0A1S3C1R3 uncharacterized protein LOC1034958530.0e+0078.63Show/hide
Query:  MGVFGCSKDGYLNDSKFSEPLPWIGIYIAAASLVCLLAMAADLVHGVRHKKFWFPCKFFTLNSTSLTLIAVAIKLSVDLNTSMPGREDQLAKLSSAVLMC
        MGV GC+ DGYLND+K+SEPLPWIGIYIAAASLVCLLAMAADL+HG+ H+KFWFPCKFFTLNSTSLTLIAVAIKLSVDLNTSMPGREDQLAKLSSAVLMC
Subjt:  MGVFGCSKDGYLNDSKFSEPLPWIGIYIAAASLVCLLAMAADLVHGVRHKKFWFPCKFFTLNSTSLTLIAVAIKLSVDLNTSMPGREDQLAKLSSAVLMC

Query:  TIMGNSMPSLGSMENQEIFTNIMALGILVITLIVNVCIELGTGVIYVFMKEHISIVFLMLVLLAIQSFSALTVPSTKSYLEMKYRVRHQLAFKEC-ADAK
        TIM N MPSLGSMENQEIF N+MALGILVITLIVNV +E+GTGV+YV+MKEH+S++ LMLVLL I S SA  VPSTKSYLEMKY VRH+LA KEC A+ K
Subjt:  TIMGNSMPSLGSMENQEIFTNIMALGILVITLIVNVCIELGTGVIYVFMKEHISIVFLMLVLLAIQSFSALTVPSTKSYLEMKYRVRHQLAFKEC-ADAK

Query:  TGPAVVERLREGLMKYWMMAHTCSPQFVIGRSATCTASGAICLLTAAILAEAIVRSYLMQRSFKFCNGQSDYKWSITLILIVQCAAVGVGTVAPAIRWFA
            V+ERL+ GLMKYWMMA T SPQFV+GRSATCTASGAI LL+AAILAEAI+ SYLM+RSFKFCNGQSDYKWSIT IL++QC AV VGTVAPAIRWF 
Subjt:  TGPAVVERLREGLMKYWMMAHTCSPQFVIGRSATCTASGAICLLTAAILAEAIVRSYLMQRSFKFCNGQSDYKWSITLILIVQCAAVGVGTVAPAIRWFA

Query:  AINFRCPKLGKGGYKKEFTLENYWIRYLVEMKQCPLSIRVENRSFRKLAHSAKNKFLDACIILQTAIVFISKVIRLISIFFVRGIFSFCDCFKSLKKKLC
        AI FRCP L  GGYKKEF LE YWIRYLVEMK+ PL+IRV+NR  RKLAH+A+N FLDACIILQT IVFISKVIRLISIFF RGIFS CDCFKSLK KL 
Subjt:  AINFRCPKLGKGGYKKEFTLENYWIRYLVEMKQCPLSIRVENRSFRKLAHSAKNKFLDACIILQTAIVFISKVIRLISIFFVRGIFSFCDCFKSLKKKLC

Query:  FKDAISINNSGSEVDPDSKLDLSRFVLYLEGEDDLVHLMVANNCYATNQWIQKGQKKKPKNLINLLEGMIMTRGFKGVAEFDNHQVPCLDSKEPQNCWAL
        F   IS++NSGSE D DSKLDLSRFVLYLEGEDDLVH+MV NN +A + WIQKGQKKKPK LI+LLEG I+++GFKGVAEFDNHQVPCLDSKEP NCWAL
Subjt:  FKDAISINNSGSEVDPDSKLDLSRFVLYLEGEDDLVHLMVANNCYATNQWIQKGQKKKPKNLINLLEGMIMTRGFKGVAEFDNHQVPCLDSKEPQNCWAL

Query:  SVVTLTSIAISLPNISRCLVKQLVSAVNEGLPYIRLLEDHLDTKGDFINLKKAAEIVWLGIDLHHKWLDVDLQKISHHKESPKEILEQLSDCAKKIYSVE
         VVTLT+IAISLPNI R L+K LV+AVNEGL YIR +ED  DT+G+FINLKKAAE+VWLG+DL+++WL +DL KISHHKE+P E+ +Q SD AKKIY+ E
Subjt:  SVVTLTSIAISLPNISRCLVKQLVSAVNEGLPYIRLLEDHLDTKGDFINLKKAAEIVWLGIDLHHKWLDVDLQKISHHKESPKEILEQLSDCAKKIYSVE

Query:  KTTNQRLCLKLSPSNWPIRILAANCMYRISESMLLKYENKYGYRNEELFAETTAMISAIMGACLTNLEDMIATKCSNSAIEKRERSVREAAYILGKTGNI
        KTTN+ LCLKLS S WPI+ILAANCMYRISESMLLKYE KY Y NE+LF E  A IS IMGACLTNLE +I+TKC+N  IEKRE+SVREAAYILGKTG I
Subjt:  KTTNQRLCLKLSPSNWPIRILAANCMYRISESMLLKYENKYGYRNEELFAETTAMISAIMGACLTNLEDMIATKCSNSAIEKRERSVREAAYILGKTGNI

Query:  LKLIEKATLPTLDLHQMASIDEWRLAYKLE
        L +IEK  LP LD +QM SIDEWRLAYKLE
Subjt:  LKLIEKATLPTLDLHQMASIDEWRLAYKLE

A0A5A7SMA1 Uncharacterized protein0.0e+0078.63Show/hide
Query:  MGVFGCSKDGYLNDSKFSEPLPWIGIYIAAASLVCLLAMAADLVHGVRHKKFWFPCKFFTLNSTSLTLIAVAIKLSVDLNTSMPGREDQLAKLSSAVLMC
        MGV GC+ DGYLND+K+SEPLPWIGIYIAAASLVCLLAMAADL+HG+ H+KFWFPCKFFTLNSTSLTLIAVAIKLSVDLNTSMPGREDQLAKLSSAVLMC
Subjt:  MGVFGCSKDGYLNDSKFSEPLPWIGIYIAAASLVCLLAMAADLVHGVRHKKFWFPCKFFTLNSTSLTLIAVAIKLSVDLNTSMPGREDQLAKLSSAVLMC

Query:  TIMGNSMPSLGSMENQEIFTNIMALGILVITLIVNVCIELGTGVIYVFMKEHISIVFLMLVLLAIQSFSALTVPSTKSYLEMKYRVRHQLAFKEC-ADAK
        TIM N MPSLGSMENQEIF N+MALGILVITLIVNV +E+GTGV+YV+MKEH+S++ LMLVLL I S SA  VPSTKSYLEMKY VRH+LA KEC A+ K
Subjt:  TIMGNSMPSLGSMENQEIFTNIMALGILVITLIVNVCIELGTGVIYVFMKEHISIVFLMLVLLAIQSFSALTVPSTKSYLEMKYRVRHQLAFKEC-ADAK

Query:  TGPAVVERLREGLMKYWMMAHTCSPQFVIGRSATCTASGAICLLTAAILAEAIVRSYLMQRSFKFCNGQSDYKWSITLILIVQCAAVGVGTVAPAIRWFA
            V+ERL+ GLMKYWMMA T SPQFV+GRSATCTASGAI LL+AAILAEAI+ SYLM+RSFKFCNGQSDYKWSIT IL++QC AV VGTVAPAIRWF 
Subjt:  TGPAVVERLREGLMKYWMMAHTCSPQFVIGRSATCTASGAICLLTAAILAEAIVRSYLMQRSFKFCNGQSDYKWSITLILIVQCAAVGVGTVAPAIRWFA

Query:  AINFRCPKLGKGGYKKEFTLENYWIRYLVEMKQCPLSIRVENRSFRKLAHSAKNKFLDACIILQTAIVFISKVIRLISIFFVRGIFSFCDCFKSLKKKLC
        AI FRCP L  GGYKKEF LE YWIRYLVEMK+ PL+IRV+NR  RKLAH+A+N FLDACIILQT IVFISKVIRLISIFF RGIFS CDCFKSLK KL 
Subjt:  AINFRCPKLGKGGYKKEFTLENYWIRYLVEMKQCPLSIRVENRSFRKLAHSAKNKFLDACIILQTAIVFISKVIRLISIFFVRGIFSFCDCFKSLKKKLC

Query:  FKDAISINNSGSEVDPDSKLDLSRFVLYLEGEDDLVHLMVANNCYATNQWIQKGQKKKPKNLINLLEGMIMTRGFKGVAEFDNHQVPCLDSKEPQNCWAL
        F   IS++NSGSE D DSKLDLSRFVLYLEGEDDLVH+MV NN +A + WIQKGQKKKPK LI+LLEG I+++GFKGVAEFDNHQVPCLDSKEP NCWAL
Subjt:  FKDAISINNSGSEVDPDSKLDLSRFVLYLEGEDDLVHLMVANNCYATNQWIQKGQKKKPKNLINLLEGMIMTRGFKGVAEFDNHQVPCLDSKEPQNCWAL

Query:  SVVTLTSIAISLPNISRCLVKQLVSAVNEGLPYIRLLEDHLDTKGDFINLKKAAEIVWLGIDLHHKWLDVDLQKISHHKESPKEILEQLSDCAKKIYSVE
         VVTLT+IAISLPNI R L+K LV+AVNEGL YIR +ED  DT+G+FINLKKAAE+VWLG+DL+++WL +DL KISHHKE+P E+ +Q SD AKKIY+ E
Subjt:  SVVTLTSIAISLPNISRCLVKQLVSAVNEGLPYIRLLEDHLDTKGDFINLKKAAEIVWLGIDLHHKWLDVDLQKISHHKESPKEILEQLSDCAKKIYSVE

Query:  KTTNQRLCLKLSPSNWPIRILAANCMYRISESMLLKYENKYGYRNEELFAETTAMISAIMGACLTNLEDMIATKCSNSAIEKRERSVREAAYILGKTGNI
        KTTN+ LCLKLS S WPI+ILAANCMYRISESMLLKYE KY Y NE+LF E  A IS IMGACLTNLE +I+TKC+N  IEKRE+SVREAAYILGKTG I
Subjt:  KTTNQRLCLKLSPSNWPIRILAANCMYRISESMLLKYENKYGYRNEELFAETTAMISAIMGACLTNLEDMIATKCSNSAIEKRERSVREAAYILGKTGNI

Query:  LKLIEKATLPTLDLHQMASIDEWRLAYKLE
        L +IEK  LP LD +QM SIDEWRLAYKLE
Subjt:  LKLIEKATLPTLDLHQMASIDEWRLAYKLE

A0A6J1DKG6 uncharacterized protein LOC1110217630.0e+0097.67Show/hide
Query:  MGVFGCSKDGYLNDSKFSEPLPWIGIYIAAASLVCLLAMAADLVHGVRHKKFWFPCKFFTLNSTSLTLIAVAIKLSVDLNTSMPGREDQLAKLSSAVLMC
        MGVFGCSKDGYLNDSKFSEPLPWIGIYIAAASLVCL AMAADLVHGVRHKKFWFPCKFFTLNSTSLTLIAVAIKLSVDLNTSMPGREDQLAKLSSAVLMC
Subjt:  MGVFGCSKDGYLNDSKFSEPLPWIGIYIAAASLVCLLAMAADLVHGVRHKKFWFPCKFFTLNSTSLTLIAVAIKLSVDLNTSMPGREDQLAKLSSAVLMC

Query:  TIMGNSMPSLGSMENQEIFTNIMALGILVITLIVNVCIELGTGVIYVFMKEHISIVFLMLVLLAIQSFSALTVPSTKSYLEMKYRVRHQLAFKECADAKT
        TIMGNSMPSLGSMENQEIFTNIMALGILVITLIVNVCIELGTGVIYVFMKEHISIVFLMLVLLAIQSFSALTVPSTKSYLEMKYRVRHQLAFKECADAKT
Subjt:  TIMGNSMPSLGSMENQEIFTNIMALGILVITLIVNVCIELGTGVIYVFMKEHISIVFLMLVLLAIQSFSALTVPSTKSYLEMKYRVRHQLAFKECADAKT

Query:  GPAVVERLREGLMKYWMMAHTCSPQFVIGRSATCTASGAICLLTAAILAEAIVRSYLMQRSFKFCNGQSDYKWSITLILIVQCAAVGVGTVAPAIRWFAA
        GPAVVERLREGLMKYWMMAHTCSPQFVIGRSATCTASGAICLLTAAILAEAIVRSYLMQRSFKFCNGQSDYKWSITLILIVQCAAVGVGTVAPAIRWFAA
Subjt:  GPAVVERLREGLMKYWMMAHTCSPQFVIGRSATCTASGAICLLTAAILAEAIVRSYLMQRSFKFCNGQSDYKWSITLILIVQCAAVGVGTVAPAIRWFAA

Query:  INFRCPKLGKGGYKKEFTLENYWIRYLVEMKQCPLSIRVENRSFRKLAHSAKNKFLDACIILQTAIVFISKVIRLISIFFVRGIFSFCDCFKSLKKKLCF
        INFRCPKLGKGGYKKEFTLENYWIRYLVEMKQCPL+I VENRS RK AHSAK+KFLDACIILQTAIVFISKVIRL+SIFFV GIFSFCDCFKSLKKKLCF
Subjt:  INFRCPKLGKGGYKKEFTLENYWIRYLVEMKQCPLSIRVENRSFRKLAHSAKNKFLDACIILQTAIVFISKVIRLISIFFVRGIFSFCDCFKSLKKKLCF

Query:  KDAISINNSGSEVDPDSKLDLSRFVLYLEGEDDLVHLMVANNCYATNQWIQKGQKKKPKNLINLLEGMIMTRGFKGVAEFDNHQVPCLDSKEPQNCWALS
        KDAISINNSGSEVD DSKLDLSRFVLYLEGEDDLVHLMVANNCYATNQWIQKGQKKKPKNLI+LLEGMIMTRGFKGVAEFDNHQVPCLDSKEPQNCWAL 
Subjt:  KDAISINNSGSEVDPDSKLDLSRFVLYLEGEDDLVHLMVANNCYATNQWIQKGQKKKPKNLINLLEGMIMTRGFKGVAEFDNHQVPCLDSKEPQNCWALS

Query:  VVTLTSIAISLPNISRCLVKQLVSAVNEGLPYIRLLEDHLDTKGDFINLKKAAEIVWLGIDLHHKWLDVDLQKISHHKESPKEILEQLSDCAKKIYSVEK
        VVTLT+IAISLPNISRCL+KQLVSAVNEGLPYIRLLEDHLDTKGDFINLKKAAEIVWLGIDLHHKWLDVDLQKISHHKESPKEILEQLSDCAKKIYSVEK
Subjt:  VVTLTSIAISLPNISRCLVKQLVSAVNEGLPYIRLLEDHLDTKGDFINLKKAAEIVWLGIDLHHKWLDVDLQKISHHKESPKEILEQLSDCAKKIYSVEK

Query:  TTNQRLCLKLSPSNWPIRILAANCMYRISESMLLKYENKYGYRNEELFAETTAMISAIMGACLTNLEDMIATKCSNSAIEKRERSVREAAYILGKTGNIL
        TTNQRLCLKLSPSNWPIRILAANCMYRISESMLLKYE KYGYRNEELFAETT MISAIMGACLTNLED+IATKCSNS IEKRERSVREAAYILGKTGNIL
Subjt:  TTNQRLCLKLSPSNWPIRILAANCMYRISESMLLKYENKYGYRNEELFAETTAMISAIMGACLTNLEDMIATKCSNSAIEKRERSVREAAYILGKTGNIL

Query:  KLIEKATLPTLDLHQMASIDEWRLAYKLES
        KLIEKATLPTLDLHQMASIDEWRLAYKLES
Subjt:  KLIEKATLPTLDLHQMASIDEWRLAYKLES

A0A6J1EDX2 uncharacterized protein LOC1114323430.0e+0081.92Show/hide
Query:  MGVFGCSKDGYLNDSKFSEPLPWIGIYIAAASLVCLLAMAADLVHGVRHKKFWFPCKFFTLNSTSLTLIAVAIKLSVDLNTSMPGREDQLAKLSSAVLMC
        MGV GCS DGYLND+KFSEP+PWIGIYIA ASLVCLLAMAADLVHG RH+KFWFPCKFFTLNSTSLTLIAVAIKLSVDLNTSMPGREDQLAKLSSAVLMC
Subjt:  MGVFGCSKDGYLNDSKFSEPLPWIGIYIAAASLVCLLAMAADLVHGVRHKKFWFPCKFFTLNSTSLTLIAVAIKLSVDLNTSMPGREDQLAKLSSAVLMC

Query:  TIMGNSMPSLGSMENQEIFTNIMALGILVITLIVNVCIELGTGVIYVFMKEHISIVFLMLVLLAIQSFSALTVPSTKSYLEMKYRVRHQLAFKEC-ADAK
        TIM N MPSLGSMENQEIF NIMAL ILVITLI+NVC+E+GTGVIYV+MKE +SI+ LMLVLL I SFSAL VPSTKSYLEMKY +RH+LA KEC A+ K
Subjt:  TIMGNSMPSLGSMENQEIFTNIMALGILVITLIVNVCIELGTGVIYVFMKEHISIVFLMLVLLAIQSFSALTVPSTKSYLEMKYRVRHQLAFKEC-ADAK

Query:  TGPAVVERLREGLMKYWMMAHTCSPQFVIGRSATCTASGAICLLTAAILAEAIVRSYLMQRSFKFCNGQSDYKWSITLILIVQCAAVGVGTVAPAIRWFA
         G   VERL+EG++KYWMMA TCSPQFV+GRSATCTASGAICLL+A ILAEAI+ SYL ++SFKFCNG SDYKWSI+ IL++QC AV VGT+APAIRWFA
Subjt:  TGPAVVERLREGLMKYWMMAHTCSPQFVIGRSATCTASGAICLLTAAILAEAIVRSYLMQRSFKFCNGQSDYKWSITLILIVQCAAVGVGTVAPAIRWFA

Query:  AINFRCPKLGKGGYKKEFTLENYWIRYLVEMKQCPLSIRVENRSFRKLAHSAKNKFLDACIILQTAIVFISKVIRLISIFFVRGIFSFCDCFKSLKKKLC
        AI FRCPKLGK GYKKEFTLENYWI+YLVEMKQCPL+I+V+NR  RKL HSAKNKFLD CIILQT IVF SKVIRLISIF V GIFSFCDCFKSLK KL 
Subjt:  AINFRCPKLGKGGYKKEFTLENYWIRYLVEMKQCPLSIRVENRSFRKLAHSAKNKFLDACIILQTAIVFISKVIRLISIFFVRGIFSFCDCFKSLKKKLC

Query:  FKDAISINNSGSEVDPDSKLDLSRFVLYLEGEDDLVHLMVANNCYATNQWIQKGQKKKPKNLINLLEGMIMTRGFKGVAEFDNHQVPCLDSKEPQNCWAL
        FKD IS+N+SGSEVD DSK+DLSRFVLYLEGEDDLVHLM+ANN +AT+ WIQKGQKKKPK LI+LLEG IM+RGFKGVAEFDN QVPCLDS++PQNCW L
Subjt:  FKDAISINNSGSEVDPDSKLDLSRFVLYLEGEDDLVHLMVANNCYATNQWIQKGQKKKPKNLINLLEGMIMTRGFKGVAEFDNHQVPCLDSKEPQNCWAL

Query:  SVVTLTSIAISLPNISRCLVKQLVSAVNEGLPYIRLLEDHLDTKGDFINLKKAAEIVWLGIDLHHKWLDVDLQKISHHKESPKEILEQLSDCAKKIYSVE
         VVTLT IA SLPN++R L+K L+ AVNEGL YIRL+EDHLDTKGDFINLKKAAEIVWLGIDLHHKWLD+D+ KISHHKESPKE+LEQLS+CAKKIYS E
Subjt:  SVVTLTSIAISLPNISRCLVKQLVSAVNEGLPYIRLLEDHLDTKGDFINLKKAAEIVWLGIDLHHKWLDVDLQKISHHKESPKEILEQLSDCAKKIYSVE

Query:  KTTNQRLCLKLSPSNWPIRILAANCMYRISESMLLKYENKYGYRNEELFAETTAMISAIMGACLTNLEDMIATKCSNSAIEKRERSVREAAYILGKTGNI
        K TNQ LCLKLSPS WPI++LAANCMYRISESMLLKYE KYG+ +E+LF E  A+I AIMGACLTNLE +I+TKCSNSAIEKRE+SVR+AAYILGKTGNI
Subjt:  KTTNQRLCLKLSPSNWPIRILAANCMYRISESMLLKYENKYGYRNEELFAETTAMISAIMGACLTNLEDMIATKCSNSAIEKRERSVREAAYILGKTGNI

Query:  LKLIEKATLPTLDLHQMASIDEWRLAYKLE
        LKL+EK TLP LD HQM SI+EWRL YKLE
Subjt:  LKLIEKATLPTLDLHQMASIDEWRLAYKLE

A0A6J1HUF0 uncharacterized protein LOC1114662740.0e+0081.64Show/hide
Query:  MGVFGCSKDGYLNDSKFSEPLPWIGIYIAAASLVCLLAMAADLVHGVRHKKFWFPCKFFTLNSTSLTLIAVAIKLSVDLNTSMPGREDQLAKLSSAVLMC
        MGV GCS DGYLND+KFSEP+PWIGIYIA ASLVCLLAMAADLVHG RH+KFWFPCKFFTLNSTSLTLIAVAIKLSVDLNTSMPGREDQLAKLSSAVLMC
Subjt:  MGVFGCSKDGYLNDSKFSEPLPWIGIYIAAASLVCLLAMAADLVHGVRHKKFWFPCKFFTLNSTSLTLIAVAIKLSVDLNTSMPGREDQLAKLSSAVLMC

Query:  TIMGNSMPSLGSMENQEIFTNIMALGILVITLIVNVCIELGTGVIYVFMKEHISIVFLMLVLLAIQSFSALTVPSTKSYLEMKYRVRHQLAFKEC-ADAK
        TIM N MPSLGSMENQEIF NIMALGILVITLI+NVC+E+GTGVIYV+MKE +SI+ LMLVLL I SFSAL VPSTKSYLEMKY +RH+LA KEC A+ K
Subjt:  TIMGNSMPSLGSMENQEIFTNIMALGILVITLIVNVCIELGTGVIYVFMKEHISIVFLMLVLLAIQSFSALTVPSTKSYLEMKYRVRHQLAFKEC-ADAK

Query:  TGPAVVERLREGLMKYWMMAHTCSPQFVIGRSATCTASGAICLLTAAILAEAIVRSYLMQRSFKFCNGQSDYKWSITLILIVQCAAVGVGTVAPAIRWFA
         G   VERL+EG++KYWMMA TCSPQFV+GRSATCTASGAICLL+A ILAEAI+ SYL ++SFKFCNGQSDYKWSI+ IL++QC AV VGT+APAIRWFA
Subjt:  TGPAVVERLREGLMKYWMMAHTCSPQFVIGRSATCTASGAICLLTAAILAEAIVRSYLMQRSFKFCNGQSDYKWSITLILIVQCAAVGVGTVAPAIRWFA

Query:  AINFRCPKLGKGGYKKEFTLENYWIRYLVEMKQCPLSIRVENRSFRKLAHSAKNKFLDACIILQTAIVFISKVIRLISIFFVRGIFSFCDCFKSLKKKLC
        AI FR PKLGK GYKKEFTLENYWI+YLVEMKQCPL+I+V+NR  RKL HS KNKFLD CIILQT IVF SKVIRLISIF V GIFSFCDCFKSLK KLC
Subjt:  AINFRCPKLGKGGYKKEFTLENYWIRYLVEMKQCPLSIRVENRSFRKLAHSAKNKFLDACIILQTAIVFISKVIRLISIFFVRGIFSFCDCFKSLKKKLC

Query:  FKDAISINNSGSEVDPDSKLDLSRFVLYLEGEDDLVHLMVANNCYATNQWIQKGQKKKPKNLINLLEGMIMTRGFKGVAEFDNHQVPCLDSKEPQNCWAL
        FKD IS+N+SGSEVD DSK+DLSRFVLYLEGEDDLVHLM+ANN +AT+ WIQKGQKKKPK LI+LLEG IM+RGFKGVAEFDN QVPCLDS++PQN W L
Subjt:  FKDAISINNSGSEVDPDSKLDLSRFVLYLEGEDDLVHLMVANNCYATNQWIQKGQKKKPKNLINLLEGMIMTRGFKGVAEFDNHQVPCLDSKEPQNCWAL

Query:  SVVTLTSIAISLPNISRCLVKQLVSAVNEGLPYIRLLEDHLDTKGDFINLKKAAEIVWLGIDLHHKWLDVDLQKISHHKESPKEILEQLSDCAKKIYSVE
         VVTLT IA SLPN++R L+K L+ AVNEGL YIRL+EDHLDTKGDFINLKKAAEIVWLGIDLHHKWLD+D+ KISHHKESPKE+LEQLS+CAKKIYS E
Subjt:  SVVTLTSIAISLPNISRCLVKQLVSAVNEGLPYIRLLEDHLDTKGDFINLKKAAEIVWLGIDLHHKWLDVDLQKISHHKESPKEILEQLSDCAKKIYSVE

Query:  KTTNQRLCLKLSPSNWPIRILAANCMYRISESMLLKYENKYGYRNEELFAETTAMISAIMGACLTNLEDMIATKCSNSAIEKRERSVREAAYILGKTGNI
        K TNQ LCLKLSPS WPI++LAANCMYRISESMLLKYE KYG+ +E+LF E  A+I AI+GACLTNLE +I+TKCSNSAIEKRE+SVR+AA ILGKTGNI
Subjt:  KTTNQRLCLKLSPSNWPIRILAANCMYRISESMLLKYENKYGYRNEELFAETTAMISAIMGACLTNLEDMIATKCSNSAIEKRERSVREAAYILGKTGNI

Query:  LKLIEKATLPTLDLHQMASIDEWRLAYKLE
        LKL+EK TLP LD HQM SI+EWRL YKLE
Subjt:  LKLIEKATLPTLDLHQMASIDEWRLAYKLE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G17250.1 unknown protein6.3e-20951.03Show/hide
Query:  GCSKDGYLNDSKFSEPLPWIGIYIAAASLVCLLAMAADLVHGVRHKKFWFPCKFFTLNSTSLTLIAVAIKLSVDLNTSMPGREDQLAKLSSAVLMCTIMG
        GC   G L+D++FS+PLP IG+Y+AAASL+C +AM +DL+HG RH+KFWFPCKFF+LN+T+LT I+V +KLS+DLNTSMP R+DQLAKLSS+V +CT+M 
Subjt:  GCSKDGYLNDSKFSEPLPWIGIYIAAASLVCLLAMAADLVHGVRHKKFWFPCKFFTLNSTSLTLIAVAIKLSVDLNTSMPGREDQLAKLSSAVLMCTIMG

Query:  NSMPSLGSMENQEIFTNIMALGILVITLIVNVCIELGTGVIYVFMKEHISIVFLMLVLLAIQSFSALTVPSTKSYLEMKYRVRHQLAFKECADAKTGPAV
        NSMPSLG M  Q++  N+ ALGILVIT +VN+CI+LGTG IYVF +EH  ++ LML++  I SFSA+TVP+TK +LE+KY+ R++ A K+C         
Subjt:  NSMPSLGSMENQEIFTNIMALGILVITLIVNVCIELGTGVIYVFMKEHISIVFLMLVLLAIQSFSALTVPSTKSYLEMKYRVRHQLAFKECADAKTGPAV

Query:  VERLREGLMKYWMMAHTCSPQFVIGRSATCTASGAICLLTAAILAEAIVRSYLMQ-RSFKFCNGQSDYKWSITLILIVQCAAVGVGTVAPAIRWFAAINF
        V +L+E LMK+WMMAHT SPQFV+ RS TCT +G +C L A  LAEA+VRSY +Q RS  FCNG SDYKWS TL+L+ Q AAV +GTVAPA RW  A+NF
Subjt:  VERLREGLMKYWMMAHTCSPQFVIGRSATCTASGAICLLTAAILAEAIVRSYLMQ-RSFKFCNGQSDYKWSITLILIVQCAAVGVGTVAPAIRWFAAINF

Query:  RCPKLGKGGYKKEFTLENYWIRYLVEMKQCPLSI-RVENRSFRKLAHSAKNKFLDACIILQTAIVFISKVIRLISIFFVRGIFSFCDCFKSLKKKLCFKD
        RCP   K   K E  +E+YW+  L E K+ PL+   +  R  RKLAH      LD CI  Q  IV  SK +R I+++    I   C  F        FK 
Subjt:  RCPKLGKGGYKKEFTLENYWIRYLVEMKQCPLSI-RVENRSFRKLAHSAKNKFLDACIILQTAIVFISKVIRLISIFFVRGIFSFCDCFKSLKKKLCFKD

Query:  AISINNSGSEVDPDSKLDLSRFVLYLEGEDDLVHLMVANNCYATNQWIQKGQKKKPKNLINLLEG-MIMTRGFKGVAEFDNHQVPCLDSKEPQNCWALSV
            N   S  +P ++    +FVL+LEGE+++V  M  +N  AT+  IQKG+KK+P NLI LLE    +++GF+G+ +FD+ +V  L S EP N WAL +
Subjt:  AISINNSGSEVDPDSKLDLSRFVLYLEGEDDLVHLMVANNCYATNQWIQKGQKKKPKNLINLLEG-MIMTRGFKGVAEFDNHQVPCLDSKEPQNCWALSV

Query:  VTLTSIAISLPNISRCLVKQLVSAVNEGLPYIRLLEDHLDTKGDFINLKKAAEIVWLGIDLHHKWLDVDLQKIS-HHKESPKEILEQLSDCAKKIYSVEK
        VTLTSIA++LPNI  C +K+LV AVNE L Y+   E+ LD +G+  N +KAAE+VWLG+DL+HKWL+VDL+K+S  H ++P+E+L++L D A+K ++   
Subjt:  VTLTSIAISLPNISRCLVKQLVSAVNEGLPYIRLLEDHLDTKGDFINLKKAAEIVWLGIDLHHKWLDVDLQKIS-HHKESPKEILEQLSDCAKKIYSVEK

Query:  TTNQRLCLKLSPSNWPIRILAANCMYRISESMLLKYENKYGYRNEELFAETTAMISAIMGACLTNLEDMIATKCSNSAIEKRERSVREAAYILGKTGNIL
          N+ LC+K  PS+WPI+  AAN MYRIS+++L+KYE++     E L  +   M+S I+  C  N   +I  KC  +A+E RE SVREAA  LG+T  IL
Subjt:  TTNQRLCLKLSPSNWPIRILAANCMYRISESMLLKYENKYGYRNEELFAETTAMISAIMGACLTNLEDMIATKCSNSAIEKRERSVREAAYILGKTGNIL

Query:  KLIEKATLPTLDLHQMASIDEWRLAYK
        +++++  +P L  H++A IDEWR  Y+
Subjt:  KLIEKATLPTLDLHQMASIDEWRLAYK

AT4G17250.2 unknown protein6.1e-18851.84Show/hide
Query:  GCSKDGYLNDSKFSEPLPWIGIYIAAASLVCLLAMAADLVHGVRHKKFWFPCKFFTLNSTSLTLIAVAIKLSVDLNTSMPGREDQLAKLSSAVLMCTIMG
        GC   G L+D++FS+PLP IG+Y+AAASL+C +AM +DL+HG RH+KFWFPCKFF+LN+T+LT I+V +KLS+DLNTSMP R+DQLAKLSS+V +CT+M 
Subjt:  GCSKDGYLNDSKFSEPLPWIGIYIAAASLVCLLAMAADLVHGVRHKKFWFPCKFFTLNSTSLTLIAVAIKLSVDLNTSMPGREDQLAKLSSAVLMCTIMG

Query:  NSMPSLGSMENQEIFTNIMALGILVITLIVNVCIELGTGVIYVFMKEHISIVFLMLVLLAIQSFSALTVPSTKSYLEMKYRVRHQLAFKECADAKTGPAV
        NSMPSLG M  Q++  N+ ALGILVIT +VN+CI+LGTG IYVF +EH  ++ LML++  I SFSA+TVP+TK +LE+KY+ R++ A K+C         
Subjt:  NSMPSLGSMENQEIFTNIMALGILVITLIVNVCIELGTGVIYVFMKEHISIVFLMLVLLAIQSFSALTVPSTKSYLEMKYRVRHQLAFKECADAKTGPAV

Query:  VERLREGLMKYWMMAHTCSPQFVIGRSATCTASGAICLLTAAILAEAIVRSYLMQ-RSFKFCNGQSDYKWSITLILIVQCAAVGVGTVAPAIRWFAAINF
        V +L+E LMK+WMMAHT SPQFV+ RS TCT +G +C L A  LAEA+VRSY +Q RS  FCNG SDYKWS TL+L+ Q AAV +GTVAPA RW  A+NF
Subjt:  VERLREGLMKYWMMAHTCSPQFVIGRSATCTASGAICLLTAAILAEAIVRSYLMQ-RSFKFCNGQSDYKWSITLILIVQCAAVGVGTVAPAIRWFAAINF

Query:  RCPKLGKGGYKKEFTLENYWIRYLVEMKQCPLSI-RVENRSFRKLAHSAKNKFLDACIILQTAIVFISKVIRLISIFFVRGIFSFCDCFKSLKKKLCFKD
        RCP   K   K E  +E+YW+  L E K+ PL+   +  R  RKLAH      LD CI  Q  IV  SK +R I+++    I   C  F        FK 
Subjt:  RCPKLGKGGYKKEFTLENYWIRYLVEMKQCPLSI-RVENRSFRKLAHSAKNKFLDACIILQTAIVFISKVIRLISIFFVRGIFSFCDCFKSLKKKLCFKD

Query:  AISINNSGSEVDPDSKLDLSRFVLYLEGEDDLVHLMVANNCYATNQWIQKGQKKKPKNLINLLEG-MIMTRGFKGVAEFDNHQVPCLDSKEPQNCWALSV
            N   S  +P ++    +FVL+LEGE+++V  M  +N  AT+  IQKG+KK+P NLI LLE    +++GF+G+ +FD+ +V  L S EP N WAL +
Subjt:  AISINNSGSEVDPDSKLDLSRFVLYLEGEDDLVHLMVANNCYATNQWIQKGQKKKPKNLINLLEG-MIMTRGFKGVAEFDNHQVPCLDSKEPQNCWALSV

Query:  VTLTSIAISLPNISRCLVKQLVSAVNEGLPYIRLLEDHLDTKGDFINLKKAAEIVWLGIDLHHKWLDVDLQKIS-HHKESPKEILEQLSDCAKKIYSVEK
        VTLTSIA++LPNI  C +K+LV AVNE L Y+   E+ LD +G+  N +KAAE+VWLG+DL+HKWL+VDL+K+S  H ++P+E+L++L D A+K ++   
Subjt:  VTLTSIAISLPNISRCLVKQLVSAVNEGLPYIRLLEDHLDTKGDFINLKKAAEIVWLGIDLHHKWLDVDLQKIS-HHKESPKEILEQLSDCAKKIYSVEK

Query:  TTNQRLCLKLSPSNWPIRILAANCMYRISESMLLK--YENKYGYRNEELFAE
          N+ LC+K  PS+WPI+  AAN MYRIS+++L+    E   G R+   F E
Subjt:  TTNQRLCLKLSPSNWPIRILAANCMYRISESMLLK--YENKYGYRNEELFAE

AT5G47580.1 unknown protein5.5e-22153.8Show/hide
Query:  MGVFGCSKDGYLNDSKFSEPLPWIGIYIAAASLVCLLAMAADLVHGVRHKKFWFPCKFFTLNSTSLTLIAVAIKLSVDLNTSMPGREDQLAKLSSAVLMC
        MG+ GC   G LND++FS+PLP IGIY+A ASL+C LAMAADL+HG RH+KFWFPCKFF+LN+TSLT IAV IKLSVDLNTSMP R+DQLAKLSS V +C
Subjt:  MGVFGCSKDGYLNDSKFSEPLPWIGIYIAAASLVCLLAMAADLVHGVRHKKFWFPCKFFTLNSTSLTLIAVAIKLSVDLNTSMPGREDQLAKLSSAVLMC

Query:  TIMGNSMPSLGSMENQEIFTNIMALGILVITLIVNVCIELGTGVIYVFMKEHISIVFLMLVLLAIQSFSALTVPSTKSYLEMKYRVRHQLAFKECADAKT
        T+MGNSMPSLG M+N ++  N+MALGILVIT +VN+CI+LGTG IYVF +EH  +V LML++L I  FSA TVP+TK  LE KY  R+ +A K      T
Subjt:  TIMGNSMPSLGSMENQEIFTNIMALGILVITLIVNVCIELGTGVIYVFMKEHISIVFLMLVLLAIQSFSALTVPSTKSYLEMKYRVRHQLAFKECADAKT

Query:  GPAVVERLREGLMKYWMMAHTCSPQFVIGRSATCTASGAICLLTAAILAEAIVRSYLMQ-RSFKFCNGQSDYKWSITLILIVQCAAVGVGTVAPAIRWFA
           V +++++ L K+WMMAHT SPQFV+ RS TCTASG  CLL+A  L EA+VRSY ++ RS  FC+G SDYKWS +L+L+ Q   V +GT+APAIRWF 
Subjt:  GPAVVERLREGLMKYWMMAHTCSPQFVIGRSATCTASGAICLLTAAILAEAIVRSYLMQ-RSFKFCNGQSDYKWSITLILIVQCAAVGVGTVAPAIRWFA

Query:  AINFRCPKLGKGGYKKEFTLENYWIRYLVEMKQCPLSIRV-ENRSFRKLAHSAKNKFLDACIILQTAIVFISKVIRLISIFFVRGIFSFCDCFKSLKKKL
        A+NFRCP  GK  Y+ EF +E+YW  +  E KQ PLS+ + + R +RK AH+AK   LD CI++Q  IVF SK+IR IS+  V  I   C          
Subjt:  AINFRCPKLGKGGYKKEFTLENYWIRYLVEMKQCPLSIRV-ENRSFRKLAHSAKNKFLDACIILQTAIVFISKVIRLISIFFVRGIFSFCDCFKSLKKKL

Query:  CFKDAISINNSGSE---VDPDSKLDLSRFVLYLEGEDDLVHLMVANNCYATNQWIQKGQKKKPKNLINLLEGMI-MTRGFKGVAEFDNHQVPCLDSKEPQ
          + + +I +S SE    +P S+ DL+ FVL+LEGED LV +MV +N  AT+ WI+KG+KK+P NLI LLE    +++GF+G+AEFD+++V  L   EP 
Subjt:  CFKDAISINNSGSE---VDPDSKLDLSRFVLYLEGEDDLVHLMVANNCYATNQWIQKGQKKKPKNLINLLEGMI-MTRGFKGVAEFDNHQVPCLDSKEPQ

Query:  NCWALSVVTLTSIAISLPNISRCLVKQLVSAVNEGLPYIRLLEDHLDTKGDFINLKKAAEIVWLGIDLHHKWLDVDLQKISHHKESPKEILEQLSDCAKK
        NCWAL +VTLTSIA++LP+I    +K+L++AVNE L Y+   E  LDT G+ +NL+KAAE+VWLG+DL+HKW DVDL+K+S  + +P+E L +L + AKK
Subjt:  NCWALSVVTLTSIAISLPNISRCLVKQLVSAVNEGLPYIRLLEDHLDTKGDFINLKKAAEIVWLGIDLHHKWLDVDLQKISHHKESPKEILEQLSDCAKK

Query:  IYSVEKTTNQRLCLKLSPSNWPIRILAANCMYRISESMLLKYENKYGYRNEELFAETTAMISAIMGACLTNLEDMIATKCSNSAIEKRERSVREAAYILG
         +S     N  +C+K  PS+WPI+ LAAN MYRI +++LL YE++     E L  +  + IS I+  C  N+  +I+ KC  SA+E RE SVR AA  LG
Subjt:  IYSVEKTTNQRLCLKLSPSNWPIRILAANCMYRISESMLLKYENKYGYRNEELFAETTAMISAIMGACLTNLEDMIATKCSNSAIEKRERSVREAAYILG

Query:  KTGNILKLIEKATLPTLDLHQMASIDEWRLAYKLES
        +T  IL+++E+  LP L   QM +IDEWR  YK+ S
Subjt:  KTGNILKLIEKATLPTLDLHQMASIDEWRLAYKLES


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGTGTTTGGCTGCAGCAAAGATGGGTACCTGAATGATTCCAAGTTCAGCGAGCCTCTGCCATGGATTGGAATCTACATAGCTGCAGCCTCTTTGGTCTGCCTGTT
AGCGATGGCCGCCGATCTCGTCCACGGCGTCCGCCACAAAAAATTCTGGTTCCCTTGCAAGTTCTTCACTCTTAACTCCACTTCCCTGACCCTGATAGCTGTGGCAATCA
AACTTTCTGTGGATCTCAACACCTCCATGCCTGGCAGAGAGGATCAGCTTGCAAAGCTAAGTAGTGCTGTATTGATGTGCACCATCATGGGTAATTCTATGCCTTCTCTT
GGATCTATGGAGAATCAAGAAATATTTACGAACATTATGGCTCTTGGGATTCTTGTTATTACCCTCATAGTAAACGTCTGTATTGAGTTGGGTACTGGAGTGATCTATGT
ATTTATGAAGGAGCATATTTCCATTGTATTCTTGATGCTGGTTTTGCTTGCCATCCAGAGTTTTTCTGCTTTGACTGTTCCATCTACTAAAAGCTACCTGGAAATGAAGT
ACAGAGTAAGACACCAATTAGCTTTTAAAGAGTGTGCAGATGCAAAAACAGGCCCAGCAGTAGTTGAACGACTCAGGGAGGGTCTGATGAAATATTGGATGATGGCTCAT
ACCTGCAGCCCCCAGTTTGTGATTGGTCGATCCGCTACTTGTACTGCTTCTGGGGCTATCTGTCTTCTTACCGCTGCAATTTTAGCAGAGGCCATTGTTAGATCTTACTT
GATGCAGAGATCATTTAAGTTCTGCAATGGCCAGTCTGATTATAAATGGTCCATCACTTTGATTCTCATAGTACAATGTGCTGCAGTGGGAGTTGGGACTGTTGCCCCAG
CAATCAGATGGTTTGCGGCCATAAATTTCCGGTGTCCAAAGCTGGGAAAAGGGGGCTACAAGAAGGAATTCACGTTAGAAAACTATTGGATCCGGTATCTGGTTGAAATG
AAACAGTGCCCATTATCTATCAGAGTTGAAAACAGAAGTTTCAGAAAGCTTGCTCATAGTGCAAAAAACAAATTTCTGGATGCATGCATTATCCTACAAACAGCCATAGT
GTTCATAAGCAAAGTGATTCGACTCATTTCCATTTTCTTTGTCAGGGGAATCTTCTCATTCTGTGACTGCTTCAAAAGTTTGAAAAAAAAGTTGTGTTTTAAAGATGCCA
TTTCTATCAACAATTCAGGGTCAGAGGTAGATCCTGATTCAAAGCTAGATCTCAGTCGTTTTGTTCTGTATCTTGAAGGTGAAGATGATCTAGTTCACCTCATGGTAGCC
AACAATTGTTATGCCACAAATCAATGGATTCAGAAAGGGCAGAAGAAAAAGCCTAAGAATCTCATTAACCTGTTAGAGGGAATGATCATGACACGAGGTTTCAAGGGAGT
GGCAGAGTTTGACAACCATCAAGTTCCATGTCTAGATTCCAAAGAGCCCCAAAATTGCTGGGCACTCTCAGTGGTGACATTAACAAGCATAGCAATTTCTCTTCCAAACA
TAAGCCGGTGTTTAGTCAAACAGTTGGTGAGCGCCGTTAACGAAGGCCTCCCATACATAAGACTTCTTGAAGATCACCTTGACACAAAAGGGGATTTCATAAACCTAAAG
AAAGCAGCTGAAATCGTTTGGCTAGGAATTGATCTCCACCACAAATGGCTGGACGTAGACCTCCAAAAAATTTCACATCATAAAGAAAGTCCAAAGGAAATTCTTGAACA
ACTTTCAGATTGTGCAAAGAAAATCTACTCGGTGGAGAAGACGACGAATCAGCGTCTATGCTTGAAGTTAAGTCCTTCAAATTGGCCAATCAGGATACTGGCTGCGAATT
GTATGTACAGAATAAGCGAATCAATGCTGCTAAAATATGAAAATAAATATGGGTACAGAAATGAAGAATTGTTCGCAGAAACAACAGCCATGATCTCCGCCATTATGGGA
GCATGCCTCACGAATTTGGAGGACATGATAGCCACAAAGTGTTCGAACAGTGCGATTGAAAAACGAGAGAGGAGTGTGAGAGAAGCAGCTTACATTCTTGGGAAAACAGG
GAACATATTGAAACTCATTGAGAAAGCGACACTTCCTACTCTTGATCTGCATCAAATGGCAAGCATTGATGAATGGCGATTGGCTTACAAGCTGGAGAGC
mRNA sequenceShow/hide mRNA sequence
ATGGGAGTGTTTGGCTGCAGCAAAGATGGGTACCTGAATGATTCCAAGTTCAGCGAGCCTCTGCCATGGATTGGAATCTACATAGCTGCAGCCTCTTTGGTCTGCCTGTT
AGCGATGGCCGCCGATCTCGTCCACGGCGTCCGCCACAAAAAATTCTGGTTCCCTTGCAAGTTCTTCACTCTTAACTCCACTTCCCTGACCCTGATAGCTGTGGCAATCA
AACTTTCTGTGGATCTCAACACCTCCATGCCTGGCAGAGAGGATCAGCTTGCAAAGCTAAGTAGTGCTGTATTGATGTGCACCATCATGGGTAATTCTATGCCTTCTCTT
GGATCTATGGAGAATCAAGAAATATTTACGAACATTATGGCTCTTGGGATTCTTGTTATTACCCTCATAGTAAACGTCTGTATTGAGTTGGGTACTGGAGTGATCTATGT
ATTTATGAAGGAGCATATTTCCATTGTATTCTTGATGCTGGTTTTGCTTGCCATCCAGAGTTTTTCTGCTTTGACTGTTCCATCTACTAAAAGCTACCTGGAAATGAAGT
ACAGAGTAAGACACCAATTAGCTTTTAAAGAGTGTGCAGATGCAAAAACAGGCCCAGCAGTAGTTGAACGACTCAGGGAGGGTCTGATGAAATATTGGATGATGGCTCAT
ACCTGCAGCCCCCAGTTTGTGATTGGTCGATCCGCTACTTGTACTGCTTCTGGGGCTATCTGTCTTCTTACCGCTGCAATTTTAGCAGAGGCCATTGTTAGATCTTACTT
GATGCAGAGATCATTTAAGTTCTGCAATGGCCAGTCTGATTATAAATGGTCCATCACTTTGATTCTCATAGTACAATGTGCTGCAGTGGGAGTTGGGACTGTTGCCCCAG
CAATCAGATGGTTTGCGGCCATAAATTTCCGGTGTCCAAAGCTGGGAAAAGGGGGCTACAAGAAGGAATTCACGTTAGAAAACTATTGGATCCGGTATCTGGTTGAAATG
AAACAGTGCCCATTATCTATCAGAGTTGAAAACAGAAGTTTCAGAAAGCTTGCTCATAGTGCAAAAAACAAATTTCTGGATGCATGCATTATCCTACAAACAGCCATAGT
GTTCATAAGCAAAGTGATTCGACTCATTTCCATTTTCTTTGTCAGGGGAATCTTCTCATTCTGTGACTGCTTCAAAAGTTTGAAAAAAAAGTTGTGTTTTAAAGATGCCA
TTTCTATCAACAATTCAGGGTCAGAGGTAGATCCTGATTCAAAGCTAGATCTCAGTCGTTTTGTTCTGTATCTTGAAGGTGAAGATGATCTAGTTCACCTCATGGTAGCC
AACAATTGTTATGCCACAAATCAATGGATTCAGAAAGGGCAGAAGAAAAAGCCTAAGAATCTCATTAACCTGTTAGAGGGAATGATCATGACACGAGGTTTCAAGGGAGT
GGCAGAGTTTGACAACCATCAAGTTCCATGTCTAGATTCCAAAGAGCCCCAAAATTGCTGGGCACTCTCAGTGGTGACATTAACAAGCATAGCAATTTCTCTTCCAAACA
TAAGCCGGTGTTTAGTCAAACAGTTGGTGAGCGCCGTTAACGAAGGCCTCCCATACATAAGACTTCTTGAAGATCACCTTGACACAAAAGGGGATTTCATAAACCTAAAG
AAAGCAGCTGAAATCGTTTGGCTAGGAATTGATCTCCACCACAAATGGCTGGACGTAGACCTCCAAAAAATTTCACATCATAAAGAAAGTCCAAAGGAAATTCTTGAACA
ACTTTCAGATTGTGCAAAGAAAATCTACTCGGTGGAGAAGACGACGAATCAGCGTCTATGCTTGAAGTTAAGTCCTTCAAATTGGCCAATCAGGATACTGGCTGCGAATT
GTATGTACAGAATAAGCGAATCAATGCTGCTAAAATATGAAAATAAATATGGGTACAGAAATGAAGAATTGTTCGCAGAAACAACAGCCATGATCTCCGCCATTATGGGA
GCATGCCTCACGAATTTGGAGGACATGATAGCCACAAAGTGTTCGAACAGTGCGATTGAAAAACGAGAGAGGAGTGTGAGAGAAGCAGCTTACATTCTTGGGAAAACAGG
GAACATATTGAAACTCATTGAGAAAGCGACACTTCCTACTCTTGATCTGCATCAAATGGCAAGCATTGATGAATGGCGATTGGCTTACAAGCTGGAGAGC
Protein sequenceShow/hide protein sequence
MGVFGCSKDGYLNDSKFSEPLPWIGIYIAAASLVCLLAMAADLVHGVRHKKFWFPCKFFTLNSTSLTLIAVAIKLSVDLNTSMPGREDQLAKLSSAVLMCTIMGNSMPSL
GSMENQEIFTNIMALGILVITLIVNVCIELGTGVIYVFMKEHISIVFLMLVLLAIQSFSALTVPSTKSYLEMKYRVRHQLAFKECADAKTGPAVVERLREGLMKYWMMAH
TCSPQFVIGRSATCTASGAICLLTAAILAEAIVRSYLMQRSFKFCNGQSDYKWSITLILIVQCAAVGVGTVAPAIRWFAAINFRCPKLGKGGYKKEFTLENYWIRYLVEM
KQCPLSIRVENRSFRKLAHSAKNKFLDACIILQTAIVFISKVIRLISIFFVRGIFSFCDCFKSLKKKLCFKDAISINNSGSEVDPDSKLDLSRFVLYLEGEDDLVHLMVA
NNCYATNQWIQKGQKKKPKNLINLLEGMIMTRGFKGVAEFDNHQVPCLDSKEPQNCWALSVVTLTSIAISLPNISRCLVKQLVSAVNEGLPYIRLLEDHLDTKGDFINLK
KAAEIVWLGIDLHHKWLDVDLQKISHHKESPKEILEQLSDCAKKIYSVEKTTNQRLCLKLSPSNWPIRILAANCMYRISESMLLKYENKYGYRNEELFAETTAMISAIMG
ACLTNLEDMIATKCSNSAIEKRERSVREAAYILGKTGNILKLIEKATLPTLDLHQMASIDEWRLAYKLES