; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0017292 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0017292
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionGlycosyltransferase family protein 2
Genome locationchr12:20296430..20311073
RNA-Seq ExpressionPI0017292
SyntenyPI0017292
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR029044 - Nucleotide-diphospho-sugar transferases


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139558.2 uncharacterized protein LOC101202906 [Cucumis sativus]0.0e+0094.21Show/hide
Query:  MGMLRNSAMGNGDCLEGMINDCVGGKGKLRPQRSSSTRIVAGLTCLQFAFALYATFLLYYVSPAIDLRTKPDFSWATRIAQQWRQFVIPTHVVGRYQEPN
        MGM RN  MGNGDC+EGMI D VGGKGKLRPQRSSST+IVAGLTCLQFAFALYATFLLYYVSPAIDLRTKPDFSWATRIAQQW+QFVIP HVVGRYQEPN
Subjt:  MGMLRNSAMGNGDCLEGMINDCVGGKGKLRPQRSSSTRIVAGLTCLQFAFALYATFLLYYVSPAIDLRTKPDFSWATRIAQQWRQFVIPTHVVGRYQEPN

Query:  SMMMMQAELRPITPKEACENEKIDFEQKKSNDGQMIKLKTELYNEILDFQSKSFGTETLSQLMAMKSKWDLKGPNKPKVTVILNHFKRKTLCAQLNSLLH
        S MMMQAELRPITP+EACENEKIDFEQKKSNDGQMIKLKTELYNEILDFQSKSFGTETLSQLMAMKSKWDLKGPNKPKVTVILNHFKRKTLCAQLNSLLH
Subjt:  SMMMMQAELRPITPKEACENEKIDFEQKKSNDGQMIKLKTELYNEILDFQSKSFGTETLSQLMAMKSKWDLKGPNKPKVTVILNHFKRKTLCAQLNSLLH

Query:  QTLPFHHVWVLAFGSPNELSLKRIVDSYNNSKISFISSSYDFKYYGRFQMALQTEADLVYILDDDMIPGRKMLQILSHVAGTDKYKNAVLGSIGRILPFR
        QTLPFHHVWVLAFGSPNELSLKRIVDSYNNSKISFISSSYDFKYYGRFQMALQTEADLVYILDDDMIPGRKMLQILSHVAGTDKYKN+VLGSIGRILPFR
Subjt:  QTLPFHHVWVLAFGSPNELSLKRIVDSYNNSKISFISSSYDFKYYGRFQMALQTEADLVYILDDDMIPGRKMLQILSHVAGTDKYKNAVLGSIGRILPFR

Query:  QKDFTFPSYRKF-----------------------LDFLSSSWFLSAELVKTLFIETPFTFATGEDLHLSYQLQKYRNAGSFVLPVDPKDKETWGDSEHR
        QKDFTFPSYRKF                       +DFLSSSWFLSAELVKTLFIETPFTFATGEDLHLSYQLQKYR+AGSFVLPVDPKDKETWGDSEHR
Subjt:  QKDFTFPSYRKF-----------------------LDFLSSSWFLSAELVKTLFIETPFTFATGEDLHLSYQLQKYRNAGSFVLPVDPKDKETWGDSEHR

Query:  LAYVSETTVIFKDIVQVRDEQWWKALSTGYITQWAAMHPQKIDALFYAHSVDEAKALAPLLEKFRSTVGKKAYIVVSGGSFCPCEDVADALKWPKLVCKE
        LAYVSETTVIFKDIVQVRD+QWWKALSTGYITQWAAMHPQKIDALFYAHSVDEAKALAPLLEKFRSTVGKKAYIVVSGG+FCPCEDVADALKWPKLVCKE
Subjt:  LAYVSETTVIFKDIVQVRDEQWWKALSTGYITQWAAMHPQKIDALFYAHSVDEAKALAPLLEKFRSTVGKKAYIVVSGGSFCPCEDVADALKWPKLVCKE

Query:  RRFKIFDLAIGALSGISNSEVPMVQAVYASMKGLIKIHNPSVIITVADIDPNVKKALKMASEANLNGTTLILLPRPSISKVLWMADLRSTALPNWNKMHI
        RRFKIFDLAIGALSGISNSEVP+VQAVYASMKGLIKIHNPSVIITVADIDPNVKKALKMASEANLNGTTL+LLPRPSISKVLWMA+LRSTALPNWNKM I
Subjt:  RRFKIFDLAIGALSGISNSEVPMVQAVYASMKGLIKIHNPSVIITVADIDPNVKKALKMASEANLNGTTLILLPRPSISKVLWMADLRSTALPNWNKMHI

Query:  SINIITQNRASSLTRLLKSLKDAYYLGDEISISFNMDSKVDEETIKLVSSFEWPHGPKSLRRRIIQGGLIRAVSESWYPSSDDDYGLLLEDDIEVSPYYY
        SINIITQNRASSLTRLLKSLKDAYYLGDEI ISFNMDSKVDEETIKLVSSFEWPHGPKSLRRRIIQGGLIRAVSESWYP+SDDDYGLLLEDDIEVSPYYY
Subjt:  SINIITQNRASSLTRLLKSLKDAYYLGDEISISFNMDSKVDEETIKLVSSFEWPHGPKSLRRRIIQGGLIRAVSESWYPSSDDDYGLLLEDDIEVSPYYY

Query:  LWIKYALLAYHYDPQISLPELSSISLYTPRLVEVVKERPKWNATEFFKRIHPNTPYLHQLPCSWGAVFFPKHWREFYVYMNSRFTENAKENPVQIPKSRT
        LWIKYALLAYHYDPQISLPELSSISLYTPRLVEVVKERPKWNATEFFKRIHPNTPYLHQLPCSWGAVFFPKHWREFYVYMNSRFTENAKENPVQIPKSRT
Subjt:  LWIKYALLAYHYDPQISLPELSSISLYTPRLVEVVKERPKWNATEFFKRIHPNTPYLHQLPCSWGAVFFPKHWREFYVYMNSRFTENAKENPVQIPKSRT

Query:  NGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPGAHISAKDNVVKHKKEDFEVPLLKENFMNFLPNGKLPAASRLPSLNLFNQPVSLKGLKSA
        NGWQASWKKFLIDMMYLRGYVSLYPNFP+QASFSTNHMEPGAHISAKDN+VKHKKEDFEVPLLKENF+NFLPN K+PAASRLPSLNLFNQPVSLKGLKSA
Subjt:  NGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPGAHISAKDNVVKHKKEDFEVPLLKENFMNFLPNGKLPAASRLPSLNLFNQPVSLKGLKSA

Query:  GAKLGQDVLKCEAFEIVAVNHGTGLPSHCAKF
        GAKL QDVLKCE  EIV VNHGTGLPSHCAKF
Subjt:  GAKLGQDVLKCEAFEIVAVNHGTGLPSHCAKF

XP_008462712.1 PREDICTED: uncharacterized protein LOC103501011 isoform X1 [Cucumis melo]0.0e+0094.31Show/hide
Query:  MGMLRNSAMGNGDCLEGMINDCVGGKGKLRPQRSSSTRIVAGLTCLQFAFALYATFLLYYVSPAIDLRTKPDFSWATRIAQQWRQFVIPTHVVGRYQEPN
        MG  RNSAMGNGDCLEGMIND VGGKGKLRPQRSSST+IVAGLTCLQFAFALYATFLLYYVSPAIDLRTKPDFSWATRIAQQW QFVIP HVVGRYQEP 
Subjt:  MGMLRNSAMGNGDCLEGMINDCVGGKGKLRPQRSSSTRIVAGLTCLQFAFALYATFLLYYVSPAIDLRTKPDFSWATRIAQQWRQFVIPTHVVGRYQEPN

Query:  SMMMMQAELRPITPKEACENEKIDFEQKKSNDGQMIKLKTELYNEILDFQSKSFGTETLSQLMAMKSKWDLKGPNKPKVTVILNHFKRKTLCAQLNSLLH
        S MMMQAELRPITP+EACENEKIDFEQKKSNDGQMIKLKTELYNEILDFQSKSFGTETL QLMAMKSKWDLKGP KPKVTVILNHFKRKTLCAQLNSLLH
Subjt:  SMMMMQAELRPITPKEACENEKIDFEQKKSNDGQMIKLKTELYNEILDFQSKSFGTETLSQLMAMKSKWDLKGPNKPKVTVILNHFKRKTLCAQLNSLLH

Query:  QTLPFHHVWVLAFGSPNELSLKRIVDSYNNSKISFISSSYDFKYYGRFQMALQTEADLVYILDDDMIPGRKMLQILSHVAGTDKYKNAVLGSIGRILPFR
        QTLPFHHVWVLAFGSPNELSLKRIVDSYNNSKISFISSSYDFKYYGRFQMALQTEADLVYILDDDMIPGRKMLQILSHVAGTDKYKNAVLGSIGRILPFR
Subjt:  QTLPFHHVWVLAFGSPNELSLKRIVDSYNNSKISFISSSYDFKYYGRFQMALQTEADLVYILDDDMIPGRKMLQILSHVAGTDKYKNAVLGSIGRILPFR

Query:  QKDFTFPSYRKF-----------------------LDFLSSSWFLSAELVKTLFIETPFTFATGEDLHLSYQLQKYRNAGSFVLPVDPKDKETWGDSEHR
        QKDFTFPSYRKF                       +DFLSSSWFLSAELVKTLFIETPFTFATGEDLHLSYQLQKYRNAGSFVLPVDPKDKETWGDSEHR
Subjt:  QKDFTFPSYRKF-----------------------LDFLSSSWFLSAELVKTLFIETPFTFATGEDLHLSYQLQKYRNAGSFVLPVDPKDKETWGDSEHR

Query:  LAYVSETTVIFKDIVQVRDEQWWKALSTGYITQWAAMHPQKIDALFYAHSVDEAKALAPLLEKFRSTVGKKAYIVVSGGSFCPCEDVADALKWPKLVCKE
        LAYVSETTVIFKDIVQVRD+QWWKALSTGY+TQWAAMHPQKIDALFYAHSVDEAKALAPLLEKFRSTVGKKAYIVVSGG FCPCEDV DALKWPKLVCKE
Subjt:  LAYVSETTVIFKDIVQVRDEQWWKALSTGYITQWAAMHPQKIDALFYAHSVDEAKALAPLLEKFRSTVGKKAYIVVSGGSFCPCEDVADALKWPKLVCKE

Query:  RRFKIFDLAIGALSGISNSEVPMVQAVYASMKGLIKIHNPSVIITVADIDPNVKKALKMASEANLNGTTLILLPRPSISKVLWMADLRSTALPNWNKMHI
        RRFKIFDLAIGALSG+SNSEVP+VQAVYASMKGLIKIHNPSVIITVADIDPNVKKALKMASEANLNGTTLILLPRPSISKVLWMADLRSTALPNWNKM I
Subjt:  RRFKIFDLAIGALSGISNSEVPMVQAVYASMKGLIKIHNPSVIITVADIDPNVKKALKMASEANLNGTTLILLPRPSISKVLWMADLRSTALPNWNKMHI

Query:  SINIITQNRASSLTRLLKSLKDAYYLGDEISISFNMDSKVDEETIKLVSSFEWPHGPKSLRRRIIQGGLIRAVSESWYPSSDDDYGLLLEDDIEVSPYYY
        SINIITQNR SSLTRLLKSLKDAYYLGDEI ISFNMDSKVDE+TIKLVSSFEWPHGPKSLRRRIIQGGLIRAVSESWYP+SDDDYGLLLEDDIEVSPYYY
Subjt:  SINIITQNRASSLTRLLKSLKDAYYLGDEISISFNMDSKVDEETIKLVSSFEWPHGPKSLRRRIIQGGLIRAVSESWYPSSDDDYGLLLEDDIEVSPYYY

Query:  LWIKYALLAYHYDPQISLPELSSISLYTPRLVEVVKERPKWNATEFFKRIHPNTPYLHQLPCSWGAVFFPKHWREFYVYMNSRFTENAKENPVQIPKSRT
        LWIKYALLAYHYDPQISLPELSSISLYTPRLVEVVKERPKWNATEFFKRIHPNTPYLHQLPCSWGAVFFPKHWREFYVYMNSRFTENAKENPVQIPKSRT
Subjt:  LWIKYALLAYHYDPQISLPELSSISLYTPRLVEVVKERPKWNATEFFKRIHPNTPYLHQLPCSWGAVFFPKHWREFYVYMNSRFTENAKENPVQIPKSRT

Query:  NGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPGAHISAKDNVVKHKKEDFEVPLLKENFMNFLPNGKLPAASRLPSLNLFNQPVSLKGLKSA
        NGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPGAHISAK+NVVKH KEDFEVPLLKENF+N+LPNGKLPAASRLPSLNLFNQPVSLKGLKSA
Subjt:  NGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPGAHISAKDNVVKHKKEDFEVPLLKENFMNFLPNGKLPAASRLPSLNLFNQPVSLKGLKSA

Query:  GAKLGQDVLKCEAFEIVAVNHGTGLPSHCAKF
        GAKLGQDVLKCE  EIV VNHGTGLPSHCAKF
Subjt:  GAKLGQDVLKCEAFEIVAVNHGTGLPSHCAKF

XP_008462738.1 PREDICTED: uncharacterized protein LOC103501011 isoform X2 [Cucumis melo]0.0e+0094.31Show/hide
Query:  MGMLRNSAMGNGDCLEGMINDCVGGKGKLRPQRSSSTRIVAGLTCLQFAFALYATFLLYYVSPAIDLRTKPDFSWATRIAQQWRQFVIPTHVVGRYQEPN
        MG  RNSAMGNGDCLEGMIND VGGKGKLRPQRSSST+IVAGLTCLQFAFALYATFLLYYVSPAIDLRTKPDFSWATRIAQQW QFVIP HVVGRYQEP 
Subjt:  MGMLRNSAMGNGDCLEGMINDCVGGKGKLRPQRSSSTRIVAGLTCLQFAFALYATFLLYYVSPAIDLRTKPDFSWATRIAQQWRQFVIPTHVVGRYQEPN

Query:  SMMMMQAELRPITPKEACENEKIDFEQKKSNDGQMIKLKTELYNEILDFQSKSFGTETLSQLMAMKSKWDLKGPNKPKVTVILNHFKRKTLCAQLNSLLH
        S MMMQAELRPITP+EACENEKIDFEQKKSNDGQMIKLKTELYNEILDFQSKSFGTETL QLMAMKSKWDLKGP KPKVTVILNHFKRKTLCAQLNSLLH
Subjt:  SMMMMQAELRPITPKEACENEKIDFEQKKSNDGQMIKLKTELYNEILDFQSKSFGTETLSQLMAMKSKWDLKGPNKPKVTVILNHFKRKTLCAQLNSLLH

Query:  QTLPFHHVWVLAFGSPNELSLKRIVDSYNNSKISFISSSYDFKYYGRFQMALQTEADLVYILDDDMIPGRKMLQILSHVAGTDKYKNAVLGSIGRILPFR
        QTLPFHHVWVLAFGSPNELSLKRIVDSYNNSKISFISSSYDFKYYGRFQMALQTEADLVYILDDDMIPGRKMLQILSHVAGTDKYKNAVLGSIGRILPFR
Subjt:  QTLPFHHVWVLAFGSPNELSLKRIVDSYNNSKISFISSSYDFKYYGRFQMALQTEADLVYILDDDMIPGRKMLQILSHVAGTDKYKNAVLGSIGRILPFR

Query:  QKDFTFPSYRKF-----------------------LDFLSSSWFLSAELVKTLFIETPFTFATGEDLHLSYQLQKYRNAGSFVLPVDPKDKETWGDSEHR
        QKDFTFPSYRKF                       +DFLSSSWFLSAELVKTLFIETPFTFATGEDLHLSYQLQKYRNAGSFVLPVDPKDKETWGDSEHR
Subjt:  QKDFTFPSYRKF-----------------------LDFLSSSWFLSAELVKTLFIETPFTFATGEDLHLSYQLQKYRNAGSFVLPVDPKDKETWGDSEHR

Query:  LAYVSETTVIFKDIVQVRDEQWWKALSTGYITQWAAMHPQKIDALFYAHSVDEAKALAPLLEKFRSTVGKKAYIVVSGGSFCPCEDVADALKWPKLVCKE
        LAYVSETTVIFKDIVQVRD+QWWKALSTGY+TQWAAMHPQKIDALFYAHSVDEAKALAPLLEKFRSTVGKKAYIVVSGG FCPCEDV DALKWPKLVCKE
Subjt:  LAYVSETTVIFKDIVQVRDEQWWKALSTGYITQWAAMHPQKIDALFYAHSVDEAKALAPLLEKFRSTVGKKAYIVVSGGSFCPCEDVADALKWPKLVCKE

Query:  RRFKIFDLAIGALSGISNSEVPMVQAVYASMKGLIKIHNPSVIITVADIDPNVKKALKMASEANLNGTTLILLPRPSISKVLWMADLRSTALPNWNKMHI
        RRFKIFDLAIGALSG+SNSEVP+VQAVYASMKGLIKIHNPSVIITVADIDPNVKKALKMASEANLNGTTLILLPRPSISKVLWMADLRSTALPNWNKM I
Subjt:  RRFKIFDLAIGALSGISNSEVPMVQAVYASMKGLIKIHNPSVIITVADIDPNVKKALKMASEANLNGTTLILLPRPSISKVLWMADLRSTALPNWNKMHI

Query:  SINIITQNRASSLTRLLKSLKDAYYLGDEISISFNMDSKVDEETIKLVSSFEWPHGPKSLRRRIIQGGLIRAVSESWYPSSDDDYGLLLEDDIEVSPYYY
        SINIITQNR SSLTRLLKSLKDAYYLGDEI ISFNMDSKVDE+TIKLVSSFEWPHGPKSLRRRIIQGGLIRAVSESWYP+SDDDYGLLLEDDIEVSPYYY
Subjt:  SINIITQNRASSLTRLLKSLKDAYYLGDEISISFNMDSKVDEETIKLVSSFEWPHGPKSLRRRIIQGGLIRAVSESWYPSSDDDYGLLLEDDIEVSPYYY

Query:  LWIKYALLAYHYDPQISLPELSSISLYTPRLVEVVKERPKWNATEFFKRIHPNTPYLHQLPCSWGAVFFPKHWREFYVYMNSRFTENAKENPVQIPKSRT
        LWIKYALLAYHYDPQISLPELSSISLYTPRLVEVVKERPKWNATEFFKRIHPNTPYLHQLPCSWGAVFFPKHWREFYVYMNSRFTENAKENPVQIPKSRT
Subjt:  LWIKYALLAYHYDPQISLPELSSISLYTPRLVEVVKERPKWNATEFFKRIHPNTPYLHQLPCSWGAVFFPKHWREFYVYMNSRFTENAKENPVQIPKSRT

Query:  NGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPGAHISAKDNVVKHKKEDFEVPLLKENFMNFLPNGKLPAASRLPSLNLFNQPVSLKGLKSA
        NGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPGAHISAK+NVVKH KEDFEVPLLKENF+N+LPNGKLPAASRLPSLNLFNQPVSLKGLKSA
Subjt:  NGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPGAHISAKDNVVKHKKEDFEVPLLKENFMNFLPNGKLPAASRLPSLNLFNQPVSLKGLKSA

Query:  GAKLGQDVLKCEAFEIVAVNHGTGLPSHCAKF
        GAKLGQDVLKCE  EIV VNHGTGLPSHCAKF
Subjt:  GAKLGQDVLKCEAFEIVAVNHGTGLPSHCAKF

XP_022970291.1 uncharacterized protein LOC111469301 [Cucurbita maxima]0.0e+0092.18Show/hide
Query:  MGMLRNSAMGNGDCLEGMINDCVGGKGKLRPQRSSSTRIVAGLTCLQFAFALYATFLLYYVSPAIDLRTKPDFSWATRIAQQWRQFVIPTHVVGRYQEPN
        MG+ RN A  NGD LEGMIND VGGKGKLRPQR+SST++VAGLTCLQFAFALYATFLLYYVSPAIDLRTKPDFSWATRIAQQWRQFVIP HVVGR +EP 
Subjt:  MGMLRNSAMGNGDCLEGMINDCVGGKGKLRPQRSSSTRIVAGLTCLQFAFALYATFLLYYVSPAIDLRTKPDFSWATRIAQQWRQFVIPTHVVGRYQEPN

Query:  SMMMMQAELRPITPKEACENEKIDFEQKKSNDGQMIKLKTELYNEILDFQSKSFGTETLSQLMAMKSKWDLKGPNKPKVTVILNHFKRKTLCAQLNSLLH
        S +MMQAE RPITP+EACENEKIDFEQKKS DGQMIKLKTELYNEILDFQSKSFGTETLSQLMAMKSKWDL+GPNKPKVTVILNHFKRKTLCAQLNSLL 
Subjt:  SMMMMQAELRPITPKEACENEKIDFEQKKSNDGQMIKLKTELYNEILDFQSKSFGTETLSQLMAMKSKWDLKGPNKPKVTVILNHFKRKTLCAQLNSLLH

Query:  QTLPFHHVWVLAFGSPNELSLKRIVDSYNNSKISFISSSYDFKYYGRFQMALQTEADLVYILDDDMIPGRKMLQILSHVAGTDKYKNAVLGSIGRILPFR
        QTLPFHH+WVLAFGSPNELSLKRIVDSYNNS+ISFISSSYDFKYYGRFQMALQTEADLVYILDDDMIPGRKMLQILSHVAGTDKYKNAVLGSIGRILPFR
Subjt:  QTLPFHHVWVLAFGSPNELSLKRIVDSYNNSKISFISSSYDFKYYGRFQMALQTEADLVYILDDDMIPGRKMLQILSHVAGTDKYKNAVLGSIGRILPFR

Query:  QKDFTFPSYRKF-----------------------LDFLSSSWFLSAELVKTLFIETPFTFATGEDLHLSYQLQKYRNAGSFVLPVDPKDKETWGDSEHR
        QKDFTFPSYRKF                       +DFLSSSWFLSAELVKTLFIETPFTFATGEDLHLSYQLQKYRNA SFVLPVDPKDKETWGDSEHR
Subjt:  QKDFTFPSYRKF-----------------------LDFLSSSWFLSAELVKTLFIETPFTFATGEDLHLSYQLQKYRNAGSFVLPVDPKDKETWGDSEHR

Query:  LAYVSETTVIFKDIVQVRDEQWWKALSTGYITQWAAMHPQKIDALFYAHSVDEAKALAPLLEKFRSTVGKKAYIVVSGGSFCPCEDVADALKWPKLVCKE
        LAYVSETTVIFKDIVQVRD+QWWKA+STGYITQWAAM+PQKIDALFYAHSVDEAKALAPLLEKFRSTVGKKAYI VSGG+FCPCED A ALKWPK VCKE
Subjt:  LAYVSETTVIFKDIVQVRDEQWWKALSTGYITQWAAMHPQKIDALFYAHSVDEAKALAPLLEKFRSTVGKKAYIVVSGGSFCPCEDVADALKWPKLVCKE

Query:  RRFKIFDLAIGALSGISNSEVPMVQAVYASMKGLIKIHNPSVIITVADIDPNVKKALKMASEANLNG-TTLILLPRPSISKVLWMADLRSTALPNWNKMH
        RRFKIFDLAIGALSGISNSEVP+VQAVYASMKGLIKIHNPSV+ITVAD+DPNVKKALKMASEANLNG TT+ILLPRPSISKVLWMADLRSTALPNWNKM 
Subjt:  RRFKIFDLAIGALSGISNSEVPMVQAVYASMKGLIKIHNPSVIITVADIDPNVKKALKMASEANLNG-TTLILLPRPSISKVLWMADLRSTALPNWNKMH

Query:  ISINIITQNRASSLTRLLKSLKDAYYLGDEISISFNMDSKVDEETIKLVSSFEWPHGPKSLRRRIIQGGLIRAVSESWYPSSDDDYGLLLEDDIEVSPYY
        ISINIITQNRA SLTRLLKSLKDAYYLGDEI ISFNMDSKVDE+TIKLVSSFEWPHGPKSLRRRIIQGGLIRAVSESWYP+SDDDYGLLLEDDIEVSPYY
Subjt:  ISINIITQNRASSLTRLLKSLKDAYYLGDEISISFNMDSKVDEETIKLVSSFEWPHGPKSLRRRIIQGGLIRAVSESWYPSSDDDYGLLLEDDIEVSPYY

Query:  YLWIKYALLAYHYDPQISLPELSSISLYTPRLVEVVKERPKWNATEFFKRIHPNTPYLHQLPCSWGAVFFPKHWREFYVYMNSRFTENAKENPVQIPKSR
        YLWIKYALLAYHYDPQISLPELSSISLYTPRLVEVVKERPKWNATEFFKRIHPNTPYLHQLPCSWGAVFFPKHWREFYVYMN+R+TENAKENPVQIPKSR
Subjt:  YLWIKYALLAYHYDPQISLPELSSISLYTPRLVEVVKERPKWNATEFFKRIHPNTPYLHQLPCSWGAVFFPKHWREFYVYMNSRFTENAKENPVQIPKSR

Query:  TNGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPGAHISAKDNVVKHKKEDFEVPLLKENFMNFLPNGKLPAASRLPSLNLFNQPVSLKGLKS
        TNGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPGAHISAKDN+VKHKKEDFEVPLLKENF NFLPNGKLPAASRLPSLNLFNQPVSLKGLKS
Subjt:  TNGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPGAHISAKDNVVKHKKEDFEVPLLKENFMNFLPNGKLPAASRLPSLNLFNQPVSLKGLKS

Query:  AGAKLGQDVLKCEAFEIVAVNHGTGLPSHCAKF
        AGAKLGQDVLKCE  EIVAVNH TGLPSHCAKF
Subjt:  AGAKLGQDVLKCEAFEIVAVNHGTGLPSHCAKF

XP_038894981.1 uncharacterized protein LOC120083336 [Benincasa hispida]0.0e+0093.45Show/hide
Query:  MGMLRNSAMGNGDCLEGMINDCVGGKGKLRPQRSSSTRIVAGLTCLQFAFALYATFLLYYVSPAIDLRTKPDFSWATRIAQQWRQFVIPTHVVGRYQEPN
        MGM RNSA GNGD LEGMI+D VGGKGKLRPQR+SST+IVAGLTCLQFAFALYATFLLYYVSP+IDLRTKPDFSWATRIAQQWRQFVIP HVVGRYQEP 
Subjt:  MGMLRNSAMGNGDCLEGMINDCVGGKGKLRPQRSSSTRIVAGLTCLQFAFALYATFLLYYVSPAIDLRTKPDFSWATRIAQQWRQFVIPTHVVGRYQEPN

Query:  SMMMMQAELRPITPKEACENEKIDFEQKKSNDGQMIKLKTELYNEILDFQSKSFGTETLSQLMAMKSKWDLKGPNKPKVTVILNHFKRKTLCAQLNSLLH
        S +MMQAE RPITP+EACENEKIDFEQKKSNDGQMIKLKT+LYNEILDFQSKSFGTETL QLMAMKSKWDL+GPNKPKVTVILNHFKRKTLCAQLNSLL 
Subjt:  SMMMMQAELRPITPKEACENEKIDFEQKKSNDGQMIKLKTELYNEILDFQSKSFGTETLSQLMAMKSKWDLKGPNKPKVTVILNHFKRKTLCAQLNSLLH

Query:  QTLPFHHVWVLAFGSPNELSLKRIVDSYNNSKISFISSSYDFKYYGRFQMALQTEADLVYILDDDMIPGRKMLQILSHVAGTDKYKNAVLGSIGRILPFR
        QTLPFHH+WVLAFGSPNELSLKRIVDSYNNSKISFISSSYDFKYYGRFQMALQTEADLVYILDDDMIPGRKMLQILSHVAGTDKYKNAVLGSIGRILPFR
Subjt:  QTLPFHHVWVLAFGSPNELSLKRIVDSYNNSKISFISSSYDFKYYGRFQMALQTEADLVYILDDDMIPGRKMLQILSHVAGTDKYKNAVLGSIGRILPFR

Query:  QKDFTFPSYRKF-----------------------LDFLSSSWFLSAELVKTLFIETPFTFATGEDLHLSYQLQKYRNAGSFVLPVDPKDKETWGDSEHR
        QKDFTFPSYRKF                       +DFLSSSWFLSAELVKTLFIETPFTFATGEDLHLSYQLQKYRNAGSFVLPVDPKD+ETWGDSEHR
Subjt:  QKDFTFPSYRKF-----------------------LDFLSSSWFLSAELVKTLFIETPFTFATGEDLHLSYQLQKYRNAGSFVLPVDPKDKETWGDSEHR

Query:  LAYVSETTVIFKDIVQVRDEQWWKALSTGYITQWAAMHPQKIDALFYAHSVDEAKALAPLLEKFRSTVGKKAYIVVSGGSFCPCEDVADALKWPKLVCKE
        LAYVSETTVIFKDIVQVRD+QWWKALSTGYITQWAAMHPQKIDALFYAHSVDE KALAPLLEKFRSTVGKKAYIVVSGG+FCPCED A ALKWPKLVCKE
Subjt:  LAYVSETTVIFKDIVQVRDEQWWKALSTGYITQWAAMHPQKIDALFYAHSVDEAKALAPLLEKFRSTVGKKAYIVVSGGSFCPCEDVADALKWPKLVCKE

Query:  RRFKIFDLAIGALSGISNSEVPMVQAVYASMKGLIKIHNPSVIITVADIDPNVKKALKMASEANLNGTTLILLPRPSISKVLWMADLRSTALPNWNKMHI
        RRFKIFDLAIGALSGISNSEVP+VQAVYASMKGLIKIHNPSV+ITVAD+DPNVKKALKMASEANLNGTTLILLPRPSISKVLWMADLRSTALPNWNKM I
Subjt:  RRFKIFDLAIGALSGISNSEVPMVQAVYASMKGLIKIHNPSVIITVADIDPNVKKALKMASEANLNGTTLILLPRPSISKVLWMADLRSTALPNWNKMHI

Query:  SINIITQNRASSLTRLLKSLKDAYYLGDEISISFNMDSKVDEETIKLVSSFEWPHGPKSLRRRIIQGGLIRAVSESWYPSSDDDYGLLLEDDIEVSPYYY
        SINIITQNRASSLTRLLKSLKDAYYLGDEI ISFNMDSKVDEETIKLVSSFEWPHGPKS RRRIIQGGLIRAVSESWYP+SDDDYGLLLEDDIEVSPYYY
Subjt:  SINIITQNRASSLTRLLKSLKDAYYLGDEISISFNMDSKVDEETIKLVSSFEWPHGPKSLRRRIIQGGLIRAVSESWYPSSDDDYGLLLEDDIEVSPYYY

Query:  LWIKYALLAYHYDPQISLPELSSISLYTPRLVEVVKERPKWNATEFFKRIHPNTPYLHQLPCSWGAVFFPKHWREFYVYMNSRFTENAKENPVQIPKSRT
        LWIKYALLAYHYDPQISLPELSSISLYTPRLVEVVKERPKWNATEFF RIHPNTPYLHQLPCSWGAVFFPKHWREFYVYMNSRFTENAKENPVQIPKSRT
Subjt:  LWIKYALLAYHYDPQISLPELSSISLYTPRLVEVVKERPKWNATEFFKRIHPNTPYLHQLPCSWGAVFFPKHWREFYVYMNSRFTENAKENPVQIPKSRT

Query:  NGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPGAHISAKDNVVKHKKEDFEVPLLKENFMNFLPNGKLPAASRLPSLNLFNQPVSLKGLKSA
        NGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPGAHISAKDNVVKHKKEDFEVPLLKENF+NFLPNGKLPAASRLPSLNLFNQPVSLKGLKSA
Subjt:  NGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPGAHISAKDNVVKHKKEDFEVPLLKENFMNFLPNGKLPAASRLPSLNLFNQPVSLKGLKSA

Query:  GAKLGQDVLKCEAFEIVAVNHGTGLPSHCAKF
        GAKLGQDVLKCE  EIVAVNH TGLPSHCAKF
Subjt:  GAKLGQDVLKCEAFEIVAVNHGTGLPSHCAKF

TrEMBL top hitse value%identityAlignment
A0A0A0LV36 Uncharacterized protein0.0e+0094.21Show/hide
Query:  MGMLRNSAMGNGDCLEGMINDCVGGKGKLRPQRSSSTRIVAGLTCLQFAFALYATFLLYYVSPAIDLRTKPDFSWATRIAQQWRQFVIPTHVVGRYQEPN
        MGM RN  MGNGDC+EGMI D VGGKGKLRPQRSSST+IVAGLTCLQFAFALYATFLLYYVSPAIDLRTKPDFSWATRIAQQW+QFVIP HVVGRYQEPN
Subjt:  MGMLRNSAMGNGDCLEGMINDCVGGKGKLRPQRSSSTRIVAGLTCLQFAFALYATFLLYYVSPAIDLRTKPDFSWATRIAQQWRQFVIPTHVVGRYQEPN

Query:  SMMMMQAELRPITPKEACENEKIDFEQKKSNDGQMIKLKTELYNEILDFQSKSFGTETLSQLMAMKSKWDLKGPNKPKVTVILNHFKRKTLCAQLNSLLH
        S MMMQAELRPITP+EACENEKIDFEQKKSNDGQMIKLKTELYNEILDFQSKSFGTETLSQLMAMKSKWDLKGPNKPKVTVILNHFKRKTLCAQLNSLLH
Subjt:  SMMMMQAELRPITPKEACENEKIDFEQKKSNDGQMIKLKTELYNEILDFQSKSFGTETLSQLMAMKSKWDLKGPNKPKVTVILNHFKRKTLCAQLNSLLH

Query:  QTLPFHHVWVLAFGSPNELSLKRIVDSYNNSKISFISSSYDFKYYGRFQMALQTEADLVYILDDDMIPGRKMLQILSHVAGTDKYKNAVLGSIGRILPFR
        QTLPFHHVWVLAFGSPNELSLKRIVDSYNNSKISFISSSYDFKYYGRFQMALQTEADLVYILDDDMIPGRKMLQILSHVAGTDKYKN+VLGSIGRILPFR
Subjt:  QTLPFHHVWVLAFGSPNELSLKRIVDSYNNSKISFISSSYDFKYYGRFQMALQTEADLVYILDDDMIPGRKMLQILSHVAGTDKYKNAVLGSIGRILPFR

Query:  QKDFTFPSYRKF-----------------------LDFLSSSWFLSAELVKTLFIETPFTFATGEDLHLSYQLQKYRNAGSFVLPVDPKDKETWGDSEHR
        QKDFTFPSYRKF                       +DFLSSSWFLSAELVKTLFIETPFTFATGEDLHLSYQLQKYR+AGSFVLPVDPKDKETWGDSEHR
Subjt:  QKDFTFPSYRKF-----------------------LDFLSSSWFLSAELVKTLFIETPFTFATGEDLHLSYQLQKYRNAGSFVLPVDPKDKETWGDSEHR

Query:  LAYVSETTVIFKDIVQVRDEQWWKALSTGYITQWAAMHPQKIDALFYAHSVDEAKALAPLLEKFRSTVGKKAYIVVSGGSFCPCEDVADALKWPKLVCKE
        LAYVSETTVIFKDIVQVRD+QWWKALSTGYITQWAAMHPQKIDALFYAHSVDEAKALAPLLEKFRSTVGKKAYIVVSGG+FCPCEDVADALKWPKLVCKE
Subjt:  LAYVSETTVIFKDIVQVRDEQWWKALSTGYITQWAAMHPQKIDALFYAHSVDEAKALAPLLEKFRSTVGKKAYIVVSGGSFCPCEDVADALKWPKLVCKE

Query:  RRFKIFDLAIGALSGISNSEVPMVQAVYASMKGLIKIHNPSVIITVADIDPNVKKALKMASEANLNGTTLILLPRPSISKVLWMADLRSTALPNWNKMHI
        RRFKIFDLAIGALSGISNSEVP+VQAVYASMKGLIKIHNPSVIITVADIDPNVKKALKMASEANLNGTTL+LLPRPSISKVLWMA+LRSTALPNWNKM I
Subjt:  RRFKIFDLAIGALSGISNSEVPMVQAVYASMKGLIKIHNPSVIITVADIDPNVKKALKMASEANLNGTTLILLPRPSISKVLWMADLRSTALPNWNKMHI

Query:  SINIITQNRASSLTRLLKSLKDAYYLGDEISISFNMDSKVDEETIKLVSSFEWPHGPKSLRRRIIQGGLIRAVSESWYPSSDDDYGLLLEDDIEVSPYYY
        SINIITQNRASSLTRLLKSLKDAYYLGDEI ISFNMDSKVDEETIKLVSSFEWPHGPKSLRRRIIQGGLIRAVSESWYP+SDDDYGLLLEDDIEVSPYYY
Subjt:  SINIITQNRASSLTRLLKSLKDAYYLGDEISISFNMDSKVDEETIKLVSSFEWPHGPKSLRRRIIQGGLIRAVSESWYPSSDDDYGLLLEDDIEVSPYYY

Query:  LWIKYALLAYHYDPQISLPELSSISLYTPRLVEVVKERPKWNATEFFKRIHPNTPYLHQLPCSWGAVFFPKHWREFYVYMNSRFTENAKENPVQIPKSRT
        LWIKYALLAYHYDPQISLPELSSISLYTPRLVEVVKERPKWNATEFFKRIHPNTPYLHQLPCSWGAVFFPKHWREFYVYMNSRFTENAKENPVQIPKSRT
Subjt:  LWIKYALLAYHYDPQISLPELSSISLYTPRLVEVVKERPKWNATEFFKRIHPNTPYLHQLPCSWGAVFFPKHWREFYVYMNSRFTENAKENPVQIPKSRT

Query:  NGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPGAHISAKDNVVKHKKEDFEVPLLKENFMNFLPNGKLPAASRLPSLNLFNQPVSLKGLKSA
        NGWQASWKKFLIDMMYLRGYVSLYPNFP+QASFSTNHMEPGAHISAKDN+VKHKKEDFEVPLLKENF+NFLPN K+PAASRLPSLNLFNQPVSLKGLKSA
Subjt:  NGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPGAHISAKDNVVKHKKEDFEVPLLKENFMNFLPNGKLPAASRLPSLNLFNQPVSLKGLKSA

Query:  GAKLGQDVLKCEAFEIVAVNHGTGLPSHCAKF
        GAKL QDVLKCE  EIV VNHGTGLPSHCAKF
Subjt:  GAKLGQDVLKCEAFEIVAVNHGTGLPSHCAKF

A0A1S3CHM5 uncharacterized protein LOC103501011 isoform X20.0e+0094.31Show/hide
Query:  MGMLRNSAMGNGDCLEGMINDCVGGKGKLRPQRSSSTRIVAGLTCLQFAFALYATFLLYYVSPAIDLRTKPDFSWATRIAQQWRQFVIPTHVVGRYQEPN
        MG  RNSAMGNGDCLEGMIND VGGKGKLRPQRSSST+IVAGLTCLQFAFALYATFLLYYVSPAIDLRTKPDFSWATRIAQQW QFVIP HVVGRYQEP 
Subjt:  MGMLRNSAMGNGDCLEGMINDCVGGKGKLRPQRSSSTRIVAGLTCLQFAFALYATFLLYYVSPAIDLRTKPDFSWATRIAQQWRQFVIPTHVVGRYQEPN

Query:  SMMMMQAELRPITPKEACENEKIDFEQKKSNDGQMIKLKTELYNEILDFQSKSFGTETLSQLMAMKSKWDLKGPNKPKVTVILNHFKRKTLCAQLNSLLH
        S MMMQAELRPITP+EACENEKIDFEQKKSNDGQMIKLKTELYNEILDFQSKSFGTETL QLMAMKSKWDLKGP KPKVTVILNHFKRKTLCAQLNSLLH
Subjt:  SMMMMQAELRPITPKEACENEKIDFEQKKSNDGQMIKLKTELYNEILDFQSKSFGTETLSQLMAMKSKWDLKGPNKPKVTVILNHFKRKTLCAQLNSLLH

Query:  QTLPFHHVWVLAFGSPNELSLKRIVDSYNNSKISFISSSYDFKYYGRFQMALQTEADLVYILDDDMIPGRKMLQILSHVAGTDKYKNAVLGSIGRILPFR
        QTLPFHHVWVLAFGSPNELSLKRIVDSYNNSKISFISSSYDFKYYGRFQMALQTEADLVYILDDDMIPGRKMLQILSHVAGTDKYKNAVLGSIGRILPFR
Subjt:  QTLPFHHVWVLAFGSPNELSLKRIVDSYNNSKISFISSSYDFKYYGRFQMALQTEADLVYILDDDMIPGRKMLQILSHVAGTDKYKNAVLGSIGRILPFR

Query:  QKDFTFPSYRKF-----------------------LDFLSSSWFLSAELVKTLFIETPFTFATGEDLHLSYQLQKYRNAGSFVLPVDPKDKETWGDSEHR
        QKDFTFPSYRKF                       +DFLSSSWFLSAELVKTLFIETPFTFATGEDLHLSYQLQKYRNAGSFVLPVDPKDKETWGDSEHR
Subjt:  QKDFTFPSYRKF-----------------------LDFLSSSWFLSAELVKTLFIETPFTFATGEDLHLSYQLQKYRNAGSFVLPVDPKDKETWGDSEHR

Query:  LAYVSETTVIFKDIVQVRDEQWWKALSTGYITQWAAMHPQKIDALFYAHSVDEAKALAPLLEKFRSTVGKKAYIVVSGGSFCPCEDVADALKWPKLVCKE
        LAYVSETTVIFKDIVQVRD+QWWKALSTGY+TQWAAMHPQKIDALFYAHSVDEAKALAPLLEKFRSTVGKKAYIVVSGG FCPCEDV DALKWPKLVCKE
Subjt:  LAYVSETTVIFKDIVQVRDEQWWKALSTGYITQWAAMHPQKIDALFYAHSVDEAKALAPLLEKFRSTVGKKAYIVVSGGSFCPCEDVADALKWPKLVCKE

Query:  RRFKIFDLAIGALSGISNSEVPMVQAVYASMKGLIKIHNPSVIITVADIDPNVKKALKMASEANLNGTTLILLPRPSISKVLWMADLRSTALPNWNKMHI
        RRFKIFDLAIGALSG+SNSEVP+VQAVYASMKGLIKIHNPSVIITVADIDPNVKKALKMASEANLNGTTLILLPRPSISKVLWMADLRSTALPNWNKM I
Subjt:  RRFKIFDLAIGALSGISNSEVPMVQAVYASMKGLIKIHNPSVIITVADIDPNVKKALKMASEANLNGTTLILLPRPSISKVLWMADLRSTALPNWNKMHI

Query:  SINIITQNRASSLTRLLKSLKDAYYLGDEISISFNMDSKVDEETIKLVSSFEWPHGPKSLRRRIIQGGLIRAVSESWYPSSDDDYGLLLEDDIEVSPYYY
        SINIITQNR SSLTRLLKSLKDAYYLGDEI ISFNMDSKVDE+TIKLVSSFEWPHGPKSLRRRIIQGGLIRAVSESWYP+SDDDYGLLLEDDIEVSPYYY
Subjt:  SINIITQNRASSLTRLLKSLKDAYYLGDEISISFNMDSKVDEETIKLVSSFEWPHGPKSLRRRIIQGGLIRAVSESWYPSSDDDYGLLLEDDIEVSPYYY

Query:  LWIKYALLAYHYDPQISLPELSSISLYTPRLVEVVKERPKWNATEFFKRIHPNTPYLHQLPCSWGAVFFPKHWREFYVYMNSRFTENAKENPVQIPKSRT
        LWIKYALLAYHYDPQISLPELSSISLYTPRLVEVVKERPKWNATEFFKRIHPNTPYLHQLPCSWGAVFFPKHWREFYVYMNSRFTENAKENPVQIPKSRT
Subjt:  LWIKYALLAYHYDPQISLPELSSISLYTPRLVEVVKERPKWNATEFFKRIHPNTPYLHQLPCSWGAVFFPKHWREFYVYMNSRFTENAKENPVQIPKSRT

Query:  NGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPGAHISAKDNVVKHKKEDFEVPLLKENFMNFLPNGKLPAASRLPSLNLFNQPVSLKGLKSA
        NGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPGAHISAK+NVVKH KEDFEVPLLKENF+N+LPNGKLPAASRLPSLNLFNQPVSLKGLKSA
Subjt:  NGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPGAHISAKDNVVKHKKEDFEVPLLKENFMNFLPNGKLPAASRLPSLNLFNQPVSLKGLKSA

Query:  GAKLGQDVLKCEAFEIVAVNHGTGLPSHCAKF
        GAKLGQDVLKCE  EIV VNHGTGLPSHCAKF
Subjt:  GAKLGQDVLKCEAFEIVAVNHGTGLPSHCAKF

A0A1S3CHM8 uncharacterized protein LOC103501011 isoform X10.0e+0094.31Show/hide
Query:  MGMLRNSAMGNGDCLEGMINDCVGGKGKLRPQRSSSTRIVAGLTCLQFAFALYATFLLYYVSPAIDLRTKPDFSWATRIAQQWRQFVIPTHVVGRYQEPN
        MG  RNSAMGNGDCLEGMIND VGGKGKLRPQRSSST+IVAGLTCLQFAFALYATFLLYYVSPAIDLRTKPDFSWATRIAQQW QFVIP HVVGRYQEP 
Subjt:  MGMLRNSAMGNGDCLEGMINDCVGGKGKLRPQRSSSTRIVAGLTCLQFAFALYATFLLYYVSPAIDLRTKPDFSWATRIAQQWRQFVIPTHVVGRYQEPN

Query:  SMMMMQAELRPITPKEACENEKIDFEQKKSNDGQMIKLKTELYNEILDFQSKSFGTETLSQLMAMKSKWDLKGPNKPKVTVILNHFKRKTLCAQLNSLLH
        S MMMQAELRPITP+EACENEKIDFEQKKSNDGQMIKLKTELYNEILDFQSKSFGTETL QLMAMKSKWDLKGP KPKVTVILNHFKRKTLCAQLNSLLH
Subjt:  SMMMMQAELRPITPKEACENEKIDFEQKKSNDGQMIKLKTELYNEILDFQSKSFGTETLSQLMAMKSKWDLKGPNKPKVTVILNHFKRKTLCAQLNSLLH

Query:  QTLPFHHVWVLAFGSPNELSLKRIVDSYNNSKISFISSSYDFKYYGRFQMALQTEADLVYILDDDMIPGRKMLQILSHVAGTDKYKNAVLGSIGRILPFR
        QTLPFHHVWVLAFGSPNELSLKRIVDSYNNSKISFISSSYDFKYYGRFQMALQTEADLVYILDDDMIPGRKMLQILSHVAGTDKYKNAVLGSIGRILPFR
Subjt:  QTLPFHHVWVLAFGSPNELSLKRIVDSYNNSKISFISSSYDFKYYGRFQMALQTEADLVYILDDDMIPGRKMLQILSHVAGTDKYKNAVLGSIGRILPFR

Query:  QKDFTFPSYRKF-----------------------LDFLSSSWFLSAELVKTLFIETPFTFATGEDLHLSYQLQKYRNAGSFVLPVDPKDKETWGDSEHR
        QKDFTFPSYRKF                       +DFLSSSWFLSAELVKTLFIETPFTFATGEDLHLSYQLQKYRNAGSFVLPVDPKDKETWGDSEHR
Subjt:  QKDFTFPSYRKF-----------------------LDFLSSSWFLSAELVKTLFIETPFTFATGEDLHLSYQLQKYRNAGSFVLPVDPKDKETWGDSEHR

Query:  LAYVSETTVIFKDIVQVRDEQWWKALSTGYITQWAAMHPQKIDALFYAHSVDEAKALAPLLEKFRSTVGKKAYIVVSGGSFCPCEDVADALKWPKLVCKE
        LAYVSETTVIFKDIVQVRD+QWWKALSTGY+TQWAAMHPQKIDALFYAHSVDEAKALAPLLEKFRSTVGKKAYIVVSGG FCPCEDV DALKWPKLVCKE
Subjt:  LAYVSETTVIFKDIVQVRDEQWWKALSTGYITQWAAMHPQKIDALFYAHSVDEAKALAPLLEKFRSTVGKKAYIVVSGGSFCPCEDVADALKWPKLVCKE

Query:  RRFKIFDLAIGALSGISNSEVPMVQAVYASMKGLIKIHNPSVIITVADIDPNVKKALKMASEANLNGTTLILLPRPSISKVLWMADLRSTALPNWNKMHI
        RRFKIFDLAIGALSG+SNSEVP+VQAVYASMKGLIKIHNPSVIITVADIDPNVKKALKMASEANLNGTTLILLPRPSISKVLWMADLRSTALPNWNKM I
Subjt:  RRFKIFDLAIGALSGISNSEVPMVQAVYASMKGLIKIHNPSVIITVADIDPNVKKALKMASEANLNGTTLILLPRPSISKVLWMADLRSTALPNWNKMHI

Query:  SINIITQNRASSLTRLLKSLKDAYYLGDEISISFNMDSKVDEETIKLVSSFEWPHGPKSLRRRIIQGGLIRAVSESWYPSSDDDYGLLLEDDIEVSPYYY
        SINIITQNR SSLTRLLKSLKDAYYLGDEI ISFNMDSKVDE+TIKLVSSFEWPHGPKSLRRRIIQGGLIRAVSESWYP+SDDDYGLLLEDDIEVSPYYY
Subjt:  SINIITQNRASSLTRLLKSLKDAYYLGDEISISFNMDSKVDEETIKLVSSFEWPHGPKSLRRRIIQGGLIRAVSESWYPSSDDDYGLLLEDDIEVSPYYY

Query:  LWIKYALLAYHYDPQISLPELSSISLYTPRLVEVVKERPKWNATEFFKRIHPNTPYLHQLPCSWGAVFFPKHWREFYVYMNSRFTENAKENPVQIPKSRT
        LWIKYALLAYHYDPQISLPELSSISLYTPRLVEVVKERPKWNATEFFKRIHPNTPYLHQLPCSWGAVFFPKHWREFYVYMNSRFTENAKENPVQIPKSRT
Subjt:  LWIKYALLAYHYDPQISLPELSSISLYTPRLVEVVKERPKWNATEFFKRIHPNTPYLHQLPCSWGAVFFPKHWREFYVYMNSRFTENAKENPVQIPKSRT

Query:  NGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPGAHISAKDNVVKHKKEDFEVPLLKENFMNFLPNGKLPAASRLPSLNLFNQPVSLKGLKSA
        NGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPGAHISAK+NVVKH KEDFEVPLLKENF+N+LPNGKLPAASRLPSLNLFNQPVSLKGLKSA
Subjt:  NGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPGAHISAKDNVVKHKKEDFEVPLLKENFMNFLPNGKLPAASRLPSLNLFNQPVSLKGLKSA

Query:  GAKLGQDVLKCEAFEIVAVNHGTGLPSHCAKF
        GAKLGQDVLKCE  EIV VNHGTGLPSHCAKF
Subjt:  GAKLGQDVLKCEAFEIVAVNHGTGLPSHCAKF

A0A5A7SVB5 Glycosyl transferase, family 20.0e+0094.31Show/hide
Query:  MGMLRNSAMGNGDCLEGMINDCVGGKGKLRPQRSSSTRIVAGLTCLQFAFALYATFLLYYVSPAIDLRTKPDFSWATRIAQQWRQFVIPTHVVGRYQEPN
        MG  RNSAMGNGDCLEGMIND VGGKGKLRPQRSSST+IVAGLTCLQFAFALYATFLLYYVSPAIDLRTKPDFSWATRIAQQW QFVIP HVVGRYQEP 
Subjt:  MGMLRNSAMGNGDCLEGMINDCVGGKGKLRPQRSSSTRIVAGLTCLQFAFALYATFLLYYVSPAIDLRTKPDFSWATRIAQQWRQFVIPTHVVGRYQEPN

Query:  SMMMMQAELRPITPKEACENEKIDFEQKKSNDGQMIKLKTELYNEILDFQSKSFGTETLSQLMAMKSKWDLKGPNKPKVTVILNHFKRKTLCAQLNSLLH
        S MMMQAELRPITP+EACENEKIDFEQKKSNDGQMIKLKTELYNEILDFQSKSFGTETL QLMAMKSKWDLKGP KPKVTVILNHFKRKTLCAQLNSLLH
Subjt:  SMMMMQAELRPITPKEACENEKIDFEQKKSNDGQMIKLKTELYNEILDFQSKSFGTETLSQLMAMKSKWDLKGPNKPKVTVILNHFKRKTLCAQLNSLLH

Query:  QTLPFHHVWVLAFGSPNELSLKRIVDSYNNSKISFISSSYDFKYYGRFQMALQTEADLVYILDDDMIPGRKMLQILSHVAGTDKYKNAVLGSIGRILPFR
        QTLPFHHVWVLAFGSPNELSLKRIVDSYNNSKISFISSSYDFKYYGRFQMALQTEADLVYILDDDMIPGRKMLQILSHVAGTDKYKNAVLGSIGRILPFR
Subjt:  QTLPFHHVWVLAFGSPNELSLKRIVDSYNNSKISFISSSYDFKYYGRFQMALQTEADLVYILDDDMIPGRKMLQILSHVAGTDKYKNAVLGSIGRILPFR

Query:  QKDFTFPSYRKF-----------------------LDFLSSSWFLSAELVKTLFIETPFTFATGEDLHLSYQLQKYRNAGSFVLPVDPKDKETWGDSEHR
        QKDFTFPSYRKF                       +DFLSSSWFLSAELVKTLFIETPFTFATGEDLHLSYQLQKYRNAGSFVLPVDPKDKETWGDSEHR
Subjt:  QKDFTFPSYRKF-----------------------LDFLSSSWFLSAELVKTLFIETPFTFATGEDLHLSYQLQKYRNAGSFVLPVDPKDKETWGDSEHR

Query:  LAYVSETTVIFKDIVQVRDEQWWKALSTGYITQWAAMHPQKIDALFYAHSVDEAKALAPLLEKFRSTVGKKAYIVVSGGSFCPCEDVADALKWPKLVCKE
        LAYVSETTVIFKDIVQVRD+QWWKALSTGY+TQWAAMHPQKIDALFYAHSVDEAKALAPLLEKFRSTVGKKAYIVVSGG FCPCEDV DALKWPKLVCKE
Subjt:  LAYVSETTVIFKDIVQVRDEQWWKALSTGYITQWAAMHPQKIDALFYAHSVDEAKALAPLLEKFRSTVGKKAYIVVSGGSFCPCEDVADALKWPKLVCKE

Query:  RRFKIFDLAIGALSGISNSEVPMVQAVYASMKGLIKIHNPSVIITVADIDPNVKKALKMASEANLNGTTLILLPRPSISKVLWMADLRSTALPNWNKMHI
        RRFKIFDLAIGALSG+SNSEVP+VQAVYASMKGLIKIHNPSVIITVADIDPNVKKALKMASEANLNGTTLILLPRPSISKVLWMADLRSTALPNWNKM I
Subjt:  RRFKIFDLAIGALSGISNSEVPMVQAVYASMKGLIKIHNPSVIITVADIDPNVKKALKMASEANLNGTTLILLPRPSISKVLWMADLRSTALPNWNKMHI

Query:  SINIITQNRASSLTRLLKSLKDAYYLGDEISISFNMDSKVDEETIKLVSSFEWPHGPKSLRRRIIQGGLIRAVSESWYPSSDDDYGLLLEDDIEVSPYYY
        SINIITQNR SSLTRLLKSLKDAYYLGDEI ISFNMDSKVDE+TIKLVSSFEWPHGPKSLRRRIIQGGLIRAVSESWYP+SDDDYGLLLEDDIEVSPYYY
Subjt:  SINIITQNRASSLTRLLKSLKDAYYLGDEISISFNMDSKVDEETIKLVSSFEWPHGPKSLRRRIIQGGLIRAVSESWYPSSDDDYGLLLEDDIEVSPYYY

Query:  LWIKYALLAYHYDPQISLPELSSISLYTPRLVEVVKERPKWNATEFFKRIHPNTPYLHQLPCSWGAVFFPKHWREFYVYMNSRFTENAKENPVQIPKSRT
        LWIKYALLAYHYDPQISLPELSSISLYTPRLVEVVKERPKWNATEFFKRIHPNTPYLHQLPCSWGAVFFPKHWREFYVYMNSRFTENAKENPVQIPKSRT
Subjt:  LWIKYALLAYHYDPQISLPELSSISLYTPRLVEVVKERPKWNATEFFKRIHPNTPYLHQLPCSWGAVFFPKHWREFYVYMNSRFTENAKENPVQIPKSRT

Query:  NGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPGAHISAKDNVVKHKKEDFEVPLLKENFMNFLPNGKLPAASRLPSLNLFNQPVSLKGLKSA
        NGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPGAHISAK+NVVKH KEDFEVPLLKENF+N+LPNGKLPAASRLPSLNLFNQPVSLKGLKSA
Subjt:  NGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPGAHISAKDNVVKHKKEDFEVPLLKENFMNFLPNGKLPAASRLPSLNLFNQPVSLKGLKSA

Query:  GAKLGQDVLKCEAFEIVAVNHGTGLPSHCAKF
        GAKLGQDVLKCE  EIV VNHGTGLPSHCAKF
Subjt:  GAKLGQDVLKCEAFEIVAVNHGTGLPSHCAKF

A0A6J1HYP9 uncharacterized protein LOC1114693010.0e+0092.18Show/hide
Query:  MGMLRNSAMGNGDCLEGMINDCVGGKGKLRPQRSSSTRIVAGLTCLQFAFALYATFLLYYVSPAIDLRTKPDFSWATRIAQQWRQFVIPTHVVGRYQEPN
        MG+ RN A  NGD LEGMIND VGGKGKLRPQR+SST++VAGLTCLQFAFALYATFLLYYVSPAIDLRTKPDFSWATRIAQQWRQFVIP HVVGR +EP 
Subjt:  MGMLRNSAMGNGDCLEGMINDCVGGKGKLRPQRSSSTRIVAGLTCLQFAFALYATFLLYYVSPAIDLRTKPDFSWATRIAQQWRQFVIPTHVVGRYQEPN

Query:  SMMMMQAELRPITPKEACENEKIDFEQKKSNDGQMIKLKTELYNEILDFQSKSFGTETLSQLMAMKSKWDLKGPNKPKVTVILNHFKRKTLCAQLNSLLH
        S +MMQAE RPITP+EACENEKIDFEQKKS DGQMIKLKTELYNEILDFQSKSFGTETLSQLMAMKSKWDL+GPNKPKVTVILNHFKRKTLCAQLNSLL 
Subjt:  SMMMMQAELRPITPKEACENEKIDFEQKKSNDGQMIKLKTELYNEILDFQSKSFGTETLSQLMAMKSKWDLKGPNKPKVTVILNHFKRKTLCAQLNSLLH

Query:  QTLPFHHVWVLAFGSPNELSLKRIVDSYNNSKISFISSSYDFKYYGRFQMALQTEADLVYILDDDMIPGRKMLQILSHVAGTDKYKNAVLGSIGRILPFR
        QTLPFHH+WVLAFGSPNELSLKRIVDSYNNS+ISFISSSYDFKYYGRFQMALQTEADLVYILDDDMIPGRKMLQILSHVAGTDKYKNAVLGSIGRILPFR
Subjt:  QTLPFHHVWVLAFGSPNELSLKRIVDSYNNSKISFISSSYDFKYYGRFQMALQTEADLVYILDDDMIPGRKMLQILSHVAGTDKYKNAVLGSIGRILPFR

Query:  QKDFTFPSYRKF-----------------------LDFLSSSWFLSAELVKTLFIETPFTFATGEDLHLSYQLQKYRNAGSFVLPVDPKDKETWGDSEHR
        QKDFTFPSYRKF                       +DFLSSSWFLSAELVKTLFIETPFTFATGEDLHLSYQLQKYRNA SFVLPVDPKDKETWGDSEHR
Subjt:  QKDFTFPSYRKF-----------------------LDFLSSSWFLSAELVKTLFIETPFTFATGEDLHLSYQLQKYRNAGSFVLPVDPKDKETWGDSEHR

Query:  LAYVSETTVIFKDIVQVRDEQWWKALSTGYITQWAAMHPQKIDALFYAHSVDEAKALAPLLEKFRSTVGKKAYIVVSGGSFCPCEDVADALKWPKLVCKE
        LAYVSETTVIFKDIVQVRD+QWWKA+STGYITQWAAM+PQKIDALFYAHSVDEAKALAPLLEKFRSTVGKKAYI VSGG+FCPCED A ALKWPK VCKE
Subjt:  LAYVSETTVIFKDIVQVRDEQWWKALSTGYITQWAAMHPQKIDALFYAHSVDEAKALAPLLEKFRSTVGKKAYIVVSGGSFCPCEDVADALKWPKLVCKE

Query:  RRFKIFDLAIGALSGISNSEVPMVQAVYASMKGLIKIHNPSVIITVADIDPNVKKALKMASEANLNG-TTLILLPRPSISKVLWMADLRSTALPNWNKMH
        RRFKIFDLAIGALSGISNSEVP+VQAVYASMKGLIKIHNPSV+ITVAD+DPNVKKALKMASEANLNG TT+ILLPRPSISKVLWMADLRSTALPNWNKM 
Subjt:  RRFKIFDLAIGALSGISNSEVPMVQAVYASMKGLIKIHNPSVIITVADIDPNVKKALKMASEANLNG-TTLILLPRPSISKVLWMADLRSTALPNWNKMH

Query:  ISINIITQNRASSLTRLLKSLKDAYYLGDEISISFNMDSKVDEETIKLVSSFEWPHGPKSLRRRIIQGGLIRAVSESWYPSSDDDYGLLLEDDIEVSPYY
        ISINIITQNRA SLTRLLKSLKDAYYLGDEI ISFNMDSKVDE+TIKLVSSFEWPHGPKSLRRRIIQGGLIRAVSESWYP+SDDDYGLLLEDDIEVSPYY
Subjt:  ISINIITQNRASSLTRLLKSLKDAYYLGDEISISFNMDSKVDEETIKLVSSFEWPHGPKSLRRRIIQGGLIRAVSESWYPSSDDDYGLLLEDDIEVSPYY

Query:  YLWIKYALLAYHYDPQISLPELSSISLYTPRLVEVVKERPKWNATEFFKRIHPNTPYLHQLPCSWGAVFFPKHWREFYVYMNSRFTENAKENPVQIPKSR
        YLWIKYALLAYHYDPQISLPELSSISLYTPRLVEVVKERPKWNATEFFKRIHPNTPYLHQLPCSWGAVFFPKHWREFYVYMN+R+TENAKENPVQIPKSR
Subjt:  YLWIKYALLAYHYDPQISLPELSSISLYTPRLVEVVKERPKWNATEFFKRIHPNTPYLHQLPCSWGAVFFPKHWREFYVYMNSRFTENAKENPVQIPKSR

Query:  TNGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPGAHISAKDNVVKHKKEDFEVPLLKENFMNFLPNGKLPAASRLPSLNLFNQPVSLKGLKS
        TNGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPGAHISAKDN+VKHKKEDFEVPLLKENF NFLPNGKLPAASRLPSLNLFNQPVSLKGLKS
Subjt:  TNGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPGAHISAKDNVVKHKKEDFEVPLLKENFMNFLPNGKLPAASRLPSLNLFNQPVSLKGLKS

Query:  AGAKLGQDVLKCEAFEIVAVNHGTGLPSHCAKF
        AGAKLGQDVLKCE  EIVAVNH TGLPSHCAKF
Subjt:  AGAKLGQDVLKCEAFEIVAVNHGTGLPSHCAKF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G12260.1 BEST Arabidopsis thaliana protein match is: glycosyltransferase family protein 2 (TAIR:AT5G60700.1)8.0e-2831.2Show/hide
Query:  INIITQNRASSLTRLLKSLKDAYY--LGDEISI-------SFNM---DSKVDE------ETIKLVSSFEWPHGPKSLRRRIIQGGLIRAVSESWYPSSDD
        I ++T NR  SL+R L+SL  A Y   GD   I        FN+   D+ V++      E +  V  FEW  G K +  R    GL     E+W+P SD 
Subjt:  INIITQNRASSLTRLLKSLKDAYY--LGDEISI-------SFNM---DSKVDE------ETIKLVSSFEWPHGPKSLRRRIIQGGLIRAVSESWYPSSDD

Query:  DYGLLLEDDIEVSPYYYLWIKYALLAYHYDPQISLPELSSISLYTPRLVEVVKERPKWNATEFFKRIHPNTP-YLHQLPCSWGAVFFPKHWREFYVYMNS
        ++  ++EDD+EVSP YY  ++  +L Y+YD     P +   SL  PR V      P  +  +    + P T   L+QL  +WG + FPK W+EF ++ + 
Subjt:  DYGLLLEDDIEVSPYYYLWIKYALLAYHYDPQISLPELSSISLYTPRLVEVVKERPKWNATEFFKRIHPNTP-YLHQLPCSWGAVFFPKHWREFYVYMNS

Query:  RFTENAKENPVQIPKSRTNGW-----QASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPGAH
          ++  K     +    +NGW     +  W  + I  ++ RGY ++Y +FPN+ + S +H + G +
Subjt:  RFTENAKENPVQIPKSRTNGW-----QASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPGAH

AT5G60700.1 glycosyltransferase family protein 20.0e+0082.78Show/hide
Query:  MIPGRKMLQILSHVAGTDKYKNAVLGSIGRILPFRQKDFTFPSYRKF-----------------------LDFLSSSWFLSAELVKTLFIETPFTFATGE
        MIPG+KMLQ+LSHVAGT+KY+N+VLGSIGRILPFRQKDFTFPSYRKF                       +DFLSSSWFLSAELVK LFIE PFTF+TGE
Subjt:  MIPGRKMLQILSHVAGTDKYKNAVLGSIGRILPFRQKDFTFPSYRKF-----------------------LDFLSSSWFLSAELVKTLFIETPFTFATGE

Query:  DLHLSYQLQKYRNAGSFVLPVDPKDKETWGDSEHRLAYVSETTVIFKDIVQVRDEQWWKALSTGYITQWAAMHPQKIDALFYAHSVDEAKALAPLLEKFR
        DLHLSYQLQKYRNAGSFVLPVDP DKETWGDSEHRLAYVSETTVIFK+IV+VRD QWWKALSTGY+TQWAAM+PQKIDALFYAHS+DE KAL PLLEKFR
Subjt:  DLHLSYQLQKYRNAGSFVLPVDPKDKETWGDSEHRLAYVSETTVIFKDIVQVRDEQWWKALSTGYITQWAAMHPQKIDALFYAHSVDEAKALAPLLEKFR

Query:  STVGKKAYIVVSGGSFCPCEDVADALKWPKLVCKERRFKIFDLAIGALSGISNSEVPMVQAVYASMKGLIKIHNPSVIITVADIDPNVKKALKMASEANL
         TVGKKAYI VSGG FCPCED A AL+WPK+VCKERRFKIFDL +GA+ G+SNSEVP+ QAVY+SMKGLIKIHNPSV+ITVAD DPNVKKALKMA+E N 
Subjt:  STVGKKAYIVVSGGSFCPCEDVADALKWPKLVCKERRFKIFDLAIGALSGISNSEVPMVQAVYASMKGLIKIHNPSVIITVADIDPNVKKALKMASEANL

Query:  NGTTLILLPRPSISKVLWMADLRSTALPNWNKMHISINIITQNRASSLTRLLKSLKDAYYLGDEISISFNMDSKVDEETIKLVSSFEWPHGPKSLRRRII
        NGT L+LLPR SISKVLWMADLRSTALPNWNKM +S+NIITQNRA SL RLL+SL +AYYLGDEIS+SFNMDSKVDEETI +VS+F+WPHGPK+LRRRII
Subjt:  NGTTLILLPRPSISKVLWMADLRSTALPNWNKMHISINIITQNRASSLTRLLKSLKDAYYLGDEISISFNMDSKVDEETIKLVSSFEWPHGPKSLRRRII

Query:  QGGLIRAVSESWYPSSDDDYGLLLEDDIEVSPYYYLWIKYALLAYHYDPQISLPELSSISLYTPRLVEVVKERPKWNATEFFKRIHPNTPYLHQLPCSWG
        QGGLIRAVSESWYP+SDDD+GLLLEDDIEVSPYY+LWIKYALLAYHYDPQ+S PELSSISLYTP++VEVVKERPKWN T+FFK+IHP+TPYLHQLPCSWG
Subjt:  QGGLIRAVSESWYPSSDDDYGLLLEDDIEVSPYYYLWIKYALLAYHYDPQISLPELSSISLYTPRLVEVVKERPKWNATEFFKRIHPNTPYLHQLPCSWG

Query:  AVFFPKHWREFYVYMNSRFTENAKENPVQIPKSRTNGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPGAHISAKDNVVKHKKEDFEVPLLKE
        AVFFPK WREFYVYMN RFTENAK NPVQIPKSRTNGWQASWKKFLIDMMYLRGYVSLYPNFPNQ+SFSTNHMEPGAHI+AKDNVVKH K DFEVPLL +
Subjt:  AVFFPKHWREFYVYMNSRFTENAKENPVQIPKSRTNGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPGAHISAKDNVVKHKKEDFEVPLLKE

Query:  NFMNFLPNGKLPAASRLPSLNLFNQPVSLKGLKSAGAKLGQDVLKC-EAFEIVAVNHGTGLPSHCAKF
        +F NFLPN KLP  S+LPSLNLFN PVSLKGLK+AGAKLGQDVL+C    EIVAVNH TGLP+ C KF
Subjt:  NFMNFLPNGKLPAASRLPSLNLFNQPVSLKGLKSAGAKLGQDVLKC-EAFEIVAVNHGTGLPSHCAKF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTATGTTACGAAATTCCGCGATGGGAAACGGGGATTGTTTAGAAGGGATGATCAATGATTGTGTTGGAGGGAAGGGGAAGTTAAGACCCCAAAGAAGTTCTTCAAC
AAGGATTGTTGCTGGTCTTACATGTCTTCAGTTTGCCTTTGCATTATATGCAACATTTCTTTTGTATTATGTCAGTCCTGCAATAGACTTGAGAACCAAGCCAGATTTCT
CTTGGGCTACAAGAATTGCTCAACAATGGAGACAGTTCGTAATACCGACACATGTTGTGGGTCGATACCAAGAACCGAATTCTATGATGATGATGCAAGCGGAATTAAGA
CCGATCACTCCTAAGGAAGCTTGTGAGAATGAGAAGATTGATTTTGAACAAAAGAAGTCCAATGATGGGCAGATGATAAAGTTGAAAACAGAGCTTTATAATGAGATTCT
AGATTTTCAAAGCAAAAGCTTTGGAACTGAAACTCTTTCTCAGCTAATGGCAATGAAATCTAAGTGGGATTTGAAAGGGCCAAACAAGCCAAAAGTTACAGTGATCTTGA
ACCATTTCAAGAGAAAAACTTTGTGTGCACAACTTAATTCTTTGCTTCATCAAACCCTTCCTTTCCACCATGTTTGGGTGCTTGCATTTGGGAGTCCAAATGAGCTCTCT
TTGAAAAGAATTGTAGATAGCTATAACAACTCAAAAATTAGCTTCATTAGCTCAAGCTATGACTTCAAGTACTATGGAAGGTTCCAAATGGCCTTACAAACCGAAGCCGA
TTTAGTATATATTCTTGATGATGATATGATTCCTGGCCGTAAAATGCTACAGATTTTATCTCATGTAGCAGGGACTGACAAGTACAAGAATGCAGTTTTAGGTAGCATAG
GAAGAATTTTACCATTTCGACAGAAGGATTTCACATTCCCTAGTTATCGAAAGTTCCTCGATTTTCTTTCCAGCTCTTGGTTCTTATCTGCAGAGCTTGTGAAGACACTT
TTCATTGAAACCCCTTTCACCTTTGCAACTGGAGAAGATCTTCATCTAAGCTATCAGCTTCAAAAGTATAGAAATGCAGGCTCATTTGTTCTTCCTGTAGACCCAAAAGA
CAAAGAAACATGGGGAGACAGTGAACACAGGCTGGCTTACGTGTCCGAGACAACTGTGATATTCAAGGACATTGTTCAAGTTCGAGATGAACAATGGTGGAAAGCGTTGT
CTACGGGTTATATCACACAATGGGCTGCTATGCATCCTCAAAAAATAGATGCTCTATTTTATGCTCATTCCGTTGATGAAGCTAAAGCACTAGCACCACTTCTTGAAAAG
TTTAGGTCCACGGTTGGCAAGAAGGCTTATATTGTAGTGTCGGGCGGAAGTTTTTGCCCGTGTGAAGATGTTGCAGATGCTCTTAAATGGCCGAAATTGGTTTGTAAAGA
ACGGAGGTTCAAGATATTTGACTTAGCTATTGGGGCTCTCTCTGGAATATCAAATTCCGAGGTTCCTATGGTGCAAGCAGTGTATGCTAGTATGAAGGGATTGATCAAAA
TACACAATCCTAGCGTCATCATCACCGTGGCGGATATTGATCCTAATGTGAAGAAAGCTTTAAAAATGGCATCAGAAGCTAACTTGAATGGTACAACACTGATTCTTTTA
CCAAGGCCTTCCATTTCGAAAGTTCTTTGGATGGCTGATCTTCGATCAACAGCACTTCCAAATTGGAACAAGATGCATATTTCGATCAACATTATCACACAAAATCGTGC
CAGCTCGTTAACAAGGCTTCTCAAATCACTAAAAGATGCATATTACTTAGGGGATGAGATATCTATCAGCTTCAACATGGACAGTAAAGTAGACGAGGAAACTATAAAAT
TAGTAAGCTCCTTTGAGTGGCCCCATGGCCCGAAAAGCCTCAGAAGGAGAATCATCCAAGGAGGGCTAATCCGAGCAGTAAGCGAGAGTTGGTATCCGTCTTCAGACGAT
GATTATGGACTGCTACTCGAAGACGATATTGAAGTCTCTCCATACTATTACTTATGGATCAAATACGCCCTCCTAGCGTACCACTATGATCCACAAATATCTCTACCCGA
GCTATCGTCAATCTCCCTCTACACACCTCGACTAGTCGAAGTGGTAAAAGAAAGACCTAAATGGAATGCAACAGAGTTCTTCAAGCGGATTCATCCAAATACACCGTACC
TCCATCAGCTCCCCTGTAGTTGGGGAGCAGTTTTCTTTCCCAAACATTGGAGAGAGTTTTACGTTTACATGAACTCAAGGTTCACAGAAAATGCAAAAGAGAACCCAGTT
CAAATCCCTAAATCTAGAACAAATGGTTGGCAGGCATCATGGAAGAAGTTCCTAATAGACATGATGTATTTAAGAGGCTACGTAAGTCTCTACCCTAACTTCCCAAATCA
AGCCAGTTTTTCAACAAACCACATGGAACCAGGGGCTCACATAAGTGCTAAAGACAATGTCGTGAAGCACAAGAAAGAAGATTTTGAGGTTCCATTATTGAAAGAAAACT
TTATGAATTTCTTACCCAATGGGAAATTGCCGGCTGCTTCGAGACTTCCATCATTGAACCTGTTCAATCAACCGGTGTCACTGAAGGGCCTCAAGTCCGCTGGAGCCAAG
CTAGGTCAAGATGTGCTGAAATGCGAAGCTTTTGAGATTGTAGCAGTGAATCATGGGACTGGTCTGCCTTCGCACTGTGCAAAATTCTGA
mRNA sequenceShow/hide mRNA sequence
AATTAAGTATATCTCAATATAAAGCATAGGTCCTCAACCAAGCAGTATAAAAAAACAAAAGATAGTATGAAAGTTGTTTGGCGGGTAAAGATATTTTATAAATTACTTTC
TAGAGAGAGAGAGAGAAAGGAGTAGAGAAGTGCCAAAACGGAGGGGAGATATAGCCATTTTCCAAGGCTTCAAATCCGGGGCTTTGTATTAATTTCTCAGCTGGAAGTTA
TAGATTTCGCCCCGTTTCGAAAAGTTTTAAGGTTGTGTCGAGTTGAAGATGGGTATGTTACGAAATTCCGCGATGGGAAACGGGGATTGTTTAGAAGGGATGATCAATGA
TTGTGTTGGAGGGAAGGGGAAGTTAAGACCCCAAAGAAGTTCTTCAACAAGGATTGTTGCTGGTCTTACATGTCTTCAGTTTGCCTTTGCATTATATGCAACATTTCTTT
TGTATTATGTCAGTCCTGCAATAGACTTGAGAACCAAGCCAGATTTCTCTTGGGCTACAAGAATTGCTCAACAATGGAGACAGTTCGTAATACCGACACATGTTGTGGGT
CGATACCAAGAACCGAATTCTATGATGATGATGCAAGCGGAATTAAGACCGATCACTCCTAAGGAAGCTTGTGAGAATGAGAAGATTGATTTTGAACAAAAGAAGTCCAA
TGATGGGCAGATGATAAAGTTGAAAACAGAGCTTTATAATGAGATTCTAGATTTTCAAAGCAAAAGCTTTGGAACTGAAACTCTTTCTCAGCTAATGGCAATGAAATCTA
AGTGGGATTTGAAAGGGCCAAACAAGCCAAAAGTTACAGTGATCTTGAACCATTTCAAGAGAAAAACTTTGTGTGCACAACTTAATTCTTTGCTTCATCAAACCCTTCCT
TTCCACCATGTTTGGGTGCTTGCATTTGGGAGTCCAAATGAGCTCTCTTTGAAAAGAATTGTAGATAGCTATAACAACTCAAAAATTAGCTTCATTAGCTCAAGCTATGA
CTTCAAGTACTATGGAAGGTTCCAAATGGCCTTACAAACCGAAGCCGATTTAGTATATATTCTTGATGATGATATGATTCCTGGCCGTAAAATGCTACAGATTTTATCTC
ATGTAGCAGGGACTGACAAGTACAAGAATGCAGTTTTAGGTAGCATAGGAAGAATTTTACCATTTCGACAGAAGGATTTCACATTCCCTAGTTATCGAAAGTTCCTCGAT
TTTCTTTCCAGCTCTTGGTTCTTATCTGCAGAGCTTGTGAAGACACTTTTCATTGAAACCCCTTTCACCTTTGCAACTGGAGAAGATCTTCATCTAAGCTATCAGCTTCA
AAAGTATAGAAATGCAGGCTCATTTGTTCTTCCTGTAGACCCAAAAGACAAAGAAACATGGGGAGACAGTGAACACAGGCTGGCTTACGTGTCCGAGACAACTGTGATAT
TCAAGGACATTGTTCAAGTTCGAGATGAACAATGGTGGAAAGCGTTGTCTACGGGTTATATCACACAATGGGCTGCTATGCATCCTCAAAAAATAGATGCTCTATTTTAT
GCTCATTCCGTTGATGAAGCTAAAGCACTAGCACCACTTCTTGAAAAGTTTAGGTCCACGGTTGGCAAGAAGGCTTATATTGTAGTGTCGGGCGGAAGTTTTTGCCCGTG
TGAAGATGTTGCAGATGCTCTTAAATGGCCGAAATTGGTTTGTAAAGAACGGAGGTTCAAGATATTTGACTTAGCTATTGGGGCTCTCTCTGGAATATCAAATTCCGAGG
TTCCTATGGTGCAAGCAGTGTATGCTAGTATGAAGGGATTGATCAAAATACACAATCCTAGCGTCATCATCACCGTGGCGGATATTGATCCTAATGTGAAGAAAGCTTTA
AAAATGGCATCAGAAGCTAACTTGAATGGTACAACACTGATTCTTTTACCAAGGCCTTCCATTTCGAAAGTTCTTTGGATGGCTGATCTTCGATCAACAGCACTTCCAAA
TTGGAACAAGATGCATATTTCGATCAACATTATCACACAAAATCGTGCCAGCTCGTTAACAAGGCTTCTCAAATCACTAAAAGATGCATATTACTTAGGGGATGAGATAT
CTATCAGCTTCAACATGGACAGTAAAGTAGACGAGGAAACTATAAAATTAGTAAGCTCCTTTGAGTGGCCCCATGGCCCGAAAAGCCTCAGAAGGAGAATCATCCAAGGA
GGGCTAATCCGAGCAGTAAGCGAGAGTTGGTATCCGTCTTCAGACGATGATTATGGACTGCTACTCGAAGACGATATTGAAGTCTCTCCATACTATTACTTATGGATCAA
ATACGCCCTCCTAGCGTACCACTATGATCCACAAATATCTCTACCCGAGCTATCGTCAATCTCCCTCTACACACCTCGACTAGTCGAAGTGGTAAAAGAAAGACCTAAAT
GGAATGCAACAGAGTTCTTCAAGCGGATTCATCCAAATACACCGTACCTCCATCAGCTCCCCTGTAGTTGGGGAGCAGTTTTCTTTCCCAAACATTGGAGAGAGTTTTAC
GTTTACATGAACTCAAGGTTCACAGAAAATGCAAAAGAGAACCCAGTTCAAATCCCTAAATCTAGAACAAATGGTTGGCAGGCATCATGGAAGAAGTTCCTAATAGACAT
GATGTATTTAAGAGGCTACGTAAGTCTCTACCCTAACTTCCCAAATCAAGCCAGTTTTTCAACAAACCACATGGAACCAGGGGCTCACATAAGTGCTAAAGACAATGTCG
TGAAGCACAAGAAAGAAGATTTTGAGGTTCCATTATTGAAAGAAAACTTTATGAATTTCTTACCCAATGGGAAATTGCCGGCTGCTTCGAGACTTCCATCATTGAACCTG
TTCAATCAACCGGTGTCACTGAAGGGCCTCAAGTCCGCTGGAGCCAAGCTAGGTCAAGATGTGCTGAAATGCGAAGCTTTTGAGATTGTAGCAGTGAATCATGGGACTGG
TCTGCCTTCGCACTGTGCAAAATTCTGA
Protein sequenceShow/hide protein sequence
MGMLRNSAMGNGDCLEGMINDCVGGKGKLRPQRSSSTRIVAGLTCLQFAFALYATFLLYYVSPAIDLRTKPDFSWATRIAQQWRQFVIPTHVVGRYQEPNSMMMMQAELR
PITPKEACENEKIDFEQKKSNDGQMIKLKTELYNEILDFQSKSFGTETLSQLMAMKSKWDLKGPNKPKVTVILNHFKRKTLCAQLNSLLHQTLPFHHVWVLAFGSPNELS
LKRIVDSYNNSKISFISSSYDFKYYGRFQMALQTEADLVYILDDDMIPGRKMLQILSHVAGTDKYKNAVLGSIGRILPFRQKDFTFPSYRKFLDFLSSSWFLSAELVKTL
FIETPFTFATGEDLHLSYQLQKYRNAGSFVLPVDPKDKETWGDSEHRLAYVSETTVIFKDIVQVRDEQWWKALSTGYITQWAAMHPQKIDALFYAHSVDEAKALAPLLEK
FRSTVGKKAYIVVSGGSFCPCEDVADALKWPKLVCKERRFKIFDLAIGALSGISNSEVPMVQAVYASMKGLIKIHNPSVIITVADIDPNVKKALKMASEANLNGTTLILL
PRPSISKVLWMADLRSTALPNWNKMHISINIITQNRASSLTRLLKSLKDAYYLGDEISISFNMDSKVDEETIKLVSSFEWPHGPKSLRRRIIQGGLIRAVSESWYPSSDD
DYGLLLEDDIEVSPYYYLWIKYALLAYHYDPQISLPELSSISLYTPRLVEVVKERPKWNATEFFKRIHPNTPYLHQLPCSWGAVFFPKHWREFYVYMNSRFTENAKENPV
QIPKSRTNGWQASWKKFLIDMMYLRGYVSLYPNFPNQASFSTNHMEPGAHISAKDNVVKHKKEDFEVPLLKENFMNFLPNGKLPAASRLPSLNLFNQPVSLKGLKSAGAK
LGQDVLKCEAFEIVAVNHGTGLPSHCAKF