; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g04390 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g04390
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUPF0481 protein At3g47200-like
Genome locationchr8:3201646..3202878
RNA-Seq ExpressionMoc08g04390
SyntenyMoc08g04390
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0064930.1 UPF0481 protein [Cucumis melo var. makuwa]1.5e-9248.17Show/hide
Query:  VPAALFEMKPEAYIPQFIFIGHPDPTVEHFVSGNND---KIMKGCFVGKFSSVAKVELNEIIGRVIGWEQTARDCYLFIGRRKLDTVQFVKFLVMDGCFV
        VP  L    P+AY PQ I IG       H+    ND   K  KG +V  F +VAK+  NE+I + + WE+ AR+ Y  +   K++  +F++ L+ D CFV
Subjt:  VPAALFEMKPEAYIPQFIFIGHPDPTVEHFVSGNND---KIMKGCFVGKFSSVAKVELNEIIGRVIGWEQTARDCYLFIGRRKLDTVQFVKFLVMDGCFV

Query:  VMYMLVSVFPEFQDIDTSSFFWRFNDAVFRDLLLFQNQLPFFLLRSLYDLCVSNNQTILGNTSFIQLTHQFFID-REGIGYLGKDFRVEEDKLEVKHLIH
        VMY++ S+  EF+D+DT SF WRF++ +F+DLLL +NQLPFFLL  LY+LC +  Q  L + SFI+L   +F + REG+ Y+ + + ++ D  EV HL+ 
Subjt:  VMYMLVSVFPEFQDIDTSSFFWRFNDAVFRDLLLFQNQLPFFLLRSLYDLCVSNNQTILGNTSFIQLTHQFFID-REGIGYLGKDFRVEEDKLEVKHLIH

Query:  FLSCYM----NFPHQDDSSKSLDGTPLMVSVWPPTATELYDYGISFEKKSHYSQKM-FDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFL
        FL  ++    + PH  D S         +S WP TATEL+D GISF  +      + F ER G+L++P III+++FE   RN+IAYEY   KS  VSNF 
Subjt:  FLSCYM----NFPHQDDSSKSLDGTPLMVSVWPPTATELYDYGISFEKKSHYSQKM-FDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFL

Query:  MFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVTKLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMRA
        MFM FL+N++ DV+LL+ +GII NHL S +E+  LF DLCKN++ ERNLY+ EC+KM++YCKHRRHRWM SLK DYF TPWA ISF+AAV+LLLLTL++ 
Subjt:  MFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVTKLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMRA

Query:  VVAVLSMPK
        VVA +++ K
Subjt:  VVAVLSMPK

XP_004138858.1 UPF0481 protein At3g47200 [Cucumis sativus]4.5e-8947.77Show/hide
Query:  VPAALFEMKPEAYIPQFIFIGHPDPTVEHFVSGNND--KIMKGCFVGKFSSVAKVELNEIIGRVIGWEQTARDCYLFIGRRKLDTVQFVKFLVMDGCFVV
        VP  L +  P+AY PQ I IG       H+    ND  K  KG +V  F +VAK++ NE+I + + WE+ AR+ Y+     K D  +F++ L+ D CFVV
Subjt:  VPAALFEMKPEAYIPQFIFIGHPDPTVEHFVSGNND--KIMKGCFVGKFSSVAKVELNEIIGRVIGWEQTARDCYLFIGRRKLDTVQFVKFLVMDGCFVV

Query:  MYMLVSVFPEFQDIDTSSFFWRFNDAVFRDLLLFQNQLPFFLLRSLYDLCVSNNQTILGNTSFIQLTHQFFID-REGIGYLGKDFRVEEDKLEVKHLIHF
        MY++ S+  EF+D+DT SF WRF++ +F+DLLL +NQLPFFLL  LY+LC S  Q  L + SFI+L   +F   REG+ Y+ K+   + D   V HL+ F
Subjt:  MYMLVSVFPEFQDIDTSSFFWRFNDAVFRDLLLFQNQLPFFLLRSLYDLCVSNNQTILGNTSFIQLTHQFFID-REGIGYLGKDFRVEEDKLEVKHLIHF

Query:  LSCYMNFPHQDDSSKSLDGTPLMVSVWPPTATELYDYGISFE-KKSHYSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFLMFMRF
        L  ++  P        L     + S WP TATEL++ GISF  +K       F ER G+L++P III+++FE   RN+IAYEY   KS   SNF MFM F
Subjt:  LSCYMNFPHQDDSSKSLDGTPLMVSVWPPTATELYDYGISFE-KKSHYSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFLMFMRF

Query:  LLNSDNDVNLLIKEGIIHNHLESAKEVTKLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMRAVVAVL
        L+N++ DV+LL+ +GII N L S KE+  LF DLCKN++ ERN Y+  C +M++YCKHRRHRWM SLK DYF TPWA ISF+AAV+LLLLTL++ VVA +
Subjt:  LLNSDNDVNLLIKEGIIHNHLESAKEVTKLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMRAVVAVL

Query:  SMPK
        ++ K
Subjt:  SMPK

XP_008445209.1 PREDICTED: UPF0481 protein At3g47200-like [Cucumis melo]2.5e-9248.17Show/hide
Query:  VPAALFEMKPEAYIPQFIFIGHPDPTVEHFVSGNND---KIMKGCFVGKFSSVAKVELNEIIGRVIGWEQTARDCYLFIGRRKLDTVQFVKFLVMDGCFV
        VP  L    P+AY PQ I IG       H+    ND   K  KG +V  F +VAK+  NE+I + + WE+ AR+ Y  +   K++  +F++ L+ D CFV
Subjt:  VPAALFEMKPEAYIPQFIFIGHPDPTVEHFVSGNND---KIMKGCFVGKFSSVAKVELNEIIGRVIGWEQTARDCYLFIGRRKLDTVQFVKFLVMDGCFV

Query:  VMYMLVSVFPEFQDIDTSSFFWRFNDAVFRDLLLFQNQLPFFLLRSLYDLCVSNNQTILGNTSFIQLTHQFFID-REGIGYLGKDFRVEEDKLEVKHLIH
        VMY++ S+  EF+D+DT SF WRF++ +F+DLLL +NQLPFFLL  LY+LC +  Q  L + SFI+L   +F + REG+ Y+ + + ++ D  EV HL+ 
Subjt:  VMYMLVSVFPEFQDIDTSSFFWRFNDAVFRDLLLFQNQLPFFLLRSLYDLCVSNNQTILGNTSFIQLTHQFFID-REGIGYLGKDFRVEEDKLEVKHLIH

Query:  FLSCYM----NFPHQDDSSKSLDGTPLMVSVWPPTATELYDYGISFEKKSHYSQKM-FDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFL
        FL  ++    + PH  D S         +S WP TATEL+D GISF  +      + F ER G+L++P III+++FE   RN+IAYEY   KS  VSNF 
Subjt:  FLSCYM----NFPHQDDSSKSLDGTPLMVSVWPPTATELYDYGISFEKKSHYSQKM-FDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFL

Query:  MFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVTKLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMRA
        MFM FL+N++ DV+LL+ +GII NHL S +E+  LF DLCKN++ ERNLY+ EC+KM++YCKHRRHRWM SLK DYF TPWA ISF+AAV+LLLLTL++ 
Subjt:  MFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVTKLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMRA

Query:  VVAVLSMPK
        VVA +++ K
Subjt:  VVAVLSMPK

XP_022131636.1 UPF0481 protein At3g47200-like [Momordica charantia]5.8e-238100Show/hide
Query:  MELDKIGNVPAALFEMKPEAYIPQFIFIGHPDPTVEHFVSGNNDKIMKGCFVGKFSSVAKVELNEIIGRVIGWEQTARDCYLFIGRRKLDTVQFVKFLVM
        MELDKIGNVPAALFEMKPEAYIPQFIFIGHPDPTVEHFVSGNNDKIMKGCFVGKFSSVAKVELNEIIGRVIGWEQTARDCYLFIGRRKLDTVQFVKFLVM
Subjt:  MELDKIGNVPAALFEMKPEAYIPQFIFIGHPDPTVEHFVSGNNDKIMKGCFVGKFSSVAKVELNEIIGRVIGWEQTARDCYLFIGRRKLDTVQFVKFLVM

Query:  DGCFVVMYMLVSVFPEFQDIDTSSFFWRFNDAVFRDLLLFQNQLPFFLLRSLYDLCVSNNQTILGNTSFIQLTHQFFIDREGIGYLGKDFRVEEDKLEVK
        DGCFVVMYMLVSVFPEFQDIDTSSFFWRFNDAVFRDLLLFQNQLPFFLLRSLYDLCVSNNQTILGNTSFIQLTHQFFIDREGIGYLGKDFRVEEDKLEVK
Subjt:  DGCFVVMYMLVSVFPEFQDIDTSSFFWRFNDAVFRDLLLFQNQLPFFLLRSLYDLCVSNNQTILGNTSFIQLTHQFFIDREGIGYLGKDFRVEEDKLEVK

Query:  HLIHFLSCYMNFPHQDDSSKSLDGTPLMVSVWPPTATELYDYGISFEKKSHYSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFLM
        HLIHFLSCYMNFPHQDDSSKSLDGTPLMVSVWPPTATELYDYGISFEKKSHYSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFLM
Subjt:  HLIHFLSCYMNFPHQDDSSKSLDGTPLMVSVWPPTATELYDYGISFEKKSHYSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFLM

Query:  FMRFLLNSDNDVNLLIKEGIIHNHLESAKEVTKLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMRAV
        FMRFLLNSDNDVNLLIKEGIIHNHLESAKEVTKLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMRAV
Subjt:  FMRFLLNSDNDVNLLIKEGIIHNHLESAKEVTKLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMRAV

Query:  VAVLSMPKPK
        VAVLSMPKPK
Subjt:  VAVLSMPKPK

XP_022132033.1 UPF0481 protein At3g47200-like [Momordica charantia]1.7e-13666.43Show/hide
Query:  DKIGNVPAALFEMKPEAYIPQFIFIGHPDPTVEHFVSGNNDKI---MKGCFVGKFSSVAKVELNEIIGRVIGWEQTARDCYLFIGRRKLDTVQFVKFLVM
        D I  VP AL+   P A+IPQFI IG   P      S   D I   MK  F   F+S  +VELN+I+  VIG E+ A + Y    R  +   +F++ L++
Subjt:  DKIGNVPAALFEMKPEAYIPQFIFIGHPDPTVEHFVSGNNDKI---MKGCFVGKFSSVAKVELNEIIGRVIGWEQTARDCYLFIGRRKLDTVQFVKFLVM

Query:  DGCFVVMYMLVSVFPEFQDIDTSSFFWRFNDAVFRDLLLFQNQLPFFLLRSLYDLCVSNN-QTILGNTSFIQLTHQFFIDREGIGYLGKDFRV-EEDKLE
        D CFVVMY+L SV P FQD + SSF WRFNDA+FRD++LF+NQLPFFLL+SLY+LC  NN ++ILG  SFIQLTH+FF+ REGIGYLGKDFRV  EDKLE
Subjt:  DGCFVVMYMLVSVFPEFQDIDTSSFFWRFNDAVFRDLLLFQNQLPFFLLRSLYDLCVSNN-QTILGNTSFIQLTHQFFIDREGIGYLGKDFRV-EEDKLE

Query:  VKHLIHFLSCYMNFPHQDDSSKSLDGTPLMV----SVWPPTATELYDYGISFEKKSHYSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPG
        V H +HFLS YMN         SLD T   V    ++WPPTATELYDYGISFEKKSHYSQKMFDER GILR+PHIIINETFES +RNIIA+E+  RK   
Subjt:  VKHLIHFLSCYMNFPHQDDSSKSLDGTPLMV----SVWPPTATELYDYGISFEKKSHYSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPG

Query:  VSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVTKLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLL
        VSNF +FMRFLLNSDNDV LLIKEGIIHNHLES K VTKLF DLC+NVV E NLYNYEC++MR+Y KHRRHRWMASLK DYFNTPWALISFIAAVVLLLL
Subjt:  VSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVTKLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLL

Query:  TLMRAVVAVLSMPK
        TLM+ VVAVLSMPK
Subjt:  TLMRAVVAVLSMPK

TrEMBL top hitse value%identityAlignment
A0A0A0LPK8 Uncharacterized protein2.2e-8947.77Show/hide
Query:  VPAALFEMKPEAYIPQFIFIGHPDPTVEHFVSGNND--KIMKGCFVGKFSSVAKVELNEIIGRVIGWEQTARDCYLFIGRRKLDTVQFVKFLVMDGCFVV
        VP  L +  P+AY PQ I IG       H+    ND  K  KG +V  F +VAK++ NE+I + + WE+ AR+ Y+     K D  +F++ L+ D CFVV
Subjt:  VPAALFEMKPEAYIPQFIFIGHPDPTVEHFVSGNND--KIMKGCFVGKFSSVAKVELNEIIGRVIGWEQTARDCYLFIGRRKLDTVQFVKFLVMDGCFVV

Query:  MYMLVSVFPEFQDIDTSSFFWRFNDAVFRDLLLFQNQLPFFLLRSLYDLCVSNNQTILGNTSFIQLTHQFFID-REGIGYLGKDFRVEEDKLEVKHLIHF
        MY++ S+  EF+D+DT SF WRF++ +F+DLLL +NQLPFFLL  LY+LC S  Q  L + SFI+L   +F   REG+ Y+ K+   + D   V HL+ F
Subjt:  MYMLVSVFPEFQDIDTSSFFWRFNDAVFRDLLLFQNQLPFFLLRSLYDLCVSNNQTILGNTSFIQLTHQFFID-REGIGYLGKDFRVEEDKLEVKHLIHF

Query:  LSCYMNFPHQDDSSKSLDGTPLMVSVWPPTATELYDYGISFE-KKSHYSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFLMFMRF
        L  ++  P        L     + S WP TATEL++ GISF  +K       F ER G+L++P III+++FE   RN+IAYEY   KS   SNF MFM F
Subjt:  LSCYMNFPHQDDSSKSLDGTPLMVSVWPPTATELYDYGISFE-KKSHYSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFLMFMRF

Query:  LLNSDNDVNLLIKEGIIHNHLESAKEVTKLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMRAVVAVL
        L+N++ DV+LL+ +GII N L S KE+  LF DLCKN++ ERN Y+  C +M++YCKHRRHRWM SLK DYF TPWA ISF+AAV+LLLLTL++ VVA +
Subjt:  LLNSDNDVNLLIKEGIIHNHLESAKEVTKLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMRAVVAVL

Query:  SMPK
        ++ K
Subjt:  SMPK

A0A1S3BD00 UPF0481 protein At3g47200-like1.2e-9248.17Show/hide
Query:  VPAALFEMKPEAYIPQFIFIGHPDPTVEHFVSGNND---KIMKGCFVGKFSSVAKVELNEIIGRVIGWEQTARDCYLFIGRRKLDTVQFVKFLVMDGCFV
        VP  L    P+AY PQ I IG       H+    ND   K  KG +V  F +VAK+  NE+I + + WE+ AR+ Y  +   K++  +F++ L+ D CFV
Subjt:  VPAALFEMKPEAYIPQFIFIGHPDPTVEHFVSGNND---KIMKGCFVGKFSSVAKVELNEIIGRVIGWEQTARDCYLFIGRRKLDTVQFVKFLVMDGCFV

Query:  VMYMLVSVFPEFQDIDTSSFFWRFNDAVFRDLLLFQNQLPFFLLRSLYDLCVSNNQTILGNTSFIQLTHQFFID-REGIGYLGKDFRVEEDKLEVKHLIH
        VMY++ S+  EF+D+DT SF WRF++ +F+DLLL +NQLPFFLL  LY+LC +  Q  L + SFI+L   +F + REG+ Y+ + + ++ D  EV HL+ 
Subjt:  VMYMLVSVFPEFQDIDTSSFFWRFNDAVFRDLLLFQNQLPFFLLRSLYDLCVSNNQTILGNTSFIQLTHQFFID-REGIGYLGKDFRVEEDKLEVKHLIH

Query:  FLSCYM----NFPHQDDSSKSLDGTPLMVSVWPPTATELYDYGISFEKKSHYSQKM-FDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFL
        FL  ++    + PH  D S         +S WP TATEL+D GISF  +      + F ER G+L++P III+++FE   RN+IAYEY   KS  VSNF 
Subjt:  FLSCYM----NFPHQDDSSKSLDGTPLMVSVWPPTATELYDYGISFEKKSHYSQKM-FDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFL

Query:  MFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVTKLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMRA
        MFM FL+N++ DV+LL+ +GII NHL S +E+  LF DLCKN++ ERNLY+ EC+KM++YCKHRRHRWM SLK DYF TPWA ISF+AAV+LLLLTL++ 
Subjt:  MFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVTKLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMRA

Query:  VVAVLSMPK
        VVA +++ K
Subjt:  VVAVLSMPK

A0A5A7VCL1 UPF0481 protein7.2e-9348.17Show/hide
Query:  VPAALFEMKPEAYIPQFIFIGHPDPTVEHFVSGNND---KIMKGCFVGKFSSVAKVELNEIIGRVIGWEQTARDCYLFIGRRKLDTVQFVKFLVMDGCFV
        VP  L    P+AY PQ I IG       H+    ND   K  KG +V  F +VAK+  NE+I + + WE+ AR+ Y  +   K++  +F++ L+ D CFV
Subjt:  VPAALFEMKPEAYIPQFIFIGHPDPTVEHFVSGNND---KIMKGCFVGKFSSVAKVELNEIIGRVIGWEQTARDCYLFIGRRKLDTVQFVKFLVMDGCFV

Query:  VMYMLVSVFPEFQDIDTSSFFWRFNDAVFRDLLLFQNQLPFFLLRSLYDLCVSNNQTILGNTSFIQLTHQFFID-REGIGYLGKDFRVEEDKLEVKHLIH
        VMY++ S+  EF+D+DT SF WRF++ +F+DLLL +NQLPFFLL  LY+LC +  Q  L + SFI+L   +F + REG+ Y+ + + ++ D  EV HL+ 
Subjt:  VMYMLVSVFPEFQDIDTSSFFWRFNDAVFRDLLLFQNQLPFFLLRSLYDLCVSNNQTILGNTSFIQLTHQFFID-REGIGYLGKDFRVEEDKLEVKHLIH

Query:  FLSCYM----NFPHQDDSSKSLDGTPLMVSVWPPTATELYDYGISFEKKSHYSQKM-FDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFL
        FL  ++    + PH  D S         +S WP TATEL+D GISF  +      + F ER G+L++P III+++FE   RN+IAYEY   KS  VSNF 
Subjt:  FLSCYM----NFPHQDDSSKSLDGTPLMVSVWPPTATELYDYGISFEKKSHYSQKM-FDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFL

Query:  MFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVTKLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMRA
        MFM FL+N++ DV+LL+ +GII NHL S +E+  LF DLCKN++ ERNLY+ EC+KM++YCKHRRHRWM SLK DYF TPWA ISF+AAV+LLLLTL++ 
Subjt:  MFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVTKLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMRA

Query:  VVAVLSMPK
        VVA +++ K
Subjt:  VVAVLSMPK

A0A6J1BQ21 UPF0481 protein At3g47200-like2.8e-238100Show/hide
Query:  MELDKIGNVPAALFEMKPEAYIPQFIFIGHPDPTVEHFVSGNNDKIMKGCFVGKFSSVAKVELNEIIGRVIGWEQTARDCYLFIGRRKLDTVQFVKFLVM
        MELDKIGNVPAALFEMKPEAYIPQFIFIGHPDPTVEHFVSGNNDKIMKGCFVGKFSSVAKVELNEIIGRVIGWEQTARDCYLFIGRRKLDTVQFVKFLVM
Subjt:  MELDKIGNVPAALFEMKPEAYIPQFIFIGHPDPTVEHFVSGNNDKIMKGCFVGKFSSVAKVELNEIIGRVIGWEQTARDCYLFIGRRKLDTVQFVKFLVM

Query:  DGCFVVMYMLVSVFPEFQDIDTSSFFWRFNDAVFRDLLLFQNQLPFFLLRSLYDLCVSNNQTILGNTSFIQLTHQFFIDREGIGYLGKDFRVEEDKLEVK
        DGCFVVMYMLVSVFPEFQDIDTSSFFWRFNDAVFRDLLLFQNQLPFFLLRSLYDLCVSNNQTILGNTSFIQLTHQFFIDREGIGYLGKDFRVEEDKLEVK
Subjt:  DGCFVVMYMLVSVFPEFQDIDTSSFFWRFNDAVFRDLLLFQNQLPFFLLRSLYDLCVSNNQTILGNTSFIQLTHQFFIDREGIGYLGKDFRVEEDKLEVK

Query:  HLIHFLSCYMNFPHQDDSSKSLDGTPLMVSVWPPTATELYDYGISFEKKSHYSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFLM
        HLIHFLSCYMNFPHQDDSSKSLDGTPLMVSVWPPTATELYDYGISFEKKSHYSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFLM
Subjt:  HLIHFLSCYMNFPHQDDSSKSLDGTPLMVSVWPPTATELYDYGISFEKKSHYSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFLM

Query:  FMRFLLNSDNDVNLLIKEGIIHNHLESAKEVTKLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMRAV
        FMRFLLNSDNDVNLLIKEGIIHNHLESAKEVTKLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMRAV
Subjt:  FMRFLLNSDNDVNLLIKEGIIHNHLESAKEVTKLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMRAV

Query:  VAVLSMPKPK
        VAVLSMPKPK
Subjt:  VAVLSMPKPK

A0A6J1BR42 UPF0481 protein At3g47200-like8.1e-13766.43Show/hide
Query:  DKIGNVPAALFEMKPEAYIPQFIFIGHPDPTVEHFVSGNNDKI---MKGCFVGKFSSVAKVELNEIIGRVIGWEQTARDCYLFIGRRKLDTVQFVKFLVM
        D I  VP AL+   P A+IPQFI IG   P      S   D I   MK  F   F+S  +VELN+I+  VIG E+ A + Y    R  +   +F++ L++
Subjt:  DKIGNVPAALFEMKPEAYIPQFIFIGHPDPTVEHFVSGNNDKI---MKGCFVGKFSSVAKVELNEIIGRVIGWEQTARDCYLFIGRRKLDTVQFVKFLVM

Query:  DGCFVVMYMLVSVFPEFQDIDTSSFFWRFNDAVFRDLLLFQNQLPFFLLRSLYDLCVSNN-QTILGNTSFIQLTHQFFIDREGIGYLGKDFRV-EEDKLE
        D CFVVMY+L SV P FQD + SSF WRFNDA+FRD++LF+NQLPFFLL+SLY+LC  NN ++ILG  SFIQLTH+FF+ REGIGYLGKDFRV  EDKLE
Subjt:  DGCFVVMYMLVSVFPEFQDIDTSSFFWRFNDAVFRDLLLFQNQLPFFLLRSLYDLCVSNN-QTILGNTSFIQLTHQFFIDREGIGYLGKDFRV-EEDKLE

Query:  VKHLIHFLSCYMNFPHQDDSSKSLDGTPLMV----SVWPPTATELYDYGISFEKKSHYSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPG
        V H +HFLS YMN         SLD T   V    ++WPPTATELYDYGISFEKKSHYSQKMFDER GILR+PHIIINETFES +RNIIA+E+  RK   
Subjt:  VKHLIHFLSCYMNFPHQDDSSKSLDGTPLMV----SVWPPTATELYDYGISFEKKSHYSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPG

Query:  VSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVTKLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLL
        VSNF +FMRFLLNSDNDV LLIKEGIIHNHLES K VTKLF DLC+NVV E NLYNYEC++MR+Y KHRRHRWMASLK DYFNTPWALISFIAAVVLLLL
Subjt:  VSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVTKLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLL

Query:  TLMRAVVAVLSMPK
        TLM+ VVAVLSMPK
Subjt:  TLMRAVVAVLSMPK

SwissProt top hitse value%identityAlignment
P0C897 Putative UPF0481 protein At3g026455.6e-1025Show/hide
Query:  LDGTPLMVSVWPPTATELYDYGISFEKKSH--YSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGV-SNFLMFMRFLLNSDNDVNLLIKE
        ++  PL+  +  P+ ++L+  G+ F+  +H   S   FD  +G   +P I ++   E+ +RN++AYE T    P V + +   +  +++S+ DV LL ++
Subjt:  LDGTPLMVSVWPPTATELYDYGISFEKKSH--YSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGV-SNFLMFMRFLLNSDNDVNLLIKE

Query:  GIIHNHLESAKEVTKLFQDLCKNV-VAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMRAVVAVLS
        G++ + L+S +E  +++  + K+V + +    +   + + +Y   R    +  L   Y    W +++F+AAV+LL+L  ++    V S
Subjt:  GIIHNHLESAKEVTKLFQDLCKNV-VAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMRAVVAVLS

Q9SD53 UPF0481 protein At3g472001.4e-2125.64Show/hide
Query:  IGNVPAALFEMKPEAYIPQFIFIG--HPDPTVEHFVSGNNDKIMKGCFVGKFSSVAKVELNEIIGRVIGWEQTARDCYLFIGRRKLDTVQFVKF-LVMDG
        I  VP +   + P+AY P+ + IG  H        +  +  ++++  F+ + +    VE N ++  V+  E   R  Y      +L T   + F +V+DG
Subjt:  IGNVPAALFEMKPEAYIPQFIFIG--HPDPTVEHFVSGNNDKIMKGCFVGKFSSVAKVELNEIIGRVIGWEQTARDCYLFIGRRKLDTVQFVKF-LVMDG

Query:  CFVVM-YMLVSVFPEFQDIDTSSFFWRFNDAVFRDLLLFQNQLPFFLLRSLYDLCVSNNQTILGNTSFIQLTHQFF---IDREGIGYLGKDFRVEEDKLE
        CF++M ++++S   E  +    S  W  + ++  DLLL +NQ+PFF+L++LY         I  ++   ++   FF   ID+EG       +  +    +
Subjt:  CFVVM-YMLVSVFPEFQDIDTSSFFWRFNDAVFRDLLLFQNQLPFFLLRSLYDLCVSNNQTILGNTSFIQLTHQFF---IDREGIGYLGKDFRVEEDKLE

Query:  VKHLIHFL----------SCYMNFPH--------QDDSSKSLD--GTPLMVSVWPPTATELYDYGISFEKKSHYSQKMFDER--TGILRVPHIIINETFE
         KHL+  +          S   + PH        +  +  S+D    PL++S     A  L   GI F  +      + + R     L++P +  +    
Subjt:  VKHLIHFL----------SCYMNFPH--------QDDSSKSLD--GTPLMVSVWPPTATELYDYGISFEKKSHYSQKMFDER--TGILRVPHIIINETFE

Query:  STMRNIIAYE-YTIRKSPGVSNFLMFMRFLLNSDNDVNLLIKEG-IIHNHLESAKEVTKLFQDLCKNVVAE--RNLYNYECQKMRKYCKHRRHRWMASLK
        S   N +A+E +    S  ++ +++FM  LLN++ DV  L  +  II NH  S  EV++ F+ + K+VV E   +  N   + + +Y K   +   A  +
Subjt:  STMRNIIAYE-YTIRKSPGVSNFLMFMRFLLNSDNDVNLLIKEG-IIHNHLESAKEVTKLFQDLCKNVVAE--RNLYNYECQKMRKYCKHRRHRWMASLK

Query:  HDYFNTPWALISFIAAVVLLLLTLMRAVVAVLS
        H +F +PW  +S  A + ++LLT++++ VA+LS
Subjt:  HDYFNTPWALISFIAAVVLLLLTLMRAVVAVLS

Arabidopsis top hitse value%identityAlignment
AT3G50120.1 Plant protein of unknown function (DUF247)1.6e-2828.15Show/hide
Query:  IGNVPAALFEMKPEAYIPQFIFIGHPDPTVEHFVSGNNDKIMKGCFVGKFSSVAKVELNEIIGRVIGWEQTARDCYLFIGRRKLDTVQFVKFLVMDGCFV
        I  VP  L E   ++Y PQ + +G      +   S +  K      V +        +   I  +   E+ AR CY   G   L + +F++ LV+DGCF 
Subjt:  IGNVPAALFEMKPEAYIPQFIFIGHPDPTVEHFVSGNNDKIMKGCFVGKFSSVAKVELNEIIGRVIGWEQTARDCYLFIGRRKLDTVQFVKFLVMDGCFV

Query:  VMYMLVSVFPEFQDIDTSSFFWRFNDAVF----------RDLLLFQNQLPFFLLRSLYDLCV-SNNQTILGNTSFIQLTHQFFIDREGIGYLGKDFRVEE
        V+ +       F ++      +  ND VF          RD+++ +NQLP F+L  L +L + + NQT L     I+         E +   G+  ++E 
Subjt:  VMYMLVSVFPEFQDIDTSSFFWRFNDAVF----------RDLLLFQNQLPFFLLRSLYDLCV-SNNQTILGNTSFIQLTHQFFIDREGIGYLGKDFRVEE

Query:  DKLEVKHLIHF-----LSCYMNFPHQDDSSKSLDGTPLMVSVWPPT--------------ATELYDYGISFEKKSHYSQKMFDERTGILRVPHIIINETF
             K    F     L C   F      S       L    W                  TEL + GI F ++          + G L +P ++I++  
Subjt:  DKLEVKHLIHF-----LSCYMNFPHQDDSSKSLDGTPLMVSVWPPT--------------ATELYDYGISFEKKSHYSQKMFDERTGILRVPHIIINETF

Query:  ESTMRNIIAYEYT-IRKSPGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVTKLFQDLCKNVV--AERNLYNYECQKMRKYCKHRRHRWMASLK
        +S   N+IA+E   I  S  ++++++FM  L++S  DV+ L   GII + L S  EV  LF  LC+ VV   E +  +    ++ +Y  H+ + W A+LK
Subjt:  ESTMRNIIAYEYT-IRKSPGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVTKLFQDLCKNVV--AERNLYNYECQKMRKYCKHRRHRWMASLK

Query:  HDYFNTPWALISFIAAVVLLLLTLMRAVVAVLSMPKP
        H YFN PWA++SF AAV+LL+LT  ++  AV +  KP
Subjt:  HDYFNTPWALISFIAAVVLLLLTLMRAVVAVLSMPKP

AT3G50150.1 Plant protein of unknown function (DUF247)4.7e-2828.97Show/hide
Query:  IGNVPAALFEMKPEAYIPQFIFIG-------HPDPTVEHFVSGNNDKIMKGCFVGKFSSVAKVELNEIIGRVIGWEQTARDCYLFIGRRKLDTVQFVKFL
        I  VP  L E   ++Y+PQ + IG       H  P   H          K   V    +  K  +   I  +   E+ AR CY      K ++ +F + L
Subjt:  IGNVPAALFEMKPEAYIPQFIFIG-------HPDPTVEHFVSGNNDKIMKGCFVGKFSSVAKVELNEIIGRVIGWEQTARDCYLFIGRRKLDTVQFVKFL

Query:  VMDGCFVVMYMLVSVFPEFQDIDTSSFFWRFNDAVF----------RDLLLFQNQLPFFLLRSLYDLCVSN-NQTILGNTSFIQLTHQFFIDREGI--GY
        V+DGCF V+ +       FQ I      +  ND VF          RD+++ +NQLP F+L  L  L     NQT +     ++         E +    
Subjt:  VMDGCFVVMYMLVSVFPEFQDIDTSSFFWRFNDAVF----------RDLLLFQNQLPFFLLRSLYDLCVSN-NQTILGNTSFIQLTHQFFIDREGI--GY

Query:  LGKDFRVEEDKLEVKHLIHFLSCYMNFPHQDDSSKSLDGTPL----MVSVWPP---TATELYDYGISFEKKSHYSQKMFDERTGILRVPHIIINETFEST
           D + + D+L     +H L  +     Q  S  +  GTP     MV          TEL   G++F +K        + + G L++P ++I++  +S 
Subjt:  LGKDFRVEEDKLEVKHLIHFLSCYMNFPHQDDSSKSLDGTPL----MVSVWPP---TATELYDYGISFEKKSHYSQKMFDERTGILRVPHIIINETFEST

Query:  MRNIIAYEYT-IRKSPGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVTKLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRW---MASLKHD
          N+IA+E    + S  ++++++FM  L+NS  DV+ L  +GII + L S  EV  LF  LCK V+ +     Y  Q  R+  ++   +W    A+L+  
Subjt:  MRNIIAYEYT-IRKSPGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVTKLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRW---MASLKHD

Query:  YFNTPWALISFIAAVVLLLLTLMRAVVAVLSMPKP
        YFN PWA  SF AAV+LL LT  ++  AV +  KP
Subjt:  YFNTPWALISFIAAVVLLLLTLMRAVVAVLSMPKP

AT3G50160.1 Plant protein of unknown function (DUF247)4.2e-2928.74Show/hide
Query:  IGNVPAALFEMKPEAYIPQFIFIGHPDPTVEHFVSGNNDKIMKGCFVGKFSSVAKVELNEIIGRVIGWEQTARDCYLFIGRRKLDTVQFVKFLVMDGCFV
        I  VP  L E   ++Y+PQ + IG      +H +     K      V    + AK ++   I  +   E+ AR CY   G   ++  +F++ LV+DG F+
Subjt:  IGNVPAALFEMKPEAYIPQFIFIGHPDPTVEHFVSGNNDKIMKGCFVGKFSSVAKVELNEIIGRVIGWEQTARDCYLFIGRRKLDTVQFVKFLVMDGCFV

Query:  VMYMLVSVFPEFQDIDTSSFFWRFNDAVF----------RDLLLFQNQLPFFLLRSLYDLCVSNNQTILGNTSFIQLTHQFFIDREGIGYLGKDFRVEED
        +  +       FQ+I  +      ND VF          RD+++ +NQLP+ +L+ L  L       +L   + +QL   FF          +      +
Subjt:  VMYMLVSVFPEFQDIDTSSFFWRFNDAVF----------RDLLLFQNQLPFFLLRSLYDLCVSNNQTILGNTSFIQLTHQFFIDREGIGYLGKDFRVEED

Query:  KLEVKHLIHFLSCYMNFPHQDDSSKSLDGTPLMVSVWPP----TATELYDYGISFEKKSHYSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYT-IR
         L  +  +H L        Q  SS + D    MV+  P       TEL + G+ F +K        + + G L++P ++I++  +S   N+IA+E   I+
Subjt:  KLEVKHLIHFLSCYMNFPHQDDSSKSLDGTPLMVSVWPP----TATELYDYGISFEKKSHYSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYT-IR

Query:  KSPGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVTKLFQDLCKNVVAERN--LYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAA
         S  ++++++FM  L+NS  DV+ L   GII N L S  EV+ LF  L K V+ + N    +    ++  Y + + +   A+L+H YFN PWA  SFIAA
Subjt:  KSPGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVTKLFQDLCKNVVAERN--LYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAA

Query:  VVLLLLTLMRAVVAVLSMPKP
        V LL+ T  ++  AV +  KP
Subjt:  VVLLLLTLMRAVVAVLSMPKP

AT4G31980.1 unknown protein8.3e-4130.98Show/hide
Query:  IGNVPAALFEMKPEAYIPQFIFIGHPDPTVEHFVSGNNDKIMKGCFVGKFSSVAKVELNEIIGRVIGWEQTARDCYLFIGRRKLDTVQFVKFLVMDGCFV
        I  VP  L  + P+AY P+ +  G      E   +  + K     ++  F       L +++     WEQ AR CY      KL + +FV+ LV+DG F+
Subjt:  IGNVPAALFEMKPEAYIPQFIFIGHPDPTVEHFVSGNNDKIMKGCFVGKFSSVAKVELNEIIGRVIGWEQTARDCYLFIGRRKLDTVQFVKFLVMDGCFV

Query:  VMYMLVSVFPEFQDIDTSSF--FWRFNDAVFRDLLLFQNQLPFFLLRSLYDLCVSNNQTILGNTSFIQLTHQFFIDREGIGYLGKDFRVEEDKL--EVKH
        V  +L S +P  +  +   F       D V RD++L +NQLPFF+++ ++ L ++  Q   G  S IQL  + F       Y     R++++K   E +H
Subjt:  VMYMLVSVFPEFQDIDTSSF--FWRFNDAVFRDLLLFQNQLPFFLLRSLYDLCVSNNQTILGNTSFIQLTHQFFIDREGIGYLGKDFRVEEDKL--EVKH

Query:  LIHFL-SCYM-NFPHQDDSSKSLDGTPLMVSVWPPTATELYDYGISFEKKSHYSQKMFD--ERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSN
         +  L SCY+  FP +      L+ T + V    P ATEL+  G+ F K +  S  + D     G+L++P I++++  ES  +NII +E     +    +
Subjt:  LIHFL-SCYM-NFPHQDDSSKSLDGTPLMVSVWPPTATELYDYGISFEKKSHYSQKMFD--ERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSN

Query:  FLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVTKLFQDLCKNVVAERNLY-NYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTL
        ++M +   + S  D +LLI  GII N+L ++ +V+ LF  + K V+ +R  Y +   + ++ YC    +RW A L+ DYF+ PWA+ S  AA++LLLLT 
Subjt:  FLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVTKLFQDLCKNVVAERNLY-NYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTL

Query:  MRAVVAVLSM
        +++V ++L++
Subjt:  MRAVVAVLSM

AT5G22550.2 Plant protein of unknown function (DUF247)8.9e-2725.16Show/hide
Query:  IGNVPAALFEMKPEAYIPQFIFIG--HPDPTVEHF-VSGNNDKIMKGCFVGKFSSVAKVELNEIIGRVIGWEQTARDCY---LFIGRRKLDTVQFVKFLV
        I  +P  L ++  +AY P+ + IG  H     +H  +   + K     FV K +    V L  ++  V G EQ  RD Y   L   ++KL     +K ++
Subjt:  IGNVPAALFEMKPEAYIPQFIFIG--HPDPTVEHF-VSGNNDKIMKGCFVGKFSSVAKVELNEIIGRVIGWEQTARDCY---LFIGRRKLDTVQFVKFLV

Query:  MDGCFVVM-YMLVSVFPEFQDIDTSSFFWRFNDAVFR-DLLLFQNQLPFFLLRSLYDLCVSNNQTILGNTSFIQLTHQFFIDREGIGYLGKDFRVEEDKL
        +DGCF++M +++VS   E+ ++    F  R+     R DLLL +NQ+P FLL+ L +        +  +TS   L  +FF   +      + F  + + L
Subjt:  MDGCFVVM-YMLVSVFPEFQDIDTSSFFWRFNDAVFR-DLLLFQNQLPFFLLRSLYDLCVSNNQTILGNTSFIQLTHQFFIDREGIGYLGKDFRVEEDKL

Query:  EVKHLIHFL---------------SCYMN-------FPHQDDS----------SKSLDGTPLMVSVWPP---------TATELYDYGISFEKKSHYSQKM
          KHL+  +                C +N       +   + S          SK + G        PP         +A +L   GI F +K +    +
Subjt:  EVKHLIHFL---------------SCYMN-------FPHQDDS----------SKSLDGTPLMVSVWPP---------TATELYDYGISFEKKSHYSQKM

Query:  -FDERTGILRVPHIIINETFESTMRNIIAYE-YTIRKSPGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVTKLFQDLCKNV--VAERNLYNYE
            ++G++ +P ++ ++   + + N +A+E + +  S  +++F++FM  L+N+++D   LI++GI+ N+  + +EV+  F+++ K++     ++  +  
Subjt:  -FDERTGILRVPHIIINETFESTMRNIIAYE-YTIRKSPGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVTKLFQDLCKNV--VAERNLYNYE

Query:  CQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMRAVVAVLSMPKP
         + + +Y     H   A  K+ +FNTPW  +S  AA+VLLLLT+ +A  A  +  +P
Subjt:  CQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMRAVVAVLSMPKP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTTGGACAAAATTGGCAACGTTCCGGCAGCTCTGTTCGAAATGAAACCAGAAGCTTATATTCCCCAATTCATCTTCATCGGCCATCCTGATCCTACTGTC
GAGCATTTTGTAAGTGGAAATAATGATAAGATCATGAAAGGCTGCTTTGTCGGAAAGTTTTCTTCCGTTGCGAAAGTGGAGTTGAATGAGATTATAGGACGAGTC
ATAGGTTGGGAGCAAACAGCTCGTGATTGTTATTTGTTCATAGGTCGGAGAAAATTGGACACAGTCCAGTTTGTGAAGTTTCTAGTCATGGATGGTTGTTTCGTG
GTCATGTATATGCTGGTTTCTGTGTTCCCGGAGTTTCAGGACATCGACACGTCATCGTTCTTTTGGAGATTCAACGATGCAGTATTCAGAGATCTGTTACTGTTT
CAAAACCAACTTCCTTTCTTTCTTCTCCGGTCTCTATACGACCTGTGCGTCTCCAATAATCAAACTATACTTGGAAACACCTCTTTCATTCAACTTACTCATCAA
TTTTTTATTGACCGTGAAGGGATCGGTTATCTTGGAAAGGATTTTAGGGTAGAGGAAGACAAATTAGAAGTGAAGCATCTTATTCATTTTCTAAGTTGTTATATG
AACTTTCCTCATCAAGACGATTCTTCAAAATCATTGGACGGCACCCCACTCATGGTTTCTGTTTGGCCACCCACTGCCACTGAGCTTTACGATTACGGCATTTCT
TTCGAGAAGAAATCACATTATTCTCAAAAGATGTTTGATGAACGTACCGGCATTCTCAGAGTGCCTCACATCATAATAAATGAGACTTTCGAAAGCACGATGAGA
AACATCATAGCTTACGAGTATACAATTCGCAAGAGTCCAGGCGTAAGCAACTTCTTGATGTTCATGCGTTTCTTGTTGAACTCCGACAACGATGTAAATTTGCTC
ATAAAGGAGGGGATTATCCACAACCATTTGGAAAGCGCAAAGGAAGTTACTAAGTTGTTCCAGGACCTTTGTAAGAACGTTGTGGCCGAAAGAAATTTGTACAAC
TATGAATGTCAGAAAATGAGAAAATACTGCAAGCACCGCCGCCATCGATGGATGGCTTCGTTGAAACACGACTATTTTAACACGCCGTGGGCTTTGATCTCCTTC
ATTGCTGCCGTCGTCCTGCTTTTACTCACTCTCATGCGAGCGGTGGTAGCTGTACTCTCCATGCCTAAGCCTAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTTGGACAAAATTGGCAACGTTCCGGCAGCTCTGTTCGAAATGAAACCAGAAGCTTATATTCCCCAATTCATCTTCATCGGCCATCCTGATCCTACTGTC
GAGCATTTTGTAAGTGGAAATAATGATAAGATCATGAAAGGCTGCTTTGTCGGAAAGTTTTCTTCCGTTGCGAAAGTGGAGTTGAATGAGATTATAGGACGAGTC
ATAGGTTGGGAGCAAACAGCTCGTGATTGTTATTTGTTCATAGGTCGGAGAAAATTGGACACAGTCCAGTTTGTGAAGTTTCTAGTCATGGATGGTTGTTTCGTG
GTCATGTATATGCTGGTTTCTGTGTTCCCGGAGTTTCAGGACATCGACACGTCATCGTTCTTTTGGAGATTCAACGATGCAGTATTCAGAGATCTGTTACTGTTT
CAAAACCAACTTCCTTTCTTTCTTCTCCGGTCTCTATACGACCTGTGCGTCTCCAATAATCAAACTATACTTGGAAACACCTCTTTCATTCAACTTACTCATCAA
TTTTTTATTGACCGTGAAGGGATCGGTTATCTTGGAAAGGATTTTAGGGTAGAGGAAGACAAATTAGAAGTGAAGCATCTTATTCATTTTCTAAGTTGTTATATG
AACTTTCCTCATCAAGACGATTCTTCAAAATCATTGGACGGCACCCCACTCATGGTTTCTGTTTGGCCACCCACTGCCACTGAGCTTTACGATTACGGCATTTCT
TTCGAGAAGAAATCACATTATTCTCAAAAGATGTTTGATGAACGTACCGGCATTCTCAGAGTGCCTCACATCATAATAAATGAGACTTTCGAAAGCACGATGAGA
AACATCATAGCTTACGAGTATACAATTCGCAAGAGTCCAGGCGTAAGCAACTTCTTGATGTTCATGCGTTTCTTGTTGAACTCCGACAACGATGTAAATTTGCTC
ATAAAGGAGGGGATTATCCACAACCATTTGGAAAGCGCAAAGGAAGTTACTAAGTTGTTCCAGGACCTTTGTAAGAACGTTGTGGCCGAAAGAAATTTGTACAAC
TATGAATGTCAGAAAATGAGAAAATACTGCAAGCACCGCCGCCATCGATGGATGGCTTCGTTGAAACACGACTATTTTAACACGCCGTGGGCTTTGATCTCCTTC
ATTGCTGCCGTCGTCCTGCTTTTACTCACTCTCATGCGAGCGGTGGTAGCTGTACTCTCCATGCCTAAGCCTAAGTGA
Protein sequenceShow/hide protein sequence
MELDKIGNVPAALFEMKPEAYIPQFIFIGHPDPTVEHFVSGNNDKIMKGCFVGKFSSVAKVELNEIIGRVIGWEQTARDCYLFIGRRKLDTVQFVKFLVMDGCFV
VMYMLVSVFPEFQDIDTSSFFWRFNDAVFRDLLLFQNQLPFFLLRSLYDLCVSNNQTILGNTSFIQLTHQFFIDREGIGYLGKDFRVEEDKLEVKHLIHFLSCYM
NFPHQDDSSKSLDGTPLMVSVWPPTATELYDYGISFEKKSHYSQKMFDERTGILRVPHIIINETFESTMRNIIAYEYTIRKSPGVSNFLMFMRFLLNSDNDVNLL
IKEGIIHNHLESAKEVTKLFQDLCKNVVAERNLYNYECQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAVVLLLLTLMRAVVAVLSMPKPK