; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g04370 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g04370
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUPF0481 protein At3g47200-like
Genome locationchr8:3181886..3187555
RNA-Seq ExpressionMoc08g04370
SyntenyMoc08g04370
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0064930.1 UPF0481 protein [Cucumis melo var. makuwa]1.1e-9445.45Show/hide
Query:  NNVTLGTVSEDEEDVEIKVIRNMMKERDSYLLSEQEN-SSNIRGTDGIFRVPLALWRANPNAFIPQFIPIGPVHGFHNSYGYDPIIRKMKFYFFRNFAS-
        NN      +    D EIK I + M E  ++ +S + + S++      I+ VP  L   NP A+ PQ I IGP+H +      D  I++ K  +  NF + 
Subjt:  NNVTLGTVSEDEEDVEIKVIRNMMKERDSYLLSEQEN-SSNIRGTDGIFRVPLALWRANPNAFIPQFIPIGPVHGFHNSYGYDPIIRKMKFYFFRNFAS-

Query:  TEVELNKIVRLVIGCEKRATEFYKIDIRMGRAEFLESLILDCCFVVMYILSSVVPLFQDFEMSSFSWRFNDAIFRDMVLFENQLPFFLLQSLYELCAFNN
         ++  N+++   +  E+RA  +Y   I+M R EF++ LI D CFVVMYI+ S+V  F+D + +SF WRF++ IF+D++L ENQLPFFLL  LY LCAF  
Subjt:  TEVELNKIVRLVIGCEKRATEFYKIDIRMGRAEFLESLILDCCFVVMYILSSVVPLFQDFEMSSFSWRFNDAIFRDMVLFENQLPFFLLQSLYELCAFNN

Query:  YRSILGEFSFIQLTHKFFM-AREGIGYLGKDFRVLHEDKLEVNHFVHFLSYYMNSLDH----TRWRVGPSLAIWPPTATELYDYGISFEKKSHYSQKM-F
         +  L + SFI+L   +F   REG+ Y+ + +  L  D  EVNH V FL  ++    H      + V   L+ WP TATEL+D GISF  +      + F
Subjt:  YRSILGEFSFIQLTHKFFM-AREGIGYLGKDFRVLHEDKLEVNHFVHFLSYYMNSLDH----TRWRVGPSLAIWPPTATELYDYGISFEKKSHYSQKM-F

Query:  DERAGILRLPHIIINETFESMIRNIIAFEHCHRKRQVVSNFAIFMRFLLNSDNDVILLIKEGIIHNHLESTKGVTKLFHDLCENVVVETNLYNYECKRMR
         ER G+L++P III+++FE + RN+IA+E+CH K + VSNF +FM FL+N++ DV LL+ +GII NHL ST+ +  LF+DLC+N++VE NLY+ EC++M+
Subjt:  DERAGILRLPHIIINETFESMIRNIIAFEHCHRKRQVVSNFAIFMRFLLNSDNDVILLIKEGIIHNHLESTKGVTKLFHDLCENVVVETNLYNYECKRMR

Query:  EYGKHRRHRWMASLKRDYFNTPWALISFIAAVVLLLLTLMQTVVAVLSMPK
        EY KHRRHRWM SLKRDYF TPWA ISF+AAV+LLLLTL+QTVVA +++ K
Subjt:  EYGKHRRHRWMASLKRDYFNTPWALISFIAAVVLLLLTLMQTVVAVLSMPK

XP_004138858.1 UPF0481 protein At3g47200 [Cucumis sativus]2.0e-9144.71Show/hide
Query:  EGNNVTLGTVSEDEEDVEIKVIRNMM--KERDSYLLSEQENSSNIRGTDGIFRVPLALWRANPNAFIPQFIPIGPVHGFHNSYGYDPIIRKMKFYFFRNF
        E NN T  T S DE   EIKVI + M      S       ++S+      I+ VP  L + NP A+ PQ I IGP+H +      + +I++ K  +  NF
Subjt:  EGNNVTLGTVSEDEEDVEIKVIRNMM--KERDSYLLSEQENSSNIRGTDGIFRVPLALWRANPNAFIPQFIPIGPVHGFHNSYGYDPIIRKMKFYFFRNF

Query:  AS-TEVELNKIVRLVIGCEKRATEFYKIDIRMGRAEFLESLILDCCFVVMYILSSVVPLFQDFEMSSFSWRFNDAIFRDMVLFENQLPFFLLQSLYELCA
         +  +++ N++++  +  E+RA  +Y   I M R EF++ LI D CFVVMY++ S+V  F+D + +SF WRF++ IF+D++L ENQLPFFLL  LY LCA
Subjt:  AS-TEVELNKIVRLVIGCEKRATEFYKIDIRMGRAEFLESLILDCCFVVMYILSSVVPLFQDFEMSSFSWRFNDAIFRDMVLFENQLPFFLLQSLYELCA

Query:  FNNYRSILGEFSFIQLTHKFF-MAREGIGYLGKDFRVLHEDKLEVNHFVHFLSYYMNSLDHTRWRVGPS----LAIWPPTATELYDYGISFE-KKSHYSQ
          + +  L + SFI+L   +F   REG+ Y+ + +     D   VNH V FL  ++    H     G S    L+ WP TATEL++ GISF  +K     
Subjt:  FNNYRSILGEFSFIQLTHKFF-MAREGIGYLGKDFRVLHEDKLEVNHFVHFLSYYMNSLDHTRWRVGPS----LAIWPPTATELYDYGISFE-KKSHYSQ

Query:  KMFDERAGILRLPHIIINETFESMIRNIIAFEHCHRKRQVVSNFAIFMRFLLNSDNDVILLIKEGIIHNHLESTKGVTKLFHDLCENVVVETNLYNYECK
          F ER G+L++P III+++FE + RN+IA+E+CH K +  SNF +FM FL+N++ DV LL+ +GII N L STK +  LF DLC+N+++E N Y+  C 
Subjt:  KMFDERAGILRLPHIIINETFESMIRNIIAFEHCHRKRQVVSNFAIFMRFLLNSDNDVILLIKEGIIHNHLESTKGVTKLFHDLCENVVVETNLYNYECK

Query:  RMREYGKHRRHRWMASLKRDYFNTPWALISFIAAVVLLLLTLMQTVVAVLSMPK
        RM+EY KHRRHRWM SLKRDYF TPWA ISF+AAV+LLLLTL+QTVVA +++ K
Subjt:  RMREYGKHRRHRWMASLKRDYFNTPWALISFIAAVVLLLLTLMQTVVAVLSMPK

XP_008445209.1 PREDICTED: UPF0481 protein At3g47200-like [Cucumis melo]1.9e-9446.14Show/hide
Query:  EGNNVTLGTVSEDEEDVEIKVIRNMMKERDSYLLSEQEN-SSNIRGTDGIFRVPLALWRANPNAFIPQFIPIGPVHGFHNSYGYDPIIRKMKFYFFRNFA
        E NN    T S DE   EIK I + M E  ++ +S + + S++      I+ VP  L   NP A+ PQ I IGP+H +      D  I++ K  +  NF 
Subjt:  EGNNVTLGTVSEDEEDVEIKVIRNMMKERDSYLLSEQEN-SSNIRGTDGIFRVPLALWRANPNAFIPQFIPIGPVHGFHNSYGYDPIIRKMKFYFFRNFA

Query:  S-TEVELNKIVRLVIGCEKRATEFYKIDIRMGRAEFLESLILDCCFVVMYILSSVVPLFQDFEMSSFSWRFNDAIFRDMVLFENQLPFFLLQSLYELCAF
        +  ++  N+++   +  E+RA  +Y   I+M R EF++ LI D CFVVMYI+ S+V  F+D + +SF WRF++ IF+D++L ENQLPFFLL  LY LCAF
Subjt:  S-TEVELNKIVRLVIGCEKRATEFYKIDIRMGRAEFLESLILDCCFVVMYILSSVVPLFQDFEMSSFSWRFNDAIFRDMVLFENQLPFFLLQSLYELCAF

Query:  NNYRSILGEFSFIQLTHKFFM-AREGIGYLGKDFRVLHEDKLEVNHFVHFLSYYMNSLDH----TRWRVGPSLAIWPPTATELYDYGISFEKKSHYSQKM
           +  L + SFI+L   +F   REG+ Y+ + +  L  D  EVNH V FL  ++    H      + V   L+ WP TATEL+D GISF  +      +
Subjt:  NNYRSILGEFSFIQLTHKFFM-AREGIGYLGKDFRVLHEDKLEVNHFVHFLSYYMNSLDH----TRWRVGPSLAIWPPTATELYDYGISFEKKSHYSQKM

Query:  -FDERAGILRLPHIIINETFESMIRNIIAFEHCHRKRQVVSNFAIFMRFLLNSDNDVILLIKEGIIHNHLESTKGVTKLFHDLCENVVVETNLYNYECKR
         F ER G+L++P III+++FE + RN+IA+E+CH K + VSNF +FM FL+N++ DV LL+ +GII NHL ST+ +  LF+DLC+N++VE NLY+ EC++
Subjt:  -FDERAGILRLPHIIINETFESMIRNIIAFEHCHRKRQVVSNFAIFMRFLLNSDNDVILLIKEGIIHNHLESTKGVTKLFHDLCENVVVETNLYNYECKR

Query:  MREYGKHRRHRWMASLKRDYFNTPWALISFIAAVVLLLLTLMQTVVAVLSMPK
        M+EY KHRRHRWM SLKRDYF TPWA ISF+AAV+LLLLTL+QTVVA +++ K
Subjt:  MREYGKHRRHRWMASLKRDYFNTPWALISFIAAVVLLLLTLMQTVVAVLSMPK

XP_022131636.1 UPF0481 protein At3g47200-like [Momordica charantia]9.9e-13666.43Show/hide
Query:  DGIFRVPLALWRANPNAFIPQFIPIG---PVHGFHNSYGYDPIIRKMKFYFFRNFAS-TEVELNKIVRLVIGCEKRATEFYKIDIR--MGRAEFLESLIL
        D I  VP AL+   P A+IPQFI IG   P      S   D I   MK  F   F+S  +VELN+I+  VIG E+ A + Y    R  +   +F++ L++
Subjt:  DGIFRVPLALWRANPNAFIPQFIPIG---PVHGFHNSYGYDPIIRKMKFYFFRNFAS-TEVELNKIVRLVIGCEKRATEFYKIDIR--MGRAEFLESLIL

Query:  DCCFVVMYILSSVVPLFQDFEMSSFSWRFNDAIFRDMVLFENQLPFFLLQSLYELCAFNNYRSILGEFSFIQLTHKFFMAREGIGYLGKDFRVLHEDKLE
        D CFVVMY+L SV P FQD + SSF WRFNDA+FRD++LF+NQLPFFLL+SLY+LC  NN ++ILG  SFIQLTH+FF+ REGIGYLGKDFRV  EDKLE
Subjt:  DCCFVVMYILSSVVPLFQDFEMSSFSWRFNDAIFRDMVLFENQLPFFLLQSLYELCAFNNYRSILGEFSFIQLTHKFFMAREGIGYLGKDFRVLHEDKLE

Query:  VNHFVHFLSYYMN---------SLDHTRWRVGPSLAIWPPTATELYDYGISFEKKSHYSQKMFDERAGILRLPHIIINETFESMIRNIIAFEHCHRKRQV
        V H +HFLS YMN         SLD T   V    ++WPPTATELYDYGISFEKKSHYSQKMFDER GILR+PHIIINETFES +RNIIA+E+  RK   
Subjt:  VNHFVHFLSYYMN---------SLDHTRWRVGPSLAIWPPTATELYDYGISFEKKSHYSQKMFDERAGILRLPHIIINETFESMIRNIIAFEHCHRKRQV

Query:  VSNFAIFMRFLLNSDNDVILLIKEGIIHNHLESTKGVTKLFHDLCENVVVETNLYNYECKRMREYGKHRRHRWMASLKRDYFNTPWALISFIAAVVLLLL
        VSNF +FMRFLLNSDNDV LLIKEGIIHNHLES K VTKLF DLC+NVV E NLYNYEC++MR+Y KHRRHRWMASLK DYFNTPWALISFIAAVVLLLL
Subjt:  VSNFAIFMRFLLNSDNDVILLIKEGIIHNHLESTKGVTKLFHDLCENVVVETNLYNYECKRMREYGKHRRHRWMASLKRDYFNTPWALISFIAAVVLLLL

Query:  TLMQTVVAVLSMPK
        TLM+ VVAVLSMPK
Subjt:  TLMQTVVAVLSMPK

XP_022132033.1 UPF0481 protein At3g47200-like [Momordica charantia]1.1e-243100Show/hide
Query:  MMKERDSYLLSEQENSSNIRGTDGIFRVPLALWRANPNAFIPQFIPIGPVHGFHNSYGYDPIIRKMKFYFFRNFASTEVELNKIVRLVIGCEKRATEFYK
        MMKERDSYLLSEQENSSNIRGTDGIFRVPLALWRANPNAFIPQFIPIGPVHGFHNSYGYDPIIRKMKFYFFRNFASTEVELNKIVRLVIGCEKRATEFYK
Subjt:  MMKERDSYLLSEQENSSNIRGTDGIFRVPLALWRANPNAFIPQFIPIGPVHGFHNSYGYDPIIRKMKFYFFRNFASTEVELNKIVRLVIGCEKRATEFYK

Query:  IDIRMGRAEFLESLILDCCFVVMYILSSVVPLFQDFEMSSFSWRFNDAIFRDMVLFENQLPFFLLQSLYELCAFNNYRSILGEFSFIQLTHKFFMAREGI
        IDIRMGRAEFLESLILDCCFVVMYILSSVVPLFQDFEMSSFSWRFNDAIFRDMVLFENQLPFFLLQSLYELCAFNNYRSILGEFSFIQLTHKFFMAREGI
Subjt:  IDIRMGRAEFLESLILDCCFVVMYILSSVVPLFQDFEMSSFSWRFNDAIFRDMVLFENQLPFFLLQSLYELCAFNNYRSILGEFSFIQLTHKFFMAREGI

Query:  GYLGKDFRVLHEDKLEVNHFVHFLSYYMNSLDHTRWRVGPSLAIWPPTATELYDYGISFEKKSHYSQKMFDERAGILRLPHIIINETFESMIRNIIAFEH
        GYLGKDFRVLHEDKLEVNHFVHFLSYYMNSLDHTRWRVGPSLAIWPPTATELYDYGISFEKKSHYSQKMFDERAGILRLPHIIINETFESMIRNIIAFEH
Subjt:  GYLGKDFRVLHEDKLEVNHFVHFLSYYMNSLDHTRWRVGPSLAIWPPTATELYDYGISFEKKSHYSQKMFDERAGILRLPHIIINETFESMIRNIIAFEH

Query:  CHRKRQVVSNFAIFMRFLLNSDNDVILLIKEGIIHNHLESTKGVTKLFHDLCENVVVETNLYNYECKRMREYGKHRRHRWMASLKRDYFNTPWALISFIA
        CHRKRQVVSNFAIFMRFLLNSDNDVILLIKEGIIHNHLESTKGVTKLFHDLCENVVVETNLYNYECKRMREYGKHRRHRWMASLKRDYFNTPWALISFIA
Subjt:  CHRKRQVVSNFAIFMRFLLNSDNDVILLIKEGIIHNHLESTKGVTKLFHDLCENVVVETNLYNYECKRMREYGKHRRHRWMASLKRDYFNTPWALISFIA

Query:  AVVLLLLTLMQTVVAVLSMPK
        AVVLLLLTLMQTVVAVLSMPK
Subjt:  AVVLLLLTLMQTVVAVLSMPK

TrEMBL top hitse value%identityAlignment
A0A0A0LPK8 Uncharacterized protein9.5e-9244.71Show/hide
Query:  EGNNVTLGTVSEDEEDVEIKVIRNMM--KERDSYLLSEQENSSNIRGTDGIFRVPLALWRANPNAFIPQFIPIGPVHGFHNSYGYDPIIRKMKFYFFRNF
        E NN T  T S DE   EIKVI + M      S       ++S+      I+ VP  L + NP A+ PQ I IGP+H +      + +I++ K  +  NF
Subjt:  EGNNVTLGTVSEDEEDVEIKVIRNMM--KERDSYLLSEQENSSNIRGTDGIFRVPLALWRANPNAFIPQFIPIGPVHGFHNSYGYDPIIRKMKFYFFRNF

Query:  AS-TEVELNKIVRLVIGCEKRATEFYKIDIRMGRAEFLESLILDCCFVVMYILSSVVPLFQDFEMSSFSWRFNDAIFRDMVLFENQLPFFLLQSLYELCA
         +  +++ N++++  +  E+RA  +Y   I M R EF++ LI D CFVVMY++ S+V  F+D + +SF WRF++ IF+D++L ENQLPFFLL  LY LCA
Subjt:  AS-TEVELNKIVRLVIGCEKRATEFYKIDIRMGRAEFLESLILDCCFVVMYILSSVVPLFQDFEMSSFSWRFNDAIFRDMVLFENQLPFFLLQSLYELCA

Query:  FNNYRSILGEFSFIQLTHKFF-MAREGIGYLGKDFRVLHEDKLEVNHFVHFLSYYMNSLDHTRWRVGPS----LAIWPPTATELYDYGISFE-KKSHYSQ
          + +  L + SFI+L   +F   REG+ Y+ + +     D   VNH V FL  ++    H     G S    L+ WP TATEL++ GISF  +K     
Subjt:  FNNYRSILGEFSFIQLTHKFF-MAREGIGYLGKDFRVLHEDKLEVNHFVHFLSYYMNSLDHTRWRVGPS----LAIWPPTATELYDYGISFE-KKSHYSQ

Query:  KMFDERAGILRLPHIIINETFESMIRNIIAFEHCHRKRQVVSNFAIFMRFLLNSDNDVILLIKEGIIHNHLESTKGVTKLFHDLCENVVVETNLYNYECK
          F ER G+L++P III+++FE + RN+IA+E+CH K +  SNF +FM FL+N++ DV LL+ +GII N L STK +  LF DLC+N+++E N Y+  C 
Subjt:  KMFDERAGILRLPHIIINETFESMIRNIIAFEHCHRKRQVVSNFAIFMRFLLNSDNDVILLIKEGIIHNHLESTKGVTKLFHDLCENVVVETNLYNYECK

Query:  RMREYGKHRRHRWMASLKRDYFNTPWALISFIAAVVLLLLTLMQTVVAVLSMPK
        RM+EY KHRRHRWM SLKRDYF TPWA ISF+AAV+LLLLTL+QTVVA +++ K
Subjt:  RMREYGKHRRHRWMASLKRDYFNTPWALISFIAAVVLLLLTLMQTVVAVLSMPK

A0A1S3BD00 UPF0481 protein At3g47200-like9.1e-9546.14Show/hide
Query:  EGNNVTLGTVSEDEEDVEIKVIRNMMKERDSYLLSEQEN-SSNIRGTDGIFRVPLALWRANPNAFIPQFIPIGPVHGFHNSYGYDPIIRKMKFYFFRNFA
        E NN    T S DE   EIK I + M E  ++ +S + + S++      I+ VP  L   NP A+ PQ I IGP+H +      D  I++ K  +  NF 
Subjt:  EGNNVTLGTVSEDEEDVEIKVIRNMMKERDSYLLSEQEN-SSNIRGTDGIFRVPLALWRANPNAFIPQFIPIGPVHGFHNSYGYDPIIRKMKFYFFRNFA

Query:  S-TEVELNKIVRLVIGCEKRATEFYKIDIRMGRAEFLESLILDCCFVVMYILSSVVPLFQDFEMSSFSWRFNDAIFRDMVLFENQLPFFLLQSLYELCAF
        +  ++  N+++   +  E+RA  +Y   I+M R EF++ LI D CFVVMYI+ S+V  F+D + +SF WRF++ IF+D++L ENQLPFFLL  LY LCAF
Subjt:  S-TEVELNKIVRLVIGCEKRATEFYKIDIRMGRAEFLESLILDCCFVVMYILSSVVPLFQDFEMSSFSWRFNDAIFRDMVLFENQLPFFLLQSLYELCAF

Query:  NNYRSILGEFSFIQLTHKFFM-AREGIGYLGKDFRVLHEDKLEVNHFVHFLSYYMNSLDH----TRWRVGPSLAIWPPTATELYDYGISFEKKSHYSQKM
           +  L + SFI+L   +F   REG+ Y+ + +  L  D  EVNH V FL  ++    H      + V   L+ WP TATEL+D GISF  +      +
Subjt:  NNYRSILGEFSFIQLTHKFFM-AREGIGYLGKDFRVLHEDKLEVNHFVHFLSYYMNSLDH----TRWRVGPSLAIWPPTATELYDYGISFEKKSHYSQKM

Query:  -FDERAGILRLPHIIINETFESMIRNIIAFEHCHRKRQVVSNFAIFMRFLLNSDNDVILLIKEGIIHNHLESTKGVTKLFHDLCENVVVETNLYNYECKR
         F ER G+L++P III+++FE + RN+IA+E+CH K + VSNF +FM FL+N++ DV LL+ +GII NHL ST+ +  LF+DLC+N++VE NLY+ EC++
Subjt:  -FDERAGILRLPHIIINETFESMIRNIIAFEHCHRKRQVVSNFAIFMRFLLNSDNDVILLIKEGIIHNHLESTKGVTKLFHDLCENVVVETNLYNYECKR

Query:  MREYGKHRRHRWMASLKRDYFNTPWALISFIAAVVLLLLTLMQTVVAVLSMPK
        M+EY KHRRHRWM SLKRDYF TPWA ISF+AAV+LLLLTL+QTVVA +++ K
Subjt:  MREYGKHRRHRWMASLKRDYFNTPWALISFIAAVVLLLLTLMQTVVAVLSMPK

A0A5A7VCL1 UPF0481 protein5.4e-9545.45Show/hide
Query:  NNVTLGTVSEDEEDVEIKVIRNMMKERDSYLLSEQEN-SSNIRGTDGIFRVPLALWRANPNAFIPQFIPIGPVHGFHNSYGYDPIIRKMKFYFFRNFAS-
        NN      +    D EIK I + M E  ++ +S + + S++      I+ VP  L   NP A+ PQ I IGP+H +      D  I++ K  +  NF + 
Subjt:  NNVTLGTVSEDEEDVEIKVIRNMMKERDSYLLSEQEN-SSNIRGTDGIFRVPLALWRANPNAFIPQFIPIGPVHGFHNSYGYDPIIRKMKFYFFRNFAS-

Query:  TEVELNKIVRLVIGCEKRATEFYKIDIRMGRAEFLESLILDCCFVVMYILSSVVPLFQDFEMSSFSWRFNDAIFRDMVLFENQLPFFLLQSLYELCAFNN
         ++  N+++   +  E+RA  +Y   I+M R EF++ LI D CFVVMYI+ S+V  F+D + +SF WRF++ IF+D++L ENQLPFFLL  LY LCAF  
Subjt:  TEVELNKIVRLVIGCEKRATEFYKIDIRMGRAEFLESLILDCCFVVMYILSSVVPLFQDFEMSSFSWRFNDAIFRDMVLFENQLPFFLLQSLYELCAFNN

Query:  YRSILGEFSFIQLTHKFFM-AREGIGYLGKDFRVLHEDKLEVNHFVHFLSYYMNSLDH----TRWRVGPSLAIWPPTATELYDYGISFEKKSHYSQKM-F
         +  L + SFI+L   +F   REG+ Y+ + +  L  D  EVNH V FL  ++    H      + V   L+ WP TATEL+D GISF  +      + F
Subjt:  YRSILGEFSFIQLTHKFFM-AREGIGYLGKDFRVLHEDKLEVNHFVHFLSYYMNSLDH----TRWRVGPSLAIWPPTATELYDYGISFEKKSHYSQKM-F

Query:  DERAGILRLPHIIINETFESMIRNIIAFEHCHRKRQVVSNFAIFMRFLLNSDNDVILLIKEGIIHNHLESTKGVTKLFHDLCENVVVETNLYNYECKRMR
         ER G+L++P III+++FE + RN+IA+E+CH K + VSNF +FM FL+N++ DV LL+ +GII NHL ST+ +  LF+DLC+N++VE NLY+ EC++M+
Subjt:  DERAGILRLPHIIINETFESMIRNIIAFEHCHRKRQVVSNFAIFMRFLLNSDNDVILLIKEGIIHNHLESTKGVTKLFHDLCENVVVETNLYNYECKRMR

Query:  EYGKHRRHRWMASLKRDYFNTPWALISFIAAVVLLLLTLMQTVVAVLSMPK
        EY KHRRHRWM SLKRDYF TPWA ISF+AAV+LLLLTL+QTVVA +++ K
Subjt:  EYGKHRRHRWMASLKRDYFNTPWALISFIAAVVLLLLTLMQTVVAVLSMPK

A0A6J1BQ21 UPF0481 protein At3g47200-like4.8e-13666.43Show/hide
Query:  DGIFRVPLALWRANPNAFIPQFIPIG---PVHGFHNSYGYDPIIRKMKFYFFRNFAS-TEVELNKIVRLVIGCEKRATEFYKIDIR--MGRAEFLESLIL
        D I  VP AL+   P A+IPQFI IG   P      S   D I   MK  F   F+S  +VELN+I+  VIG E+ A + Y    R  +   +F++ L++
Subjt:  DGIFRVPLALWRANPNAFIPQFIPIG---PVHGFHNSYGYDPIIRKMKFYFFRNFAS-TEVELNKIVRLVIGCEKRATEFYKIDIR--MGRAEFLESLIL

Query:  DCCFVVMYILSSVVPLFQDFEMSSFSWRFNDAIFRDMVLFENQLPFFLLQSLYELCAFNNYRSILGEFSFIQLTHKFFMAREGIGYLGKDFRVLHEDKLE
        D CFVVMY+L SV P FQD + SSF WRFNDA+FRD++LF+NQLPFFLL+SLY+LC  NN ++ILG  SFIQLTH+FF+ REGIGYLGKDFRV  EDKLE
Subjt:  DCCFVVMYILSSVVPLFQDFEMSSFSWRFNDAIFRDMVLFENQLPFFLLQSLYELCAFNNYRSILGEFSFIQLTHKFFMAREGIGYLGKDFRVLHEDKLE

Query:  VNHFVHFLSYYMN---------SLDHTRWRVGPSLAIWPPTATELYDYGISFEKKSHYSQKMFDERAGILRLPHIIINETFESMIRNIIAFEHCHRKRQV
        V H +HFLS YMN         SLD T   V    ++WPPTATELYDYGISFEKKSHYSQKMFDER GILR+PHIIINETFES +RNIIA+E+  RK   
Subjt:  VNHFVHFLSYYMN---------SLDHTRWRVGPSLAIWPPTATELYDYGISFEKKSHYSQKMFDERAGILRLPHIIINETFESMIRNIIAFEHCHRKRQV

Query:  VSNFAIFMRFLLNSDNDVILLIKEGIIHNHLESTKGVTKLFHDLCENVVVETNLYNYECKRMREYGKHRRHRWMASLKRDYFNTPWALISFIAAVVLLLL
        VSNF +FMRFLLNSDNDV LLIKEGIIHNHLES K VTKLF DLC+NVV E NLYNYEC++MR+Y KHRRHRWMASLK DYFNTPWALISFIAAVVLLLL
Subjt:  VSNFAIFMRFLLNSDNDVILLIKEGIIHNHLESTKGVTKLFHDLCENVVVETNLYNYECKRMREYGKHRRHRWMASLKRDYFNTPWALISFIAAVVLLLL

Query:  TLMQTVVAVLSMPK
        TLM+ VVAVLSMPK
Subjt:  TLMQTVVAVLSMPK

A0A6J1BR42 UPF0481 protein At3g47200-like5.3e-244100Show/hide
Query:  MMKERDSYLLSEQENSSNIRGTDGIFRVPLALWRANPNAFIPQFIPIGPVHGFHNSYGYDPIIRKMKFYFFRNFASTEVELNKIVRLVIGCEKRATEFYK
        MMKERDSYLLSEQENSSNIRGTDGIFRVPLALWRANPNAFIPQFIPIGPVHGFHNSYGYDPIIRKMKFYFFRNFASTEVELNKIVRLVIGCEKRATEFYK
Subjt:  MMKERDSYLLSEQENSSNIRGTDGIFRVPLALWRANPNAFIPQFIPIGPVHGFHNSYGYDPIIRKMKFYFFRNFASTEVELNKIVRLVIGCEKRATEFYK

Query:  IDIRMGRAEFLESLILDCCFVVMYILSSVVPLFQDFEMSSFSWRFNDAIFRDMVLFENQLPFFLLQSLYELCAFNNYRSILGEFSFIQLTHKFFMAREGI
        IDIRMGRAEFLESLILDCCFVVMYILSSVVPLFQDFEMSSFSWRFNDAIFRDMVLFENQLPFFLLQSLYELCAFNNYRSILGEFSFIQLTHKFFMAREGI
Subjt:  IDIRMGRAEFLESLILDCCFVVMYILSSVVPLFQDFEMSSFSWRFNDAIFRDMVLFENQLPFFLLQSLYELCAFNNYRSILGEFSFIQLTHKFFMAREGI

Query:  GYLGKDFRVLHEDKLEVNHFVHFLSYYMNSLDHTRWRVGPSLAIWPPTATELYDYGISFEKKSHYSQKMFDERAGILRLPHIIINETFESMIRNIIAFEH
        GYLGKDFRVLHEDKLEVNHFVHFLSYYMNSLDHTRWRVGPSLAIWPPTATELYDYGISFEKKSHYSQKMFDERAGILRLPHIIINETFESMIRNIIAFEH
Subjt:  GYLGKDFRVLHEDKLEVNHFVHFLSYYMNSLDHTRWRVGPSLAIWPPTATELYDYGISFEKKSHYSQKMFDERAGILRLPHIIINETFESMIRNIIAFEH

Query:  CHRKRQVVSNFAIFMRFLLNSDNDVILLIKEGIIHNHLESTKGVTKLFHDLCENVVVETNLYNYECKRMREYGKHRRHRWMASLKRDYFNTPWALISFIA
        CHRKRQVVSNFAIFMRFLLNSDNDVILLIKEGIIHNHLESTKGVTKLFHDLCENVVVETNLYNYECKRMREYGKHRRHRWMASLKRDYFNTPWALISFIA
Subjt:  CHRKRQVVSNFAIFMRFLLNSDNDVILLIKEGIIHNHLESTKGVTKLFHDLCENVVVETNLYNYECKRMREYGKHRRHRWMASLKRDYFNTPWALISFIA

Query:  AVVLLLLTLMQTVVAVLSMPK
        AVVLLLLTLMQTVVAVLSMPK
Subjt:  AVVLLLLTLMQTVVAVLSMPK

SwissProt top hitse value%identityAlignment
P0C897 Putative UPF0481 protein At3g026454.8e-0824.16Show/hide
Query:  PTATELYDYGISFEKKSH--YSQKMFDERAGILRLPHIIINETFESMIRNIIAFEHCHRKRQVV-SNFAIFMRFLLNSDNDVILLIKEGIIHNHLESTKG
        P+ ++L+  G+ F+  +H   S   FD  +G   LP I ++   E+++RN++A+E  +    +V + +   +  +++S+ DV LL ++G++ + L+S + 
Subjt:  PTATELYDYGISFEKKSH--YSQKMFDERAGILRLPHIIINETFESMIRNIIAFEHCHRKRQVV-SNFAIFMRFLLNSDNDVILLIKEGIIHNHLESTKG

Query:  VTKLFHDLCENVVVETNLYNYECKRMREYGKHRRHRWMASLKR---DYFNTPWALISFIAAVVLLLLTLMQTVVAVLS
          ++++ + ++  V      +  K + +  ++   RW   + R    Y    W +++F+AAV+LL+L  +Q    V S
Subjt:  VTKLFHDLCENVVVETNLYNYECKRMREYGKHRRHRWMASLKR---DYFNTPWALISFIAAVVLLLLTLMQTVVAVLS

Q9SD53 UPF0481 protein At3g472003.3e-2525.9Show/hide
Query:  IFRVPLALWRANPNAFIPQFIPIGPVH-GFHNSYGYDPIIRKMKFYFFRNFASTEVELNKIVRLVIGCEKRATEFYKIDIRMGRAEFLESLILDCCFVVM
        IFRVP +    NP A+ P+ + IGP H G  +         ++   F       +VE N +V+ V+  E +  + Y  +++ G  + +  ++LD CF++M
Subjt:  IFRVPLALWRANPNAFIPQFIPIGPVH-GFHNSYGYDPIIRKMKFYFFRNFASTEVELNKIVRLVIGCEKRATEFYKIDIRMGRAEFLESLILDCCFVVM

Query:  --YILSSVVPLFQDFEMSSFSWRFNDAIFRDMVLFENQLPFFLLQSLY-----------ELCAFNNYRSILGEFSFIQLTHKFFMAREGIGYLGKDF--R
           I+S  + L +D  + S  W  + +I  D++L ENQ+PFF+LQ+LY              AF+ +++ + +       H+ + A+  +  + + F   
Subjt:  --YILSSVVPLFQDFEMSSFSWRFNDAIFRDMVLFENQLPFFLLQSLY-----------ELCAFNNYRSILGEFSFIQLTHKFFMAREGIGYLGKDF--R

Query:  VLHEDKLEVNHF-VHFLSYYMNSLDHTRWRVGPSLAIWPPTATELYDYGISFEKKSHYSQKMFDER--AGILRLPHIIINETFESMIRNIIAFEHCHR-K
            DK    H  V        ++     +  P +     +A  L   GI F  +      + + R     L++P +  +    S   N +AFE  +   
Subjt:  VLHEDKLEVNHF-VHFLSYYMNSLDHTRWRVGPSLAIWPPTATELYDYGISFEKKSHYSQKMFDER--AGILRLPHIIINETFESMIRNIIAFEHCHR-K

Query:  RQVVSNFAIFMRFLLNSDNDVILLIKEG-IIHNHLESTKGVTKLFHDLCENVV--VETNLYNYECKRMREYGKHRRHRWMASLKRDYFNTPWALISFIAA
           ++ + +FM  LLN++ DV  L  +  II NH  S   V++ F  + ++VV  V+T+  N   K + EY K   +   A  +  +F +PW  +S  A 
Subjt:  RQVVSNFAIFMRFLLNSDNDVILLIKEG-IIHNHLESTKGVTKLFHDLCENVV--VETNLYNYECKRMREYGKHRRHRWMASLKRDYFNTPWALISFIAA

Query:  VVLLLLTLMQTVVAVLS
        + ++LLT++Q+ VA+LS
Subjt:  VVLLLLTLMQTVVAVLS

Arabidopsis top hitse value%identityAlignment
AT3G50120.1 Plant protein of unknown function (DUF247)2.9e-3227.78Show/hide
Query:  EDEEDVEIKVIRNMMKE-RDSYLLS--------EQENSSNIRGTDGIFRVPLALWRANPNAFIPQFIPIGPVHGFHNSYGYDPIIRKMKFYFFRNFASTE
        + + D  I VI+   K+ RD +++S         +++ + + G   I+RVP  L   +  ++ PQ + +GP H  H        +R M  + +R      
Subjt:  EDEEDVEIKVIRNMMKE-RDSYLLS--------EQENSSNIRGTDGIFRVPLALWRANPNAFIPQFIPIGPVHGFHNSYGYDPIIRKMKFYFFRNFASTE

Query:  VELNKIVRLVIGC----EKRATEFYKIDIRMGRAEFLESLILDCCFVVMYILSSVVPLFQDFEMSSFSWRFNDAIF----------RDMVLFENQLPFFL
           N+ +++ I      E++A   Y+  + +   EF+E L+LD CFV+     +V         +   +  ND +F          RDMV+ ENQLP F+
Subjt:  VELNKIVRLVIGC----EKRATEFYKIDIRMGRAEFLESLILDCCFVVMYILSSVVPLFQDFEMSSFSWRFNDAIF----------RDMVLFENQLPFFL

Query:  LQSLYELCAFNNYRSILGEFSFIQLTHKFF---------MAREGIGYLGKDF-RVLHEDKLEVNHFVHFLSYYMNSLDHTRWRVGPSLA--IWPPT----
        L  L EL      ++ L      QL  +FF         + + G   L     R    D       +H L  +  SL  +  +  P L    W       
Subjt:  LQSLYELCAFNNYRSILGEFSFIQLTHKFF---------MAREGIGYLGKDF-RVLHEDKLEVNHFVHFLSYYMNSLDHTRWRVGPSLA--IWPPT----

Query:  ----------ATELYDYGISFEKKSHYSQKMFDERAGILRLPHIIINETFESMIRNIIAFEHCH-RKRQVVSNFAIFMRFLLNSDNDVILLIKEGIIHNH
                   TEL + GI F ++          + G L +P ++I++  +S+  N+IAFE CH      ++++ IFM  L++S  DV  L   GII + 
Subjt:  ----------ATELYDYGISFEKKSHYSQKMFDERAGILRLPHIIINETFESMIRNIIAFEHCH-RKRQVVSNFAIFMRFLLNSDNDVILLIKEGIIHNH

Query:  LESTKGVTKLFHDLCENVVVET--NLYNYECKRMREYGKHRRHRWMASLKRDYFNTPWALISFIAAVVLLLLTLMQTVVAVLSMPK
        L S   V  LF+ LC+ VV +T  +  +     +  Y  H+ + W A+LK  YFN PWA++SF AAV+LL+LT  Q+  AV +  K
Subjt:  LESTKGVTKLFHDLCENVVVET--NLYNYECKRMREYGKHRRHRWMASLKRDYFNTPWALISFIAAVVLLLLTLMQTVVAVLSMPK

AT3G50140.1 Plant protein of unknown function (DUF247)1.0e-2928.8Show/hide
Query:  IFRVPLALWRANPNAFIPQFIPIGPVHGFHNSYGYDPIIRKMKFYFFRN----FASTEVELNKIVRLVIGCEKRATEFYKIDIRMGRAEFLESLILDCCF
        I+RVPL+L +++ N++ PQ + +GP H     +G D  +R M ++ +R        T+  +   +  +   E+RA   Y+  I +   +F + L+LD CF
Subjt:  IFRVPLALWRANPNAFIPQFIPIGPVHGFHNSYGYDPIIRKMKFYFFRN----FASTEVELNKIVRLVIGCEKRATEFYKIDIRMGRAEFLESLILDCCF

Query:  VVMYILSSVVPLFQDFEMSSFSWRFNDAIF----------RDMVLFENQLPFFLLQSLYELCAFNNYRSILGEFSFIQLTHKFF-------MAREGIGYL
        V+     +    ++ F  S   +  ND +F          RDM++ ENQLP F+L  L EL     Y++ L      QL  +FF       M+   I   
Subjt:  VVMYILSSVVPLFQDFEMSSFSWRFNDAIF----------RDMVLFENQLPFFLLQSLYELCAFNNYRSILGEFSFIQLTHKFF-------MAREGIGYL

Query:  ----GKDFRVLHEDKLEVNHFVHFLSYYMNS-----------LDHTRWRVGPSLAIWPP-----TATELYDYGISFEKKSHYSQKMFD--ERAGILRLPH
             K F  + + + E    +H L  +  S           L  +RW   P +A           TEL + GI F+++   S + +D   + G L +P 
Subjt:  ----GKDFRVLHEDKLEVNHFVHFLSYYMNS-----------LDHTRWRVGPSLAIWPP-----TATELYDYGISFEKKSHYSQKMFD--ERAGILRLPH

Query:  IIINETFESMIRNIIAFEHCH-RKRQVVSNFAIFMRFLLNSDNDVILLIKEGIIHNHLESTKGVTKLFHDLCENVVVE-TNLYNYE-CKRMREYGKHRRH
        ++I++  +S+  N+IA+E CH      ++++ IFM  L++S  D+  L    II + L +   V  +F+ LC+ V  +  N Y  E   ++  Y   + +
Subjt:  IIINETFESMIRNIIAFEHCH-RKRQVVSNFAIFMRFLLNSDNDVILLIKEGIIHNHLESTKGVTKLFHDLCENVVVE-TNLYNYE-CKRMREYGKHRRH

Query:  RWMASLKRDYFNTPWALISFIAAVVLLLLTLMQT
           A+LK  YF+ PWA  SF AAV+LLLLTL Q+
Subjt:  RWMASLKRDYFNTPWALISFIAAVVLLLLTLMQT

AT3G50150.1 Plant protein of unknown function (DUF247)1.4e-3129.37Show/hide
Query:  IFRVPLALWRANPNAFIPQFIPIGPVHGFHNSYGYDPIIRKMKFYFFRNFASTEVELNKIVRLVIGCEKRATEFYKIDIRMGRA-EFLESLILDCCFVVM
        I+RVP  L   +  +++PQ + IGP H  H      P+ R          A T+  +   +  +   E+ A   Y+  I M  + EF E L+LD CFV+ 
Subjt:  IFRVPLALWRANPNAFIPQFIPIGPVHGFHNSYGYDPIIRKMKFYFFRNFASTEVELNKIVRLVIGCEKRATEFYKIDIRMGRA-EFLESLILDCCFVVM

Query:  YILSSVVPLFQDFEMSSFSWRFNDAIF----------RDMVLFENQLPFFLLQSLYEL-CAFNNYRSILGEFSFIQLTHKFFMAREGIG-YLGKDFRVL-
            ++    Q F+     +  ND +F          RDM++ ENQLP F+L  L  L     N   I+ E +      +FF         L K  R L 
Subjt:  YILSSVVPLFQDFEMSSFSWRFNDAIF----------RDMVLFENQLPFFLLQSLYEL-CAFNNYRSILGEFSFIQLTHKFFMAREGIG-YLGKDFRVL-

Query:  ---HEDKLEVNHFVHFLSYYMNSLDHTRWRVGPSLAIWPPT-----------ATELYDYGISFEKKSHYSQKMFDERAGILRLPHIIINETFESMIRNII
             D+L  N  +H L  +  SL  +             +            TEL   G++F +K        + + G L++P ++I++  +S+  N+I
Subjt:  ---HEDKLEVNHFVHFLSYYMNSLDHTRWRVGPSLAIWPPT-----------ATELYDYGISFEKKSHYSQKMFDERAGILRLPHIIINETFESMIRNII

Query:  AFEHCH-RKRQVVSNFAIFMRFLLNSDNDVILLIKEGIIHNHLESTKGVTKLFHDLCENVVVETNLYNYECKRMREYGKHRRHRW---MASLKRDYFNTP
        AFE CH +    ++++ IFM  L+NS  DV  L  +GII + L S   V  LF+ LC+ V+ +     Y  +  RE  ++   +W    A+L++ YFN P
Subjt:  AFEHCH-RKRQVVSNFAIFMRFLLNSDNDVILLIKEGIIHNHLESTKGVTKLFHDLCENVVVETNLYNYECKRMREYGKHRRHRW---MASLKRDYFNTP

Query:  WALISFIAAVVLLLLTLMQTVVAVLSMPK
        WA  SF AAV+LL LT  Q+  AV +  K
Subjt:  WALISFIAAVVLLLLTLMQTVVAVLSMPK

AT3G50160.1 Plant protein of unknown function (DUF247)1.0e-2929Show/hide
Query:  GEGNNVTLGTVSEDEEDVEIKVI---RNMMKERDSYLLSEQ-------ENSSNIRGTDGIFRVPLALWRANPNAFIPQFIPIGPVHGFHNSYGYDPIIRK
        GE  N+ +      E  VE  V    +N  K R+ +++S         +N++       I+RVP  L   +  +++PQ + IGP H  H      P+ R 
Subjt:  GEGNNVTLGTVSEDEEDVEIKVI---RNMMKERDSYLLSEQ-------ENSSNIRGTDGIFRVPLALWRANPNAFIPQFIPIGPVHGFHNSYGYDPIIRK

Query:  MKFYFFRNFASTEVELNKIVRLVIGCEKRATEFYKIDIRMGRAEFLESLILDCCFVVMYILSSVVPLFQDFEMSS----FSWR-FNDAIFRDMVLFENQL
                 A  + ++   +  +   E++A   Y+  I M R EF+E L+LD  F++  I       FQ+   +     F  R    +I RDMV+ ENQL
Subjt:  MKFYFFRNFASTEVELNKIVRLVIGCEKRATEFYKIDIRMGRAEFLESLILDCCFVVMYILSSVVPLFQDFEMSS----FSWR-FNDAIFRDMVLFENQL

Query:  PFFLLQSLYELCAFNNYRSILGEFSFIQLTHKFFMAREGIGYLGKDFRVLHEDKLEVNHFVHFLSYYMNSLDHTRWRVGPSLAIWPPTATELYDYGISFE
        P+ +L+ L +L        +L + + +QL   FF        +  +   LH   L+V       S   +  D +     P   I     TEL + G+ F 
Subjt:  PFFLLQSLYELCAFNNYRSILGEFSFIQLTHKFFMAREGIGYLGKDFRVLHEDKLEVNHFVHFLSYYMNSLDHTRWRVGPSLAIWPPTATELYDYGISFE

Query:  KKSHYSQKMFDERAGILRLPHIIINETFESMIRNIIAFEHCH-RKRQVVSNFAIFMRFLLNSDNDVILLIKEGIIHNHLESTKGVTKLFHDLCENVVVET
        +K        + + G L++P ++I++  +S+  N+IAFE CH +  + ++++ IFM  L+NS  DV  L   GII N L S   V+ LF+ L + V+ + 
Subjt:  KKSHYSQKMFDERAGILRLPHIIINETFESMIRNIIAFEHCH-RKRQVVSNFAIFMRFLLNSDNDVILLIKEGIIHNHLESTKGVTKLFHDLCENVVVET

Query:  NLYNYECKRMREYGKHRRHRW---MASLKRDYFNTPWALISFIAAVVLLLLTLMQTVVAVLS
        N   Y      E   + R +W    A+L+  YFN PWA  SFIAAV LL+ T  Q+  AV +
Subjt:  NLYNYECKRMREYGKHRRHRW---MASLKRDYFNTPWALISFIAAVVLLLLTLMQTVVAVLS

AT4G31980.1 unknown protein3.7e-4030.88Show/hide
Query:  IFRVPLALWRANPNAFIPQFIPIGPVHGFHNSYGYDPIIRKMKFYFFRNF-ASTEVELNKIVRLVIGCEKRATEFYKIDIRMGRAEFLESLILDCCFVVM
        I++VP  L R NP+A+ P+ +  GP+H           +   K+ +  +F   T   L  +VRL    E+ A   Y  D+++   EF+E L++D  F+V 
Subjt:  IFRVPLALWRANPNAFIPQFIPIGPVHGFHNSYGYDPIIRKMKFYFFRNF-ASTEVELNKIVRLVIGCEKRATEFYKIDIRMGRAEFLESLILDCCFVVM

Query:  YILSSVVPLFQDFEMSSF--SWRFNDAIFRDMVLFENQLPFFLLQSLYELCAFNNYRSILGEFSFIQLTHKFFMAREGIGYLGKDFRVLHEDKL--EVNH
         +L S  P  +      F  S    D + RDM+L ENQLPFF+++ ++ L   N Y+   G  S IQL  + F       +L +    + ++K   E  H
Subjt:  YILSSVVPLFQDFEMSSF--SWRFNDAIFRDMVLFENQLPFFLLQSLYELCAFNNYRSILGEFSFIQLTHKFFMAREGIGYLGKDFRVLHEDKL--EVNH

Query:  FVH-----FLSYYMNSLDHTRWRVGPSLAIWPPTATELYDYGISFEKKSHYSQKMFD--ERAGILRLPHIIINETFESMIRNIIAFEHCHRKRQVVSNFA
        FV      +L  +   L++T  +V  +     P ATEL+  G+ F K +  S  + D     G+L++P I++++  ES+ +NII FE C    +   ++ 
Subjt:  FVH-----FLSYYMNSLDHTRWRVGPSLAIWPPTATELYDYGISFEKKSHYSQKMFD--ERAGILRLPHIIINETFESMIRNIIAFEHCHRKRQVVSNFA

Query:  IFMRFLLNSDNDVILLIKEGIIHNHLESTKGVTKLFHDLCENVVVETNLY-NYECKRMREYGKHRRHRWMASLKRDYFNTPWALISFIAAVVLLLLTLMQ
        + +   + S  D  LLI  GII N+L ++  V+ LF+ + + V+ +   Y +   + ++ Y     +RW A L+RDYF+ PWA+ S  AA++LLLLT +Q
Subjt:  IFMRFLLNSDNDVILLIKEGIIHNHLESTKGVTKLFHDLCENVVVETNLY-NYECKRMREYGKHRRHRWMASLKRDYFNTPWALISFIAAVVLLLLTLMQ

Query:  TVVAVLSM
        +V ++L++
Subjt:  TVVAVLSM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATCAGAGTGGTCTCTGCCTGATTAGAATTGGTCTTTTTAGACCAACAGTCTGCTTGAAAATGTCCTTAGCGTCCACAATTGAAGCATTGTATTGCGCTCGAAGCCT
TGAAGTGGAGCCCGGGGAAGGCAACAACGTGACTCTGGGCACTGTATCAGAAGATGAAGAAGATGTGGAAATCAAAGTTATTCGTAATATGATGAAAGAACGCGATAGTT
ATCTGTTGTCTGAGCAAGAAAATTCTTCAAACATTAGGGGCACCGACGGCATTTTTAGAGTTCCGCTGGCTCTGTGGAGAGCGAACCCAAATGCTTTTATTCCCCAATTC
ATCCCCATCGGCCCTGTTCATGGTTTTCATAATAGTTATGGTTATGATCCGATAATTAGGAAAATGAAATTCTACTTTTTTCGAAATTTTGCTTCTACAGAAGTCGAGTT
GAATAAGATTGTACGACTAGTCATAGGTTGTGAAAAAAGAGCTACCGAGTTTTACAAAATAGACATCAGAATGGGCAGAGCCGAGTTTTTGGAGTCCCTAATCTTGGATT
GTTGTTTCGTGGTCATGTATATCCTCTCTTCTGTGGTCCCGTTGTTTCAGGACTTCGAAATGTCGTCGTTTTCTTGGAGATTCAACGATGCAATATTCAGAGATATGGTA
CTCTTTGAAAACCAACTCCCTTTCTTCCTTCTCCAGTCTCTATATGAACTCTGCGCCTTCAATAATTATCGATCTATACTTGGGGAGTTCTCTTTCATTCAACTTACTCA
CAAATTTTTTATGGCTCGTGAAGGGATTGGTTATCTTGGAAAAGATTTTAGGGTACTGCACGAGGATAAATTAGAAGTGAATCATTTTGTTCATTTTCTAAGTTATTATA
TGAACTCGTTGGACCACACAAGATGGCGCGTCGGGCCTAGCCTCGCAATTTGGCCACCCACTGCCACTGAGCTTTACGATTACGGCATTTCTTTCGAGAAGAAATCACAT
TATTCTCAAAAGATGTTTGATGAACGTGCCGGCATTCTCAGGCTGCCACACATCATAATAAACGAGACTTTCGAAAGCATGATCAGAAACATCATAGCTTTCGAGCATTG
TCACCGCAAGAGACAGGTTGTAAGCAACTTTGCGATATTCATGCGTTTCTTGTTAAACTCCGATAACGATGTGATTTTGTTGATTAAGGAGGGGATTATACACAACCATT
TGGAAAGCACAAAGGGAGTTACTAAATTGTTCCACGACCTTTGTGAGAACGTTGTGGTTGAAACAAATTTATACAACTATGAATGTAAGAGAATGAGAGAATACGGCAAG
CACCGCCGCCATCGGTGGATGGCTTCGTTGAAACGCGACTATTTTAACACGCCATGGGCTTTGATCTCCTTCATCGCTGCCGTCGTCCTGCTTTTACTCACTCTCATGCA
AACGGTGGTAGCTGTCCTCTCCATGCCTAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCATCAGAGTGGTCTCTGCCTGATTAGAATTGGTCTTTTTAGACCAACAGTCTGCTTGAAAATGTCCTTAGCGTCCACAATTGAAGCATTGTATTGCGCTCGAAGCCT
TGAAGTGGAGCCCGGGGAAGGCAACAACGTGACTCTGGGCACTGTATCAGAAGATGAAGAAGATGTGGAAATCAAAGTTATTCGTAATATGATGAAAGAACGCGATAGTT
ATCTGTTGTCTGAGCAAGAAAATTCTTCAAACATTAGGGGCACCGACGGCATTTTTAGAGTTCCGCTGGCTCTGTGGAGAGCGAACCCAAATGCTTTTATTCCCCAATTC
ATCCCCATCGGCCCTGTTCATGGTTTTCATAATAGTTATGGTTATGATCCGATAATTAGGAAAATGAAATTCTACTTTTTTCGAAATTTTGCTTCTACAGAAGTCGAGTT
GAATAAGATTGTACGACTAGTCATAGGTTGTGAAAAAAGAGCTACCGAGTTTTACAAAATAGACATCAGAATGGGCAGAGCCGAGTTTTTGGAGTCCCTAATCTTGGATT
GTTGTTTCGTGGTCATGTATATCCTCTCTTCTGTGGTCCCGTTGTTTCAGGACTTCGAAATGTCGTCGTTTTCTTGGAGATTCAACGATGCAATATTCAGAGATATGGTA
CTCTTTGAAAACCAACTCCCTTTCTTCCTTCTCCAGTCTCTATATGAACTCTGCGCCTTCAATAATTATCGATCTATACTTGGGGAGTTCTCTTTCATTCAACTTACTCA
CAAATTTTTTATGGCTCGTGAAGGGATTGGTTATCTTGGAAAAGATTTTAGGGTACTGCACGAGGATAAATTAGAAGTGAATCATTTTGTTCATTTTCTAAGTTATTATA
TGAACTCGTTGGACCACACAAGATGGCGCGTCGGGCCTAGCCTCGCAATTTGGCCACCCACTGCCACTGAGCTTTACGATTACGGCATTTCTTTCGAGAAGAAATCACAT
TATTCTCAAAAGATGTTTGATGAACGTGCCGGCATTCTCAGGCTGCCACACATCATAATAAACGAGACTTTCGAAAGCATGATCAGAAACATCATAGCTTTCGAGCATTG
TCACCGCAAGAGACAGGTTGTAAGCAACTTTGCGATATTCATGCGTTTCTTGTTAAACTCCGATAACGATGTGATTTTGTTGATTAAGGAGGGGATTATACACAACCATT
TGGAAAGCACAAAGGGAGTTACTAAATTGTTCCACGACCTTTGTGAGAACGTTGTGGTTGAAACAAATTTATACAACTATGAATGTAAGAGAATGAGAGAATACGGCAAG
CACCGCCGCCATCGGTGGATGGCTTCGTTGAAACGCGACTATTTTAACACGCCATGGGCTTTGATCTCCTTCATCGCTGCCGTCGTCCTGCTTTTACTCACTCTCATGCA
AACGGTGGTAGCTGTCCTCTCCATGCCTAAGTGA
Protein sequenceShow/hide protein sequence
MHQSGLCLIRIGLFRPTVCLKMSLASTIEALYCARSLEVEPGEGNNVTLGTVSEDEEDVEIKVIRNMMKERDSYLLSEQENSSNIRGTDGIFRVPLALWRANPNAFIPQF
IPIGPVHGFHNSYGYDPIIRKMKFYFFRNFASTEVELNKIVRLVIGCEKRATEFYKIDIRMGRAEFLESLILDCCFVVMYILSSVVPLFQDFEMSSFSWRFNDAIFRDMV
LFENQLPFFLLQSLYELCAFNNYRSILGEFSFIQLTHKFFMAREGIGYLGKDFRVLHEDKLEVNHFVHFLSYYMNSLDHTRWRVGPSLAIWPPTATELYDYGISFEKKSH
YSQKMFDERAGILRLPHIIINETFESMIRNIIAFEHCHRKRQVVSNFAIFMRFLLNSDNDVILLIKEGIIHNHLESTKGVTKLFHDLCENVVVETNLYNYECKRMREYGK
HRRHRWMASLKRDYFNTPWALISFIAAVVLLLLTLMQTVVAVLSMPK