; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi08G011810 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi08G011810
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionPlant protein of unknown function (DUF247)
Genome locationchr08:20420990..20422426
RNA-Seq ExpressionLsi08G011810
SyntenyLsi08G011810
Gene Ontology termsNA
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0064945.1 UPF0481 protein [Cucumis melo var. makuwa]1.2e-11051.4Show/hide
Query:  MNLKAYAPQVISIGPFHH-QCPIGGNFTATEQYKLQGLINFLRHIDNDEMKYSLKEEVVKN-RSLEDLLKTGSLKDLVKKAHCWMKEARNCYTEPIG-MD
        MN KAY PQ+ISIGPFHH +C    +F  TEQYKLQ L+NFLR I+N+       ++VVK   SLEDLLKTG+LK LV+K HC M EA+NCY+EPI  MD
Subjt:  MNLKAYAPQVISIGPFHH-QCPIGGNFTATEQYKLQGLINFLRHIDNDEMKYSLKEEVVKN-RSLEDLLKTGSLKDLVKKAHCWMKEARNCYTEPIG-MD

Query:  DHKFLTIMLVDACFIVEFFIIEYDRNF-----FDQIEDDVDLSLLYQEI-----------------------------RPLPLDDKHLDDKQ--------
         H F+ +ML+DACFI+E FI +YD +      F  I+D+VD  LLY E                              R  P+   HL            
Subjt:  DHKFLTIMLVDACFIVEFFIIEYDRNF-----FDQIEDDVDLSLLYQEI-----------------------------RPLPLDDKHLDDKQ--------

Query:  --------KKWHEKNMNKFLSFLCVLFPAYRQKQHEEN------SFENNNNNNNTFLSFFRVLLCC-LWQKPGEIINEEELF-PPSTTELCEAGVIIKKA
                  +  K++  FLSF  +  P++      +N      + EN   NN  FLSFF  L CC LWQ P +  N++EL  PPS TELCE+GV IKKA
Subjt:  --------KKWHEKNMNKFLSFLCVLFPAYRQKQHEEN------SFENNNNNNNTFLSFFRVLLCC-LWQKPGEIINEEELF-PPSTTELCEAGVIIKKA

Query:  KDVKYLMDINFKNGVLEIPPIYIYDDFEFMLRNMLAFEQFQ---RNKYALHYVSFLDDLISTEKDVHLLVKAGVIVNKIGGSDKEISDMFNNICKFVTLS
        K+ +YL +I+FKNGVL+IP ++IYD+FE M RN++AFEQF+   +N YA  Y+ F+DDLISTEKDV LLV +GVI+N IGGSDKE+S++FNN+ KFV   
Subjt:  KDVKYLMDINFKNGVLEIPPIYIYDDFEFMLRNMLAFEQFQ---RNKYALHYVSFLDDLISTEKDVHLLVKAGVIVNKIGGSDKEISDMFNNICKFVTLS

Query:  TYSNRLEDISKALREHCNGRWNKAKASLKHNYFNTPWAVISFIAATFLILLTLLQTIFSAISAFP
                ISKALR+HCNGRWNKAKASLKHNYFNTPWA ISF A +FL+LLTLLQTIFS ISAFP
Subjt:  TYSNRLEDISKALREHCNGRWNKAKASLKHNYFNTPWAVISFIAATFLILLTLLQTIFSAISAFP

KAA0064946.1 UPF0481 protein [Cucumis melo var. makuwa]3.0e-11452.27Show/hide
Query:  MNLKAYAPQVISIGPFHHQCPIGGNFTATEQYKLQGLINFLRHIDNDEMKYSLKEEVVKNRSLEDLLKTGSLKDLVKKAHCWMKEARNCYTEPIG-MDDH
        MN KAY PQ+ISIGPFHH      +F ATEQYKLQ L+NFLR I+N+       ++V K+RSL+DLLK G+LK LV+K HCWM E RNCY+EPI  MDDH
Subjt:  MNLKAYAPQVISIGPFHHQCPIGGNFTATEQYKLQGLINFLRHIDNDEMKYSLKEEVVKNRSLEDLLKTGSLKDLVKKAHCWMKEARNCYTEPIG-MDDH

Query:  KFLTIMLVDACFIVEFFIIEYDRNF-----FDQIEDDVDLSLLYQEI-RPLPLDDKHLDDK------QKKWH----------------------------
         F+ +ML+DACFIVE FI +YD ++     F +I+D+VD  LLY E  + +  D   L+++      Q  +H                            
Subjt:  KFLTIMLVDACFIVEFFIIEYDRNF-----FDQIEDDVDLSLLYQEI-RPLPLDDKHLDDK------QKKWH----------------------------

Query:  ----------EKNMNKFLSFLCVLFPAYR-----QKQHEENSFENNNNNNNTFLSFFRVLLCC-LWQKP-GEIINEEELF-PPSTTELCEAGVIIKKAKD
                   K++  FLSF  +  P++      Q + E  + EN   NN  FLSFF  L CC LWQ P  +  N++EL  PPS TELCE+GV I+KAK+
Subjt:  ----------EKNMNKFLSFLCVLFPAYR-----QKQHEENSFENNNNNNNTFLSFFRVLLCC-LWQKP-GEIINEEELF-PPSTTELCEAGVIIKKAKD

Query:  VKYLMDINFKNGVLEIPPIYIYDDFEFMLRNMLAFEQF---QRNKYALHYVSFLDDLISTEKDVHLLVKAGVIVNKIGGSDKEISDMFNNICKFVTLSTY
         KYL +I FKNGVL+IP ++IYD+FE M RN++AFEQF     N YA  Y+ F+DDLISTEKDV LLV +GVI+N IGGSDKE+S++FNN+ KFV     
Subjt:  VKYLMDINFKNGVLEIPPIYIYDDFEFMLRNMLAFEQF---QRNKYALHYVSFLDDLISTEKDVHLLVKAGVIVNKIGGSDKEISDMFNNICKFVTLSTY

Query:  SNRLEDISKALREHCNGRWNKAKASLKHNYFNTPWAVISFIAATFLILLTLLQTIFSAISAFP
              ISKALR+HCNGRWNKAKASLKHNYFNTPWA ISF AA+FL+LLTLLQTIFS ISAFP
Subjt:  SNRLEDISKALREHCNGRWNKAKASLKHNYFNTPWAVISFIAATFLILLTLLQTIFSAISAFP

XP_008445583.2 PREDICTED: LOW QUALITY PROTEIN: UPF0481 protein At3g47200-like [Cucumis melo]2.8e-11251.4Show/hide
Query:  MNLKAYAPQVISIGPFHHQCPIGGNFTATEQYKLQGLINFLRHIDNDEMKYSLKEEVVKNRSLEDLLKTGSLKDLVKKAHCWMKEARNCYTEPIG-MDDH
        MN KAY PQ+ISIGPFHH      +F ATEQYKLQ L+NFLR I+N+       ++V K+RSL+DLLK G+LK LV+K HCWM E RNCY+EPI  MDDH
Subjt:  MNLKAYAPQVISIGPFHHQCPIGGNFTATEQYKLQGLINFLRHIDNDEMKYSLKEEVVKNRSLEDLLKTGSLKDLVKKAHCWMKEARNCYTEPIG-MDDH

Query:  KFLTIMLVDACFIVEFFIIEYDRNF-----FDQIEDDVDLSLLYQEI-RPLPLDDKHLDDK------QKKWH----------------------------
         F+ +ML+DACFIVE FI +YD ++     F +I+D+VD  LLY E  + +  D   L+++      Q  +H                            
Subjt:  KFLTIMLVDACFIVEFFIIEYDRNF-----FDQIEDDVDLSLLYQEI-RPLPLDDKHLDDK------QKKWH----------------------------

Query:  ----------EKNMNKFLSFLCVLFPAYR-----QKQHEENSFENNNNNNNTFLSFFRVLLCC-LWQKPGEIINEEELFP----PSTTELCEAGVIIKKA
                   K++  FLSF  +  P++      Q + E  + EN   NN  FLSFF  L CC LWQ P    N++++      PS TELCE+GV I+KA
Subjt:  ----------EKNMNKFLSFLCVLFPAYR-----QKQHEENSFENNNNNNNTFLSFFRVLLCC-LWQKPGEIINEEELFP----PSTTELCEAGVIIKKA

Query:  KDVKYLMDINFKNGVLEIPPIYIYDDFEFMLRNMLAFEQF---QRNKYALHYVSFLDDLISTEKDVHLLVKAGVIVNKIGGSDKEISDMFNNICKFVTLS
        K+ KYL +I FKNGVL+IP ++IYD+FE M RN++AFEQF     N YA  Y+ F+DDLISTEKDV LLV +GVI+N IGGSDKE+S++FNN+ KFV   
Subjt:  KDVKYLMDINFKNGVLEIPPIYIYDDFEFMLRNMLAFEQF---QRNKYALHYVSFLDDLISTEKDVHLLVKAGVIVNKIGGSDKEISDMFNNICKFVTLS

Query:  TYSNRLEDISKALREHCNGRWNKAKASLKHNYFNTPWAVISFIAATFLILLTLLQTIFSAISAFP
                ISKALR+HCNGRWNKAKASLKHNYFNTPWA ISF AA+FL+LLTLLQTIFS ISAFP
Subjt:  TYSNRLEDISKALREHCNGRWNKAKASLKHNYFNTPWAVISFIAATFLILLTLLQTIFSAISAFP

XP_008445584.2 PREDICTED: UPF0481 protein At3g47200-like [Cucumis melo]1.2e-11051.4Show/hide
Query:  MNLKAYAPQVISIGPFHH-QCPIGGNFTATEQYKLQGLINFLRHIDNDEMKYSLKEEVVKN-RSLEDLLKTGSLKDLVKKAHCWMKEARNCYTEPIG-MD
        MN KAY PQ+ISIGPFHH +C    +F  TEQYKLQ L+NFLR I+N+       ++VVK   SLEDLLKTG+LK LV+K HC M EA+NCY+EPI  MD
Subjt:  MNLKAYAPQVISIGPFHH-QCPIGGNFTATEQYKLQGLINFLRHIDNDEMKYSLKEEVVKN-RSLEDLLKTGSLKDLVKKAHCWMKEARNCYTEPIG-MD

Query:  DHKFLTIMLVDACFIVEFFIIEYDRNF-----FDQIEDDVDLSLLYQEI-----------------------------RPLPLDDKHLDDKQ--------
         H F+ +ML+DACFI+E FI +YD +      F  I+D+VD  LLY E                              R  P+   HL            
Subjt:  DHKFLTIMLVDACFIVEFFIIEYDRNF-----FDQIEDDVDLSLLYQEI-----------------------------RPLPLDDKHLDDKQ--------

Query:  --------KKWHEKNMNKFLSFLCVLFPAYRQKQHEEN------SFENNNNNNNTFLSFFRVLLCC-LWQKPGEIINEEELF-PPSTTELCEAGVIIKKA
                  +  K++  FLSF  +  P++      +N      + EN   NN  FLSFF  L CC LWQ P +  N++EL  PPS TELCE+GV IKKA
Subjt:  --------KKWHEKNMNKFLSFLCVLFPAYRQKQHEEN------SFENNNNNNNTFLSFFRVLLCC-LWQKPGEIINEEELF-PPSTTELCEAGVIIKKA

Query:  KDVKYLMDINFKNGVLEIPPIYIYDDFEFMLRNMLAFEQFQ---RNKYALHYVSFLDDLISTEKDVHLLVKAGVIVNKIGGSDKEISDMFNNICKFVTLS
        K+ +YL +I+FKNGVL+IP ++IYD+FE M RN++AFEQF+   +N YA  Y+ F+DDLISTEKDV LLV +GVI+N IGGSDKE+S++FNN+ KFV   
Subjt:  KDVKYLMDINFKNGVLEIPPIYIYDDFEFMLRNMLAFEQFQ---RNKYALHYVSFLDDLISTEKDVHLLVKAGVIVNKIGGSDKEISDMFNNICKFVTLS

Query:  TYSNRLEDISKALREHCNGRWNKAKASLKHNYFNTPWAVISFIAATFLILLTLLQTIFSAISAFP
                ISKALR+HCNGRWNKAKASLKHNYFNTPWA ISF A +FL+LLTLLQTIFS ISAFP
Subjt:  TYSNRLEDISKALREHCNGRWNKAKASLKHNYFNTPWAVISFIAATFLILLTLLQTIFSAISAFP

XP_038886293.1 UPF0481 protein At3g47200-like [Benincasa hispida]3.7e-11254.32Show/hide
Query:  MNLKAYAPQVISIGPFHHQCPIGGNFTATEQYKLQGLINFLRHIDNDEMKYSLKEEVVKNRSLEDLLKTGSLKDLVKKAHCWMKEARNCYTEPIGMDDHK
        MN KAY PQVISIGPFHHQC    +F   EQYKLQGLINFL  I + ++   L ++  K  SLEDLL+TGSLK LV KAH W+KEARNCY EPI MDD K
Subjt:  MNLKAYAPQVISIGPFHHQCPIGGNFTATEQYKLQGLINFLRHIDNDEMKYSLKEEVVKNRSLEDLLKTGSLKDLVKKAHCWMKEARNCYTEPIGMDDHK

Query:  FLTIMLVDACFIVEFFIIEYDRNFFDQIEDDVDLSLLYQEIR------------------PLPLDDKHLDDKQK------------------KW---HEK
        FL +MLVDACF+VE FI+EY+  +F  +   + + + Y  ++                   +P   + + D  K                  +W     K
Subjt:  FLTIMLVDACFIVEFFIIEYDRNFFDQIEDDVDLSLLYQEIR------------------PLPLDDKHLDDKQK------------------KW---HEK

Query:  NMNKFLSFLCVLFPAYRQKQHEENSFENNNNNNNTFLSFFRVLLCCLWQKPGEIINEEELFPPSTTELCEAGVIIKKAKDVKYLMDINFKNGVLEIPPIY
        ++  FLSF  +  P  R +Q+ E   + N      FL+ FRV LCCL +K     +EE L PPS T+LCEAGV IKKAKD KYLMDI+FKNGVL+IPP++
Subjt:  NMNKFLSFLCVLFPAYRQKQHEENSFENNNNNNNTFLSFFRVLLCCLWQKPGEIINEEELFPPSTTELCEAGVIIKKAKDVKYLMDINFKNGVLEIPPIY

Query:  IYDDFEFMLRNMLAFEQF------QRNKYALHYVSFLDDLISTEKDVHLLVKAGVIVNKIGGSDKEISDMFNNICKFVTLSTYSNRLEDISKALREHCNG
        IYDDF+ +LRN+LAFEQ        RNKY   YV FLD LIST+KDVHLLV+AGVI+N IGGSDKE+SD+FNN+ KFVT+   S   + ISK LR +C+G
Subjt:  IYDDFEFMLRNMLAFEQF------QRNKYALHYVSFLDDLISTEKDVHLLVKAGVIVNKIGGSDKEISDMFNNICKFVTLSTYSNRLEDISKALREHCNG

Query:  RWNKAKASLKHNYFNTPWAVISFIAATFLILLTLLQTIFS
         WNKAKASLKHNYFNTPWA ISF AATFLILLTLLQTIFS
Subjt:  RWNKAKASLKHNYFNTPWAVISFIAATFLILLTLLQTIFS

TrEMBL top hitse value%identityAlignment
A0A1S3BD29 LOW QUALITY PROTEIN: UPF0481 protein At3g47200-like1.4e-11251.4Show/hide
Query:  MNLKAYAPQVISIGPFHHQCPIGGNFTATEQYKLQGLINFLRHIDNDEMKYSLKEEVVKNRSLEDLLKTGSLKDLVKKAHCWMKEARNCYTEPIG-MDDH
        MN KAY PQ+ISIGPFHH      +F ATEQYKLQ L+NFLR I+N+       ++V K+RSL+DLLK G+LK LV+K HCWM E RNCY+EPI  MDDH
Subjt:  MNLKAYAPQVISIGPFHHQCPIGGNFTATEQYKLQGLINFLRHIDNDEMKYSLKEEVVKNRSLEDLLKTGSLKDLVKKAHCWMKEARNCYTEPIG-MDDH

Query:  KFLTIMLVDACFIVEFFIIEYDRNF-----FDQIEDDVDLSLLYQEI-RPLPLDDKHLDDK------QKKWH----------------------------
         F+ +ML+DACFIVE FI +YD ++     F +I+D+VD  LLY E  + +  D   L+++      Q  +H                            
Subjt:  KFLTIMLVDACFIVEFFIIEYDRNF-----FDQIEDDVDLSLLYQEI-RPLPLDDKHLDDK------QKKWH----------------------------

Query:  ----------EKNMNKFLSFLCVLFPAYR-----QKQHEENSFENNNNNNNTFLSFFRVLLCC-LWQKPGEIINEEELFP----PSTTELCEAGVIIKKA
                   K++  FLSF  +  P++      Q + E  + EN   NN  FLSFF  L CC LWQ P    N++++      PS TELCE+GV I+KA
Subjt:  ----------EKNMNKFLSFLCVLFPAYR-----QKQHEENSFENNNNNNNTFLSFFRVLLCC-LWQKPGEIINEEELFP----PSTTELCEAGVIIKKA

Query:  KDVKYLMDINFKNGVLEIPPIYIYDDFEFMLRNMLAFEQF---QRNKYALHYVSFLDDLISTEKDVHLLVKAGVIVNKIGGSDKEISDMFNNICKFVTLS
        K+ KYL +I FKNGVL+IP ++IYD+FE M RN++AFEQF     N YA  Y+ F+DDLISTEKDV LLV +GVI+N IGGSDKE+S++FNN+ KFV   
Subjt:  KDVKYLMDINFKNGVLEIPPIYIYDDFEFMLRNMLAFEQF---QRNKYALHYVSFLDDLISTEKDVHLLVKAGVIVNKIGGSDKEISDMFNNICKFVTLS

Query:  TYSNRLEDISKALREHCNGRWNKAKASLKHNYFNTPWAVISFIAATFLILLTLLQTIFSAISAFP
                ISKALR+HCNGRWNKAKASLKHNYFNTPWA ISF AA+FL+LLTLLQTIFS ISAFP
Subjt:  TYSNRLEDISKALREHCNGRWNKAKASLKHNYFNTPWAVISFIAATFLILLTLLQTIFSAISAFP

A0A1S3BDS2 UPF0481 protein At3g47200-like5.7e-11151.4Show/hide
Query:  MNLKAYAPQVISIGPFHH-QCPIGGNFTATEQYKLQGLINFLRHIDNDEMKYSLKEEVVKN-RSLEDLLKTGSLKDLVKKAHCWMKEARNCYTEPIG-MD
        MN KAY PQ+ISIGPFHH +C    +F  TEQYKLQ L+NFLR I+N+       ++VVK   SLEDLLKTG+LK LV+K HC M EA+NCY+EPI  MD
Subjt:  MNLKAYAPQVISIGPFHH-QCPIGGNFTATEQYKLQGLINFLRHIDNDEMKYSLKEEVVKN-RSLEDLLKTGSLKDLVKKAHCWMKEARNCYTEPIG-MD

Query:  DHKFLTIMLVDACFIVEFFIIEYDRNF-----FDQIEDDVDLSLLYQEI-----------------------------RPLPLDDKHLDDKQ--------
         H F+ +ML+DACFI+E FI +YD +      F  I+D+VD  LLY E                              R  P+   HL            
Subjt:  DHKFLTIMLVDACFIVEFFIIEYDRNF-----FDQIEDDVDLSLLYQEI-----------------------------RPLPLDDKHLDDKQ--------

Query:  --------KKWHEKNMNKFLSFLCVLFPAYRQKQHEEN------SFENNNNNNNTFLSFFRVLLCC-LWQKPGEIINEEELF-PPSTTELCEAGVIIKKA
                  +  K++  FLSF  +  P++      +N      + EN   NN  FLSFF  L CC LWQ P +  N++EL  PPS TELCE+GV IKKA
Subjt:  --------KKWHEKNMNKFLSFLCVLFPAYRQKQHEEN------SFENNNNNNNTFLSFFRVLLCC-LWQKPGEIINEEELF-PPSTTELCEAGVIIKKA

Query:  KDVKYLMDINFKNGVLEIPPIYIYDDFEFMLRNMLAFEQFQ---RNKYALHYVSFLDDLISTEKDVHLLVKAGVIVNKIGGSDKEISDMFNNICKFVTLS
        K+ +YL +I+FKNGVL+IP ++IYD+FE M RN++AFEQF+   +N YA  Y+ F+DDLISTEKDV LLV +GVI+N IGGSDKE+S++FNN+ KFV   
Subjt:  KDVKYLMDINFKNGVLEIPPIYIYDDFEFMLRNMLAFEQFQ---RNKYALHYVSFLDDLISTEKDVHLLVKAGVIVNKIGGSDKEISDMFNNICKFVTLS

Query:  TYSNRLEDISKALREHCNGRWNKAKASLKHNYFNTPWAVISFIAATFLILLTLLQTIFSAISAFP
                ISKALR+HCNGRWNKAKASLKHNYFNTPWA ISF A +FL+LLTLLQTIFS ISAFP
Subjt:  TYSNRLEDISKALREHCNGRWNKAKASLKHNYFNTPWAVISFIAATFLILLTLLQTIFSAISAFP

A0A5A7VBG0 UPF0481 protein9.5e-10651.44Show/hide
Query:  MNLKAYAPQVISIGPFHH-QCPIGGNFTATEQYKLQGLINFLRHIDNDEMKYSLKEEVVKNRSLEDLLKTGSLKDLVKKAHCWMKEARNCYTEPIG-MDD
        MN + Y PQ+ISIGPFHH +C    +F ATEQYKLQ L+NFLR I+ND         V   RSLEDLLKT +LK LVKK  CWM EARN Y+EPI  MDD
Subjt:  MNLKAYAPQVISIGPFHH-QCPIGGNFTATEQYKLQGLINFLRHIDNDEMKYSLKEEVVKNRSLEDLLKTGSLKDLVKKAHCWMKEARNCYTEPIG-MDD

Query:  HKFLTIMLVDACFIVEFFIIEYDRNF-----FDQIEDDVDLSLLYQEIRPLPLDDK-HLDDKQKKWHEKNM-------NKFLSFLCVLFPAY--------
        H F+ +ML+DACFIVE FI +YD +      F QI+D+++L LLY EI     +D   L+++   +  +NM       +  +SF+ + + A         
Subjt:  HKFLTIMLVDACFIVEFFIIEYDRNF-----FDQIEDDVDLSLLYQEIRPLPLDDK-HLDDKQKKWHEKNM-------NKFLSFLCVLFPAY--------

Query:  ---------RQKQH----------EENSFENNNNNNNTFLSFFRVLLCC-LWQ-KPGEIINEEELF--PPSTTELCEAGVIIKKAKDVKYLMDINFKNGV
                  + +H             + ++   NN  FLSFF  L CC LWQ +P +  ++ EL   PPS TEL E+GV IKKAK+ KYL +I FKNGV
Subjt:  ---------RQKQH----------EENSFENNNNNNNTFLSFFRVLLCC-LWQ-KPGEIINEEELF--PPSTTELCEAGVIIKKAKDVKYLMDINFKNGV

Query:  LEIPPIYIYDDFEFMLRNMLAFEQF---QRNKYALHYVSFLDDLISTEKDVHLLVKAGVIVNKIGGSDKEISDMFNNICKFV-TLSTYSNRLEDISKALR
        LEIP ++IYD+FE ++RN++AFEQ     RN YA  Y+ F+DD+ISTEKDV +LV + VIVN IGGSDKE++++FNN+ KF+ + +  S++   ISKAL 
Subjt:  LEIPPIYIYDDFEFMLRNMLAFEQF---QRNKYALHYVSFLDDLISTEKDVHLLVKAGVIVNKIGGSDKEISDMFNNICKFV-TLSTYSNRLEDISKALR

Query:  EHCNGRWNKAKASLKHNYFNTPWAVISFIAATFLILLTLLQTIFSAISAFP
        +HCNGRWNKAKASLKHNYFNTPWA ISF AA+FL+LLTLLQTIFS ISAFP
Subjt:  EHCNGRWNKAKASLKHNYFNTPWAVISFIAATFLILLTLLQTIFSAISAFP

A0A5A7VF39 UPF0481 protein1.5e-11452.27Show/hide
Query:  MNLKAYAPQVISIGPFHHQCPIGGNFTATEQYKLQGLINFLRHIDNDEMKYSLKEEVVKNRSLEDLLKTGSLKDLVKKAHCWMKEARNCYTEPIG-MDDH
        MN KAY PQ+ISIGPFHH      +F ATEQYKLQ L+NFLR I+N+       ++V K+RSL+DLLK G+LK LV+K HCWM E RNCY+EPI  MDDH
Subjt:  MNLKAYAPQVISIGPFHHQCPIGGNFTATEQYKLQGLINFLRHIDNDEMKYSLKEEVVKNRSLEDLLKTGSLKDLVKKAHCWMKEARNCYTEPIG-MDDH

Query:  KFLTIMLVDACFIVEFFIIEYDRNF-----FDQIEDDVDLSLLYQEI-RPLPLDDKHLDDK------QKKWH----------------------------
         F+ +ML+DACFIVE FI +YD ++     F +I+D+VD  LLY E  + +  D   L+++      Q  +H                            
Subjt:  KFLTIMLVDACFIVEFFIIEYDRNF-----FDQIEDDVDLSLLYQEI-RPLPLDDKHLDDK------QKKWH----------------------------

Query:  ----------EKNMNKFLSFLCVLFPAYR-----QKQHEENSFENNNNNNNTFLSFFRVLLCC-LWQKP-GEIINEEELF-PPSTTELCEAGVIIKKAKD
                   K++  FLSF  +  P++      Q + E  + EN   NN  FLSFF  L CC LWQ P  +  N++EL  PPS TELCE+GV I+KAK+
Subjt:  ----------EKNMNKFLSFLCVLFPAYR-----QKQHEENSFENNNNNNNTFLSFFRVLLCC-LWQKP-GEIINEEELF-PPSTTELCEAGVIIKKAKD

Query:  VKYLMDINFKNGVLEIPPIYIYDDFEFMLRNMLAFEQF---QRNKYALHYVSFLDDLISTEKDVHLLVKAGVIVNKIGGSDKEISDMFNNICKFVTLSTY
         KYL +I FKNGVL+IP ++IYD+FE M RN++AFEQF     N YA  Y+ F+DDLISTEKDV LLV +GVI+N IGGSDKE+S++FNN+ KFV     
Subjt:  VKYLMDINFKNGVLEIPPIYIYDDFEFMLRNMLAFEQF---QRNKYALHYVSFLDDLISTEKDVHLLVKAGVIVNKIGGSDKEISDMFNNICKFVTLSTY

Query:  SNRLEDISKALREHCNGRWNKAKASLKHNYFNTPWAVISFIAATFLILLTLLQTIFSAISAFP
              ISKALR+HCNGRWNKAKASLKHNYFNTPWA ISF AA+FL+LLTLLQTIFS ISAFP
Subjt:  SNRLEDISKALREHCNGRWNKAKASLKHNYFNTPWAVISFIAATFLILLTLLQTIFSAISAFP

A0A5A7VGD0 UPF0481 protein5.7e-11151.4Show/hide
Query:  MNLKAYAPQVISIGPFHH-QCPIGGNFTATEQYKLQGLINFLRHIDNDEMKYSLKEEVVKN-RSLEDLLKTGSLKDLVKKAHCWMKEARNCYTEPIG-MD
        MN KAY PQ+ISIGPFHH +C    +F  TEQYKLQ L+NFLR I+N+       ++VVK   SLEDLLKTG+LK LV+K HC M EA+NCY+EPI  MD
Subjt:  MNLKAYAPQVISIGPFHH-QCPIGGNFTATEQYKLQGLINFLRHIDNDEMKYSLKEEVVKN-RSLEDLLKTGSLKDLVKKAHCWMKEARNCYTEPIG-MD

Query:  DHKFLTIMLVDACFIVEFFIIEYDRNF-----FDQIEDDVDLSLLYQEI-----------------------------RPLPLDDKHLDDKQ--------
         H F+ +ML+DACFI+E FI +YD +      F  I+D+VD  LLY E                              R  P+   HL            
Subjt:  DHKFLTIMLVDACFIVEFFIIEYDRNF-----FDQIEDDVDLSLLYQEI-----------------------------RPLPLDDKHLDDKQ--------

Query:  --------KKWHEKNMNKFLSFLCVLFPAYRQKQHEEN------SFENNNNNNNTFLSFFRVLLCC-LWQKPGEIINEEELF-PPSTTELCEAGVIIKKA
                  +  K++  FLSF  +  P++      +N      + EN   NN  FLSFF  L CC LWQ P +  N++EL  PPS TELCE+GV IKKA
Subjt:  --------KKWHEKNMNKFLSFLCVLFPAYRQKQHEEN------SFENNNNNNNTFLSFFRVLLCC-LWQKPGEIINEEELF-PPSTTELCEAGVIIKKA

Query:  KDVKYLMDINFKNGVLEIPPIYIYDDFEFMLRNMLAFEQFQ---RNKYALHYVSFLDDLISTEKDVHLLVKAGVIVNKIGGSDKEISDMFNNICKFVTLS
        K+ +YL +I+FKNGVL+IP ++IYD+FE M RN++AFEQF+   +N YA  Y+ F+DDLISTEKDV LLV +GVI+N IGGSDKE+S++FNN+ KFV   
Subjt:  KDVKYLMDINFKNGVLEIPPIYIYDDFEFMLRNMLAFEQFQ---RNKYALHYVSFLDDLISTEKDVHLLVKAGVIVNKIGGSDKEISDMFNNICKFVTLS

Query:  TYSNRLEDISKALREHCNGRWNKAKASLKHNYFNTPWAVISFIAATFLILLTLLQTIFSAISAFP
                ISKALR+HCNGRWNKAKASLKHNYFNTPWA ISF A +FL+LLTLLQTIFS ISAFP
Subjt:  TYSNRLEDISKALREHCNGRWNKAKASLKHNYFNTPWAVISFIAATFLILLTLLQTIFSAISAFP

SwissProt top hitse value%identityAlignment
P0C897 Putative UPF0481 protein At3g026455.0e-1129.73Show/hide
Query:  EELFPPSTTELCEAGVIIK-KAKDVKYLMDINFKNGVLEIPPIYIYDDFEFMLRNMLAFEQFQRNKYAL--HYVSFLDDLISTEKDVHLLVKAGVIVNKI
        EEL  PS ++L +AGV  K  A      +  +  +G   +P I +  + E +LRN++A+E    +   +   Y   ++ +I +E+DV LL + GV+V+++
Subjt:  EELFPPSTTELCEAGVIIK-KAKDVKYLMDINFKNGVLEIPPIYIYDDFEFMLRNMLAFEQFQRNKYAL--HYVSFLDDLISTEKDVHLLVKAGVIVNKI

Query:  GGSDKEISDMFNNICKFVTLSTYSNRLEDISKALREHCNGRWNKAKASLKHNYFNTPWAVISFIAATFLILLTLLQTIFSAISAF
          SD+E ++M+N + K V L T    L+   + +  +  GRW      L   Y    W +++F+AA  L++L  LQ      S+F
Subjt:  GGSDKEISDMFNNICKFVTLSTYSNRLEDISKALREHCNGRWNKAKASLKHNYFNTPWAVISFIAATFLILLTLLQTIFSAISAF

Q9SD53 UPF0481 protein At3g472002.1e-1724.31Show/hide
Query:  MNLKAYAPQVISIGPFHHQCPIGGNFTATEQYKLQGLINFLRHIDNDEMKYSLKEEVVKNRSLEDLLKTGSLKDLVKKAHCWMKEARNCYTEPIGMDDHK
        +N KAY P+V+SIGP+H+      +    +Q+K + L  FL     DE K    EE V  +++ D      L+D ++K+          Y+E +    H 
Subjt:  MNLKAYAPQVISIGPFHHQCPIGGNFTATEQYKLQGLINFLRHIDNDEMKYSLKEEVVKNRSLEDLLKTGSLKDLVKKAHCWMKEARNCYTEPIGMDDHK

Query:  FLTIMLVDACFIVEFFIIEYDRNFFDQIEDDV------------DLSLLYQEIRPLPLDDKHLDDKQKKWHEKNMNKFLSFLCVLFPAYRQKQHEENSFE
         + +M++D CFI+  F+I       +  ED +            DL LL  ++    L   ++  K     + N         + F  ++    +E S+ 
Subjt:  FLTIMLVDACFIVEFFIIEYDRNFFDQIEDDV------------DLSLLYQEIRPLPLDDKHLDDKQKKWHEKNMNKFLSFLCVLFPAYRQKQHEENSFE

Query:  NNNNN----------NNTFL---------SFFRVLLCCLWQKPGEIINEEELFPP---STTELCEAGV--IIKKAKDVKYLMDINFKNGVLEIPPIYIYD
          + N            TFL         S   V +     K G + + +    P   S   L   G+   ++++K+   ++++  K   L+IP +    
Subjt:  NNNNN----------NNTFL---------SFFRVLLCCLWQKPGEIINEEELFPP---STTELCEAGV--IIKKAKDVKYLMDINFKNGVLEIPPIYIYD

Query:  DFEFMLRNMLAFEQF--QRNKYALHYVSFLDDLISTEKDVHLLVKAGVIVNKIGGSDKEISDMFNNICKFVTLSTYSNRLEDISKALREHCNGRWNKAKA
               N +AFEQF    +     Y+ F+  L++ E+DV  L    +I+    GS+ E+S+ F  I K V     ++ L ++ K + E+    +N   A
Subjt:  DFEFMLRNMLAFEQF--QRNKYALHYVSFLDDLISTEKDVHLLVKAGVIVNKIGGSDKEISDMFNNICKFVTLSTYSNRLEDISKALREHCNGRWNKAKA

Query:  SLKHNYFNTPWAVISFIAATFLILLTLLQTIFSAIS
          +H +F +PW  +S  A  F+ILLT+LQ+  + +S
Subjt:  SLKHNYFNTPWAVISFIAATFLILLTLLQTIFSAIS

Arabidopsis top hitse value%identityAlignment
AT3G50130.1 Plant protein of unknown function (DUF247)1.0e-3528.34Show/hide
Query:  NLKAYAPQVISIGPFHHQCPIGGNFTATEQYKLQGLINFLRHIDNDEMKYSLKEEVVKNRSLEDLLKTGSLKDLVKKAHCWMKEARNCYTEPIGMDDHKF
        N K+Y PQ +S+GPFHH     GN         + L+   RH      K+     V+     +  +   ++K+L  +       AR CY  PI +  +KF
Subjt:  NLKAYAPQVISIGPFHHQCPIGGNFTATEQYKLQGLINFLRHIDNDEMKYSLKEEVVKNRSLEDLLKTGSLKDLVKKAHCWMKEARNCYTEPIGMDDHKF

Query:  LTIMLVDACFIVEFF--------IIEYDRN--------FFDQIEDDVDLSLLYQEIRPLPLDDKHLDDKQKKWHEKNMNKFLS--FLCVLFPAYRQKQHE
          ++++D CF++E F         + YDRN            I+ D+   ++ +   PL + ++ L+ +  K H+  +   L+  F   L P        
Subjt:  LTIMLVDACFIVEFF--------IIEYDRN--------FFDQIEDDVDLSLLYQEIRPLPLDDKHLDDKQKKWHEKNMNKFLS--FLCVLFPAYRQKQHE

Query:  ENSFENNNNNNNT---------FLSFFRVLLCCLWQKPGEIIN-------------EEELFPPSTTELCEAGVIIKKAKDVKYLMDINFKNGVLEIPPIY
        ++S E +   N            L  FR  L      P   ++              ++      TEL EAG+  +  K  ++  DI FKNG LEIP + 
Subjt:  ENSFENNNNNNNT---------FLSFFRVLLCCLWQKPGEIIN-------------EEELFPPSTTELCEAGVIIKKAKDVKYLMDINFKNGVLEIPPIY

Query:  IYDDFEFMLRNMLAFEQ--FQRNKYALHYVSFLDDLISTEKDVHLLVKAGVIVNKIGGSDKEISDMFNNICKFVTLSTYSNRLEDISKALREHCNGRWNK
        I+D  + +  N++AFEQ     +     Y+ F+D+LI + +DV  L   G+I + + G+D E++D+FN +C+ V     ++ L  +S  +  + + +WN 
Subjt:  IYDDFEFMLRNMLAFEQ--FQRNKYALHYVSFLDDLISTEKDVHLLVKAGVIVNKIGGSDKEISDMFNNICKFVTLSTYSNRLEDISKALREHCNGRWNK

Query:  AKASLKHNYFNTPWAVISFIAATFLILLTLLQTIFSAISAF
         KA LKH YFN PWA  SF AA  L++LTL Q+ F+A   F
Subjt:  AKASLKHNYFNTPWAVISFIAATFLILLTLLQTIFSAISAF

AT3G50140.1 Plant protein of unknown function (DUF247)3.7e-3828.7Show/hide
Query:  AYAPQVISIGPFHHQCPIGGNFTATEQYKLQGLINFLRHIDNDEMKYSLKEEVVKNRSLEDLLKTGSLKDLVKKAHCWMKEARNCYTEPIGMDDHKFLTI
        +Y PQ +S+GP+HH     G+               LR +D    K+     V+K       +   ++K+L ++       AR CY  PIG+  +KF  +
Subjt:  AYAPQVISIGPFHHQCPIGGNFTATEQYKLQGLINFLRHIDNDEMKYSLKEEVVKNRSLEDLLKTGSLKDLVKKAHCWMKEARNCYTEPIGMDDHKFLTI

Query:  MLVDACFIVEFF--------IIEYDRN--------FFDQIEDDVDLSLLYQEIRPLPLDDKHLDDKQKKWHEKNMNKFLS--FLCVLFPAYRQKQHEENS
        +++D CF+++ F         + YDRN            I  D+   L+ +   PL + ++ L+ +    ++  +   L+  F   L P Y      ENS
Subjt:  MLVDACFIVEFF--------IIEYDRN--------FFDQIEDDVDLSLLYQEIRPLPLDDKHLDDKQKKWHEKNMNKFLS--FLCVLFPAYRQKQHEENS

Query:  FENNNNNNNTFLSFFRVLLCCL----------------------W-QKPGEIINEEELFPPSTTELCEAGVIIKKAKDVKYLMDINFKNGVLEIPPIYIY
         ENNN   N      +  L CL                      W +KP      ++      TEL EAG+  K+ K  ++  DI FKNG LEIP + I+
Subjt:  FENNNNNNNTFLSFFRVLLCCL----------------------W-QKPGEIINEEELFPPSTTELCEAGVIIKKAKDVKYLMDINFKNGVLEIPPIYIY

Query:  DDFEFMLRNMLAFEQ--FQRNKYALHYVSFLDDLISTEKDVHLLVKAGVIVNKIGGSDKEISDMFNNICKFVTLSTYSNRLEDISKALREHCNGRWNKAK
        D  + +  N++A+EQ           Y+ F+D+LI + +D+  L    +I + + G+D E++D+FN +C+ V     +  L ++S  +  + N +WN  K
Subjt:  DDFEFMLRNMLAFEQ--FQRNKYALHYVSFLDDLISTEKDVHLLVKAGVIVNKIGGSDKEISDMFNNICKFVTLSTYSNRLEDISKALREHCNGRWNKAK

Query:  ASLKHNYFNTPWAVISFIAATFLILLTLLQTIFSAISAF
        A+LKH YF+ PWA  SF AA  L+LLTL Q+ F++   F
Subjt:  ASLKHNYFNTPWAVISFIAATFLILLTLLQTIFSAISAF

AT3G50160.1 Plant protein of unknown function (DUF247)7.3e-3429.27Show/hide
Query:  KAYAPQVISIGPFHHQCPIGGNFTATEQYKLQGLINFLRHIDNDEMKYSLKEEVVKNRSLEDL-LKTGSLKDLVKKAHCWMKEARNCYTEPIGMDDHKFL
        K+Y PQ++SIGP+HH      +    E++K + +                   +V  R+  D+ +   ++K+L +K       AR CY  PI M+ ++F+
Subjt:  KAYAPQVISIGPFHHQCPIGGNFTATEQYKLQGLINFLRHIDNDEMKYSLKEEVVKNRSLEDL-LKTGSLKDLVKKAHCWMKEARNCYTEPIGMDDHKFL

Query:  TIMLVDACFIVEFFIIEYDRNFFDQI----EDDV-DLSLLYQEIRPLPLDDKHLDDKQKKW---------------HEKNMNKFLSFLCVLFPAYRQKQH
         ++++D  FI+E F  +     F +I     D V  +  L Q IR     D  + + Q  W                + N+  F  F   L P  R+   
Subjt:  TIMLVDACFIVEFFIIEYDRNFFDQI----EDDV-DLSLLYQEIRPLPLDDKHLDDKQKKW---------------HEKNMNKFLSFLCVLFPAYRQKQH

Query:  EENSFENNNNNNNTFLSFFRVLLCCLWQKPG------EIINEE-ELFPPSTTELCEAGVIIKKAKDVKYLMDINFKNGVLEIPPIYIYDDFEFMLRNMLA
        EE             L    VL   L Q  G       ++N++ +      TEL  AGV   + K+  +  DI FKNG L+IP + I+D  + +  N++A
Subjt:  EENSFENNNNNNNTFLSFFRVLLCCLWQKPG------EIINEE-ELFPPSTTELCEAGVIIKKAKDVKYLMDINFKNGVLEIPPIYIYDDFEFMLRNMLA

Query:  FEQ--FQRNKYALHYVSFLDDLISTEKDVHLLVKAGVIVNKIGGSDKEISDMFNNICKFVTLSTYSNRLEDISKALREHCNGRWNKAKASLKHNYFNTPW
        FEQ   + +K    Y+ F+D+LI++ +DV  L   G+I N + GSD E+SD+FN + K V        L  ++  +  +   +WN  KA+L+H YFN PW
Subjt:  FEQ--FQRNKYALHYVSFLDDLISTEKDVHLLVKAGVIVNKIGGSDKEISDMFNNICKFVTLSTYSNRLEDISKALREHCNGRWNKAKASLKHNYFNTPW

Query:  AVISFIAATFLILLTLLQTIFSAISAF
        A  SFIAA  L++ T  Q+ F+  + F
Subjt:  AVISFIAATFLILLTLLQTIFSAISAF

AT3G50170.1 Plant protein of unknown function (DUF247)3.0e-3527.5Show/hide
Query:  KAYAPQVISIGPFHHQCPIGGNFTATEQYKLQGLINFLRHIDNDEMKYSLKEEVVKNRSLEDLLKTGSLKDLVKKAHCWMKEARNCYTEPIGMDDHKFLT
        K+Y PQ +S+GP+HH           E++K + L   L+          LK+ +         + T ++++L +K       AR CY  PI +  ++F  
Subjt:  KAYAPQVISIGPFHHQCPIGGNFTATEQYKLQGLINFLRHIDNDEMKYSLKEEVVKNRSLEDLLKTGSLKDLVKKAHCWMKEARNCYTEPIGMDDHKFLT

Query:  IMLVDACFIVEFF--------IIEYDRN--------FFDQIEDDVDLSLLYQEIRPLPLDDKHLDDKQKKWHEKNM--NKFLSFLCVLFPAYRQKQHEEN
        ++++D CF++E F         I Y RN            I+ D+   ++ +   PL + D+ L+ +    ++  +  +  + F   L P        + 
Subjt:  IMLVDACFIVEFF--------IIEYDRN--------FFDQIEDDVDLSLLYQEIRPLPLDDKHLDDKQKKWHEKNM--NKFLSFLCVLFPAYRQKQHEEN

Query:  SFENN----------NNNNNTFLSFF-------------RVLLCCLWQKPGEIINEEELFPPSTTELCEAGVIIKKAKDVKYLMDINFKNGVLEIPPIYI
        S   N          +      L  F             R LL  L +    +   ++      TEL EAGV  +K K  ++  DI FKNG LEIP + I
Subjt:  SFENN----------NNNNNTFLSFF-------------RVLLCCLWQKPGEIINEEELFPPSTTELCEAGVIIKKAKDVKYLMDINFKNGVLEIPPIYI

Query:  YDDFEFMLRNMLAFEQ--FQRNKYALHYVSFLDDLISTEKDVHLLVKAGVIVNKIGGSDKEISDMFNNICKFVTLSTYSNRLEDISKALREHCNGRWNKA
        +D  + +  N++AFEQ   + + +   Y+ F+D+LI++ +DV  L   G+I + + GSD E++D+FN +C+ V      + L  +S  +  + N +WN  
Subjt:  YDDFEFMLRNMLAFEQ--FQRNKYALHYVSFLDDLISTEKDVHLLVKAGVIVNKIGGSDKEISDMFNNICKFVTLSTYSNRLEDISKALREHCNGRWNKA

Query:  KASLKHNYFNTPWAVISFIAATFLILLTLLQTIFSAISAF
        KA+L H YFN PWA  SF AA  L+LLTL Q+ ++  + +
Subjt:  KASLKHNYFNTPWAVISFIAATFLILLTLLQTIFSAISAF

AT4G31980.1 unknown protein2.3e-3529.78Show/hide
Query:  MNLKAYAPQVISIGPFHHQCPIGGNFTATEQYKLQGLINFLRHIDNDEMKYSLKEEVVKNRSLEDLLKTGSLKDLVKKAHCWMKEARNCYTEPIGMDDHK
        +N  AY P+++S GP H          A E  K + L++F+   ++                        SL+DLV+ A  W + AR+CY E + +   +
Subjt:  MNLKAYAPQVISIGPFHHQCPIGGNFTATEQYKLQGLINFLRHIDNDEMKYSLKEEVVKNRSLEDLLKTGSLKDLVKKAHCWMKEARNCYTEPIGMDDHK

Query:  FLTIMLVDACFIVEFFIIEYDRNFFDQIEDDVDL----SLLYQEI-RPLPLDDKHLDDKQKKWHEKNMNKFLSFLCVLFPAYRQKQHEENSFENNNNNNN
        F+ +++VD  F+VE  +    R+ + ++  + D     S++  ++ R + L +  L     K   +     L++     P+  Q      S+  +  ++ 
Subjt:  FLTIMLVDACFIVEFFIIEYDRNFFDQIEDDVDL----SLLYQEI-RPLPLDDKHLDDKQKKWHEKNMNKFLSFLCVLFPAYRQKQHEENSFENNNNNNN

Query:  TFLS---FFRVLL--CCLWQKP--GEIINEEELFPPSTTELCEAGVIIKKAKDVKYLMDINFKNGVLEIPPIYIYDDFEFMLRNMLAFEQFQ-RNKYALH
         F++    F  LL  C L Q P   E    +    P  TEL  AGV  K A+    L+DI+F +GVL+IP I + D  E + +N++ FEQ +  NK  L 
Subjt:  TFLS---FFRVLL--CCLWQKP--GEIINEEELFPPSTTELCEAGVIIKKAKDVKYLMDINFKNGVLEIPPIYIYDDFEFMLRNMLAFEQFQ-RNKYALH

Query:  YVSFLDDLISTEKDVHLLVKAGVIVNKIGGSDKEISDMFNNICKFVTLSTYSNR--LEDISKALREHCNGRWNKAKASLKHNYFNTPWAVISFIAATFLI
        Y+  L   I +  D  LL+ +G+IVN +G S  ++S++FN+I K V    Y  R     +S+ L+ +CN  WN+ KA L+ +YF+ PWAV S  AA  L+
Subjt:  YVSFLDDLISTEKDVHLLVKAGVIVNKIGGSDKEISDMFNNICKFVTLSTYSNR--LEDISKALREHCNGRWNKAKASLKHNYFNTPWAVISFIAATFLI

Query:  LLTLLQTIFSAIS
        LLT +Q++ S ++
Subjt:  LLTLLQTIFSAIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCTTAAAGCCTACGCCCCTCAAGTTATTTCCATTGGCCCTTTTCACCATCAATGTCCAATTGGTGGTAATTTTACAGCAACAGAACAGTATAAGCTTCAAGGTCT
TATTAACTTTCTACGTCATATCGATAATGATGAGATGAAATATTCATTGAAGGAGGAGGTTGTGAAAAATAGATCATTGGAGGACCTTCTGAAAACTGGATCATTGAAGG
ACCTCGTGAAAAAAGCTCATTGTTGGATGAAAGAAGCCCGTAATTGCTATACAGAACCCATAGGCATGGACGACCATAAGTTTCTTACAATAATGCTTGTGGATGCTTGT
TTCATAGTGGAATTTTTTATAATAGAATATGATCGTAACTTCTTCGATCAAATTGAAGACGATGTAGATCTTTCGTTACTCTACCAAGAAATAAGACCCTTGCCCCTTGA
TGATAAGCACCTTGATGATAAGCAGAAAAAATGGCATGAGAAAAATATGAACAAATTCTTGAGCTTCCTATGCGTCCTTTTTCCAGCATATCGACAGAAGCAACATGAGG
AAAACAGCTTTGAGAATAATAATAATAATAATAATACTTTTTTAAGCTTCTTTCGTGTACTCTTGTGTTGTCTTTGGCAGAAGCCCGGGGAGATTATCAATGAGGAAGAG
TTGTTTCCTCCATCCACAACTGAGCTCTGCGAGGCTGGTGTCATCATCAAGAAAGCAAAAGATGTCAAATATTTGATGGACATAAACTTCAAAAATGGGGTTTTGGAAAT
TCCACCAATATATATTTATGACGACTTTGAATTTATGTTGCGAAACATGCTAGCATTTGAGCAATTCCAGCGCAACAAGTATGCGTTACATTATGTCTCATTTCTGGATG
ATTTGATCAGTACAGAGAAAGATGTGCATTTACTTGTGAAGGCCGGAGTCATAGTCAATAAAATTGGCGGCAGTGATAAAGAAATTTCAGATATGTTTAACAACATCTGT
AAATTTGTCACATTATCTACTTATTCTAACCGCTTAGAAGATATTAGCAAAGCTTTGCGTGAGCACTGCAATGGAAGGTGGAACAAGGCAAAAGCTTCACTCAAACATAA
CTATTTCAATACCCCATGGGCTGTTATCTCCTTCATTGCTGCAACTTTCCTCATTCTTCTCACCCTCCTTCAAACCATTTTCTCTGCTATCTCTGCGTTTCCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATCTTAAAGCCTACGCCCCTCAAGTTATTTCCATTGGCCCTTTTCACCATCAATGTCCAATTGGTGGTAATTTTACAGCAACAGAACAGTATAAGCTTCAAGGTCT
TATTAACTTTCTACGTCATATCGATAATGATGAGATGAAATATTCATTGAAGGAGGAGGTTGTGAAAAATAGATCATTGGAGGACCTTCTGAAAACTGGATCATTGAAGG
ACCTCGTGAAAAAAGCTCATTGTTGGATGAAAGAAGCCCGTAATTGCTATACAGAACCCATAGGCATGGACGACCATAAGTTTCTTACAATAATGCTTGTGGATGCTTGT
TTCATAGTGGAATTTTTTATAATAGAATATGATCGTAACTTCTTCGATCAAATTGAAGACGATGTAGATCTTTCGTTACTCTACCAAGAAATAAGACCCTTGCCCCTTGA
TGATAAGCACCTTGATGATAAGCAGAAAAAATGGCATGAGAAAAATATGAACAAATTCTTGAGCTTCCTATGCGTCCTTTTTCCAGCATATCGACAGAAGCAACATGAGG
AAAACAGCTTTGAGAATAATAATAATAATAATAATACTTTTTTAAGCTTCTTTCGTGTACTCTTGTGTTGTCTTTGGCAGAAGCCCGGGGAGATTATCAATGAGGAAGAG
TTGTTTCCTCCATCCACAACTGAGCTCTGCGAGGCTGGTGTCATCATCAAGAAAGCAAAAGATGTCAAATATTTGATGGACATAAACTTCAAAAATGGGGTTTTGGAAAT
TCCACCAATATATATTTATGACGACTTTGAATTTATGTTGCGAAACATGCTAGCATTTGAGCAATTCCAGCGCAACAAGTATGCGTTACATTATGTCTCATTTCTGGATG
ATTTGATCAGTACAGAGAAAGATGTGCATTTACTTGTGAAGGCCGGAGTCATAGTCAATAAAATTGGCGGCAGTGATAAAGAAATTTCAGATATGTTTAACAACATCTGT
AAATTTGTCACATTATCTACTTATTCTAACCGCTTAGAAGATATTAGCAAAGCTTTGCGTGAGCACTGCAATGGAAGGTGGAACAAGGCAAAAGCTTCACTCAAACATAA
CTATTTCAATACCCCATGGGCTGTTATCTCCTTCATTGCTGCAACTTTCCTCATTCTTCTCACCCTCCTTCAAACCATTTTCTCTGCTATCTCTGCGTTTCCTTAG
Protein sequenceShow/hide protein sequence
MNLKAYAPQVISIGPFHHQCPIGGNFTATEQYKLQGLINFLRHIDNDEMKYSLKEEVVKNRSLEDLLKTGSLKDLVKKAHCWMKEARNCYTEPIGMDDHKFLTIMLVDAC
FIVEFFIIEYDRNFFDQIEDDVDLSLLYQEIRPLPLDDKHLDDKQKKWHEKNMNKFLSFLCVLFPAYRQKQHEENSFENNNNNNNTFLSFFRVLLCCLWQKPGEIINEEE
LFPPSTTELCEAGVIIKKAKDVKYLMDINFKNGVLEIPPIYIYDDFEFMLRNMLAFEQFQRNKYALHYVSFLDDLISTEKDVHLLVKAGVIVNKIGGSDKEISDMFNNIC
KFVTLSTYSNRLEDISKALREHCNGRWNKAKASLKHNYFNTPWAVISFIAATFLILLTLLQTIFSAISAFP