; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr017203 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr017203
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionConserved peptide upstream open reading frame 46
Genome locationtig00153033:501862..502707
RNA-Seq ExpressionSgr017203
SyntenySgr017203
Gene Ontology termsGO:0032259 - methylation (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8647389.1 hypothetical protein Csa_003425 [Cucumis sativus]4.1e-8663.67Show/hide
Query:  MDFKLLKSQIFHESPVARRVFFRIFLFASAVSIIPIVHILTTYDFNSFHLPKSRGCYPAGPSGGNSDQLPRGSYLFQGHFLNPVWDSFDTVRCHENVNLT
        MD K  +S I H+   A+RVFFR+FLFASA+S+IPI+HILT+YDF SFHLPKS  C+ +  S    D LPRGSYLFQGHFLNPVWDSFD++ C   VNLT
Subjt:  MDFKLLKSQIFHESPVARRVFFRIFLFASAVSIIPIVHILTTYDFNSFHLPKSRGCYPAGPSGGNSDQLPRGSYLFQGHFLNPVWDSFDTVRCHENVNLT

Query:  VAVIKELVDSKHLFNHSAKVLYVGGSSSSAVSVLGDLGFSSAVGVDKGRFLSLKRKQFGYRLDYANASFDFVLFRGNFKLSVPDLVVAEIERVLNAGGIG
        +++IK LV  KHLFNHSA+ L+VGGSSSSA SVL DLGFS AVGVDKGRF+SLKR + GY+LDY N+SFDFVLF+G  K+SVPDLVV E+ER+L+ GGIG
Subjt:  VAVIKELVDSKHLFNHSAKVLYVGGSSSSAVSVLGDLGFSSAVGVDKGRFLSLKRKQFGYRLDYANASFDFVLFRGNFKLSVPDLVVAEIERVLNAGGIG

Query:  AVVFRFSTPATADISIRSAARVASLLKTSCVVHSGYVNKFYMTVFKKKPETGGYSDEFHRLPVNCSS
        AVV   S+P    ISI    RV  LLK+SCVV+SG V K Y++VFKKK     + D+   +P+NCSS
Subjt:  AVVFRFSTPATADISIRSAARVASLLKTSCVVHSGYVNKFYMTVFKKKPETGGYSDEFHRLPVNCSS

XP_008449921.1 PREDICTED: uncharacterized protein LOC103491650 [Cucumis melo]1.2e-8564.55Show/hide
Query:  MDFKLLKSQIFHESPVARRVFFRIFLFASAVSIIPIVHILTTYDFNSFHLPKSRGCYPAGPSGGNSDQLPRGSYLFQGHFLNPVWDSFDTVRCHENVNLT
        MD K LKS I  +   A+RVFFR+FLFASA+S+IPI+HILT+YDF SFHLPKS  C+ +  S    D LPRGSYLFQGHFLNPVWDSF+++ C E VNLT
Subjt:  MDFKLLKSQIFHESPVARRVFFRIFLFASAVSIIPIVHILTTYDFNSFHLPKSRGCYPAGPSGGNSDQLPRGSYLFQGHFLNPVWDSFDTVRCHENVNLT

Query:  VAVIKELVDSKHLFNHSAKVLYVGGSSSSAVSVLGDLGFSSAVGVDKGRFLSLKRKQFGYRLDYANASFDFVLFRGN-FKLSVPDLVVAEIERVLNAGGI
        +++IK LVD KHLFNHSA+ L+VGGSSSSA SVL DLGFS A+GVDKGRF+SLKR + GY+LDYAN SFDFVLF G   K+SVPDLVV EIER+L+ GGI
Subjt:  VAVIKELVDSKHLFNHSAKVLYVGGSSSSAVSVLGDLGFSSAVGVDKGRFLSLKRKQFGYRLDYANASFDFVLFRGN-FKLSVPDLVVAEIERVLNAGGI

Query:  GAVVFRFSTPATADISIRSAARVASLLKTSCVVHSGYVNKFYMTVFKKKPETGGYSDEFHRLPVNCSS
        GAVV   S+P    ISI    RV  LLK+SCVV+SG V K Y++VFKKK     + D+   +P+NCSS
Subjt:  GAVVFRFSTPATADISIRSAARVASLLKTSCVVHSGYVNKFYMTVFKKKPETGGYSDEFHRLPVNCSS

XP_011657647.1 uncharacterized protein LOC105435880 [Cucumis sativus]4.1e-8663.67Show/hide
Query:  MDFKLLKSQIFHESPVARRVFFRIFLFASAVSIIPIVHILTTYDFNSFHLPKSRGCYPAGPSGGNSDQLPRGSYLFQGHFLNPVWDSFDTVRCHENVNLT
        MD K  +S I H+   A+RVFFR+FLFASA+S+IPI+HILT+YDF SFHLPKS  C+ +  S    D LPRGSYLFQGHFLNPVWDSFD++ C   VNLT
Subjt:  MDFKLLKSQIFHESPVARRVFFRIFLFASAVSIIPIVHILTTYDFNSFHLPKSRGCYPAGPSGGNSDQLPRGSYLFQGHFLNPVWDSFDTVRCHENVNLT

Query:  VAVIKELVDSKHLFNHSAKVLYVGGSSSSAVSVLGDLGFSSAVGVDKGRFLSLKRKQFGYRLDYANASFDFVLFRGNFKLSVPDLVVAEIERVLNAGGIG
        +++IK LV  KHLFNHSA+ L+VGGSSSSA SVL DLGFS AVGVDKGRF+SLKR + GY+LDY N+SFDFVLF+G  K+SVPDLVV E+ER+L+ GGIG
Subjt:  VAVIKELVDSKHLFNHSAKVLYVGGSSSSAVSVLGDLGFSSAVGVDKGRFLSLKRKQFGYRLDYANASFDFVLFRGNFKLSVPDLVVAEIERVLNAGGIG

Query:  AVVFRFSTPATADISIRSAARVASLLKTSCVVHSGYVNKFYMTVFKKKPETGGYSDEFHRLPVNCSS
        AVV   S+P    ISI    RV  LLK+SCVV+SG V K Y++VFKKK     + D+   +P+NCSS
Subjt:  AVVFRFSTPATADISIRSAARVASLLKTSCVVHSGYVNKFYMTVFKKKPETGGYSDEFHRLPVNCSS

XP_022143860.1 uncharacterized protein LOC111013673 [Momordica charantia]6.6e-10072.2Show/hide
Query:  MDFKLLKSQIFHESPVARRVFFRIFLFASAVSIIPIVHILTTYDFNSFHLPKSRGCYPA-GPSGGNSDQLPRGSYLFQGHFLNPVWDSFDTVRCHENVNL
        MD KL KSQI ++  VARRVFFR+FLFASAVSIIPIVHILTTYDF +FHLP+S+GCY A G +  NSDQ PRGSYLFQGHFLNPVWDSFD+V C ENVNL
Subjt:  MDFKLLKSQIFHESPVARRVFFRIFLFASAVSIIPIVHILTTYDFNSFHLPKSRGCYPA-GPSGGNSDQLPRGSYLFQGHFLNPVWDSFDTVRCHENVNL

Query:  TVAVIKELVDSKHLFNHSAKVLYVGGSSSSAVSVLGDLGFSSAVGVDKGRFLSLKRKQFGYRLDYANASFDFVLFRGNFKLSVPDLVVAEIERVLNAGGI
        T++ IK LVD KHLFNHSAK L+VGGSSSSAVS + DLGFS AVGVDKGR LSLKRK+FGYRLDYAN SFDFV+FRG FK+SVPDLVV EIERVLN+GGI
Subjt:  TVAVIKELVDSKHLFNHSAKVLYVGGSSSSAVSVLGDLGFSSAVGVDKGRFLSLKRKQFGYRLDYANASFDFVLFRGNFKLSVPDLVVAEIERVLNAGGI

Query:  GAVVFRFSTPATADISIRSAARVASLLKTSCVVHSGYVNKFYMTVFKKKPETGGYSDEFHRLPVNCSSFAGNRTLLQ
        GAVV   S P         AARV SLLK+SCVVHSG VN FYMTVFKK+ + GG           CS   GNRTLL+
Subjt:  GAVVFRFSTPATADISIRSAARVASLLKTSCVVHSGYVNKFYMTVFKKKPETGGYSDEFHRLPVNCSSFAGNRTLLQ

XP_038883488.1 uncharacterized protein LOC120074441 [Benincasa hispida]2.8e-9065.54Show/hide
Query:  MDFKLLKSQIFHESPVARRVFFRIFLFASAVSIIPIVHILTTYDFNSFHLPKSRGCYPAGPSGGNSDQLPRGSYLFQGHFLNPVWDSFDTVRCHENVNLT
        MD K +KS I H+   ARR+FFRIFLF S +S+IPI+HILT+YDF SFHLPKS  C+    +    DQLPRGSYLFQGHFLNPVWDSFD+V C E VNLT
Subjt:  MDFKLLKSQIFHESPVARRVFFRIFLFASAVSIIPIVHILTTYDFNSFHLPKSRGCYPAGPSGGNSDQLPRGSYLFQGHFLNPVWDSFDTVRCHENVNLT

Query:  VAVIKELVDSKHLFNHSAKVLYVGGSSSSAVSVLGDLGFSSAVGVDKGRFLSLKRKQFGYRLDYANASFDFVLFRGNFKLSVPDLVVAEIERVLNAGGIG
        V++IK LV+ KHLFNHSA+ L+VGGSSSSA S+L DLGFS AVGVDKGRF+SL+++  GY+LDY+N SFDFVLF+G  K+SVPDLVV EIER+L  GGIG
Subjt:  VAVIKELVDSKHLFNHSAKVLYVGGSSSSAVSVLGDLGFSSAVGVDKGRFLSLKRKQFGYRLDYANASFDFVLFRGNFKLSVPDLVVAEIERVLNAGGIG

Query:  AVVFRFSTPATADISIRSAARVASLLKTSCVVHSGYVNKFYMTVFKKKPETGGYSDEFHRLPVNCSS
        AVV   S+P    ISI  A RV  LLK+SCVV+SG VNK Y++VFKKK        +FH LP+NCSS
Subjt:  AVVFRFSTPATADISIRSAARVASLLKTSCVVHSGYVNKFYMTVFKKKPETGGYSDEFHRLPVNCSS

TrEMBL top hitse value%identityAlignment
A0A0A0KEM7 Uncharacterized protein2.0e-8663.67Show/hide
Query:  MDFKLLKSQIFHESPVARRVFFRIFLFASAVSIIPIVHILTTYDFNSFHLPKSRGCYPAGPSGGNSDQLPRGSYLFQGHFLNPVWDSFDTVRCHENVNLT
        MD K  +S I H+   A+RVFFR+FLFASA+S+IPI+HILT+YDF SFHLPKS  C+ +  S    D LPRGSYLFQGHFLNPVWDSFD++ C   VNLT
Subjt:  MDFKLLKSQIFHESPVARRVFFRIFLFASAVSIIPIVHILTTYDFNSFHLPKSRGCYPAGPSGGNSDQLPRGSYLFQGHFLNPVWDSFDTVRCHENVNLT

Query:  VAVIKELVDSKHLFNHSAKVLYVGGSSSSAVSVLGDLGFSSAVGVDKGRFLSLKRKQFGYRLDYANASFDFVLFRGNFKLSVPDLVVAEIERVLNAGGIG
        +++IK LV  KHLFNHSA+ L+VGGSSSSA SVL DLGFS AVGVDKGRF+SLKR + GY+LDY N+SFDFVLF+G  K+SVPDLVV E+ER+L+ GGIG
Subjt:  VAVIKELVDSKHLFNHSAKVLYVGGSSSSAVSVLGDLGFSSAVGVDKGRFLSLKRKQFGYRLDYANASFDFVLFRGNFKLSVPDLVVAEIERVLNAGGIG

Query:  AVVFRFSTPATADISIRSAARVASLLKTSCVVHSGYVNKFYMTVFKKKPETGGYSDEFHRLPVNCSS
        AVV   S+P    ISI    RV  LLK+SCVV+SG V K Y++VFKKK     + D+   +P+NCSS
Subjt:  AVVFRFSTPATADISIRSAARVASLLKTSCVVHSGYVNKFYMTVFKKKPETGGYSDEFHRLPVNCSS

A0A1S3BMI8 uncharacterized protein LOC1034916505.8e-8664.55Show/hide
Query:  MDFKLLKSQIFHESPVARRVFFRIFLFASAVSIIPIVHILTTYDFNSFHLPKSRGCYPAGPSGGNSDQLPRGSYLFQGHFLNPVWDSFDTVRCHENVNLT
        MD K LKS I  +   A+RVFFR+FLFASA+S+IPI+HILT+YDF SFHLPKS  C+ +  S    D LPRGSYLFQGHFLNPVWDSF+++ C E VNLT
Subjt:  MDFKLLKSQIFHESPVARRVFFRIFLFASAVSIIPIVHILTTYDFNSFHLPKSRGCYPAGPSGGNSDQLPRGSYLFQGHFLNPVWDSFDTVRCHENVNLT

Query:  VAVIKELVDSKHLFNHSAKVLYVGGSSSSAVSVLGDLGFSSAVGVDKGRFLSLKRKQFGYRLDYANASFDFVLFRGN-FKLSVPDLVVAEIERVLNAGGI
        +++IK LVD KHLFNHSA+ L+VGGSSSSA SVL DLGFS A+GVDKGRF+SLKR + GY+LDYAN SFDFVLF G   K+SVPDLVV EIER+L+ GGI
Subjt:  VAVIKELVDSKHLFNHSAKVLYVGGSSSSAVSVLGDLGFSSAVGVDKGRFLSLKRKQFGYRLDYANASFDFVLFRGN-FKLSVPDLVVAEIERVLNAGGI

Query:  GAVVFRFSTPATADISIRSAARVASLLKTSCVVHSGYVNKFYMTVFKKKPETGGYSDEFHRLPVNCSS
        GAVV   S+P    ISI    RV  LLK+SCVV+SG V K Y++VFKKK     + D+   +P+NCSS
Subjt:  GAVVFRFSTPATADISIRSAARVASLLKTSCVVHSGYVNKFYMTVFKKKPETGGYSDEFHRLPVNCSS

A0A5A7U1B0 Methyltransferase type 115.5e-8467.07Show/hide
Query:  MDFKLLKSQIFHESPVARRVFFRIFLFASAVSIIPIVHILTTYDFNSFHLPKSRGCYPAGPSGGNSDQLPRGSYLFQGHFLNPVWDSFDTVRCHENVNLT
        MD K LKS I  +   A+RVFFR+FLFASA+S+IPI+HILT+YDF SFHLPKS  C+ +  S    D LPRGSYLFQGHFLNPVWDSF+++ C E VNLT
Subjt:  MDFKLLKSQIFHESPVARRVFFRIFLFASAVSIIPIVHILTTYDFNSFHLPKSRGCYPAGPSGGNSDQLPRGSYLFQGHFLNPVWDSFDTVRCHENVNLT

Query:  VAVIKELVDSKHLFNHSAKVLYVGGSSSSAVSVLGDLGFSSAVGVDKGRFLSLKRKQFGYRLDYANASFDFVLFRGN-FKLSVPDLVVAEIERVLNAGGI
        +++IK LVD KHLFNHSA+ L+VGGSSSSA SVL DLGFS A+GVDKGRF+SLKR + GY+LDYAN SFDFVLF G   K+SVPDLVV EIER+L+ GGI
Subjt:  VAVIKELVDSKHLFNHSAKVLYVGGSSSSAVSVLGDLGFSSAVGVDKGRFLSLKRKQFGYRLDYANASFDFVLFRGN-FKLSVPDLVVAEIERVLNAGGI

Query:  GAVVFRFSTPATADISIRSAARVASLLKTSCVVHSGYVNKFYMTVFKKK
        GAVV   S+P    ISI    RV  LLK+SCVV+SG V K Y++VFKKK
Subjt:  GAVVFRFSTPATADISIRSAARVASLLKTSCVVHSGYVNKFYMTVFKKK

A0A6J1CPZ9 uncharacterized protein LOC1110136733.2e-10072.2Show/hide
Query:  MDFKLLKSQIFHESPVARRVFFRIFLFASAVSIIPIVHILTTYDFNSFHLPKSRGCYPA-GPSGGNSDQLPRGSYLFQGHFLNPVWDSFDTVRCHENVNL
        MD KL KSQI ++  VARRVFFR+FLFASAVSIIPIVHILTTYDF +FHLP+S+GCY A G +  NSDQ PRGSYLFQGHFLNPVWDSFD+V C ENVNL
Subjt:  MDFKLLKSQIFHESPVARRVFFRIFLFASAVSIIPIVHILTTYDFNSFHLPKSRGCYPA-GPSGGNSDQLPRGSYLFQGHFLNPVWDSFDTVRCHENVNL

Query:  TVAVIKELVDSKHLFNHSAKVLYVGGSSSSAVSVLGDLGFSSAVGVDKGRFLSLKRKQFGYRLDYANASFDFVLFRGNFKLSVPDLVVAEIERVLNAGGI
        T++ IK LVD KHLFNHSAK L+VGGSSSSAVS + DLGFS AVGVDKGR LSLKRK+FGYRLDYAN SFDFV+FRG FK+SVPDLVV EIERVLN+GGI
Subjt:  TVAVIKELVDSKHLFNHSAKVLYVGGSSSSAVSVLGDLGFSSAVGVDKGRFLSLKRKQFGYRLDYANASFDFVLFRGNFKLSVPDLVVAEIERVLNAGGI

Query:  GAVVFRFSTPATADISIRSAARVASLLKTSCVVHSGYVNKFYMTVFKKKPETGGYSDEFHRLPVNCSSFAGNRTLLQ
        GAVV   S P         AARV SLLK+SCVVHSG VN FYMTVFKK+ + GG           CS   GNRTLL+
Subjt:  GAVVFRFSTPATADISIRSAARVASLLKTSCVVHSGYVNKFYMTVFKKKPETGGYSDEFHRLPVNCSSFAGNRTLLQ

A0A6J1EDK4 uncharacterized protein LOC1114332231.9e-8462.92Show/hide
Query:  MDFKLLKSQIFHESPVARRVFFRIFLFASAVSIIPIVHILTTYDFNSFHLPKSRGCYPAGPSGGNSDQLPRGSYLFQGHFLNPVWDSFDTVRCHENVNLT
        MD KL KS I H++  ARR+ FR+FLFA AVSIIP VHI T+YDF SFHLPKS  C+ AG   G +DQLPRGSYLFQGHFLNP+WDS ++  C E VNLT
Subjt:  MDFKLLKSQIFHESPVARRVFFRIFLFASAVSIIPIVHILTTYDFNSFHLPKSRGCYPAGPSGGNSDQLPRGSYLFQGHFLNPVWDSFDTVRCHENVNLT

Query:  VAVIKELVDSKHLFNHSAKVLYVGGSSSSAVSVLGDLGFSSAVGVDKGRFLSLKRKQFGYRLDYANASFDFVLFRGNFKLSVPDLVVAEIERVLNAGGIG
        ++VI+ELVD KHLFNHSA+ L+VG SSS+A SVL DLGF  AVG+DKGRF+S+K+++ GY+LDY N SFDFVLFRG FK+SVPDLVV EIERVL  GG G
Subjt:  VAVIKELVDSKHLFNHSAKVLYVGGSSSSAVSVLGDLGFSSAVGVDKGRFLSLKRKQFGYRLDYANASFDFVLFRGNFKLSVPDLVVAEIERVLNAGGIG

Query:  AVVFRFSTPATADISIRSAARVASLLKTSCVVHSGYVNKFYMTVFKKKPETGGYSDEFHRLPVNCSS
        AVV   ++P T    I  A R+ SLLK+SCVV S  VN   +TVFKKK          H  P+NC S
Subjt:  AVVFRFSTPATADISIRSAARVASLLKTSCVVHSGYVNKFYMTVFKKKPETGGYSDEFHRLPVNCSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G53400.1 BEST Arabidopsis thaliana protein match is: conserved peptide upstream open reading frame 47 (TAIR:AT5G03190.1)1.1e-2530.69Show/hide
Query:  MDFKLLKSQIFHESPVARRVFFRIFLFASAVSIIPIVHILT-TYD---FNSFHLPKSRGCYP----AGPSGGNSDQLPRGSYLFQGHFLNPVWDSFDTVR
        M F++LK ++   S   RRV  R  +   A S++ ++  L   Y+    N+    K   C       GP   + + L     LF   FL PVW+  ++ +
Subjt:  MDFKLLKSQIFHESPVARRVFFRIFLFASAVSIIPIVHILT-TYD---FNSFHLPKSRGCYP----AGPSGGNSDQLPRGSYLFQGHFLNPVWDSFDTVR

Query:  CHENVNLTVAVIKELVDSKHLFNHSAKVLYVGGSSSSAVSVLGDLGFSSAVGVDKGRFLSLKRKQFGYRLDYANASFDFVLFRGNFKLSVPDLVVAEIER
        C +N+ LT  V++EL    +L ++ +K L +G  S SAV  +   G S           + K ++F   L Y +ASF FV       ++VP  +V EIER
Subjt:  CHENVNLTVAVIKELVDSKHLFNHSAKVLYVGGSSSSAVSVLGDLGFSSAVGVDKGRFLSLKRKQFGYRLDYANASFDFVLFRGNFKLSVPDLVVAEIER

Query:  VLNAGGIGAVVFRFSTPATADISIRSAARVASLLKTSCVVHSGYVNKFYMTVFKKKPETGGYSDEFHR-LPVNCSSFAGNRTLLQKMKHL
        +L  GG GA++   ++ + ++  +RS + V+SLLK S VVH   + K  + VFK+  E     D+ H   P +CSS   NR  +  ++ L
Subjt:  VLNAGGIGAVVFRFSTPATADISIRSAARVASLLKTSCVVHSGYVNKFYMTVFKKKPETGGYSDEFHR-LPVNCSSFAGNRTLLQKMKHL

AT5G03190.1 conserved peptide upstream open reading frame 472.1e-1931.93Show/hide
Query:  MDFKLLKSQIFHESPVARRVFFRIFLFASAVSIIPIVHILTTYDFNSFHLPKSRGCYPAGPSGGNSDQLPRGSYLFQGHFLNPVWDSFDTVRCHENVNLT
        M  K+LK  IF  S   R   FR  + ASA+S++P++ +         H+    G       G     +P G  LF    + P W   +T +  +     
Subjt:  MDFKLLKSQIFHESPVARRVFFRIFLFASAVSIIPIVHILTTYDFNSFHLPKSRGCYPAGPSGGNSDQLPRGSYLFQGHFLNPVWDSFDTVRCHENVNLT

Query:  VAVIKELVD---SKHLFNHSAKVLYVGGSSSSAVSVLGDLGFSSAVGVDKGRFLSLKRKQFGYRLDYA-NASFDFVLFRGNFKLSVPDLVVAEIERVLNA
          VI +LVD      L ++ AK+L +G  S SAVS   ++GFS   GV K    S   ++    L+ + + SFDFVL      ++ P L+V E+ERVL  
Subjt:  VAVIKELVD---SKHLFNHSAKVLYVGGSSSSAVSVLGDLGFSSAVGVDKGRFLSLKRKQFGYRLDYA-NASFDFVLFRGNFKLSVPDLVVAEIERVLNA

Query:  GGIGAVVFRFSTPATADISIRSAARVASLLKTSCVVHSGYVNKFYMTVFKKKPETGGYSDEFHRLPVNCSSFAGNRTLLQKMKHL
        GG GAV+   ST A      R    V S LK S +V    ++KF + VFK+      Y     +LP +C S   NR   + M+ L
Subjt:  GGIGAVVFRFSTPATADISIRSAARVASLLKTSCVVHSGYVNKFYMTVFKKKPETGGYSDEFHRLPVNCSSFAGNRTLLQKMKHL

AT5G03190.2 conserved peptide upstream open reading frame 471.1e-1731.23Show/hide
Query:  ARRVFFRIFLFASAVSIIPIVHILTTYDFNSFHLPKSRGCYPAGPSGGNSDQLPRGSYLFQGHFLNPVWDSFDTVRCHENVNLTVAVIKELVD---SKHL
        +R   FR  + ASA+S++P++ +         H+    G       G     +P G  LF    + P W   +T +  +       VI +LVD      L
Subjt:  ARRVFFRIFLFASAVSIIPIVHILTTYDFNSFHLPKSRGCYPAGPSGGNSDQLPRGSYLFQGHFLNPVWDSFDTVRCHENVNLTVAVIKELVD---SKHL

Query:  FNHSAKVLYVGGSSSSAVSVLGDLGFSSAVGVDKGRFLSLKRKQFGYRLDYA-NASFDFVLFRGNFKLSVPDLVVAEIERVLNAGGIGAVVFRFSTPATA
         ++ AK+L +G  S SAVS   ++GFS   GV K    S   ++    L+ + + SFDFVL      ++ P L+V E+ERVL  GG GAV+   ST A  
Subjt:  FNHSAKVLYVGGSSSSAVSVLGDLGFSSAVGVDKGRFLSLKRKQFGYRLDYA-NASFDFVLFRGNFKLSVPDLVVAEIERVLNAGGIGAVVFRFSTPATA

Query:  DISIRSAARVASLLKTSCVVHSGYVNKFYMTVFKKKPETGGYSDEFHRLPVNCSSFAGNRTLLQKMKHL
            R    V S LK S +V    ++KF + VFK+      Y     +LP +C S   NR   + M+ L
Subjt:  DISIRSAARVASLLKTSCVVHSGYVNKFYMTVFKKKPETGGYSDEFHRLPVNCSSFAGNRTLLQKMKHL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTTAAGCTCCTCAAATCGCAGATCTTTCACGAGTCTCCAGTGGCGAGGCGCGTGTTCTTCCGCATCTTCTTGTTCGCCTCCGCCGTCTCCATCATTCCGATCGT
CCACATCCTCACCACTTACGATTTCAATAGCTTCCATTTGCCCAAATCCCGAGGCTGCTACCCCGCCGGGCCCAGCGGCGGAAACTCCGATCAACTCCCGCGGGGTTCCT
ATTTGTTTCAGGGTCACTTTCTGAACCCCGTCTGGGATTCGTTCGACACAGTGCGTTGCCACGAAAATGTGAATCTGACGGTCGCTGTGATCAAGGAGCTGGTGGACAGC
AAGCATTTGTTCAACCATAGCGCTAAGGTGCTTTATGTCGGCGGGAGTTCGTCCTCCGCCGTGTCGGTGCTGGGAGATTTGGGCTTTTCCAGTGCCGTCGGAGTCGACAA
GGGTCGCTTTCTCTCGCTGAAACGGAAACAATTTGGGTACAGACTCGATTACGCGAATGCTTCCTTCGATTTCGTTTTATTCAGAGGCAACTTTAAGCTCTCTGTTCCTG
ATCTGGTGGTGGCTGAGATAGAGCGTGTTCTCAATGCCGGCGGAATTGGTGCGGTTGTTTTCCGTTTCAGCACCCCGGCGACGGCAGACATTTCGATCAGATCCGCCGCT
CGAGTGGCGAGCTTGCTAAAAACTTCCTGCGTCGTGCATTCGGGCTATGTAAATAAGTTCTATATGACTGTATTCAAGAAGAAACCTGAAACCGGTGGCTACTCCGATGA
GTTTCATCGCCTTCCTGTCAACTGCTCGAGTTTCGCCGGAAACAGAACTCTCCTGCAGAAAATGAAGCATCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATTTTAAGCTCCTCAAATCGCAGATCTTTCACGAGTCTCCAGTGGCGAGGCGCGTGTTCTTCCGCATCTTCTTGTTCGCCTCCGCCGTCTCCATCATTCCGATCGT
CCACATCCTCACCACTTACGATTTCAATAGCTTCCATTTGCCCAAATCCCGAGGCTGCTACCCCGCCGGGCCCAGCGGCGGAAACTCCGATCAACTCCCGCGGGGTTCCT
ATTTGTTTCAGGGTCACTTTCTGAACCCCGTCTGGGATTCGTTCGACACAGTGCGTTGCCACGAAAATGTGAATCTGACGGTCGCTGTGATCAAGGAGCTGGTGGACAGC
AAGCATTTGTTCAACCATAGCGCTAAGGTGCTTTATGTCGGCGGGAGTTCGTCCTCCGCCGTGTCGGTGCTGGGAGATTTGGGCTTTTCCAGTGCCGTCGGAGTCGACAA
GGGTCGCTTTCTCTCGCTGAAACGGAAACAATTTGGGTACAGACTCGATTACGCGAATGCTTCCTTCGATTTCGTTTTATTCAGAGGCAACTTTAAGCTCTCTGTTCCTG
ATCTGGTGGTGGCTGAGATAGAGCGTGTTCTCAATGCCGGCGGAATTGGTGCGGTTGTTTTCCGTTTCAGCACCCCGGCGACGGCAGACATTTCGATCAGATCCGCCGCT
CGAGTGGCGAGCTTGCTAAAAACTTCCTGCGTCGTGCATTCGGGCTATGTAAATAAGTTCTATATGACTGTATTCAAGAAGAAACCTGAAACCGGTGGCTACTCCGATGA
GTTTCATCGCCTTCCTGTCAACTGCTCGAGTTTCGCCGGAAACAGAACTCTCCTGCAGAAAATGAAGCATCTTTGA
Protein sequenceShow/hide protein sequence
MDFKLLKSQIFHESPVARRVFFRIFLFASAVSIIPIVHILTTYDFNSFHLPKSRGCYPAGPSGGNSDQLPRGSYLFQGHFLNPVWDSFDTVRCHENVNLTVAVIKELVDS
KHLFNHSAKVLYVGGSSSSAVSVLGDLGFSSAVGVDKGRFLSLKRKQFGYRLDYANASFDFVLFRGNFKLSVPDLVVAEIERVLNAGGIGAVVFRFSTPATADISIRSAA
RVASLLKTSCVVHSGYVNKFYMTVFKKKPETGGYSDEFHRLPVNCSSFAGNRTLLQKMKHL