; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr016633 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr016633
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionsurfeit locus protein 2-like
Genome locationtig00152977:1019022..1025736
RNA-Seq ExpressionSgr016633
SyntenySgr016633
Gene Ontology termsNA
InterPro domainsIPR008833 - Surfeit locus protein 2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057230.1 surfeit locus protein 2 [Cucumis melo var. makuwa]5.0e-10681.06Show/hide
Query:  MATSTAEERADVEGENLLGQPTFTELDNGRFRCVETGHEVLAKDMDSYSRTKRCRLGLIDFALSHRKAPLNMFEQDPLSRSKLKCKLTGDTINKTEEHIW
        M+TSTAEE   VEG +LLGQPTFTELDNGRFRCVETGHEVLAKD DSYSRTKRCRLGL+DFALS RKAPLNMFEQDPLSRSKLKCKLTGDTINKTEEHIW
Subjt:  MATSTAEERADVEGENLLGQPTFTELDNGRFRCVETGHEVLAKDMDSYSRTKRCRLGLIDFALSHRKAPLNMFEQDPLSRSKLKCKLTGDTINKTEEHIW

Query:  KHINGKRFLNKLEQKELEKDLLAKSGEQQGKKKATKASKPSTENSHNIKKKKKKGLENTISEARESPGESDPEHAFWVPPVGQRWDCDNGGDRWGSGSDS
        KHINGKRFLNKLEQKE EKD +AKSGEQQ KKKA K  KPSTENS   KKKKKK  E TISEA+E  GESDPE AFW+PPVGQRWD DNGGDRW SGSD 
Subjt:  KHINGKRFLNKLEQKELEKDLLAKSGEQQGKKKATKASKPSTENSHNIKKKKKKGLENTISEARESPGESDPEHAFWVPPVGQRWDCDNGGDRWGSGSDS

Query:  EHESDKVIAME-------------DENKHGKTESDELSKRTKRMSIEIEPSSFASRKKKSKKSS
        EHESDK+IAM+             DE+KH + ESDELSKRTKRMSIEI PSSFASRKKKSKKSS
Subjt:  EHESDKVIAME-------------DENKHGKTESDELSKRTKRMSIEIEPSSFASRKKKSKKSS

XP_008464937.1 PREDICTED: surfeit locus protein 2 [Cucumis melo]5.0e-10681.06Show/hide
Query:  MATSTAEERADVEGENLLGQPTFTELDNGRFRCVETGHEVLAKDMDSYSRTKRCRLGLIDFALSHRKAPLNMFEQDPLSRSKLKCKLTGDTINKTEEHIW
        M+TSTAEE   VEG +LLGQPTFTELDNGRFRCVETGHEVLAKD DSYSRTKRCRLGL+DFALS RKAPLNMFEQDPLSRSKLKCKLTGDTINKTEEHIW
Subjt:  MATSTAEERADVEGENLLGQPTFTELDNGRFRCVETGHEVLAKDMDSYSRTKRCRLGLIDFALSHRKAPLNMFEQDPLSRSKLKCKLTGDTINKTEEHIW

Query:  KHINGKRFLNKLEQKELEKDLLAKSGEQQGKKKATKASKPSTENSHNIKKKKKKGLENTISEARESPGESDPEHAFWVPPVGQRWDCDNGGDRWGSGSDS
        KHINGKRFLNKLEQKE EKD +AKSGEQQ KKKA K  KPSTENS   KKKKKK  E TISEA+E  GESDPE AFW+PPVGQRWD DNGGDRW SGSD 
Subjt:  KHINGKRFLNKLEQKELEKDLLAKSGEQQGKKKATKASKPSTENSHNIKKKKKKGLENTISEARESPGESDPEHAFWVPPVGQRWDCDNGGDRWGSGSDS

Query:  EHESDKVIAME-------------DENKHGKTESDELSKRTKRMSIEIEPSSFASRKKKSKKSS
        EHESDK+IAM+             DE+KH + ESDELSKRTKRMSIEI PSSFASRKKKSKKSS
Subjt:  EHESDKVIAME-------------DENKHGKTESDELSKRTKRMSIEIEPSSFASRKKKSKKSS

XP_022152129.1 surfeit locus protein 2-like [Momordica charantia]1.7e-10684.06Show/hide
Query:  MATSTAEERADVEGENLLGQPTFTELDNGRFRCVETGHEVLAKDMDSYSRTKRCRLGLIDFALSHRKAPLNMFEQDPLSRSKLKCKLTGDTINKTEEHIW
        MATS  EERA+VEG +LLGQPTFTEL+NGRFRCVETGHE+L KD D YSRTKRCR+GLIDFALSHRKAPLNMF+QDPLSRSKLKCKLTGDTINKTEEHIW
Subjt:  MATSTAEERADVEGENLLGQPTFTELDNGRFRCVETGHEVLAKDMDSYSRTKRCRLGLIDFALSHRKAPLNMFEQDPLSRSKLKCKLTGDTINKTEEHIW

Query:  KHINGKRFLNKLEQKELEKDLLAKSGEQQGKKKATKASKPSTENSHNIKKKKKKGLENTISEARESPGESDPEHAFWVPPVGQRWDCDNGGDRWGSGSDS
        KHINGKRFLNKLEQKELEK+  AKSGEQQ KKKA KASK STENS+   KKKKK LE T SEARE  G +D E AFW+PPVGQRWDCDNGGDRW SGSDS
Subjt:  KHINGKRFLNKLEQKELEKDLLAKSGEQQGKKKATKASKPSTENSHNIKKKKKKGLENTISEARESPGESDPEHAFWVPPVGQRWDCDNGGDRWGSGSDS

Query:  EHESDKVIAMEDENKHGKTESDELSKRTKRMSIEIEPSSFASRKKKSKKSS
        EHES+KVIAM+DE+K G+TESDELSK+TKRMSIEI PSSFASRKKKSKKSS
Subjt:  EHESDKVIAMEDENKHGKTESDELSKRTKRMSIEIEPSSFASRKKKSKKSS

XP_038903228.1 surfeit locus protein 2-like isoform X1 [Benincasa hispida]5.3e-10882.26Show/hide
Query:  MATSTAEER--ADVEGENLLGQPTFTELDNGRFRCVETGHEVLAKDMDSYSRTKRCRLGLIDFALSHRKAPLNMFEQDPLSRSKLKCKLTGDTINKTEEH
        MATSTAEER  A VEG +LLGQPTFTELDNGRFRCVETGHEVLAKD DSYSRTKRCRLGLIDFALSHRKAPLNMFEQDPLSRSKLKCKLTGDTINKTEEH
Subjt:  MATSTAEER--ADVEGENLLGQPTFTELDNGRFRCVETGHEVLAKDMDSYSRTKRCRLGLIDFALSHRKAPLNMFEQDPLSRSKLKCKLTGDTINKTEEH

Query:  IWKHINGKRFLNKLEQKELEKDLLAKSGEQQGKKKATKASKPSTENSHNIKKKKKKGLENTISEARESPGESDPEHAFWVPPVGQRWDCDNGGDRWGSGS
        IWKHINGKRFLNKLEQKELEK  +AKSGEQQGKKKA KA KPS ENS    KKKKK  E TISEA++  GESDPE AFW+PP+GQRWD D+GGDRWGSGS
Subjt:  IWKHINGKRFLNKLEQKELEKDLLAKSGEQQGKKKATKASKPSTENSHNIKKKKKKGLENTISEARESPGESDPEHAFWVPPVGQRWDCDNGGDRWGSGS

Query:  DSEHESDKVIAME------------DENKHGKTESDELSKRTKRMSIEIEPSSFASRKKKSKKSS
        DSEHE DK+IAM+            DE+KHG+ ESDELSKRTKRMSIEI PSSFASRKKKSKKSS
Subjt:  DSEHESDKVIAME------------DENKHGKTESDELSKRTKRMSIEIEPSSFASRKKKSKKSS

XP_038903229.1 surfeit locus protein 2-like isoform X2 [Benincasa hispida]5.0e-10683.33Show/hide
Query:  MATSTAEER--ADVEGENLLGQPTFTELDNGRFRCVETGHEVLAKDMDSYSRTKRCRLGLIDFALSHRKAPLNMFEQDPLSRSKLKCKLTGDTINKTEEH
        MATSTAEER  A VEG +LLGQPTFTELDNGRFRCVETGHEVLAKD DSYSRTKRCRLGLIDFALSHRKAPLNMFEQDPLSRSKLKCKLTGDTINKTEEH
Subjt:  MATSTAEER--ADVEGENLLGQPTFTELDNGRFRCVETGHEVLAKDMDSYSRTKRCRLGLIDFALSHRKAPLNMFEQDPLSRSKLKCKLTGDTINKTEEH

Query:  IWKHINGKRFLNKLEQKELEKDLLAKSGEQQGKKKATKASKPSTENSHNIKKKKKKGLENTISEARESPGESDPEHAFWVPPVGQRWDCDNGGDRWGSGS
        IWKHINGKRFLNKLEQKELEK  +AKSGEQQGKKKA KA KPS ENS    KKKKK  E TISEA++  GESDPE AFW+PP+GQRWD D+GGDRWGSGS
Subjt:  IWKHINGKRFLNKLEQKELEKDLLAKSGEQQGKKKATKASKPSTENSHNIKKKKKKGLENTISEARESPGESDPEHAFWVPPVGQRWDCDNGGDRWGSGS

Query:  DSEHESDKVI-----AMEDENKHGKTESDELSKRTKRMSIEIEPSSFASRKKKSKKSS
        DSEHE D+ I        DE+KHG+ ESDELSKRTKRMSIEI PSSFASRKKKSKKSS
Subjt:  DSEHESDKVI-----AMEDENKHGKTESDELSKRTKRMSIEIEPSSFASRKKKSKKSS

TrEMBL top hitse value%identityAlignment
A0A0A0LD47 Uncharacterized protein7.3e-10378.95Show/hide
Query:  MATSTAEE--RADVEGENLLGQPTFTELDNGRFRCVETGHEVLAKDMDSYSRTKRCRLGLIDFALSHRKAPLNMFEQDPLSRSKLKCKLTGDTINKTEEH
        M+TSTAEE   A VEG +LLGQPTFTELDNGRFRCVETGHE++ KD DSYSRTKRCRLGLID ALS RKAPLNMFEQDPLSRSKLKCKLTGDTINKTEEH
Subjt:  MATSTAEE--RADVEGENLLGQPTFTELDNGRFRCVETGHEVLAKDMDSYSRTKRCRLGLIDFALSHRKAPLNMFEQDPLSRSKLKCKLTGDTINKTEEH

Query:  IWKHINGKRFLNKLEQKELEKDLLAKSGEQQGKKKATKASKPSTENSHNIKKKKKKGLENTISEARESPGESDPEHAFWVPPVGQRWDCDNGGDRWGSGS
        IWKHINGKRFLNKLEQKE EK+L+AKSGEQQ KKKA KA KPS+ENS   KKKKKK  E TISEA+E  GES+ E AFW+PPVGQRWD DNGGDRW SGS
Subjt:  IWKHINGKRFLNKLEQKELEKDLLAKSGEQQGKKKATKASKPSTENSHNIKKKKKKGLENTISEARESPGESDPEHAFWVPPVGQRWDCDNGGDRWGSGS

Query:  DSEHESDKVIAME-------------DENKHGKTESDELSKRTKRMSIEIEPSSFASRKKKSKKSS
        DSEHESDK+IAM+             DE+KH + ESDELSK+TKRMSIEI PSSFASRKKKSKKSS
Subjt:  DSEHESDKVIAME-------------DENKHGKTESDELSKRTKRMSIEIEPSSFASRKKKSKKSS

A0A1S3CMR8 surfeit locus protein 22.4e-10681.06Show/hide
Query:  MATSTAEERADVEGENLLGQPTFTELDNGRFRCVETGHEVLAKDMDSYSRTKRCRLGLIDFALSHRKAPLNMFEQDPLSRSKLKCKLTGDTINKTEEHIW
        M+TSTAEE   VEG +LLGQPTFTELDNGRFRCVETGHEVLAKD DSYSRTKRCRLGL+DFALS RKAPLNMFEQDPLSRSKLKCKLTGDTINKTEEHIW
Subjt:  MATSTAEERADVEGENLLGQPTFTELDNGRFRCVETGHEVLAKDMDSYSRTKRCRLGLIDFALSHRKAPLNMFEQDPLSRSKLKCKLTGDTINKTEEHIW

Query:  KHINGKRFLNKLEQKELEKDLLAKSGEQQGKKKATKASKPSTENSHNIKKKKKKGLENTISEARESPGESDPEHAFWVPPVGQRWDCDNGGDRWGSGSDS
        KHINGKRFLNKLEQKE EKD +AKSGEQQ KKKA K  KPSTENS   KKKKKK  E TISEA+E  GESDPE AFW+PPVGQRWD DNGGDRW SGSD 
Subjt:  KHINGKRFLNKLEQKELEKDLLAKSGEQQGKKKATKASKPSTENSHNIKKKKKKGLENTISEARESPGESDPEHAFWVPPVGQRWDCDNGGDRWGSGSDS

Query:  EHESDKVIAME-------------DENKHGKTESDELSKRTKRMSIEIEPSSFASRKKKSKKSS
        EHESDK+IAM+             DE+KH + ESDELSKRTKRMSIEI PSSFASRKKKSKKSS
Subjt:  EHESDKVIAME-------------DENKHGKTESDELSKRTKRMSIEIEPSSFASRKKKSKKSS

A0A5D3D5E1 Surfeit locus protein 22.4e-10681.06Show/hide
Query:  MATSTAEERADVEGENLLGQPTFTELDNGRFRCVETGHEVLAKDMDSYSRTKRCRLGLIDFALSHRKAPLNMFEQDPLSRSKLKCKLTGDTINKTEEHIW
        M+TSTAEE   VEG +LLGQPTFTELDNGRFRCVETGHEVLAKD DSYSRTKRCRLGL+DFALS RKAPLNMFEQDPLSRSKLKCKLTGDTINKTEEHIW
Subjt:  MATSTAEERADVEGENLLGQPTFTELDNGRFRCVETGHEVLAKDMDSYSRTKRCRLGLIDFALSHRKAPLNMFEQDPLSRSKLKCKLTGDTINKTEEHIW

Query:  KHINGKRFLNKLEQKELEKDLLAKSGEQQGKKKATKASKPSTENSHNIKKKKKKGLENTISEARESPGESDPEHAFWVPPVGQRWDCDNGGDRWGSGSDS
        KHINGKRFLNKLEQKE EKD +AKSGEQQ KKKA K  KPSTENS   KKKKKK  E TISEA+E  GESDPE AFW+PPVGQRWD DNGGDRW SGSD 
Subjt:  KHINGKRFLNKLEQKELEKDLLAKSGEQQGKKKATKASKPSTENSHNIKKKKKKGLENTISEARESPGESDPEHAFWVPPVGQRWDCDNGGDRWGSGSDS

Query:  EHESDKVIAME-------------DENKHGKTESDELSKRTKRMSIEIEPSSFASRKKKSKKSS
        EHESDK+IAM+             DE+KH + ESDELSKRTKRMSIEI PSSFASRKKKSKKSS
Subjt:  EHESDKVIAME-------------DENKHGKTESDELSKRTKRMSIEIEPSSFASRKKKSKKSS

A0A6J1DF42 surfeit locus protein 2-like8.3e-10784.06Show/hide
Query:  MATSTAEERADVEGENLLGQPTFTELDNGRFRCVETGHEVLAKDMDSYSRTKRCRLGLIDFALSHRKAPLNMFEQDPLSRSKLKCKLTGDTINKTEEHIW
        MATS  EERA+VEG +LLGQPTFTEL+NGRFRCVETGHE+L KD D YSRTKRCR+GLIDFALSHRKAPLNMF+QDPLSRSKLKCKLTGDTINKTEEHIW
Subjt:  MATSTAEERADVEGENLLGQPTFTELDNGRFRCVETGHEVLAKDMDSYSRTKRCRLGLIDFALSHRKAPLNMFEQDPLSRSKLKCKLTGDTINKTEEHIW

Query:  KHINGKRFLNKLEQKELEKDLLAKSGEQQGKKKATKASKPSTENSHNIKKKKKKGLENTISEARESPGESDPEHAFWVPPVGQRWDCDNGGDRWGSGSDS
        KHINGKRFLNKLEQKELEK+  AKSGEQQ KKKA KASK STENS+   KKKKK LE T SEARE  G +D E AFW+PPVGQRWDCDNGGDRW SGSDS
Subjt:  KHINGKRFLNKLEQKELEKDLLAKSGEQQGKKKATKASKPSTENSHNIKKKKKKGLENTISEARESPGESDPEHAFWVPPVGQRWDCDNGGDRWGSGSDS

Query:  EHESDKVIAMEDENKHGKTESDELSKRTKRMSIEIEPSSFASRKKKSKKSS
        EHES+KVIAM+DE+K G+TESDELSK+TKRMSIEI PSSFASRKKKSKKSS
Subjt:  EHESDKVIAMEDENKHGKTESDELSKRTKRMSIEIEPSSFASRKKKSKKSS

A0A6J1F4F2 uncharacterized protein LOC1114420702.1e-10281.96Show/hide
Query:  MATSTAEERA--DVEGENLLGQPTFTELDNGRFRCVETGHEVLAKDMDSYSRTKRCRLGLIDFALSHRKAPLNMFEQDPLSRSKLKCKLTGDTINKTEEH
        MAT T EERA   VEG +LLGQPTF ELDNGRFRCVETGHEVLAKD DSYSRTKRCRLGLIDFALSHRKAPLNMFE DPLSRSKLKCKLTGDTINKTEEH
Subjt:  MATSTAEERA--DVEGENLLGQPTFTELDNGRFRCVETGHEVLAKDMDSYSRTKRCRLGLIDFALSHRKAPLNMFEQDPLSRSKLKCKLTGDTINKTEEH

Query:  IWKHINGKRFLNKLEQKELEKDLLAKSGEQQGKKKATKASKPSTENSHNIKKKKKKGLENTISEARESPGESDPEHAFWVPPVGQRWDCDNGGDRWGSGS
        IWKHINGKRFLNKLEQKELEKD +AKSGEQ+GKKKA KA K STENS    KK KK LE  ISEARES G SD E  FW+PP GQRWD DNGGDRWGS S
Subjt:  IWKHINGKRFLNKLEQKELEKDLLAKSGEQQGKKKATKASKPSTENSHNIKKKKKKGLENTISEARESPGESDPEHAFWVPPVGQRWDCDNGGDRWGSGS

Query:  DSEHESDKVIAMEDE--NKHGKTESDELSKRTKRMSIEIEPSSFASRKKKSKKSS
        DSEHESDK+ A++DE  +KHG+ E+ ELS RTKRMSIEI PSSFASRKKKSKKSS
Subjt:  DSEHESDKVIAMEDE--NKHGKTESDELSKRTKRMSIEIEPSSFASRKKKSKKSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G14440.1 Surfeit locus protein 2 (SURF2)3.0e-5647.67Show/hide
Query:  MATSTAEERADVEGENLLGQPTFTELDNGRFRCVETGHEVLAKDMDSYSRTKRCRLGLIDFALSHRKAPLNMFEQDPLSRSKLKCKLTGDTINKTEEHIW
        MA +  E     EG +LLG+P + +L+NGRF+CV+TGHE+L KD   YS++KRCRLGLID+ALSH K PLN+FEQDP +RSKLKCKLTGDT+NKTEEHIW
Subjt:  MATSTAEERADVEGENLLGQPTFTELDNGRFRCVETGHEVLAKDMDSYSRTKRCRLGLIDFALSHRKAPLNMFEQDPLSRSKLKCKLTGDTINKTEEHIW

Query:  KHINGKRFLNKLEQKELEKD---LLAKSGEQQGKKKATK-ASKPSTENSHNIKKKKKKGLE---NTISEARESPGESD----PEHAFWVPPVGQRWDCDN
        KHI G+RFLN+LE+KE EK+   + A+ GE   K+   +   K   +  +N KKK KK +E   N    A E   E+D     E  FW+PP G+RWD D+
Subjt:  KHINGKRFLNKLEQKELEKD---LLAKSGEQQGKKKATK-ASKPSTENSHNIKKKKKKGLE---NTISEARESPGESD----PEHAFWVPPVGQRWDCDN

Query:  GGDRWGSGSDS------------EHESDKVIAMEDENKHGKTE-------SDELSKRTKRMSIEIEPSSFASRKKKSKK
        G DRWGS SDS            E + D  I+ME+    G+ +        D  +   KR   E+  S   S+KK  KK
Subjt:  GGDRWGSGSDS------------EHESDKVIAMEDENKHGKTE-------SDELSKRTKRMSIEIEPSSFASRKKKSKK

AT5G14440.2 Surfeit locus protein 2 (SURF2)2.7e-5748.2Show/hide
Query:  MATSTAEERADVEGENLLGQPTFTELDNGRFRCVETGHEVLAKDMDSYSRTKRCRLGLIDFALSHRKAPLNMFEQDPLSRSKLKCKLTGDTINKTEEHIW
        MA +  E     EG +LLG+P + +L+NGRF+CV+TGHE+L KD   YS++KRCRLGLID+ALSH K PLN+FEQDP +RSKLKCKLTGDT+NKTEEHIW
Subjt:  MATSTAEERADVEGENLLGQPTFTELDNGRFRCVETGHEVLAKDMDSYSRTKRCRLGLIDFALSHRKAPLNMFEQDPLSRSKLKCKLTGDTINKTEEHIW

Query:  KHINGKRFLNKLEQKELEKD---LLAKSGEQQGKKKATK-ASKPSTENSHNIKKKKKKGLE---NTISEARESPGESD----PEHAFWVPPVGQRWDCDN
        KHI G+RFLN+LE+KE EK+   + A+ GE   K+   +   K   +  +N KKK KK +E   N    A E   E+D     E  FW+PP G+RWD D+
Subjt:  KHINGKRFLNKLEQKELEKD---LLAKSGEQQGKKKATK-ASKPSTENSHNIKKKKKKGLE---NTISEARESPGESD----PEHAFWVPPVGQRWDCDN

Query:  GGDRWGSGSDS------------EHESDKVIAMEDE------NKHGKTESDELSKRTKRMSIEIEPSSFASRKKKSKK
        G DRWGS SDS            E + D  I+ME+       ++ GK   D+     KR   E+  S   S+KK  KK
Subjt:  GGDRWGSGSDS------------EHESDKVIAMEDE------NKHGKTESDELSKRTKRMSIEIEPSSFASRKKKSKK

AT5G40570.1 Surfeit locus protein 2 (SURF2)2.3e-4851.02Show/hide
Query:  EGENLLGQPTFTELDNGRFRCVETGHEVLAKDMDSYSRTKRCRLGLIDFALSHRKAPLNMFEQDPLSRSKLKCKLTGDTINKTEEHIWKHINGKRFLNKL
        EG++L G PTF +L NGR RCVETGHEVLA D++SY+R KRCRLGLI+ ALS  K PLNMF Q PLSRSKL CKLTGDT+NK EEHIWKH+NGKRFL+KL
Subjt:  EGENLLGQPTFTELDNGRFRCVETGHEVLAKDMDSYSRTKRCRLGLIDFALSHRKAPLNMFEQDPLSRSKLKCKLTGDTINKTEEHIWKHINGKRFLNKL

Query:  EQKELEKDLLAKSGEQQGKKKATKASKPSTENSHNIKKKKKKGLENTISEARESPGESDPEHAFWVPPVGQRWDCDNGGDRWGSGSDSEHESDKVIAMED
        EQ E       +     G+ + T+ + P     HN   K+  G E           ESD    FW+P                S SDSE + +     ++
Subjt:  EQKELEKDLLAKSGEQQGKKKATKASKPSTENSHNIKKKKKKGLENTISEARESPGESDPEHAFWVPPVGQRWDCDNGGDRWGSGSDSEHESDKVIAMED

Query:  ENKHGK----TESDELSKRTKRMSIEIEPSSFASRKKKSKKSSIN
        EN  G      ES+ELS+RTKRMS+EI PSSFASRKKKSK   ++
Subjt:  ENKHGK----TESDELSKRTKRMSIEIEPSSFASRKKKSKKSSIN

AT5G40570.2 Surfeit locus protein 2 (SURF2)2.8e-4649.6Show/hide
Query:  EGENLLGQPTFTELDNGRFRCVETGHEVLAKDMDSYSRTKRCRLGLIDFALSHRKAPLNMFEQDPLSR-------SKLKCKLTGDTINKTEEHIWKHING
        EG++L G PTF +L NGR RCVETGHEVLA D++SY+R KRCRLGLI+ ALS  K PLNMF Q PLSR       SKL CKLTGDT+NK EEHIWKH+NG
Subjt:  EGENLLGQPTFTELDNGRFRCVETGHEVLAKDMDSYSRTKRCRLGLIDFALSHRKAPLNMFEQDPLSR-------SKLKCKLTGDTINKTEEHIWKHING

Query:  KRFLNKLEQKELEKDLLAKSGEQQGKKKATKASKPSTENSHNIKKKKKKGLENTISEARESPGESDPEHAFWVPPVGQRWDCDNGGDRWGSGSDSEHESD
        KRFL+KLEQ E       +     G+ + T+ + P     HN   K+  G E           ESD    FW+P                S SDSE + +
Subjt:  KRFLNKLEQKELEKDLLAKSGEQQGKKKATKASKPSTENSHNIKKKKKKGLENTISEARESPGESDPEHAFWVPPVGQRWDCDNGGDRWGSGSDSEHESD

Query:  KVIAMEDENKHGK----TESDELSKRTKRMSIEIEPSSFASRKKKSKKSSIN
             ++EN  G      ES+ELS+RTKRMS+EI PSSFASRKKKSK   ++
Subjt:  KVIAMEDENKHGK----TESDELSKRTKRMSIEIEPSSFASRKKKSKKSSIN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGACGAGCACCGCCGAGGAGAGGGCGGATGTGGAAGGGGAGAACCTGCTGGGCCAACCGACCTTCACAGAGCTCGACAATGGCCGCTTCCGGTGCGTGGAGACCGG
CCACGAAGTCCTCGCCAAGGATATGGACTCCTACTCACGGACCAAGCGCTGCCGTCTCGGCCTCATCGACTTCGCTTTGTCCCATCGAAAAGCTCCCCTCAATATGTTCG
AGCAGGACCCTCTCTCCCGTTCGAAGTTAAAATGCAAACTGACTGGTGATACCATCAACAAAACGGAGGAACATATATGGAAGCACATTAATGGAAAACGTTTCCTCAAC
AAATTAGAGCAAAAGGAGTTAGAGAAAGACTTATTGGCTAAATCAGGAGAGCAGCAAGGCAAGAAGAAGGCCACTAAGGCTTCAAAGCCAAGTACAGAAAATTCACATAA
TATTAAAAAGAAGAAGAAGAAGGGACTGGAGAATACTATTTCTGAAGCAAGGGAATCTCCTGGGGAGAGTGACCCAGAGCATGCCTTTTGGGTGCCTCCAGTAGGGCAGC
GTTGGGATTGTGACAACGGAGGAGACCGATGGGGTTCAGGATCAGATTCAGAGCATGAAAGTGATAAGGTCATTGCAATGGAAGACGAGAATAAACATGGTAAAACCGAG
TCAGATGAATTGTCTAAGCGGACCAAAAGAATGTCCATAGAGATTGAACCCAGCAGCTTTGCTTCAAGAAAGAAGAAGAGTAAGAAGAGCTCCATTAACTCACATGTTCT
CGCTGGCTGGCGGTTACGCTCGAAGACCCGGCCCCATTGCTTTGTTTTTTCTTTTTCTTTTTCTTTTTTTAAATTCCAACTGCATCCAAACCATAGCGCCTCTGTCTTCG
CCAGCCGCCGTTTCCGCCGTCATTCGACGCCAATTTGGTGCCCAAAGAAGTCGGCGACTCTACCTCCTCTTATGCATTGGAGTGTGATCTTCAATAGTCTAGCTTCAGGA
GCTATTGGCGTGCCCCGGTTTTCTTTAAGACTATTTTTCGGATGGCACAATCTCTCTGTTGCTTGTGAAAAATTGTGTCGTTCCCCGTTTGTTAGGTTGGGGTTGGGCAT
CCTCTTCCCACCCAAGAGAGGGGATACTAATATGAAGGAGATGATTTTGTCTGTTGTTGCCCATGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGACGAGCACCGCCGAGGAGAGGGCGGATGTGGAAGGGGAGAACCTGCTGGGCCAACCGACCTTCACAGAGCTCGACAATGGCCGCTTCCGGTGCGTGGAGACCGG
CCACGAAGTCCTCGCCAAGGATATGGACTCCTACTCACGGACCAAGCGCTGCCGTCTCGGCCTCATCGACTTCGCTTTGTCCCATCGAAAAGCTCCCCTCAATATGTTCG
AGCAGGACCCTCTCTCCCGTTCGAAGTTAAAATGCAAACTGACTGGTGATACCATCAACAAAACGGAGGAACATATATGGAAGCACATTAATGGAAAACGTTTCCTCAAC
AAATTAGAGCAAAAGGAGTTAGAGAAAGACTTATTGGCTAAATCAGGAGAGCAGCAAGGCAAGAAGAAGGCCACTAAGGCTTCAAAGCCAAGTACAGAAAATTCACATAA
TATTAAAAAGAAGAAGAAGAAGGGACTGGAGAATACTATTTCTGAAGCAAGGGAATCTCCTGGGGAGAGTGACCCAGAGCATGCCTTTTGGGTGCCTCCAGTAGGGCAGC
GTTGGGATTGTGACAACGGAGGAGACCGATGGGGTTCAGGATCAGATTCAGAGCATGAAAGTGATAAGGTCATTGCAATGGAAGACGAGAATAAACATGGTAAAACCGAG
TCAGATGAATTGTCTAAGCGGACCAAAAGAATGTCCATAGAGATTGAACCCAGCAGCTTTGCTTCAAGAAAGAAGAAGAGTAAGAAGAGCTCCATTAACTCACATGTTCT
CGCTGGCTGGCGGTTACGCTCGAAGACCCGGCCCCATTGCTTTGTTTTTTCTTTTTCTTTTTCTTTTTTTAAATTCCAACTGCATCCAAACCATAGCGCCTCTGTCTTCG
CCAGCCGCCGTTTCCGCCGTCATTCGACGCCAATTTGGTGCCCAAAGAAGTCGGCGACTCTACCTCCTCTTATGCATTGGAGTGTGATCTTCAATAGTCTAGCTTCAGGA
GCTATTGGCGTGCCCCGGTTTTCTTTAAGACTATTTTTCGGATGGCACAATCTCTCTGTTGCTTGTGAAAAATTGTGTCGTTCCCCGTTTGTTAGGTTGGGGTTGGGCAT
CCTCTTCCCACCCAAGAGAGGGGATACTAATATGAAGGAGATGATTTTGTCTGTTGTTGCCCATGAGTAG
Protein sequenceShow/hide protein sequence
MATSTAEERADVEGENLLGQPTFTELDNGRFRCVETGHEVLAKDMDSYSRTKRCRLGLIDFALSHRKAPLNMFEQDPLSRSKLKCKLTGDTINKTEEHIWKHINGKRFLN
KLEQKELEKDLLAKSGEQQGKKKATKASKPSTENSHNIKKKKKKGLENTISEARESPGESDPEHAFWVPPVGQRWDCDNGGDRWGSGSDSEHESDKVIAMEDENKHGKTE
SDELSKRTKRMSIEIEPSSFASRKKKSKKSSINSHVLAGWRLRSKTRPHCFVFSFSFSFFKFQLHPNHSASVFASRRFRRHSTPIWCPKKSATLPPLMHWSVIFNSLASG
AIGVPRFSLRLFFGWHNLSVACEKLCRSPFVRLGLGILFPPKRGDTNMKEMILSVVAHE