; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g14380 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g14380
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionsurfeit locus protein 2-like
Genome locationchr8:10888817..10892064
RNA-Seq ExpressionMoc08g14380
SyntenyMoc08g14380
Gene Ontology termsNA
InterPro domainsIPR008833 - Surfeit locus protein 2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7017791.1 hypothetical protein SDJN02_19657, partial [Cucurbita argyrosperma subsp. argyrosperma]1.2e-10082.61Show/hide
Query:  MATSRAEERA--NVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEH
        MAT   EERA   VEGADLLGQPTF EL+NGRFRCVETGHE+L KDKD+YSRTKRCR+GLIDFALSHRKAPLNMF+ DPLSRSKLKCKLTGDTINKTEEH
Subjt:  MATSRAEERA--NVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEH

Query:  IWKHINGKRFLNKLEQKELEKES-AKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSE
        IWKHINGKRFLNKLEQKELEK+S AKSGEQ+ KKKAAKA K STENS KK KKELEK  SEARE NGG+D+ED FWMPP GQRWD DNGGDRW S SDSE
Subjt:  IWKHINGKRFLNKLEQKELEKES-AKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSE

Query:  HESEKVIAMDDED--KPGETESDELSKQTKRMSIEIGPSSFASRKKKSKKSSK
        HES+K+ A+DDED  K GE E+ ELS +TKRMSIEIGPSSFASRKKKSKKSSK
Subjt:  HESEKVIAMDDED--KPGETESDELSKQTKRMSIEIGPSSFASRKKKSKKSSK

XP_004141117.1 surfeit locus protein 2 [Cucumis sativus]1.1e-10180.23Show/hide
Query:  MATSRAEE--RANVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEH
        M+TS AEE   A VEG DLLGQPTFTEL+NGRFRCVETGHEL+ KDKD+YSRTKRCR+GLID ALS RKAPLNMF+QDPLSRSKLKCKLTGDTINKTEEH
Subjt:  MATSRAEE--RANVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEH

Query:  IWKHINGKRFLNKLEQKELEKE-SAKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSE
        IWKHINGKRFLNKLEQKE EKE  AKSGEQQ+KKKAAKA K S+ENS KKKKKE E+T SEA+E NG +++EDAFWMPPVGQRWD DNGGDRW SGSDSE
Subjt:  IWKHINGKRFLNKLEQKELEKE-SAKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSE

Query:  HESEKVIAMD-------------DEDKPGETESDELSKQTKRMSIEIGPSSFASRKKKSKKSS
        HES+K+IAMD             DEDK  E ESDELSKQTKRMSIEIGPSSFASRKKKSKKSS
Subjt:  HESEKVIAMD-------------DEDKPGETESDELSKQTKRMSIEIGPSSFASRKKKSKKSS

XP_022152129.1 surfeit locus protein 2-like [Momordica charantia]7.8e-12999.19Show/hide
Query:  MATSRAEERANVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEHIW
        MATSR EERANVEGADLLGQPTFTELENGRFRCVETGHELLPKDKD YSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEHIW
Subjt:  MATSRAEERANVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEHIW

Query:  KHINGKRFLNKLEQKELEKESAKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSEHES
        KHINGKRFLNKLEQKELEKESAKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSEHES
Subjt:  KHINGKRFLNKLEQKELEKESAKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSEHES

Query:  EKVIAMDDEDKPGETESDELSKQTKRMSIEIGPSSFASRKKKSKKSSK
        EKVIAMDDEDKPGETESDELSKQTKRMSIEIGPSSFASRKKKSKKSSK
Subjt:  EKVIAMDDEDKPGETESDELSKQTKRMSIEIGPSSFASRKKKSKKSSK

XP_022935099.1 uncharacterized protein LOC111442070 [Cucurbita moschata]9.0e-10182.61Show/hide
Query:  MATSRAEERA--NVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEH
        MAT   EERA   VEGADLLGQPTF EL+NGRFRCVETGHE+L KDKD+YSRTKRCR+GLIDFALSHRKAPLNMF+ DPLSRSKLKCKLTGDTINKTEEH
Subjt:  MATSRAEERA--NVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEH

Query:  IWKHINGKRFLNKLEQKELEKES-AKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSE
        IWKHINGKRFLNKLEQKELEK+S AKSGEQ+ KKKAAKA K STENS KK KKELEK  SEARE NGG+D+ED FWMPP GQRWD DNGGDRW S SDSE
Subjt:  IWKHINGKRFLNKLEQKELEKES-AKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSE

Query:  HESEKVIAMDDEDKP--GETESDELSKQTKRMSIEIGPSSFASRKKKSKKSSK
        HES+K+ A+DDEDK   GE E+ ELS +TKRMSIEIGPSSFASRKKKSKKSSK
Subjt:  HESEKVIAMDDEDKP--GETESDELSKQTKRMSIEIGPSSFASRKKKSKKSSK

XP_038903228.1 surfeit locus protein 2-like isoform X1 [Benincasa hispida]9.6e-10381.68Show/hide
Query:  MATSRAEER--ANVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEH
        MATS AEER  A VEGADLLGQPTFTEL+NGRFRCVETGHE+L KDKD+YSRTKRCR+GLIDFALSHRKAPLNMF+QDPLSRSKLKCKLTGDTINKTEEH
Subjt:  MATSRAEER--ANVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEH

Query:  IWKHINGKRFLNKLEQKELEKES-AKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSE
        IWKHINGKRFLNKLEQKELEK S AKSGEQQ KKKAAKA K S ENS KKKKKE E+T SEA++ NG +D EDAFWMPP+GQRWD D+GGDRW SGSDSE
Subjt:  IWKHINGKRFLNKLEQKELEKES-AKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSE

Query:  HESEKVIAMD------------DEDKPGETESDELSKQTKRMSIEIGPSSFASRKKKSKKSS
        HE +K+IAMD            DEDK GE ESDELSK+TKRMSIEIGPSSFASRKKKSKKSS
Subjt:  HESEKVIAMD------------DEDKPGETESDELSKQTKRMSIEIGPSSFASRKKKSKKSS

TrEMBL top hitse value%identityAlignment
A0A0A0LD47 Uncharacterized protein5.1e-10280.23Show/hide
Query:  MATSRAEE--RANVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEH
        M+TS AEE   A VEG DLLGQPTFTEL+NGRFRCVETGHEL+ KDKD+YSRTKRCR+GLID ALS RKAPLNMF+QDPLSRSKLKCKLTGDTINKTEEH
Subjt:  MATSRAEE--RANVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEH

Query:  IWKHINGKRFLNKLEQKELEKE-SAKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSE
        IWKHINGKRFLNKLEQKE EKE  AKSGEQQ+KKKAAKA K S+ENS KKKKKE E+T SEA+E NG +++EDAFWMPPVGQRWD DNGGDRW SGSDSE
Subjt:  IWKHINGKRFLNKLEQKELEKE-SAKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSE

Query:  HESEKVIAMD-------------DEDKPGETESDELSKQTKRMSIEIGPSSFASRKKKSKKSS
        HES+K+IAMD             DEDK  E ESDELSKQTKRMSIEIGPSSFASRKKKSKKSS
Subjt:  HESEKVIAMD-------------DEDKPGETESDELSKQTKRMSIEIGPSSFASRKKKSKKSS

A0A5D3D5E1 Surfeit locus protein 23.1e-9978.79Show/hide
Query:  MATSRAEERANVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEHIW
        M+TS AEE   VEG  LLGQPTFTEL+NGRFRCVETGHE+L KDKD+YSRTKRCR+GL+DFALS RKAPLNMF+QDPLSRSKLKCKLTGDTINKTEEHIW
Subjt:  MATSRAEERANVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEHIW

Query:  KHINGKRFLNKLEQKELEKES-AKSGEQQAKKKAAKASKASTENS---NKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDS
        KHINGKRFLNKLEQKE EK+S AKSGEQQ+KKKAAK  K STENS    KKKKKE E+T SEA+E NG +D EDAFWMPPVGQRWD DNGGDRW SGSD 
Subjt:  KHINGKRFLNKLEQKELEKES-AKSGEQQAKKKAAKASKASTENS---NKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDS

Query:  EHESEKVIAMD-------------DEDKPGETESDELSKQTKRMSIEIGPSSFASRKKKSKKSS
        EHES+K+IAMD             DEDK  E ESDELSK+TKRMSIEIGPSSFASRKKKSKKSS
Subjt:  EHESEKVIAMD-------------DEDKPGETESDELSKQTKRMSIEIGPSSFASRKKKSKKSS

A0A6J1DF42 surfeit locus protein 2-like3.8e-12999.19Show/hide
Query:  MATSRAEERANVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEHIW
        MATSR EERANVEGADLLGQPTFTELENGRFRCVETGHELLPKDKD YSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEHIW
Subjt:  MATSRAEERANVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEHIW

Query:  KHINGKRFLNKLEQKELEKESAKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSEHES
        KHINGKRFLNKLEQKELEKESAKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSEHES
Subjt:  KHINGKRFLNKLEQKELEKESAKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSEHES

Query:  EKVIAMDDEDKPGETESDELSKQTKRMSIEIGPSSFASRKKKSKKSSK
        EKVIAMDDEDKPGETESDELSKQTKRMSIEIGPSSFASRKKKSKKSSK
Subjt:  EKVIAMDDEDKPGETESDELSKQTKRMSIEIGPSSFASRKKKSKKSSK

A0A6J1F4F2 uncharacterized protein LOC1114420704.3e-10182.61Show/hide
Query:  MATSRAEERA--NVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEH
        MAT   EERA   VEGADLLGQPTF EL+NGRFRCVETGHE+L KDKD+YSRTKRCR+GLIDFALSHRKAPLNMF+ DPLSRSKLKCKLTGDTINKTEEH
Subjt:  MATSRAEERA--NVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEH

Query:  IWKHINGKRFLNKLEQKELEKES-AKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSE
        IWKHINGKRFLNKLEQKELEK+S AKSGEQ+ KKKAAKA K STENS KK KKELEK  SEARE NGG+D+ED FWMPP GQRWD DNGGDRW S SDSE
Subjt:  IWKHINGKRFLNKLEQKELEKES-AKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSE

Query:  HESEKVIAMDDEDKP--GETESDELSKQTKRMSIEIGPSSFASRKKKSKKSSK
        HES+K+ A+DDEDK   GE E+ ELS +TKRMSIEIGPSSFASRKKKSKKSSK
Subjt:  HESEKVIAMDDEDKP--GETESDELSKQTKRMSIEIGPSSFASRKKKSKKSSK

A0A6J1J5G4 uncharacterized protein LOC111481480 isoform X31.1e-9981.42Show/hide
Query:  MATSRAEERA--NVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEH
        MAT   EERA   VEGADLLG+PTF EL+NGRFRCVETGHE+L KDKD+YSRTKRCR+GLIDFALSHRKAPLNMF+ DPLSRSKLKCKLTGDTINKTEEH
Subjt:  MATSRAEERA--NVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEH

Query:  IWKHINGKRFLNKLEQKELEKES-AKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSE
        IWKHINGKRFLNKLEQKE EK+S AKSGEQ+ KKKAAKA K STENS KK KKELEK  SEARE NGG+D+ED FWMPP GQRWD DNGGDRW S SDSE
Subjt:  IWKHINGKRFLNKLEQKELEKES-AKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSE

Query:  HESEKVIAMDDEDKP--GETESDELSKQTKRMSIEIGPSSFASRKKKSKKSSK
        HES+K+ A DDEDK   GE E+ ELS +TKRMSIEIGPSSFASRKKKSKK+SK
Subjt:  HESEKVIAMDDEDKP--GETESDELSKQTKRMSIEIGPSSFASRKKKSKKSSK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G14440.1 Surfeit locus protein 2 (SURF2)3.4e-5849.65Show/hide
Query:  MATSRAEERANVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEHIW
        MA +  E     EGADLLG+P + +LENGRF+CV+TGHELL KDK  YS++KRCR+GLID+ALSH K PLN+F+QDP +RSKLKCKLTGDT+NKTEEHIW
Subjt:  MATSRAEERANVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEHIW

Query:  KHINGKRFLNKLEQKELEKES----AKSGEQQAKKKAA----KASKASTENSNKKKKKELEKTTS---EAREPNGGND----SEDAFWMPPVGQRWDCDN
        KHI G+RFLN+LE+KE EKES    A+ GE  AK+       K  K    N  KK KK +EK  +    A E    ND     E  FWMPP G+RWD D+
Subjt:  KHINGKRFLNKLEQKELEKES----AKSGEQQAKKKAA----KASKASTENSNKKKKKELEKTTS---EAREPNGGND----SEDAFWMPPVGQRWDCDN

Query:  GGDRWVSGSDSEHE-SEKVIAMDDEDKPGETESDEL------------------SKQTKRMSIEIGPSSFASRKKKSKKSSK
        G DRW S SDS+ E +E+   + + D+ GE   +E                   +   KR   E+G S   S+KK  KK  K
Subjt:  GGDRWVSGSDSEHE-SEKVIAMDDEDKPGETESDEL------------------SKQTKRMSIEIGPSSFASRKKKSKKSSK

AT5G14440.2 Surfeit locus protein 2 (SURF2)4.1e-5949.82Show/hide
Query:  MATSRAEERANVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEHIW
        MA +  E     EGADLLG+P + +LENGRF+CV+TGHELL KDK  YS++KRCR+GLID+ALSH K PLN+F+QDP +RSKLKCKLTGDT+NKTEEHIW
Subjt:  MATSRAEERANVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEHIW

Query:  KHINGKRFLNKLEQKELEKES----AKSGEQQAKKKAA----KASKASTENSNKKKKKELEKTTS---EAREPNGGND----SEDAFWMPPVGQRWDCDN
        KHI G+RFLN+LE+KE EKES    A+ GE  AK+       K  K    N  KK KK +EK  +    A E    ND     E  FWMPP G+RWD D+
Subjt:  KHINGKRFLNKLEQKELEKES----AKSGEQQAKKKAA----KASKASTENSNKKKKKELEKTTS---EAREPNGGND----SEDAFWMPPVGQRWDCDN

Query:  GGDRWVSGSDSEHE------------------SEKVIAMDDEDKPGETESDELSKQTKRMSIEIGPSSFASRKKKSKKSSK
        G DRW S SDS+ E                   E+ I + + D+ G+   D+     KR   E+G S   S+KK  KK  K
Subjt:  GGDRWVSGSDSEHE------------------SEKVIAMDDEDKPGETESDELSKQTKRMSIEIGPSSFASRKKKSKKSSK

AT5G40570.1 Surfeit locus protein 2 (SURF2)2.7e-4750.85Show/hide
Query:  EGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEHIWKHINGKRFLNKL
        EG  L G PTF +L NGR RCVETGHE+L  D ++Y+R KRCR+GLI+ ALS  K PLNMF Q PLSRSKL CKLTGDT+NK EEHIWKH+NGKRFL+KL
Subjt:  EGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEHIWKHINGKRFLNKL

Query:  EQKELEKESAKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSEHESEKVIAMDDEDKP
        EQ  +E+ +  SG  +            T+ +N +  KE           + G++  D FWMP                S SDSE + E     D+E+  
Subjt:  EQKELEKESAKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSEHESEKVIAMDDEDKP

Query:  G----ETESDELSKQTKRMSIEIGPSSFASRKKKSK
        G    + ES+ELS++TKRMS+EIGPSSFASRKKKSK
Subjt:  G----ETESDELSKQTKRMSIEIGPSSFASRKKKSK

AT5G40570.2 Surfeit locus protein 2 (SURF2)3.3e-4549.38Show/hide
Query:  EGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSR-------SKLKCKLTGDTINKTEEHIWKHING
        EG  L G PTF +L NGR RCVETGHE+L  D ++Y+R KRCR+GLI+ ALS  K PLNMF Q PLSR       SKL CKLTGDT+NK EEHIWKH+NG
Subjt:  EGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSR-------SKLKCKLTGDTINKTEEHIWKHING

Query:  KRFLNKLEQKELEKESAKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSEHESEKVIA
        KRFL+KLEQ  +E+ +  SG  +            T+ +N +  KE           + G++  D FWMP                S SDSE + E    
Subjt:  KRFLNKLEQKELEKESAKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSEHESEKVIA

Query:  MDDEDKPG----ETESDELSKQTKRMSIEIGPSSFASRKKKSK
         D+E+  G    + ES+ELS++TKRMS+EIGPSSFASRKKKSK
Subjt:  MDDEDKPG----ETESDELSKQTKRMSIEIGPSSFASRKKKSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGACGAGTAGAGCCGAGGAGAGGGCGAATGTGGAAGGGGCCGATCTTCTGGGGCAACCGACCTTCACAGAGCTCGAAAATGGCCGATTCCGCTGCGTGGAGACCGG
CCACGAACTCCTCCCCAAGGATAAGGACACCTACTCTCGGACCAAGCGCTGCCGCATCGGCCTCATCGACTTCGCTTTGTCCCATCGAAAAGCTCCTCTCAATATGTTCG
ACCAGGATCCTCTCTCTCGTTCGAAGTTAAAATGCAAACTGACTGGTGATACCATCAACAAAACAGAGGAACATATATGGAAGCACATTAATGGCAAGCGTTTCCTCAAC
AAATTAGAGCAAAAGGAGTTAGAGAAAGAGTCGGCTAAATCAGGAGAGCAGCAAGCCAAGAAGAAGGCTGCTAAGGCTTCAAAGGCAAGTACAGAAAATTCAAATAAGAA
GAAGAAGAAGGAACTGGAGAAGACTACTTCTGAAGCAAGGGAACCCAATGGGGGGAATGACTCAGAGGACGCCTTCTGGATGCCTCCGGTAGGGCAGCGTTGGGATTGTG
ACAATGGAGGAGACCGATGGGTTTCAGGATCAGATTCGGAGCATGAAAGTGAGAAGGTCATTGCAATGGATGACGAGGATAAACCTGGTGAAACCGAGTCAGATGAATTA
TCTAAGCAGACCAAAAGAATGTCTATAGAGATTGGACCCAGTAGCTTTGCTTCAAGAAAGAAGAAAAGCAAGAAAAGCTCTAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGACGAGTAGAGCCGAGGAGAGGGCGAATGTGGAAGGGGCCGATCTTCTGGGGCAACCGACCTTCACAGAGCTCGAAAATGGCCGATTCCGCTGCGTGGAGACCGG
CCACGAACTCCTCCCCAAGGATAAGGACACCTACTCTCGGACCAAGCGCTGCCGCATCGGCCTCATCGACTTCGCTTTGTCCCATCGAAAAGCTCCTCTCAATATGTTCG
ACCAGGATCCTCTCTCTCGTTCGAAGTTAAAATGCAAACTGACTGGTGATACCATCAACAAAACAGAGGAACATATATGGAAGCACATTAATGGCAAGCGTTTCCTCAAC
AAATTAGAGCAAAAGGAGTTAGAGAAAGAGTCGGCTAAATCAGGAGAGCAGCAAGCCAAGAAGAAGGCTGCTAAGGCTTCAAAGGCAAGTACAGAAAATTCAAATAAGAA
GAAGAAGAAGGAACTGGAGAAGACTACTTCTGAAGCAAGGGAACCCAATGGGGGGAATGACTCAGAGGACGCCTTCTGGATGCCTCCGGTAGGGCAGCGTTGGGATTGTG
ACAATGGAGGAGACCGATGGGTTTCAGGATCAGATTCGGAGCATGAAAGTGAGAAGGTCATTGCAATGGATGACGAGGATAAACCTGGTGAAACCGAGTCAGATGAATTA
TCTAAGCAGACCAAAAGAATGTCTATAGAGATTGGACCCAGTAGCTTTGCTTCAAGAAAGAAGAAAAGCAAGAAAAGCTCTAAGTGA
Protein sequenceShow/hide protein sequence
MATSRAEERANVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEHIWKHINGKRFLN
KLEQKELEKESAKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSEHESEKVIAMDDEDKPGETESDEL
SKQTKRMSIEIGPSSFASRKKKSKKSSK