; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023243 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023243
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionHolliday junction resolvase MOC1, chloroplastic
Genome locationtig00000892:1389017..1396061
RNA-Seq ExpressionSgr023243
SyntenySgr023243
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6570430.1 Holliday junction resolvase MOC1, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]9.6e-9475.4Show/hide
Query:  VELMESLPFNAHPLQAQSYCFMNSLSSKLKPELHHFRLLCT-SSSSVVHTPEVPTTRPSSVRKESA-GAKLKIANTQLKDNWLASLSCPFPLGHDHLSTV
        VELMESLP NAHPLQAQS+CFM SLSSKLK ELH FR LCT SSSS V +  +PT   SSVRKES+ G KLKIA+TQLKDNWLASLSCPFPL HD+ S  
Subjt:  VELMESLPFNAHPLQAQSYCFMNSLSSKLKPELHHFRLLCT-SSSSVVHTPEVPTTRPSSVRKESA-GAKLKIANTQLKDNWLASLSCPFPLGHDHLSTV

Query:  GSRGDLNASSDCVIGVDPDLPLLLRGYSVIDCIYLLGCFSHNDRGIKNSRQVFDSPHLQVLIGGRRRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQ
        G  GD NA+SDCVIGVDPD         V   + LL      D  I +S QV+DSPHLQVL+GG+RRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTP+PQ
Subjt:  GSRGDLNASSDCVIGVDPDLPLLLRGYSVIDCIYLLGCFSHNDRGIKNSRQVFDSPHLQVLIGGRRRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQ

Query:  DGKQGWWSGGFGYGLWIGILVGLGYSVVPVPSLAWKNKFELSGKDTSK
        DGKQGWWSGGFGYGLWIG+LVGLG+SVVPVPSLAWKNKF+LSGKDTSK
Subjt:  DGKQGWWSGGFGYGLWIGILVGLGYSVVPVPSLAWKNKFELSGKDTSK

KAG7010304.1 hypothetical protein SDJN02_27097 [Cucurbita argyrosperma subsp. argyrosperma]1.6e-9375.92Show/hide
Query:  MESLPFNAHPLQAQSYCFMNSLSSKLKPELHHFRLLCT-SSSSVVHTPEVPTTRPSSVRKESA-GAKLKIANTQLKDNWLASLSCPFPLGHDHLSTVGSR
        MESLP NAHPLQAQS+CFM SLSSKLK ELH FR LCT SSSS V +  +PT   SSVRKES+ G KLKIA+TQLKDNWLASLSCPFPLGHD+ ST G  
Subjt:  MESLPFNAHPLQAQSYCFMNSLSSKLKPELHHFRLLCT-SSSSVVHTPEVPTTRPSSVRKESA-GAKLKIANTQLKDNWLASLSCPFPLGHDHLSTVGSR

Query:  GDLNASSDCVIGVDPDLPLLLRGYSVIDCIYLLGCFSHNDRGIKNSRQVFDSPHLQVLIGGRRRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGK
        GD NA+SDCVIGVDPD         V   + LL      D  I +S QV+DSPHLQVL+GG+RRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTP+PQDGK
Subjt:  GDLNASSDCVIGVDPDLPLLLRGYSVIDCIYLLGCFSHNDRGIKNSRQVFDSPHLQVLIGGRRRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGK

Query:  QGWWSGGFGYGLWIGILVGLGYSVVPVPSLAWKNKFELSGKDTSK
        QGWWSGGFGYGLWIG+LVGLG+SVVPVPSLAWKNKF+LSGKDTSK
Subjt:  QGWWSGGFGYGLWIGILVGLGYSVVPVPSLAWKNKFELSGKDTSK

XP_022153090.1 uncharacterized protein LOC111020674 isoform X1 [Momordica charantia]3.9e-9576.19Show/hide
Query:  MESLPF--NAHP--LQAQSYCFMNSLSSKLKPELHHFRLLCTSSSSVVHTPEVPTTRPSSVRKESAG-AKLKIANTQLKDNWLASLSCPFPLGHDHLSTV
        MESLPF  NAHP  LQ QS+CFMNSLSSKLKPE +HFRLLCTSSS  VH P++P    SS RKES G AKLKIA++QLKDNWLASLS PFPLGHDH    
Subjt:  MESLPF--NAHP--LQAQSYCFMNSLSSKLKPELHHFRLLCTSSSSVVHTPEVPTTRPSSVRKESAG-AKLKIANTQLKDNWLASLSCPFPLGHDHLSTV

Query:  GSRGDLNASSDCVIGVDPD----LPLLLRGYSVIDCIYLLGCFSHNDRGIKNSRQVFDSPHLQVLIGGRRRKRLDAKSIVQLLHSFNAPIGTTAYLEQST
        GSRGDLNASSDCVIGVDPD    + LL   YSV                   S QV+DSPHLQVL+GGRRRKRLDAKSIVQLLHSFNAPIGTTAYLEQST
Subjt:  GSRGDLNASSDCVIGVDPD----LPLLLRGYSVIDCIYLLGCFSHNDRGIKNSRQVFDSPHLQVLIGGRRRKRLDAKSIVQLLHSFNAPIGTTAYLEQST

Query:  PYPQDGKQGWWSGGFGYGLWIGILVGLGYSVVPVPSLAWKNKFELSGKDTSK
        PYPQDGKQGWWSGGFGYGLWIGILVGLG+SV+PVPSLAWKNKFELSGKDTSK
Subjt:  PYPQDGKQGWWSGGFGYGLWIGILVGLGYSVVPVPSLAWKNKFELSGKDTSK

XP_022153091.1 uncharacterized protein LOC111020674 isoform X2 [Momordica charantia]3.9e-9576.19Show/hide
Query:  MESLPF--NAHP--LQAQSYCFMNSLSSKLKPELHHFRLLCTSSSSVVHTPEVPTTRPSSVRKESAG-AKLKIANTQLKDNWLASLSCPFPLGHDHLSTV
        MESLPF  NAHP  LQ QS+CFMNSLSSKLKPE +HFRLLCTSSS  VH P++P    SS RKES G AKLKIA++QLKDNWLASLS PFPLGHDH    
Subjt:  MESLPF--NAHP--LQAQSYCFMNSLSSKLKPELHHFRLLCTSSSSVVHTPEVPTTRPSSVRKESAG-AKLKIANTQLKDNWLASLSCPFPLGHDHLSTV

Query:  GSRGDLNASSDCVIGVDPD----LPLLLRGYSVIDCIYLLGCFSHNDRGIKNSRQVFDSPHLQVLIGGRRRKRLDAKSIVQLLHSFNAPIGTTAYLEQST
        GSRGDLNASSDCVIGVDPD    + LL   YSV                   S QV+DSPHLQVL+GGRRRKRLDAKSIVQLLHSFNAPIGTTAYLEQST
Subjt:  GSRGDLNASSDCVIGVDPD----LPLLLRGYSVIDCIYLLGCFSHNDRGIKNSRQVFDSPHLQVLIGGRRRKRLDAKSIVQLLHSFNAPIGTTAYLEQST

Query:  PYPQDGKQGWWSGGFGYGLWIGILVGLGYSVVPVPSLAWKNKFELSGKDTSK
        PYPQDGKQGWWSGGFGYGLWIGILVGLG+SV+PVPSLAWKNKFELSGKDTSK
Subjt:  PYPQDGKQGWWSGGFGYGLWIGILVGLGYSVVPVPSLAWKNKFELSGKDTSK

XP_022943519.1 Holliday junction resolvase MOC1, chloroplastic [Cucurbita moschata]3.6e-9375.82Show/hide
Query:  MESLPFNAHPLQAQSYCFMNSLSSKLKPELHHFRLLCTSSSSVVHTPEVPTTRPSSVRKESA-GAKLKIANTQLKDNWLASLSCPFPLGHDHLSTVGSRG
        MESLP NAHPLQAQS+CFM SLSSKLK ELH FR LCTSSSS V +  +PT   SSVRKES+ G KLKIA+TQLKDNWLASLSCPFPLGHD+ S  G  G
Subjt:  MESLPFNAHPLQAQSYCFMNSLSSKLKPELHHFRLLCTSSSSVVHTPEVPTTRPSSVRKESA-GAKLKIANTQLKDNWLASLSCPFPLGHDHLSTVGSRG

Query:  DLNASSDCVIGVDPDLPLLLRGYSVIDCIYLLGCFSHNDRGIKNSRQVFDSPHLQVLIGGRRRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGKQ
        D NA+SDCVIGVDPD         V   + LL      D  I +S QV+DSPHLQVL+GG+RRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTP+PQDGKQ
Subjt:  DLNASSDCVIGVDPDLPLLLRGYSVIDCIYLLGCFSHNDRGIKNSRQVFDSPHLQVLIGGRRRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGKQ

Query:  GWWSGGFGYGLWIGILVGLGYSVVPVPSLAWKNKFELSGKDTSK
        GWWSGGFGYGLWIG+LVGLG+SVVPVPSLAWKNKF+LSGKDTSK
Subjt:  GWWSGGFGYGLWIGILVGLGYSVVPVPSLAWKNKFELSGKDTSK

TrEMBL top hitse value%identityAlignment
A0A5D3B8B8 Uncharacterized protein2.2e-8371.19Show/hide
Query:  PLQAQSYCFMNSL-SSKLKPELHHFRLLCTSSSSVVHTPEVPTTRPSSVRKESAG-AKLKIANTQLKDNWLASLSCPFPLGHDHLSTVGSRGDLNASSDC
        PLQA S  FM SL SSKLKP+LHHFR LC+SS S + +PE+ TT  SS+RK+S+G AKL IA+ QLKDNWLASLSCPFPLGHD  S   SR D NA+S+C
Subjt:  PLQAQSYCFMNSL-SSKLKPELHHFRLLCTSSSSVVHTPEVPTTRPSSVRKESAG-AKLKIANTQLKDNWLASLSCPFPLGHDHLSTVGSRGDLNASSDC

Query:  VIGVDPDLPLLLRGYSVIDCIYLLGCFSHNDRGIKNSRQVFDSPHLQVLIGGRRRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGKQGWWSGGFG
        VIGVDPD         V   + LL      D  I +S QV+DSPH+QVL+GGRRRKRLDAKSIVQLLHSFNAPIGTTAYLEQS P+P+DGKQGWW GGFG
Subjt:  VIGVDPDLPLLLRGYSVIDCIYLLGCFSHNDRGIKNSRQVFDSPHLQVLIGGRRRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGKQGWWSGGFG

Query:  YGLWIGILVGLGYSVVPVPSLAWKNKFELSGKDTSK
        YGLWIG+LVGLG+SVVPVP LAWKNKFELSGKDTSK
Subjt:  YGLWIGILVGLGYSVVPVPSLAWKNKFELSGKDTSK

A0A6J1DFT4 uncharacterized protein LOC111020674 isoform X11.9e-9576.19Show/hide
Query:  MESLPF--NAHP--LQAQSYCFMNSLSSKLKPELHHFRLLCTSSSSVVHTPEVPTTRPSSVRKESAG-AKLKIANTQLKDNWLASLSCPFPLGHDHLSTV
        MESLPF  NAHP  LQ QS+CFMNSLSSKLKPE +HFRLLCTSSS  VH P++P    SS RKES G AKLKIA++QLKDNWLASLS PFPLGHDH    
Subjt:  MESLPF--NAHP--LQAQSYCFMNSLSSKLKPELHHFRLLCTSSSSVVHTPEVPTTRPSSVRKESAG-AKLKIANTQLKDNWLASLSCPFPLGHDHLSTV

Query:  GSRGDLNASSDCVIGVDPD----LPLLLRGYSVIDCIYLLGCFSHNDRGIKNSRQVFDSPHLQVLIGGRRRKRLDAKSIVQLLHSFNAPIGTTAYLEQST
        GSRGDLNASSDCVIGVDPD    + LL   YSV                   S QV+DSPHLQVL+GGRRRKRLDAKSIVQLLHSFNAPIGTTAYLEQST
Subjt:  GSRGDLNASSDCVIGVDPD----LPLLLRGYSVIDCIYLLGCFSHNDRGIKNSRQVFDSPHLQVLIGGRRRKRLDAKSIVQLLHSFNAPIGTTAYLEQST

Query:  PYPQDGKQGWWSGGFGYGLWIGILVGLGYSVVPVPSLAWKNKFELSGKDTSK
        PYPQDGKQGWWSGGFGYGLWIGILVGLG+SV+PVPSLAWKNKFELSGKDTSK
Subjt:  PYPQDGKQGWWSGGFGYGLWIGILVGLGYSVVPVPSLAWKNKFELSGKDTSK

A0A6J1DJM7 uncharacterized protein LOC111020674 isoform X21.9e-9576.19Show/hide
Query:  MESLPF--NAHP--LQAQSYCFMNSLSSKLKPELHHFRLLCTSSSSVVHTPEVPTTRPSSVRKESAG-AKLKIANTQLKDNWLASLSCPFPLGHDHLSTV
        MESLPF  NAHP  LQ QS+CFMNSLSSKLKPE +HFRLLCTSSS  VH P++P    SS RKES G AKLKIA++QLKDNWLASLS PFPLGHDH    
Subjt:  MESLPF--NAHP--LQAQSYCFMNSLSSKLKPELHHFRLLCTSSSSVVHTPEVPTTRPSSVRKESAG-AKLKIANTQLKDNWLASLSCPFPLGHDHLSTV

Query:  GSRGDLNASSDCVIGVDPD----LPLLLRGYSVIDCIYLLGCFSHNDRGIKNSRQVFDSPHLQVLIGGRRRKRLDAKSIVQLLHSFNAPIGTTAYLEQST
        GSRGDLNASSDCVIGVDPD    + LL   YSV                   S QV+DSPHLQVL+GGRRRKRLDAKSIVQLLHSFNAPIGTTAYLEQST
Subjt:  GSRGDLNASSDCVIGVDPD----LPLLLRGYSVIDCIYLLGCFSHNDRGIKNSRQVFDSPHLQVLIGGRRRKRLDAKSIVQLLHSFNAPIGTTAYLEQST

Query:  PYPQDGKQGWWSGGFGYGLWIGILVGLGYSVVPVPSLAWKNKFELSGKDTSK
        PYPQDGKQGWWSGGFGYGLWIGILVGLG+SV+PVPSLAWKNKFELSGKDTSK
Subjt:  PYPQDGKQGWWSGGFGYGLWIGILVGLGYSVVPVPSLAWKNKFELSGKDTSK

A0A6J1FUK2 Holliday junction resolvase MOC1, chloroplastic1.8e-9375.82Show/hide
Query:  MESLPFNAHPLQAQSYCFMNSLSSKLKPELHHFRLLCTSSSSVVHTPEVPTTRPSSVRKESA-GAKLKIANTQLKDNWLASLSCPFPLGHDHLSTVGSRG
        MESLP NAHPLQAQS+CFM SLSSKLK ELH FR LCTSSSS V +  +PT   SSVRKES+ G KLKIA+TQLKDNWLASLSCPFPLGHD+ S  G  G
Subjt:  MESLPFNAHPLQAQSYCFMNSLSSKLKPELHHFRLLCTSSSSVVHTPEVPTTRPSSVRKESA-GAKLKIANTQLKDNWLASLSCPFPLGHDHLSTVGSRG

Query:  DLNASSDCVIGVDPDLPLLLRGYSVIDCIYLLGCFSHNDRGIKNSRQVFDSPHLQVLIGGRRRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGKQ
        D NA+SDCVIGVDPD         V   + LL      D  I +S QV+DSPHLQVL+GG+RRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTP+PQDGKQ
Subjt:  DLNASSDCVIGVDPDLPLLLRGYSVIDCIYLLGCFSHNDRGIKNSRQVFDSPHLQVLIGGRRRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGKQ

Query:  GWWSGGFGYGLWIGILVGLGYSVVPVPSLAWKNKFELSGKDTSK
        GWWSGGFGYGLWIG+LVGLG+SVVPVPSLAWKNKF+LSGKDTSK
Subjt:  GWWSGGFGYGLWIGILVGLGYSVVPVPSLAWKNKFELSGKDTSK

A0A6J1JDX6 Holliday junction resolvase MOC1, chloroplastic1.8e-9073.77Show/hide
Query:  MESLPFNAHPLQAQSYCFMNSLSSKLKPELHHFRLLCTSSSSVVHTPEVPTTRPSSVRKESA-GAKLKIANTQLKDNWLASLSCPFPLGHDHLSTVGSRG
        MESLPFNAHPLQAQS+CFM SL      ELH FR LCTSSSS   +  +PT   SSVRKES+ G KLKIA+TQLKDNWLASLSCPFPLGHD+ S  G  G
Subjt:  MESLPFNAHPLQAQSYCFMNSLSSKLKPELHHFRLLCTSSSSVVHTPEVPTTRPSSVRKESA-GAKLKIANTQLKDNWLASLSCPFPLGHDHLSTVGSRG

Query:  DLNASSDCVIGVDPDLPLLLRGYSVIDCIYLLGCFSHNDRGIKNSRQVFDSPHLQVLIGGRRRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGKQ
        D NA+SDCVIGVDPD         V   + LL      D  I +S QV+DSPHLQVL+GG+RRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTP+PQDGKQ
Subjt:  DLNASSDCVIGVDPDLPLLLRGYSVIDCIYLLGCFSHNDRGIKNSRQVFDSPHLQVLIGGRRRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGKQ

Query:  GWWSGGFGYGLWIGILVGLGYSVVPVPSLAWKNKFELSGKDTSK
        GWWSGGFGYGLWIG+LVGLG+SVVPVPSLAWKNKF+LSGKDTSK
Subjt:  GWWSGGFGYGLWIGILVGLGYSVVPVPSLAWKNKFELSGKDTSK

SwissProt top hitse value%identityAlignment
Q8GWA2 Holliday junction resolvase MOC1, chloroplastic1.5e-3641.33Show/hide
Query:  MNSLSSKLKPELHHFRLLCTSSSSVVHTPEVPTTRPSSVRKESAGAKLKIANTQLKDNWLASLSCPFPLGHDHLSTVGSRGDLNASSDCVIGVDPDLPLL
        M+SL SK++P L H       S S       P TR  S    +      I    +K+ WL SLS       D  +T       NA S C+IG+DPDL   
Subjt:  MNSLSSKLKPELHHFRLLCTSSSSVVHTPEVPTTRPSSVRKESAGAKLKIANTQLKDNWLASLSCPFPLGHDHLSTVGSRGDLNASSDCVIGVDPDLPLL

Query:  LRGYSVIDCIYLLGCFSHNDRGIKNSRQVFDSPHLQVLIGGRRRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGKQGWWSGGFGYGLWIGILVGL
                    L     +  G  +  QV+D+PH+ VL+G R RKRLDAKSIVQL+ S + P G+  Y+EQS P+P+DGKQGW+SGGFGYGLWIG LV  
Subjt:  LRGYSVIDCIYLLGCFSHNDRGIKNSRQVFDSPHLQVLIGGRRRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGKQGWWSGGFGYGLWIGILVGL

Query:  GYSVVPVPSLAWKNKFELSGKDTSK
        G+ V+PV +  WK  F+L+    +K
Subjt:  GYSVVPVPSLAWKNKFELSGKDTSK

Arabidopsis top hitse value%identityAlignment
AT2G26840.1 unknown protein1.0e-3741.33Show/hide
Query:  MNSLSSKLKPELHHFRLLCTSSSSVVHTPEVPTTRPSSVRKESAGAKLKIANTQLKDNWLASLSCPFPLGHDHLSTVGSRGDLNASSDCVIGVDPDLPLL
        M+SL SK++P L H       S S       P TR  S    +      I    +K+ WL SLS       D  +T       NA S C+IG+DPDL   
Subjt:  MNSLSSKLKPELHHFRLLCTSSSSVVHTPEVPTTRPSSVRKESAGAKLKIANTQLKDNWLASLSCPFPLGHDHLSTVGSRGDLNASSDCVIGVDPDLPLL

Query:  LRGYSVIDCIYLLGCFSHNDRGIKNSRQVFDSPHLQVLIGGRRRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGKQGWWSGGFGYGLWIGILVGL
                    L     +  G  +  QV+D+PH+ VL+G R RKRLDAKSIVQL+ S + P G+  Y+EQS P+P+DGKQGW+SGGFGYGLWIG LV  
Subjt:  LRGYSVIDCIYLLGCFSHNDRGIKNSRQVFDSPHLQVLIGGRRRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGKQGWWSGGFGYGLWIGILVGL

Query:  GYSVVPVPSLAWKNKFELSGKDTSK
        G+ V+PV +  WK  F+L+    +K
Subjt:  GYSVVPVPSLAWKNKFELSGKDTSK

AT2G26840.2 unknown protein4.8e-3536.47Show/hide
Query:  MNSLSSKLKPELHHFRLLCTSSSSVVHTPEVPTTRPSSVRKESAGAKLKIANTQLKDNWLASLSCPFPLGHDHLSTVGSRGDLNASSDCVIGVDPDLP--
        M+SL SK++P L H       S S       P TR  S    +      I    +K+ WL SLS       D  +T       NA S C+IG+DPDL   
Subjt:  MNSLSSKLKPELHHFRLLCTSSSSVVHTPEVPTTRPSSVRKESAGAKLKIANTQLKDNWLASLSCPFPLGHDHLSTVGSRGDLNASSDCVIGVDPDLP--

Query:  -------------------------------LLLRGYSVIDCIYLLGCFSHNDRGIKNS--------RQVFDSPHLQVLIGGRRRKRLDAKSIVQLLHSF
                                       L+LR Y  +   Y++   +    G   S         +V+D+PH+ VL+G R RKRLDAKSIVQL+ S 
Subjt:  -------------------------------LLLRGYSVIDCIYLLGCFSHNDRGIKNS--------RQVFDSPHLQVLIGGRRRKRLDAKSIVQLLHSF

Query:  NAPIGTTAYLEQSTPYPQDGKQGWWSGGFGYGLWIGILVGLGYSVVPVPSLAWKNKFELSGKDTSK
        + P G+  Y+EQS P+P+DGKQGW+SGGFGYGLWIG LV  G+ V+PV +  WK  F+L+    +K
Subjt:  NAPIGTTAYLEQSTPYPQDGKQGWWSGGFGYGLWIGILVGLGYSVVPVPSLAWKNKFELSGKDTSK

AT3G43910.1 unknown protein1.9e-0728.46Show/hide
Query:  VIGVDPDLPLLLRGYSVIDCIYLLGCFSHNDRGIKNSRQVFDSPHLQVLIGGRRRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGKQGWWSGGFG
        +IG+DP+L               L     +D+G     QV+D+P L+V++   R +  + KS+++L+ S + P GT A++ +   +P++     ++ G G
Subjt:  VIGVDPDLPLLLRGYSVIDCIYLLGCFSHNDRGIKNSRQVFDSPHLQVLIGGRRRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGKQGWWSGGFG

Query:  YGLWIGILVGLGYSVVPVPSLAWKNKFELS
         GLW   L+    SV+ V    W   F LS
Subjt:  YGLWIGILVGLGYSVVPVPSLAWKNKFELS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCCCCGACACTGTGAACCAGTTGAATCTGCGTCTTGTTACATGGTCCATTTTCGCAATTGCTCTTGCAAATGCCAATTCTTTGGGCTCGGTAATTGGGGGCCCATC
GTTTGATCTATTGAGTTGTTGGGCCTTTGAGGAAATGGTGAATGGGCCTGAGCCGGCCTGTGCCGAGACCGGCGATCGAGGAGAGGAGGCGGTGGAGCTTTTGCTACTGC
GGTTGTCGTTGCGGCCTTCAAGACGCGCCGGAAGTTTGAGCCGAGGAGGAAGAGGAAGGACCTTTCCATCTGAAAACAGTTCATCGGCAAACGCCATCGCAGATCGAATC
GATGTCGTTGAGATTGAAACGACAGCTAGTTTCGAATTCAAACTCATCAATGTGCTTCGGACTGGATGGCTCACTGTAAAACTCCATGTTCTTCAGATTCATTTTCCGGA
ACTTCCGATAGGAAAATCGCGGAGTTATCTATGTAATATCGGTCCCTTCTTCGAAGTGGTTGAATTGATGGAATCCCTTCCATTCAACGCACATCCATTGCAAGCGCAGT
CCTACTGTTTCATGAACTCCCTCTCTTCCAAGCTCAAACCCGAACTTCACCACTTCAGACTCCTCTGCACTTCTTCTTCATCAGTTGTTCACACTCCAGAAGTTCCGACT
ACAAGACCTTCTTCTGTTCGGAAAGAGAGCGCTGGAGCTAAATTGAAGATCGCCAATACTCAACTTAAGGACAACTGGTTGGCTTCTCTATCGTGTCCTTTTCCTCTAGG
TCACGATCATCTCTCCACCGTTGGTAGCCGAGGAGACCTAAATGCGAGTTCGGACTGTGTTATCGGGGTCGATCCCGATCTACCGTTGTTGCTTCGTGGATATTCGGTAA
TAGATTGTATATACTTACTGGGCTGCTTTAGCCATAATGATCGAGGGATTAAAAATAGTAGACAGGTATTTGATTCTCCACACCTACAAGTACTGATTGGTGGAAGGAGA
CGGAAACGTTTAGATGCGAAGTCAATTGTCCAGCTTCTTCATAGCTTCAATGCTCCCATTGGAACTACTGCATATCTGGAGCAGTCGACCCCATATCCACAGGATGGAAA
ACAGGGGTGGTGGAGTGGAGGATTTGGTTATGGATTGTGGATTGGCATATTAGTCGGGCTGGGATATTCTGTTGTTCCTGTGCCATCTCTTGCGTGGAAAAACAAATTTG
AGCTCTCGGGAAAGGATACTTCTAAGCTGAGGTGTCGTGAAGCTATTATGCCTAACGAAACTAACGAATTTCTTTTCCGTCATAGATTTGGTCAAGATCTAGAAGTAATG
ACTAAATCGGTGGCTCCTAGCATTAGTGAGGCTGGACATGAATTTGGAAACACCATCTGTCTTAGAGGAGTCGGTAGCGTCTTAGAAATGGTAAAGGAGTCCCAGCATTC
TTCTTCCTCCCCTGAGTTCAGGTTCAAGTTTGGTATGAATAGAATGAATGATGAGAAATGGATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCCCCGACACTGTGAACCAGTTGAATCTGCGTCTTGTTACATGGTCCATTTTCGCAATTGCTCTTGCAAATGCCAATTCTTTGGGCTCGGTAATTGGGGGCCCATC
GTTTGATCTATTGAGTTGTTGGGCCTTTGAGGAAATGGTGAATGGGCCTGAGCCGGCCTGTGCCGAGACCGGCGATCGAGGAGAGGAGGCGGTGGAGCTTTTGCTACTGC
GGTTGTCGTTGCGGCCTTCAAGACGCGCCGGAAGTTTGAGCCGAGGAGGAAGAGGAAGGACCTTTCCATCTGAAAACAGTTCATCGGCAAACGCCATCGCAGATCGAATC
GATGTCGTTGAGATTGAAACGACAGCTAGTTTCGAATTCAAACTCATCAATGTGCTTCGGACTGGATGGCTCACTGTAAAACTCCATGTTCTTCAGATTCATTTTCCGGA
ACTTCCGATAGGAAAATCGCGGAGTTATCTATGTAATATCGGTCCCTTCTTCGAAGTGGTTGAATTGATGGAATCCCTTCCATTCAACGCACATCCATTGCAAGCGCAGT
CCTACTGTTTCATGAACTCCCTCTCTTCCAAGCTCAAACCCGAACTTCACCACTTCAGACTCCTCTGCACTTCTTCTTCATCAGTTGTTCACACTCCAGAAGTTCCGACT
ACAAGACCTTCTTCTGTTCGGAAAGAGAGCGCTGGAGCTAAATTGAAGATCGCCAATACTCAACTTAAGGACAACTGGTTGGCTTCTCTATCGTGTCCTTTTCCTCTAGG
TCACGATCATCTCTCCACCGTTGGTAGCCGAGGAGACCTAAATGCGAGTTCGGACTGTGTTATCGGGGTCGATCCCGATCTACCGTTGTTGCTTCGTGGATATTCGGTAA
TAGATTGTATATACTTACTGGGCTGCTTTAGCCATAATGATCGAGGGATTAAAAATAGTAGACAGGTATTTGATTCTCCACACCTACAAGTACTGATTGGTGGAAGGAGA
CGGAAACGTTTAGATGCGAAGTCAATTGTCCAGCTTCTTCATAGCTTCAATGCTCCCATTGGAACTACTGCATATCTGGAGCAGTCGACCCCATATCCACAGGATGGAAA
ACAGGGGTGGTGGAGTGGAGGATTTGGTTATGGATTGTGGATTGGCATATTAGTCGGGCTGGGATATTCTGTTGTTCCTGTGCCATCTCTTGCGTGGAAAAACAAATTTG
AGCTCTCGGGAAAGGATACTTCTAAGCTGAGGTGTCGTGAAGCTATTATGCCTAACGAAACTAACGAATTTCTTTTCCGTCATAGATTTGGTCAAGATCTAGAAGTAATG
ACTAAATCGGTGGCTCCTAGCATTAGTGAGGCTGGACATGAATTTGGAAACACCATCTGTCTTAGAGGAGTCGGTAGCGTCTTAGAAATGGTAAAGGAGTCCCAGCATTC
TTCTTCCTCCCCTGAGTTCAGGTTCAAGTTTGGTATGAATAGAATGAATGATGAGAAATGGATCTGA
Protein sequenceShow/hide protein sequence
MGPDTVNQLNLRLVTWSIFAIALANANSLGSVIGGPSFDLLSCWAFEEMVNGPEPACAETGDRGEEAVELLLLRLSLRPSRRAGSLSRGGRGRTFPSENSSSANAIADRI
DVVEIETTASFEFKLINVLRTGWLTVKLHVLQIHFPELPIGKSRSYLCNIGPFFEVVELMESLPFNAHPLQAQSYCFMNSLSSKLKPELHHFRLLCTSSSSVVHTPEVPT
TRPSSVRKESAGAKLKIANTQLKDNWLASLSCPFPLGHDHLSTVGSRGDLNASSDCVIGVDPDLPLLLRGYSVIDCIYLLGCFSHNDRGIKNSRQVFDSPHLQVLIGGRR
RKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGKQGWWSGGFGYGLWIGILVGLGYSVVPVPSLAWKNKFELSGKDTSKLRCREAIMPNETNEFLFRHRFGQDLEVM
TKSVAPSISEAGHEFGNTICLRGVGSVLEMVKESQHSSSSPEFRFKFGMNRMNDEKWI