; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr025171 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr025171
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionprotein LIGHT-DEPENDENT SHORT HYPOCOTYLS 10-like
Genome locationtig00003412:1927207..1927713
RNA-Seq ExpressionSgr025171
SyntenySgr025171
Gene Ontology termsGO:0009299 - mRNA transcription (biological process)
GO:0009416 - response to light stimulus (biological process)
GO:0005634 - nucleus (cellular component)
InterPro domainsIPR006936 - ALOG domain
IPR040222 - ALOG family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589232.1 Protein LIGHT-DEPENDENT SHORT HYPOCOTYLS 10, partial [Cucurbita argyrosperma subsp. sororia]7.2e-7891.19Show/hide
Query:  KDFAEGSSSSSSPQP-GSTPSRYESQKRRDWNTFCQYLKNQRPSVPLSHCSCNHVLEFLRYLDQFGKTKVHLQGCMFYGQPEPPAPCTCPLRQAWGSLDA
        KDF+EGSSS+SSPQP GS PSRYESQKRRDWNTFCQYLKNQRP VPLSHCSCNHVLEFLRYLDQFGKTKVH+QGCMFYGQPEPPAPCTCPLRQAWGSLDA
Subjt:  KDFAEGSSSSSSPQP-GSTPSRYESQKRRDWNTFCQYLKNQRPSVPLSHCSCNHVLEFLRYLDQFGKTKVHLQGCMFYGQPEPPAPCTCPLRQAWGSLDA

Query:  LIGRLRAAYEENGGSSETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKKTVKARELK
        LIGRLRAAYEE+GGS ETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKKT    + K
Subjt:  LIGRLRAAYEENGGSSETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKKTVKARELK

KAG7022931.1 Protein LIGHT-DEPENDENT SHORT HYPOCOTYLS 10, partial [Cucurbita argyrosperma subsp. argyrosperma]3.6e-7794.08Show/hide
Query:  KDFAEGSSSSSSPQP-GSTPSRYESQKRRDWNTFCQYLKNQRPSVPLSHCSCNHVLEFLRYLDQFGKTKVHLQGCMFYGQPEPPAPCTCPLRQAWGSLDA
        KDF+EGSSS+SSPQP GS PSRYESQKRRDWNTFCQYLKNQRP VPLSHCSCNHVLEFLRYLDQFGKTKVH+QGCMFYGQPEPPAPCTCPLRQAWGSLDA
Subjt:  KDFAEGSSSSSSPQP-GSTPSRYESQKRRDWNTFCQYLKNQRPSVPLSHCSCNHVLEFLRYLDQFGKTKVHLQGCMFYGQPEPPAPCTCPLRQAWGSLDA

Query:  LIGRLRAAYEENGGSSETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKKT
        LIGRLRAAYEE+GGS ETNPFASGAIRVYLRDVRECQAKARGIPYKKKKK T
Subjt:  LIGRLRAAYEENGGSSETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKKT

XP_022930813.1 protein LIGHT-DEPENDENT SHORT HYPOCOTYLS 10-like [Cucurbita moschata]7.2e-7891.19Show/hide
Query:  KDFAEGSSSSSSPQP-GSTPSRYESQKRRDWNTFCQYLKNQRPSVPLSHCSCNHVLEFLRYLDQFGKTKVHLQGCMFYGQPEPPAPCTCPLRQAWGSLDA
        KDF+EGSSS+SSPQP GS PSRYESQKRRDWNTFCQYLKNQRP VPLSHCSCNHVLEFLRYLDQFGKTKVH+QGCMFYGQPEPPAPCTCPLRQAWGSLDA
Subjt:  KDFAEGSSSSSSPQP-GSTPSRYESQKRRDWNTFCQYLKNQRPSVPLSHCSCNHVLEFLRYLDQFGKTKVHLQGCMFYGQPEPPAPCTCPLRQAWGSLDA

Query:  LIGRLRAAYEENGGSSETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKKTVKARELK
        LIGRLRAAYEE+GGS ETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKKT    + K
Subjt:  LIGRLRAAYEENGGSSETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKKTVKARELK

XP_022988968.1 protein LIGHT-DEPENDENT SHORT HYPOCOTYLS 10-like [Cucurbita maxima]2.1e-7791.82Show/hide
Query:  MSIERG-KDFAEGSSSSSSPQPGS-TPSRYESQKRRDWNTFCQYLKNQRPSVPLSHCSCNHVLEFLRYLDQFGKTKVHLQGCMFYGQPEPPAPCTCPLRQ
        MS E G KDF+EGSSS+SSPQPG   PSRYESQKRRDWNTFCQYLKNQRP VPLSHCSCNHVLEFLRYLDQFGKTKVH+QGCMFYGQPEPPAPCTCPLRQ
Subjt:  MSIERG-KDFAEGSSSSSSPQPGS-TPSRYESQKRRDWNTFCQYLKNQRPSVPLSHCSCNHVLEFLRYLDQFGKTKVHLQGCMFYGQPEPPAPCTCPLRQ

Query:  AWGSLDALIGRLRAAYEENGGSSETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKKT
        AWGSLDALIGRLRAAYEE+GGS ETNPFASGAIRVYLRDVRECQAKARGIPYKKKKK T
Subjt:  AWGSLDALIGRLRAAYEENGGSSETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKKT

XP_023530720.1 protein LIGHT-DEPENDENT SHORT HYPOCOTYLS 10-like [Cucurbita pepo subsp. pepo]1.5e-7889.76Show/hide
Query:  MSIERG-KDFAEGSSSSSSPQP-GSTPSRYESQKRRDWNTFCQYLKNQRPSVPLSHCSCNHVLEFLRYLDQFGKTKVHLQGCMFYGQPEPPAPCTCPLRQ
        MS E G KDF+EGSSS+SSPQP GS PSRYESQKRRDWNTFCQYLKNQRP VPLSHCSCNHVLEFLRYLDQFGKTKVH+QGCMFYGQPEPPAPCTCPLRQ
Subjt:  MSIERG-KDFAEGSSSSSSPQP-GSTPSRYESQKRRDWNTFCQYLKNQRPSVPLSHCSCNHVLEFLRYLDQFGKTKVHLQGCMFYGQPEPPAPCTCPLRQ

Query:  AWGSLDALIGRLRAAYEENGGSSETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKKTVKARELK
        AWGSLDALIGRLRAAYEE+GGS ETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKKT    + K
Subjt:  AWGSLDALIGRLRAAYEENGGSSETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKKTVKARELK

TrEMBL top hitse value%identityAlignment
A0A0A0K2D1 ALOG domain-containing protein3.6e-7588.34Show/hide
Query:  MSIER---GKDFAEGSSSSS---SPQPGSTPSRYESQKRRDWNTFCQYLKNQRPSVPLSHCSCNHVLEFLRYLDQFGKTKVHLQGCMFYGQPEPPAPCTC
        MSIE     KDF+EGSSSSS   S    +TPSRYESQKRRDWNTFCQYLKNQRP VPLSHC+CN VLEFLRYLDQFGKTKVH+QGCMFYGQPEPPAPCTC
Subjt:  MSIER---GKDFAEGSSSSS---SPQPGSTPSRYESQKRRDWNTFCQYLKNQRPSVPLSHCSCNHVLEFLRYLDQFGKTKVHLQGCMFYGQPEPPAPCTC

Query:  PLRQAWGSLDALIGRLRAAYEENGGSSETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKKT
        PLRQAWGSLDALIGRLRAAYEENGGS ETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKKT
Subjt:  PLRQAWGSLDALIGRLRAAYEENGGSSETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKKT

A0A1S3C1B9 protein LIGHT-DEPENDENT SHORT HYPOCOTYLS 104.7e-7585.8Show/hide
Query:  MSIER--GKDFAEGSSSSS---SPQPGSTPSRYESQKRRDWNTFCQYLKNQRPSVPLSHCSCNHVLEFLRYLDQFGKTKVHLQGCMFYGQPEPPAPCTCP
        MSIE    KDF+EGSSSSS   S    +TPSRYESQKRRDWNTFCQYLKNQRP VPLSHC+CN VLEFLRYLDQFGKTKVH+QGCMFYGQPEPPAPCTCP
Subjt:  MSIER--GKDFAEGSSSSS---SPQPGSTPSRYESQKRRDWNTFCQYLKNQRPSVPLSHCSCNHVLEFLRYLDQFGKTKVHLQGCMFYGQPEPPAPCTCP

Query:  LRQAWGSLDALIGRLRAAYEENGGSSETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKKTVKARELK
        LRQAWGSLDALIGRLRAAYEENGGS ETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKKT    + K
Subjt:  LRQAWGSLDALIGRLRAAYEENGGSSETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKKTVKARELK

A0A5D3C6X1 Protein LIGHT-DEPENDENT SHORT HYPOCOTYLS 102.8e-7588.89Show/hide
Query:  MSIER--GKDFAEGSSSSS---SPQPGSTPSRYESQKRRDWNTFCQYLKNQRPSVPLSHCSCNHVLEFLRYLDQFGKTKVHLQGCMFYGQPEPPAPCTCP
        MSIE    KDF+EGSSSSS   S    +TPSRYESQKRRDWNTFCQYLKNQRP VPLSHC+CN VLEFLRYLDQFGKTKVH+QGCMFYGQPEPPAPCTCP
Subjt:  MSIER--GKDFAEGSSSSS---SPQPGSTPSRYESQKRRDWNTFCQYLKNQRPSVPLSHCSCNHVLEFLRYLDQFGKTKVHLQGCMFYGQPEPPAPCTCP

Query:  LRQAWGSLDALIGRLRAAYEENGGSSETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKKT
        LRQAWGSLDALIGRLRAAYEENGGS ETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKKT
Subjt:  LRQAWGSLDALIGRLRAAYEENGGSSETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKKT

A0A6J1EWG4 protein LIGHT-DEPENDENT SHORT HYPOCOTYLS 10-like3.5e-7891.19Show/hide
Query:  KDFAEGSSSSSSPQP-GSTPSRYESQKRRDWNTFCQYLKNQRPSVPLSHCSCNHVLEFLRYLDQFGKTKVHLQGCMFYGQPEPPAPCTCPLRQAWGSLDA
        KDF+EGSSS+SSPQP GS PSRYESQKRRDWNTFCQYLKNQRP VPLSHCSCNHVLEFLRYLDQFGKTKVH+QGCMFYGQPEPPAPCTCPLRQAWGSLDA
Subjt:  KDFAEGSSSSSSPQP-GSTPSRYESQKRRDWNTFCQYLKNQRPSVPLSHCSCNHVLEFLRYLDQFGKTKVHLQGCMFYGQPEPPAPCTCPLRQAWGSLDA

Query:  LIGRLRAAYEENGGSSETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKKTVKARELK
        LIGRLRAAYEE+GGS ETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKKT    + K
Subjt:  LIGRLRAAYEENGGSSETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKKTVKARELK

A0A6J1JNV9 protein LIGHT-DEPENDENT SHORT HYPOCOTYLS 10-like1.0e-7791.82Show/hide
Query:  MSIERG-KDFAEGSSSSSSPQPGS-TPSRYESQKRRDWNTFCQYLKNQRPSVPLSHCSCNHVLEFLRYLDQFGKTKVHLQGCMFYGQPEPPAPCTCPLRQ
        MS E G KDF+EGSSS+SSPQPG   PSRYESQKRRDWNTFCQYLKNQRP VPLSHCSCNHVLEFLRYLDQFGKTKVH+QGCMFYGQPEPPAPCTCPLRQ
Subjt:  MSIERG-KDFAEGSSSSSSPQPGS-TPSRYESQKRRDWNTFCQYLKNQRPSVPLSHCSCNHVLEFLRYLDQFGKTKVHLQGCMFYGQPEPPAPCTCPLRQ

Query:  AWGSLDALIGRLRAAYEENGGSSETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKKT
        AWGSLDALIGRLRAAYEE+GGS ETNPFASGAIRVYLRDVRECQAKARGIPYKKKKK T
Subjt:  AWGSLDALIGRLRAAYEENGGSSETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKKT

SwissProt top hitse value%identityAlignment
A2XED8 Protein G1-like71.3e-5871.43Show/hide
Query:  SSSPQPGSTPSRYESQKRRDWNTFCQYLKNQRPSVPLSHCSCNHVLEFLRYLDQFGKTKVHLQGCMFYGQPEPPAPCTCPLRQAWGSLDALIGRLRAAYE
        +++PQP +  SRYESQKRRDWNTF QYL+N RP + L+ CS  HV+EFLRYLDQFGKTKVH  GC FYGQP PP PC CPLRQAWGSLDALIGRLRAAYE
Subjt:  SSSPQPGSTPSRYESQKRRDWNTFCQYLKNQRPSVPLSHCSCNHVLEFLRYLDQFGKTKVHLQGCMFYGQPEPPAPCTCPLRQAWGSLDALIGRLRAAYE

Query:  ENGGSSETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKKTVKARE
        E+GG+ E+NPFA+ A+R+YLR+VR+ QAKARGIPY+KKK+K  +A +
Subjt:  ENGGSSETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKKTVKARE

Q6ATW6 Protein G1-like89.5e-5771.33Show/hide
Query:  QPGSTPSRYESQKRRDWNTFCQYLKNQRPSVPLSHCSCNHVLEFLRYLDQFGKTKVHLQGCMFYGQPEPPAPCTCPLRQAWGSLDALIGRLRAAYEENGG
        QP    SRYESQKRRDWNTF QYLKN RP + L+ CS  HV+EFL+YLDQFGKTKVH  GC +YGQP PPAPC CPLRQAWGSLDALIGRLRAAYEE+G 
Subjt:  QPGSTPSRYESQKRRDWNTFCQYLKNQRPSVPLSHCSCNHVLEFLRYLDQFGKTKVHLQGCMFYGQPEPPAPCTCPLRQAWGSLDALIGRLRAAYEENGG

Query:  SSETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKKTVKARE
        + E+NPFA+ A+R+YLR+VR+ QAKARGIPY+KKK+K  + ++
Subjt:  SSETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKKTVKARE

Q941W1 Protein G1-like71.3e-5871.43Show/hide
Query:  SSSPQPGSTPSRYESQKRRDWNTFCQYLKNQRPSVPLSHCSCNHVLEFLRYLDQFGKTKVHLQGCMFYGQPEPPAPCTCPLRQAWGSLDALIGRLRAAYE
        +++PQP +  SRYESQKRRDWNTF QYL+N RP + L+ CS  HV+EFLRYLDQFGKTKVH  GC FYGQP PP PC CPLRQAWGSLDALIGRLRAAYE
Subjt:  SSSPQPGSTPSRYESQKRRDWNTFCQYLKNQRPSVPLSHCSCNHVLEFLRYLDQFGKTKVHLQGCMFYGQPEPPAPCTCPLRQAWGSLDALIGRLRAAYE

Query:  ENGGSSETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKKTVKARE
        E+GG+ E+NPFA+ A+R+YLR+VR+ QAKARGIPY+KKK+K  +A +
Subjt:  ENGGSSETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKKTVKARE

Q9S7R3 Protein LIGHT-DEPENDENT SHORT HYPOCOTYLS 107.3e-7386.27Show/hide
Query:  ERGKDFAEGSSSSSSPQPGSTPSRYESQKRRDWNTFCQYLKNQRPSVPLSHCSCNHVLEFLRYLDQFGKTKVHLQGCMFYGQPEPPAPCTCPLRQAWGSL
        ERGK   E S S    +P  TPSRYESQKRRDWNTF QYLKNQRP VP+SHCSCNHVL+FLRYLDQFGKTKVH+ GCMFYGQPEPPAPCTCPLRQAWGSL
Subjt:  ERGKDFAEGSSSSSSPQPGSTPSRYESQKRRDWNTFCQYLKNQRPSVPLSHCSCNHVLEFLRYLDQFGKTKVHLQGCMFYGQPEPPAPCTCPLRQAWGSL

Query:  DALIGRLRAAYEENGGSSETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKK
        DALIGRLRAAYEENGG  ETNPFASGAIRVYLR+VRECQAKARGIPYKKKKKK
Subjt:  DALIGRLRAAYEENGGSSETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKK

Q9ZVA0 Protein LIGHT-DEPENDENT SHORT HYPOCOTYLS 71.1e-5766.87Show/hide
Query:  RGKDFAEGSSSSSSP------QPGSTP-----SRYESQKRRDWNTFCQYLKNQRPSVPLSHCSCNHVLEFLRYLDQFGKTKVHLQGCMFYGQPEPPAPCT
        +GK  AEGSS   S       QP S P     SRYESQKRRDWNTFCQYL+NQ+P V +S C  NH+L+FL+YLDQFGKTKVH+ GC+F+GQ EP   C 
Subjt:  RGKDFAEGSSSSSSP------QPGSTP-----SRYESQKRRDWNTFCQYLKNQRPSVPLSHCSCNHVLEFLRYLDQFGKTKVHLQGCMFYGQPEPPAPCT

Query:  CPLRQAWGSLDALIGRLRAAYEENGGSSETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKK
        CPL+QAWGSLDALIGRLRAA+EENGG  E NPFA G IRV+LR+VR+ QAKARG+PYKK+KK+
Subjt:  CPLRQAWGSLDALIGRLRAAYEENGGSSETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKK

Arabidopsis top hitse value%identityAlignment
AT1G07090.1 Protein of unknown function (DUF640)3.4e-5771.03Show/hide
Query:  GSSSSSSPQPGSTPSRYESQKRRDWNTFCQYLKNQRPSVPLSHCSCNHVLEFLRYLDQFGKTKVHLQGCMFYGQPEPPAPCTCPLRQAWGSLDALIGRLR
        G S  SSP   +TPSRYESQKRRDWNTF QYLKN +P + LS CS  HV+EFL+YLDQFGKTKVH+  C ++G  +PP+PC+CPL+QAWGSLDALIGRLR
Subjt:  GSSSSSSPQPGSTPSRYESQKRRDWNTFCQYLKNQRPSVPLSHCSCNHVLEFLRYLDQFGKTKVHLQGCMFYGQPEPPAPCTCPLRQAWGSLDALIGRLR

Query:  AAYEENGGSSETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKK
        AAYEENGG  ++NPFA+ A+R+YLR+VRE QAKARGIPY+KKK+K
Subjt:  AAYEENGGSSETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKK

AT1G78815.1 Protein of unknown function (DUF640)8.0e-5966.87Show/hide
Query:  RGKDFAEGSSSSSSP------QPGSTP-----SRYESQKRRDWNTFCQYLKNQRPSVPLSHCSCNHVLEFLRYLDQFGKTKVHLQGCMFYGQPEPPAPCT
        +GK  AEGSS   S       QP S P     SRYESQKRRDWNTFCQYL+NQ+P V +S C  NH+L+FL+YLDQFGKTKVH+ GC+F+GQ EP   C 
Subjt:  RGKDFAEGSSSSSSP------QPGSTP-----SRYESQKRRDWNTFCQYLKNQRPSVPLSHCSCNHVLEFLRYLDQFGKTKVHLQGCMFYGQPEPPAPCT

Query:  CPLRQAWGSLDALIGRLRAAYEENGGSSETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKK
        CPL+QAWGSLDALIGRLRAA+EENGG  E NPFA G IRV+LR+VR+ QAKARG+PYKK+KK+
Subjt:  CPLRQAWGSLDALIGRLRAAYEENGGSSETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKK

AT2G42610.1 Protein of unknown function (DUF640)5.2e-7486.27Show/hide
Query:  ERGKDFAEGSSSSSSPQPGSTPSRYESQKRRDWNTFCQYLKNQRPSVPLSHCSCNHVLEFLRYLDQFGKTKVHLQGCMFYGQPEPPAPCTCPLRQAWGSL
        ERGK   E S S    +P  TPSRYESQKRRDWNTF QYLKNQRP VP+SHCSCNHVL+FLRYLDQFGKTKVH+ GCMFYGQPEPPAPCTCPLRQAWGSL
Subjt:  ERGKDFAEGSSSSSSPQPGSTPSRYESQKRRDWNTFCQYLKNQRPSVPLSHCSCNHVLEFLRYLDQFGKTKVHLQGCMFYGQPEPPAPCTCPLRQAWGSL

Query:  DALIGRLRAAYEENGGSSETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKK
        DALIGRLRAAYEENGG  ETNPFASGAIRVYLR+VRECQAKARGIPYKKKKKK
Subjt:  DALIGRLRAAYEENGGSSETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKK

AT2G42610.2 Protein of unknown function (DUF640)5.2e-7486.27Show/hide
Query:  ERGKDFAEGSSSSSSPQPGSTPSRYESQKRRDWNTFCQYLKNQRPSVPLSHCSCNHVLEFLRYLDQFGKTKVHLQGCMFYGQPEPPAPCTCPLRQAWGSL
        ERGK   E S S    +P  TPSRYESQKRRDWNTF QYLKNQRP VP+SHCSCNHVL+FLRYLDQFGKTKVH+ GCMFYGQPEPPAPCTCPLRQAWGSL
Subjt:  ERGKDFAEGSSSSSSPQPGSTPSRYESQKRRDWNTFCQYLKNQRPSVPLSHCSCNHVLEFLRYLDQFGKTKVHLQGCMFYGQPEPPAPCTCPLRQAWGSL

Query:  DALIGRLRAAYEENGGSSETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKK
        DALIGRLRAAYEENGG  ETNPFASGAIRVYLR+VRECQAKARGIPYKKKKKK
Subjt:  DALIGRLRAAYEENGGSSETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKK

AT4G18610.1 Protein of unknown function (DUF640)3.4e-5768.32Show/hide
Query:  KDFAEGSSSSSS-------PQPGSTPSRYESQKRRDWNTFCQYLKNQRPSVPLSHCSCNHVLEFLRYLDQFGKTKVHLQGCMFYGQPEPPAPCTCPLRQA
        KD  +  SSSS+       PQP    SRYESQKRRDWNTF QYLK+Q P + +S     HVL FLRYLDQFGKTKVH Q C+F+GQP+PP PCTCPL+QA
Subjt:  KDFAEGSSSSSS-------PQPGSTPSRYESQKRRDWNTFCQYLKNQRPSVPLSHCSCNHVLEFLRYLDQFGKTKVHLQGCMFYGQPEPPAPCTCPLRQA

Query:  WGSLDALIGRLRAAYEENGGSS-ETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKKTVK
        WGSLDALIGRLRAAYEE+GG S +TNPFA+G+IRV+LR+VRE QAKARGIPY+KKK++  K
Subjt:  WGSLDALIGRLRAAYEENGGSS-ETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKKTVK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTATTGAAAGAGGCAAAGATTTTGCTGAAGGATCTTCGTCGAGCTCTTCGCCACAGCCTGGATCAACCCCAAGTCGATATGAATCCCAAAAACGAAGGGATTGGAA
CACATTTTGCCAATATTTGAAGAACCAGAGACCCTCAGTTCCACTTTCTCACTGCAGCTGCAATCATGTCTTGGAATTCCTCCGATATCTTGATCAATTTGGGAAAACAA
AAGTTCATCTTCAGGGTTGCATGTTTTATGGACAGCCTGAGCCACCAGCGCCATGTACTTGCCCACTTAGGCAAGCTTGGGGGAGTTTGGATGCTCTTATTGGGAGGTTG
AGAGCTGCCTATGAAGAAAATGGTGGTTCGTCGGAGACAAACCCTTTTGCTAGTGGTGCAATTAGGGTTTATCTCAGGGATGTGAGAGAGTGTCAAGCTAAAGCAAGGGG
AATTCCTTACAAAAAGAAGAAGAAGAAGACCGTCAAAGCAAGGGAACTGAAGAATCAAGCTCGATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTATTGAAAGAGGCAAAGATTTTGCTGAAGGATCTTCGTCGAGCTCTTCGCCACAGCCTGGATCAACCCCAAGTCGATATGAATCCCAAAAACGAAGGGATTGGAA
CACATTTTGCCAATATTTGAAGAACCAGAGACCCTCAGTTCCACTTTCTCACTGCAGCTGCAATCATGTCTTGGAATTCCTCCGATATCTTGATCAATTTGGGAAAACAA
AAGTTCATCTTCAGGGTTGCATGTTTTATGGACAGCCTGAGCCACCAGCGCCATGTACTTGCCCACTTAGGCAAGCTTGGGGGAGTTTGGATGCTCTTATTGGGAGGTTG
AGAGCTGCCTATGAAGAAAATGGTGGTTCGTCGGAGACAAACCCTTTTGCTAGTGGTGCAATTAGGGTTTATCTCAGGGATGTGAGAGAGTGTCAAGCTAAAGCAAGGGG
AATTCCTTACAAAAAGAAGAAGAAGAAGACCGTCAAAGCAAGGGAACTGAAGAATCAAGCTCGATGA
Protein sequenceShow/hide protein sequence
MSIERGKDFAEGSSSSSSPQPGSTPSRYESQKRRDWNTFCQYLKNQRPSVPLSHCSCNHVLEFLRYLDQFGKTKVHLQGCMFYGQPEPPAPCTCPLRQAWGSLDALIGRL
RAAYEENGGSSETNPFASGAIRVYLRDVRECQAKARGIPYKKKKKKTVKARELKNQAR