; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr022629 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr022629
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein GL2-INTERACTING REPRESSOR 1
Genome locationtig00000289:1751957..1757040
RNA-Seq ExpressionSgr022629
SyntenySgr022629
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7016354.1 hypothetical protein SDJN02_21462, partial [Cucurbita argyrosperma subsp. argyrosperma]5.8e-8157.58Show/hide
Query:  LFPICSSNDQQALYLYLKAEDMCEKMTVEVEKPRKETLGL--------SLIPISAKGLSI---ILPINYYGRRGRSMSADVMWLLGPRHIWFLGCISRRG
        L P CSSND Q+L+L    EDM EK  +EVEK +K  + +         +   S   ++I   I+ +  +    +       WL+G R I      S R 
Subjt:  LFPICSSNDQQALYLYLKAEDMCEKMTVEVEKPRKETLGL--------SLIPISAKGLSI---ILPINYYGRRGRSMSADVMWLLGPRHIWFLGCISRRG

Query:  ERAVVL------------------------MAGTVLETTQYLEHAGDAKDCCGGSDAESHVHSLEEFSLSMDQLNQDFDKSLVLKSSSVYASPIELNDSS
          +V++                        M     ETTQYLE +GDAKDC GG D ES + S EEFS S+DQLNQ+F+KSLVLKSSS  +SPIE+ DSS
Subjt:  ERAVVL------------------------MAGTVLETTQYLEHAGDAKDCCGGSDAESHVHSLEEFSLSMDQLNQDFDKSLVLKSSSVYASPIELNDSS

Query:  NMSMKQSEVPDKKETKRRYAAEMGMTRNENDAYLDLKLSPPGVYLRGKSSNESKPSSPRSQDSCLSAEVELNVNSENNLRVEGSPLIVMGCTFCLLYVMV
         +SMKQSEV D KETKRRYAA MG T NEN AYLDLKLSPPGVYLRGKSSNESK SSP+SQDSC+SAEVE NVN ENNL VEGSPLIVMGCTFCLLYVMV
Subjt:  NMSMKQSEVPDKKETKRRYAAEMGMTRNENDAYLDLKLSPPGVYLRGKSSNESKPSSPRSQDSCLSAEVELNVNSENNLRVEGSPLIVMGCTFCLLYVMV

Query:  TDADPRCPKCKNSGLLDIFRGNHAKRSRKN
        TD DPRCP CKNSGLLDIFRGN  KRSRKN
Subjt:  TDADPRCPKCKNSGLLDIFRGNHAKRSRKN

XP_022939375.1 uncharacterized protein LOC111445310 isoform X3 [Cucurbita moschata]3.5e-7882.05Show/hide
Query:  LETTQYLEHAGDAKDCCGGSDAESHVHSLEEFSLSMDQLNQDFDKSLVLKSSSVYASPIELNDSSNMSMKQSEVPDKKETKRRYAAEMGMTRNENDAYLD
        LETTQYLE +GDAKDC GG D ES + S EEFS S+DQLNQ+F+KSLVLKSSS  +SPIE+ DSS +SMKQSEV DKKETKRRYAA MG T NEN AYLD
Subjt:  LETTQYLEHAGDAKDCCGGSDAESHVHSLEEFSLSMDQLNQDFDKSLVLKSSSVYASPIELNDSSNMSMKQSEVPDKKETKRRYAAEMGMTRNENDAYLD

Query:  LKLSPPGVYLRGKSSNESKPSSPRSQDSCLSAEVELNVNSENNLRVEGSPLIVMGCTFCLLYVMVTDADPRCPKCKNSGLLDIFRGNHAKRSRKN
        LKLSPPGVYLRGKSSNESK SSP+SQDSC+SAEVE NVN ENNL VEGSPLIVMGCTFCLLYVMVTD DPRCP CKNSGLLDIFRGN  KRSRKN
Subjt:  LKLSPPGVYLRGKSSNESKPSSPRSQDSCLSAEVELNVNSENNLRVEGSPLIVMGCTFCLLYVMVTDADPRCPKCKNSGLLDIFRGNHAKRSRKN

XP_022939376.1 uncharacterized protein LOC111445310 isoform X4 [Cucurbita moschata]3.5e-7882.05Show/hide
Query:  LETTQYLEHAGDAKDCCGGSDAESHVHSLEEFSLSMDQLNQDFDKSLVLKSSSVYASPIELNDSSNMSMKQSEVPDKKETKRRYAAEMGMTRNENDAYLD
        LETTQYLE +GDAKDC GG D ES + S EEFS S+DQLNQ+F+KSLVLKSSS  +SPIE+ DSS +SMKQSEV DKKETKRRYAA MG T NEN AYLD
Subjt:  LETTQYLEHAGDAKDCCGGSDAESHVHSLEEFSLSMDQLNQDFDKSLVLKSSSVYASPIELNDSSNMSMKQSEVPDKKETKRRYAAEMGMTRNENDAYLD

Query:  LKLSPPGVYLRGKSSNESKPSSPRSQDSCLSAEVELNVNSENNLRVEGSPLIVMGCTFCLLYVMVTDADPRCPKCKNSGLLDIFRGNHAKRSRKN
        LKLSPPGVYLRGKSSNESK SSP+SQDSC+SAEVE NVN ENNL VEGSPLIVMGCTFCLLYVMVTD DPRCP CKNSGLLDIFRGN  KRSRKN
Subjt:  LKLSPPGVYLRGKSSNESKPSSPRSQDSCLSAEVELNVNSENNLRVEGSPLIVMGCTFCLLYVMVTDADPRCPKCKNSGLLDIFRGNHAKRSRKN

XP_022993469.1 uncharacterized protein LOC111489470 [Cucurbita maxima]1.9e-7962.46Show/hide
Query:  MCEKMTVEVEKPRKETLGLSLIPISAKGLSIILPINYYGRRGR-SMSADVMWLLGPRHIWFLG----------CISRRGERAVVLMAGTVLETTQYLEHA
        M EK  +EVEK +K            + + ++   ++  R G  S +   + +L  RH+W L            ++RR    V+   G   ETTQYLE +
Subjt:  MCEKMTVEVEKPRKETLGLSLIPISAKGLSIILPINYYGRRGR-SMSADVMWLLGPRHIWFLG----------CISRRGERAVVLMAGTVLETTQYLEHA

Query:  GDAKDCCGGSDAESHVHSLEEFSLSMDQLNQDFDKSLVLKSSSVYASPIELNDSSNMSMKQSEVPDKKETKRRYAAEMGMTRNENDAYLDLKLSPPGVYL
        GDAKDC GG D ES + S EEFS S+DQLNQ+F+KSLVLKSSS  +SPIE+  SS +SMKQSEV DKKETKRRYAA MG T NEN AYLDLKLSPPGVYL
Subjt:  GDAKDCCGGSDAESHVHSLEEFSLSMDQLNQDFDKSLVLKSSSVYASPIELNDSSNMSMKQSEVPDKKETKRRYAAEMGMTRNENDAYLDLKLSPPGVYL

Query:  RGKSSNESKPSSPRSQDSCLSAEVELNVNSENNLRVEGSPLIVMGCTFCLLYVMVTDADPRCPKCKNSGLLDIFRGNHAKRSRKN
        RGKSSNESK SSP+SQDSC+SAEVE NVN ENNL VEGSPLIVMGCTFCLLYVMVTD DPRCP CKNSGLLDIFRGN  KRSRKN
Subjt:  RGKSSNESKPSSPRSQDSCLSAEVELNVNSENNLRVEGSPLIVMGCTFCLLYVMVTDADPRCPKCKNSGLLDIFRGNHAKRSRKN

XP_038885898.1 uncharacterized protein LOC120076204 [Benincasa hispida]3.2e-7982.23Show/hide
Query:  VLETTQYLEHAGDAKDCCGGSDAESHVHSLEEFSLSMDQLNQDFDKSLVLKSSSVYASPIELNDSSNMSMKQSEVPDKKETKRRYAAEMGMTRNENDAYL
        +LETTQYLEHAGDAKD CGG D ES   S E+FSLS+DQLNQDF+KSLVLK+S    S IEL DSSN++MKQSEVPDKKETKRRYA EMG   NENDAYL
Subjt:  VLETTQYLEHAGDAKDCCGGSDAESHVHSLEEFSLSMDQLNQDFDKSLVLKSSSVYASPIELNDSSNMSMKQSEVPDKKETKRRYAAEMGMTRNENDAYL

Query:  DLKLSPPGVYLRGKSSNESKPSSPRSQDSCLSAEVELNVNSE-NNLRVEGSPLIVMGCTFCLLYVMVTDADPRCPKCKNSGLLDIFRGNHAKRSRKN
        DLKLSPPGVY RGK SNESK SSPRSQDSC+SAEVE NVNSE NNLRVE SPLIVMGCTFCLLYVMVTDADPRCPKCKN GLLD+FRGN  KRSRKN
Subjt:  DLKLSPPGVYLRGKSSNESKPSSPRSQDSCLSAEVELNVNSE-NNLRVEGSPLIVMGCTFCLLYVMVTDADPRCPKCKNSGLLDIFRGNHAKRSRKN

TrEMBL top hitse value%identityAlignment
A0A6J1FFQ0 uncharacterized protein LOC111445310 isoform X41.7e-7882.05Show/hide
Query:  LETTQYLEHAGDAKDCCGGSDAESHVHSLEEFSLSMDQLNQDFDKSLVLKSSSVYASPIELNDSSNMSMKQSEVPDKKETKRRYAAEMGMTRNENDAYLD
        LETTQYLE +GDAKDC GG D ES + S EEFS S+DQLNQ+F+KSLVLKSSS  +SPIE+ DSS +SMKQSEV DKKETKRRYAA MG T NEN AYLD
Subjt:  LETTQYLEHAGDAKDCCGGSDAESHVHSLEEFSLSMDQLNQDFDKSLVLKSSSVYASPIELNDSSNMSMKQSEVPDKKETKRRYAAEMGMTRNENDAYLD

Query:  LKLSPPGVYLRGKSSNESKPSSPRSQDSCLSAEVELNVNSENNLRVEGSPLIVMGCTFCLLYVMVTDADPRCPKCKNSGLLDIFRGNHAKRSRKN
        LKLSPPGVYLRGKSSNESK SSP+SQDSC+SAEVE NVN ENNL VEGSPLIVMGCTFCLLYVMVTD DPRCP CKNSGLLDIFRGN  KRSRKN
Subjt:  LKLSPPGVYLRGKSSNESKPSSPRSQDSCLSAEVELNVNSENNLRVEGSPLIVMGCTFCLLYVMVTDADPRCPKCKNSGLLDIFRGNHAKRSRKN

A0A6J1FGZ2 uncharacterized protein LOC111445310 isoform X25.0e-7881.96Show/hide
Query:  ETTQYLEHAGDAKDCCGGSDAESHVHSLEEFSLSMDQLNQDFDKSLVLKSSSVYASPIELNDSSNMSMKQSEVPDKKETKRRYAAEMGMTRNENDAYLDL
        ETTQYLE +GDAKDC GG D ES + S EEFS S+DQLNQ+F+KSLVLKSSS  +SPIE+ DSS +SMKQSEV DKKETKRRYAA MG T NEN AYLDL
Subjt:  ETTQYLEHAGDAKDCCGGSDAESHVHSLEEFSLSMDQLNQDFDKSLVLKSSSVYASPIELNDSSNMSMKQSEVPDKKETKRRYAAEMGMTRNENDAYLDL

Query:  KLSPPGVYLRGKSSNESKPSSPRSQDSCLSAEVELNVNSENNLRVEGSPLIVMGCTFCLLYVMVTDADPRCPKCKNSGLLDIFRGNHAKRSRKN
        KLSPPGVYLRGKSSNESK SSP+SQDSC+SAEVE NVN ENNL VEGSPLIVMGCTFCLLYVMVTD DPRCP CKNSGLLDIFRGN  KRSRKN
Subjt:  KLSPPGVYLRGKSSNESKPSSPRSQDSCLSAEVELNVNSENNLRVEGSPLIVMGCTFCLLYVMVTDADPRCPKCKNSGLLDIFRGNHAKRSRKN

A0A6J1FLG8 uncharacterized protein LOC111445310 isoform X15.0e-7881.96Show/hide
Query:  ETTQYLEHAGDAKDCCGGSDAESHVHSLEEFSLSMDQLNQDFDKSLVLKSSSVYASPIELNDSSNMSMKQSEVPDKKETKRRYAAEMGMTRNENDAYLDL
        ETTQYLE +GDAKDC GG D ES + S EEFS S+DQLNQ+F+KSLVLKSSS  +SPIE+ DSS +SMKQSEV DKKETKRRYAA MG T NEN AYLDL
Subjt:  ETTQYLEHAGDAKDCCGGSDAESHVHSLEEFSLSMDQLNQDFDKSLVLKSSSVYASPIELNDSSNMSMKQSEVPDKKETKRRYAAEMGMTRNENDAYLDL

Query:  KLSPPGVYLRGKSSNESKPSSPRSQDSCLSAEVELNVNSENNLRVEGSPLIVMGCTFCLLYVMVTDADPRCPKCKNSGLLDIFRGNHAKRSRKN
        KLSPPGVYLRGKSSNESK SSP+SQDSC+SAEVE NVN ENNL VEGSPLIVMGCTFCLLYVMVTD DPRCP CKNSGLLDIFRGN  KRSRKN
Subjt:  KLSPPGVYLRGKSSNESKPSSPRSQDSCLSAEVELNVNSENNLRVEGSPLIVMGCTFCLLYVMVTDADPRCPKCKNSGLLDIFRGNHAKRSRKN

A0A6J1FMI8 uncharacterized protein LOC111445310 isoform X31.7e-7882.05Show/hide
Query:  LETTQYLEHAGDAKDCCGGSDAESHVHSLEEFSLSMDQLNQDFDKSLVLKSSSVYASPIELNDSSNMSMKQSEVPDKKETKRRYAAEMGMTRNENDAYLD
        LETTQYLE +GDAKDC GG D ES + S EEFS S+DQLNQ+F+KSLVLKSSS  +SPIE+ DSS +SMKQSEV DKKETKRRYAA MG T NEN AYLD
Subjt:  LETTQYLEHAGDAKDCCGGSDAESHVHSLEEFSLSMDQLNQDFDKSLVLKSSSVYASPIELNDSSNMSMKQSEVPDKKETKRRYAAEMGMTRNENDAYLD

Query:  LKLSPPGVYLRGKSSNESKPSSPRSQDSCLSAEVELNVNSENNLRVEGSPLIVMGCTFCLLYVMVTDADPRCPKCKNSGLLDIFRGNHAKRSRKN
        LKLSPPGVYLRGKSSNESK SSP+SQDSC+SAEVE NVN ENNL VEGSPLIVMGCTFCLLYVMVTD DPRCP CKNSGLLDIFRGN  KRSRKN
Subjt:  LKLSPPGVYLRGKSSNESKPSSPRSQDSCLSAEVELNVNSENNLRVEGSPLIVMGCTFCLLYVMVTDADPRCPKCKNSGLLDIFRGNHAKRSRKN

A0A6J1K099 uncharacterized protein LOC1114894709.1e-8062.46Show/hide
Query:  MCEKMTVEVEKPRKETLGLSLIPISAKGLSIILPINYYGRRGR-SMSADVMWLLGPRHIWFLG----------CISRRGERAVVLMAGTVLETTQYLEHA
        M EK  +EVEK +K            + + ++   ++  R G  S +   + +L  RH+W L            ++RR    V+   G   ETTQYLE +
Subjt:  MCEKMTVEVEKPRKETLGLSLIPISAKGLSIILPINYYGRRGR-SMSADVMWLLGPRHIWFLG----------CISRRGERAVVLMAGTVLETTQYLEHA

Query:  GDAKDCCGGSDAESHVHSLEEFSLSMDQLNQDFDKSLVLKSSSVYASPIELNDSSNMSMKQSEVPDKKETKRRYAAEMGMTRNENDAYLDLKLSPPGVYL
        GDAKDC GG D ES + S EEFS S+DQLNQ+F+KSLVLKSSS  +SPIE+  SS +SMKQSEV DKKETKRRYAA MG T NEN AYLDLKLSPPGVYL
Subjt:  GDAKDCCGGSDAESHVHSLEEFSLSMDQLNQDFDKSLVLKSSSVYASPIELNDSSNMSMKQSEVPDKKETKRRYAAEMGMTRNENDAYLDLKLSPPGVYL

Query:  RGKSSNESKPSSPRSQDSCLSAEVELNVNSENNLRVEGSPLIVMGCTFCLLYVMVTDADPRCPKCKNSGLLDIFRGNHAKRSRKN
        RGKSSNESK SSP+SQDSC+SAEVE NVN ENNL VEGSPLIVMGCTFCLLYVMVTD DPRCP CKNSGLLDIFRGN  KRSRKN
Subjt:  RGKSSNESKPSSPRSQDSCLSAEVELNVNSENNLRVEGSPLIVMGCTFCLLYVMVTDADPRCPKCKNSGLLDIFRGNHAKRSRKN

SwissProt top hitse value%identityAlignment
Q9FNI1 Protein GL2-INTERACTING REPRESSOR 12.8e-0938.1Show/hide
Query:  MTRNENDAYLDLKLSPPGVYLRG--KSSNESKPSSPRS-QDSCLSAEVELNVNSENNLRVEGSP----LIVMGCTFCLLYVMVTDADPRCPKCKNSGLLD
        M+R      L L LSPP    R   +S + S  +SP S   SC+S+E+      E ++R   SP    ++++GC  CL+YVM+++ DP+CPKCK++ LLD
Subjt:  MTRNENDAYLDLKLSPPGVYLRG--KSSNESKPSSPRS-QDSCLSAEVELNVNSENNLRVEGSP----LIVMGCTFCLLYVMVTDADPRCPKCKNSGLLD

Query:  IFRGN
            N
Subjt:  IFRGN

Q9SRN4 Protein GL2-INTERACTING REPRESSOR 22.8e-0939.22Show/hide
Query:  RNENDAYLDLK--LSPPGVYLRG----KSSNESKPSSPRSQDSCLSAEVELNVNSENNLRVEGSPLIVMGCTFCLLYVMVTDADPRCPKCKNSGLLDIFR
        RN+N   L+L+  LSPP          +S N S  +SP    SC+S+E     N E N  +  + ++++GC  CL+YVM++D DP+CPKCK++ LLD  +
Subjt:  RNENDAYLDLK--LSPPGVYLRG----KSSNESKPSSPRSQDSCLSAEVELNVNSENNLRVEGSPLIVMGCTFCLLYVMVTDADPRCPKCKNSGLLDIFR

Query:  GN
         N
Subjt:  GN

Arabidopsis top hitse value%identityAlignment
AT3G11600.1 unknown protein2.0e-1039.22Show/hide
Query:  RNENDAYLDLK--LSPPGVYLRG----KSSNESKPSSPRSQDSCLSAEVELNVNSENNLRVEGSPLIVMGCTFCLLYVMVTDADPRCPKCKNSGLLDIFR
        RN+N   L+L+  LSPP          +S N S  +SP    SC+S+E     N E N  +  + ++++GC  CL+YVM++D DP+CPKCK++ LLD  +
Subjt:  RNENDAYLDLK--LSPPGVYLRG----KSSNESKPSSPRSQDSCLSAEVELNVNSENNLRVEGSPLIVMGCTFCLLYVMVTDADPRCPKCKNSGLLDIFR

Query:  GN
         N
Subjt:  GN

AT3G52561.1 unknown protein1.5e-0538.96Show/hide
Query:  VYLRGKSSNESKPS-SPRSQDSCL--SAEVELNVNSENNLRVEGSPLIVMGCTFCLLYVMVTDADPRCPKCKNSGLL
        V   G+    S  S +  SQ+SCL  + EV+  V S      E   ++VMGC  C++YVMV     RCPKCK + L+
Subjt:  VYLRGKSSNESKPS-SPRSQDSCL--SAEVELNVNSENNLRVEGSPLIVMGCTFCLLYVMVTDADPRCPKCKNSGLL

AT5G06270.1 unknown protein2.0e-1038.1Show/hide
Query:  MTRNENDAYLDLKLSPPGVYLRG--KSSNESKPSSPRS-QDSCLSAEVELNVNSENNLRVEGSP----LIVMGCTFCLLYVMVTDADPRCPKCKNSGLLD
        M+R      L L LSPP    R   +S + S  +SP S   SC+S+E+      E ++R   SP    ++++GC  CL+YVM+++ DP+CPKCK++ LLD
Subjt:  MTRNENDAYLDLKLSPPGVYLRG--KSSNESKPSSPRS-QDSCLSAEVELNVNSENNLRVEGSP----LIVMGCTFCLLYVMVTDADPRCPKCKNSGLLD

Query:  IFRGN
            N
Subjt:  IFRGN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTATCTTTAGCGGCAATCTCAACTCTGTCCTCTTCCCCCACTTTCTCCTCTTCCCAATTTGCTCATCAAATGATCAGCAAGCGCTCTATCTCTACCTGAAGGCTGA
AGATATGTGCGAGAAAATGACGGTGGAAGTGGAGAAACCAAGAAAAGAAACACTCGGGTTGTCTTTAATTCCAATTTCTGCAAAAGGGTTGTCGATTATTTTACCAATAA
ATTACTACGGAAGGCGAGGCCGATCAATGTCTGCGGACGTGATGTGGCTTTTGGGTCCTCGGCATATATGGTTTCTCGGCTGCATCAGTCGGCGAGGAGAGAGGGCCGTG
GTGCTGATGGCCGGCACCGTACTGGAAACCACTCAATATCTGGAGCATGCTGGTGATGCAAAGGATTGTTGTGGTGGTAGTGACGCAGAGAGCCATGTTCACAGCCTTGA
GGAGTTCTCCTTGAGCATGGATCAACTCAATCAAGACTTTGACAAATCTCTCGTTCTTAAAAGTTCATCGGTTTATGCATCTCCAATTGAGTTAAACGATAGTTCTAATA
TGTCTATGAAGCAATCAGAAGTTCCTGATAAGAAGGAAACTAAAAGAAGGTATGCAGCAGAAATGGGAATGACTCGTAATGAAAATGATGCATATTTGGATCTCAAACTA
TCACCTCCAGGGGTCTACTTAAGAGGCAAATCGTCAAATGAATCGAAACCTTCATCCCCAAGATCTCAAGACTCATGCTTATCTGCAGAGGTCGAGTTAAATGTGAACTC
GGAGAATAACCTCCGAGTTGAAGGTTCGCCATTGATTGTGATGGGATGTACTTTTTGCCTGCTTTACGTGATGGTAACGGATGCAGATCCCAGATGCCCTAAATGCAAAA
ATTCTGGCTTGCTCGACATTTTTCGTGGAAATCATGCAAAGAGATCAAGAAAGAACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTATCTTTAGCGGCAATCTCAACTCTGTCCTCTTCCCCCACTTTCTCCTCTTCCCAATTTGCTCATCAAATGATCAGCAAGCGCTCTATCTCTACCTGAAGGCTGA
AGATATGTGCGAGAAAATGACGGTGGAAGTGGAGAAACCAAGAAAAGAAACACTCGGGTTGTCTTTAATTCCAATTTCTGCAAAAGGGTTGTCGATTATTTTACCAATAA
ATTACTACGGAAGGCGAGGCCGATCAATGTCTGCGGACGTGATGTGGCTTTTGGGTCCTCGGCATATATGGTTTCTCGGCTGCATCAGTCGGCGAGGAGAGAGGGCCGTG
GTGCTGATGGCCGGCACCGTACTGGAAACCACTCAATATCTGGAGCATGCTGGTGATGCAAAGGATTGTTGTGGTGGTAGTGACGCAGAGAGCCATGTTCACAGCCTTGA
GGAGTTCTCCTTGAGCATGGATCAACTCAATCAAGACTTTGACAAATCTCTCGTTCTTAAAAGTTCATCGGTTTATGCATCTCCAATTGAGTTAAACGATAGTTCTAATA
TGTCTATGAAGCAATCAGAAGTTCCTGATAAGAAGGAAACTAAAAGAAGGTATGCAGCAGAAATGGGAATGACTCGTAATGAAAATGATGCATATTTGGATCTCAAACTA
TCACCTCCAGGGGTCTACTTAAGAGGCAAATCGTCAAATGAATCGAAACCTTCATCCCCAAGATCTCAAGACTCATGCTTATCTGCAGAGGTCGAGTTAAATGTGAACTC
GGAGAATAACCTCCGAGTTGAAGGTTCGCCATTGATTGTGATGGGATGTACTTTTTGCCTGCTTTACGTGATGGTAACGGATGCAGATCCCAGATGCCCTAAATGCAAAA
ATTCTGGCTTGCTCGACATTTTTCGTGGAAATCATGCAAAGAGATCAAGAAAGAACTAG
Protein sequenceShow/hide protein sequence
MGIFSGNLNSVLFPHFLLFPICSSNDQQALYLYLKAEDMCEKMTVEVEKPRKETLGLSLIPISAKGLSIILPINYYGRRGRSMSADVMWLLGPRHIWFLGCISRRGERAV
VLMAGTVLETTQYLEHAGDAKDCCGGSDAESHVHSLEEFSLSMDQLNQDFDKSLVLKSSSVYASPIELNDSSNMSMKQSEVPDKKETKRRYAAEMGMTRNENDAYLDLKL
SPPGVYLRGKSSNESKPSSPRSQDSCLSAEVELNVNSENNLRVEGSPLIVMGCTFCLLYVMVTDADPRCPKCKNSGLLDIFRGNHAKRSRKN