; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr012099 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr012099
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationtig00153211:35355..39006
RNA-Seq ExpressionSgr012099
SyntenySgr012099
Gene Ontology termsNA
InterPro domainsIPR008972 - Cupredoxin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026100.1 uncharacterized protein E6C27_scaffold19G00360 [Cucumis melo var. makuwa]1.1e-5239.94Show/hide
Query:  FGTPPLGNLLNQITSIKMDRSDFLLWQNLALPILRSYKLEGYLTGTKPCPPMYLPNLDQATSSIVEE-------------TRSLNPKYETWQSIDQLLLG
        F  PPL  +LNQ+ ++K+DR ++LLW+ LALPIL+ YKLEG+LTG  PCP  ++ +   + +++ EE              R +N  +E W + D LLLG
Subjt:  FGTPPLGNLLNQITSIKMDRSDFLLWQNLALPILRSYKLEGYLTGTKPCPPMYLPNLDQATSSIVEE-------------TRSLNPKYETWQSIDQLLLG

Query:  WLYKSMILDIATQVMGFSTLRDLWQSIQKIFGVESKAQEDYLRKLFQQTRKGTKKITEYLSLMKSDSEGLTLAGNPVTMRTLISQIFAGLDEEFNPVI--
        WLY SM  D+A Q+MGF+ + DLW + Q  FGV+S+A+ED+LR++ Q TRKG  K+ EYL +MK++ + L   G+PV  R LISQ+  GLDE +N VI  
Subjt:  WLYKSMILDIATQVMGFSTLRDLWQSIQKIFGVESKAQEDYLRKLFQQTRKGTKKITEYLSLMKSDSEGLTLAGNPVTMRTLISQIFAGLDEEFNPVI--

Query:  -------------------EKRLEHQLIQK----------TSTLGTQATATVHFVGTTSSGNFSKGGRGRGRGGRYFSNNRLICQICGKTRHSAAICRHR
                           EK L+HQ  QK          +  L       ++     S+  F    R    G R   NN   CQ+CGK  HSA +C +R
Subjt:  -------------------EKRLEHQLIQK----------TSTLGTQATATVHFVGTTSSGNFSKGGRGRGRGGRYFSNNRLICQICGKTRHSAAICRHR

Query:  FNKEFGHP
        FNKEF  P
Subjt:  FNKEFGHP

XP_016902197.1 PREDICTED: uncharacterized protein LOC107991581 isoform X1 [Cucumis melo]9.6e-4438.11Show/hide
Query:  FGTPPLGNLLNQITSIKMDRSDFLLWQNLALPILRSYKLEGYLTGTKPCPPMYLPNLDQATSSIVEE-------------TRSLNPKYETWQSIDQLLLG
        F  PPL  +LNQ+T++K+DR ++LLW+ LALPIL+ YKLEG+LT   PCP  ++ +   + +++ EE              R +NP +E W + D LLLG
Subjt:  FGTPPLGNLLNQITSIKMDRSDFLLWQNLALPILRSYKLEGYLTGTKPCPPMYLPNLDQATSSIVEE-------------TRSLNPKYETWQSIDQLLLG

Query:  WLYKSMILDIATQVMGFSTLRDLWQSIQKIFGVESKAQEDYLRKLFQQTRKGTKKITEYLSLMKSDSEGLTLAGNP-VTMRTLISQIFAGLDEEFNPVIE
        WLY SM  D+A Q+MGF+ + DLW + Q  FGV+S+A+ED+LR++ Q TRKG  ++   + ++        + G P ++   + S++          + E
Subjt:  WLYKSMILDIATQVMGFSTLRDLWQSIQKIFGVESKAQEDYLRKLFQQTRKGTKKITEYLSLMKSDSEGLTLAGNP-VTMRTLISQIFAGLDEEFNPVIE

Query:  KRLEHQLIQKTSTLG-TQATA-------TVHFVGTTSSGNFSKGGRGRGRGGRYFSNNRLICQICGKTRHSAAICRHRFNKEFGHP
        KRL+HQ  QK +T   TQ+ A        ++     S+  F    R    G R   NN   CQ+CGK  HSA +C +RFNKEF  P
Subjt:  KRLEHQLIQKTSTLG-TQATA-------TVHFVGTTSSGNFSKGGRGRGRGGRYFSNNRLICQICGKTRHSAAICRHRFNKEFGHP

XP_016902203.1 PREDICTED: uncharacterized protein LOC107991581 isoform X3 [Cucumis melo]9.6e-4438.11Show/hide
Query:  FGTPPLGNLLNQITSIKMDRSDFLLWQNLALPILRSYKLEGYLTGTKPCPPMYLPNLDQATSSIVEE-------------TRSLNPKYETWQSIDQLLLG
        F  PPL  +LNQ+T++K+DR ++LLW+ LALPIL+ YKLEG+LT   PCP  ++ +   + +++ EE              R +NP +E W + D LLLG
Subjt:  FGTPPLGNLLNQITSIKMDRSDFLLWQNLALPILRSYKLEGYLTGTKPCPPMYLPNLDQATSSIVEE-------------TRSLNPKYETWQSIDQLLLG

Query:  WLYKSMILDIATQVMGFSTLRDLWQSIQKIFGVESKAQEDYLRKLFQQTRKGTKKITEYLSLMKSDSEGLTLAGNP-VTMRTLISQIFAGLDEEFNPVIE
        WLY SM  D+A Q+MGF+ + DLW + Q  FGV+S+A+ED+LR++ Q TRKG  ++   + ++        + G P ++   + S++          + E
Subjt:  WLYKSMILDIATQVMGFSTLRDLWQSIQKIFGVESKAQEDYLRKLFQQTRKGTKKITEYLSLMKSDSEGLTLAGNP-VTMRTLISQIFAGLDEEFNPVIE

Query:  KRLEHQLIQKTSTLG-TQATA-------TVHFVGTTSSGNFSKGGRGRGRGGRYFSNNRLICQICGKTRHSAAICRHRFNKEFGHP
        KRL+HQ  QK +T   TQ+ A        ++     S+  F    R    G R   NN   CQ+CGK  HSA +C +RFNKEF  P
Subjt:  KRLEHQLIQKTSTLG-TQATA-------TVHFVGTTSSGNFSKGGRGRGRGGRYFSNNRLICQICGKTRHSAAICRHRFNKEFGHP

XP_022151683.1 uncharacterized protein LOC111019598 [Momordica charantia]8.9e-5849.25Show/hide
Query:  FGTPPLGNLLNQITSIKMDRSDFLLWQNLALPILRSYKLEGYLTGTKPCPPMYL------PNLDQATSSIVEETRSLNPKYETWQSIDQLLLGWLYKSMI
        F +PPL  LLNQITSIKMDR +FLLWQNLALPILRSYKL  YLTG KPCPP +L       N++ +TSS  + + +LNP YE W  +D+LLLGWLY SM 
Subjt:  FGTPPLGNLLNQITSIKMDRSDFLLWQNLALPILRSYKLEGYLTGTKPCPPMYL------PNLDQATSSIVEETRSLNPKYETWQSIDQLLLGWLYKSMI

Query:  LDIATQVMGFSTLRDLWQSIQKIFGVESKAQEDYLRKLFQQTRKGTKKITEYLSLMKSDSEGLTLAGNPVTMRTLISQIFAGLDEEFNPVI---------
         D+A QVMGFST R+LW ++Q++FGV+S+A+ DYL+++FQQT KG+ ++ EYL LMKS ++ L LAG+ V++R L+SQ+  GLDEE+NP++         
Subjt:  LDIATQVMGFSTLRDLWQSIQKIFGVESKAQEDYLRKLFQQTRKGTKKITEYLSLMKSDSEGLTLAGNPVTMRTLISQIFAGLDEEFNPVI---------

Query:  ------------EKRLEHQLIQKTS-TLGTQATATVHFVG--------TTSSGNFSKGGRGRGRGGRY
                    EKRLE+Q   K+   +    T +V++V          T++GN S G     RGG Y
Subjt:  ------------EKRLEHQLIQKTS-TLGTQATATVHFVG--------TTSSGNFSKGGRGRGRGGRY

XP_038902487.1 uncharacterized protein LOC120089143 [Benincasa hispida]1.9e-4741.67Show/hide
Query:  TSIKMDRSDFLLWQNLALPILRSYKLEGYLTGTKPCPPMYLPNLDQATSSI-----------------------------VEETRSLNPKYETWQSIDQL
        T+IK+D+ ++LLW+NLALPILRSY+LEG+LTG  PCPP +    DQ+T+++                                   +NP YE+   +DQL
Subjt:  TSIKMDRSDFLLWQNLALPILRSYKLEGYLTGTKPCPPMYLPNLDQATSSI-----------------------------VEETRSLNPKYETWQSIDQL

Query:  LLGWLYKSMILDIATQVMGFSTLRDLWQSIQKIFGVESKAQEDYLRKLFQQTRKGTKKITEYLSLMKSDSEGLTLAGNPVTMRTLISQIFAGLDEEFNPV
        LLGWLY  M  ++A QVMG+   + LW +IQ++FG++S+A EDYLR++FQQT KG  K+ EYL +MK+ S+ L L G+PV  R L+SQ+  GLDEEFNP 
Subjt:  LLGWLYKSMILDIATQVMGFSTLRDLWQSIQKIFGVESKAQEDYLRKLFQQTRKGTKKITEYLSLMKSDSEGLTLAGNPVTMRTLISQIFAGLDEEFNPV

Query:  IEKRLEHQLIQKTSTLGTQATATVHFVGTTSSGNFSKGGRGRGRGGRYFSN---NRLICQICGK
        +        I  T+      T  + F    ++ N  +GG GR RG R ++N   NR  CQ+C +
Subjt:  IEKRLEHQLIQKTSTLGTQATATVHFVGTTSSGNFSKGGRGRGRGGRYFSN---NRLICQICGK

TrEMBL top hitse value%identityAlignment
A0A1S4E1U6 uncharacterized protein LOC107991581 isoform X14.6e-4438.11Show/hide
Query:  FGTPPLGNLLNQITSIKMDRSDFLLWQNLALPILRSYKLEGYLTGTKPCPPMYLPNLDQATSSIVEE-------------TRSLNPKYETWQSIDQLLLG
        F  PPL  +LNQ+T++K+DR ++LLW+ LALPIL+ YKLEG+LT   PCP  ++ +   + +++ EE              R +NP +E W + D LLLG
Subjt:  FGTPPLGNLLNQITSIKMDRSDFLLWQNLALPILRSYKLEGYLTGTKPCPPMYLPNLDQATSSIVEE-------------TRSLNPKYETWQSIDQLLLG

Query:  WLYKSMILDIATQVMGFSTLRDLWQSIQKIFGVESKAQEDYLRKLFQQTRKGTKKITEYLSLMKSDSEGLTLAGNP-VTMRTLISQIFAGLDEEFNPVIE
        WLY SM  D+A Q+MGF+ + DLW + Q  FGV+S+A+ED+LR++ Q TRKG  ++   + ++        + G P ++   + S++          + E
Subjt:  WLYKSMILDIATQVMGFSTLRDLWQSIQKIFGVESKAQEDYLRKLFQQTRKGTKKITEYLSLMKSDSEGLTLAGNP-VTMRTLISQIFAGLDEEFNPVIE

Query:  KRLEHQLIQKTSTLG-TQATA-------TVHFVGTTSSGNFSKGGRGRGRGGRYFSNNRLICQICGKTRHSAAICRHRFNKEFGHP
        KRL+HQ  QK +T   TQ+ A        ++     S+  F    R    G R   NN   CQ+CGK  HSA +C +RFNKEF  P
Subjt:  KRLEHQLIQKTSTLG-TQATA-------TVHFVGTTSSGNFSKGGRGRGRGGRYFSNNRLICQICGKTRHSAAICRHRFNKEFGHP

A0A1S4E1V2 uncharacterized protein LOC107991581 isoform X34.6e-4438.11Show/hide
Query:  FGTPPLGNLLNQITSIKMDRSDFLLWQNLALPILRSYKLEGYLTGTKPCPPMYLPNLDQATSSIVEE-------------TRSLNPKYETWQSIDQLLLG
        F  PPL  +LNQ+T++K+DR ++LLW+ LALPIL+ YKLEG+LT   PCP  ++ +   + +++ EE              R +NP +E W + D LLLG
Subjt:  FGTPPLGNLLNQITSIKMDRSDFLLWQNLALPILRSYKLEGYLTGTKPCPPMYLPNLDQATSSIVEE-------------TRSLNPKYETWQSIDQLLLG

Query:  WLYKSMILDIATQVMGFSTLRDLWQSIQKIFGVESKAQEDYLRKLFQQTRKGTKKITEYLSLMKSDSEGLTLAGNP-VTMRTLISQIFAGLDEEFNPVIE
        WLY SM  D+A Q+MGF+ + DLW + Q  FGV+S+A+ED+LR++ Q TRKG  ++   + ++        + G P ++   + S++          + E
Subjt:  WLYKSMILDIATQVMGFSTLRDLWQSIQKIFGVESKAQEDYLRKLFQQTRKGTKKITEYLSLMKSDSEGLTLAGNP-VTMRTLISQIFAGLDEEFNPVIE

Query:  KRLEHQLIQKTSTLG-TQATA-------TVHFVGTTSSGNFSKGGRGRGRGGRYFSNNRLICQICGKTRHSAAICRHRFNKEFGHP
        KRL+HQ  QK +T   TQ+ A        ++     S+  F    R    G R   NN   CQ+CGK  HSA +C +RFNKEF  P
Subjt:  KRLEHQLIQKTSTLG-TQATA-------TVHFVGTTSSGNFSKGGRGRGRGGRYFSNNRLICQICGKTRHSAAICRHRFNKEFGHP

A0A5A7SIT7 Uncharacterized protein5.5e-5339.94Show/hide
Query:  FGTPPLGNLLNQITSIKMDRSDFLLWQNLALPILRSYKLEGYLTGTKPCPPMYLPNLDQATSSIVEE-------------TRSLNPKYETWQSIDQLLLG
        F  PPL  +LNQ+ ++K+DR ++LLW+ LALPIL+ YKLEG+LTG  PCP  ++ +   + +++ EE              R +N  +E W + D LLLG
Subjt:  FGTPPLGNLLNQITSIKMDRSDFLLWQNLALPILRSYKLEGYLTGTKPCPPMYLPNLDQATSSIVEE-------------TRSLNPKYETWQSIDQLLLG

Query:  WLYKSMILDIATQVMGFSTLRDLWQSIQKIFGVESKAQEDYLRKLFQQTRKGTKKITEYLSLMKSDSEGLTLAGNPVTMRTLISQIFAGLDEEFNPVI--
        WLY SM  D+A Q+MGF+ + DLW + Q  FGV+S+A+ED+LR++ Q TRKG  K+ EYL +MK++ + L   G+PV  R LISQ+  GLDE +N VI  
Subjt:  WLYKSMILDIATQVMGFSTLRDLWQSIQKIFGVESKAQEDYLRKLFQQTRKGTKKITEYLSLMKSDSEGLTLAGNPVTMRTLISQIFAGLDEEFNPVI--

Query:  -------------------EKRLEHQLIQK----------TSTLGTQATATVHFVGTTSSGNFSKGGRGRGRGGRYFSNNRLICQICGKTRHSAAICRHR
                           EK L+HQ  QK          +  L       ++     S+  F    R    G R   NN   CQ+CGK  HSA +C +R
Subjt:  -------------------EKRLEHQLIQK----------TSTLGTQATATVHFVGTTSSGNFSKGGRGRGRGGRYFSNNRLICQICGKTRHSAAICRHR

Query:  FNKEFGHP
        FNKEF  P
Subjt:  FNKEFGHP

A0A5C7HHE9 Uncharacterized protein6.1e-4434.14Show/hide
Query:  PLGNLLNQITSIKMDRSDFLLWQNLALPILRSYKLEGYLTGTKPCPPMYLPNLDQATSSIVEETRSLNPKYETWQSIDQLLLGWLYKSMILDIATQVMGF
        P GN LNQ  +IK+DR +F+LW+ +   I++ ++L+G+L  T+PCPP +LP+    T  + +     NP+YE W   DQLL+GWLY SM  ++A  VMG 
Subjt:  PLGNLLNQITSIKMDRSDFLLWQNLALPILRSYKLEGYLTGTKPCPPMYLPNLDQATSSIVEETRSLNPKYETWQSIDQLLLGWLYKSMILDIATQVMGF

Query:  STLRDLWQSIQKIFGVESKAQEDYLRKLFQQTRKGTKKITEYLSLMKSDSEGLTLAGNPVTMRTLISQIFAGLDEEFNPVI-------------------
        +T   LW++++ +FG  SK++ + +R   Q TRKG+  + EYL+ MK+ ++ L +AG+P     L + I AGLD E+ P++                   
Subjt:  STLRDLWQSIQKIFGVESKAQEDYLRKLFQQTRKGTKKITEYLSLMKSDSEGLTLAGNPVTMRTLISQIFAGLDEEFNPVI-------------------

Query:  --EKRLEHQLIQKTSTLGTQATATVHFVGTTSSGNFSK--------------------------GGRGRGRGGRYFSNNRLICQICGKTRHSAAICRHRF
          + +LEH  I   S  G   ++    + T    N                             GGR RGRGGR  +N+R  CQ+CGK  HSA++C  R+
Subjt:  --EKRLEHQLIQKTSTLGTQATATVHFVGTTSSGNFSK--------------------------GGRGRGRGGRYFSNNRLICQICGKTRHSAAICRHRF

Query:  NKEFGHPSGQFKLAIS--GSTSAFLASPEVV
        +  +    G    A S   S S F+A+PE V
Subjt:  NKEFGHPSGQFKLAIS--GSTSAFLASPEVV

A0A6J1DCW4 uncharacterized protein LOC1110195984.3e-5849.25Show/hide
Query:  FGTPPLGNLLNQITSIKMDRSDFLLWQNLALPILRSYKLEGYLTGTKPCPPMYL------PNLDQATSSIVEETRSLNPKYETWQSIDQLLLGWLYKSMI
        F +PPL  LLNQITSIKMDR +FLLWQNLALPILRSYKL  YLTG KPCPP +L       N++ +TSS  + + +LNP YE W  +D+LLLGWLY SM 
Subjt:  FGTPPLGNLLNQITSIKMDRSDFLLWQNLALPILRSYKLEGYLTGTKPCPPMYL------PNLDQATSSIVEETRSLNPKYETWQSIDQLLLGWLYKSMI

Query:  LDIATQVMGFSTLRDLWQSIQKIFGVESKAQEDYLRKLFQQTRKGTKKITEYLSLMKSDSEGLTLAGNPVTMRTLISQIFAGLDEEFNPVI---------
         D+A QVMGFST R+LW ++Q++FGV+S+A+ DYL+++FQQT KG+ ++ EYL LMKS ++ L LAG+ V++R L+SQ+  GLDEE+NP++         
Subjt:  LDIATQVMGFSTLRDLWQSIQKIFGVESKAQEDYLRKLFQQTRKGTKKITEYLSLMKSDSEGLTLAGNPVTMRTLISQIFAGLDEEFNPVI---------

Query:  ------------EKRLEHQLIQKTS-TLGTQATATVHFVG--------TTSSGNFSKGGRGRGRGGRY
                    EKRLE+Q   K+   +    T +V++V          T++GN S G     RGG Y
Subjt:  ------------EKRLEHQLIQKTS-TLGTQATATVHFVG--------TTSSGNFSKGGRGRGRGGRY

SwissProt top hitse value%identityAlignment
O82081 Uclacyanin 11.2e-0434.78Show/hide
Query:  HTIAEPQSQTDFDAC--VKPGFVF-DSIIFIAFDRPGRRYFICTVESHCEAGIKFSIDVLPKPGTTPNA
        H + E  ++ +FD+C  VKP   F +    +    PG+RYFIC +  HC  G+K  ++V+P     P A
Subjt:  HTIAEPQSQTDFDAC--VKPGFVF-DSIIFIAFDRPGRRYFICTVESHCEAGIKFSIDVLPKPGTTPNA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.3e-1422.54Show/hide
Query:  LNQITSIKMDRSDFLLWQNLALPILRSYKLEGYLTGTKPCPPMYLPNLDQATSSIVEETRSLNPKYETWQSIDQLLLGWLYKSMILDIATQVMGFSTLRD
        +N     K+  +++L+W      +   Y+L G+L G+   PP  +           +    +NP Y  W+  D+L+   +  ++ + +   V   +T   
Subjt:  LNQITSIKMDRSDFLLWQNLALPILRSYKLEGYLTGTKPCPPMYLPNLDQATSSIVEETRSLNPKYETWQSIDQLLLGWLYKSMILDIATQVMGFSTLRD

Query:  LWQSIQKIFGVESKAQEDYLRKLFQQTRKGTKKITEYLSLMKSDSEGLTLAGNPVTMRTLISQIFAGLDEEFNPVIEK-----------RLEHQLIQKTS
        +W++++KI+   S      LR   +Q  KGTK I +Y+  + +  + L L G P+     + ++   L EE+ PVI++            +  +L+   S
Subjt:  LWQSIQKIFGVESKAQEDYLRKLFQQTRKGTKKITEYLSLMKSDSEGLTLAGNPVTMRTLISQIFAGLDEEFNPVIEK-----------RLEHQLIQKTS

Query:  TLGTQATATVHFV--------GTTSSGNFSKGGRGRGRGGRYFSNNRL---------------------ICQICGKTRHSAAIC
         +   ++ATV  +         TT++ N + G R      R  +NN                        CQICG   HSA  C
Subjt:  TLGTQATATVHFV--------GTTSSGNFSKGGRGRGRGGRYFSNNRL---------------------ICQICGKTRHSAAIC

Arabidopsis top hitse value%identityAlignment
AT2G32300.1 uclacyanin 18.5e-0634.78Show/hide
Query:  HTIAEPQSQTDFDAC--VKPGFVF-DSIIFIAFDRPGRRYFICTVESHCEAGIKFSIDVLPKPGTTPNA
        H + E  ++ +FD+C  VKP   F +    +    PG+RYFIC +  HC  G+K  ++V+P     P A
Subjt:  HTIAEPQSQTDFDAC--VKPGFVF-DSIIFIAFDRPGRRYFICTVESHCEAGIKFSIDVLPKPGTTPNA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCCTTCTCAAGGCCCAAGCAGCCTTCAACCAAATGAACCCATTCGGCACTCCACCCCTTGGCAACCTACTAAACCAGATTACCTCCATTAAAATGGACAGATCGGA
TTTCCTCCTCTGGCAAAATCTTGCCCTTCCAATTCTCCGAAGTTACAAGCTAGAAGGATATCTCACGGGCACAAAGCCTTGTCCACCCATGTATCTTCCCAATCTAGATC
AAGCCACCAGCTCCATTGTTGAAGAAACACGATCACTTAATCCAAAATACGAGACGTGGCAGTCCATTGATCAGCTACTACTTGGGTGGCTTTACAAATCGATGATTCTC
GACATTGCAACTCAGGTAATGGGATTCTCAACTTTGAGAGATCTTTGGCAATCCATTCAAAAGATATTTGGTGTCGAATCCAAAGCCCAAGAAGATTACCTGCGTAAGCT
ATTTCAGCAAACACGCAAAGGTACCAAGAAAATCACTGAATATCTTTCATTAATGAAATCTGATTCTGAGGGTCTGACTTTAGCTGGTAATCCTGTGACAATGAGAACTT
TAATATCTCAAATTTTTGCTGGTCTTGATGAGGAATTTAACCCAGTTATTGAGAAACGACTTGAGCACCAACTAATTCAGAAAACTAGCACCTTGGGCACTCAGGCCACA
GCCACTGTCCACTTTGTGGGCACTACCAGCAGTGGCAACTTCTCCAAAGGGGGAAGAGGAAGAGGTCGCGGCGGACGATACTTTTCGAATAATCGGCTGATTTGTCAAAT
CTGTGGAAAGACAAGACATAGTGCAGCGATTTGCCGTCATCGGTTCAACAAGGAATTTGGACATCCTTCCGGTCAGTTTAAGCTAGCAATCAGTGGGTCCACCTCCGCTT
TTCTTGCCTCCCCAGAAGTGGTAGCAATCCTAATTGGATTCAAATCAAGAGCCAATAAGACTCACACCATTGCAGAACCACAATCTCAAACGGATTTTGATGCATGCGTC
AAGCCGGGATTCGTTTTCGATTCTATCATTTTCATTGCCTTTGATCGTCCTGGTCGTCGTTACTTTATCTGCACTGTTGAGAGCCATTGTGAAGCAGGCATAAAATTTTC
TATTGATGTTTTACCAAAACCTGGAACAACGCCAAATGCGGCTGTTAAAGTTGGTGCATTGCCTGCACTTCTCTTCACCACCATCCTGGCTAATTTTCTCTTCTTCATTT
GA
mRNA sequenceShow/hide mRNA sequence
ATGTTCCTTCTCAAGGCCCAAGCAGCCTTCAACCAAATGAACCCATTCGGCACTCCACCCCTTGGCAACCTACTAAACCAGATTACCTCCATTAAAATGGACAGATCGGA
TTTCCTCCTCTGGCAAAATCTTGCCCTTCCAATTCTCCGAAGTTACAAGCTAGAAGGATATCTCACGGGCACAAAGCCTTGTCCACCCATGTATCTTCCCAATCTAGATC
AAGCCACCAGCTCCATTGTTGAAGAAACACGATCACTTAATCCAAAATACGAGACGTGGCAGTCCATTGATCAGCTACTACTTGGGTGGCTTTACAAATCGATGATTCTC
GACATTGCAACTCAGGTAATGGGATTCTCAACTTTGAGAGATCTTTGGCAATCCATTCAAAAGATATTTGGTGTCGAATCCAAAGCCCAAGAAGATTACCTGCGTAAGCT
ATTTCAGCAAACACGCAAAGGTACCAAGAAAATCACTGAATATCTTTCATTAATGAAATCTGATTCTGAGGGTCTGACTTTAGCTGGTAATCCTGTGACAATGAGAACTT
TAATATCTCAAATTTTTGCTGGTCTTGATGAGGAATTTAACCCAGTTATTGAGAAACGACTTGAGCACCAACTAATTCAGAAAACTAGCACCTTGGGCACTCAGGCCACA
GCCACTGTCCACTTTGTGGGCACTACCAGCAGTGGCAACTTCTCCAAAGGGGGAAGAGGAAGAGGTCGCGGCGGACGATACTTTTCGAATAATCGGCTGATTTGTCAAAT
CTGTGGAAAGACAAGACATAGTGCAGCGATTTGCCGTCATCGGTTCAACAAGGAATTTGGACATCCTTCCGGTCAGTTTAAGCTAGCAATCAGTGGGTCCACCTCCGCTT
TTCTTGCCTCCCCAGAAGTGGTAGCAATCCTAATTGGATTCAAATCAAGAGCCAATAAGACTCACACCATTGCAGAACCACAATCTCAAACGGATTTTGATGCATGCGTC
AAGCCGGGATTCGTTTTCGATTCTATCATTTTCATTGCCTTTGATCGTCCTGGTCGTCGTTACTTTATCTGCACTGTTGAGAGCCATTGTGAAGCAGGCATAAAATTTTC
TATTGATGTTTTACCAAAACCTGGAACAACGCCAAATGCGGCTGTTAAAGTTGGTGCATTGCCTGCACTTCTCTTCACCACCATCCTGGCTAATTTTCTCTTCTTCATTT
GA
Protein sequenceShow/hide protein sequence
MFLLKAQAAFNQMNPFGTPPLGNLLNQITSIKMDRSDFLLWQNLALPILRSYKLEGYLTGTKPCPPMYLPNLDQATSSIVEETRSLNPKYETWQSIDQLLLGWLYKSMIL
DIATQVMGFSTLRDLWQSIQKIFGVESKAQEDYLRKLFQQTRKGTKKITEYLSLMKSDSEGLTLAGNPVTMRTLISQIFAGLDEEFNPVIEKRLEHQLIQKTSTLGTQAT
ATVHFVGTTSSGNFSKGGRGRGRGGRYFSNNRLICQICGKTRHSAAICRHRFNKEFGHPSGQFKLAISGSTSAFLASPEVVAILIGFKSRANKTHTIAEPQSQTDFDACV
KPGFVFDSIIFIAFDRPGRRYFICTVESHCEAGIKFSIDVLPKPGTTPNAAVKVGALPALLFTTILANFLFFI