; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr017684 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr017684
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionRPA-interacting protein
Genome locationtig00153054:770161..779533
RNA-Seq ExpressionSgr017684
SyntenySgr017684
Gene Ontology termsGO:0006606 - protein import into nucleus (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR028155 - RPA-interacting protein, central domain
IPR028156 - RPA-interacting protein
IPR028158 - RPA-interacting protein, N-terminal domain
IPR028159 - RPA-interacting protein, C-terminal domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573010.1 hypothetical protein SDJN03_26897, partial [Cucurbita argyrosperma subsp. sororia]6.3e-13081.12Show/hide
Query:  EEDLNSVAQTRRPSLKTHPRYNNYQSWKQKLRENCYKRVREGRSRLLWKMRLPMSSPTYPHSLNNRQQDFIQSAFQDIFSDELKKIKDQSLDGYNKNLTS
        E+D +SV QT R S+KTHPRYNN+QSWK+KLRENC KRVREGR+RLLWKMR    SP Y HSL++RQQDFI+S+FQDIFSDELKKIKD++ +   KNL S
Subjt:  EEDLNSVAQTRRPSLKTHPRYNNYQSWKQKLRENCYKRVREGRSRLLWKMRLPMSSPTYPHSLNNRQQDFIQSAFQDIFSDELKKIKDQSLDGYNKNLTS

Query:  APESDDVLWEYEGLRDAYEGEGEEILLEMQRIFYEDLNVDLKQKESEGPNVTWEDEEDEFLAHAVYEHMQINSEKVFEKFWCPICKQGELQENNHFIHCT
        APESDDVLWEYEGL DAY GEGEEILLEMQRIFYEDLNVDLKQKESEGP VTWEDEEDEFLA AVYEHMQ+NSEK   K WCPICKQG+LQEN HFIHCT
Subjt:  APESDDVLWEYEGLRDAYEGEGEEILLEMQRIFYEDLNVDLKQKESEGPNVTWEDEEDEFLAHAVYEHMQINSEKVFEKFWCPICKQGELQENNHFIHCT

Query:  YCGLQLNKGNEVTLDLLRSRLADAHAEHLDRGCRLKPRFCVESRFDITALYISCEGCSTFEVRTQPLTTFIGVSAAVRRRWVSAML
        +CG+QLNKGNEVTLDLLRSRLAD HAEHLDRGCRLKP FCVESRF+ITALYISC+GC+TFEV+TQPL T IGV+AAVRR W++AML
Subjt:  YCGLQLNKGNEVTLDLLRSRLADAHAEHLDRGCRLKPRFCVESRFDITALYISCEGCSTFEVRTQPLTTFIGVSAAVRRRWVSAML

KAG7012195.1 hypothetical protein SDJN02_24947 [Cucurbita argyrosperma subsp. argyrosperma]3.7e-13080.9Show/hide
Query:  MEEEDLNSVAQTRRPSLKTHPRYNNYQSWKQKLRENCYKRVREGRSRLLWKMRLPMSSPTYPHSLNNRQQDFIQSAFQDIFSDELKKIKDQSLDGYNKNL
        MEE+D +SV QT R S+KTHPRYNN+QSWK+KLRENC KRVREGR+RLLWKMR    SP Y HSL++RQQDFI+S+FQDIFSDELKKIKD++ +   KNL
Subjt:  MEEEDLNSVAQTRRPSLKTHPRYNNYQSWKQKLRENCYKRVREGRSRLLWKMRLPMSSPTYPHSLNNRQQDFIQSAFQDIFSDELKKIKDQSLDGYNKNL

Query:  TSAPESDDVLWEYEGLRDAYEGEGEEILLEMQRIFYEDLNVDLKQKESEGPNVTWEDEEDEFLAHAVYEHMQINSEKVFEKFWCPICKQGELQENNHFIH
         SAPESDDVLWEYEGL DAY GEGEEILLEMQRIFYEDLNVDLKQKESEGP VTWEDEEDEFLA AVYEHMQ+NSEK   K WCPICKQG+LQEN HFIH
Subjt:  TSAPESDDVLWEYEGLRDAYEGEGEEILLEMQRIFYEDLNVDLKQKESEGPNVTWEDEEDEFLAHAVYEHMQINSEKVFEKFWCPICKQGELQENNHFIH

Query:  CTYCGLQLNKGNEVTLDLLRSRLADAHAEHLDRGCRLKPRFCVESRFDITALYISCEGCSTFEVRTQPLTTFIGVSAAVRRRWVSAML
        CT+CG+QLNKGNEVTLDLLR RLAD HAEHLDRGCRLKP FCVESRF+ITALYISC+GC+TFEV+TQPL T IGV+AAVRR W++AML
Subjt:  CTYCGLQLNKGNEVTLDLLRSRLADAHAEHLDRGCRLKPRFCVESRFDITALYISCEGCSTFEVRTQPLTTFIGVSAAVRRRWVSAML

TYJ99094.1 uncharacterized protein E5676_scaffold248G002980 [Cucumis melo var. makuwa]2.5e-13472.07Show/hide
Query:  MEEEDLNSVAQTRRPSLKTHPRYNNYQSWKQKLRENCYKRVREGRSRLLWKMRLPMSSPTYPHSLNNRQQDFIQSAFQDIFSDELKKIKDQSLDGYNKNL
        ME+++ N V QTRR S+KTHPRYNN QSWKQKLRENC KRVREGRSRLLWKMRLPMSSPTYPHSLNNRQQDFI+SAFQDIFSDELKKIKD+S++ +N+NL
Subjt:  MEEEDLNSVAQTRRPSLKTHPRYNNYQSWKQKLRENCYKRVREGRSRLLWKMRLPMSSPTYPHSLNNRQQDFIQSAFQDIFSDELKKIKDQSLDGYNKNL

Query:  TSAPESDDVLWEYEGLRDAYEGEGEEILLEMQRIFYEDLNVDLKQK--------------------------------------------ESEGPNVTWE
         S PE+ DVLWEYEG+ DAYEG+GEEILLEMQRIFYEDLNVD++QK                                            ESEGP VTWE
Subjt:  TSAPESDDVLWEYEGLRDAYEGEGEEILLEMQRIFYEDLNVDLKQK--------------------------------------------ESEGPNVTWE

Query:  DEEDEFLAHAVYEHMQINSEKVFEKFWCPICKQGELQENNHFIHCTYCGLQLNKGNEVTLDLLRSRLADAHAEHLDRGCRLKPRFCVESRFDITALYISC
        DEEDEFLA AVYEHMQ+++EK+ EKFWCP+CKQGELQEN+HFIHCT CGL+LNKGNEVTLDLLR RLAD HAEHLDRGCRLKP+FCVESRF+ITALYISC
Subjt:  DEEDEFLAHAVYEHMQINSEKVFEKFWCPICKQGELQENNHFIHCTYCGLQLNKGNEVTLDLLRSRLADAHAEHLDRGCRLKPRFCVESRFDITALYISC

Query:  EGCSTFEVRTQPLTTFIGVSAAVRRRWVSAMLE
        EGC+TFEV+TQPL TFIGV+A VRR+W+S MLE
Subjt:  EGCSTFEVRTQPLTTFIGVSAAVRRRWVSAMLE

XP_022137008.1 uncharacterized protein LOC111008573 isoform X1 [Momordica charantia]2.1e-13386.25Show/hide
Query:  KETVKMEEEDLNSVAQTRRPSLKTHPRYNNYQSWKQKLRENCYKRVREGRSRLLWKMRLPMSSPTYPHSLNNRQQDFIQSAFQDIFSDELKKIKDQSLDG
        K+ VKMEEED NS  Q RRPSLKTHPRYNNYQSWKQKLRENCYKRVREGRSRLLWKMRLP+SSPTYPHSLNNRQQDF++SAFQDIFSDELKKIKDQSL+ 
Subjt:  KETVKMEEEDLNSVAQTRRPSLKTHPRYNNYQSWKQKLRENCYKRVREGRSRLLWKMRLPMSSPTYPHSLNNRQQDFIQSAFQDIFSDELKKIKDQSLDG

Query:  YNKNLTSAPESDDVLWEYEGLRDAYEGEGEEILLEMQRIFYEDLNVDLKQKESEGPNVTWEDEEDEFLAHAVYEHMQINSEKVFEKFWCPICKQGELQEN
        Y+  L SAPE DDVLWEYEG++DAYEGEGEEILLEMQRIFYEDL+VD++QK SEGP VTWEDEEDEFLA AVYEHMQ+NSEKVFEKFWCPICKQGEL EN
Subjt:  YNKNLTSAPESDDVLWEYEGLRDAYEGEGEEILLEMQRIFYEDLNVDLKQKESEGPNVTWEDEEDEFLAHAVYEHMQINSEKVFEKFWCPICKQGELQEN

Query:  NHFIHCTYCGLQLNKGNEVTLDLLRSRLADAHAEHLDRGCRLKPRFCVESRFDITALYISCEGCSTFEV
        N  IHCT+CGLQL+K NEVT+DLLRSRLAD HAEHLDRGCRLKP FCVE+RFDITALYISCEGC TFE+
Subjt:  NHFIHCTYCGLQLNKGNEVTLDLLRSRLADAHAEHLDRGCRLKPRFCVESRFDITALYISCEGCSTFEV

XP_038893884.1 uncharacterized protein LOC120082685 isoform X1 [Benincasa hispida]1.3e-13085.61Show/hide
Query:  MEEEDLNSVAQTRRPSLKTHPRYNNYQSWKQKLRENCYKRVREGRSRLLWKMRLPMSSPTYPHSLNNRQQDFIQSAFQDIFSDELKKIKDQSLDGYNKNL
        ME++D NSV QT R S+KTHPRYNN QSWKQKLRENC KRVREGRSRLLWKMRLPM+SP+YPHSLNNRQQDFI+SAFQDIFSDELKKIKDQS++ YNKNL
Subjt:  MEEEDLNSVAQTRRPSLKTHPRYNNYQSWKQKLRENCYKRVREGRSRLLWKMRLPMSSPTYPHSLNNRQQDFIQSAFQDIFSDELKKIKDQSLDGYNKNL

Query:  TSAPESDDVLWEYEGLRDAYEGEGEEILLEMQRIFYEDLNVDLKQKESEGPNVTWEDEEDEFLAHAVYEHMQINSEKVFEKFWCPICKQGELQENNHFIH
         S PE+ DVLWEYEG++DAYEGEGEEILLEMQRIFYEDLN+DL+ KESEGP VTWEDEEDEFLA AVYEHM++NSEKV EKFWCPICKQGELQENNHFIH
Subjt:  TSAPESDDVLWEYEGLRDAYEGEGEEILLEMQRIFYEDLNVDLKQKESEGPNVTWEDEEDEFLAHAVYEHMQINSEKVFEKFWCPICKQGELQENNHFIH

Query:  CTYCGLQLNKGNEVTLDLLRSRLADAHAEHLDRGCRLKPRFCVESRFDITALYISCEGCSTFEV
        CT+CGL+L KGNEVTLDLLR RLAD HAEHLDRGCRLKP FCVES+F+ITALYISCEGC+TFEV
Subjt:  CTYCGLQLNKGNEVTLDLLRSRLADAHAEHLDRGCRLKPRFCVESRFDITALYISCEGCSTFEV

TrEMBL top hitse value%identityAlignment
A0A0A0LTY4 Uncharacterized protein1.7e-12884.09Show/hide
Query:  MEEEDLNSVAQTRRPSLKTHPRYNNYQSWKQKLRENCYKRVREGRSRLLWKMRLPMSSPTYPHSLNNRQQDFIQSAFQDIFSDELKKIKDQSLDGYNKNL
        MEE+  N V QTRR S+KTHPRYNN QSWKQKLRENC KRVREGRSRLLWKMRLPMSSPTY HSLNNRQQD I+SAFQDIF+DELKKIKD+S++ YN+NL
Subjt:  MEEEDLNSVAQTRRPSLKTHPRYNNYQSWKQKLRENCYKRVREGRSRLLWKMRLPMSSPTYPHSLNNRQQDFIQSAFQDIFSDELKKIKDQSLDGYNKNL

Query:  TSAPESDDVLWEYEGLRDAYEGEGEEILLEMQRIFYEDLNVDLKQKESEGPNVTWEDEEDEFLAHAVYEHMQINSEKVFEKFWCPICKQGELQENNHFIH
         S PE+ DVLWEYEG+ DAYEG+GEEILLEMQRIFYEDLNVDL+QKESE P VTWEDEEDEFLA AVYEHMQ+++EK+ EKFWCP+CKQGELQENNHFIH
Subjt:  TSAPESDDVLWEYEGLRDAYEGEGEEILLEMQRIFYEDLNVDLKQKESEGPNVTWEDEEDEFLAHAVYEHMQINSEKVFEKFWCPICKQGELQENNHFIH

Query:  CTYCGLQLNKGNEVTLDLLRSRLADAHAEHLDRGCRLKPRFCVESRFDITALYISCEGCSTFEV
        CT+CGL+LNKGNEVTLDLLR RLAD HAEHLDRGCRLKP+FCVESRF+ITALYISCEGC+TFEV
Subjt:  CTYCGLQLNKGNEVTLDLLRSRLADAHAEHLDRGCRLKPRFCVESRFDITALYISCEGCSTFEV

A0A1S3AU18 uncharacterized protein LOC1034828202.0e-12984.09Show/hide
Query:  MEEEDLNSVAQTRRPSLKTHPRYNNYQSWKQKLRENCYKRVREGRSRLLWKMRLPMSSPTYPHSLNNRQQDFIQSAFQDIFSDELKKIKDQSLDGYNKNL
        ME+++ N V QTRR S+KTHPRYNN QSWKQKLRENC KRVREGRSRLLWKMRLPMSSPTYPHSLNNRQQDFI+SAFQDIFSDELKKIKD+S++ +N+NL
Subjt:  MEEEDLNSVAQTRRPSLKTHPRYNNYQSWKQKLRENCYKRVREGRSRLLWKMRLPMSSPTYPHSLNNRQQDFIQSAFQDIFSDELKKIKDQSLDGYNKNL

Query:  TSAPESDDVLWEYEGLRDAYEGEGEEILLEMQRIFYEDLNVDLKQKESEGPNVTWEDEEDEFLAHAVYEHMQINSEKVFEKFWCPICKQGELQENNHFIH
         S PE+ DVLWEYEG+ DAYEG+GEEILLEMQRIFYEDLNVD++QKESEGP VTWEDEEDEFLA AVYEHMQ+++EK+ EKFWCP+CKQGELQEN+HFIH
Subjt:  TSAPESDDVLWEYEGLRDAYEGEGEEILLEMQRIFYEDLNVDLKQKESEGPNVTWEDEEDEFLAHAVYEHMQINSEKVFEKFWCPICKQGELQENNHFIH

Query:  CTYCGLQLNKGNEVTLDLLRSRLADAHAEHLDRGCRLKPRFCVESRFDITALYISCEGCSTFEV
        CT CGL+LNKGNEVTLDLLR RLAD HAEHLDRGCRLKP+FCVESRF+ITALYISCEGC+TFEV
Subjt:  CTYCGLQLNKGNEVTLDLLRSRLADAHAEHLDRGCRLKPRFCVESRFDITALYISCEGCSTFEV

A0A5D3BJ40 Uncharacterized protein1.2e-13472.07Show/hide
Query:  MEEEDLNSVAQTRRPSLKTHPRYNNYQSWKQKLRENCYKRVREGRSRLLWKMRLPMSSPTYPHSLNNRQQDFIQSAFQDIFSDELKKIKDQSLDGYNKNL
        ME+++ N V QTRR S+KTHPRYNN QSWKQKLRENC KRVREGRSRLLWKMRLPMSSPTYPHSLNNRQQDFI+SAFQDIFSDELKKIKD+S++ +N+NL
Subjt:  MEEEDLNSVAQTRRPSLKTHPRYNNYQSWKQKLRENCYKRVREGRSRLLWKMRLPMSSPTYPHSLNNRQQDFIQSAFQDIFSDELKKIKDQSLDGYNKNL

Query:  TSAPESDDVLWEYEGLRDAYEGEGEEILLEMQRIFYEDLNVDLKQK--------------------------------------------ESEGPNVTWE
         S PE+ DVLWEYEG+ DAYEG+GEEILLEMQRIFYEDLNVD++QK                                            ESEGP VTWE
Subjt:  TSAPESDDVLWEYEGLRDAYEGEGEEILLEMQRIFYEDLNVDLKQK--------------------------------------------ESEGPNVTWE

Query:  DEEDEFLAHAVYEHMQINSEKVFEKFWCPICKQGELQENNHFIHCTYCGLQLNKGNEVTLDLLRSRLADAHAEHLDRGCRLKPRFCVESRFDITALYISC
        DEEDEFLA AVYEHMQ+++EK+ EKFWCP+CKQGELQEN+HFIHCT CGL+LNKGNEVTLDLLR RLAD HAEHLDRGCRLKP+FCVESRF+ITALYISC
Subjt:  DEEDEFLAHAVYEHMQINSEKVFEKFWCPICKQGELQENNHFIHCTYCGLQLNKGNEVTLDLLRSRLADAHAEHLDRGCRLKPRFCVESRFDITALYISC

Query:  EGCSTFEVRTQPLTTFIGVSAAVRRRWVSAMLE
        EGC+TFEV+TQPL TFIGV+A VRR+W+S MLE
Subjt:  EGCSTFEVRTQPLTTFIGVSAAVRRRWVSAMLE

A0A6J1C935 uncharacterized protein LOC111008573 isoform X11.0e-13386.25Show/hide
Query:  KETVKMEEEDLNSVAQTRRPSLKTHPRYNNYQSWKQKLRENCYKRVREGRSRLLWKMRLPMSSPTYPHSLNNRQQDFIQSAFQDIFSDELKKIKDQSLDG
        K+ VKMEEED NS  Q RRPSLKTHPRYNNYQSWKQKLRENCYKRVREGRSRLLWKMRLP+SSPTYPHSLNNRQQDF++SAFQDIFSDELKKIKDQSL+ 
Subjt:  KETVKMEEEDLNSVAQTRRPSLKTHPRYNNYQSWKQKLRENCYKRVREGRSRLLWKMRLPMSSPTYPHSLNNRQQDFIQSAFQDIFSDELKKIKDQSLDG

Query:  YNKNLTSAPESDDVLWEYEGLRDAYEGEGEEILLEMQRIFYEDLNVDLKQKESEGPNVTWEDEEDEFLAHAVYEHMQINSEKVFEKFWCPICKQGELQEN
        Y+  L SAPE DDVLWEYEG++DAYEGEGEEILLEMQRIFYEDL+VD++QK SEGP VTWEDEEDEFLA AVYEHMQ+NSEKVFEKFWCPICKQGEL EN
Subjt:  YNKNLTSAPESDDVLWEYEGLRDAYEGEGEEILLEMQRIFYEDLNVDLKQKESEGPNVTWEDEEDEFLAHAVYEHMQINSEKVFEKFWCPICKQGELQEN

Query:  NHFIHCTYCGLQLNKGNEVTLDLLRSRLADAHAEHLDRGCRLKPRFCVESRFDITALYISCEGCSTFEV
        N  IHCT+CGLQL+K NEVT+DLLRSRLAD HAEHLDRGCRLKP FCVE+RFDITALYISCEGC TFE+
Subjt:  NHFIHCTYCGLQLNKGNEVTLDLLRSRLADAHAEHLDRGCRLKPRFCVESRFDITALYISCEGCSTFEV

A0A6J1GT42 uncharacterized protein LOC111457192 isoform X12.2e-12081.82Show/hide
Query:  MEEEDLNSVAQTRRPSLKTHPRYNNYQSWKQKLRENCYKRVREGRSRLLWKMRLPMSSPTYPHSLNNRQQDFIQSAFQDIFSDELKKIKDQSLDGYNKNL
        MEE+D +SV QT R S+KTHPRYNN+QSWK+KLRENC KRVREGR+RLLWKMR    SP Y HSL++RQQDFI+S+FQDIFSDELKKIKD++ +   KNL
Subjt:  MEEEDLNSVAQTRRPSLKTHPRYNNYQSWKQKLRENCYKRVREGRSRLLWKMRLPMSSPTYPHSLNNRQQDFIQSAFQDIFSDELKKIKDQSLDGYNKNL

Query:  TSAPESDDVLWEYEGLRDAYEGEGEEILLEMQRIFYEDLNVDLKQKESEGPNVTWEDEEDEFLAHAVYEHMQINSEKVFEKFWCPICKQGELQENNHFIH
         SAPESDDVLWEYEGL DAY GEGEEILLEMQRIFYEDLNVDLKQKESEGP VTWEDEEDEFLA AVYEHMQ+NSEK   K WCPICKQG+LQEN HFIH
Subjt:  TSAPESDDVLWEYEGLRDAYEGEGEEILLEMQRIFYEDLNVDLKQKESEGPNVTWEDEEDEFLAHAVYEHMQINSEKVFEKFWCPICKQGELQENNHFIH

Query:  CTYCGLQLNKGNEVTLDLLRSRLADAHAEHLDRGCRLKPRFCVESRFDITALYISCEGCSTFEV
        CT+CG+QLNKGNEVTLDLLRSRLAD HAEHLDRGCRL+P FCVESRF+ITALYISC+GC+TFEV
Subjt:  CTYCGLQLNKGNEVTLDLLRSRLADAHAEHLDRGCRLKPRFCVESRFDITALYISCEGCSTFEV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G12760.1 unknown protein3.6e-6753.01Show/hide
Query:  EDLNSVAQT--RRPSLKTHPRYNNYQSWKQKLRENCYKRVREGRSRLLWKMRLPMSSPTYPHSLNNRQQDFIQSAFQDIFSDELKKIKDQS--LDGYNKN
        E L   A T  +R S K+ P + +Y   KQK RENC++RVRE R+RLLWK+R+           ++ Q++ I  AFQDI SDELKKI+D S  L G NK 
Subjt:  EDLNSVAQT--RRPSLKTHPRYNNYQSWKQKLRENCYKRVREGRSRLLWKMRLPMSSPTYPHSLNNRQQDFIQSAFQDIFSDELKKIKDQS--LDGYNKN

Query:  LTSAPESDDVLWEY-EGLRDAYEGEGEEILLEMQRIFYEDLNVDLKQKESEGPNVTWEDEEDEFLAHAVYEHMQINSEKVFEKFWCPICKQGELQENNHF
        LT   + DD+LWEY EGL+  YEG+ EEILLEMQ+IFY+DL  +     S     TWEDEED++LA  V ++M +NSE+   + WCPICK+GEL EN+  
Subjt:  LTSAPESDDVLWEY-EGLRDAYEGEGEEILLEMQRIFYEDLNVDLKQKESEGPNVTWEDEEDEFLAHAVYEHMQINSEKVFEKFWCPICKQGELQENNHF

Query:  IHCTYCGLQLNKGNEVTLDLLRSRLADAHAEHLDRGCRLKPRFCVESRFDITALYISCEGCSTFEV
        I C  C +QLNKG EV L++L+ RLA+AH EHL RGCRLKP F V+S +++ ALYI+CE C TFEV
Subjt:  IHCTYCGLQLNKGNEVTLDLLRSRLADAHAEHLDRGCRLKPRFCVESRFDITALYISCEGCSTFEV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCACCGCTTCGAGACGAAGACATTGACGAGGGCAAATTTCGGCATAGCAGTCGAAAGGAAACAGTCAAAATGGAGGAGGAGGATCTGAATTCCGTCGCTCAAACCCG
TCGTCCTTCGCTTAAGACCCATCCTCGTTACAACAATTATCAGTCATGGAAGCAAAAGCTGAGGGAGAACTGTTACAAGAGGGTCCGAGAAGGAAGAAGTCGCTTGCTCT
GGAAGATGAGGCTGCCCATGTCCTCGCCCACTTACCCCCACTCTCTCAATAATCGACAGCAGGATTTTATCCAATCTGCTTTTCAAGACATCTTTTCTGATGAGCTGAAA
AAGATTAAAGACCAATCTTTGGATGGCTATAATAAGAATTTAACCTCTGCCCCCGAGTCTGATGATGTTCTTTGGGAATATGAGGGGCTTCGGGATGCTTATGAAGGTGA
AGGTGAAGAAATATTGTTAGAAATGCAAAGGATTTTTTATGAGGATCTCAACGTGGATCTAAAACAAAAAGAATCAGAAGGCCCTAATGTAACATGGGAAGATGAAGAAG
ACGAGTTCTTGGCCCATGCAGTTTATGAACATATGCAAATTAACAGTGAGAAGGTTTTTGAGAAGTTCTGGTGTCCTATTTGTAAACAAGGAGAGCTGCAAGAGAACAAC
CACTTCATACATTGCACTTATTGTGGACTGCAGCTTAACAAAGGCAATGAGGTTACTCTGGACCTCCTCCGTTCTCGGTTGGCCGATGCGCATGCTGAACACCTCGATCG
GGGTTGCAGATTGAAGCCTAGGTTTTGTGTTGAGAGTAGATTTGACATAACTGCATTATACATCTCTTGTGAAGGTTGCAGCACATTTGAGGTTCGGACTCAGCCGCTCA
CCACCTTCATCGGAGTTAGTGCCGCCGTCCGCCGCCGCTGGGTCTCCGCCATGCTTGAGCTTGATAATGGCGCCACAGATTCTAAGTCCCATTACATTCAACTTCCTTCA
GGACATAGAGAAGAAGAACATTCTGGCAGTTCAGACGCTGAGGAACATGATCATGGGGTCGAGCCTGATGGCCACCACCTCGATCCTCCTCTGCGCCGGCCTCGCCGCCG
TCCTCAGCAGCACTTACAGCATAAAAAAGCCGCTCACCGACGCCGTCTACGGAGCCCATGGAGAGTTCACGGTGGCTCTGAAGTTCGTGACGATCTTGACCATCTTTGTG
TTTTCGTTCTTCTGCCACAGCTTGTCGATCAGGTTTCTGAACCAGGCGAGCCTTCTCATCAGCGCGCCATTGCACCCCTTCTCCGTATTAACGGAGGAGCATCTGGTGGA
TCTTCTGGACAAGGGGTGCCTCCTGAACATCGTCGGAAACAGGCTCTTCTACGCTGCTCTGCCGCTGGTGCTTTGGACATGCGGACCGCTGCTGGTTTTCTTGTGCTTTG
CGGTTATGGATTAAAATGGGCAAAAGGGTCTGTCTATTTCATCCCTATGAGACGGAATTCTGAAGTGCCTCAATCTTCCTTCCTGTTTCTTCATGTAACCCCTACACAGA
AACTCCTTCGAGAACTTATCTTCCACTTTCCTATTCACATCATGAACAAACACGTCCGTTTCCCCCTCCTCCCTGTTCCTCGCCATCATCCCAGCCGTGTAAATCGCCGT
CATCCTCCCCGGCGCCTCCTCGTAATACCCCTACCTTGGGTCGCCGACGGCCGTGCATTCCGGTCCCCGGCCGACGTCCATCAGGCCGTCGGCCTCTTTAACTTTGCTGT
CGTAACTGACGTGGAAGATGGTCCGGCCGCCGTAGTTGAGGGTGCTCCACATGAGGCTGTCGTGGCCGAGGCCGAAGATGAGGAAGTTGCATGGCGACCGCTCGTCCAGA
ACTCTGGCGGCGACGGAGATTTCTTTCAGAGTTTGCTGCGGCGTGATGGTGGAGGTCGAGTAGTGGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCACCGCTTCGAGACGAAGACATTGACGAGGGCAAATTTCGGCATAGCAGTCGAAAGGAAACAGTCAAAATGGAGGAGGAGGATCTGAATTCCGTCGCTCAAACCCG
TCGTCCTTCGCTTAAGACCCATCCTCGTTACAACAATTATCAGTCATGGAAGCAAAAGCTGAGGGAGAACTGTTACAAGAGGGTCCGAGAAGGAAGAAGTCGCTTGCTCT
GGAAGATGAGGCTGCCCATGTCCTCGCCCACTTACCCCCACTCTCTCAATAATCGACAGCAGGATTTTATCCAATCTGCTTTTCAAGACATCTTTTCTGATGAGCTGAAA
AAGATTAAAGACCAATCTTTGGATGGCTATAATAAGAATTTAACCTCTGCCCCCGAGTCTGATGATGTTCTTTGGGAATATGAGGGGCTTCGGGATGCTTATGAAGGTGA
AGGTGAAGAAATATTGTTAGAAATGCAAAGGATTTTTTATGAGGATCTCAACGTGGATCTAAAACAAAAAGAATCAGAAGGCCCTAATGTAACATGGGAAGATGAAGAAG
ACGAGTTCTTGGCCCATGCAGTTTATGAACATATGCAAATTAACAGTGAGAAGGTTTTTGAGAAGTTCTGGTGTCCTATTTGTAAACAAGGAGAGCTGCAAGAGAACAAC
CACTTCATACATTGCACTTATTGTGGACTGCAGCTTAACAAAGGCAATGAGGTTACTCTGGACCTCCTCCGTTCTCGGTTGGCCGATGCGCATGCTGAACACCTCGATCG
GGGTTGCAGATTGAAGCCTAGGTTTTGTGTTGAGAGTAGATTTGACATAACTGCATTATACATCTCTTGTGAAGGTTGCAGCACATTTGAGGTTCGGACTCAGCCGCTCA
CCACCTTCATCGGAGTTAGTGCCGCCGTCCGCCGCCGCTGGGTCTCCGCCATGCTTGAGCTTGATAATGGCGCCACAGATTCTAAGTCCCATTACATTCAACTTCCTTCA
GGACATAGAGAAGAAGAACATTCTGGCAGTTCAGACGCTGAGGAACATGATCATGGGGTCGAGCCTGATGGCCACCACCTCGATCCTCCTCTGCGCCGGCCTCGCCGCCG
TCCTCAGCAGCACTTACAGCATAAAAAAGCCGCTCACCGACGCCGTCTACGGAGCCCATGGAGAGTTCACGGTGGCTCTGAAGTTCGTGACGATCTTGACCATCTTTGTG
TTTTCGTTCTTCTGCCACAGCTTGTCGATCAGGTTTCTGAACCAGGCGAGCCTTCTCATCAGCGCGCCATTGCACCCCTTCTCCGTATTAACGGAGGAGCATCTGGTGGA
TCTTCTGGACAAGGGGTGCCTCCTGAACATCGTCGGAAACAGGCTCTTCTACGCTGCTCTGCCGCTGGTGCTTTGGACATGCGGACCGCTGCTGGTTTTCTTGTGCTTTG
CGGTTATGGATTAAAATGGGCAAAAGGGTCTGTCTATTTCATCCCTATGAGACGGAATTCTGAAGTGCCTCAATCTTCCTTCCTGTTTCTTCATGTAACCCCTACACAGA
AACTCCTTCGAGAACTTATCTTCCACTTTCCTATTCACATCATGAACAAACACGTCCGTTTCCCCCTCCTCCCTGTTCCTCGCCATCATCCCAGCCGTGTAAATCGCCGT
CATCCTCCCCGGCGCCTCCTCGTAATACCCCTACCTTGGGTCGCCGACGGCCGTGCATTCCGGTCCCCGGCCGACGTCCATCAGGCCGTCGGCCTCTTTAACTTTGCTGT
CGTAACTGACGTGGAAGATGGTCCGGCCGCCGTAGTTGAGGGTGCTCCACATGAGGCTGTCGTGGCCGAGGCCGAAGATGAGGAAGTTGCATGGCGACCGCTCGTCCAGA
ACTCTGGCGGCGACGGAGATTTCTTTCAGAGTTTGCTGCGGCGTGATGGTGGAGGTCGAGTAGTGGATTAG
Protein sequenceShow/hide protein sequence
MSPLRDEDIDEGKFRHSSRKETVKMEEEDLNSVAQTRRPSLKTHPRYNNYQSWKQKLRENCYKRVREGRSRLLWKMRLPMSSPTYPHSLNNRQQDFIQSAFQDIFSDELK
KIKDQSLDGYNKNLTSAPESDDVLWEYEGLRDAYEGEGEEILLEMQRIFYEDLNVDLKQKESEGPNVTWEDEEDEFLAHAVYEHMQINSEKVFEKFWCPICKQGELQENN
HFIHCTYCGLQLNKGNEVTLDLLRSRLADAHAEHLDRGCRLKPRFCVESRFDITALYISCEGCSTFEVRTQPLTTFIGVSAAVRRRWVSAMLELDNGATDSKSHYIQLPS
GHREEEHSGSSDAEEHDHGVEPDGHHLDPPLRRPRRRPQQHLQHKKAAHRRRLRSPWRVHGGSEVRDDLDHLCVFVLLPQLVDQVSEPGEPSHQRAIAPLLRINGGASGG
SSGQGVPPEHRRKQALLRCSAAGALDMRTAAGFLVLCGYGLKWAKGSVYFIPMRRNSEVPQSSFLFLHVTPTQKLLRELIFHFPIHIMNKHVRFPLLPVPRHHPSRVNRR
HPPRRLLVIPLPWVADGRAFRSPADVHQAVGLFNFAVVTDVEDGPAAVVEGAPHEAVVAEAEDEEVAWRPLVQNSGGDGDFFQSLLRRDGGGRVVD