; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017872 (gene) of Snake gourd v1 genome

Gene IDTan0017872
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionZyxin-like
Genome locationLG10:1129966..1132425
RNA-Seq ExpressionTan0017872
SyntenyTan0017872
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004133795.1 DNA-directed RNA polymerase II subunit RPB1 [Cucumis sativus]5.6e-6963.21Show/hide
Query:  MAGR-FGRSLYRFSSVNRPAAPGGNTSSQDLNQYDGRRYPSATKDTSDE---------LRPTRSPL-LSPTFSVKKAASPPSSPPSYRASTVRMIRSPSK
        MAGR FGR LYRFSS NRP AP  NTS QD  QYDGRRYPS+ +DTS E         LR    PL  SPT+S+KKA SPPSS PSYRA   R I SP+K
Subjt:  MAGR-FGRSLYRFSSVNRPAAPGGNTSSQDLNQYDGRRYPSATKDTSDE---------LRPTRSPL-LSPTFSVKKAASPPSSPPSYRASTVRMIRSPSK

Query:  PIDEYPNYKPTTRSRSPEVEPKSMVY--KAIEKVTKSDHHVGSGKTTTSSHK--QQPNAINISGENLGAVMEIVE-SKREGGHVIKK-KETVRGILNS--
         +DEYP YKP T+ RSPE + K  ++    +EKVTKSD +  S K T SSHK   QPNAINI GEN+GAVMEIVE SKREGGH+IKK KET RGILN+  
Subjt:  PIDEYPNYKPTTRSRSPEVEPKSMVY--KAIEKVTKSDHHVGSGKTTTSSHK--QQPNAINISGENLGAVMEIVE-SKREGGHVIKK-KETVRGILNS--

Query:  ITNDQNNEDSKDHKAEGQKALNSMPASTFLNSNFQSVNNSLLYGASLAHRDPGLHLSFSRNPTGDRFTHDD-KKQQHKKY
        + NDQNNE SK          +SMP +TFLNSNFQSVNNSLLY A+L HRDPGLHL+FSRNPTG+R   DD KKQ H +Y
Subjt:  ITNDQNNEDSKDHKAEGQKALNSMPASTFLNSNFQSVNNSLLYGASLAHRDPGLHLSFSRNPTGDRFTHDD-KKQQHKKY

XP_008437848.1 PREDICTED: uncharacterized protein LOC103483156 [Cucumis melo]7.0e-7263.18Show/hide
Query:  MAGR-FGRSLYRFSSVNRPAAPGGNTSSQDLNQYDGRRYPSATKDTSDE---------LRPTRSPL-LSPTFSVKKAASPPSSPPSYRASTVRMIRSPSK
        MAGR FGR +YRFSS NRP AP  NTS QD  QYD R+YPSAT+DTS E         LR    PL  SPT+S+KKA SPPSSPP YR    R I SP K
Subjt:  MAGR-FGRSLYRFSSVNRPAAPGGNTSSQDLNQYDGRRYPSATKDTSDE---------LRPTRSPL-LSPTFSVKKAASPPSSPPSYRASTVRMIRSPSK

Query:  PIDEYPNYKPTTRSRSPEVEPKSMVYK--AIEKVTKSDHHVGSGKTTTSSHKQQPNAINISGENLGAVMEIVE-SKREGGHVIKK-KETVRGILNS--IT
         +DEYP YKP T+ RSPE + K  ++K   +EKVTKSD +    K  +S   QQPNAINI GEN+GAVMEIVE SKREGGH+IKK KET RGILN+  + 
Subjt:  PIDEYPNYKPTTRSRSPEVEPKSMVYK--AIEKVTKSDHHVGSGKTTTSSHKQQPNAINISGENLGAVMEIVE-SKREGGHVIKK-KETVRGILNS--IT

Query:  NDQNNEDSKDHKAEGQKALNSMPASTFLNSNFQSVNNSLLYGASLAHRDPGLHLSFSRNPTGDRFTHDDKKQQHKKY
        NDQNNE SK          +SMP +TFLNSNFQSVNNSLLY A+L +RDPGLHLSFSRNPTGDRF  DDKKQ H KY
Subjt:  NDQNNEDSKDHKAEGQKALNSMPASTFLNSNFQSVNNSLLYGASLAHRDPGLHLSFSRNPTGDRFTHDDKKQQHKKY

XP_022147459.1 serine/arginine repetitive matrix protein 1-like [Momordica charantia]6.3e-8165.82Show/hide
Query:  MAGRFGRSLYRFSSVNRPAAPGGNTSSQDLNQYDGRRYPSATKDTSDELRPTRSPL--------LSPTFSVKKAASPPSSPPSYRASTVRMIRSPSKPID
        MAGR+GRS YRFSSVNRP APGGNTS+QD  QYDGR+YPSAT+D+S E R    P          SPT+SVKKAASPPSSPP YRA   R + SP + +D
Subjt:  MAGRFGRSLYRFSSVNRPAAPGGNTSSQDLNQYDGRRYPSATKDTSDELRPTRSPL--------LSPTFSVKKAASPPSSPPSYRASTVRMIRSPSKPID

Query:  EYPNYKPTTRSRSPEVEPKSMVYKAIEK-VTKSDHHVGSGKTTTSSHKQQ--PNAINISGENLGAVMEIVES-KREGGHVIKKKETVRGILNSITNDQNN
        EYP YKPTT+ RSPE + K ++YKAIEK  TKSD +  +GKTT+S  +QQ  PNAINISGENLGAVMEIV+S KREGGH+I+KKE+    L+S  NDQNN
Subjt:  EYPNYKPTTRSRSPEVEPKSMVYKAIEK-VTKSDHHVGSGKTTTSSHKQQ--PNAINISGENLGAVMEIVES-KREGGHVIKKKETVRGILNSITNDQNN

Query:  EDSKDHK---AEGQKALNSMPASTFLNSNFQSVNNSLLYGASLAHRDPGLHLSFSRNPTGDRFTHDDKKQQHKKY
        E SKD K   A    + +S+PA+TFLNSNFQSVNNSLLY ASLAHRDPGLHL+F+RNP GDRF  DDKK  H KY
Subjt:  EDSKDHK---AEGQKALNSMPASTFLNSNFQSVNNSLLYGASLAHRDPGLHLSFSRNPTGDRFTHDDKKQQHKKY

XP_038879892.1 uncharacterized protein At1g10890-like isoform X1 [Benincasa hispida]9.2e-7263.04Show/hide
Query:  MAGR-FGRSLYRFSSVNRPAAP-GGNTSSQDLNQYDGRRYPSATKDTSDELRPTRSPLL----------SPTFSVKKAASPPSSPPSYRASTVRMIRSPS
        MAGR FGR LYRFSS NRP AP   NTS QD  QYDGR+Y SAT+DTS E R    P L          SPT+S+KKA SPP SPP YRA   R+I SP 
Subjt:  MAGR-FGRSLYRFSSVNRPAAP-GGNTSSQDLNQYDGRRYPSATKDTSDELRPTRSPLL----------SPTFSVKKAASPPSSPPSYRASTVRMIRSPS

Query:  KPIDEYPNYKPTTRSRSPEVEPKSMVYKAIEKVTKSDHHVGSGKTTTSSHKQQPNAINISGENLGAVMEIVE-SKREG-GHVIKKKETVRGILN--SITN
        K +DEY  YKPTT+ RSPE + K  + K ++KVTKSD H  S KT +S   QQPNAINI G+N+GAVMEIVE SKREG GHVIKKKET R +LN  +  N
Subjt:  KPIDEYPNYKPTTRSRSPEVEPKSMVYKAIEKVTKSDHHVGSGKTTTSSHKQQPNAINISGENLGAVMEIVE-SKREG-GHVIKKKETVRGILN--SITN

Query:  DQNNEDSKDHKAEGQKALNSMPASTFLNSNFQSVNNSLLYGASLAHRDPGLHLSFSRNPTGDRFTHDDKKQQHKKY
        DQ NE SK          +SMP STFLNSNFQSVNNSLL+ A+LAHRDPGLHL+FS NPTGDR T DDKKQ H KY
Subjt:  DQNNEDSKDHKAEGQKALNSMPASTFLNSNFQSVNNSLLYGASLAHRDPGLHLSFSRNPTGDRFTHDDKKQQHKKY

XP_038879901.1 uncharacterized protein LOC120071611 isoform X2 [Benincasa hispida]8.9e-6761.23Show/hide
Query:  MAGR-FGRSLYRFSSVNRPAAP-GGNTSSQDLNQYDGRRYPSATKDTSDELRPTRSPLL----------SPTFSVKKAASPPSSPPSYRASTVRMIRSPS
        MAGR FGR LYRFSS NRP AP   NTS QD  QYDGR+Y SAT+DTS E R    P L          SPT+S+KKA SPP SPP YRA   R+I SP 
Subjt:  MAGR-FGRSLYRFSSVNRPAAP-GGNTSSQDLNQYDGRRYPSATKDTSDELRPTRSPLL----------SPTFSVKKAASPPSSPPSYRASTVRMIRSPS

Query:  KPIDEYPNYKPTTRSRSPEVEPKSMVYKAIEKVTKSDHHVGSGKTTTSSHKQQPNAINISGENLGAVMEIVE-SKREG-GHVIKKKETVRGILN--SITN
        K +DEY        S+SPE + K  + K ++KVTKSD H  S KT +S   QQPNAINI G+N+GAVMEIVE SKREG GHVIKKKET R +LN  +  N
Subjt:  KPIDEYPNYKPTTRSRSPEVEPKSMVYKAIEKVTKSDHHVGSGKTTTSSHKQQPNAINISGENLGAVMEIVE-SKREG-GHVIKKKETVRGILN--SITN

Query:  DQNNEDSKDHKAEGQKALNSMPASTFLNSNFQSVNNSLLYGASLAHRDPGLHLSFSRNPTGDRFTHDDKKQQHKKY
        DQ NE SK          +SMP STFLNSNFQSVNNSLL+ A+LAHRDPGLHL+FS NPTGDR T DDKKQ H KY
Subjt:  DQNNEDSKDHKAEGQKALNSMPASTFLNSNFQSVNNSLLYGASLAHRDPGLHLSFSRNPTGDRFTHDDKKQQHKKY

TrEMBL top hitse value%identityAlignment
A0A0A0L3R5 Uncharacterized protein2.7e-6963.21Show/hide
Query:  MAGR-FGRSLYRFSSVNRPAAPGGNTSSQDLNQYDGRRYPSATKDTSDE---------LRPTRSPL-LSPTFSVKKAASPPSSPPSYRASTVRMIRSPSK
        MAGR FGR LYRFSS NRP AP  NTS QD  QYDGRRYPS+ +DTS E         LR    PL  SPT+S+KKA SPPSS PSYRA   R I SP+K
Subjt:  MAGR-FGRSLYRFSSVNRPAAPGGNTSSQDLNQYDGRRYPSATKDTSDE---------LRPTRSPL-LSPTFSVKKAASPPSSPPSYRASTVRMIRSPSK

Query:  PIDEYPNYKPTTRSRSPEVEPKSMVY--KAIEKVTKSDHHVGSGKTTTSSHK--QQPNAINISGENLGAVMEIVE-SKREGGHVIKK-KETVRGILNS--
         +DEYP YKP T+ RSPE + K  ++    +EKVTKSD +  S K T SSHK   QPNAINI GEN+GAVMEIVE SKREGGH+IKK KET RGILN+  
Subjt:  PIDEYPNYKPTTRSRSPEVEPKSMVY--KAIEKVTKSDHHVGSGKTTTSSHK--QQPNAINISGENLGAVMEIVE-SKREGGHVIKK-KETVRGILNS--

Query:  ITNDQNNEDSKDHKAEGQKALNSMPASTFLNSNFQSVNNSLLYGASLAHRDPGLHLSFSRNPTGDRFTHDD-KKQQHKKY
        + NDQNNE SK          +SMP +TFLNSNFQSVNNSLLY A+L HRDPGLHL+FSRNPTG+R   DD KKQ H +Y
Subjt:  ITNDQNNEDSKDHKAEGQKALNSMPASTFLNSNFQSVNNSLLYGASLAHRDPGLHLSFSRNPTGDRFTHDD-KKQQHKKY

A0A1S3AVL1 uncharacterized protein LOC1034831563.4e-7263.18Show/hide
Query:  MAGR-FGRSLYRFSSVNRPAAPGGNTSSQDLNQYDGRRYPSATKDTSDE---------LRPTRSPL-LSPTFSVKKAASPPSSPPSYRASTVRMIRSPSK
        MAGR FGR +YRFSS NRP AP  NTS QD  QYD R+YPSAT+DTS E         LR    PL  SPT+S+KKA SPPSSPP YR    R I SP K
Subjt:  MAGR-FGRSLYRFSSVNRPAAPGGNTSSQDLNQYDGRRYPSATKDTSDE---------LRPTRSPL-LSPTFSVKKAASPPSSPPSYRASTVRMIRSPSK

Query:  PIDEYPNYKPTTRSRSPEVEPKSMVYK--AIEKVTKSDHHVGSGKTTTSSHKQQPNAINISGENLGAVMEIVE-SKREGGHVIKK-KETVRGILNS--IT
         +DEYP YKP T+ RSPE + K  ++K   +EKVTKSD +    K  +S   QQPNAINI GEN+GAVMEIVE SKREGGH+IKK KET RGILN+  + 
Subjt:  PIDEYPNYKPTTRSRSPEVEPKSMVYK--AIEKVTKSDHHVGSGKTTTSSHKQQPNAINISGENLGAVMEIVE-SKREGGHVIKK-KETVRGILNS--IT

Query:  NDQNNEDSKDHKAEGQKALNSMPASTFLNSNFQSVNNSLLYGASLAHRDPGLHLSFSRNPTGDRFTHDDKKQQHKKY
        NDQNNE SK          +SMP +TFLNSNFQSVNNSLLY A+L +RDPGLHLSFSRNPTGDRF  DDKKQ H KY
Subjt:  NDQNNEDSKDHKAEGQKALNSMPASTFLNSNFQSVNNSLLYGASLAHRDPGLHLSFSRNPTGDRFTHDDKKQQHKKY

A0A5A7U5I8 Zyxin-like3.4e-7263.18Show/hide
Query:  MAGR-FGRSLYRFSSVNRPAAPGGNTSSQDLNQYDGRRYPSATKDTSDE---------LRPTRSPL-LSPTFSVKKAASPPSSPPSYRASTVRMIRSPSK
        MAGR FGR +YRFSS NRP AP  NTS QD  QYD R+YPSAT+DTS E         LR    PL  SPT+S+KKA SPPSSPP YR    R I SP K
Subjt:  MAGR-FGRSLYRFSSVNRPAAPGGNTSSQDLNQYDGRRYPSATKDTSDE---------LRPTRSPL-LSPTFSVKKAASPPSSPPSYRASTVRMIRSPSK

Query:  PIDEYPNYKPTTRSRSPEVEPKSMVYK--AIEKVTKSDHHVGSGKTTTSSHKQQPNAINISGENLGAVMEIVE-SKREGGHVIKK-KETVRGILNS--IT
         +DEYP YKP T+ RSPE + K  ++K   +EKVTKSD +    K  +S   QQPNAINI GEN+GAVMEIVE SKREGGH+IKK KET RGILN+  + 
Subjt:  PIDEYPNYKPTTRSRSPEVEPKSMVYK--AIEKVTKSDHHVGSGKTTTSSHKQQPNAINISGENLGAVMEIVE-SKREGGHVIKK-KETVRGILNS--IT

Query:  NDQNNEDSKDHKAEGQKALNSMPASTFLNSNFQSVNNSLLYGASLAHRDPGLHLSFSRNPTGDRFTHDDKKQQHKKY
        NDQNNE SK          +SMP +TFLNSNFQSVNNSLLY A+L +RDPGLHLSFSRNPTGDRF  DDKKQ H KY
Subjt:  NDQNNEDSKDHKAEGQKALNSMPASTFLNSNFQSVNNSLLYGASLAHRDPGLHLSFSRNPTGDRFTHDDKKQQHKKY

A0A6J1D126 serine/arginine repetitive matrix protein 1-like3.1e-8165.82Show/hide
Query:  MAGRFGRSLYRFSSVNRPAAPGGNTSSQDLNQYDGRRYPSATKDTSDELRPTRSPL--------LSPTFSVKKAASPPSSPPSYRASTVRMIRSPSKPID
        MAGR+GRS YRFSSVNRP APGGNTS+QD  QYDGR+YPSAT+D+S E R    P          SPT+SVKKAASPPSSPP YRA   R + SP + +D
Subjt:  MAGRFGRSLYRFSSVNRPAAPGGNTSSQDLNQYDGRRYPSATKDTSDELRPTRSPL--------LSPTFSVKKAASPPSSPPSYRASTVRMIRSPSKPID

Query:  EYPNYKPTTRSRSPEVEPKSMVYKAIEK-VTKSDHHVGSGKTTTSSHKQQ--PNAINISGENLGAVMEIVES-KREGGHVIKKKETVRGILNSITNDQNN
        EYP YKPTT+ RSPE + K ++YKAIEK  TKSD +  +GKTT+S  +QQ  PNAINISGENLGAVMEIV+S KREGGH+I+KKE+    L+S  NDQNN
Subjt:  EYPNYKPTTRSRSPEVEPKSMVYKAIEK-VTKSDHHVGSGKTTTSSHKQQ--PNAINISGENLGAVMEIVES-KREGGHVIKKKETVRGILNSITNDQNN

Query:  EDSKDHK---AEGQKALNSMPASTFLNSNFQSVNNSLLYGASLAHRDPGLHLSFSRNPTGDRFTHDDKKQQHKKY
        E SKD K   A    + +S+PA+TFLNSNFQSVNNSLLY ASLAHRDPGLHL+F+RNP GDRF  DDKK  H KY
Subjt:  EDSKDHK---AEGQKALNSMPASTFLNSNFQSVNNSLLYGASLAHRDPGLHLSFSRNPTGDRFTHDDKKQQHKKY

A0A6J1INF1 uncharacterized protein LOC111479075 isoform X11.9e-5454.75Show/hide
Query:  MAGRFGRSLYRFSSVNRPAAPGGNTSSQDLNQYDGRRYPSATKDTSDELRPTRSPLLSPTFSVKKAASPPSSPPSYRASTVRMIRSPSKPIDEYPNYKPT
        MA RFGRSLYRFSSVNRP AP               RYPS       E RP+ SP   P     + ASPPSS         R+I SP  P+ +YP     
Subjt:  MAGRFGRSLYRFSSVNRPAAPGGNTSSQDLNQYDGRRYPSATKDTSDELRPTRSPLLSPTFSVKKAASPPSSPPSYRASTVRMIRSPSKPIDEYPNYKPT

Query:  TRSRSPEVEPKSMVYKAIEKVTKSDHHVGSGKTTTSSHKQQPNAINISGENLGAVMEIVE-SKREGGHVIKKKETVRGILNS---ITNDQNNEDSKDHKA
           +SPE + K+MV+K +EK  KS+ ++ SG+T  +   QQPNAINI+GEN+GAVMEIVE SK EGGHV+KKKET RG++++     NDQ+ + SK HKA
Subjt:  TRSRSPEVEPKSMVYKAIEKVTKSDHHVGSGKTTTSSHKQQPNAINISGENLGAVMEIVE-SKREGGHVIKKKETVRGILNS---ITNDQNNEDSKDHKA

Query:  EGQK--ALNSMPASTFLNSNFQSVNNSLLYGASLAHRDPGLHLSFSRNPTGDRFTHDDKKQQH
          QK  A +S+P +TFLN+NFQSVNNSLL+ ASLAHRDPGLHL+FSRN TGDRFT DDKKQ H
Subjt:  EGQK--ALNSMPASTFLNSNFQSVNNSLLYGASLAHRDPGLHLSFSRNPTGDRFTHDDKKQQH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G63310.1 unknown protein1.2e-0529.37Show/hide
Query:  IEKVTKSDHHVGSGKTTTSSHKQQPNAINI---SGENLGAVMEIVESKREGGHVIKKKETVRGILNSITNDQNNEDSKDHKAEGQKALNSMPASTFLNSN
        ++ +  S  H+G  K   S + ++ + I +   SG NLGA M+       G                   DQN +   D              ST++NSN
Subjt:  IEKVTKSDHHVGSGKTTTSSHKQQPNAINI---SGENLGAVMEIVESKREGGHVIKKKETVRGILNSITNDQNNEDSKDHKAEGQKALNSMPASTFLNSN

Query:  FQSVNNSLLYGASLAHRDPGLHLSFS
        FQ+VNNS++ GA     DPG+HL  S
Subjt:  FQSVNNSLLYGASLAHRDPGLHLSFS

AT2G46630.1 unknown protein7.4e-1126.02Show/hide
Query:  KKAASPPSSPPSYRASTVRMIRSPSKPIDEYPNYKPTTRSRSPEVEPKSMVYKAIEKVTK-------------SDHHVGS--------------------
        +K  SPPS   S R++T   +++ S    E     P+ R  SP   P S+++   E   K             ++HH  +                    
Subjt:  KKAASPPSSPPSYRASTVRMIRSPSKPIDEYPNYKPTTRSRSPEVEPKSMVYKAIEKVTK-------------SDHHVGS--------------------

Query:  ----GKTTTSSHKQQPNA----------INISGENLGAVMEIVES---KREGGHVIKKKETVRGILNSITNDQNNEDSKDHKAEGQKAL---------NS
            G      H+Q  ++          I I+GEN GAVMEI+ S    + GG          G        Q++  S   + EG+K           ++
Subjt:  ----GKTTTSSHKQQPNA----------INISGENLGAVMEIVES---KREGGHVIKKKETVRGILNSITNDQNNEDSKDHKAEGQKAL---------NS

Query:  MPASTFLNSNFQSVNNSLLYGASLAHRDPGLHLSFSRNPTGDRFTH
        +P   F+NSN Q +NNS++Y ++ +H DPG+HL  SR P  D   H
Subjt:  MPASTFLNSNFQSVNNSLLYGASLAHRDPGLHLSFSRNPTGDRFTH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGGTCGTTTTGGTCGTTCGTTATACCGTTTTTCTTCCGTTAACCGACCCGCTGCCCCCGGCGGCAATACTTCAAGTCAAGATTTGAATCAGTATGATGGTCGGCG
GTATCCTTCAGCCACCAAAGACACGTCGGATGAACTCCGACCGACTCGCTCACCGCTGCTTTCGCCTACGTTCTCTGTCAAGAAGGCTGCTTCGCCGCCATCTTCTCCTC
CGTCTTACAGAGCTTCGACTGTACGAATGATTCGCTCGCCATCGAAGCCTATAGACGAATACCCTAATTACAAACCTACTACTCGATCTAGGTCACCTGAAGTCGAGCCT
AAATCGATGGTCTACAAAGCCATCGAGAAGGTGACGAAGTCGGACCATCACGTCGGGTCGGGGAAGACGACGACCTCGTCCCACAAGCAGCAGCCAAATGCAATAAACAT
ATCAGGAGAAAATCTTGGGGCAGTAATGGAAATTGTTGAGTCAAAACGGGAAGGGGGGCATGTCATAAAGAAGAAAGAGACAGTTAGAGGAATATTAAACAGCATTACCA
ATGACCAAAACAACGAAGATTCTAAGGACCACAAGGCCGAAGGCCAAAAAGCACTTAATTCTATGCCAGCAAGCACTTTTTTGAACAGCAATTTTCAGAGCGTAAATAAT
TCTCTTCTTTACGGTGCTTCTTTGGCTCACCGTGACCCCGGTTTGCACCTCTCTTTCTCCCGGAACCCGACCGGCGACCGGTTCACTCATGATGACAAGAAGCAACAGCA
CAAGAAGTACTAG
mRNA sequenceShow/hide mRNA sequence
GGCTCTTCTTATAAATAACCCTAAAACACTCACATCACATTCATCATCTGAGTTTTTAAATTAAATTTTTCCATATATCATTTTCTATTTTATTTCCGACCAAACAAACC
TTTTTTTTCTTGTTTCATTATGGCCGGTCGTTTTGGTCGTTCGTTATACCGTTTTTCTTCCGTTAACCGACCCGCTGCCCCCGGCGGCAATACTTCAAGTCAAGATTTGA
ATCAGTATGATGGTCGGCGGTATCCTTCAGCCACCAAAGACACGTCGGATGAACTCCGACCGACTCGCTCACCGCTGCTTTCGCCTACGTTCTCTGTCAAGAAGGCTGCT
TCGCCGCCATCTTCTCCTCCGTCTTACAGAGCTTCGACTGTACGAATGATTCGCTCGCCATCGAAGCCTATAGACGAATACCCTAATTACAAACCTACTACTCGATCTAG
GTCACCTGAAGTCGAGCCTAAATCGATGGTCTACAAAGCCATCGAGAAGGTGACGAAGTCGGACCATCACGTCGGGTCGGGGAAGACGACGACCTCGTCCCACAAGCAGC
AGCCAAATGCAATAAACATATCAGGAGAAAATCTTGGGGCAGTAATGGAAATTGTTGAGTCAAAACGGGAAGGGGGGCATGTCATAAAGAAGAAAGAGACAGTTAGAGGA
ATATTAAACAGCATTACCAATGACCAAAACAACGAAGATTCTAAGGACCACAAGGCCGAAGGCCAAAAAGCACTTAATTCTATGCCAGCAAGCACTTTTTTGAACAGCAA
TTTTCAGAGCGTAAATAATTCTCTTCTTTACGGTGCTTCTTTGGCTCACCGTGACCCCGGTTTGCACCTCTCTTTCTCCCGGAACCCGACCGGCGACCGGTTCACTCATG
ATGACAAGAAGCAACAGCACAAGAAGTACTAGAGATAAAGAAGCAAGAGAGAATTTAATGATGGATAATTAATAAAACAATAAGATTTATATATTATTATTATTATTATC
TTATGAGAATAAAATGAGATGATTTATGTAACAGAAAAATGTCAGTATTATTTAAGTTTTATATGTGGTTTCATTGGTCATTAATTTATGTATGTTGAATATATAATATA
AAGTTTGGATTTTCTTTTGGTTTTTGTGGCCAAAATTAATTTTAATCAAGT
Protein sequenceShow/hide protein sequence
MAGRFGRSLYRFSSVNRPAAPGGNTSSQDLNQYDGRRYPSATKDTSDELRPTRSPLLSPTFSVKKAASPPSSPPSYRASTVRMIRSPSKPIDEYPNYKPTTRSRSPEVEP
KSMVYKAIEKVTKSDHHVGSGKTTTSSHKQQPNAINISGENLGAVMEIVESKREGGHVIKKKETVRGILNSITNDQNNEDSKDHKAEGQKALNSMPASTFLNSNFQSVNN
SLLYGASLAHRDPGLHLSFSRNPTGDRFTHDDKKQQHKKY