; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009775 (gene) of Snake gourd v1 genome

Gene IDTan0009775
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrotransposon protein
Genome locationLG05:19711469..19712472
RNA-Seq ExpressionTan0009775
SyntenyTan0009775
Gene Ontology termsNA
InterPro domainsIPR024752 - Myb/SANT-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008441954.1 PREDICTED: uncharacterized protein LOC103485953 [Cucumis melo]5.4e-5240.13Show/hide
Query:  MAGTSRSSKHTWTKVEDARLVESLVSLIHK-GWRSIMG----------PSMLAEKLQNSCL-EQNTIDCKVRTLKKQYHAITEMLSNACSGFGWNEEFKC
        MA  SR+ KHTWTK E+ + VE LV L+   GWRS  G            M+AEKL  + + E +TIDC V++LKK YHAI EM   +CSGFGWNEEF+C
Subjt:  MAGTSRSSKHTWTKVEDARLVESLVSLIHK-GWRSIMG----------PSMLAEKLQNSCL-EQNTIDCKVRTLKKQYHAITEMLSNACSGFGWNEEFKC

Query:  VEAEKEVFDAWVKSHTNAKGMRNKPFPHYDDLAYVFGKDRATGMGAETPMEMASSAAEQMEEEIRLGSQDFMGGEQRTMENPGIGDIGEDDLPDTPTSMR
        + AE+++FD+W+KSH  AKG+ +K FP+YDDL+YVFGKDRATG  +ET   + S+ +    + I L       G+    + P +   G    PD    +R
Subjt:  VEAEKEVFDAWVKSHTNAKGMRNKPFPHYDDLAYVFGKDRATGMGAETPMEMASSAAEQMEEEIRLGSQDFMGGEQRTMENPGIGDIGEDDLPDTPTSMR

Query:  NTSGMSSRSIG--SKRKRSSFQTELIDVVRTTMDMQTNHMQKLLSWQKEKYELEAARRKEVVDLLYQIEGLTEHDRVALIDLLVSDIQKTDCFLQVPPQS
               R+    SKRKR S + E ++V+R+ M+     ++ +  W KEK  +E   R +VV  L  I  L   DR  L+ +L   ++  + FL +P + 
Subjt:  NTSGMSSRSIG--SKRKRSSFQTELIDVVRTTMDMQTNHMQKLLSWQKEKYELEAARRKEVVDLLYQIEGLTEHDRVALIDLLVSDIQKTDCFLQVPPQS

Query:  RRTYCMRLL
        +  YC  LL
Subjt:  RRTYCMRLL

XP_038877407.1 uncharacterized protein LOC120069696 [Benincasa hispida]1.9e-6048.62Show/hide
Query:  ARLVESLVSLIHKGWRSIMGP------SMLAEKLQNSC--LEQNTIDCKVRTLKKQYHAITEMLSNACSGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGM
        A+L+E L+ L+  GWRSIMG       + L+E     C  L QNTI+CKVR+LKKQY+AI+EMLS   SGF WNEEFKCV+ E+E+F+ WV+SH NAKGM
Subjt:  ARLVESLVSLIHKGWRSIMGP------SMLAEKLQNSC--LEQNTIDCKVRTLKKQYHAITEMLSNACSGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGM

Query:  RNKPFPHYDDLAYVFGKDRATGMGAETPMEMASSAAEQMEEEIRLGSQDFMGGEQRTMENPGIGDIGEDDLPDTPT--SMRNTSGMSSRSIGSKRKRSSF
         NKPFPHYDDL+              TP            E  ++ S                  + +D++ + PT  S   TS     S GSKRKRSSF
Subjt:  RNKPFPHYDDLAYVFGKDRATGMGAETPMEMASSAAEQMEEEIRLGSQDFMGGEQRTMENPGIGDIGEDDLPDTPT--SMRNTSGMSSRSIGSKRKRSSF

Query:  QTELIDVVRTTMDMQTNHMQKLLSWQKEKYELEAARRKEVVDLLYQIEGLTEHDRVALIDLLVSDIQKTDCFLQVPPQSRRTYCMRLLGR
        Q E+ID++R+T++M + HM +L SWQK+KYELE  R+KEVV+ +Y I+GL E  +V LIDL+V+DIQKTDCFL VP  + + YC+RLLGR
Subjt:  QTELIDVVRTTMDMQTNHMQKLLSWQKEKYELEAARRKEVVDLLYQIEGLTEHDRVALIDLLVSDIQKTDCFLQVPPQSRRTYCMRLLGR

XP_038887234.1 uncharacterized protein LOC120077425 [Benincasa hispida]2.5e-7351.13Show/hide
Query:  MAGTSRSSKHTWTKVEDARLVESLVSLIHKGWRSIMG----------PSMLAEKLQNSCLEQNTIDCKVRTLKKQYHAITEMLSNACSGFGWNEEFKCVE
        M G S+ SKH W+KVEDARLVE+L+ L+  GWRS  G            +L EK+    L +NTI+CKVR+LKKQY+A++EMLS   SGF WNEEFKCV+
Subjt:  MAGTSRSSKHTWTKVEDARLVESLVSLIHKGWRSIMG----------PSMLAEKLQNSCLEQNTIDCKVRTLKKQYHAITEMLSNACSGFGWNEEFKCVE

Query:  AEKEVFDAWVKSHTNAKGMRNKPFPHYDDLAYVFGKDRATGMGAETPMEMASSAAEQMEEEIRLGSQDFMGGEQRTMENPGIGDIGEDDLPDTPT--SMR
         E+E+FD WV+SH NAKGM  KPFPHYDDL+ VFGKDRA      TP                         E R  E+P    + +D++ + P   S  
Subjt:  AEKEVFDAWVKSHTNAKGMRNKPFPHYDDLAYVFGKDRATGMGAETPMEMASSAAEQMEEEIRLGSQDFMGGEQRTMENPGIGDIGEDDLPDTPT--SMR

Query:  NTSGMSSRSIGSKRKRSSFQTELIDVVRTTMDMQTNHMQKLLSWQKEKYELEAARRKEVVDLLYQIEGLTEHDRVALIDLLVSDIQKTDCFLQVPPQSRR
          S  +  S GSKRKRSSFQ E+ID+V++T++MQ+ HM +L SWQ EKYELE    KEVV+ +Y I+ L E+D+V LIDL+V+DIQKTDCFL VP  +R+
Subjt:  NTSGMSSRSIGSKRKRSSFQTELIDVVRTTMDMQTNHMQKLLSWQKEKYELEAARRKEVVDLLYQIEGLTEHDRVALIDLLVSDIQKTDCFLQVPPQSRR

Query:  TYCMRLLGR
         YC+RLLGR
Subjt:  TYCMRLLGR

XP_038895773.1 uncharacterized protein LOC120083935 [Benincasa hispida]3.3e-6547.68Show/hide
Query:  RSSKHTWTKVEDARLVESLVSLIHKGWRSIMGPSMLA----------EKLQNSCLEQNTIDCKVRTLKKQYHAITEMLSNACSGFGWNEEFKCVEAEKEV
        + SKH W+KVEDA+ VE+L+ L+  GWRS  G   L           EK+    L QNTI+CKVR+LKKQ +A++EMLS   SGF WNEEFKCV+ E+E+
Subjt:  RSSKHTWTKVEDARLVESLVSLIHKGWRSIMGPSMLA----------EKLQNSCLEQNTIDCKVRTLKKQYHAITEMLSNACSGFGWNEEFKCVEAEKEV

Query:  FDAWVKSHTNAKGMRNKPFPHYDDLAYVFGKDRATGMGAETPMEMASSAAEQMEEEIRLGSQDFMGGEQRTMENPGIGDIGEDDLPDTPTSMRNTSGMSS
        FD WV+SH NAKGM NKPFPHYDDL+ VFGK +A G  +E P  M ++A  + E+EIRLGSQD                        TP           
Subjt:  FDAWVKSHTNAKGMRNKPFPHYDDLAYVFGKDRATGMGAETPMEMASSAAEQMEEEIRLGSQDFMGGEQRTMENPGIGDIGEDDLPDTPTSMRNTSGMSS

Query:  RSIGSKRKRSSFQTELIDVVRTTMDMQTNHMQKLLSWQKEKYELEAARRKEVVDLLYQIEGLTEHDRVALIDLLVSDIQKTDCFLQVPPQSRRTYCMRLL
                                  ++ HM +L SWQKEKYELE  RRKEVV+ +Y I+GL E D+V LIDLLV+DIQKT+CFL VP  +R+ YC+RLL
Subjt:  RSIGSKRKRSSFQTELIDVVRTTMDMQTNHMQKLLSWQKEKYELEAARRKEVVDLLYQIEGLTEHDRVALIDLLVSDIQKTDCFLQVPPQSRRTYCMRLL

Query:  GR
        GR
Subjt:  GR

XP_038896380.1 uncharacterized protein LOC120084641 [Benincasa hispida]8.6e-7451.13Show/hide
Query:  MAGTSRSSKHTWTKVEDARLVESLVSLIHKGWRSIMG----------PSMLAEKLQNSCLEQNTIDCKVRTLKKQYHAITEMLSNACSGFGWNEEFKCVE
        MAG+ + SKH W+KVED +LVE+L+ L+  GWRS  G            +L EK+    L QNTI+CKVR+LKKQY+A++EMLS   SGFGWNEEFKCV+
Subjt:  MAGTSRSSKHTWTKVEDARLVESLVSLIHKGWRSIMG----------PSMLAEKLQNSCLEQNTIDCKVRTLKKQYHAITEMLSNACSGFGWNEEFKCVE

Query:  AEKEVFDAWVKSHTNAKGMRNKPFPHYDDLAYVFGKDRATGMGAETPMEMASSAAEQMEEEIRLGSQDFMGGEQRTMENPGIGDIGEDDLPDTPT--SMR
         EKE+FD WV+SH NAKGM NK F HYDDL+ VFGKDRA      TP                         E    E+P    + +D++ + P   S  
Subjt:  AEKEVFDAWVKSHTNAKGMRNKPFPHYDDLAYVFGKDRATGMGAETPMEMASSAAEQMEEEIRLGSQDFMGGEQRTMENPGIGDIGEDDLPDTPT--SMR

Query:  NTSGMSSRSIGSKRKRSSFQTELIDVVRTTMDMQTNHMQKLLSWQKEKYELEAARRKEVVDLLYQIEGLTEHDRVALIDLLVSDIQKTDCFLQVPPQSRR
          S ++  S GSKRKR SFQ E+ID++R+T++MQ+ HM +L SWQKEKYELE  RRKEVV+ +Y I+GL E D+V  IDLLV+DIQKTDCFL VP  +R+
Subjt:  NTSGMSSRSIGSKRKRSSFQTELIDVVRTTMDMQTNHMQKLLSWQKEKYELEAARRKEVVDLLYQIEGLTEHDRVALIDLLVSDIQKTDCFLQVPPQSRR

Query:  TYCMRLLGR
         YC+ LL R
Subjt:  TYCMRLLGR

TrEMBL top hitse value%identityAlignment
A0A1S3B4L3 uncharacterized protein LOC1034859532.6e-5240.13Show/hide
Query:  MAGTSRSSKHTWTKVEDARLVESLVSLIHK-GWRSIMG----------PSMLAEKLQNSCL-EQNTIDCKVRTLKKQYHAITEMLSNACSGFGWNEEFKC
        MA  SR+ KHTWTK E+ + VE LV L+   GWRS  G            M+AEKL  + + E +TIDC V++LKK YHAI EM   +CSGFGWNEEF+C
Subjt:  MAGTSRSSKHTWTKVEDARLVESLVSLIHK-GWRSIMG----------PSMLAEKLQNSCL-EQNTIDCKVRTLKKQYHAITEMLSNACSGFGWNEEFKC

Query:  VEAEKEVFDAWVKSHTNAKGMRNKPFPHYDDLAYVFGKDRATGMGAETPMEMASSAAEQMEEEIRLGSQDFMGGEQRTMENPGIGDIGEDDLPDTPTSMR
        + AE+++FD+W+KSH  AKG+ +K FP+YDDL+YVFGKDRATG  +ET   + S+ +    + I L       G+    + P +   G    PD    +R
Subjt:  VEAEKEVFDAWVKSHTNAKGMRNKPFPHYDDLAYVFGKDRATGMGAETPMEMASSAAEQMEEEIRLGSQDFMGGEQRTMENPGIGDIGEDDLPDTPTSMR

Query:  NTSGMSSRSIG--SKRKRSSFQTELIDVVRTTMDMQTNHMQKLLSWQKEKYELEAARRKEVVDLLYQIEGLTEHDRVALIDLLVSDIQKTDCFLQVPPQS
               R+    SKRKR S + E ++V+R+ M+     ++ +  W KEK  +E   R +VV  L  I  L   DR  L+ +L   ++  + FL +P + 
Subjt:  NTSGMSSRSIG--SKRKRSSFQTELIDVVRTTMDMQTNHMQKLLSWQKEKYELEAARRKEVVDLLYQIEGLTEHDRVALIDLLVSDIQKTDCFLQVPPQS

Query:  RRTYCMRLL
        +  YC  LL
Subjt:  RRTYCMRLL

A0A5A7U0H7 Retrotransposon protein2.6e-5240.13Show/hide
Query:  MAGTSRSSKHTWTKVEDARLVESLVSLIHK-GWRSIMG----------PSMLAEKLQNSCL-EQNTIDCKVRTLKKQYHAITEMLSNACSGFGWNEEFKC
        MA  SR+ KHTWTK E+ + VE LV L+   GWRS  G            M+AEKL  + + E +TIDC V++LKK YHAI EM   +CSGFGWNEEF+C
Subjt:  MAGTSRSSKHTWTKVEDARLVESLVSLIHK-GWRSIMG----------PSMLAEKLQNSCL-EQNTIDCKVRTLKKQYHAITEMLSNACSGFGWNEEFKC

Query:  VEAEKEVFDAWVKSHTNAKGMRNKPFPHYDDLAYVFGKDRATGMGAETPMEMASSAAEQMEEEIRLGSQDFMGGEQRTMENPGIGDIGEDDLPDTPTSMR
        + AE+++FD+W+KSH  AKG+ +K FP+YDDL+YVFGKDRATG  +ET   + S+ +    + I L       G+    + P +   G    PD    +R
Subjt:  VEAEKEVFDAWVKSHTNAKGMRNKPFPHYDDLAYVFGKDRATGMGAETPMEMASSAAEQMEEEIRLGSQDFMGGEQRTMENPGIGDIGEDDLPDTPTSMR

Query:  NTSGMSSRSIG--SKRKRSSFQTELIDVVRTTMDMQTNHMQKLLSWQKEKYELEAARRKEVVDLLYQIEGLTEHDRVALIDLLVSDIQKTDCFLQVPPQS
               R+    SKRKR S + E ++V+R+ M+     ++ +  W KEK  +E   R +VV  L  I  L   DR  L+ +L   ++  + FL +P + 
Subjt:  NTSGMSSRSIG--SKRKRSSFQTELIDVVRTTMDMQTNHMQKLLSWQKEKYELEAARRKEVVDLLYQIEGLTEHDRVALIDLLVSDIQKTDCFLQVPPQS

Query:  RRTYCMRLL
        +  YC  LL
Subjt:  RRTYCMRLL

A0A5A7UME4 Retrotransposon protein2.6e-4438.89Show/hide
Query:  MAGTSRSSKHTWTKVEDARLVESLVSLIHK-GWRSIMG----------PSMLAEKLQNSCLEQNTIDCKVRTLKKQYHAITEMLSNACSGFGWNEEFKCV
        M  +SR  KHTWTK E+A LVE LV L++  GWRS  G            M+A K+  S +  +TID +++ +K+ +HA+ EM    CSGFGWN+E KC+
Subjt:  MAGTSRSSKHTWTKVEDARLVESLVSLIHK-GWRSIMG----------PSMLAEKLQNSCLEQNTIDCKVRTLKKQYHAITEMLSNACSGFGWNEEFKCV

Query:  EAEKEVFDAWVKSHTNAKGMRNKPFPHYDDLAYVFGKDRATGMGAETPMEMASSAAEQMEEEIRLGSQDFMGGEQRTMENPGIGDIGEDDLPDTPTSMRN
         AEKEVFD W  SH  AKG+ NK F HYD+L+YVFGKDRATG  AE+  ++ S+     + E      D    +   M +PG+ ++  DDL +T T+   
Subjt:  EAEKEVFDAWVKSHTNAKGMRNKPFPHYDDLAYVFGKDRATGMGAETPMEMASSAAEQMEEEIRLGSQDFMGGEQRTMENPGIGDIGEDDLPDTPTSMRN

Query:  TSGMSSRSIGSKRKRSSFQTELIDVVRTTMDMQTNHMQKLLSWQKEKYELEAARRKEVVDLLYQIEGLTEHDRVALIDLLVSDIQKTDCFLQVPPQSRRT
         S   + S GSKRKR    T+  D+VRT ++     + ++  W   + +     R+E+V  L  I  LT  DR  L+ +L+ ++     FL+VP   +  
Subjt:  TSGMSSRSIGSKRKRSSFQTELIDVVRTTMDMQTNHMQKLLSWQKEKYELEAARRKEVVDLLYQIEGLTEHDRVALIDLLVSDIQKTDCFLQVPPQSRRT

Query:  YCMRLL
        YC  +L
Subjt:  YCMRLL

A0A5D3CBF7 Retrotransposon protein3.4e-4439.09Show/hide
Query:  MAGTSRSSKHTWTKVEDARLVESLVSLIHK-GWRSIMG----------PSMLAEKLQNSCLEQNTIDCKVRTLKKQYHAITEMLSNACSGFGWNEEFKCV
        M  +SR  KHTWTK E+A LVE LV L++  GWRS  G            M+A K+  S +  +TID +++ +K+ +HA+ EM    CSGFGWN+E KC+
Subjt:  MAGTSRSSKHTWTKVEDARLVESLVSLIHK-GWRSIMG----------PSMLAEKLQNSCLEQNTIDCKVRTLKKQYHAITEMLSNACSGFGWNEEFKCV

Query:  EAEKEVFDAWVKSHTNAKGMRNKPFPHYDDLAYVFGKDRATGMGAETPMEMASSAAEQMEEEIRLGSQDFM-GGEQRTMENPGIGDIGEDDLPDTPTSMR
         AEKEVFD W  SH  AKG+ NK F HYD+L+YVFGKDRATG  AE+  ++ S+     +     G+ D M   +   M +PG+ ++  DDL +T T+  
Subjt:  EAEKEVFDAWVKSHTNAKGMRNKPFPHYDDLAYVFGKDRATGMGAETPMEMASSAAEQMEEEIRLGSQDFM-GGEQRTMENPGIGDIGEDDLPDTPTSMR

Query:  NTSGMSSRSIGSKRKRSSFQTELIDVVRTTMDMQTNHMQKLLSWQKEKYELEAARRKEVVDLLYQIEGLTEHDRVALIDLLVSDIQKTDCFLQVPPQSRR
          S   + S GSKRKR    T+  D+VRT ++     + ++  W   + +     R+E+V  L  I  LT  DR  L+ +L+ ++     FL+VP   + 
Subjt:  NTSGMSSRSIGSKRKRSSFQTELIDVVRTTMDMQTNHMQKLLSWQKEKYELEAARRKEVVDLLYQIEGLTEHDRVALIDLLVSDIQKTDCFLQVPPQSRR

Query:  TYCMRLL
         YC  +L
Subjt:  TYCMRLL

A0A6J1DW73 uncharacterized protein LOC1110250189.4e-5050.91Show/hide
Query:  GFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDDLAYVFGKDRATGMGAETPMEMASSAAEQMEEEIRLGSQDFMGGEQRTMENPGIGDIGED
        GFGWN++ KC+EAEKEVFD WVKSH NAKG+RNKP PHYDDL   FGKDRATG   + P++MASSAA  + E+    +QDF   +          D  E+
Subjt:  GFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDDLAYVFGKDRATGMGAETPMEMASSAAEQMEEEIRLGSQDFMGGEQRTMENPGIGDIGED

Query:  DLPDTPTSMRNTSGMSSRSIGSKRKRSSFQTELIDVVRTTMDMQTNHMQKLLSWQKEKYELEAARRKEVVDLLYQIEGLTEHDRVALIDLLVSDIQKTDC
        DLP+TPTS + T G SS   GSKRKRS + +E++DVVRT M MQT H++K+ +W  +K E + ARRK V D L QI  L  +D V L+ +L+++++K+  
Subjt:  DLPDTPTSMRNTSGMSSRSIGSKRKRSSFQTELIDVVRTTMDMQTNHMQKLLSWQKEKYELEAARRKEVVDLLYQIEGLTEHDRVALIDLLVSDIQKTDC

Query:  FLQVPPQSRRTYCMRLLGRT
        FL+VP + ++ +CM+LLG++
Subjt:  FLQVPPQSRRTYCMRLLGRT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30140.1 unknown protein1.8e-0828.78Show/hide
Query:  GTSRSSKHTWTKVEDARLVESLVSLIHKGWR---SIMGPSMLAEKLQNSCLEQ-------NTIDCKVRTLKKQYHAITEMLSNACSGFGWNEEFKCVEAE
        G  +   + WT  E     + L+ LI + WR    I+G   +  KL  +  ++            +++ LK  Y +  + L    SGFGW+ E K   A 
Subjt:  GTSRSSKHTWTKVEDARLVESLVSLIHKGWR---SIMGPSMLAEKLQNSCLEQ-------NTIDCKVRTLKKQYHAITEMLSNACSGFGWNEEFKCVEAE

Query:  KEVFDAWVKSHTNAKGMRNKPFPHYDDLAYVFGKDRATG
         EV+  ++K+H N K M+ +   H++DL  +FG   ATG
Subjt:  KEVFDAWVKSHTNAKGMRNKPFPHYDDLAYVFGKDRATG

AT4G02210.1 unknown protein1.3e-0630.88Show/hide
Query:  KVRTLKKQYHAITEMLSNACSGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDDLAYVFG
        + ++L++Q++AI  +L +   GF W+ E + V A+  V+  ++K+H +A+    +P P+Y DL  + G
Subjt:  KVRTLKKQYHAITEMLSNACSGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDDLAYVFG

AT4G02210.2 unknown protein1.3e-0630.88Show/hide
Query:  KVRTLKKQYHAITEMLSNACSGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDDLAYVFG
        + ++L++Q++AI  +L +   GF W+ E + V A+  V+  ++K+H +A+    +P P+Y DL  + G
Subjt:  KVRTLKKQYHAITEMLSNACSGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMRNKPFPHYDDLAYVFG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGGTACTTCACGAAGCTCTAAACATACGTGGACGAAGGTGGAGGATGCGAGATTGGTGGAGTCACTTGTCTCTTTAATACACAAAGGTTGGCGATCGATAATGGG
ACCTTCAATGCTAGCTGAGAAATTACAAAACTCATGCCTAGAACAAAACACAATCGATTGCAAGGTTAGAACTCTCAAGAAACAATACCATGCTATTACAGAGATGCTTA
GTAATGCATGTAGTGGCTTCGGCTGGAACGAAGAGTTCAAGTGTGTAGAGGCAGAGAAGGAGGTGTTTGATGCATGGGTTAAGAGCCATACAAATGCGAAAGGGATGAGG
AATAAACCATTTCCACACTATGATGACCTCGCCTATGTCTTTGGAAAGGATAGAGCTACAGGAATGGGTGCAGAGACCCCAATGGAAATGGCATCTAGCGCTGCAGAGCA
AATGGAGGAGGAGATTCGGTTGGGATCACAAGACTTCATGGGAGGGGAACAACGAACAATGGAGAATCCAGGAATTGGTGACATAGGGGAAGATGATTTGCCAGACACAC
CTACCAGTATGCGTAATACATCTGGCATGTCTTCTAGAAGTATTGGGAGCAAAAGAAAACGATCATCCTTCCAAACTGAATTAATTGATGTTGTGCGCACAACAATGGAT
ATGCAAACCAATCACATGCAAAAACTTCTATCCTGGCAGAAGGAGAAGTATGAGTTGGAGGCTGCACGAAGGAAGGAAGTAGTCGATCTCTTGTATCAGATAGAAGGATT
GACTGAGCATGATCGTGTCGCCTTGATAGACTTGCTTGTGAGTGATATCCAAAAGACTGACTGTTTTCTACAGGTTCCACCTCAATCGAGGAGGACATATTGCATGCGTC
TACTGGGAAGGACTGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGGTACTTCACGAAGCTCTAAACATACGTGGACGAAGGTGGAGGATGCGAGATTGGTGGAGTCACTTGTCTCTTTAATACACAAAGGTTGGCGATCGATAATGGG
ACCTTCAATGCTAGCTGAGAAATTACAAAACTCATGCCTAGAACAAAACACAATCGATTGCAAGGTTAGAACTCTCAAGAAACAATACCATGCTATTACAGAGATGCTTA
GTAATGCATGTAGTGGCTTCGGCTGGAACGAAGAGTTCAAGTGTGTAGAGGCAGAGAAGGAGGTGTTTGATGCATGGGTTAAGAGCCATACAAATGCGAAAGGGATGAGG
AATAAACCATTTCCACACTATGATGACCTCGCCTATGTCTTTGGAAAGGATAGAGCTACAGGAATGGGTGCAGAGACCCCAATGGAAATGGCATCTAGCGCTGCAGAGCA
AATGGAGGAGGAGATTCGGTTGGGATCACAAGACTTCATGGGAGGGGAACAACGAACAATGGAGAATCCAGGAATTGGTGACATAGGGGAAGATGATTTGCCAGACACAC
CTACCAGTATGCGTAATACATCTGGCATGTCTTCTAGAAGTATTGGGAGCAAAAGAAAACGATCATCCTTCCAAACTGAATTAATTGATGTTGTGCGCACAACAATGGAT
ATGCAAACCAATCACATGCAAAAACTTCTATCCTGGCAGAAGGAGAAGTATGAGTTGGAGGCTGCACGAAGGAAGGAAGTAGTCGATCTCTTGTATCAGATAGAAGGATT
GACTGAGCATGATCGTGTCGCCTTGATAGACTTGCTTGTGAGTGATATCCAAAAGACTGACTGTTTTCTACAGGTTCCACCTCAATCGAGGAGGACATATTGCATGCGTC
TACTGGGAAGGACTGGATGA
Protein sequenceShow/hide protein sequence
MAGTSRSSKHTWTKVEDARLVESLVSLIHKGWRSIMGPSMLAEKLQNSCLEQNTIDCKVRTLKKQYHAITEMLSNACSGFGWNEEFKCVEAEKEVFDAWVKSHTNAKGMR
NKPFPHYDDLAYVFGKDRATGMGAETPMEMASSAAEQMEEEIRLGSQDFMGGEQRTMENPGIGDIGEDDLPDTPTSMRNTSGMSSRSIGSKRKRSSFQTELIDVVRTTMD
MQTNHMQKLLSWQKEKYELEAARRKEVVDLLYQIEGLTEHDRVALIDLLVSDIQKTDCFLQVPPQSRRTYCMRLLGRTG