; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020624 (gene) of Snake gourd v1 genome

Gene IDTan0020624
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGlial fibrillary acidic protein-like
Genome locationLG04:27406558..27408469
RNA-Seq ExpressionTan0020624
SyntenyTan0020624
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033221.1 uncharacterized protein E6C27_scaffold845G00100 [Cucumis melo var. makuwa]3.2e-5630.32Show/hide
Query:  MVDTHLPGAFAKSTPEEDSWEVVFQEFGQLQHSSEVIWPRVAEVQLPQDCQLELVLNNVEELKGLWEGLSMESRASFSVVYGNIADLMYADVNLNALQAL
        MVDTH     A   P EDSWE V Q     Q  S + WP+  EV LP+ CQL  + NN+EELK LWE L  E RA F   YG+I DL+Y  +N + LQAL
Subjt:  MVDTHLPGAFAKSTPEEDSWEVVFQEFGQLQHSSEVIWPRVAEVQLPQDCQLELVLNNVEELKGLWEGLSMESRASFSVVYGNIADLMYADVNLNALQAL

Query:  VHFWDPVLKCFSFKMCDLTPTIEEYQALIRVPASTGSKVFTHDRRSTLQRSLSKFLE-----------------------------------------IS
         HFWDPVLKCF+F   DLTPTIEEYQALI +P   G+K++ + R+ TLQRSLSKF+                                          I+
Subjt:  VHFWDPVLKCFSFKMCDLTPTIEEYQALIRVPASTGSKVFTHDRRSTLQRSLSKFLE-----------------------------------------IS

Query:  LLHIGIQAFP----------------LSKLGSPL------TYRC---------GQ-FPFVPLLGLW----------------------------------
        L   G   FP                + +  +P+      T+R          G+ F   P+L +W                                  
Subjt:  LLHIGIQAFP----------------LSKLGSPL------TYRC---------GQ-FPFVPLLGLW----------------------------------

Query:  --------------GSIAYSPLLVLRQ-------------------------------AINVWKDIRKLKNIQHCEGTMPQYEDWRAVRIWAKVDVSPMP
                      G IAYSPLLVLRQ                               A+  WK ++K+K+++HCEGT  QY++WRA R    + ++PM 
Subjt:  --------------GSIAYSPLLVLRQ-------------------------------AINVWKDIRKLKNIQHCEGTMPQYEDWRAVRIWAKVDVSPMP

Query:  IDLEETLKI-DHVDLEKELKLAKERNSVLMKENEELRAEVKLWIGQSTSAKRQLEEVQQRLQRQLELERENGSLNAEVVQLRKKNRGLWREIEVLQGEAE
          L   +++  + D EKELK  +E N VL  ENE+LR EVK W+ Q+ +  R L+E ++                                         
Subjt:  IDLEETLKI-DHVDLEKELKLAKERNSVLMKENEELRAEVKLWIGQSTSAKRQLEEVQQRLQRQLELERENGSLNAEVVQLRKKNRGLWREIEVLQGEAE

Query:  AQKLHIKDLKQEVWKLNNAMKNFQDILNEQVTKSQETIVSLEKEKELLQKLTDEYKSQFVDAERRNTLLQDTIASLEHQLVVYRNANEVVMEDHARL
                                        K  ET + +    E+LQ   +EYKSQ ++AE +N  LQ  + S E QL++ R A EV+ +D+A+L
Subjt:  AQKLHIKDLKQEVWKLNNAMKNFQDILNEQVTKSQETIVSLEKEKELLQKLTDEYKSQFVDAERRNTLLQDTIASLEHQLVVYRNANEVVMEDHARL

KAA0046606.1 glial fibrillary acidic protein-like [Cucumis melo var. makuwa]6.6e-5435.49Show/hide
Query:  MVDTHLPGAFAKSTPEEDSWEVVFQEFGQLQHSSEVIWPRVAEVQLPQDCQLELVLNNVEELKGLWEGLSMESRASFSVVYGNIADLMYADVNLNALQAL
        MVDTH     A+  P EDSWE V Q     Q  S   WP+  EV LP+ CQL  + NN+EELK LWE L  E RA F+  YG+I DL+Y  +N + LQAL
Subjt:  MVDTHLPGAFAKSTPEEDSWEVVFQEFGQLQHSSEVIWPRVAEVQLPQDCQLELVLNNVEELKGLWEGLSMESRASFSVVYGNIADLMYADVNLNALQAL

Query:  VHFWDPVLKCFSFKMCDLTPTIEEYQALIRVPASTGSKVFTHDRRSTLQRSLSKFLEISLLHIGIQAFPLSK-LGSPLTYRCGQFPFVPLLGLWGSIAYS
         HFWDPVLKCF+F   DLTPTIEEYQALI +P   G+ ++ +DR+ TLQRSLSKF+        I A  L K + +     C    ++  L     +  +
Subjt:  VHFWDPVLKCFSFKMCDLTPTIEEYQALIRVPASTGSKVFTHDRRSTLQRSLSKFLEISLLHIGIQAFPLSK-LGSPLTYRCGQFPFVPLLGLWGSIAYS

Query:  PLLVLRQAINVWKDIRKLKNIQHCEGTMPQYEDWRAVRIWAKVD--VSP-MPIDLEETLKIDHVDLEKELKLAKERNSVLMKENEELRAEVKLWIGQSTS
         L ++   I       ++K           Y +   V+I+  ++  V+P +PI  E    ++H  ++ + K                   + +WI   +S
Subjt:  PLLVLRQAINVWKDIRKLKNIQHCEGTMPQYEDWRAVRIWAKVD--VSP-MPIDLEETLKIDHVDLEKELKLAKERNSVLMKENEELRAEVKLWIGQSTS

Query:  AKRQLEEVQQRLQRQLELERENGSLNAEVVQLRKKNRGLWREIEVLQGEAEAQKLHIKDLKQEVWKLNNAMKNFQDILNEQVTKSQETIVSLEKEKELLQ
          R L+E +   +++LELE+EN SLN E +Q+RKKN+ L R I  L  E EA+K                                 T + +    E+LQ
Subjt:  AKRQLEEVQQRLQRQLELERENGSLNAEVVQLRKKNRGLWREIEVLQGEAEAQKLHIKDLKQEVWKLNNAMKNFQDILNEQVTKSQETIVSLEKEKELLQ

Query:  KLTDEYKSQFVDAERRNTLLQDTIASLEHQLVVYRNANEVVMEDHARL
           +EYKSQ ++AE +N  LQ  + S E QL++ R A +V+ +D+A+L
Subjt:  KLTDEYKSQFVDAERRNTLLQDTIASLEHQLVVYRNANEVVMEDHARL

KAA0065295.1 uncharacterized protein E6C27_scaffold1023G00080 [Cucumis melo var. makuwa]2.9e-4155.77Show/hide
Query:  MVDTHLPGAFAKSTPEEDSWEVVFQEFGQLQHSSEVIWPRVAEVQLPQDCQLELVLNNVEELKGLWEGLSMESRASFSVVYGNIADLMYADVNLNALQAL
        MVDTH     A+  P EDSW+ V Q     Q  S + WP+  EV LP+ CQLE + NN+EELK LWE L  E RA F+  YG+I DL+Y  +N + LQAL
Subjt:  MVDTHLPGAFAKSTPEEDSWEVVFQEFGQLQHSSEVIWPRVAEVQLPQDCQLELVLNNVEELKGLWEGLSMESRASFSVVYGNIADLMYADVNLNALQAL

Query:  VHFWDPVLKCFSFKMCDLTPTIEEYQALIRVPASTGSKVFTHDRRSTLQRSLSKFL
         HFWDPVLK F+F   DLTPTIEEYQALI +P   G+K++ +DR+ TLQRSLSKF+
Subjt:  VHFWDPVLKCFSFKMCDLTPTIEEYQALIRVPASTGSKVFTHDRRSTLQRSLSKFL

KAA0066094.1 girdin-like [Cucumis melo var. makuwa]3.0e-4630.48Show/hide
Query:  AEVQLPQDCQLELVLNNVEELKGLWEGLSMESRASFSVVYGNIADLMYADVNLNALQALVHFWDPVLKCFSFKMCDLTPTIEEYQALIRVPASTGSKVFT
        +++ +   CQL    N++  LK +WE L+ + R  FS  YG+IA+LMY  VN  AL+A+++FWDP   CF+F  CDL PTIEEYQA++ +P      V+ 
Subjt:  AEVQLPQDCQLELVLNNVEELKGLWEGLSMESRASFSVVYGNIADLMYADVNLNALQALVHFWDPVLKCFSFKMCDLTPTIEEYQALIRVPASTGSKVFT

Query:  HDRRSTLQRSLSKFLE-----------------------------------------ISLLHIGIQAFPLSK---LGSPLTYRCGQFPFVPLLGLWGSIA
         + + T +R+LSKFLE                                         ++L   G   FP ++    G  + YRCG F  VPLLG WG + 
Subjt:  HDRRSTLQRSLSKFLE-----------------------------------------ISLLHIGIQAFPLSK---LGSPLTYRCGQFPFVPLLGLWGSIA

Query:  YSPLLVL-------------------------------RQAINVWKDIRKLKNIQHCEGTMPQYEDWRAVRIWAKVDVSPMPIDLEETLKIDHVD--LEK
        Y+PLLVL                               RQA+  WK IRK+K+  H EG    YE W+A R    +D+S   ++  +    +  +  +EK
Subjt:  YSPLLVL-------------------------------RQAINVWKDIRKLKNIQHCEGTMPQYEDWRAVRIWAKVDVSPMPIDLEETLKIDHVD--LEK

Query:  ELKLAKERNSVLMKENEELRAEVKLWIGQSTSAKRQLEEVQQRLQRQLELERENGSLNAEVVQLRKKNRGLWREIEVLQGEAEAQKLHIKDLKQEVWKLN
         ++L +E+N +L +ENE+LR E   W+  +T  + +LE+ +  L+ Q +LE++  +L+ E+ ++ K NR L  E   LQ    +Q  +IKDL+       
Subjt:  ELKLAKERNSVLMKENEELRAEVKLWIGQSTSAKRQLEEVQQRLQRQLELERENGSLNAEVVQLRKKNRGLWREIEVLQGEAEAQKLHIKDLKQEVWKLN

Query:  NAMKNFQD---ILNEQVTKSQETIVSLEKEKELLQKLTDEYKSQFVDAERRNTLLQ
        N  + F +    LN  + K +  I+ LE +   L++  D    +  +      +L+
Subjt:  NAMKNFQD---ILNEQVTKSQETIVSLEKEKELLQKLTDEYKSQFVDAERRNTLLQ

TYK28912.1 hypothetical protein E5676_scaffold3695G00150 [Cucumis melo var. makuwa]6.2e-3652.7Show/hide
Query:  MVDTHLPGAFAKSTPEEDSWEVVFQEFGQLQHSSEVIWPRVAEVQLPQDCQLELVLNNVEELKGLWEGLSMESRASFSVVYGNIADLMYADVNLNALQAL
        MVDT+      +  P ED WE V Q     Q  S V WP+  EVQLP+ CQL  + NN++ELK LWE L  E RA F+  YG+I DL+Y  +N + LQAL
Subjt:  MVDTHLPGAFAKSTPEEDSWEVVFQEFGQLQHSSEVIWPRVAEVQLPQDCQLELVLNNVEELKGLWEGLSMESRASFSVVYGNIADLMYADVNLNALQAL

Query:  VHFWDPVLKCFSFKMCDLTPTIEEYQALIRVPASTGSKVFTHDRRSTL
         HFWDP+LKCF F   DLTPTIEEYQALI +P   G+K + +DR+ TL
Subjt:  VHFWDPVLKCFSFKMCDLTPTIEEYQALIRVPASTGSKVFTHDRRSTL

TrEMBL top hitse value%identityAlignment
A0A5A7SUT0 Reverse transcriptase1.5e-5630.32Show/hide
Query:  MVDTHLPGAFAKSTPEEDSWEVVFQEFGQLQHSSEVIWPRVAEVQLPQDCQLELVLNNVEELKGLWEGLSMESRASFSVVYGNIADLMYADVNLNALQAL
        MVDTH     A   P EDSWE V Q     Q  S + WP+  EV LP+ CQL  + NN+EELK LWE L  E RA F   YG+I DL+Y  +N + LQAL
Subjt:  MVDTHLPGAFAKSTPEEDSWEVVFQEFGQLQHSSEVIWPRVAEVQLPQDCQLELVLNNVEELKGLWEGLSMESRASFSVVYGNIADLMYADVNLNALQAL

Query:  VHFWDPVLKCFSFKMCDLTPTIEEYQALIRVPASTGSKVFTHDRRSTLQRSLSKFLE-----------------------------------------IS
         HFWDPVLKCF+F   DLTPTIEEYQALI +P   G+K++ + R+ TLQRSLSKF+                                          I+
Subjt:  VHFWDPVLKCFSFKMCDLTPTIEEYQALIRVPASTGSKVFTHDRRSTLQRSLSKFLE-----------------------------------------IS

Query:  LLHIGIQAFP----------------LSKLGSPL------TYRC---------GQ-FPFVPLLGLW----------------------------------
        L   G   FP                + +  +P+      T+R          G+ F   P+L +W                                  
Subjt:  LLHIGIQAFP----------------LSKLGSPL------TYRC---------GQ-FPFVPLLGLW----------------------------------

Query:  --------------GSIAYSPLLVLRQ-------------------------------AINVWKDIRKLKNIQHCEGTMPQYEDWRAVRIWAKVDVSPMP
                      G IAYSPLLVLRQ                               A+  WK ++K+K+++HCEGT  QY++WRA R    + ++PM 
Subjt:  --------------GSIAYSPLLVLRQ-------------------------------AINVWKDIRKLKNIQHCEGTMPQYEDWRAVRIWAKVDVSPMP

Query:  IDLEETLKI-DHVDLEKELKLAKERNSVLMKENEELRAEVKLWIGQSTSAKRQLEEVQQRLQRQLELERENGSLNAEVVQLRKKNRGLWREIEVLQGEAE
          L   +++  + D EKELK  +E N VL  ENE+LR EVK W+ Q+ +  R L+E ++                                         
Subjt:  IDLEETLKI-DHVDLEKELKLAKERNSVLMKENEELRAEVKLWIGQSTSAKRQLEEVQQRLQRQLELERENGSLNAEVVQLRKKNRGLWREIEVLQGEAE

Query:  AQKLHIKDLKQEVWKLNNAMKNFQDILNEQVTKSQETIVSLEKEKELLQKLTDEYKSQFVDAERRNTLLQDTIASLEHQLVVYRNANEVVMEDHARL
                                        K  ET + +    E+LQ   +EYKSQ ++AE +N  LQ  + S E QL++ R A EV+ +D+A+L
Subjt:  AQKLHIKDLKQEVWKLNNAMKNFQDILNEQVTKSQETIVSLEKEKELLQKLTDEYKSQFVDAERRNTLLQDTIASLEHQLVVYRNANEVVMEDHARL

A0A5A7TXA1 Glial fibrillary acidic protein-like3.2e-5435.49Show/hide
Query:  MVDTHLPGAFAKSTPEEDSWEVVFQEFGQLQHSSEVIWPRVAEVQLPQDCQLELVLNNVEELKGLWEGLSMESRASFSVVYGNIADLMYADVNLNALQAL
        MVDTH     A+  P EDSWE V Q     Q  S   WP+  EV LP+ CQL  + NN+EELK LWE L  E RA F+  YG+I DL+Y  +N + LQAL
Subjt:  MVDTHLPGAFAKSTPEEDSWEVVFQEFGQLQHSSEVIWPRVAEVQLPQDCQLELVLNNVEELKGLWEGLSMESRASFSVVYGNIADLMYADVNLNALQAL

Query:  VHFWDPVLKCFSFKMCDLTPTIEEYQALIRVPASTGSKVFTHDRRSTLQRSLSKFLEISLLHIGIQAFPLSK-LGSPLTYRCGQFPFVPLLGLWGSIAYS
         HFWDPVLKCF+F   DLTPTIEEYQALI +P   G+ ++ +DR+ TLQRSLSKF+        I A  L K + +     C    ++  L     +  +
Subjt:  VHFWDPVLKCFSFKMCDLTPTIEEYQALIRVPASTGSKVFTHDRRSTLQRSLSKFLEISLLHIGIQAFPLSK-LGSPLTYRCGQFPFVPLLGLWGSIAYS

Query:  PLLVLRQAINVWKDIRKLKNIQHCEGTMPQYEDWRAVRIWAKVD--VSP-MPIDLEETLKIDHVDLEKELKLAKERNSVLMKENEELRAEVKLWIGQSTS
         L ++   I       ++K           Y +   V+I+  ++  V+P +PI  E    ++H  ++ + K                   + +WI   +S
Subjt:  PLLVLRQAINVWKDIRKLKNIQHCEGTMPQYEDWRAVRIWAKVD--VSP-MPIDLEETLKIDHVDLEKELKLAKERNSVLMKENEELRAEVKLWIGQSTS

Query:  AKRQLEEVQQRLQRQLELERENGSLNAEVVQLRKKNRGLWREIEVLQGEAEAQKLHIKDLKQEVWKLNNAMKNFQDILNEQVTKSQETIVSLEKEKELLQ
          R L+E +   +++LELE+EN SLN E +Q+RKKN+ L R I  L  E EA+K                                 T + +    E+LQ
Subjt:  AKRQLEEVQQRLQRQLELERENGSLNAEVVQLRKKNRGLWREIEVLQGEAEAQKLHIKDLKQEVWKLNNAMKNFQDILNEQVTKSQETIVSLEKEKELLQ

Query:  KLTDEYKSQFVDAERRNTLLQDTIASLEHQLVVYRNANEVVMEDHARL
           +EYKSQ ++AE +N  LQ  + S E QL++ R A +V+ +D+A+L
Subjt:  KLTDEYKSQFVDAERRNTLLQDTIASLEHQLVVYRNANEVVMEDHARL

A0A5A7VFL0 Girdin-like1.4e-4630.48Show/hide
Query:  AEVQLPQDCQLELVLNNVEELKGLWEGLSMESRASFSVVYGNIADLMYADVNLNALQALVHFWDPVLKCFSFKMCDLTPTIEEYQALIRVPASTGSKVFT
        +++ +   CQL    N++  LK +WE L+ + R  FS  YG+IA+LMY  VN  AL+A+++FWDP   CF+F  CDL PTIEEYQA++ +P      V+ 
Subjt:  AEVQLPQDCQLELVLNNVEELKGLWEGLSMESRASFSVVYGNIADLMYADVNLNALQALVHFWDPVLKCFSFKMCDLTPTIEEYQALIRVPASTGSKVFT

Query:  HDRRSTLQRSLSKFLE-----------------------------------------ISLLHIGIQAFPLSK---LGSPLTYRCGQFPFVPLLGLWGSIA
         + + T +R+LSKFLE                                         ++L   G   FP ++    G  + YRCG F  VPLLG WG + 
Subjt:  HDRRSTLQRSLSKFLE-----------------------------------------ISLLHIGIQAFPLSK---LGSPLTYRCGQFPFVPLLGLWGSIA

Query:  YSPLLVL-------------------------------RQAINVWKDIRKLKNIQHCEGTMPQYEDWRAVRIWAKVDVSPMPIDLEETLKIDHVD--LEK
        Y+PLLVL                               RQA+  WK IRK+K+  H EG    YE W+A R    +D+S   ++  +    +  +  +EK
Subjt:  YSPLLVL-------------------------------RQAINVWKDIRKLKNIQHCEGTMPQYEDWRAVRIWAKVDVSPMPIDLEETLKIDHVD--LEK

Query:  ELKLAKERNSVLMKENEELRAEVKLWIGQSTSAKRQLEEVQQRLQRQLELERENGSLNAEVVQLRKKNRGLWREIEVLQGEAEAQKLHIKDLKQEVWKLN
         ++L +E+N +L +ENE+LR E   W+  +T  + +LE+ +  L+ Q +LE++  +L+ E+ ++ K NR L  E   LQ    +Q  +IKDL+       
Subjt:  ELKLAKERNSVLMKENEELRAEVKLWIGQSTSAKRQLEEVQQRLQRQLELERENGSLNAEVVQLRKKNRGLWREIEVLQGEAEAQKLHIKDLKQEVWKLN

Query:  NAMKNFQD---ILNEQVTKSQETIVSLEKEKELLQKLTDEYKSQFVDAERRNTLLQ
        N  + F +    LN  + K +  I+ LE +   L++  D    +  +      +L+
Subjt:  NAMKNFQD---ILNEQVTKSQETIVSLEKEKELLQKLTDEYKSQFVDAERRNTLLQ

A0A5A7VHI3 Uncharacterized protein1.4e-4155.77Show/hide
Query:  MVDTHLPGAFAKSTPEEDSWEVVFQEFGQLQHSSEVIWPRVAEVQLPQDCQLELVLNNVEELKGLWEGLSMESRASFSVVYGNIADLMYADVNLNALQAL
        MVDTH     A+  P EDSW+ V Q     Q  S + WP+  EV LP+ CQLE + NN+EELK LWE L  E RA F+  YG+I DL+Y  +N + LQAL
Subjt:  MVDTHLPGAFAKSTPEEDSWEVVFQEFGQLQHSSEVIWPRVAEVQLPQDCQLELVLNNVEELKGLWEGLSMESRASFSVVYGNIADLMYADVNLNALQAL

Query:  VHFWDPVLKCFSFKMCDLTPTIEEYQALIRVPASTGSKVFTHDRRSTLQRSLSKFL
         HFWDPVLK F+F   DLTPTIEEYQALI +P   G+K++ +DR+ TLQRSLSKF+
Subjt:  VHFWDPVLKCFSFKMCDLTPTIEEYQALIRVPASTGSKVFTHDRRSTLQRSLSKFL

A0A5D3DZ69 Uncharacterized protein3.0e-3652.7Show/hide
Query:  MVDTHLPGAFAKSTPEEDSWEVVFQEFGQLQHSSEVIWPRVAEVQLPQDCQLELVLNNVEELKGLWEGLSMESRASFSVVYGNIADLMYADVNLNALQAL
        MVDT+      +  P ED WE V Q     Q  S V WP+  EVQLP+ CQL  + NN++ELK LWE L  E RA F+  YG+I DL+Y  +N + LQAL
Subjt:  MVDTHLPGAFAKSTPEEDSWEVVFQEFGQLQHSSEVIWPRVAEVQLPQDCQLELVLNNVEELKGLWEGLSMESRASFSVVYGNIADLMYADVNLNALQAL

Query:  VHFWDPVLKCFSFKMCDLTPTIEEYQALIRVPASTGSKVFTHDRRSTL
         HFWDP+LKCF F   DLTPTIEEYQALI +P   G+K + +DR+ TL
Subjt:  VHFWDPVLKCFSFKMCDLTPTIEEYQALIRVPASTGSKVFTHDRRSTL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGACACTCATTTGCCCGGGGCATTCGCCAAATCCACCCCTGAGGAAGATTCGTGGGAAGTAGTGTTCCAGGAGTTTGGGCAGTTGCAACATTCATCAGAGGTTAT
TTGGCCAAGGGTTGCAGAGGTCCAACTCCCTCAGGATTGTCAGCTAGAGCTCGTTCTTAACAATGTGGAAGAATTAAAAGGTTTATGGGAAGGTTTGAGTATGGAAAGCA
GAGCTAGTTTTAGTGTAGTGTATGGGAATATCGCTGATCTCATGTATGCAGACGTCAACTTGAACGCTTTGCAAGCTTTAGTTCATTTCTGGGATCCCGTGCTTAAATGT
TTCTCATTTAAGATGTGTGATTTGACTCCCACGATCGAGGAGTATCAGGCTTTGATACGTGTACCAGCCAGTACAGGAAGTAAAGTCTTCACTCATGATCGAAGGTCGAC
TTTACAAAGATCTTTATCCAAGTTTTTGGAGATTTCATTGCTGCACATTGGGATCCAAGCTTTCCCTCTATCGAAGCTTGGAAGTCCCTTAACATATCGATGTGGGCAAT
TTCCATTTGTACCACTTTTGGGACTGTGGGGGAGTATTGCTTACTCGCCTCTGTTGGTACTACGACAGGCAATCAATGTTTGGAAAGATATAAGGAAATTGAAAAATATC
CAACATTGTGAAGGAACCATGCCACAATACGAAGACTGGAGAGCAGTTAGAATTTGGGCAAAAGTTGATGTTTCACCAATGCCAATTGATTTAGAGGAAACTTTGAAGAT
AGACCATGTTGATTTAGAAAAAGAATTGAAGCTTGCAAAAGAAAGAAACTCTGTGCTCATGAAGGAAAATGAGGAATTAAGGGCTGAAGTTAAATTATGGATTGGACAAT
CCACAAGCGCGAAAAGACAGTTGGAAGAAGTTCAACAGCGTTTACAGAGGCAGCTTGAATTAGAGAGGGAAAACGGTTCTCTAAACGCAGAAGTCGTCCAACTACGGAAG
AAAAATAGGGGCTTGTGGAGAGAAATAGAAGTCCTACAAGGTGAGGCAGAGGCTCAGAAATTACACATCAAAGACCTGAAGCAAGAGGTATGGAAGCTCAACAATGCCAT
GAAAAACTTCCAAGATATACTTAATGAACAAGTTACAAAGTCACAAGAGACAATTGTGTCTTTGGAAAAAGAAAAGGAACTTTTGCAAAAGCTGACAGATGAGTATAAGA
GCCAATTTGTTGATGCAGAACGGAGAAACACACTGCTTCAAGATACTATTGCAAGTTTGGAGCACCAGTTGGTAGTATATCGAAATGCGAACGAAGTAGTGATGGAAGAT
CATGCACGATTAAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGTAGACACTCATTTGCCCGGGGCATTCGCCAAATCCACCCCTGAGGAAGATTCGTGGGAAGTAGTGTTCCAGGAGTTTGGGCAGTTGCAACATTCATCAGAGGTTAT
TTGGCCAAGGGTTGCAGAGGTCCAACTCCCTCAGGATTGTCAGCTAGAGCTCGTTCTTAACAATGTGGAAGAATTAAAAGGTTTATGGGAAGGTTTGAGTATGGAAAGCA
GAGCTAGTTTTAGTGTAGTGTATGGGAATATCGCTGATCTCATGTATGCAGACGTCAACTTGAACGCTTTGCAAGCTTTAGTTCATTTCTGGGATCCCGTGCTTAAATGT
TTCTCATTTAAGATGTGTGATTTGACTCCCACGATCGAGGAGTATCAGGCTTTGATACGTGTACCAGCCAGTACAGGAAGTAAAGTCTTCACTCATGATCGAAGGTCGAC
TTTACAAAGATCTTTATCCAAGTTTTTGGAGATTTCATTGCTGCACATTGGGATCCAAGCTTTCCCTCTATCGAAGCTTGGAAGTCCCTTAACATATCGATGTGGGCAAT
TTCCATTTGTACCACTTTTGGGACTGTGGGGGAGTATTGCTTACTCGCCTCTGTTGGTACTACGACAGGCAATCAATGTTTGGAAAGATATAAGGAAATTGAAAAATATC
CAACATTGTGAAGGAACCATGCCACAATACGAAGACTGGAGAGCAGTTAGAATTTGGGCAAAAGTTGATGTTTCACCAATGCCAATTGATTTAGAGGAAACTTTGAAGAT
AGACCATGTTGATTTAGAAAAAGAATTGAAGCTTGCAAAAGAAAGAAACTCTGTGCTCATGAAGGAAAATGAGGAATTAAGGGCTGAAGTTAAATTATGGATTGGACAAT
CCACAAGCGCGAAAAGACAGTTGGAAGAAGTTCAACAGCGTTTACAGAGGCAGCTTGAATTAGAGAGGGAAAACGGTTCTCTAAACGCAGAAGTCGTCCAACTACGGAAG
AAAAATAGGGGCTTGTGGAGAGAAATAGAAGTCCTACAAGGTGAGGCAGAGGCTCAGAAATTACACATCAAAGACCTGAAGCAAGAGGTATGGAAGCTCAACAATGCCAT
GAAAAACTTCCAAGATATACTTAATGAACAAGTTACAAAGTCACAAGAGACAATTGTGTCTTTGGAAAAAGAAAAGGAACTTTTGCAAAAGCTGACAGATGAGTATAAGA
GCCAATTTGTTGATGCAGAACGGAGAAACACACTGCTTCAAGATACTATTGCAAGTTTGGAGCACCAGTTGGTAGTATATCGAAATGCGAACGAAGTAGTGATGGAAGAT
CATGCACGATTAAAATAA
Protein sequenceShow/hide protein sequence
MVDTHLPGAFAKSTPEEDSWEVVFQEFGQLQHSSEVIWPRVAEVQLPQDCQLELVLNNVEELKGLWEGLSMESRASFSVVYGNIADLMYADVNLNALQALVHFWDPVLKC
FSFKMCDLTPTIEEYQALIRVPASTGSKVFTHDRRSTLQRSLSKFLEISLLHIGIQAFPLSKLGSPLTYRCGQFPFVPLLGLWGSIAYSPLLVLRQAINVWKDIRKLKNI
QHCEGTMPQYEDWRAVRIWAKVDVSPMPIDLEETLKIDHVDLEKELKLAKERNSVLMKENEELRAEVKLWIGQSTSAKRQLEEVQQRLQRQLELERENGSLNAEVVQLRK
KNRGLWREIEVLQGEAEAQKLHIKDLKQEVWKLNNAMKNFQDILNEQVTKSQETIVSLEKEKELLQKLTDEYKSQFVDAERRNTLLQDTIASLEHQLVVYRNANEVVMED
HARLK