; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007243 (gene) of Snake gourd v1 genome

Gene IDTan0007243
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionsp110 nuclear body protein-like
Genome locationLG03:78008863..78009717
RNA-Seq ExpressionTan0007243
SyntenyTan0007243
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6594621.1 hypothetical protein SDJN03_11174, partial [Cucurbita argyrosperma subsp. sororia]4.2e-8667.44Show/hide
Query:  MGCCVSSGKSINSTHKLDRNSTAGTVKIFGPFTDNGSREAPSAMEEETVKEVLSETPSLKP-PLLPPPKNCPPEEE----TVGDKVKNEGDEIEKKLREI
        MGCCVSSGKS +S HK D    A   KIFGP TDNGSRE PS+MEEETVKEVLSET +LKP P+ PP K+CPPEE+     VGDKV     EIEKKL EI
Subjt:  MGCCVSSGKSINSTHKLDRNSTAGTVKIFGPFTDNGSREAPSAMEEETVKEVLSETPSLKP-PLLPPPKNCPPEEE----TVGDKVKNEGDEIEKKLREI

Query:  PINGIAQQASEFYEISNLKELLATSNFTDTMDGGGEVHQRVLKSSPVKLKNQSICRDVELKRELSPNKTLNRRSDELPVRRNTTVGSARLVQNRDVSPAV
        PINGI QQASEF EISN  +  AT+NFTD MDGG EVHQ VLK+ P    NQSI  +V LKR+LSPNKTLNRRSD+ PVRRN+ VGSARLVQ +D SPA+
Subjt:  PINGIAQQASEFYEISNLKELLATSNFTDTMDGGGEVHQRVLKSSPVKLKNQSICRDVELKRELSPNKTLNRRSDELPVRRNTTVGSARLVQNRDVSPAV

Query:  ARRGLRAEPPRRDADENPGRRSRSPATAHSDSGGSRS------SVRKPNKSA------ATATAPAGSKKVVEENNIIDRKSNTQIESLENPLVSLECFIF
          RGLR EP  +D DEN GRRSRSPATA  DSGGSRS      SVRK  KS+      A   APA S+KVVEENNI D    TQIESLENPLVSLECFIF
Subjt:  ARRGLRAEPPRRDADENPGRRSRSPATAHSDSGGSRS------SVRKPNKSA------ATATAPAGSKKVVEENNIIDRKSNTQIESLENPLVSLECFIF

Query:  L
        L
Subjt:  L

KAG7026590.1 hypothetical protein SDJN02_10592, partial [Cucurbita argyrosperma subsp. argyrosperma]2.5e-8667.44Show/hide
Query:  MGCCVSSGKSINSTHKLDRNSTAGTVKIFGPFTDNGSREAPSAMEEETVKEVLSETPSLKP-PLLPPPKNCPPEEE----TVGDKVKNEGDEIEKKLREI
        MGCCVSSGKS +S HK D    A   KIFGP TDNGSRE PS+MEEETVKEVLSET +LKP P+ PP K+CPPEE+     VGDKV     EIEKKL EI
Subjt:  MGCCVSSGKSINSTHKLDRNSTAGTVKIFGPFTDNGSREAPSAMEEETVKEVLSETPSLKP-PLLPPPKNCPPEEE----TVGDKVKNEGDEIEKKLREI

Query:  PINGIAQQASEFYEISNLKELLATSNFTDTMDGGGEVHQRVLKSSPVKLKNQSICRDVELKRELSPNKTLNRRSDELPVRRNTTVGSARLVQNRDVSPAV
        PINGI QQASEF EISN  +  AT+NFTD MDGG EVHQ VLK+ P    NQSI  +V LKR+LSPNKTLNRRSD+ PVRRN+ VGSARLVQ RD SPA+
Subjt:  PINGIAQQASEFYEISNLKELLATSNFTDTMDGGGEVHQRVLKSSPVKLKNQSICRDVELKRELSPNKTLNRRSDELPVRRNTTVGSARLVQNRDVSPAV

Query:  ARRGLRAEPPRRDADENPGRRSRSPATAHSDSGGSRS------SVRKPNKSA------ATATAPAGSKKVVEENNIIDRKSNTQIESLENPLVSLECFIF
          RGLR EP ++D DEN GRRSRSPATA  DSGGSRS      SVRK  KS+      A   APA S+KVVEENNI +    TQIESLENPLVSLECFIF
Subjt:  ARRGLRAEPPRRDADENPGRRSRSPATAHSDSGGSRS------SVRKPNKSA------ATATAPAGSKKVVEENNIIDRKSNTQIESLENPLVSLECFIF

Query:  L
        L
Subjt:  L

XP_022926404.1 uncharacterized protein LOC111433567 [Cucurbita moschata]5.5e-8667.44Show/hide
Query:  MGCCVSSGKSINSTHKLDRNSTAGTVKIFGPFTDNGSREAPSAMEEETVKEVLSETPSLKPPLLPPP-KNCPPEEE----TVGDKVKNEGDEIEKKLREI
        MGCCVSSGKS +S HK D    A   KIFGP TDNGSRE PS+MEEETVKEVLSET +LKP  + PP KNCPPEE+     VGDKV     EIEKKL EI
Subjt:  MGCCVSSGKSINSTHKLDRNSTAGTVKIFGPFTDNGSREAPSAMEEETVKEVLSETPSLKPPLLPPP-KNCPPEEE----TVGDKVKNEGDEIEKKLREI

Query:  PINGIAQQASEFYEISNLKELLATSNFTDTMDGGGEVHQRVLKSSPVKLKNQSICRDVELKRELSPNKTLNRRSDELPVRRNTTVGSARLVQNRDVSPAV
        PINGI QQ SEF EISN  +  AT+NFTD MDGG EVHQ VLK+ P    NQSI  +V LKR+LSPNKTLNRRSD+ PVRRN+ VGSARLVQ RD SPA+
Subjt:  PINGIAQQASEFYEISNLKELLATSNFTDTMDGGGEVHQRVLKSSPVKLKNQSICRDVELKRELSPNKTLNRRSDELPVRRNTTVGSARLVQNRDVSPAV

Query:  ARRGLRAEPPRRDADENPGRRSRSPATAHSDSGGSRS------SVRKPNKSA------ATATAPAGSKKVVEENNIIDRKSNTQIESLENPLVSLECFIF
          RGLR EP ++D DEN GRRSRSPATA  DSGGSRS      SVRK  KS+      A   APA S+KVVEENNI D    TQIESLENPLVSLECFIF
Subjt:  ARRGLRAEPPRRDADENPGRRSRSPATAHSDSGGSRS------SVRKPNKSA------ATATAPAGSKKVVEENNIIDRKSNTQIESLENPLVSLECFIF

Query:  L
        L
Subjt:  L

XP_023518159.1 uncharacterized protein LOC111781703 [Cucurbita pepo subsp. pepo]1.9e-8367.57Show/hide
Query:  MGCCVSSGKSINSTHKLDRNSTAGTVKIFGPFTDNGSREAPSAMEEETVKEVLSETPSLKP-PLLPPPKNCPPEEE----TVGDKVKNEGDEIEKKLREI
        MGCCVSSGKS +S HK D    A   KIFGP TDNGSRE PS+MEEETVKEVLSET +LKP  + PP KNCPPEE+     VGDKV     EIEKKL EI
Subjt:  MGCCVSSGKSINSTHKLDRNSTAGTVKIFGPFTDNGSREAPSAMEEETVKEVLSETPSLKP-PLLPPPKNCPPEEE----TVGDKVKNEGDEIEKKLREI

Query:  PINGIAQQASEFYEISNLKELLATSNFTDTMDGGGEVHQRVLKSSPVKLKNQSICRDVELKRELSPNKTLNRRSDELPVRRNTTVGSARLVQNRDVSPAV
        PINGIAQQASEF EISN  +  AT+NFTDTMDGG EVHQ VLK+ P    NQSI  DV LKR+LSPNKTLNRRSD+ PVRRN+ VGSARLVQ RD SPA+
Subjt:  PINGIAQQASEFYEISNLKELLATSNFTDTMDGGGEVHQRVLKSSPVKLKNQSICRDVELKRELSPNKTLNRRSDELPVRRNTTVGSARLVQNRDVSPAV

Query:  ARRGLRAEPPRRDADENPGRRSRSPATAHSDSGGSRS------SVRKPNKSA------ATATAPAGSKKVVEENNIIDRKSNTQIESLENPLVSLE
          RGLR EP ++D DEN GRRSRSPATA  D GGSRS      SVRK  KS+      A   A A S+KVVEENNI D    TQIESLENPLVSLE
Subjt:  ARRGLRAEPPRRDADENPGRRSRSPATAHSDSGGSRS------SVRKPNKSA------ATATAPAGSKKVVEENNIIDRKSNTQIESLENPLVSLE

XP_038882208.1 uncharacterized protein LOC120073430 [Benincasa hispida]7.4e-8365.53Show/hide
Query:  MGCCVSSGKSINSTHKLDRNSTAGTVKIFGPFTDNGSREAPSAMEEETVKEVLSETPSLKPPLLPPPKNCPPEEETVGDKVKNEGDEIEKKLREIPINGI
        MGCC+SSGKS NS +K  RNS            DNGSR+ PS+MEEETVKEVLSETPSLKPP  PP KN PPEE+ V   +K  G+EIEKKL EI INGI
Subjt:  MGCCVSSGKSINSTHKLDRNSTAGTVKIFGPFTDNGSREAPSAMEEETVKEVLSETPSLKPPLLPPPKNCPPEEETVGDKVKNEGDEIEKKLREIPINGI

Query:  AQQASEFYEISNLKELLATSN--FTDTMDGGGEVHQRVLKSSPVKL-KNQSICRDVELKRELSPNKTLNRRSDELPVRRNTTVGSARLVQNRDVSPAVAR
        A+  SEFYEIS+  E ++ S    T+ MDGGGE+HQ VLKSSPVKL K+QSI  D E+KRE+S N+TL RRSD+ PVRRN  +GS R+V NRD++PA+AR
Subjt:  AQQASEFYEISNLKELLATSN--FTDTMDGGGEVHQRVLKSSPVKL-KNQSICRDVELKRELSPNKTLNRRSDELPVRRNTTVGSARLVQNRDVSPAVAR

Query:  RGLRAEPPRRDADENPGRRSRSPATAHSDSGGSRS------SVRKPNKSAATATAPAGSKKVVEENNIIDRKSNTQIESLENPLVSLECFIFL
        R LRAEPPRRD DEN  RRSRSPATA SD  GSRS      SVRK  KS+    A A S+KVVEENNIID K N+QIESLENPLVSLECFIFL
Subjt:  RGLRAEPPRRDADENPGRRSRSPATAHSDSGGSRS------SVRKPNKSAATATAPAGSKKVVEENNIIDRKSNTQIESLENPLVSLECFIFL

TrEMBL top hitse value%identityAlignment
A0A0A0KLE9 Uncharacterized protein1.9e-7661.77Show/hide
Query:  MGCCVSSGKSINSTHKLDRNSTAGTVKIFGPFTDNGSREAPSAMEEETVKEVLSETPSLKPPLLPPPKNCPPEEETVGDKVKNEGDEIEKKLREIPINGI
        MGCC+SS +S +S +K   NS             N SR+ PS+MEEETVKEVLSETP+LKP   P   N  PE++   +  K  GDEIEKKL EIPINGI
Subjt:  MGCCVSSGKSINSTHKLDRNSTAGTVKIFGPFTDNGSREAPSAMEEETVKEVLSETPSLKPPLLPPPKNCPPEEETVGDKVKNEGDEIEKKLREIPINGI

Query:  AQQASEFYEISNLKELLATS--NFTDTMDGGGEVHQRVLKSSPVKL-KNQSICRDVELKRELSPNKTLNRRSDELPVRRNTTVGSARLVQNRDVSPAVAR
         +Q SEFYEIS++ + ++ S   FTD  DGGGEVHQ VLKSSPVKL KNQS+  DVELKRE+  ++TL RRSD+ PVRRN  VGS R+V NRD+SPA+AR
Subjt:  AQQASEFYEISNLKELLATS--NFTDTMDGGGEVHQRVLKSSPVKL-KNQSICRDVELKRELSPNKTLNRRSDELPVRRNTTVGSARLVQNRDVSPAVAR

Query:  RGLRAEPPRRDADENPGRRSRSPATAHSDSGGSRS------SVRKPNKSAATATAPAGSKKVVEENNIIDRKSNTQIESLENPLVSLECFIFL
        RGLRAEPPRRD DEN  RRS SP+TA SDS G RS      S RK  KS+      A S+KVVEENNI+D K NTQIESLENPLVSLECFIFL
Subjt:  RGLRAEPPRRDADENPGRRSRSPATAHSDSGGSRS------SVRKPNKSAATATAPAGSKKVVEENNIIDRKSNTQIESLENPLVSLECFIFL

A0A1S3B2L5 uncharacterized protein LOC1034853123.1e-7962.8Show/hide
Query:  MGCCVSSGKSINSTHKLDRNSTAGTVKIFGPFTDNGSREAPSAMEEETVKEVLSETPSLKPPLLPPPKNCPPEEETVGDKVKNEGDEIEKKLREIPINGI
        MGCC+SS +S NS +K            F P + N +R+ PS+MEEETVKEVLSETP+LKP   PP KNCPPEE+   +  K  GDE EKKL EIPINGI
Subjt:  MGCCVSSGKSINSTHKLDRNSTAGTVKIFGPFTDNGSREAPSAMEEETVKEVLSETPSLKPPLLPPPKNCPPEEETVGDKVKNEGDEIEKKLREIPINGI

Query:  AQQASEFYEISNLKELLATS--NFTDTMDGGGEVHQRVLKSSPVKL-KNQSICRDVELKRELSPNKTLNRRSDELPVRRNTTVGSARLVQNRDVSPAVAR
         +Q SEFYEIS++ + ++ S   FTD  DGGGEVHQ  LKSSPVKL KNQS+  DVELKRE+  ++TL RRSD+ PVRRN  VGS R+V NRD+SPA+AR
Subjt:  AQQASEFYEISNLKELLATS--NFTDTMDGGGEVHQRVLKSSPVKL-KNQSICRDVELKRELSPNKTLNRRSDELPVRRNTTVGSARLVQNRDVSPAVAR

Query:  RGLRAEPPRRDADENPGRRSRSPATAHSDSGGSRS------SVRKPNKSAATATAPAGSKKVVEENNIIDRKSNTQIESLENPLVSLECFIFL
        RGLRAEPPRRD DEN  RRS+SP+TA SDS G RS      S RK  KS+      A S+KVVEENNI+D K NTQIESLENPLVSLECFIFL
Subjt:  RGLRAEPPRRDADENPGRRSRSPATAHSDSGGSRS------SVRKPNKSAATATAPAGSKKVVEENNIIDRKSNTQIESLENPLVSLECFIFL

A0A5D3CNI1 Putative BEST plant protein match is: (TAIR:plant.1) protein3.1e-7962.8Show/hide
Query:  MGCCVSSGKSINSTHKLDRNSTAGTVKIFGPFTDNGSREAPSAMEEETVKEVLSETPSLKPPLLPPPKNCPPEEETVGDKVKNEGDEIEKKLREIPINGI
        MGCC+SS +S NS +K            F P + N +R+ PS+MEEETVKEVLSETP+LKP   PP KNCPPEE+   +  K  GDE EKKL EIPINGI
Subjt:  MGCCVSSGKSINSTHKLDRNSTAGTVKIFGPFTDNGSREAPSAMEEETVKEVLSETPSLKPPLLPPPKNCPPEEETVGDKVKNEGDEIEKKLREIPINGI

Query:  AQQASEFYEISNLKELLATS--NFTDTMDGGGEVHQRVLKSSPVKL-KNQSICRDVELKRELSPNKTLNRRSDELPVRRNTTVGSARLVQNRDVSPAVAR
         +Q SEFYEIS++ + ++ S   FTD  DGGGEVHQ  LKSSPVKL KNQS+  DVELKRE+  ++TL RRSD+ PVRRN  VGS R+V NRD+SPA+AR
Subjt:  AQQASEFYEISNLKELLATS--NFTDTMDGGGEVHQRVLKSSPVKL-KNQSICRDVELKRELSPNKTLNRRSDELPVRRNTTVGSARLVQNRDVSPAVAR

Query:  RGLRAEPPRRDADENPGRRSRSPATAHSDSGGSRS------SVRKPNKSAATATAPAGSKKVVEENNIIDRKSNTQIESLENPLVSLECFIFL
        RGLRAEPPRRD DEN  RRS+SP+TA SDS G RS      S RK  KS+      A S+KVVEENNI+D K NTQIESLENPLVSLECFIFL
Subjt:  RGLRAEPPRRDADENPGRRSRSPATAHSDSGGSRS------SVRKPNKSAATATAPAGSKKVVEENNIIDRKSNTQIESLENPLVSLECFIFL

A0A6J1CMA7 uncharacterized protein LOC1110124333.4e-4961.93Show/hide
Query:  MGCCVSSGKSINSTHKLDRNSTAGTVKIFGPFTDNGSREAPSAMEEETVKEVLSETPSLKPPLLPPPKNCPPEEE----TVGDKVKNEGDEIEKKLREIP
        MGCCVSSG   NS HK DRNS A   KI+       SRE PS+MEEETVKEVL+ETP+LKPP  PPPKN PP+E+     V DKVK E +EIEKK+R IP
Subjt:  MGCCVSSGKSINSTHKLDRNSTAGTVKIFGPFTDNGSREAPSAMEEETVKEVLSETPSLKPPLLPPPKNCPPEEE----TVGDKVKNEGDEIEKKLREIP

Query:  INGIAQQASEFYEISNLKELLATSNFTDTMDGGGEVHQRVLKSSPVKL-KNQSICRDVELKRELSPNKTLNRRSDELPVRRNTTVGSARLVQNRDVS
         N +A+ A EF EIS+  E L+ + FTD MD G EVHQRV ++SPVKL KNQS   DV  KRE+ PN+ LNRRSD+ PVRRN  VGSARL QNRD++
Subjt:  INGIAQQASEFYEISNLKELLATSNFTDTMDGGGEVHQRVLKSSPVKL-KNQSICRDVELKRELSPNKTLNRRSDELPVRRNTTVGSARLVQNRDVS

A0A6J1EF08 uncharacterized protein LOC1114335672.6e-8667.44Show/hide
Query:  MGCCVSSGKSINSTHKLDRNSTAGTVKIFGPFTDNGSREAPSAMEEETVKEVLSETPSLKPPLLPPP-KNCPPEEE----TVGDKVKNEGDEIEKKLREI
        MGCCVSSGKS +S HK D    A   KIFGP TDNGSRE PS+MEEETVKEVLSET +LKP  + PP KNCPPEE+     VGDKV     EIEKKL EI
Subjt:  MGCCVSSGKSINSTHKLDRNSTAGTVKIFGPFTDNGSREAPSAMEEETVKEVLSETPSLKPPLLPPP-KNCPPEEE----TVGDKVKNEGDEIEKKLREI

Query:  PINGIAQQASEFYEISNLKELLATSNFTDTMDGGGEVHQRVLKSSPVKLKNQSICRDVELKRELSPNKTLNRRSDELPVRRNTTVGSARLVQNRDVSPAV
        PINGI QQ SEF EISN  +  AT+NFTD MDGG EVHQ VLK+ P    NQSI  +V LKR+LSPNKTLNRRSD+ PVRRN+ VGSARLVQ RD SPA+
Subjt:  PINGIAQQASEFYEISNLKELLATSNFTDTMDGGGEVHQRVLKSSPVKLKNQSICRDVELKRELSPNKTLNRRSDELPVRRNTTVGSARLVQNRDVSPAV

Query:  ARRGLRAEPPRRDADENPGRRSRSPATAHSDSGGSRS------SVRKPNKSA------ATATAPAGSKKVVEENNIIDRKSNTQIESLENPLVSLECFIF
          RGLR EP ++D DEN GRRSRSPATA  DSGGSRS      SVRK  KS+      A   APA S+KVVEENNI D    TQIESLENPLVSLECFIF
Subjt:  ARRGLRAEPPRRDADENPGRRSRSPATAHSDSGGSRS------SVRKPNKSA------ATATAPAGSKKVVEENNIIDRKSNTQIESLENPLVSLECFIF

Query:  L
        L
Subjt:  L

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G11125.1 unknown protein5.1e-0528.3Show/hide
Query:  MGCCVSSGKSINSTHKLDRNSTAGTVKIFGPFTDNGSREAPSAMEEET-VKEVLSETPSLKPPLLPPPKNCPPEEETVGDKVKNEG--------DEIEKK
        MGCC+SS               A   K   P +   +   PS ++EET VKEVLSET      LL    +    E+T   K++ E         D  ++ 
Subjt:  MGCCVSSGKSINSTHKLDRNSTAGTVKIFGPFTDNGSREAPSAMEEET-VKEVLSETPSLKPPLLPPPKNCPPEEETVGDKVKNEG--------DEIEKK

Query:  LREIPINGIAQQASEFYEISNLKE-LLATSNFTDTMDGGGEVHQRVLKSSPVKLKNQSICRDVELKRELSPNKTLNRRSDELPVRRNT--TVGSARLVQN
        +   P     ++ SE  E  +L E LL+  N  D  +        V + SP K +N+ +                NRR+D  P +RN     GS RLV +
Subjt:  LREIPINGIAQQASEFYEISNLKE-LLATSNFTDTMDGGGEVHQRVLKSSPVKLKNQSICRDVELKRELSPNKTLNRRSDELPVRRNT--TVGSARLVQN

Query:  RDVSPAVARRGLRAEPPRRDADENPGRRSRSPA---------TAHSDSGGSRSSVRKPNKSAATATAPAGSKKVVEENN------IIDRKSNTQIESLEN
           +              RD+ E   RRSRSPA         T    S  S  ++R+ ++S              +E N       I   ++   +S EN
Subjt:  RDVSPAVARRGLRAEPPRRDADENPGRRSRSPA---------TAHSDSGGSRSSVRKPNKSAATATAPAGSKKVVEENN------IIDRKSNTQIESLEN

Query:  PLVSLECFIFL
        PLVSLECFIFL
Subjt:  PLVSLECFIFL

AT1G61170.1 unknown protein2.4e-0731.02Show/hide
Query:  CCVSSGKSINSTHKLDRNSTAGTVKIFGPFTDNGSREAPSAMEEET-VKEVLSETPSLKPPLLPPPKNCPPEEETVGDKVKNEGDEIEKKLREI------
        CCVSSG +       DR +            +N S +  + +EEET VKEVLSET    P       +      T  D VK +  E E+K          
Subjt:  CCVSSGKSINSTHKLDRNSTAGTVKIFGPFTDNGSREAPSAMEEET-VKEVLSETPSLKPPLLPPPKNCPPEEETVGDKVKNEGDEIEKKLREI------

Query:  -----PINGIAQQASEFYEISNLKELLATSNFTDTMDGGGEVH----QRVLKSSPVKLKNQSICRDVELKRELSPNKTLNRRSDELPVRRNTTVGSARLV
             P +   ++ SE  EI +L  L  + + T  M+G  E H    QR  + SP K + Q           ++ N    RR+D+ P +RN   G+    
Subjt:  -----PINGIAQQASEFYEISNLKELLATSNFTDTMDGGGEVH----QRVLKSSPVKLKNQSICRDVELKRELSPNKTLNRRSDELPVRRNTTVGSARLV

Query:  QNRDVSPAVARRGLRAEPPRRDADENPGRRSRSPATAHSDSGGSRSSVRKPNKSAATATAPAGSKKVVEENNIIDRKSN-----TQIESLENPLVSLECF
                    G R     RD  E  GRRSRSPAT  S    ++SS     K+     +P G  ++    N +D++ +     T  E LENPLVSLECF
Subjt:  QNRDVSPAVARRGLRAEPPRRDADENPGRRSRSPATAHSDSGGSRSSVRKPNKSAATATAPAGSKKVVEENNIIDRKSN-----TQIESLENPLVSLECF

Query:  IFL
        IFL
Subjt:  IFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTGCTGTGTTAGTTCGGGGAAATCGATAAATTCAACACACAAATTAGATCGGAATTCTACGGCAGGCACCGTTAAGATTTTTGGGCCGTTCACGGACAATGGAAG
CAGAGAGGCGCCGTCTGCCATGGAGGAAGAAACCGTCAAAGAAGTGCTCTCTGAAACGCCTTCTCTGAAACCACCACTGCTGCCACCTCCGAAGAACTGTCCACCTGAAG
AAGAAACAGTCGGGGATAAGGTCAAGAATGAGGGAGATGAGATCGAGAAGAAGCTTCGTGAAATTCCCATTAATGGAATTGCACAACAAGCTTCTGAATTCTATGAAATT
TCCAATCTGAAAGAGTTACTCGCCACCTCCAATTTCACCGATACAATGGACGGTGGCGGAGAGGTTCATCAGAGGGTTTTGAAATCATCGCCCGTGAAATTGAAGAATCA
ATCCATTTGCAGGGACGTTGAGTTAAAAAGAGAATTGTCGCCGAACAAGACACTGAACCGAAGATCCGACGAGTTGCCGGTCCGACGAAACACCACCGTCGGGTCGGCGA
GATTGGTTCAGAACAGGGACGTAAGTCCCGCAGTGGCGCGGCGAGGATTGAGGGCAGAGCCTCCCCGGAGAGACGCAGACGAGAATCCCGGCAGGAGATCCCGGTCCCCG
GCCACCGCACATTCCGACAGCGGAGGGTCTAGATCGTCGGTGAGAAAGCCCAATAAGTCGGCGGCGACGGCGACAGCACCGGCGGGCAGTAAAAAAGTAGTGGAAGAAAA
CAACATCATTGATAGAAAGAGCAACACTCAGATTGAGTCACTGGAGAATCCTCTGGTTTCATTGGAGTGCTTCATCTTCCTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTTGCTGTGTTAGTTCGGGGAAATCGATAAATTCAACACACAAATTAGATCGGAATTCTACGGCAGGCACCGTTAAGATTTTTGGGCCGTTCACGGACAATGGAAG
CAGAGAGGCGCCGTCTGCCATGGAGGAAGAAACCGTCAAAGAAGTGCTCTCTGAAACGCCTTCTCTGAAACCACCACTGCTGCCACCTCCGAAGAACTGTCCACCTGAAG
AAGAAACAGTCGGGGATAAGGTCAAGAATGAGGGAGATGAGATCGAGAAGAAGCTTCGTGAAATTCCCATTAATGGAATTGCACAACAAGCTTCTGAATTCTATGAAATT
TCCAATCTGAAAGAGTTACTCGCCACCTCCAATTTCACCGATACAATGGACGGTGGCGGAGAGGTTCATCAGAGGGTTTTGAAATCATCGCCCGTGAAATTGAAGAATCA
ATCCATTTGCAGGGACGTTGAGTTAAAAAGAGAATTGTCGCCGAACAAGACACTGAACCGAAGATCCGACGAGTTGCCGGTCCGACGAAACACCACCGTCGGGTCGGCGA
GATTGGTTCAGAACAGGGACGTAAGTCCCGCAGTGGCGCGGCGAGGATTGAGGGCAGAGCCTCCCCGGAGAGACGCAGACGAGAATCCCGGCAGGAGATCCCGGTCCCCG
GCCACCGCACATTCCGACAGCGGAGGGTCTAGATCGTCGGTGAGAAAGCCCAATAAGTCGGCGGCGACGGCGACAGCACCGGCGGGCAGTAAAAAAGTAGTGGAAGAAAA
CAACATCATTGATAGAAAGAGCAACACTCAGATTGAGTCACTGGAGAATCCTCTGGTTTCATTGGAGTGCTTCATCTTCCTCTGA
Protein sequenceShow/hide protein sequence
MGCCVSSGKSINSTHKLDRNSTAGTVKIFGPFTDNGSREAPSAMEEETVKEVLSETPSLKPPLLPPPKNCPPEEETVGDKVKNEGDEIEKKLREIPINGIAQQASEFYEI
SNLKELLATSNFTDTMDGGGEVHQRVLKSSPVKLKNQSICRDVELKRELSPNKTLNRRSDELPVRRNTTVGSARLVQNRDVSPAVARRGLRAEPPRRDADENPGRRSRSP
ATAHSDSGGSRSSVRKPNKSAATATAPAGSKKVVEENNIIDRKSNTQIESLENPLVSLECFIFL