; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007431 (gene) of Snake gourd v1 genome

Gene IDTan0007431
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionHydroxyproline-rich glycoprotein family protein
Genome locationLG01:100234919..100238942
RNA-Seq ExpressionTan0007431
SyntenyTan0007431
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8649926.1 hypothetical protein Csa_011922 [Cucumis sativus]3.6e-6355.19Show/hide
Query:  DQLCNFEAA----QNPQPQKKRVRRRRQS-RRLYNQIPLNMAEARREIVTALKLHRA-STKE-AKQQQQKQDQQIKQSL----EFCPCFEPEGRFKSRRN
        DQL NFEAA    +  +  KK+VRRRR S RRLY ++PL+MAEARREIVTALKLHRA STKE A++QQQKQDQ+ KQS     +F  CFE EGR KSRRN
Subjt:  DQLCNFEAA----QNPQPQKKRVRRRRQS-RRLYNQIPLNMAEARREIVTALKLHRA-STKE-AKQQQQKQDQQIKQSL----EFCPCFEPEGRFKSRRN

Query:  PRIYP----GCFFYMENGSDFVAPPLVSQSLN-------FDDDFKSMDTSSVVCNDSHSSFYSLSVLPP--SYICPSVSYAATHQEVPKSISLSEEEGKL
        PRIYP     C FY+ENGS  VAPP   ++LN       FDDDFK++DT         SSF SLS  PP  SYICP++S   THQE+PKS+SL EEEG L
Subjt:  PRIYP----GCFFYMENGSDFVAPPLVSQSLN-------FDDDFKSMDTSSVVCNDSHSSFYSLSVLPP--SYICPSVSYAATHQEVPKSISLSEEEGKL

Query:  MASDLFWSNNVPTGETEKEIQ-----------------------GAVKNDINHSFNKAMEFPDWLSVNDDF-LQHG-----EEDYIQYPDLSRMDIGEIE
        MASD+FW NN PTG +EK++Q                        A++ D  HS + AMEFPDWLS+NDDF LQ+      EEDY+Q PDLS  D  +IE
Subjt:  MASDLFWSNNVPTGETEKEIQ-----------------------GAVKNDINHSFNKAMEFPDWLSVNDDF-LQHG-----EEDYIQYPDLSRMDIGEIE

Query:  DVDGDWLA
        D+D +WLA
Subjt:  DVDGDWLA

KAG6608324.1 hypothetical protein SDJN03_01666, partial [Cucurbita argyrosperma subsp. sororia]1.2e-9063.38Show/hide
Query:  DQLCNFEAAQNPQPQ-------KKRVRRRRQSRRLYNQIPLNMAEARREIVTALKLHRASTKEAKQQQQKQDQQIKQSL-----EFCPCFEPEGRFKSRR
        DQLCNFEA + PQPQ       KK+VRRRRQSRRLY Q+PLNMAEARREIVTALKLHRASTKEAK+QQQKQDQQIK SL     +F PCFEPE R KSRR
Subjt:  DQLCNFEAAQNPQPQ-------KKRVRRRRQSRRLYNQIPLNMAEARREIVTALKLHRASTKEAKQQQQKQDQQIKQSL-----EFCPCFEPEGRFKSRR

Query:  NPRIYPGCFFYMENGSDFVAPPLVSQSLNFDDDFKSM-------DTSSVVCNDSHSSFYSLSVLPP-SYICPSVSYAA-THQEVPKSISLSEEEGKLMAS
        NPRIYP C FY ENGS F+APP V+QSL+ D   +++       DTSSVVCN+++ SFYSLS LPP SYICP+  YAA THQEVPKSISLSEEEG+LMAS
Subjt:  NPRIYPGCFFYMENGSDFVAPPLVSQSLNFDDDFKSM-------DTSSVVCNDSHSSFYSLSVLPP-SYICPSVSYAA-THQEVPKSISLSEEEGKLMAS

Query:  DLFWSNNVPTGETEKEIQGAVKNDINH--------------------------------SFNKAMEFPDWLSVNDDFLQ------HGEEDYIQYPDLSRM
        DLFWSNN PTGE+EKEI GAV+ +                                      +AMEFPDWLS+NDDFLQ         EDY+Q PDLS M
Subjt:  DLFWSNNVPTGETEKEIQGAVKNDINH--------------------------------SFNKAMEFPDWLSVNDDFLQ------HGEEDYIQYPDLSRM

Query:  DIGEIEDVDGDWLA
        DIGEIEDVDGDWLA
Subjt:  DIGEIEDVDGDWLA

KAG7037674.1 hypothetical protein SDJN02_01304, partial [Cucurbita argyrosperma subsp. argyrosperma]8.4e-8961.99Show/hide
Query:  DQLCNFEAAQNPQPQ-----------KKRVRRRRQSRRLYNQIPLNMAEARREIVTALKLHRASTKEAKQQQQKQDQQIKQSL-----EFCPCFEPEGRF
        DQLCNFEA + PQPQ           KK+VRRRRQSRRLY Q+PLNMAEARREIVTALKLHRASTKEAK+QQQKQDQQIK SL     +F PCFEPE R 
Subjt:  DQLCNFEAAQNPQPQ-----------KKRVRRRRQSRRLYNQIPLNMAEARREIVTALKLHRASTKEAKQQQQKQDQQIKQSL-----EFCPCFEPEGRF

Query:  KSRRNPRIYPGCFFYMENGSDFVAPPLVSQSLNFDDDFKSM-------DTSSVVCNDSHS---SFYSLSVLPP-SYICPSVSYAA-THQEVPKSISLSEE
        KSRRNPRIYP C FY ENGS F+APP V+QSL+ D   +++       DTSSVVCN++++   SFYSLS LPP SYICP+  YAA THQEVPKSISLSEE
Subjt:  KSRRNPRIYPGCFFYMENGSDFVAPPLVSQSLNFDDDFKSM-------DTSSVVCNDSHS---SFYSLSVLPP-SYICPSVSYAA-THQEVPKSISLSEE

Query:  EGKLMASDLFWSNNVPTGETEKEIQGAVKNDINH--------------------------------SFNKAMEFPDWLSVNDDFL------QHGEEDYIQ
        EG+LMASDLFWSNN PTGE+EKEI GAV+ +                                      +AMEFPDWLS+NDDFL      Q   EDY+Q
Subjt:  EGKLMASDLFWSNNVPTGETEKEIQGAVKNDINH--------------------------------SFNKAMEFPDWLSVNDDFL------QHGEEDYIQ

Query:  YPDLSRMDIGEIEDVDGDWLA
         PDLS MDIGEIEDVDGDWLA
Subjt:  YPDLSRMDIGEIEDVDGDWLA

XP_022940715.1 uncharacterized protein LOC111446225 [Cucurbita moschata]5.8e-9063.29Show/hide
Query:  DQLCNFEAAQNPQPQ-------KKRVRRRRQSRRLYNQIPLNMAEARREIVTALKLHRASTKEAKQQQQKQDQQIKQSL-----EFCPCFEPEGRFKSRR
        DQLCNFEA + PQPQ       KK+VRRRRQSRRLY Q+PLNMAEARREIVTALKLHRASTKEAK+QQQKQDQQIK SL     +F PCFEPE R KSRR
Subjt:  DQLCNFEAAQNPQPQ-------KKRVRRRRQSRRLYNQIPLNMAEARREIVTALKLHRASTKEAKQQQQKQDQQIKQSL-----EFCPCFEPEGRFKSRR

Query:  NPRIYPGCFFYMENGSDFVAPPLVSQSLNFDDDFKSM-------DTSSVVC--NDSHSSFYSLSVLPP-SYICPSVSYAA-THQEVPKSISLSEEEGKLM
        NPRIYP C FY ENGSDF+APP V+QSL+ D   +++       DTSSVVC  N+++ SFYSLS LPP SYICP+  YAA THQEVPKSISLSEEEG+LM
Subjt:  NPRIYPGCFFYMENGSDFVAPPLVSQSLNFDDDFKSM-------DTSSVVC--NDSHSSFYSLSVLPP-SYICPSVSYAA-THQEVPKSISLSEEEGKLM

Query:  ASDLFWSNNVPTGETEKEIQGAVKNDINH--------------------------------SFNKAMEFPDWLSVNDDFL------QHGEEDYIQYPDLS
        ASDLFWSNN PTGE+EKEI GAV+ +                                      +AMEFPDWLS+NDDFL      Q   EDY+Q PDLS
Subjt:  ASDLFWSNNVPTGETEKEIQGAVKNDINH--------------------------------SFNKAMEFPDWLSVNDDFL------QHGEEDYIQYPDLS

Query:  RMDIGEIEDVDGDWLA
         MDIGEIEDVDGDWLA
Subjt:  RMDIGEIEDVDGDWLA

XP_022981721.1 uncharacterized protein LOC111480786 [Cucurbita maxima]2.1e-8761.13Show/hide
Query:  DQLCNFEAAQNPQPQ-----------KKRVRRRRQSRRLYNQIPLNMAEARREIVTALKLHRASTKEAKQQQQKQDQQIKQSL-----EFCPCFEPEGRF
        DQLCNFEA + PQPQ           KK+VRRRR++RRLY Q+PLNMAEARREIVTALKLHRASTKEAK+QQQKQDQQIK SL     +F PCFEPE R 
Subjt:  DQLCNFEAAQNPQPQ-----------KKRVRRRRQSRRLYNQIPLNMAEARREIVTALKLHRASTKEAKQQQQKQDQQIKQSL-----EFCPCFEPEGRF

Query:  KSRRNPRIYPGCFFYMENGSDFVAPPLVSQSLNFDDDFKSM-------DTSSVVCNDSHS-SFYSLSVL-PPSYICPSVSYAA-THQEVPKSISLSEEEG
        KSRRNPRIYP C FY +NGSDF+APP V+QSL+ D   +++       DTSSVVCN++++ SFYSLS L P SYICP+  YAA TH+EVPKSISLSEEEG
Subjt:  KSRRNPRIYPGCFFYMENGSDFVAPPLVSQSLNFDDDFKSM-------DTSSVVCNDSHS-SFYSLSVL-PPSYICPSVSYAA-THQEVPKSISLSEEEG

Query:  KLMASDLFWSNNVPTGETEKEIQGAVKNDINH--------------------------------SFNKAMEFPDWLSVNDDFL------QHGEEDYIQYP
        +LMASDLFWSNN PTGE+EKEI GAV+ +                                      +AMEFPDWLS+NDDFL      Q   EDY+Q P
Subjt:  KLMASDLFWSNNVPTGETEKEIQGAVKNDINH--------------------------------SFNKAMEFPDWLSVNDDFL------QHGEEDYIQYP

Query:  DLSRMDIGEIEDVDGDWLA
        DLS MDIGEIEDVDGDWLA
Subjt:  DLSRMDIGEIEDVDGDWLA

TrEMBL top hitse value%identityAlignment
A0A1S4DZY0 uncharacterized protein LOC1034937171.6e-6154.19Show/hide
Query:  DQLCNFEAA----QNPQPQKKRVRRRRQS-RRLYNQIPLNMAEARREIVTALKLHRA-STKE-AKQQQQKQDQQIKQSLEFCP----CFEPEGRFKSRRN
        DQL NFEAA    +  +  KK+VRRRR S RRLY ++PL+MAEARREIVTALKLHRA STKE A++QQQKQDQ+ KQS    P    CFE EGR KS+RN
Subjt:  DQLCNFEAA----QNPQPQKKRVRRRRQS-RRLYNQIPLNMAEARREIVTALKLHRA-STKE-AKQQQQKQDQQIKQSLEFCP----CFEPEGRFKSRRN

Query:  PRIYPG----CFFYMENGSDFVAPPLVSQSLN-------FDDDFKSMDTSSVVCNDSHSSFYSLSVLPP--SYICPSVSYAAT-HQEVPKSISLSEEEGK
        PRIYP     C FY+ENGS FVAPP   ++LN       FDDDFK++DT         SSF SLS  PP  SYICP+VS   T HQE PKS+SL EEEG 
Subjt:  PRIYPG----CFFYMENGSDFVAPPLVSQSLN-------FDDDFKSMDTSSVVCNDSHSSFYSLSVLPP--SYICPSVSYAAT-HQEVPKSISLSEEEGK

Query:  LMASDLFWSNNVPTGETEKEIQ------------------------GAVKNDINHSFNKAMEFPDWLSVNDDFLQH------GEEDYIQYPDLSRMDIGE
        LMASD+FW NN PTG  EK++Q                         A++ D +HS + AM FPDW+S+NDD LQ        EED +Q PDLS  DIG+
Subjt:  LMASDLFWSNNVPTGETEKEIQ------------------------GAVKNDINHSFNKAMEFPDWLSVNDDFLQH------GEEDYIQYPDLSRMDIGE

Query:  IEDVDGDWLA
        IED+  +WLA
Subjt:  IEDVDGDWLA

A0A5A7V8V7 Putative WRKY transcription factor protein 1 isoform X21.6e-6154.19Show/hide
Query:  DQLCNFEAA----QNPQPQKKRVRRRRQS-RRLYNQIPLNMAEARREIVTALKLHRA-STKE-AKQQQQKQDQQIKQSLEFCP----CFEPEGRFKSRRN
        DQL NFEAA    +  +  KK+VRRRR S RRLY ++PL+MAEARREIVTALKLHRA STKE A++QQQKQDQ+ KQS    P    CFE EGR KS+RN
Subjt:  DQLCNFEAA----QNPQPQKKRVRRRRQS-RRLYNQIPLNMAEARREIVTALKLHRA-STKE-AKQQQQKQDQQIKQSLEFCP----CFEPEGRFKSRRN

Query:  PRIYPG----CFFYMENGSDFVAPPLVSQSLN-------FDDDFKSMDTSSVVCNDSHSSFYSLSVLPP--SYICPSVSYAAT-HQEVPKSISLSEEEGK
        PRIYP     C FY+ENGS FVAPP   ++LN       FDDDFK++DT         SSF SLS  PP  SYICP+VS   T HQE PKS+SL EEEG 
Subjt:  PRIYPG----CFFYMENGSDFVAPPLVSQSLN-------FDDDFKSMDTSSVVCNDSHSSFYSLSVLPP--SYICPSVSYAAT-HQEVPKSISLSEEEGK

Query:  LMASDLFWSNNVPTGETEKEIQ------------------------GAVKNDINHSFNKAMEFPDWLSVNDDFLQH------GEEDYIQYPDLSRMDIGE
        LMASD+FW NN PTG  EK++Q                         A++ D +HS + AM FPDW+S+NDD LQ        EED +Q PDLS  DIG+
Subjt:  LMASDLFWSNNVPTGETEKEIQ------------------------GAVKNDINHSFNKAMEFPDWLSVNDDFLQH------GEEDYIQYPDLSRMDIGE

Query:  IEDVDGDWLA
        IED+  +WLA
Subjt:  IEDVDGDWLA

A0A6J1BUG5 uncharacterized protein LOC1110054593.4e-5953.56Show/hide
Query:  QKKRVRRRRQSRRLYNQIPLNMAEARREIVTALKLHRASTKEAKQQQQKQDQQIKQSLEFCPCFEPEGRFKSRRNPRIYPGCFFYMENGSDF--------
        QKK+VRRRRQSRRLY + PLNMAEARREI TALKLHRAST+E +QQ QKQ           P FEPEGR KSRRNPRIYPGC  Y++N SDF        
Subjt:  QKKRVRRRRQSRRLYNQIPLNMAEARREIVTALKLHRASTKEAKQQQQKQDQQIKQSLEFCPCFEPEGRFKSRRNPRIYPGCFFYMENGSDF--------

Query:  ---------VAPPLVSQSLNFDD------------DFKSMDTSSVVCNDSHSSFYSLSVL-PPSYICPSVSYAATHQEVPKSISLSEEEGKLMASDLFWS
                 V P  +SQ+LN +             D  ++DT+SVV N+SH SF SLS L P SY+CP VS AAT QEV +S++LS E GKL+AS     
Subjt:  ---------VAPPLVSQSLNFDD------------DFKSMDTSSVVCNDSHSSFYSLSVL-PPSYICPSVSYAATHQEVPKSISLSEEEGKLMASDLFWS

Query:  NNVPTGETEKEIQGAVKN-------------------DINHSFNKAMEFPDWLSVNDDFLQ-----HGEEDYIQYPDLSRMDIGEIEDVDGDWLA
        +N P+G   K+ QGAV+                    D +HSF+KA+EFPDWLS+NDDFLQ     H  EDY+Q PDLS M+IGEIEDVDGDWLA
Subjt:  NNVPTGETEKEIQGAVKN-------------------DINHSFNKAMEFPDWLSVNDDFLQ-----HGEEDYIQYPDLSRMDIGEIEDVDGDWLA

A0A6J1FRD8 uncharacterized protein LOC1114462252.8e-9063.29Show/hide
Query:  DQLCNFEAAQNPQPQ-------KKRVRRRRQSRRLYNQIPLNMAEARREIVTALKLHRASTKEAKQQQQKQDQQIKQSL-----EFCPCFEPEGRFKSRR
        DQLCNFEA + PQPQ       KK+VRRRRQSRRLY Q+PLNMAEARREIVTALKLHRASTKEAK+QQQKQDQQIK SL     +F PCFEPE R KSRR
Subjt:  DQLCNFEAAQNPQPQ-------KKRVRRRRQSRRLYNQIPLNMAEARREIVTALKLHRASTKEAKQQQQKQDQQIKQSL-----EFCPCFEPEGRFKSRR

Query:  NPRIYPGCFFYMENGSDFVAPPLVSQSLNFDDDFKSM-------DTSSVVC--NDSHSSFYSLSVLPP-SYICPSVSYAA-THQEVPKSISLSEEEGKLM
        NPRIYP C FY ENGSDF+APP V+QSL+ D   +++       DTSSVVC  N+++ SFYSLS LPP SYICP+  YAA THQEVPKSISLSEEEG+LM
Subjt:  NPRIYPGCFFYMENGSDFVAPPLVSQSLNFDDDFKSM-------DTSSVVC--NDSHSSFYSLSVLPP-SYICPSVSYAA-THQEVPKSISLSEEEGKLM

Query:  ASDLFWSNNVPTGETEKEIQGAVKNDINH--------------------------------SFNKAMEFPDWLSVNDDFL------QHGEEDYIQYPDLS
        ASDLFWSNN PTGE+EKEI GAV+ +                                      +AMEFPDWLS+NDDFL      Q   EDY+Q PDLS
Subjt:  ASDLFWSNNVPTGETEKEIQGAVKNDINH--------------------------------SFNKAMEFPDWLSVNDDFL------QHGEEDYIQYPDLS

Query:  RMDIGEIEDVDGDWLA
         MDIGEIEDVDGDWLA
Subjt:  RMDIGEIEDVDGDWLA

A0A6J1IXC1 uncharacterized protein LOC1114807861.0e-8761.13Show/hide
Query:  DQLCNFEAAQNPQPQ-----------KKRVRRRRQSRRLYNQIPLNMAEARREIVTALKLHRASTKEAKQQQQKQDQQIKQSL-----EFCPCFEPEGRF
        DQLCNFEA + PQPQ           KK+VRRRR++RRLY Q+PLNMAEARREIVTALKLHRASTKEAK+QQQKQDQQIK SL     +F PCFEPE R 
Subjt:  DQLCNFEAAQNPQPQ-----------KKRVRRRRQSRRLYNQIPLNMAEARREIVTALKLHRASTKEAKQQQQKQDQQIKQSL-----EFCPCFEPEGRF

Query:  KSRRNPRIYPGCFFYMENGSDFVAPPLVSQSLNFDDDFKSM-------DTSSVVCNDSHS-SFYSLSVL-PPSYICPSVSYAA-THQEVPKSISLSEEEG
        KSRRNPRIYP C FY +NGSDF+APP V+QSL+ D   +++       DTSSVVCN++++ SFYSLS L P SYICP+  YAA TH+EVPKSISLSEEEG
Subjt:  KSRRNPRIYPGCFFYMENGSDFVAPPLVSQSLNFDDDFKSM-------DTSSVVCNDSHS-SFYSLSVL-PPSYICPSVSYAA-THQEVPKSISLSEEEG

Query:  KLMASDLFWSNNVPTGETEKEIQGAVKNDINH--------------------------------SFNKAMEFPDWLSVNDDFL------QHGEEDYIQYP
        +LMASDLFWSNN PTGE+EKEI GAV+ +                                      +AMEFPDWLS+NDDFL      Q   EDY+Q P
Subjt:  KLMASDLFWSNNVPTGETEKEIQGAVKNDINH--------------------------------SFNKAMEFPDWLSVNDDFL------QHGEEDYIQYP

Query:  DLSRMDIGEIEDVDGDWLA
        DLS MDIGEIEDVDGDWLA
Subjt:  DLSRMDIGEIEDVDGDWLA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G21280.1 hydroxyproline-rich glycoprotein family protein2.4e-1736.23Show/hide
Query:  QPQKKRVRRRRQSRRLYNQIPLNMAEARREIVTALKLHRASTKEAKQQQQKQDQQIKQSLEFCPCFEPEGRFKSRRNPRIYPGCFFYMENGSDFVAP--P
        Q  KK+VRRR  + R Y +  LNMAEARREIVTALK HRAS ++A +    Q     Q L     F P         P   P  F +     +F+ P  P
Subjt:  QPQKKRVRRRRQSRRLYNQIPLNMAEARREIVTALKLHRASTKEAKQQQQKQDQQIKQSLEFCPCFEPEGRFKSRRNPRIYPGCFFYMENGSDFVAP--P

Query:  LVSQSLNFDDDFKSMDTSSVVCNDSHSSFYSLS-----VLPPSYICPS---VSYAATHQEVPKSISLSEEEGKLMASDLFWSNNV-----PTGETEKEIQ
        L   +LNF D    + TSS   + S SS  S S       P  Y  PS       AT    P+  S S  E  ++ S  +WS  +     P  + E E  
Subjt:  LVSQSLNFDDDFKSMDTSSVVCNDSHSSFYSLS-----VLPPSYICPS---VSYAATHQEVPKSISLSEEEGKLMASDLFWSNNV-----PTGETEKEIQ

Query:  GAVKNDINHSFNKAMEFPDWLSVNDDFLQHGEEDYIQY------PDLSRMDIGEIEDVDG-DWLA
          V++D+   F+  MEFP WL+  ++ L H       Y      P LS M+IGEIE +DG DWLA
Subjt:  GAVKNDINHSFNKAMEFPDWLSVNDDFLQHGEEDYIQY------PDLSRMDIGEIEDVDG-DWLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAGAGAAGAAGCGAGAGAGCAAGGGCGGTGAAGAATATGGGAGAGTGGTGGGCGAGATCAGAATCAGATAATCCAATCATTTGTTTGATCTTTCTTCCAGTTCC
ATTGCACCTGTTTCTCTGCCCTATGCGGCGCCCATGGATTTTTGGAAAATGTGGCGCCTCTCTCTCTCTCTCTCTCTTTCTCTGCCTTTTGCCTTTACCACAATCCGAAG
AGCCTTCTGAGCCAATGACGTCGGCCATTGCAGCAAAATCTCATACAGATCAAGACCAACTCTGCAACTTTGAAGCTGCTCAAAATCCACAACCACAGAAGAAACGAGTT
AGAAGGAGACGTCAAAGCCGGCGGCTTTACAATCAAATCCCTCTCAATATGGCTGAGGCTAGAAGAGAGATTGTAACTGCACTCAAGCTCCACAGAGCATCAACTAAAGA
AGCCAAACAACAGCAACAAAAACAGGACCAACAGATTAAACAATCATTGGAATTCTGTCCTTGTTTTGAACCTGAAGGAAGATTCAAATCCAGGAGAAATCCCAGGATAT
ACCCTGGTTGTTTCTTTTATATGGAAAATGGGTCTGATTTTGTTGCTCCTCCACTCGTTTCTCAGAGTCTCAATTTTGATGATGATTTCAAATCTATGGATACTAGTTCA
GTTGTTTGTAACGACAGCCATTCTTCATTTTATTCACTTTCAGTCTTGCCCCCTTCATATATTTGTCCCTCTGTTTCTTATGCTGCTACTCATCAGGAAGTTCCTAAATC
AATTTCATTATCTGAGGAAGAAGGGAAGTTAATGGCTTCTGATCTGTTTTGGTCCAATAATGTTCCAACTGGAGAAACTGAAAAAGAGATTCAAGGGGCAGTGAAGAATG
ATATTAACCATAGTTTTAATAAAGCTATGGAATTTCCAGATTGGTTGAGTGTAAATGATGACTTTTTGCAGCATGGAGAGGAGGATTACATTCAATATCCTGACCTGTCT
CGCATGGATATTGGGGAGATTGAAGATGTGGATGGAGATTGGTTAGCATGA
mRNA sequenceShow/hide mRNA sequence
TGAAATACGTTTTCAAAAAACAAGCTAAGTTTTCAAAATTGAGTCTCCGTTTGATAACCATTTGGTTTTTAATTTTTAGTTTTTGAAAATTAAAAAAAGTAGTTTTCGAA
AACTTGATTTTGTTTTTAAAATTTAGATAGAAACTTAAAAGAAAAACTTGGGTGTTGCTTTCCAAACTTAAATGGTGTCTCAAATAAGATGAAAATCATGGTAGGGAAAT
AAAGAGAAATTATGAAAAAATAAGTATAAATTTTAAAAATAAAAATAAAAATTATCAAACGGAGCCTTAACAAATAGATTTTAAAACTTTTTGTTTTTAGAAGTTATGAC
AAAATTCATGATAAAGAAATTATTGGAAAGCAAATATAAATTTCAAGAAACAATAAAAAAAAGAGTCTTAGTAATAAAAATATGAAAAGTAGTATAATTGCATGTAGTTG
TTTTCACGAGATTAGAGGTTGAAATTCTTATACTACGAATTTCGTATACTAAAACAAAAAAAAGAAGGGAATGGAAGAGAGAAGAAGCGAGAGAGCAAGGGCGGTGAAGA
ATATGGGAGAGTGGTGGGCGAGATCAGAATCAGATAATCCAATCATTTGTTTGATCTTTCTTCCAGTTCCATTGCACCTGTTTCTCTGCCCTATGCGGCGCCCATGGATT
TTTGGAAAATGTGGCGCCTCTCTCTCTCTCTCTCTCTTTCTCTGCCTTTTGCCTTTACCACAATCCGAAGAGCCTTCTGAGCCAATGACGTCGGCCATTGCAGCAAAATC
TCATACAGATCAAGACCAACTCTGCAACTTTGAAGCTGCTCAAAATCCACAACCACAGAAGAAACGAGTTAGAAGGAGACGTCAAAGCCGGCGGCTTTACAATCAAATCC
CTCTCAATATGGCTGAGGCTAGAAGAGAGATTGTAACTGCACTCAAGCTCCACAGAGCATCAACTAAAGAAGCCAAACAACAGCAACAAAAACAGGACCAACAGATTAAA
CAATCATTGGAATTCTGTCCTTGTTTTGAACCTGAAGGAAGATTCAAATCCAGGAGAAATCCCAGGATATACCCTGGTTGTTTCTTTTATATGGAAAATGGGTCTGATTT
TGTTGCTCCTCCACTCGTTTCTCAGAGTCTCAATTTTGATGATGATTTCAAATCTATGGATACTAGTTCAGTTGTTTGTAACGACAGCCATTCTTCATTTTATTCACTTT
CAGTCTTGCCCCCTTCATATATTTGTCCCTCTGTTTCTTATGCTGCTACTCATCAGGAAGTTCCTAAATCAATTTCATTATCTGAGGAAGAAGGGAAGTTAATGGCTTCT
GATCTGTTTTGGTCCAATAATGTTCCAACTGGAGAAACTGAAAAAGAGATTCAAGGGGCAGTGAAGAATGATATTAACCATAGTTTTAATAAAGCTATGGAATTTCCAGA
TTGGTTGAGTGTAAATGATGACTTTTTGCAGCATGGAGAGGAGGATTACATTCAATATCCTGACCTGTCTCGCATGGATATTGGGGAGATTGAAGATGTGGATGGAGATT
GGTTAGCATGATTGATTCTTGGATTTATCTCATACCACGCTCTTGAATAATCAGAAAAGAAATGAAGAATGTTTTTCCCAAACGTGATTCCCTCCCCCCACTTACTTAAT
CATCATTTTTGCTATTAGCTTTTAGTA
Protein sequenceShow/hide protein sequence
MEERRSERARAVKNMGEWWARSESDNPIICLIFLPVPLHLFLCPMRRPWIFGKCGASLSLSLFLCLLPLPQSEEPSEPMTSAIAAKSHTDQDQLCNFEAAQNPQPQKKRV
RRRRQSRRLYNQIPLNMAEARREIVTALKLHRASTKEAKQQQQKQDQQIKQSLEFCPCFEPEGRFKSRRNPRIYPGCFFYMENGSDFVAPPLVSQSLNFDDDFKSMDTSS
VVCNDSHSSFYSLSVLPPSYICPSVSYAATHQEVPKSISLSEEEGKLMASDLFWSNNVPTGETEKEIQGAVKNDINHSFNKAMEFPDWLSVNDDFLQHGEEDYIQYPDLS
RMDIGEIEDVDGDWLA