; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022102 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022102
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein of unknown function, DUF601
Genome locationchr7:18363516..18366332
RNA-Seq ExpressionLag0022102
SyntenyLag0022102
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG53676.1 hypothetical protein EZV62_018932 [Acer yangbiense]1.8e-1829.77Show/hide
Query:  ASEGSVTSPDVEESYSDDGP-------SSSGCFVDPEISDSSDGEPPTHS----SDLSSSLTADRLEFLRRKYDIPDDVHLRLPNADENFENPP-DGEVA
        A E S+   D   S +++         S+  C    E S S D      S      + S++T   +E  R KY IP+++ LRLP   +   +PP + EVA
Subjt:  ASEGSVTSPDVEESYSDDGP-------SSSGCFVDPEISDSSDGEPPTHS----SDLSSSLTADRLEFLRRKYDIPDDVHLRLPNADENFENPP-DGEVA

Query:  FYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCFTLWAMHGGGSLMTVDDFLSLHTINRNPAFGDLFYYAS--TKKGTLISGPTSVKKWK
           A F+FGV LP   FL+  L     APAQL PN W  LIG + +W       L T  +F++L+ +   P +   +Y ++   K+  +   P+S K WK
Subjt:  FYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCFTLWAMHGGGSLMTVDDFLSLHTINRNPAFGDLFYYAS--TKKGTLISGPTSVKKWK

Query:  NGWFFVSGNWLERTEDGCF-FGVPMRFGEYV---------------PRNVRRS---PTAKKFAKYVLTLEKINRHGPFLVDQSVLEASGLARRRTISSE
        N WFF SG+W  +  +  F   +P RF   V                RN++ +   P   +  K +LT E + RH  F +      + G  ++R I  E
Subjt:  NGWFFVSGNWLERTEDGCF-FGVPMRFGEYV---------------PRNVRRS---PTAKKFAKYVLTLEKINRHGPFLVDQSVLEASGLARRRTISSE

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]2.4e-1838.51Show/hide
Query:  MFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCFTLWAMHGGGS----LMTVDDFLSLHTINRNPAFGDLFYYASTK-KGTLISGPTSVKKWKN
        MF++G+RLPL  F+Q+FL  TGLAPAQ+APNGW  +     L+ +    S    L+ VD  L+     R       FY  + K  G ++ GPTS+K W  
Subjt:  MFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCFTLWAMHGGGS----LMTVDDFLSLHTINRNPAFGDLFYYASTK-KGTLISGPTSVKKWKN

Query:  GWFFVSGNWLERTEDG-CFFGVPMRFGEYVPRNVRRSPTAKKFAKYVLTLEKINRH---GPFLVDQSVLEASGL
         WF+ SG WL + E G  FF VP RFG  V        T   F       E+  R    G  + D+ +LE SGL
Subjt:  GWFFVSGNWLERTEDG-CFFGVPMRFGEYVPRNVRRSPTAKKFAKYVLTLEKINRH---GPFLVDQSVLEASGL

XP_022158122.1 uncharacterized protein LOC111024680 [Momordica charantia]2.4e-1838.51Show/hide
Query:  MFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCFTLWAMHGGGS----LMTVDDFLSLHTINRNPAFGDLFYYASTK-KGTLISGPTSVKKWKN
        MF++G+RLPL  F+Q+FL  TGLAPAQ+APNGW  +     L+ +    S    L+ VD  L+     R       FY  + K  G ++ GPTS+K W  
Subjt:  MFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCFTLWAMHGGGS----LMTVDDFLSLHTINRNPAFGDLFYYASTK-KGTLISGPTSVKKWKN

Query:  GWFFVSGNWLERTEDG-CFFGVPMRFGEYVPRNVRRSPTAKKFAKYVLTLEKINRH---GPFLVDQSVLEASGL
         WF+ SG WL + E G  FF VP RFG  V        T   F       E+  R    G  + D+ +LE SGL
Subjt:  GWFFVSGNWLERTEDG-CFFGVPMRFGEYVPRNVRRSPTAKKFAKYVLTLEKINRH---GPFLVDQSVLEASGL

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]9.6e-2837.18Show/hide
Query:  DSSDGEPPTHSSDLSSSLTADRLEFLRRKYDIPDDVHLRLPNADENFENPPDGEVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCF
        + SD        +  S +    L  LRR + IP+++ LRLP   E  +NPP+G V  Y  MF++G+RLPL  F+Q+FL  TGLAPAQ+APNGW  +    
Subjt:  DSSDGEPPTHSSDLSSSLTADRLEFLRRKYDIPDDVHLRLPNADENFENPPDGEVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCF

Query:  TLWAMHGGGS----LMTVDDFLSLHTINRNPAFGDLFYYASTK-KGTLISGPTSVKKWKNGWFFVSGNWLERTEDG-CFFGVPMRFGEYVPRNVRRSPTA
         L+ +    S    L  VD  L+     R       FY  + K  G ++ GPTS+K W   WF+ SG WL + E G  FF VP RFG  V        T 
Subjt:  TLWAMHGGGS----LMTVDDFLSLHTINRNPAFGDLFYYASTK-KGTLISGPTSVKKWKNGWFFVSGNWLERTEDG-CFFGVPMRFGEYVPRNVRRSPTA

Query:  KKFAKYVLTLEKINRH---GPFLVDQSVLEASGL
          F       E+  R    G  + D+ +LE SGL
Subjt:  KKFAKYVLTLEKINRH---GPFLVDQSVLEASGL

XP_034205351.1 uncharacterized protein LOC117619489 [Prunus dulcis]1.8e-1836.48Show/hide
Query:  LSSSLTADRLEFLRRKYDIPDDVHLRLPNADENFENPPDGEVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCFTLWAMHGGGSLMT
        + + +T   LE LR +Y IPD + LRL   DE    PPDG V  +   FK G RLPL  ++  FL   GLAP Q+ PN +   +  + LW   G G   +
Subjt:  LSSSLTADRLEFLRRKYDIPDDVHLRLPNADENFENPPDGEVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCFTLWAMHGGGSLMT

Query:  VDDFLSLHTINRNPAFGDLFYYAS--TKKGTLISGPTSVKKWKNGWFFVSGNWLERTED
           F + + + ++P     +Y  +    +G LI+  +S K WKN +FF SGNW  R  D
Subjt:  VDDFLSLHTINRNPAFGDLFYYAS--TKKGTLISGPTSVKKWKNGWFFVSGNWLERTED

TrEMBL top hitse value%identityAlignment
A0A2N9FHJ6 Uncharacterized protein7.7e-2328.96Show/hide
Query:  DGTPRSCHVKLLVRAILLLSF---ATY------MASEGS----VTSPDVEESYSDDGPSSSGCFVDPEISDSSDGEPP----THSSDLSSSLTADRLEFL
        D  PR C ++ +   +  LSF   AT+      MA EGS    V S D+ E  SD    +      P IS SS+G  P    T    LS  + AD ++  
Subjt:  DGTPRSCHVKLLVRAILLLSF---ATY------MASEGS----VTSPDVEESYSDDGPSSSGCFVDPEISDSSDGEPP----THSSDLSSSLTADRLEFL

Query:  RRKYDIPDDVHLRLPNADENFENPPDGEVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCFTLW-AMHGGGSLMTVDDFLSLHTINR
        RRKY IP+DV LR+P +DE   +   G+VAFY A F  GVR PL   +++ L    LAP QLAPN W  ++G   +W  +  G   +T+D+ L  +   +
Subjt:  RRKYDIPDDVHLRLPNADENFENPPDGEVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCFTLW-AMHGGGSLMTVDDFLSLHTINR

Query:  NPAFGDLFYYASTKKG--TLISGPTSVKKWKNGWFFVSG-NW--LERTEDGCFFGVPMRFGEYVPRNVRRSPTAKKFAKYVL-TLEKINRHGPFLVDQSV
         PA    +  +  ++G   ++  P+S ++WK+ + FV G NW  L+  +D  F  V   +G      +RR    +     VL  L     H    +   +
Subjt:  NPAFGDLFYYASTKKG--TLISGPTSVKKWKNGWFFVSG-NW--LERTEDGCFFGVPMRFGEYVPRNVRRSPTAKKFAKYVL-TLEKINRHGPFLVDQSV

Query:  LEASGLA---RRRTISSEEMAFRGMYDSQWKRREARNRVGTSRASVDLTEDEAPRVTVETSRRPAAATRRTRYQTRSSVTETDPSTGIPVFALPEDYGSG
        L +           +S +E+  + M  ++  R + +  + +        +DEAP VT    +R A  + +     RS +    P+   P  ++PE   + 
Subjt:  LEASGLA---RRRTISSEEMAFRGMYDSQWKRREARNRVGTSRASVDLTEDEAPRVTVETSRRPAAATRRTRYQTRSSVTETDPSTGIPVFALPEDYGSG

Query:  GNEV
          EV
Subjt:  GNEV

A0A2N9H8T4 Uncharacterized protein5.0e-2229.68Show/hide
Query:  DGTPRSCHVKLLVRAILLLSF---ATY------MASEGS----VTSPDVEESYSDDGPSSSGCFVDPEISDSSDGEPP----THSSDLSSSLTADRLEFL
        D  P  C ++ +   +  LSF   AT+      MA EGS    V S D+ E  SD    +      P IS SS+G  P    T    LS  + AD ++  
Subjt:  DGTPRSCHVKLLVRAILLLSF---ATY------MASEGS----VTSPDVEESYSDDGPSSSGCFVDPEISDSSDGEPP----THSSDLSSSLTADRLEFL

Query:  RRKYDIPDDVHLRLPNADENFENPPDGEVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCFTLW-AMHGGGSLMTVDDFLSLHTINR
        RRKY IP+DV LR+P +DE   +   G+VAFY A F  GVR PL   +++ L    LAP QLAPN W  ++G   +W  +  G   +T+D+ L  +   +
Subjt:  RRKYDIPDDVHLRLPNADENFENPPDGEVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCFTLW-AMHGGGSLMTVDDFLSLHTINR

Query:  NPAFGDLFYYASTKKG--TLISGPTSVKKWKNGWFFVSG-NW--LERTEDGCFFGVPMRFGEYVPRNVRRSPTAKKFAKYVL-TLEKINRHGPFLVDQSV
         PA    +  +  ++G   ++  P+S ++WK+ + FV G NW  L+  +D  F  V   +G      +RR    +     VL  L     H    +   +
Subjt:  NPAFGDLFYYASTKKG--TLISGPTSVKKWKNGWFFVSG-NW--LERTEDGCFFGVPMRFGEYVPRNVRRSPTAKKFAKYVL-TLEKINRHGPFLVDQSV

Query:  LEASGLA---RRRTISSEEMAFRGMYDSQWKRREARNRVGTSRASVDLTEDEAPRVT-----VETSRRPAAATR
        L +           +S +E+  + M  ++  R + +  + +        +DEAP VT      ETS + A   R
Subjt:  LEASGLA---RRRTISSEEMAFRGMYDSQWKRREARNRVGTSRASVDLTEDEAPRVT-----VETSRRPAAATR

A0A2N9HSS2 Uncharacterized protein1.0e-2228.99Show/hide
Query:  DGTPRSCHVKLLVRAILLLSF---ATY------MASEGS----VTSPDVEESYSDDGPSSSGCFVDPEISDSSDGEPP----THSSDLSSSLTADRLEFL
        D  PR C ++ +   +  LSF   AT+      MA EGS    V S D+ E  SD    +      P IS SS+G  P    T    LS  + AD ++  
Subjt:  DGTPRSCHVKLLVRAILLLSF---ATY------MASEGS----VTSPDVEESYSDDGPSSSGCFVDPEISDSSDGEPP----THSSDLSSSLTADRLEFL

Query:  RRKYDIPDDVHLRLPNADENFENPPDGEVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCFTLW-AMHGGGSLMTVDDFLSLHTINR
        RRKY IP+DV LR+P +DE   +   G+VAFY A F  GVR PL   +++ L    LAP QLAPN W  ++G   +W  +  G   +T+D+ L  +   +
Subjt:  RRKYDIPDDVHLRLPNADENFENPPDGEVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCFTLW-AMHGGGSLMTVDDFLSLHTINR

Query:  NPAFGDLFYYASTKKG--TLISGPTSVKKWKNGWFFVSG-NW--LERTEDGCFFGVPMRFGEYVPRNVRRSPTAK----KFAKYVLTLEKINRHGPFLVD
         PA    +  +  ++G   ++  P+S ++WK+ + FV G NW  L+  +D  F  +P+R    VP +  RS   K       + +  L     H    + 
Subjt:  NPAFGDLFYYASTKKG--TLISGPTSVKKWKNGWFFVSG-NW--LERTEDGCFFGVPMRFGEYVPRNVRRSPTAK----KFAKYVLTLEKINRHGPFLVD

Query:  QSVLEASGLA---RRRTISSEEMAFRGMYDSQWKRREARNRVGTSRASVDLTEDEAPRVTVETSRRPAAATRRTRYQTRSSVTETDPSTGIPVFALPEDY
          +L +           +S +E+  + M  ++  R + +  + +        +DEAP VT    +R A  + +     RS +    P+   P  ++PE  
Subjt:  QSVLEASGLA---RRRTISSEEMAFRGMYDSQWKRREARNRVGTSRASVDLTEDEAPRVTVETSRRPAAATRRTRYQTRSSVTETDPSTGIPVFALPEDY

Query:  GSGGNEV
         +   EV
Subjt:  GSGGNEV

A0A2N9IMR5 Uncharacterized protein7.7e-2328.96Show/hide
Query:  DGTPRSCHVKLLVRAILLLSF---ATY------MASEGS----VTSPDVEESYSDDGPSSSGCFVDPEISDSSDGEPP----THSSDLSSSLTADRLEFL
        D  PR C ++ +   +  LSF   AT+      MA EGS    V S D+ E  SD    +      P IS SS+G  P    T    LS  + AD ++  
Subjt:  DGTPRSCHVKLLVRAILLLSF---ATY------MASEGS----VTSPDVEESYSDDGPSSSGCFVDPEISDSSDGEPP----THSSDLSSSLTADRLEFL

Query:  RRKYDIPDDVHLRLPNADENFENPPDGEVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCFTLW-AMHGGGSLMTVDDFLSLHTINR
        RRKY IP+DV LR+P +DE   +   G+VAFY A F  GVR PL   +++ L    LAP QLAPN W  ++G   +W  +  G   +T+D+ L  +   +
Subjt:  RRKYDIPDDVHLRLPNADENFENPPDGEVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCFTLW-AMHGGGSLMTVDDFLSLHTINR

Query:  NPAFGDLFYYASTKKG--TLISGPTSVKKWKNGWFFVSG-NW--LERTEDGCFFGVPMRFGEYVPRNVRRSPTAKKFAKYVL-TLEKINRHGPFLVDQSV
         PA    +  +  ++G   ++  P+S ++WK+ + FV G NW  L+  +D  F  V   +G      +RR    +     VL  L     H    +   +
Subjt:  NPAFGDLFYYASTKKG--TLISGPTSVKKWKNGWFFVSG-NW--LERTEDGCFFGVPMRFGEYVPRNVRRSPTAKKFAKYVL-TLEKINRHGPFLVDQSV

Query:  LEASGLA---RRRTISSEEMAFRGMYDSQWKRREARNRVGTSRASVDLTEDEAPRVTVETSRRPAAATRRTRYQTRSSVTETDPSTGIPVFALPEDYGSG
        L +           +S +E+  + M  ++  R + +  + +        +DEAP VT    +R A  + +     RS +    P+   P  ++PE   + 
Subjt:  LEASGLA---RRRTISSEEMAFRGMYDSQWKRREARNRVGTSRASVDLTEDEAPRVTVETSRRPAAATRRTRYQTRSSVTETDPSTGIPVFALPEDYGSG

Query:  GNEV
          EV
Subjt:  GNEV

A0A6J1DXS5 uncharacterized protein LOC1110255024.6e-2837.18Show/hide
Query:  DSSDGEPPTHSSDLSSSLTADRLEFLRRKYDIPDDVHLRLPNADENFENPPDGEVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCF
        + SD        +  S +    L  LRR + IP+++ LRLP   E  +NPP+G V  Y  MF++G+RLPL  F+Q+FL  TGLAPAQ+APNGW  +    
Subjt:  DSSDGEPPTHSSDLSSSLTADRLEFLRRKYDIPDDVHLRLPNADENFENPPDGEVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCF

Query:  TLWAMHGGGS----LMTVDDFLSLHTINRNPAFGDLFYYASTK-KGTLISGPTSVKKWKNGWFFVSGNWLERTEDG-CFFGVPMRFGEYVPRNVRRSPTA
         L+ +    S    L  VD  L+     R       FY  + K  G ++ GPTS+K W   WF+ SG WL + E G  FF VP RFG  V        T 
Subjt:  TLWAMHGGGS----LMTVDDFLSLHTINRNPAFGDLFYYASTK-KGTLISGPTSVKKWKNGWFFVSGNWLERTEDG-CFFGVPMRFGEYVPRNVRRSPTA

Query:  KKFAKYVLTLEKINRH---GPFLVDQSVLEASGL
          F       E+  R    G  + D+ +LE SGL
Subjt:  KKFAKYVLTLEKINRH---GPFLVDQSVLEASGL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G46696.1 Protein of unknown function, DUF6012.7e-0434.83Show/hide
Query:  DGEPPTHSSDLSSSLTADRLEFLRRKYDIPDDVHLRLPNADENFENPPDGEVAFYHAMFKF-GVRLPLPLFLQDFLVCTGLAPAQLAPN
        DG+   + + LS   T+ RL  LR  + IP  + L  P      ENPP G    +   F   G+  PLP  L D +   G+A  QL PN
Subjt:  DGEPPTHSSDLSSSLTADRLEFLRRKYDIPDDVHLRLPNADENFENPPDGEVAFYHAMFKF-GVRLPLPLFLQDFLVCTGLAPAQLAPN

AT1G51172.1 unknown protein1.8e-0826.47Show/hide
Query:  SSDGEPPTHSSDLSSSLTADRLEFLRRKYDIPDDVHLRLPNADENFENPPDGEVAFYHAMF-KFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCF
        S DG+   + + LS   T  RL  LR  + IP  + L  P    + E+PP G    +   F + G+  PLP  L D +   G+A  QL PN    ++   
Subjt:  SSDGEPPTHSSDLSSSLTADRLEFLRRKYDIPDDVHLRLPNADENFENPPDGEVAFYHAMF-KFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCF

Query:  TLWAMHGGGSLMTVDDFLSLHTINRNPAFGDLFYYASTKKGTLI--SGPTSVKKWKNGWFFVSGNWLERTEDGCFFGVPMRFGEYVPRNVRRS--PTAKK
        TL      G  + + DFL L+ + ++    +  ++ S +KG  +    P   + W+  +FF   N L   E    F       ++  R V+ +  P ++ 
Subjt:  TLWAMHGGGSLMTVDDFLSLHTINRNPAFGDLFYYASTKKGTLI--SGPTSVKKWKNGWFFVSGNWLERTEDGCFFGVPMRFGEYVPRNVRRS--PTAKK

Query:  FAKY
        FA +
Subjt:  FAKY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAAAATCTGGCCAAGCGCTCACCTGCCCTTCACGTGACACCAACAATGAATTTGTTGAGATATGGAGAAGACGTGACCTGAAAAGAAAGAAAGGTTTTCCCCTTTC
CGCGTCGTCCACCGCACGTAGCCTCCTGCTACTGGGCCCAAGGAGATTGACCCGGGCAACCCAAGCGGCCAGTAGGGTTAATTGGCCTGGTCGGCCTCGGCATTGGGCCG
AGGCCGACCAGACCCTCGGCCTCGGCATTGGGCTGAGGTCGAGGTGTCATTCCACCTCTTGCGGGCTTTCATCTCCTCGAGTCGATCTCAGCTGGTTGGAGAGCATAACT
CAATTTTGGCAGTTCATCAGAGAATGCGCAACAAGTTCGGATTCACCCTCGGAGTTTGTCAAATGGGAGTTCTTATCAAACCCCTCTAGTTGGACAAATATCAGAGTGGT
TGGATTGCAATTGAGTCAATCTTGTTGGGGCGGTGAGGTTAGTAGAAAAGGGGAAAGAGACAAATGCTACAAAAGAGAGAGGAGAGAGAAAGAGACGTGGACCGAGGCTC
ATTTGCCTCGGTCTGGCGTGTCAGGCGGCGCACCTGACACGTCTGGCGTGCCTGTCTGGCTGCAGGCACGCCTCGGCCTCGGCAAAAAGCCGAGGCCGACGTCACTTGAC
GGGACCCCACGGTCCTGTCATATCAAGGAGAGGAGAGAGAAAAATACATGGACCGAGGCTCATCTGCCTCGGTCTGGCGTGTCAGGCTGCGCACCTGACACGTCTGGCGT
GCCTGGTTGGCTGCAGGCACGCCTCGGCCTCGGCAAAAAGCCGAGGCCGATGTCATCTGACGGGACCCCACGGTCCTGTCATGTCAAGCTTCTTGTGAGAGCTATCCTTT
TACTTTCTTTTGCTACGTATATGGCCAGCGAAGGCTCTGTTACCTCTCCGGACGTGGAGGAATCTTATTCCGATGACGGTCCTTCAAGCTCGGGCTGCTTTGTGGACCCG
GAGATTTCGGATAGCAGTGATGGGGAGCCCCCTACACACTCATCGGACTTATCATCCTCGTTGACCGCAGACCGCTTAGAGTTCTTGCGGCGCAAGTATGATATTCCCGA
CGATGTGCATCTTCGGCTCCCCAATGCTGACGAGAACTTTGAGAATCCCCCGGATGGAGAGGTTGCTTTTTACCATGCCATGTTTAAGTTTGGGGTTCGCTTGCCGCTGC
CATTGTTTTTGCAAGACTTCCTGGTCTGTACGGGTCTAGCCCCTGCCCAGCTTGCTCCAAACGGGTGGTGCCACCTCATCGGTTGCTTCACCCTTTGGGCGATGCACGGT
GGGGGATCTCTAATGACCGTTGACGATTTCTTGTCCTTGCATACCATCAATCGCAATCCCGCTTTTGGTGACCTTTTTTATTACGCAAGTACCAAAAAAGGCACCTTAAT
CAGCGGACCCACTTCCGTAAAAAAGTGGAAAAATGGTTGGTTCTTTGTTAGTGGCAATTGGCTTGAAAGAACGGAAGACGGTTGTTTTTTCGGGGTTCCAATGAGGTTTG
GAGAATATGTGCCTCGCAACGTTCGACGCTCCCCGACCGCTAAGAAGTTTGCCAAATACGTCCTGACCCTCGAAAAGATTAACCGCCACGGTCCCTTTTTAGTCGATCAA
AGTGTCCTCGAAGCGTCTGGGCTAGCCAGGCGCCGCACCATCAGTTCTGAAGAAATGGCCTTCCGTGGAATGTACGATTCTCAGTGGAAAAGACGTGAAGCACGTAACAG
GGTTGGAACCTCCCGGGCCTCTGTGGACCTAACCGAGGATGAGGCTCCACGGGTTACCGTCGAGACCTCTCGTCGTCCTGCAGCTGCTACCCGCAGAACCCGGTATCAGA
CGCGCTCCTCGGTCACCGAGACAGATCCTAGCACAGGCATCCCGGTCTTTGCCCTTCCCGAGGACTACGGGAGCGGCGGCAATGAGGTAGAGGTCCTAACCCAAAACTTC
ATGTGCTGGCAAGGGTTGCCCTTCCCGAGGACTTTTCGACAAGTTGATGCACCGGGGCACGCCCTAGGAGTATAG
mRNA sequenceShow/hide mRNA sequence
ATGCCAAAATCTGGCCAAGCGCTCACCTGCCCTTCACGTGACACCAACAATGAATTTGTTGAGATATGGAGAAGACGTGACCTGAAAAGAAAGAAAGGTTTTCCCCTTTC
CGCGTCGTCCACCGCACGTAGCCTCCTGCTACTGGGCCCAAGGAGATTGACCCGGGCAACCCAAGCGGCCAGTAGGGTTAATTGGCCTGGTCGGCCTCGGCATTGGGCCG
AGGCCGACCAGACCCTCGGCCTCGGCATTGGGCTGAGGTCGAGGTGTCATTCCACCTCTTGCGGGCTTTCATCTCCTCGAGTCGATCTCAGCTGGTTGGAGAGCATAACT
CAATTTTGGCAGTTCATCAGAGAATGCGCAACAAGTTCGGATTCACCCTCGGAGTTTGTCAAATGGGAGTTCTTATCAAACCCCTCTAGTTGGACAAATATCAGAGTGGT
TGGATTGCAATTGAGTCAATCTTGTTGGGGCGGTGAGGTTAGTAGAAAAGGGGAAAGAGACAAATGCTACAAAAGAGAGAGGAGAGAGAAAGAGACGTGGACCGAGGCTC
ATTTGCCTCGGTCTGGCGTGTCAGGCGGCGCACCTGACACGTCTGGCGTGCCTGTCTGGCTGCAGGCACGCCTCGGCCTCGGCAAAAAGCCGAGGCCGACGTCACTTGAC
GGGACCCCACGGTCCTGTCATATCAAGGAGAGGAGAGAGAAAAATACATGGACCGAGGCTCATCTGCCTCGGTCTGGCGTGTCAGGCTGCGCACCTGACACGTCTGGCGT
GCCTGGTTGGCTGCAGGCACGCCTCGGCCTCGGCAAAAAGCCGAGGCCGATGTCATCTGACGGGACCCCACGGTCCTGTCATGTCAAGCTTCTTGTGAGAGCTATCCTTT
TACTTTCTTTTGCTACGTATATGGCCAGCGAAGGCTCTGTTACCTCTCCGGACGTGGAGGAATCTTATTCCGATGACGGTCCTTCAAGCTCGGGCTGCTTTGTGGACCCG
GAGATTTCGGATAGCAGTGATGGGGAGCCCCCTACACACTCATCGGACTTATCATCCTCGTTGACCGCAGACCGCTTAGAGTTCTTGCGGCGCAAGTATGATATTCCCGA
CGATGTGCATCTTCGGCTCCCCAATGCTGACGAGAACTTTGAGAATCCCCCGGATGGAGAGGTTGCTTTTTACCATGCCATGTTTAAGTTTGGGGTTCGCTTGCCGCTGC
CATTGTTTTTGCAAGACTTCCTGGTCTGTACGGGTCTAGCCCCTGCCCAGCTTGCTCCAAACGGGTGGTGCCACCTCATCGGTTGCTTCACCCTTTGGGCGATGCACGGT
GGGGGATCTCTAATGACCGTTGACGATTTCTTGTCCTTGCATACCATCAATCGCAATCCCGCTTTTGGTGACCTTTTTTATTACGCAAGTACCAAAAAAGGCACCTTAAT
CAGCGGACCCACTTCCGTAAAAAAGTGGAAAAATGGTTGGTTCTTTGTTAGTGGCAATTGGCTTGAAAGAACGGAAGACGGTTGTTTTTTCGGGGTTCCAATGAGGTTTG
GAGAATATGTGCCTCGCAACGTTCGACGCTCCCCGACCGCTAAGAAGTTTGCCAAATACGTCCTGACCCTCGAAAAGATTAACCGCCACGGTCCCTTTTTAGTCGATCAA
AGTGTCCTCGAAGCGTCTGGGCTAGCCAGGCGCCGCACCATCAGTTCTGAAGAAATGGCCTTCCGTGGAATGTACGATTCTCAGTGGAAAAGACGTGAAGCACGTAACAG
GGTTGGAACCTCCCGGGCCTCTGTGGACCTAACCGAGGATGAGGCTCCACGGGTTACCGTCGAGACCTCTCGTCGTCCTGCAGCTGCTACCCGCAGAACCCGGTATCAGA
CGCGCTCCTCGGTCACCGAGACAGATCCTAGCACAGGCATCCCGGTCTTTGCCCTTCCCGAGGACTACGGGAGCGGCGGCAATGAGGTAGAGGTCCTAACCCAAAACTTC
ATGTGCTGGCAAGGGTTGCCCTTCCCGAGGACTTTTCGACAAGTTGATGCACCGGGGCACGCCCTAGGAGTATAG
Protein sequenceShow/hide protein sequence
MPKSGQALTCPSRDTNNEFVEIWRRRDLKRKKGFPLSASSTARSLLLLGPRRLTRATQAASRVNWPGRPRHWAEADQTLGLGIGLRSRCHSTSCGLSSPRVDLSWLESIT
QFWQFIRECATSSDSPSEFVKWEFLSNPSSWTNIRVVGLQLSQSCWGGEVSRKGERDKCYKRERREKETWTEAHLPRSGVSGGAPDTSGVPVWLQARLGLGKKPRPTSLD
GTPRSCHIKERREKNTWTEAHLPRSGVSGCAPDTSGVPGWLQARLGLGKKPRPMSSDGTPRSCHVKLLVRAILLLSFATYMASEGSVTSPDVEESYSDDGPSSSGCFVDP
EISDSSDGEPPTHSSDLSSSLTADRLEFLRRKYDIPDDVHLRLPNADENFENPPDGEVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCFTLWAMHG
GGSLMTVDDFLSLHTINRNPAFGDLFYYASTKKGTLISGPTSVKKWKNGWFFVSGNWLERTEDGCFFGVPMRFGEYVPRNVRRSPTAKKFAKYVLTLEKINRHGPFLVDQ
SVLEASGLARRRTISSEEMAFRGMYDSQWKRREARNRVGTSRASVDLTEDEAPRVTVETSRRPAAATRRTRYQTRSSVTETDPSTGIPVFALPEDYGSGGNEVEVLTQNF
MCWQGLPFPRTFRQVDAPGHALGV