; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036311 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036311
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPMD domain-containing protein
Genome locationchr3:43798750..43803154
RNA-Seq ExpressionLag0036311
SyntenyLag0036311
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025968.1 hypothetical protein E6C27_scaffold34G002830 [Cucumis melo var. makuwa]9.1e-3734.05Show/hide
Query:  IRSCFLSSRCGSLMVIEAYSPCRFSRQFGFYQDIPNDLGKEIPEANLANVLRLWRICTQRETLSQVYLPARATKPHTQVTQRYRSWWLAKHGNYLEEGIQ
        +R C+L  RCG+  +I +YSP RF  QFGFYQDIPN LG   P     N+L  WRICT+R TLS++YL A + +P   VTQR+  WW  KH  Y E+ I 
Subjt:  IRSCFLSSRCGSLMVIEAYSPCRFSRQFGFYQDIPNDLGKEIPEANLANVLRLWRICTQRETLSQVYLPARATKPHTQVTQRYRSWWLAKHGNYLEEGIQ

Query:  KLVDSASPLPSKPKLPK----RLSSSAIIIGKGLKSPINHQY---------AKMNFLMEFP-----AHHNFLNSLHHCPPSFVSPDVFDSIAARVGNSKA
         LV SA P PS+P LPK     L    I + + + + +  +           + +  +E P     AH   L  L +   S   P V DS    +G SK 
Subjt:  KLVDSASPLPSKPKLPK----RLSSSAIIIGKGLKSPINHQY---------AKMNFLMEFP-----AHHNFLNSLHHCPPSFVSPDVFDSIAARVGNSKA

Query:  PTDRVVIQSCHP--VIDEI-------------------------PEQKKTTTHAAAS-----------------EISNYCADDVISNYRKQSALALWESI
        P  +   QS  P  +++EI                         P QK ++THA                    E S +  ++V+SN+ +++AL +WE I
Subjt:  PTDRVVIQSCHP--VIDEI-------------------------PEQKKTTTHAAAS-----------------EISNYCADDVISNYRKQSALALWESI

Query:  HQKIIRTSFDKRPRLKQEAVKILHTI
          KI+RT F+  PRL+ E   +L  I
Subjt:  HQKIIRTSFDKRPRLKQEAVKILHTI

KAA0050733.1 uncharacterized protein E6C27_scaffold560G00370 [Cucumis melo var. makuwa]4.1e-5330.07Show/hide
Query:  MVYFTEHLDPEKNCLVILKD---------------------------KIN-LLKMAWSEERLLSHDSSLKAWFLEYPVHSKVPGENPESTLGRRIIGDEQ
        MVYFTE+ D  K CL+ILKD                           K N +L   WS+E+ ++ +S L +WFLE  +H+K+P ENPESTLG ++I D +
Subjt:  MVYFTEHLDPEKNCLVILKD---------------------------KIN-LLKMAWSEERLLSHDSSLKAWFLEYPVHSKVPGENPESTLGRRIIGDEQ

Query:  VRWGSP--------------------------------------------DREDEACPKVVSTFLLRTITLHL---------------------------
        +RWG+                                             DR  +        +   T TLH                            
Subjt:  VRWGSP--------------------------------------------DREDEACPKVVSTFLLRTITLHL---------------------------

Query:  -NNAIAFKFLS-----------NRGFLSGVFKVASLMAEGYVLSLAVLFWPIF---------IMDSTR--------FTVHL----LLLDFQRLVSLCTVS
         +      FLS              F   VFKVASLMAEGY  SLA+   P+          I DST         F +H     L L F       T S
Subjt:  -NNAIAFKFLS-----------NRGFLSGVFKVASLMAEGYVLSLAVLFWPIF---------IMDSTR--------FTVHL----LLLDFQRLVSLCTVS

Query:  IIRSCFLS----------SRCGSLMVIEAYSPCRFSRQFGFYQDIPNDLGKEIPEANLANVLRLWRICTQRETLSQVYLPARATKPHTQVTQRYRSWWLA
        +   C +           +   + M I   +PCRF RQFGFYQD+P DL ++IP+ANLANV   W IC +  TLSQVYLPA    P   +T  Y++WWLA
Subjt:  IIRSCFLS----------SRCGSLMVIEAYSPCRFSRQFGFYQDIPNDLGKEIPEANLANVLRLWRICTQRETLSQVYLPARATKPHTQVTQRYRSWWLA

Query:  KHGNYLEEGIQKLVDSASPLPSKPKLPKRL--SSSAIIIGKGLKSPINHQYAKMNFLMEFPAHHNFL--------------------------------N
        KHG+YL+EG+Q L+D  +P   K K  K++  +    I     K P++ Q+           H + L                                 
Subjt:  KHGNYLEEGIQKLVDSASPLPSKPKLPKRL--SSSAIIIGKGLKSPINHQYAKMNFLMEFPAHHNFL--------------------------------N

Query:  SLHHCPPSFVSPDV---FDSIAARVGNSKAPTDRVVIQSCHPVIDEIPEQKKTTTHAAASEISNYCADDVISNYRKQSALALWESIHQKIIRTSFDKRPR
        S+    P    PDV        +   NSK P    ++ +C  V    P++   T     SEIS++CAD++IS+ R+++A+ LWES+ QKIIRT F++   
Subjt:  SLHHCPPSFVSPDV---FDSIAARVGNSKAPTDRVVIQSCHPVIDEIPEQKKTTTHAAASEISNYCADDVISNYRKQSALALWESIHQKIIRTSFDKRPR

Query:  LKQEAVKILHTI
        L+ E  KI   I
Subjt:  LKQEAVKILHTI

XP_031739998.1 uncharacterized protein LOC116403342 [Cucumis sativus]3.3e-4736.21Show/hide
Query:  IRSCFLSSRCGSLMVIEAYSPCRFSRQFGFYQDIPNDLGKEIPEANLANVLRLWRICTQRETLSQVYLPARATKPHTQVTQRYRSWWLAKHGNYLEEGIQ
        IRSCFLSS+CGS  VIE YSPCRFSRQFGFYQD+P DLG+EIPEAN  NV   W IC +  TLSQVYLP  A  P T VT  Y+ WWLAKHG+YL+EG+Q
Subjt:  IRSCFLSSRCGSLMVIEAYSPCRFSRQFGFYQDIPNDLGKEIPEANLANVLRLWRICTQRETLSQVYLPARATKPHTQVTQRYRSWWLAKHGNYLEEGIQ

Query:  KLVDSASPLPSKPKLPKRL--------------------------SSSAIIIGKGLKSPINH-------QYAKMNFLMEFPAH-----------------
         L+D  +P   K K  K++                           +  ++    L +   H          K N L     H                 
Subjt:  KLVDSASPLPSKPKLPKRL--------------------------SSSAIIIGKGLKSPINH-------QYAKMNFLMEFPAH-----------------

Query:  ------------------------HNF--------LNSLHHCPPSFVSPDVFDS--IAARVGNSKAPTDRVVIQSCHPVIDEIPEQKKTTTHAAASEISN
                                H+         + +L  C  + V P++ D   + +  GNSK P    ++ +C PVI   P++ + T     SEIS+
Subjt:  ------------------------HNF--------LNSLHHCPPSFVSPDVFDS--IAARVGNSKAPTDRVVIQSCHPVIDEIPEQKKTTTHAAASEISN

Query:  YCADDVISNYRKQSALALWESIHQKIIRTSFDKRPRLKQEAVKILHTI
        +CAD +IS+ R+Q+A+ LWE++ QKIIRT F++   L+ E  KI   I
Subjt:  YCADDVISNYRKQSALALWESIHQKIIRTSFDKRPRLKQEAVKILHTI

XP_031745312.1 uncharacterized protein LOC116405439 [Cucumis sativus]6.5e-4333.16Show/hide
Query:  IRSCFLSSRCGSLMVIEAYSPCRFSRQFGFYQDIPNDLGKEIPEANLANVLRLWRICTQRETLSQVYLPARATKPHTQVTQRYRSWWLAKHGNYLEEGIQ
        IRSCFLSS+CGS  VIE YSPCRFSRQFGFYQD+P DLG+EIPEAN  NV   W IC +  TLSQVYLP  A  P T VT  Y+ WWLAKHG+YL+EG+Q
Subjt:  IRSCFLSSRCGSLMVIEAYSPCRFSRQFGFYQDIPNDLGKEIPEANLANVLRLWRICTQRETLSQVYLPARATKPHTQVTQRYRSWWLAKHGNYLEEGIQ

Query:  KLVDSASPLPSKPKLPKRL--------------------------SSSAIIIGKGLKSPINH-------QYAKMNFLM----------------------
         L+D  +P   K K  K++                           +  ++    L +   H          K N L                       
Subjt:  KLVDSASPLPSKPKLPKRL--------------------------SSSAIIIGKGLKSPINH-------QYAKMNFLM----------------------

Query:  --------------EFPAHHNFLNSLHHCPPSFVS---------------------------------------------PDVFDS--IAARVGNSKAPT
                      E P            P   VS                                             P++ D   + +  GNSK P 
Subjt:  --------------EFPAHHNFLNSLHHCPPSFVS---------------------------------------------PDVFDS--IAARVGNSKAPT

Query:  DRVVIQSCHPVIDEIPEQKKTTTHAAASEISNYCADDVISNYRKQSALALWESIHQKIIRTSFDKRPRLKQEAVKILHTI
           ++ +C PVI   P++ + T     SEIS++CAD +IS+ R+Q+A+ LWE++ QKIIRT F++   L+ E  KI   I
Subjt:  DRVVIQSCHPVIDEIPEQKKTTTHAAASEISNYCADDVISNYRKQSALALWESIHQKIIRTSFDKRPRLKQEAVKILHTI

XP_031745312.1 uncharacterized protein LOC116405439 [Cucumis sativus]2.0e-1241.9Show/hide
Query:  MVYFTEHLDPEKNCLVILKDKINLLKMA----------------------------WSEERLLSHDSSLKAWFLEYPVHSKVPGENPESTLGRRIIGDEQ
        MV+FTE+ D  K CL+ILKD    L+                              WS+ER ++ +SSLK+WFLE  +H+K P E+PESTLGRRII D +
Subjt:  MVYFTEHLDPEKNCLVILKDKINLLKMA----------------------------WSEERLLSHDSSLKAWFLEYPVHSKVPGENPESTLGRRIIGDEQ

Query:  VRWGS
        +RWG+
Subjt:  VRWGS

XP_031745312.1 uncharacterized protein LOC116405439 [Cucumis sativus]5.3e-3734.04Show/hide
Query:  IRSCFLSSRCGSLMVIEAYSPCRFSRQFGFYQDIPNDLGKEIPEANLANVLRLWRICTQRETLSQVYLPARATKPHTQVTQRYRSWWLAKHGNYLEEGIQ
        +RSC+LSSRC +  +I +Y+P RF RQFGFYQD+PN +G   P   L N+L  WRICT+R TLS++YLPA + +P   VTQR+  WW  KHG Y E+   
Subjt:  IRSCFLSSRCGSLMVIEAYSPCRFSRQFGFYQDIPNDLGKEIPEANLANVLRLWRICTQRETLSQVYLPARATKPHTQVTQRYRSWWLAKHGNYLEEGIQ

Query:  KLVDSASPLPSKPKLPKRLSSSAIIIGKGL------------------------------KSPI-------NHQYAKMNFLMEFPAHHNFLNSLHHCPPS
         LV S  P PS+P+LPK   S+  + GK +                              K P+       +H   +    +E P     L+ L+     
Subjt:  KLVDSASPLPSKPKLPKRLSSSAIIIGKGL------------------------------KSPI-------NHQYAKMNFLMEFPAHHNFLNSLHHCPPS

Query:  FVSPD---------VFDSIAARVGNSKAPTDRVVIQSCHP--VIDEIPEQKKT---------TTHAAAS--------EISNYCADDVISNYRKQSALALW
        F+ PD           DS    VG SK P ++   QS  P  +++EI  +K T         ++   AS        E S +  + ++SN+ +++AL +W
Subjt:  FVSPD---------VFDSIAARVGNSKAPTDRVVIQSCHP--VIDEIPEQKKT---------TTHAAAS--------EISNYCADDVISNYRKQSALALW

Query:  ESIHQKIIRTSFDKRPRLKQEAVKILHTI
        E I  KII+T F+  PRL+ E   +L  I
Subjt:  ESIHQKIIRTSFDKRPRLKQEAVKILHTI

TrEMBL top hitse value%identityAlignment
A0A5A7SIG6 PMD domain-containing protein4.4e-3734.05Show/hide
Query:  IRSCFLSSRCGSLMVIEAYSPCRFSRQFGFYQDIPNDLGKEIPEANLANVLRLWRICTQRETLSQVYLPARATKPHTQVTQRYRSWWLAKHGNYLEEGIQ
        +R C+L  RCG+  +I +YSP RF  QFGFYQDIPN LG   P     N+L  WRICT+R TLS++YL A + +P   VTQR+  WW  KH  Y E+ I 
Subjt:  IRSCFLSSRCGSLMVIEAYSPCRFSRQFGFYQDIPNDLGKEIPEANLANVLRLWRICTQRETLSQVYLPARATKPHTQVTQRYRSWWLAKHGNYLEEGIQ

Query:  KLVDSASPLPSKPKLPK----RLSSSAIIIGKGLKSPINHQY---------AKMNFLMEFP-----AHHNFLNSLHHCPPSFVSPDVFDSIAARVGNSKA
         LV SA P PS+P LPK     L    I + + + + +  +           + +  +E P     AH   L  L +   S   P V DS    +G SK 
Subjt:  KLVDSASPLPSKPKLPK----RLSSSAIIIGKGLKSPINHQY---------AKMNFLMEFP-----AHHNFLNSLHHCPPSFVSPDVFDSIAARVGNSKA

Query:  PTDRVVIQSCHP--VIDEI-------------------------PEQKKTTTHAAAS-----------------EISNYCADDVISNYRKQSALALWESI
        P  +   QS  P  +++EI                         P QK ++THA                    E S +  ++V+SN+ +++AL +WE I
Subjt:  PTDRVVIQSCHP--VIDEI-------------------------PEQKKTTTHAAAS-----------------EISNYCADDVISNYRKQSALALWESI

Query:  HQKIIRTSFDKRPRLKQEAVKILHTI
          KI+RT F+  PRL+ E   +L  I
Subjt:  HQKIIRTSFDKRPRLKQEAVKILHTI

A0A5A7TX42 Uncharacterized protein2.2e-3632.29Show/hide
Query:  IRSCFLSSRCGSLMVIEAYSPCRFSRQFGFYQDIPNDLGKEIPEANLANVLRLWRICTQRETLSQVYLPARATKPHTQVTQRYRSWWLAKHGNYLEEGIQ
        +RSC+LSSRC +  +I +YSP RF RQFGFYQD+PND+G   P   L N+L  WRICT+R TLS++YLPAR+ +P   VTQR+  WW  KHG Y E+   
Subjt:  IRSCFLSSRCGSLMVIEAYSPCRFSRQFGFYQDIPNDLGKEIPEANLANVLRLWRICTQRETLSQVYLPARATKPHTQVTQRYRSWWLAKHGNYLEEGIQ

Query:  KLVDSASPLPSKPKLPKRLSSSAIIIGKGL------------------------------KSPI-------NHQYAKMNFLMEFP------AHHNFLNSL
         LV SA P  S+P+LPK   S+  + GK +                              K P+       +H   +    +E P        ++ L  L
Subjt:  KLVDSASPLPSKPKLPKRLSSSAIIIGKGL------------------------------KSPI-------NHQYAKMNFLMEFP------AHHNFLNSL

Query:  --HHCPPSFVSPDVFDSIAARVGNSKAPTDRVVIQSCHP--VIDEIPE-------------------------QKKTTTHAAAS----------------
               S   P   DS    VG S+ P ++   QS  P  +++EI                           QK ++ HA                   
Subjt:  --HHCPPSFVSPDVFDSIAARVGNSKAPTDRVVIQSCHP--VIDEIPE-------------------------QKKTTTHAAAS----------------

Query:  -EISNYCADDVISNYRKQSALALWESIHQKIIRTSFDKRPRLKQEAVKILHTI
         E S +  + V+SN+ +++AL +WE I  KI+RT F+  PRL+ E   +L  I
Subjt:  -EISNYCADDVISNYRKQSALALWESIHQKIIRTSFDKRPRLKQEAVKILHTI

A0A5A7U4C3 Uncharacterized protein2.0e-5330.07Show/hide
Query:  MVYFTEHLDPEKNCLVILKD---------------------------KIN-LLKMAWSEERLLSHDSSLKAWFLEYPVHSKVPGENPESTLGRRIIGDEQ
        MVYFTE+ D  K CL+ILKD                           K N +L   WS+E+ ++ +S L +WFLE  +H+K+P ENPESTLG ++I D +
Subjt:  MVYFTEHLDPEKNCLVILKD---------------------------KIN-LLKMAWSEERLLSHDSSLKAWFLEYPVHSKVPGENPESTLGRRIIGDEQ

Query:  VRWGSP--------------------------------------------DREDEACPKVVSTFLLRTITLHL---------------------------
        +RWG+                                             DR  +        +   T TLH                            
Subjt:  VRWGSP--------------------------------------------DREDEACPKVVSTFLLRTITLHL---------------------------

Query:  -NNAIAFKFLS-----------NRGFLSGVFKVASLMAEGYVLSLAVLFWPIF---------IMDSTR--------FTVHL----LLLDFQRLVSLCTVS
         +      FLS              F   VFKVASLMAEGY  SLA+   P+          I DST         F +H     L L F       T S
Subjt:  -NNAIAFKFLS-----------NRGFLSGVFKVASLMAEGYVLSLAVLFWPIF---------IMDSTR--------FTVHL----LLLDFQRLVSLCTVS

Query:  IIRSCFLS----------SRCGSLMVIEAYSPCRFSRQFGFYQDIPNDLGKEIPEANLANVLRLWRICTQRETLSQVYLPARATKPHTQVTQRYRSWWLA
        +   C +           +   + M I   +PCRF RQFGFYQD+P DL ++IP+ANLANV   W IC +  TLSQVYLPA    P   +T  Y++WWLA
Subjt:  IIRSCFLS----------SRCGSLMVIEAYSPCRFSRQFGFYQDIPNDLGKEIPEANLANVLRLWRICTQRETLSQVYLPARATKPHTQVTQRYRSWWLA

Query:  KHGNYLEEGIQKLVDSASPLPSKPKLPKRL--SSSAIIIGKGLKSPINHQYAKMNFLMEFPAHHNFL--------------------------------N
        KHG+YL+EG+Q L+D  +P   K K  K++  +    I     K P++ Q+           H + L                                 
Subjt:  KHGNYLEEGIQKLVDSASPLPSKPKLPKRL--SSSAIIIGKGLKSPINHQYAKMNFLMEFPAHHNFL--------------------------------N

Query:  SLHHCPPSFVSPDV---FDSIAARVGNSKAPTDRVVIQSCHPVIDEIPEQKKTTTHAAASEISNYCADDVISNYRKQSALALWESIHQKIIRTSFDKRPR
        S+    P    PDV        +   NSK P    ++ +C  V    P++   T     SEIS++CAD++IS+ R+++A+ LWES+ QKIIRT F++   
Subjt:  SLHHCPPSFVSPDV---FDSIAARVGNSKAPTDRVVIQSCHPVIDEIPEQKKTTTHAAASEISNYCADDVISNYRKQSALALWESIHQKIIRTSFDKRPR

Query:  LKQEAVKILHTI
        L+ E  KI   I
Subjt:  LKQEAVKILHTI

A0A5A7U8L3 PMD domain-containing protein3.7e-3633.85Show/hide
Query:  IRSCFLSSRCGSLMVIEAYSPCRFSRQFGFYQDIPNDLGKEIPEANLANVLRLWRICTQRETLSQVYLPARATKPHTQVTQRYRSWWLAKHGNYLEEGIQ
        +RSC+LSSRC +  +I +YS  RF RQFGFYQD+PND+G   P   L N+L   RICT+  TLS++YLPAR+ +P   VTQ++  WW  KHG Y E+   
Subjt:  IRSCFLSSRCGSLMVIEAYSPCRFSRQFGFYQDIPNDLGKEIPEANLANVLRLWRICTQRETLSQVYLPARATKPHTQVTQRYRSWWLAKHGNYLEEGIQ

Query:  KLVDSASPLPSKPKLPKRLSSSAIIIGKGL------------------------------KSPI-------NHQYAKMNFLMEFP------AHHNFLNSL
         LV S  P PS+P+LPK   S+  + GK +                              K P+       +H   +    +E P        ++ L  L
Subjt:  KLVDSASPLPSKPKLPKRLSSSAIIIGKGL------------------------------KSPI-------NHQYAKMNFLMEFP------AHHNFLNSL

Query:  --HHCPPSFVSPDVFDSIAARVGNSKAPTDRVVIQSCHP--VIDEIPEQKKTTTH-----------AAASEISNYCADDVISNYRKQSALALWESIHQKI
               S   P   DS    VG SK   ++   QS HP  +++EI   K T              A + E S +  + V+SN+ +++AL +WE I  KI
Subjt:  --HHCPPSFVSPDVFDSIAARVGNSKAPTDRVVIQSCHP--VIDEIPEQKKTTTH-----------AAASEISNYCADDVISNYRKQSALALWESIHQKI

Query:  IRTSFDKRPRLKQEAVKILHTI
        +RT F+  PRL+ E   +   I
Subjt:  IRTSFDKRPRLKQEAVKILHTI

A0A5A7UGA8 Uncharacterized protein2.6e-3734.04Show/hide
Query:  IRSCFLSSRCGSLMVIEAYSPCRFSRQFGFYQDIPNDLGKEIPEANLANVLRLWRICTQRETLSQVYLPARATKPHTQVTQRYRSWWLAKHGNYLEEGIQ
        +RSC+LSSRC +  +I +Y+P RF RQFGFYQD+PN +G   P   L N+L  WRICT+R TLS++YLPA + +P   VTQR+  WW  KHG Y E+   
Subjt:  IRSCFLSSRCGSLMVIEAYSPCRFSRQFGFYQDIPNDLGKEIPEANLANVLRLWRICTQRETLSQVYLPARATKPHTQVTQRYRSWWLAKHGNYLEEGIQ

Query:  KLVDSASPLPSKPKLPKRLSSSAIIIGKGL------------------------------KSPI-------NHQYAKMNFLMEFPAHHNFLNSLHHCPPS
         LV S  P PS+P+LPK   S+  + GK +                              K P+       +H   +    +E P     L+ L+     
Subjt:  KLVDSASPLPSKPKLPKRLSSSAIIIGKGL------------------------------KSPI-------NHQYAKMNFLMEFPAHHNFLNSLHHCPPS

Query:  FVSPD---------VFDSIAARVGNSKAPTDRVVIQSCHP--VIDEIPEQKKT---------TTHAAAS--------EISNYCADDVISNYRKQSALALW
        F+ PD           DS    VG SK P ++   QS  P  +++EI  +K T         ++   AS        E S +  + ++SN+ +++AL +W
Subjt:  FVSPD---------VFDSIAARVGNSKAPTDRVVIQSCHP--VIDEIPEQKKT---------TTHAAAS--------EISNYCADDVISNYRKQSALALW

Query:  ESIHQKIIRTSFDKRPRLKQEAVKILHTI
        E I  KII+T F+  PRL+ E   +L  I
Subjt:  ESIHQKIIRTSFDKRPRLKQEAVKILHTI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGGACAAGAGTCTCGACGGCAGGCGGAAGAAGGTCGTCGGCACGGTGGATGTGGGTCGTCGGCGCGATGCGAGAGGTAGAGATTGGGATACACTTTGTGTTGGGGG
GTGGAAAATGGTGGGAAATGGAGGATTAAGTCTTTCAAGGGGGAAATGGTGGGAAGGGAGGGTTGTGGAAGATGAAGATGGACGAACGGGCTGGTGCATGAAGGCGGTTC
TGGTGGGAGGGATTTTGGAACTAGCAAATTTATTTGACAATCTCTTGGGGCACAACATTAATCTTTGTCCACCATTTGAAAGAAAACTTATCTCCATTATGGTGTACTTT
ACCGAGCATCTTGATCCTGAGAAAAATTGCTTGGTCATCTTGAAGGATAAGATCAACCTGTTGAAAATGGCATGGTCAGAAGAAAGACTTTTGAGTCACGACTCTTCATT
GAAGGCGTGGTTTTTAGAGTATCCAGTGCATAGCAAGGTGCCGGGTGAAAATCCAGAATCAACTTTAGGTCGTCGAATTATTGGAGACGAACAAGTTCGTTGGGGTTCGC
CTGATCGAGAGGATGAAGCTTGTCCAAAAGTTGTGAGTACCTTTTTGCTGCGTACTATCACATTGCATCTCAACAACGCGATCGCCTTCAAATTCCTATCAAATCGTGGA
TTTCTTTCTGGAGTTTTTAAAGTCGCCAGTCTTATGGCTGAAGGTTACGTTTTGAGTCTTGCTGTCCTGTTTTGGCCAATATTTATCATGGACTCGACCAGATTCACAGT
GCATCTCCTTCTACTGGATTTTCAAAGGCTTGTTTCCCTTTGCACTGTGTCCATCATTCGGTCGTGTTTCTTGTCTTCACGCTGTGGATCGTTGATGGTTATTGAAGCTT
ATAGTCCTTGTCGATTTAGTCGACAATTTGGATTCTATCAAGACATACCGAATGACTTGGGTAAAGAAATTCCTGAGGCAAATTTAGCGAATGTGTTACGTCTTTGGAGG
ATATGCACTCAAAGAGAAACATTATCCCAAGTATACCTTCCTGCTCGTGCAACGAAACCACACACTCAAGTTACTCAACGCTACAGAAGTTGGTGGTTAGCCAAGCATGG
GAATTATCTTGAAGAAGGCATACAAAAGTTGGTGGACAGCGCTTCTCCTCTTCCCTCTAAGCCCAAGTTGCCTAAAAGGTTGAGCAGTAGCGCGATCATCATTGGAAAAG
GACTAAAAAGTCCAATCAACCATCAGTATGCGAAGATGAATTTTTTGATGGAGTTCCCAGCGCATCACAATTTCCTGAACTCCCTGCACCACTGTCCTCCTTCTTTTGTG
AGTCCAGACGTCTTTGACTCTATAGCTGCGCGTGTGGGCAATTCCAAAGCGCCAACGGATAGAGTCGTGATTCAATCTTGTCATCCTGTAATTGATGAAATTCCTGAACA
AAAGAAGACAACAACCCATGCAGCTGCGTCTGAGATCTCTAATTACTGTGCTGACGATGTTATTTCCAACTATAGGAAACAGTCTGCCCTTGCCCTGTGGGAGAGCATAC
ACCAGAAGATCATACGCACTTCGTTCGACAAAAGACCTCGCCTTAAACAAGAGGCAGTCAAAATACTTCATACAATTTTGAGATTCGTGTACCGGCTTGGATTCTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTGGGACAAGAGTCTCGACGGCAGGCGGAAGAAGGTCGTCGGCACGGTGGATGTGGGTCGTCGGCGCGATGCGAGAGGTAGAGATTGGGATACACTTTGTGTTGGGGG
GTGGAAAATGGTGGGAAATGGAGGATTAAGTCTTTCAAGGGGGAAATGGTGGGAAGGGAGGGTTGTGGAAGATGAAGATGGACGAACGGGCTGGTGCATGAAGGCGGTTC
TGGTGGGAGGGATTTTGGAACTAGCAAATTTATTTGACAATCTCTTGGGGCACAACATTAATCTTTGTCCACCATTTGAAAGAAAACTTATCTCCATTATGGTGTACTTT
ACCGAGCATCTTGATCCTGAGAAAAATTGCTTGGTCATCTTGAAGGATAAGATCAACCTGTTGAAAATGGCATGGTCAGAAGAAAGACTTTTGAGTCACGACTCTTCATT
GAAGGCGTGGTTTTTAGAGTATCCAGTGCATAGCAAGGTGCCGGGTGAAAATCCAGAATCAACTTTAGGTCGTCGAATTATTGGAGACGAACAAGTTCGTTGGGGTTCGC
CTGATCGAGAGGATGAAGCTTGTCCAAAAGTTGTGAGTACCTTTTTGCTGCGTACTATCACATTGCATCTCAACAACGCGATCGCCTTCAAATTCCTATCAAATCGTGGA
TTTCTTTCTGGAGTTTTTAAAGTCGCCAGTCTTATGGCTGAAGGTTACGTTTTGAGTCTTGCTGTCCTGTTTTGGCCAATATTTATCATGGACTCGACCAGATTCACAGT
GCATCTCCTTCTACTGGATTTTCAAAGGCTTGTTTCCCTTTGCACTGTGTCCATCATTCGGTCGTGTTTCTTGTCTTCACGCTGTGGATCGTTGATGGTTATTGAAGCTT
ATAGTCCTTGTCGATTTAGTCGACAATTTGGATTCTATCAAGACATACCGAATGACTTGGGTAAAGAAATTCCTGAGGCAAATTTAGCGAATGTGTTACGTCTTTGGAGG
ATATGCACTCAAAGAGAAACATTATCCCAAGTATACCTTCCTGCTCGTGCAACGAAACCACACACTCAAGTTACTCAACGCTACAGAAGTTGGTGGTTAGCCAAGCATGG
GAATTATCTTGAAGAAGGCATACAAAAGTTGGTGGACAGCGCTTCTCCTCTTCCCTCTAAGCCCAAGTTGCCTAAAAGGTTGAGCAGTAGCGCGATCATCATTGGAAAAG
GACTAAAAAGTCCAATCAACCATCAGTATGCGAAGATGAATTTTTTGATGGAGTTCCCAGCGCATCACAATTTCCTGAACTCCCTGCACCACTGTCCTCCTTCTTTTGTG
AGTCCAGACGTCTTTGACTCTATAGCTGCGCGTGTGGGCAATTCCAAAGCGCCAACGGATAGAGTCGTGATTCAATCTTGTCATCCTGTAATTGATGAAATTCCTGAACA
AAAGAAGACAACAACCCATGCAGCTGCGTCTGAGATCTCTAATTACTGTGCTGACGATGTTATTTCCAACTATAGGAAACAGTCTGCCCTTGCCCTGTGGGAGAGCATAC
ACCAGAAGATCATACGCACTTCGTTCGACAAAAGACCTCGCCTTAAACAAGAGGCAGTCAAAATACTTCATACAATTTTGAGATTCGTGTACCGGCTTGGATTCTCTTAA
Protein sequenceShow/hide protein sequence
MWDKSLDGRRKKVVGTVDVGRRRDARGRDWDTLCVGGWKMVGNGGLSLSRGKWWEGRVVEDEDGRTGWCMKAVLVGGILELANLFDNLLGHNINLCPPFERKLISIMVYF
TEHLDPEKNCLVILKDKINLLKMAWSEERLLSHDSSLKAWFLEYPVHSKVPGENPESTLGRRIIGDEQVRWGSPDREDEACPKVVSTFLLRTITLHLNNAIAFKFLSNRG
FLSGVFKVASLMAEGYVLSLAVLFWPIFIMDSTRFTVHLLLLDFQRLVSLCTVSIIRSCFLSSRCGSLMVIEAYSPCRFSRQFGFYQDIPNDLGKEIPEANLANVLRLWR
ICTQRETLSQVYLPARATKPHTQVTQRYRSWWLAKHGNYLEEGIQKLVDSASPLPSKPKLPKRLSSSAIIIGKGLKSPINHQYAKMNFLMEFPAHHNFLNSLHHCPPSFV
SPDVFDSIAARVGNSKAPTDRVVIQSCHPVIDEIPEQKKTTTHAAASEISNYCADDVISNYRKQSALALWESIHQKIIRTSFDKRPRLKQEAVKILHTILRFVYRLGFS