; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025304 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025304
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPlus3 domain-containing protein
Genome locationchr10:11038636..11041695
RNA-Seq ExpressionLag0025304
SyntenyLag0025304
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFY88797.1 hypothetical protein Acr_06g0007370 [Actinidia rufa]1.5e-1835.56Show/hide
Query:  LTANRLEFLRHKYDIPDDVHLRLPNAGENFENPPDGEVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLTPNGWCHLIGCFTLWAMYGGGSLMTVDDF
        +T   L  LR  Y  P  V +RLP  GE   +   GEVAFY A F  G+R P+   ++  L    + PAQL PN W  +     LW  Y     +++ +F
Subjt:  LTANRLEFLRHKYDIPDDVHLRLPNAGENFENPPDGEVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLTPNGWCHLIGCFTLWAMYGGGSLMTVDDF

Query:  LSLHTINRNPAFDN--LFYYASAKKCTLISRPTSVKKWKNGWFFVSGNWLEKTEEGCFFG---VPMRFGEYVPRNVRRSP
         +L ++N NP  D   L++ A  KK  L   P++VK WK+ +FFVSG+  E  EE    G   VP  +G  V ++    P
Subjt:  LSLHTINRNPAFDN--LFYYASAKKCTLISRPTSVKKWKNGWFFVSGNWLEKTEEGCFFG---VPMRFGEYVPRNVRRSP

GFZ18333.1 hypothetical protein Acr_27g0000720 [Actinidia rufa]6.9e-1934.02Show/hide
Query:  HSSDLSSSLTANRLEFLRHKYDIPDDVHLRLPNAGENFENPPDGEVAFYHAMFKFGVRLPL----PLFLQDFLVCTGLAPAQLTPNGWCHLIGCFTLWAM
        H  + S  +T   L  LR  Y  P  V +RLP+ G+   +   GEVAFY A F  G+R PL     L LQ + +C    PAQL PN W  +     LW +
Subjt:  HSSDLSSSLTANRLEFLRHKYDIPDDVHLRLPNAGENFENPPDGEVAFYHAMFKFGVRLPL----PLFLQDFLVCTGLAPAQLTPNGWCHLIGCFTLWAM

Query:  YGGGSLMTVDDFLSLHTINRNPAFDN--LFYYASAKKCTLISRPTSVKKWKNGWFFVSGNWLEKTEEGCFFG---VPMRFGEYVPRNVRRSP------TA
        Y     +++ +F +L ++N NP  D   L++ A  KK  L   P++VK WK+ +FFVSG+  E  EE    G   VP  +G  +P     +P        
Subjt:  YGGGSLMTVDDFLSLHTINRNPAFDN--LFYYASAKKCTLISRPTSVKKWKNGWFFVSGNWLEKTEEGCFFG---VPMRFGEYVPRNVRRSP------TA

Query:  RRFVKYVLSLEKINRHD-PFLVDQS----VFEASGLARRRT
        + F +   S+E+  R   P L+D      VF +SG    RT
Subjt:  RRFVKYVLSLEKINRHD-PFLVDQS----VFEASGLARRRT

TXG53676.1 hypothetical protein EZV62_018932 [Acer yangbiense]2.0e-1827.59Show/hide
Query:  ASEGSITSPDAGESYSDDGP-------SSSSCFVDPEISDSSDGEPPAHS----SDLSSSLTANRLEFLRHKYDIPDDVHLRLPNAGENFENPP-DGEVA
        A E SI   D   S +++         S+  C    E S S D      S      + S++T   +E  R KY IP+++ LRLP  G+   +PP + EVA
Subjt:  ASEGSITSPDAGESYSDDGP-------SSSSCFVDPEISDSSDGEPPAHS----SDLSSSLTANRLEFLRHKYDIPDDVHLRLPNAGENFENPP-DGEVA

Query:  FYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLTPNGWCHLIGCFTLWAMYGGGSLMTVDDFLSLHTINRNPAFDNLFYYASA---KKCTLISRPTSVKKW
           A F+FGV LP   FL+  L     APAQL PN W  LIG + +W       L T  +F++L+ +   P +   +YY SA   K+  +   P+S K W
Subjt:  FYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLTPNGWCHLIGCFTLWAMYGGGSLMTVDDFLSLHTINRNPAFDNLFYYASA---KKCTLISRPTSVKKW

Query:  KNGWFFVSGNWLEKTEEGCF-FGVPMRFGEYV---------------PRNVRRS---PTARRFVKYVLSLEKINRHDPFLVDQSVFEASGLARRRTINSE
        KN WFF SG+W  +  E  F   +P RF   V                RN++ +   P   R  K +L+ E + RH  F +      + G  ++R I  E
Subjt:  KNGWFFVSGNWLEKTEEGCF-FGVPMRFGEYV---------------PRNVRRS---PTARRFVKYVLSLEKINRHDPFLVDQSVFEASGLARRRTINSE

Query:  -EMAFRRMYYSQRKRREARSRAGTSQAASVDLTEDEAPRVVAETSRRPAPSTRRTRYQMPSSVTETDLSTGIPVFALPED-----YESGGNEAE-----I
          +       ++R R E+    G     SV ++ + +   +  TS    P       QMP  V     S G       +D      E    EA+      
Subjt:  -EMAFRRMYYSQRKRREARSRAGTSQAASVDLTEDEAPRVVAETSRRPAPSTRRTRYQMPSSVTETDLSTGIPVFALPED-----YESGGNEAE-----I

Query:  LTQNFMCWQGLQ----------SRRPEGELGAFTKATASAISLASKINELQ-LNSVPRSELIQAQERLNEANRLLEEVRMKLKSRDAELESTKAQLMEAK
        LT  +  +   Q                 L    KA  SA+    ++  LQ  N +    L Q  ER  +    L+E R    S   ++ + KA ++EAK
Subjt:  LTQNFMCWQGLQ----------SRRPEGELGAFTKATASAISLASKINELQ-LNSVPRSELIQAQERLNEANRLLEEVRMKLKSRDAELESTKAQLMEAK

Query:  AHLASADYLAD
        +  A+ +  AD
Subjt:  AHLASADYLAD

TXG53679.1 hypothetical protein EZV62_018935 [Acer yangbiense]1.5e-1833.52Show/hide
Query:  SSDGEPPAHSSDLSSSLTANRLEFLRHKYDIPDDVHLRLPNAGENFENPPDGEVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLTPNGWCHLIGCFT
        SS+GE       + S++T   LE L   Y IP+++ LRLP       +PP G V  +   F+FG++LP   FL+  L     APAQL+PN W  LIG + 
Subjt:  SSDGEPPAHSSDLSSSLTANRLEFLRHKYDIPDDVHLRLPNAGENFENPPDGEVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLTPNGWCHLIGCFT

Query:  LWAMYGGGSLMTVDDFLSLHTINRNPAFDNLFYYAS--AKKCTLISRPTSVKKWKNGWFFVSGNW-LEKTEEGCFFGVPMRF
        +W       L T  +F  L+ +  +P     +  ++   K+  +I  P+S K WK  WFF SG+W  +  E+     +P  F
Subjt:  LWAMYGGGSLMTVDDFLSLHTINRNPAFDNLFYYAS--AKKCTLISRPTSVKKWKNGWFFVSGNW-LEKTEEGCFFGVPMRF

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]6.4e-2533.33Show/hide
Query:  DSSDGEPPAHSSDLSSSLTANRLEFLRHKYDIPDDVHLRLPNAGENFENPPDGEVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLTPNGWCHLIGCF
        + SD        +  S +  + L  LR  + IP+++ LRLP  GE  +NPP+G V  Y  MF++G+RLPL  F+Q+FL  TGLAPAQ+ PNGW  +    
Subjt:  DSSDGEPPAHSSDLSSSLTANRLEFLRHKYDIPDDVHLRLPNAGENFENPPDGEVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLTPNGWCHLIGCF

Query:  TLWAMYGGGS----LMTVDDFLSLHTINRNPAFDNLFYYASAKKC-TLISRPTSVKKWKNGWFFVSGNWLEKTEEG-CFFGVPMRFGEYVPRNVRRSPTA
         L+ +    S    L  VD  L+     R       FY  + K    ++  PTS+K W   WF+ SG WL K E G  FF VP RFG  V        T 
Subjt:  TLWAMYGGGS----LMTVDDFLSLHTINRNPAFDNLFYYASAKKC-TLISRPTSVKKWKNGWFFVSGNWLEKTEEG-CFFGVPMRFGEYVPRNVRRSPTA

Query:  RRF--VKYVLSLEKINRHDPFLVDQSVFEASGL-----ARRRTINSEEMAFRRMY--YSQRKRREARSRAGTSQAASVDLTEDEAPRVVAETSRRPA
          F  +KY        R    LV   +   SGL     A R   +S   +   M   ++   +R+++ RA   +AA    ++   P VV   S  PA
Subjt:  RRF--VKYVLSLEKINRHDPFLVDQSVFEASGL-----ARRRTINSEEMAFRRMY--YSQRKRREARSRAGTSQAASVDLTEDEAPRVVAETSRRPA

TrEMBL top hitse value%identityAlignment
A0A5C7HA73 Plus3 domain-containing protein7.4e-1933.52Show/hide
Query:  SSDGEPPAHSSDLSSSLTANRLEFLRHKYDIPDDVHLRLPNAGENFENPPDGEVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLTPNGWCHLIGCFT
        SS+GE       + S++T   LE L   Y IP+++ LRLP       +PP G V  +   F+FG++LP   FL+  L     APAQL+PN W  LIG + 
Subjt:  SSDGEPPAHSSDLSSSLTANRLEFLRHKYDIPDDVHLRLPNAGENFENPPDGEVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLTPNGWCHLIGCFT

Query:  LWAMYGGGSLMTVDDFLSLHTINRNPAFDNLFYYAS--AKKCTLISRPTSVKKWKNGWFFVSGNW-LEKTEEGCFFGVPMRF
        +W       L T  +F  L+ +  +P     +  ++   K+  +I  P+S K WK  WFF SG+W  +  E+     +P  F
Subjt:  LWAMYGGGSLMTVDDFLSLHTINRNPAFDNLFYYAS--AKKCTLISRPTSVKKWKNGWFFVSGNW-LEKTEEGCFFGVPMRF

A0A5C7HBX5 Plus3 domain-containing protein9.7e-1927.59Show/hide
Query:  ASEGSITSPDAGESYSDDGP-------SSSSCFVDPEISDSSDGEPPAHS----SDLSSSLTANRLEFLRHKYDIPDDVHLRLPNAGENFENPP-DGEVA
        A E SI   D   S +++         S+  C    E S S D      S      + S++T   +E  R KY IP+++ LRLP  G+   +PP + EVA
Subjt:  ASEGSITSPDAGESYSDDGP-------SSSSCFVDPEISDSSDGEPPAHS----SDLSSSLTANRLEFLRHKYDIPDDVHLRLPNAGENFENPP-DGEVA

Query:  FYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLTPNGWCHLIGCFTLWAMYGGGSLMTVDDFLSLHTINRNPAFDNLFYYASA---KKCTLISRPTSVKKW
           A F+FGV LP   FL+  L     APAQL PN W  LIG + +W       L T  +F++L+ +   P +   +YY SA   K+  +   P+S K W
Subjt:  FYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLTPNGWCHLIGCFTLWAMYGGGSLMTVDDFLSLHTINRNPAFDNLFYYASA---KKCTLISRPTSVKKW

Query:  KNGWFFVSGNWLEKTEEGCF-FGVPMRFGEYV---------------PRNVRRS---PTARRFVKYVLSLEKINRHDPFLVDQSVFEASGLARRRTINSE
        KN WFF SG+W  +  E  F   +P RF   V                RN++ +   P   R  K +L+ E + RH  F +      + G  ++R I  E
Subjt:  KNGWFFVSGNWLEKTEEGCF-FGVPMRFGEYV---------------PRNVRRS---PTARRFVKYVLSLEKINRHDPFLVDQSVFEASGLARRRTINSE

Query:  -EMAFRRMYYSQRKRREARSRAGTSQAASVDLTEDEAPRVVAETSRRPAPSTRRTRYQMPSSVTETDLSTGIPVFALPED-----YESGGNEAE-----I
          +       ++R R E+    G     SV ++ + +   +  TS    P       QMP  V     S G       +D      E    EA+      
Subjt:  -EMAFRRMYYSQRKRREARSRAGTSQAASVDLTEDEAPRVVAETSRRPAPSTRRTRYQMPSSVTETDLSTGIPVFALPED-----YESGGNEAE-----I

Query:  LTQNFMCWQGLQ----------SRRPEGELGAFTKATASAISLASKINELQ-LNSVPRSELIQAQERLNEANRLLEEVRMKLKSRDAELESTKAQLMEAK
        LT  +  +   Q                 L    KA  SA+    ++  LQ  N +    L Q  ER  +    L+E R    S   ++ + KA ++EAK
Subjt:  LTQNFMCWQGLQ----------SRRPEGELGAFTKATASAISLASKINELQ-LNSVPRSELIQAQERLNEANRLLEEVRMKLKSRDAELESTKAQLMEAK

Query:  AHLASADYLAD
        +  A+ +  AD
Subjt:  AHLASADYLAD

A0A6J1DXS5 uncharacterized protein LOC1110255023.1e-2533.33Show/hide
Query:  DSSDGEPPAHSSDLSSSLTANRLEFLRHKYDIPDDVHLRLPNAGENFENPPDGEVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLTPNGWCHLIGCF
        + SD        +  S +  + L  LR  + IP+++ LRLP  GE  +NPP+G V  Y  MF++G+RLPL  F+Q+FL  TGLAPAQ+ PNGW  +    
Subjt:  DSSDGEPPAHSSDLSSSLTANRLEFLRHKYDIPDDVHLRLPNAGENFENPPDGEVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLTPNGWCHLIGCF

Query:  TLWAMYGGGS----LMTVDDFLSLHTINRNPAFDNLFYYASAKKC-TLISRPTSVKKWKNGWFFVSGNWLEKTEEG-CFFGVPMRFGEYVPRNVRRSPTA
         L+ +    S    L  VD  L+     R       FY  + K    ++  PTS+K W   WF+ SG WL K E G  FF VP RFG  V        T 
Subjt:  TLWAMYGGGS----LMTVDDFLSLHTINRNPAFDNLFYYASAKKC-TLISRPTSVKKWKNGWFFVSGNWLEKTEEG-CFFGVPMRFGEYVPRNVRRSPTA

Query:  RRF--VKYVLSLEKINRHDPFLVDQSVFEASGL-----ARRRTINSEEMAFRRMY--YSQRKRREARSRAGTSQAASVDLTEDEAPRVVAETSRRPA
          F  +KY        R    LV   +   SGL     A R   +S   +   M   ++   +R+++ RA   +AA    ++   P VV   S  PA
Subjt:  RRF--VKYVLSLEKINRHDPFLVDQSVFEASGL-----ARRRTINSEEMAFRRMY--YSQRKRREARSRAGTSQAASVDLTEDEAPRVVAETSRRPA

A0A7J0ET91 Uncharacterized protein7.4e-1935.56Show/hide
Query:  LTANRLEFLRHKYDIPDDVHLRLPNAGENFENPPDGEVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLTPNGWCHLIGCFTLWAMYGGGSLMTVDDF
        +T   L  LR  Y  P  V +RLP  GE   +   GEVAFY A F  G+R P+   ++  L    + PAQL PN W  +     LW  Y     +++ +F
Subjt:  LTANRLEFLRHKYDIPDDVHLRLPNAGENFENPPDGEVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLTPNGWCHLIGCFTLWAMYGGGSLMTVDDF

Query:  LSLHTINRNPAFDN--LFYYASAKKCTLISRPTSVKKWKNGWFFVSGNWLEKTEEGCFFG---VPMRFGEYVPRNVRRSP
         +L ++N NP  D   L++ A  KK  L   P++VK WK+ +FFVSG+  E  EE    G   VP  +G  V ++    P
Subjt:  LSLHTINRNPAFDN--LFYYASAKKCTLISRPTSVKKWKNGWFFVSGNWLEKTEEGCFFG---VPMRFGEYVPRNVRRSP

A0A7J0H5F3 Uncharacterized protein3.3e-1934.02Show/hide
Query:  HSSDLSSSLTANRLEFLRHKYDIPDDVHLRLPNAGENFENPPDGEVAFYHAMFKFGVRLPL----PLFLQDFLVCTGLAPAQLTPNGWCHLIGCFTLWAM
        H  + S  +T   L  LR  Y  P  V +RLP+ G+   +   GEVAFY A F  G+R PL     L LQ + +C    PAQL PN W  +     LW +
Subjt:  HSSDLSSSLTANRLEFLRHKYDIPDDVHLRLPNAGENFENPPDGEVAFYHAMFKFGVRLPL----PLFLQDFLVCTGLAPAQLTPNGWCHLIGCFTLWAM

Query:  YGGGSLMTVDDFLSLHTINRNPAFDN--LFYYASAKKCTLISRPTSVKKWKNGWFFVSGNWLEKTEEGCFFG---VPMRFGEYVPRNVRRSP------TA
        Y     +++ +F +L ++N NP  D   L++ A  KK  L   P++VK WK+ +FFVSG+  E  EE    G   VP  +G  +P     +P        
Subjt:  YGGGSLMTVDDFLSLHTINRNPAFDN--LFYYASAKKCTLISRPTSVKKWKNGWFFVSGNWLEKTEEGCFFG---VPMRFGEYVPRNVRRSP------TA

Query:  RRFVKYVLSLEKINRHD-PFLVDQS----VFEASGLARRRT
        + F +   S+E+  R   P L+D      VF +SG    RT
Subjt:  RRFVKYVLSLEKINRHD-PFLVDQS----VFEASGLARRRT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G46696.1 Protein of unknown function, DUF6013.0e-0434.83Show/hide
Query:  DGEPPAHSSDLSSSLTANRLEFLRHKYDIPDDVHLRLPNAGENFENPPDGEVAFYHAMFKF-GVRLPLPLFLQDFLVCTGLAPAQLTPN
        DG+   + + LS   T++RL  LR  + IP  + L  P      ENPP G    +   F   G+  PLP  L D +   G+A  QL PN
Subjt:  DGEPPAHSSDLSSSLTANRLEFLRHKYDIPDDVHLRLPNAGENFENPPDGEVAFYHAMFKF-GVRLPLPLFLQDFLVCTGLAPAQLTPN

AT1G51172.1 unknown protein1.2e-0826.64Show/hide
Query:  SSDGEPPAHSSDLSSSLTANRLEFLRHKYDIPDDVHLRLPNAGENFENPPDGEVAFYHAMF-KFGVRLPLPLFLQDFLVCTGLAPAQLTPNGWCHLIGCF
        S DG+   + + LS   T +RL  LR  + IP  + L  P    + E+PP G    +   F + G+  PLP  L D +   G+A  QL PN    ++   
Subjt:  SSDGEPPAHSSDLSSSLTANRLEFLRHKYDIPDDVHLRLPNAGENFENPPDGEVAFYHAMF-KFGVRLPLPLFLQDFLVCTGLAPAQLTPNGWCHLIGCF

Query:  TLWAMYGGGSLMTVDDFLSLHTINRNPAFDNLFYYASAKKCTLISR-PTSVKKWKNGWFFVSGNWLEKTEEGCFFGVPMRFGEYVPRNVRRS--PTARRF
        TL      G  + + DFL L+ + ++   +N F+ +  K   +    P   + W+  +FF   N L   E+   F       ++  R V+ +  P +R F
Subjt:  TLWAMYGGGSLMTVDDFLSLHTINRNPAFDNLFYYASAKKCTLISR-PTSVKKWKNGWFFVSGNWLEKTEEGCFFGVPMRFGEYVPRNVRRS--PTARRF

Query:  VKYVLSLEKINRHD
          +        RHD
Subjt:  VKYVLSLEKINRHD

AT2G15420.1 myosin heavy chain-related4.2e-0628.46Show/hide
Query:  PDDVHLRLPNAGENFENPPDGEVAFYHAMF-KFGVRLPLPLFLQDFLVCTGLAPAQLTPNGWCHLIGCFTLWAMYGGGSLMTVDDFLSLHTINRNPAFDN
        P ++ L  P++ +    PP+G +  Y A F + G+  PLP FL ++     +A +QLT     + IG   L A    G  +  D F    T +R      
Subjt:  PDDVHLRLPNAGENFENPPDGEVAFYHAMF-KFGVRLPLPLFLQDFLVCTGLAPAQLTPNGWCHLIGCFTLWAMYGGGSLMTVDDFLSLHTINRNPAFDN

Query:  LFYYASAKKCTLISRPTS-VKKWKNGWFFV
         +Y ++  K  ++S   S +  WK  +FFV
Subjt:  LFYYASAKKCTLISRPTS-VKKWKNGWFFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAAAATCTGGCCAAGCGCTCACCTGCCCTTCACGTGGCACCAATAATGAATTTGTTAAGATATGGGAAGACGTGACCTGCAAAGAAGGAAATTACCTCCCGCCTGG
GTCATCCCTCCCCTCTTATAGCTCTCCCCTTGCTAGGGTTTTTTCCTCCCGGCGCCGTGCACCGCACGTGACTCCCCTCTCCCGGGCCCAGGGGGACCAATCCAGGTGGC
CTAGGCAGCTGACGGGCCGGTTTGCCTGGTCGGCCTCGGCATTGGGCCGAGGTATAGGTCCACCTTTGGTGGGTTTCTTCTTCCTTGGGCCAATTTCAACTGGGCCTGAA
GTCCTTCCGCTCTGGTGTACCTGGCAAGCCAAAATTACCCATAACATTAAGCCCCCGCTCTCTTATCTGGGTTGCGATTACCTTGCCTCAGCCTGGCCAGTCTCGTTGGA
CAAATATCAAACTGGTGGCATTGCAACTGTGTCAACCTGCTGGGGTGGTAAAGTTAATAGAGAAGAGAAAAAGATAAACCGTACAAAAAAGACAAAAGAACAGAGAGAGA
GAGAGAGACGAACGAGGACCGAGGCTCATCTGCCTCGGTCTGGCGTGCCTGCTGGGCTGCAGGCGCGCCTCGGTCTCGGCAAAAAGTCGAAGCCGACGCTTCTAGCAAGA
GCTTTTACACTTTTACTTTCTTTTGTTACTTGCATGGCTAGTGAAGGCTCCATTACCTCTCCTGATGCGGGGGAGTCTTATTCCGATGACGGTCCTTCAAGCTCGAGCTG
CTTTGTGGACCCGGAGATTTCAGATAGCAGTGACGGGGAGCCTCCTGCACACTCATCGGACTTATCATCCTCGTTGACCGCAAACCGCCTAGAGTTCCTGAGGCACAAGT
ATGATATTCCCGATGATGTGCATCTGCGGCTTCCCAATGCTGGCGAGAACTTTGAGAATCCCCCGGATGGAGAGGTCGCTTTTTATCATGCCATGTTTAAATTTGGGGTT
CGCTTGCCACTGCCCTTGTTTTTGCAAGATTTCCTAGTCTGCACAGGTTTAGCCCCTGCCCAGCTCACTCCAAATGGGTGGTGCCACCTCATCGGCTGTTTCACTCTTTG
GGCGATGTATGGTGGGGGATCTCTTATGACTGTTGATGATTTTTTATCTTTGCACACCATCAATCGCAACCCCGCCTTCGACAACCTTTTTTATTATGCAAGTGCCAAAA
AATGCACCTTAATCAGCAGACCCACTTCCGTAAAAAAGTGGAAAAACGGTTGGTTCTTTGTTAGTGGCAACTGGCTTGAAAAAACAGAAGAAGGTTGCTTTTTCGGGGTT
CCAATGAGGTTTGGAGAATATGTGCCTCGTAACGTTCGACGCTCCCCAACAGCTAGGAGATTTGTCAAATACGTCCTGAGCCTCGAGAAGATTAACCGCCACGACCCTTT
TCTAGTTGACCAAAGCGTCTTTGAAGCATCCGGGTTAGCCAGACGTCGCACCATCAACTCAGAAGAAATGGCCTTCCGTAGAATGTATTACTCCCAGCGGAAGAGACGCG
AAGCACGCAGCAGAGCTGGAACCTCCCAGGCAGCCTCTGTGGACTTAACCGAGGATGAGGCTCCAAGAGTTGTTGCTGAGACCTCTCGCCGACCTGCCCCTTCTACTCGC
AGAACTCGGTACCAGATGCCCTCCTCGGTCACCGAGACGGACCTTAGCACAGGCATCCCGGTTTTTGCCCTCCCTGAGGACTACGAAAGTGGCGGCAATGAGGCGGAGAT
CTTGACCCAGAACTTCATGTGCTGGCAAGGGTTGCAATCCCGGAGGCCAGAAGGTGAGCTTGGGGCTTTCACGAAGGCCACTGCTTCTGCAATTAGCCTGGCGAGTAAAA
TTAACGAGCTTCAGTTGAATAGCGTCCCGCGGAGTGAGCTCATTCAAGCCCAAGAGAGGCTTAATGAGGCCAATCGCCTGCTGGAGGAAGTGCGAATGAAGCTCAAGTCT
AGGGATGCTGAGTTGGAGTCCACTAAAGCTCAACTTATGGAGGCTAAAGCCCATTTAGCCAGTGCTGACTACCTAGCTGATGAGTTCAAGAAAACCAACGAGTTTTATGC
CATGCAGGATGAAATATGGAATGACGGCATCAAGTGGGCACAAAAGAGATACAGTAAGCACCACCCCACCGTGGACGGTTCATTCATCCAAGAAGATCTCGCTACTCTTT
CCAACAACCCTGATGCCTTTGTCTCTTCTGACGATTCCTCTGGCGGTAGAGACCATATGGACCTATGA
mRNA sequenceShow/hide mRNA sequence
ATGCCAAAATCTGGCCAAGCGCTCACCTGCCCTTCACGTGGCACCAATAATGAATTTGTTAAGATATGGGAAGACGTGACCTGCAAAGAAGGAAATTACCTCCCGCCTGG
GTCATCCCTCCCCTCTTATAGCTCTCCCCTTGCTAGGGTTTTTTCCTCCCGGCGCCGTGCACCGCACGTGACTCCCCTCTCCCGGGCCCAGGGGGACCAATCCAGGTGGC
CTAGGCAGCTGACGGGCCGGTTTGCCTGGTCGGCCTCGGCATTGGGCCGAGGTATAGGTCCACCTTTGGTGGGTTTCTTCTTCCTTGGGCCAATTTCAACTGGGCCTGAA
GTCCTTCCGCTCTGGTGTACCTGGCAAGCCAAAATTACCCATAACATTAAGCCCCCGCTCTCTTATCTGGGTTGCGATTACCTTGCCTCAGCCTGGCCAGTCTCGTTGGA
CAAATATCAAACTGGTGGCATTGCAACTGTGTCAACCTGCTGGGGTGGTAAAGTTAATAGAGAAGAGAAAAAGATAAACCGTACAAAAAAGACAAAAGAACAGAGAGAGA
GAGAGAGACGAACGAGGACCGAGGCTCATCTGCCTCGGTCTGGCGTGCCTGCTGGGCTGCAGGCGCGCCTCGGTCTCGGCAAAAAGTCGAAGCCGACGCTTCTAGCAAGA
GCTTTTACACTTTTACTTTCTTTTGTTACTTGCATGGCTAGTGAAGGCTCCATTACCTCTCCTGATGCGGGGGAGTCTTATTCCGATGACGGTCCTTCAAGCTCGAGCTG
CTTTGTGGACCCGGAGATTTCAGATAGCAGTGACGGGGAGCCTCCTGCACACTCATCGGACTTATCATCCTCGTTGACCGCAAACCGCCTAGAGTTCCTGAGGCACAAGT
ATGATATTCCCGATGATGTGCATCTGCGGCTTCCCAATGCTGGCGAGAACTTTGAGAATCCCCCGGATGGAGAGGTCGCTTTTTATCATGCCATGTTTAAATTTGGGGTT
CGCTTGCCACTGCCCTTGTTTTTGCAAGATTTCCTAGTCTGCACAGGTTTAGCCCCTGCCCAGCTCACTCCAAATGGGTGGTGCCACCTCATCGGCTGTTTCACTCTTTG
GGCGATGTATGGTGGGGGATCTCTTATGACTGTTGATGATTTTTTATCTTTGCACACCATCAATCGCAACCCCGCCTTCGACAACCTTTTTTATTATGCAAGTGCCAAAA
AATGCACCTTAATCAGCAGACCCACTTCCGTAAAAAAGTGGAAAAACGGTTGGTTCTTTGTTAGTGGCAACTGGCTTGAAAAAACAGAAGAAGGTTGCTTTTTCGGGGTT
CCAATGAGGTTTGGAGAATATGTGCCTCGTAACGTTCGACGCTCCCCAACAGCTAGGAGATTTGTCAAATACGTCCTGAGCCTCGAGAAGATTAACCGCCACGACCCTTT
TCTAGTTGACCAAAGCGTCTTTGAAGCATCCGGGTTAGCCAGACGTCGCACCATCAACTCAGAAGAAATGGCCTTCCGTAGAATGTATTACTCCCAGCGGAAGAGACGCG
AAGCACGCAGCAGAGCTGGAACCTCCCAGGCAGCCTCTGTGGACTTAACCGAGGATGAGGCTCCAAGAGTTGTTGCTGAGACCTCTCGCCGACCTGCCCCTTCTACTCGC
AGAACTCGGTACCAGATGCCCTCCTCGGTCACCGAGACGGACCTTAGCACAGGCATCCCGGTTTTTGCCCTCCCTGAGGACTACGAAAGTGGCGGCAATGAGGCGGAGAT
CTTGACCCAGAACTTCATGTGCTGGCAAGGGTTGCAATCCCGGAGGCCAGAAGGTGAGCTTGGGGCTTTCACGAAGGCCACTGCTTCTGCAATTAGCCTGGCGAGTAAAA
TTAACGAGCTTCAGTTGAATAGCGTCCCGCGGAGTGAGCTCATTCAAGCCCAAGAGAGGCTTAATGAGGCCAATCGCCTGCTGGAGGAAGTGCGAATGAAGCTCAAGTCT
AGGGATGCTGAGTTGGAGTCCACTAAAGCTCAACTTATGGAGGCTAAAGCCCATTTAGCCAGTGCTGACTACCTAGCTGATGAGTTCAAGAAAACCAACGAGTTTTATGC
CATGCAGGATGAAATATGGAATGACGGCATCAAGTGGGCACAAAAGAGATACAGTAAGCACCACCCCACCGTGGACGGTTCATTCATCCAAGAAGATCTCGCTACTCTTT
CCAACAACCCTGATGCCTTTGTCTCTTCTGACGATTCCTCTGGCGGTAGAGACCATATGGACCTATGA
Protein sequenceShow/hide protein sequence
MPKSGQALTCPSRGTNNEFVKIWEDVTCKEGNYLPPGSSLPSYSSPLARVFSSRRRAPHVTPLSRAQGDQSRWPRQLTGRFAWSASALGRGIGPPLVGFFFLGPISTGPE
VLPLWCTWQAKITHNIKPPLSYLGCDYLASAWPVSLDKYQTGGIATVSTCWGGKVNREEKKINRTKKTKEQRERERRTRTEAHLPRSGVPAGLQARLGLGKKSKPTLLAR
AFTLLLSFVTCMASEGSITSPDAGESYSDDGPSSSSCFVDPEISDSSDGEPPAHSSDLSSSLTANRLEFLRHKYDIPDDVHLRLPNAGENFENPPDGEVAFYHAMFKFGV
RLPLPLFLQDFLVCTGLAPAQLTPNGWCHLIGCFTLWAMYGGGSLMTVDDFLSLHTINRNPAFDNLFYYASAKKCTLISRPTSVKKWKNGWFFVSGNWLEKTEEGCFFGV
PMRFGEYVPRNVRRSPTARRFVKYVLSLEKINRHDPFLVDQSVFEASGLARRRTINSEEMAFRRMYYSQRKRREARSRAGTSQAASVDLTEDEAPRVVAETSRRPAPSTR
RTRYQMPSSVTETDLSTGIPVFALPEDYESGGNEAEILTQNFMCWQGLQSRRPEGELGAFTKATASAISLASKINELQLNSVPRSELIQAQERLNEANRLLEEVRMKLKS
RDAELESTKAQLMEAKAHLASADYLADEFKKTNEFYAMQDEIWNDGIKWAQKRYSKHHPTVDGSFIQEDLATLSNNPDAFVSSDDSSGGRDHMDL