; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028430 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028430
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr8:21606014..21608949
RNA-Seq ExpressionLag0028430
SyntenyLag0028430
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022155186.1 uncharacterized protein LOC111022321 [Momordica charantia]1.2e-5456.22Show/hide
Query:  DSNRKRKAAAREARYDLEERMVYSVRLSKGLPSVEFTDLEASGIHQPHNDALVVSLTVANTRVHRILVDGGSSADILSTKVFEAMKLGKERLKTSAAPLV
        +S RKRKA  REAR   ++  VY   ++    ++EF++ EA+ +  PHNDALV++L +AN +VHRILVDGGSSADI+S   ++AM LG++ LK+S APLV
Subjt:  DSNRKRKAAAREARYDLEERMVYSVRLSKGLPSVEFTDLEASGIHQPHNDALVVSLTVANTRVHRILVDGGSSADILSTKVFEAMKLGKERLKTSAAPLV

Query:  GFGGERVLPEGSIELPVTYGEGQSVVTKMVNFLVVNCVSAYNVILGRPALHELKAVASTYHQMMKFPTNSGIGVVRGEQQASRECYYTALRGASQRTRQH
        GFGGERV+ EG IELPVT+G G   VTKMV+FLVVN  S+YN ILGRP +H LKA+ STYHQ  KFPT  G+G ++GEQ+ SRECY T++R   + + + 
Subjt:  GFGGERVLPEGSIELPVTYGEGQSVVTKMVNFLVVNCVSAYNVILGRPALHELKAVASTYHQMMKFPTNSGIGVVRGEQQASRECYYTALRGASQRTRQH

Query:  G
        G
Subjt:  G

XP_022158687.1 uncharacterized protein LOC111025147 [Momordica charantia]1.6e-5455.22Show/hide
Query:  DSNRKRKAAAREARYDLEERMVYSVRLSKGLPSVEFTDLEASGIHQPHNDALVVSLTVANTRVHRILVDGGSSADILSTKVFEAMKLGKERLKTSAAPLV
        +S RKRKA  REAR  L +  VY   ++    ++EF++ EA+ +  PHNDALV++L +AN +VHRILVDGGSSADI+S   ++ M LG++ LK+S APL+
Subjt:  DSNRKRKAAAREARYDLEERMVYSVRLSKGLPSVEFTDLEASGIHQPHNDALVVSLTVANTRVHRILVDGGSSADILSTKVFEAMKLGKERLKTSAAPLV

Query:  GFGGERVLPEGSIELPVTYGEGQSVVTKMVNFLVVNCVSAYNVILGRPALHELKAVASTYHQMMKFPTNSGIGVVRGEQQASRECYYTALRGASQRTRQH
        GFG ERV+PEG IEL VT+G G   VTKM++FLVV+  S+YN ILGR  +H LKA+ STYHQ MKFPT+ G+G ++GEQ+ SRECYYT++RG  + + + 
Subjt:  GFGGERVLPEGSIELPVTYGEGQSVVTKMVNFLVVNCVSAYNVILGRPALHELKAVASTYHQMMKFPTNSGIGVVRGEQQASRECYYTALRGASQRTRQH

Query:  G
        G
Subjt:  G

XP_023895777.1 uncharacterized protein LOC112007641 [Quercus suber]1.5e-5535.63Show/hide
Query:  EGKLRAEAEAAAKAKAEAE----QSKVKAKEGTQGSARPRDADRDYLESLIGQADPPFVDEIMRAEVPHKFKLPTFLQYDGKKDPVQHLDTYRSWMGFHG
        EG+   E+E+ ++  A       + +V          R      + +E L+ + D PFV  I    +P KFK+PT   YDG +DP  H+ T+++ M   G
Subjt:  EGKLRAEAEAAAKAKAEAE----QSKVKAKEGTQGSARPRDADRDYLESLIGQADPPFVDEIMRAEVPHKFKLPTFLQYDGKKDPVQHLDTYRSWMGFHG

Query:  ASEFLGARDRKKPQFNLLTIKQRPGESLNEYITRFNNEVVQVEGYDDAVALTAVIAGLQDERLLN--------SVGRTNHGHERRRGANGR---GKRTRA
                  K+   +LLTI+Q   ESL+ +ITRFN E + V+  D+ + L A   G+  +  ++        S+    H  +    A       KR RA
Subjt:  ASEFLGARDRKKPQFNLLTIKQRPGESLNEYITRFNNEVVQVEGYDDAVALTAVIAGLQDERLLN--------SVGRTNHGHERRRGANGR---GKRTRA

Query:  GAVSGILWVGASKTRRKAEADQSTKESMRGGDSNRKRKAAAREARYDLEE-RMVYSVRLSKGLP--------SVEFTDLEASGIHQPHNDALVVSLTVAN
          +        + TR   +  +S K   R  D   + K A + A+      ++V SV+LS   P        ++ FTD +A  +H PH+DA+VV+L +A+
Subjt:  GAVSGILWVGASKTRRKAEADQSTKESMRGGDSNRKRKAAAREARYDLEE-RMVYSVRLSKGLP--------SVEFTDLEASGIHQPHNDALVVSLTVAN

Query:  TRVHRILVDGGSSADILSTKVFEAMKLGKERLKTSAAPLVGFGGERVLPEGSIELPVTYGEGQSVVTKMVNFLVVNCVSAYNVILGRPALHELKAVASTY
         +  R+L+D GSSADIL    F+ M+LG++ L+ +++PLVGFGG +V P GS+ LPVT G     VTK VNFLVV+C S+YN I+GRP L+  KA+ STY
Subjt:  TRVHRILVDGGSSADILSTKVFEAMKLGKERLKTSAAPLVGFGGERVLPEGSIELPVTYGEGQSVVTKMVNFLVVNCVSAYNVILGRPALHELKAVASTY

Query:  HQMMKFPTNSGIGVVRGEQQASRECYYTALRGASQ
        H  +KFPT  G G V G+Q A+RECY   L    Q
Subjt:  HQMMKFPTNSGIGVVRGEQQASRECYYTALRGASQ

XP_023902033.1 uncharacterized protein LOC112013883 [Quercus suber]7.7e-5735.66Show/hide
Query:  LESLIGQADPPFVDEIMRAEVPHKFKLPTFLQYDGKKDPVQHLDTYRSWMGFHGASEFL-----------GARDRKKPQFNLLTIKQRPGESLNEYITRF
        +E L+ + D PF   I    +P KFK+P+   YDG +DP  H+ T+++ M   G  + +            AR  K+   +LLTI+Q   ESL  +ITRF
Subjt:  LESLIGQADPPFVDEIMRAEVPHKFKLPTFLQYDGKKDPVQHLDTYRSWMGFHGASEFL-----------GARDRKKPQFNLLTIKQRPGESLNEYITRF

Query:  NNEVVQVEGYDDAVALTAVIAGLQDERLLNSVGRTNHGHERRRGANGRGK---RTRAGAVSGILWVGASKTRRKAEADQSTKESMRG-----GDSNRKRK
        N E + V+  DD + L A   G+  +  ++   + +HGH+     + + +     R G +   +       + KA+ ++ ++  +       G ++  + 
Subjt:  NNEVVQVEGYDDAVALTAVIAGLQDERLLNSVGRTNHGHERRRGANGRGK---RTRAGAVSGILWVGASKTRRKAEADQSTKESMRG-----GDSNRKRK

Query:  AAAREARYDLEERMVYSVRLSK----GLPSVEFTDLEASGIHQPHNDALVVSLTVANTRVHRILVDGGSSADILSTKVFEAMKLGKERLKTSAAPLVGFG
        A A+++     + +  S R  +      P++ FTD EA  IH PH+DA+V++L +A+ +  R+LVD GSSADIL    F+ M++G+++L+ + +PL+GFG
Subjt:  AAAREARYDLEERMVYSVRLSK----GLPSVEFTDLEASGIHQPHNDALVVSLTVANTRVHRILVDGGSSADILSTKVFEAMKLGKERLKTSAAPLVGFG

Query:  GERVLPEGSIELPVTYGEGQSVVTKMVNFLVVNCVSAYNVILGRPALHELKAVASTYHQMMKFPTNSGIGVVRGEQQASRECYYTAL
        G +V P G+I LPV  G     +TK VNFLVV+C S+YN I+GRP L+  KAV STYH  +KFPT  G+G V+G+Q A+RECY   L
Subjt:  GERVLPEGSIELPVTYGEGQSVVTKMVNFLVVNCVSAYNVILGRPALHELKAVASTYHQMMKFPTNSGIGVVRGEQQASRECYYTAL

XP_024041095.1 uncharacterized protein LOC112098853 [Citrus clementina]1.6e-5430.46Show/hide
Query:  LESLIGQADPPFVDEIMRAEVPHKFKLPTFLQYDGKKDPVQHLDTYRSWMGFHGASE----------------------------------------FLG
        L+ +  + +PPF  +IM A+ P +F LP    YDG++DP +HL+ YR+ M   GAS+                                        F  
Subjt:  LESLIGQADPPFVDEIMRAEVPHKFKLPTFLQYDGKKDPVQHLDTYRSWMGFHGASE----------------------------------------FLG

Query:  ARDRKKPQFNLLTIKQRPGESLNEYITRFNNEVVQVEGYDDAVALTAVIAGLQDERLLNSVGR-----------------------TNHGHERRRGANGR
        AR R KP   LLT+KQ+ GE+L +YI R+NNE+ QV+GYDD +AL+ ++ GL+  +L  SV +                        N   E+     G+
Subjt:  ARDRKKPQFNLLTIKQRPGESLNEYITRFNNEVVQVEGYDDAVALTAVIAGLQDERLLNSVGR-----------------------TNHGHERRRGANGR

Query:  GKRT-------------------RAGAVSGIL--------------------------------------------------------------------
         K+                    R G  S +                                                                     
Subjt:  GKRT-------------------RAGAVSGIL--------------------------------------------------------------------

Query:  ---------------WVGASKTRRKAEADQSTK-----------ESM----------RGGDSNRKRKAAAREARYDLEERMVYSVRLS-----KGLPSVE
                       +V   + R K+E  +S+K           ESM            GDS + RK  AR+AR++     V     S     +G+P + 
Subjt:  ---------------WVGASKTRRKAEADQSTK-----------ESM----------RGGDSNRKRKAAAREARYDLEERMVYSVRLS-----KGLPSVE

Query:  FTDLEASGIHQPHNDALVVSLTVANTRVHRILVDGGSSADILSTKVFEAMKLGKERLKTSAAPLVGFGGERVLPEGSIELPVTYGEGQSVVTKMVNFLVV
        F++ +A G+H PH DALVV+L VAN R+HRIL+D GSSADIL    F  M L + +LK    PL GF G  V+PEG IEL V++G+  + VT MVNF+VV
Subjt:  FTDLEASGIHQPHNDALVVSLTVANTRVHRILVDGGSSADILSTKVFEAMKLGKERLKTSAAPLVGFGGERVLPEGSIELPVTYGEGQSVVTKMVNFLVV

Query:  NCVSAYNVILGRPALHELKAVASTYHQMMKFPTNSGIGVVRGEQQASRECYYTALRGASQRTR----QHGSAQDEGRGRPV
        +  S+YN +LGRP L+ LKA  S YH  +KFPT +G+GVVRGEQ+ +RECY  A R   +  +       + +D+ R  PV
Subjt:  NCVSAYNVILGRPALHELKAVASTYHQMMKFPTNSGIGVVRGEQQASRECYYTALRGASQRTR----QHGSAQDEGRGRPV

TrEMBL top hitse value%identityAlignment
A0A2N9FJX9 Uncharacterized protein2.8e-5231.56Show/hide
Query:  KAEAEQSKVKAKEGTQGSARPRDADRDYLESLIGQADPPFVDEIMRAEVPHKFKLPTFLQYDGKKDPVQHLDTYRSWM----------------GFHGAS
        + E E  +++ + G   ++    A R+ L++L+ +AD PF+  I    +P +FK+P    +DG KDP  +L+ +++ M                G  G++
Subjt:  KAEAEQSKVKAKEGTQGSARPRDADRDYLESLIGQADPPFVDEIMRAEVPHKFKLPTFLQYDGKKDPVQHLDTYRSWM----------------GFHGAS

Query:  ------------------------EFLGARDRKKPQFNLLTIKQRPGESLNEYITRFNNEVVQVEGYDDAVALTAVIAGLQ------------DERLLNS
                                 F+G++ R +P  +LL++KQ  GESL  ++ RFN E ++++   + V +TA +AGL+             E +   
Subjt:  ------------------------EFLGARDRKKPQFNLLTIKQRPGESLNEYITRFNNEVVQVEGYDDAVALTAVIAGLQ------------DERLLNS

Query:  VGRTNHGHERRRGANGRGK---RTRAGAVSGILWVGASKTRRKAEADQSTKESMR----------------GGDSNRKRKAAAREA-RYDLEERMVYSVR
        +   +HGH        + +     R G +   +    +    K    +   E  R                GG S   RKA AR+     + +R   ++R
Subjt:  VGRTNHGHERRRGANGRGK---RTRAGAVSGILWVGASKTRRKAEADQSTKESMR----------------GGDSNRKRKAAAREA-RYDLEERMVYSVR

Query:  LSKGLPSVEFTDLEASGIHQPHNDALVVSLTVANTRVHRILVDGGSSADILSTKVFEAMKLGKERLKTSAAPLVGFGGERVLPEGSIELPVTYGEGQSVV
        L   +  + F++ +A G HQPH+DALV+++ +A     R++VD GSSADIL    ++ M+L K++L+   APLVGF G+++ P G + LP+  G     V
Subjt:  LSKGLPSVEFTDLEASGIHQPHNDALVVSLTVANTRVHRILVDGGSSADILSTKVFEAMKLGKERLKTSAAPLVGFGGERVLPEGSIELPVTYGEGQSVV

Query:  TKMVNFLVVNCVSAYNVILGRPALHELKAVASTYHQMMKFPTNSGIGVVRGEQQASRECYYTALRGASQ
        +K V+FLVVNC SAYN I+GRP L+ L+AV STYH ++KFPT  GIG VRG+Q A+RECY  +L    Q
Subjt:  TKMVNFLVVNCVSAYNVILGRPALHELKAVASTYHQMMKFPTNSGIGVVRGEQQASRECYYTALRGASQ

A0A6J1CZ14 uncharacterized protein LOC1110159011.2e-5254.69Show/hide
Query:  DSNRKRKAAAREARYDLEERMVYSVRLSKGLPSVEFTDLEASGIHQPHNDALVVSLTVANTRVHRILVDGGSSADILSTKVFEAMKLGKERLKTSAAPLV
        +S RKRKA  REAR  L +  VY   ++     +EF++ EA+ +  PHNDALV++L +AN +VHRILVDGGSS DI S   ++AM LG+E LK+S  PL+
Subjt:  DSNRKRKAAAREARYDLEERMVYSVRLSKGLPSVEFTDLEASGIHQPHNDALVVSLTVANTRVHRILVDGGSSADILSTKVFEAMKLGKERLKTSAAPLV

Query:  GFGGERVLPEGSIELPVTYGEGQSVVTKMVNFLVVNCVSAYNVILGRPALHELKAVASTYHQMMKFPTNSGIGVVRGEQQASRECYYTALRG
        GFGGER++P+G IELP+T+      +T+MV+FLVV+  S+YN IL RP +H L+A+ STYHQ MKFPT  G+G ++GEQ+ SRECYYT++RG
Subjt:  GFGGERVLPEGSIELPVTYGEGQSVVTKMVNFLVVNCVSAYNVILGRPALHELKAVASTYHQMMKFPTNSGIGVVRGEQQASRECYYTALRG

A0A6J1DQY2 uncharacterized protein LOC1110223216.0e-5556.22Show/hide
Query:  DSNRKRKAAAREARYDLEERMVYSVRLSKGLPSVEFTDLEASGIHQPHNDALVVSLTVANTRVHRILVDGGSSADILSTKVFEAMKLGKERLKTSAAPLV
        +S RKRKA  REAR   ++  VY   ++    ++EF++ EA+ +  PHNDALV++L +AN +VHRILVDGGSSADI+S   ++AM LG++ LK+S APLV
Subjt:  DSNRKRKAAAREARYDLEERMVYSVRLSKGLPSVEFTDLEASGIHQPHNDALVVSLTVANTRVHRILVDGGSSADILSTKVFEAMKLGKERLKTSAAPLV

Query:  GFGGERVLPEGSIELPVTYGEGQSVVTKMVNFLVVNCVSAYNVILGRPALHELKAVASTYHQMMKFPTNSGIGVVRGEQQASRECYYTALRGASQRTRQH
        GFGGERV+ EG IELPVT+G G   VTKMV+FLVVN  S+YN ILGRP +H LKA+ STYHQ  KFPT  G+G ++GEQ+ SRECY T++R   + + + 
Subjt:  GFGGERVLPEGSIELPVTYGEGQSVVTKMVNFLVVNCVSAYNVILGRPALHELKAVASTYHQMMKFPTNSGIGVVRGEQQASRECYYTALRGASQRTRQH

Query:  G
        G
Subjt:  G

A0A6J1DXV4 uncharacterized protein LOC1110251477.8e-5555.22Show/hide
Query:  DSNRKRKAAAREARYDLEERMVYSVRLSKGLPSVEFTDLEASGIHQPHNDALVVSLTVANTRVHRILVDGGSSADILSTKVFEAMKLGKERLKTSAAPLV
        +S RKRKA  REAR  L +  VY   ++    ++EF++ EA+ +  PHNDALV++L +AN +VHRILVDGGSSADI+S   ++ M LG++ LK+S APL+
Subjt:  DSNRKRKAAAREARYDLEERMVYSVRLSKGLPSVEFTDLEASGIHQPHNDALVVSLTVANTRVHRILVDGGSSADILSTKVFEAMKLGKERLKTSAAPLV

Query:  GFGGERVLPEGSIELPVTYGEGQSVVTKMVNFLVVNCVSAYNVILGRPALHELKAVASTYHQMMKFPTNSGIGVVRGEQQASRECYYTALRGASQRTRQH
        GFG ERV+PEG IEL VT+G G   VTKM++FLVV+  S+YN ILGR  +H LKA+ STYHQ MKFPT+ G+G ++GEQ+ SRECYYT++RG  + + + 
Subjt:  GFGGERVLPEGSIELPVTYGEGQSVVTKMVNFLVVNCVSAYNVILGRPALHELKAVASTYHQMMKFPTNSGIGVVRGEQQASRECYYTALRGASQRTRQH

Query:  G
        G
Subjt:  G

A0A7J0DNL3 Retrotrans_gag domain-containing protein2.1e-5232.08Show/hide
Query:  LESLIGQADPPFVDEIMRAEVPHKFKLPTFLQ-YDGKKDPVQHLDTYRSWMGFHGASE----------------------------------------FL
        +++L+ Q DPPF + ++R  +  KFKLPT L  Y+GK DP+ HLD+Y+S M   G S+                                        F+
Subjt:  LESLIGQADPPFVDEIMRAEVPHKFKLPTFLQ-YDGKKDPVQHLDTYRSWMGFHGASE----------------------------------------FL

Query:  GARDRKKPQFNLLTIKQRPGESLNEYITRFNNEVVQVEGYDDAVALTAVIAGLQDERLLNSVGRT---------------------NHGHERRRG-----
          R+R+K   +L TI Q+  ESL +++ RFN  V+ VE   D V + A++ GL+   L +S+ +                           RRRG     
Subjt:  GARDRKKPQFNLLTIKQRPGESLNEYITRFNNEVVQVEGYDDAVALTAVIAGLQDERLLNSVGRT---------------------NHGHERRRG-----

Query:  --------------------------ANGRGKRT----------RAGAVSGILWVGASKTRRKAEADQSTKESMR--------GGDSNRKRKAAAREARY
                                  +N R  RT                  +    S   RK   ++ T  +++        GG S   RK  AR A  
Subjt:  --------------------------ANGRGKRT----------RAGAVSGILWVGASKTRRKAEADQSTKESMR--------GGDSNRKRKAAAREARY

Query:  DLEERMV-YSVRLSKGLPSVEFTDLEASGIHQPHNDALVVSLTVANTRVHRILVDGGSSADILSTKVFEAMKLGKERLKTSAAPLVGFGGERVLPEGSIE
         +EE +   S   +   P + F+  +  G+H PH+DALVVS  +AN  V RIL+D GSSADIL    FE MK+G ++L     PL+GFGG    P G I 
Subjt:  DLEERMV-YSVRLSKGLPSVEFTDLEASGIHQPHNDALVVSLTVANTRVHRILVDGGSSADILSTKVFEAMKLGKERLKTSAAPLVGFGGERVLPEGSIE

Query:  LPVTYGEGQSVVTKMVNFLVVNCVSAYNVILGRPALHELKAVASTYHQMMKFPTNSGIGVVRGEQQASRECYYTALR
        LP+T G      T   +F+VV+C S YN ILGRP L  +KA+ STYH  MKFPT +GIG V+G+Q+ +R+C+ +A++
Subjt:  LPVTYGEGQSVVTKMVNFLVVNCVSAYNVILGRPALHELKAVASTYHQMMKFPTNSGIGVVRGEQQASRECYYTALR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTAATTTTGGCTTAGCAGGTATGCAGGAGAAGAATGGTCACAAGCTCAGTCGAGATCGGCCCAAGGGAGAAGAAGCCCACAAGAGGGGGAGTTTCACCCCATCCTC
GGCCCAAAGCCGAGGCCGAGGGTCTGGTCGGCCTCGGCCCAATGCCGAGGCCGACCAGGCTAGTCCGCCCATCTGCGGTCAGGGCCACCCGGATCCGTCCCCTGGGCCCG
GTAGCAGGGAGTTGCGTGAGAGGACAGCGCGAGAAGGTTTCTGGACTGGGAGACCGGCCGGAGATTTTCCGATCACTGACTTAGGCATCGGAGCACGTGTGGCGAGCACC
ACACCGATTTTGGCATCAACAGTTGGCGCCGTCTGTGGGGAAGTGTGTCGGATTGAGAGCATTGCAGGCCGGTGCAACAGGATAATGGAACAGGAGAATCCAGCGGGTGA
GAAAGGTCAACAGTCCAAGGTGAGTGCCCCGGGCACTGAGATGGAAGACCTTAGAGGAAGGGTGAACGAGATGGGCCAGAGTTTGGCTGAGATTTTGGGTATATTGAAGC
AACCGAATCCTGGTACGACGTACAAGAGAAGTTCCATGTGGGACCAAGAGAGGAAGAGAGGGGCCTCCGGGCCGAGCAAAGGGGCAGAGCGCAAGGGTGCACTAGATGTC
CCAGATGAGGTTAGCACAGTAGGCTCGCACAGGAGGATCGAGGCCGAGGCCGAGGCCGAAGCCATAGCCAAAGCCAAGATGCAGGAAAAGGTCGAGCTCGAGGCAAAGAT
CAGGGTCGAGCTAGAAGGTAAGCTGAGAGCCGAGGCCGAAGCTGCGGCCAAGGCAAAGGCCGAGGCCGAGCAGTCTAAGGTTAAGGCCAAGGAAGGTACCCAGGGAAGCG
CACGCCCTAGGGATGCGGATAGGGATTATTTGGAAAGTTTGATAGGGCAGGCTGACCCACCCTTCGTCGATGAGATTATGCGAGCCGAGGTTCCACACAAGTTCAAGTTA
CCAACTTTTCTGCAGTACGACGGGAAAAAAGACCCAGTGCAACATTTGGACACCTACCGATCCTGGATGGGATTTCATGGGGCTTCCGAGTTTTTAGGAGCAAGGGATCG
GAAGAAGCCACAGTTCAATTTGTTGACTATAAAGCAAAGGCCAGGGGAGAGCTTGAATGAGTATATCACAAGGTTCAATAACGAGGTCGTGCAAGTAGAGGGGTATGACG
ATGCAGTGGCGCTGACCGCGGTTATCGCCGGGTTACAGGATGAAAGATTGTTGAATTCCGTGGGGAGGACCAACCACGGACACGAAAGGAGGAGAGGGGCAAACGGCCGC
GGGAAGAGGACGAGGGCCGGAGCCGTCTCAGGTATTCTGTGGGTCGGGGCCAGTAAGACCAGAAGGAAGGCCGAGGCCGACCAGAGTACAAAGGAAAGTATGAGAGGAGG
AGACTCAAACCGGAAAAGGAAAGCAGCGGCTAGAGAGGCAAGGTATGATCTTGAGGAAAGAATGGTGTATTCGGTACGGCTCTCAAAGGGGTTGCCGTCAGTAGAGTTCA
CCGATCTGGAAGCTTCCGGCATACACCAGCCTCACAATGATGCACTGGTGGTGAGCCTGACAGTTGCCAACACGCGGGTTCACCGGATTTTAGTTGATGGTGGAAGTTCA
GCAGACATCCTGTCCACGAAAGTGTTTGAGGCCATGAAATTGGGGAAGGAACGGTTGAAGACGAGCGCTGCTCCGCTGGTTGGTTTTGGGGGAGAAAGAGTGCTACCAGA
GGGAAGCATTGAGTTGCCGGTGACCTATGGGGAAGGGCAGAGTGTGGTGACGAAAATGGTTAATTTCTTGGTAGTAAATTGTGTATCCGCCTATAACGTCATACTGGGAA
GGCCAGCTTTGCATGAATTGAAAGCGGTGGCATCGACATACCACCAAATGATGAAGTTCCCAACCAATAGCGGAATAGGAGTTGTGAGGGGAGAACAACAAGCTTCACGT
GAATGCTATTACACCGCGCTCAGAGGAGCAAGCCAGCGAACAAGGCAGCACGGCTCGGCCCAGGACGAGGGCCGAGGCCGACCAGTCCGAGGCCGAGCAGAGGGATGCCC
TTTGCCAATGGAATTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTAATTTTGGCTTAGCAGGTATGCAGGAGAAGAATGGTCACAAGCTCAGTCGAGATCGGCCCAAGGGAGAAGAAGCCCACAAGAGGGGGAGTTTCACCCCATCCTC
GGCCCAAAGCCGAGGCCGAGGGTCTGGTCGGCCTCGGCCCAATGCCGAGGCCGACCAGGCTAGTCCGCCCATCTGCGGTCAGGGCCACCCGGATCCGTCCCCTGGGCCCG
GTAGCAGGGAGTTGCGTGAGAGGACAGCGCGAGAAGGTTTCTGGACTGGGAGACCGGCCGGAGATTTTCCGATCACTGACTTAGGCATCGGAGCACGTGTGGCGAGCACC
ACACCGATTTTGGCATCAACAGTTGGCGCCGTCTGTGGGGAAGTGTGTCGGATTGAGAGCATTGCAGGCCGGTGCAACAGGATAATGGAACAGGAGAATCCAGCGGGTGA
GAAAGGTCAACAGTCCAAGGTGAGTGCCCCGGGCACTGAGATGGAAGACCTTAGAGGAAGGGTGAACGAGATGGGCCAGAGTTTGGCTGAGATTTTGGGTATATTGAAGC
AACCGAATCCTGGTACGACGTACAAGAGAAGTTCCATGTGGGACCAAGAGAGGAAGAGAGGGGCCTCCGGGCCGAGCAAAGGGGCAGAGCGCAAGGGTGCACTAGATGTC
CCAGATGAGGTTAGCACAGTAGGCTCGCACAGGAGGATCGAGGCCGAGGCCGAGGCCGAAGCCATAGCCAAAGCCAAGATGCAGGAAAAGGTCGAGCTCGAGGCAAAGAT
CAGGGTCGAGCTAGAAGGTAAGCTGAGAGCCGAGGCCGAAGCTGCGGCCAAGGCAAAGGCCGAGGCCGAGCAGTCTAAGGTTAAGGCCAAGGAAGGTACCCAGGGAAGCG
CACGCCCTAGGGATGCGGATAGGGATTATTTGGAAAGTTTGATAGGGCAGGCTGACCCACCCTTCGTCGATGAGATTATGCGAGCCGAGGTTCCACACAAGTTCAAGTTA
CCAACTTTTCTGCAGTACGACGGGAAAAAAGACCCAGTGCAACATTTGGACACCTACCGATCCTGGATGGGATTTCATGGGGCTTCCGAGTTTTTAGGAGCAAGGGATCG
GAAGAAGCCACAGTTCAATTTGTTGACTATAAAGCAAAGGCCAGGGGAGAGCTTGAATGAGTATATCACAAGGTTCAATAACGAGGTCGTGCAAGTAGAGGGGTATGACG
ATGCAGTGGCGCTGACCGCGGTTATCGCCGGGTTACAGGATGAAAGATTGTTGAATTCCGTGGGGAGGACCAACCACGGACACGAAAGGAGGAGAGGGGCAAACGGCCGC
GGGAAGAGGACGAGGGCCGGAGCCGTCTCAGGTATTCTGTGGGTCGGGGCCAGTAAGACCAGAAGGAAGGCCGAGGCCGACCAGAGTACAAAGGAAAGTATGAGAGGAGG
AGACTCAAACCGGAAAAGGAAAGCAGCGGCTAGAGAGGCAAGGTATGATCTTGAGGAAAGAATGGTGTATTCGGTACGGCTCTCAAAGGGGTTGCCGTCAGTAGAGTTCA
CCGATCTGGAAGCTTCCGGCATACACCAGCCTCACAATGATGCACTGGTGGTGAGCCTGACAGTTGCCAACACGCGGGTTCACCGGATTTTAGTTGATGGTGGAAGTTCA
GCAGACATCCTGTCCACGAAAGTGTTTGAGGCCATGAAATTGGGGAAGGAACGGTTGAAGACGAGCGCTGCTCCGCTGGTTGGTTTTGGGGGAGAAAGAGTGCTACCAGA
GGGAAGCATTGAGTTGCCGGTGACCTATGGGGAAGGGCAGAGTGTGGTGACGAAAATGGTTAATTTCTTGGTAGTAAATTGTGTATCCGCCTATAACGTCATACTGGGAA
GGCCAGCTTTGCATGAATTGAAAGCGGTGGCATCGACATACCACCAAATGATGAAGTTCCCAACCAATAGCGGAATAGGAGTTGTGAGGGGAGAACAACAAGCTTCACGT
GAATGCTATTACACCGCGCTCAGAGGAGCAAGCCAGCGAACAAGGCAGCACGGCTCGGCCCAGGACGAGGGCCGAGGCCGACCAGTCCGAGGCCGAGCAGAGGGATGCCC
TTTGCCAATGGAATTCTGA
Protein sequenceShow/hide protein sequence
MGNFGLAGMQEKNGHKLSRDRPKGEEAHKRGSFTPSSAQSRGRGSGRPRPNAEADQASPPICGQGHPDPSPGPGSRELRERTAREGFWTGRPAGDFPITDLGIGARVAST
TPILASTVGAVCGEVCRIESIAGRCNRIMEQENPAGEKGQQSKVSAPGTEMEDLRGRVNEMGQSLAEILGILKQPNPGTTYKRSSMWDQERKRGASGPSKGAERKGALDV
PDEVSTVGSHRRIEAEAEAEAIAKAKMQEKVELEAKIRVELEGKLRAEAEAAAKAKAEAEQSKVKAKEGTQGSARPRDADRDYLESLIGQADPPFVDEIMRAEVPHKFKL
PTFLQYDGKKDPVQHLDTYRSWMGFHGASEFLGARDRKKPQFNLLTIKQRPGESLNEYITRFNNEVVQVEGYDDAVALTAVIAGLQDERLLNSVGRTNHGHERRRGANGR
GKRTRAGAVSGILWVGASKTRRKAEADQSTKESMRGGDSNRKRKAAAREARYDLEERMVYSVRLSKGLPSVEFTDLEASGIHQPHNDALVVSLTVANTRVHRILVDGGSS
ADILSTKVFEAMKLGKERLKTSAAPLVGFGGERVLPEGSIELPVTYGEGQSVVTKMVNFLVVNCVSAYNVILGRPALHELKAVASTYHQMMKFPTNSGIGVVRGEQQASR
ECYYTALRGASQRTRQHGSAQDEGRGRPVRGRAEGCPLPMEF