; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041728 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041728
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag-pro-like protein
Genome locationchr13:25438790..25441903
RNA-Seq ExpressionLag0041728
SyntenyLag0041728
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0140640 - catalytic activity, acting on a nucleic acid (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037620.1 Gag-pro-like protein [Cucumis melo var. makuwa]1.2e-5350.21Show/hide
Query:  MEEQSTEMEKTRKDIEELREKMDAI--LVALEGGKIIPDIAQSNNTMNDPPILQSTEGTTPKYHPLYNIPVEQNPFPFFKNEQVPVHNQPGFSLPTEVPP
        MEE+  +M+K R++I  L E++  I  L+++  GK+  D AQS+N     PI  + +   P+   LY++ V Q+        Q   H  P  + P  V P
Subjt:  MEEQSTEMEKTRKDIEELREKMDAI--LVALEGGKIIPDIAQSNNTMNDPPILQSTEGTTPKYHPLYNIPVEQNPFPFFKNEQVPVHNQPGFSLPTEVPP

Query:  KVTITVPNLDDPEIRKELTGGEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKL
         +   V +L+     +++   E   + +KL+VLEERLRA+EGTDV+GNIDAT+LCLVP +I+P KFKVPEF+KYDG++CP++HLIMYCRKMAAY+ NDKL
Subjt:  KVTITVPNLDDPEIRKELTGGEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKL

Query:  LIHYFQDSLTGPASRWYMQLDSTHICSWKNLADSFLK
        L+H FQDSLTGPASRWY+QLD+ HI  WK+LAD+FLK
Subjt:  LIHYFQDSLTGPASRWYMQLDSTHICSWKNLADSFLK

KAA0044913.1 uncharacterized protein E6C27_scaffold74G002080 [Cucumis melo var. makuwa]9.3e-5450.21Show/hide
Query:  MEEQSTEMEKTRKDIEELREKMDAI--LVALEGGKIIPDIAQSNNTMNDPPILQSTEGTTPKYHPLYNIPVEQNPFPFFKNEQVPVHNQPGFSLPTEVPP
        MEE+  +M+K R++I  L E++  I  L+++  GK+  D AQS+N + D        G TP YH   N+P  Q          VP +       P  V P
Subjt:  MEEQSTEMEKTRKDIEELREKMDAI--LVALEGGKIIPDIAQSNNTMNDPPILQSTEGTTPKYHPLYNIPVEQNPFPFFKNEQVPVHNQPGFSLPTEVPP

Query:  KVTITVPNLDDPEIRKELTGGEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKL
         +   + +L+     +++   E   + +KL+VLEERLRA+EGTDV+GNIDAT+LCLVPD+I+P KFKVPEF+KYDG++CP++HLIMYCRKMAA++ NDKL
Subjt:  KVTITVPNLDDPEIRKELTGGEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKL

Query:  LIHYFQDSLTGPASRWYMQLDSTHICSWKNLADSFLK
        L+H FQDSLTGPASRWY+QLD+ HI  WK+LAD+FLK
Subjt:  LIHYFQDSLTGPASRWYMQLDSTHICSWKNLADSFLK

XP_022155098.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022231, partial [Momordica charantia]9.0e-6555.98Show/hide
Query:  EEQSTEMEKTRKDIEELREKMDAILVALEGGKIIPDIAQSNNTMNDPPILQSTEGTTPK---------------------YHPLYNIPVEQNPFPFFKNE
        +++ +E EKTRKDIEELREK+DAIL+ALE GK    IA+++N +++PP  Q   G  P                      Y+PLY+IP  Q P P  +  
Subjt:  EEQSTEMEKTRKDIEELREKMDAILVALEGGKIIPDIAQSNNTMNDPPILQSTEGTTPK---------------------YHPLYNIPVEQNPFPFFKNE

Query:  QVPVHNQPG--FSLPTEV--PPKVTITVPNLDDPEIRKELTGGEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGAS
          P     G  F  P +V  PP    TV NL  P+  K+    ++  SSEKLEVLEERLRAVEGTDVFGNIDA++LCL   +++PPKFK+PEFEKY+G+S
Subjt:  QVPVHNQPG--FSLPTEV--PPKVTITVPNLDDPEIRKELTGGEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGAS

Query:  CPKNHLIMYCRKMAAYVQNDKLLIHYFQDSLTGPASRWYMQLDSTHICSWKNLADSFLK
        CPKNHLIMYCRKMAAY+QNDKLLIH FQDSL+GP S WYM LDS H+ SWKNLADSFLK
Subjt:  CPKNHLIMYCRKMAAYVQNDKLLIHYFQDSLTGPASRWYMQLDSTHICSWKNLADSFLK

XP_022157796.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111024415 [Momordica charantia]2.6e-6456.92Show/hide
Query:  MEEQSTEMEKTRKDIEELREKMDAILVALEGGKIIPDIAQSNNTMND------------PPILQSTEGTTPK---YHPLYNIPVEQNPFPFFKN-EQVPV
        ME+Q  E EKTRKDIEELREK+D I + LE GK   D A S+N +++            PP+    EG  P+   Y+PLY++P+ Q P  F K  +Q+P 
Subjt:  MEEQSTEMEKTRKDIEELREKMDAILVALEGGKIIPDIAQSNNTMND------------PPILQSTEGTTPK---YHPLYNIPVEQNPFPFFKN-EQVPV

Query:  HNQPGFSLPTEV--PPKVTITVPNLDDPEIRKELTGGEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHL
        +    F  P ++  PP    TV NL     + +    +   S EK EVLEERLRA+EGTDVFGNIDA++LCLV  +++PPKFKVPEFEKYDG+SCPKNHL
Subjt:  HNQPGFSLPTEV--PPKVTITVPNLDDPEIRKELTGGEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHL

Query:  IMYCRKMAAYVQNDKLLIHYFQDSLTGPASRWYMQLDSTHICSWKNLADSFLK
        IMYCRKM AYVQN KLLIH FQDSL G ASRWYMQLDS+H+ SWKNLADSFLK
Subjt:  IMYCRKMAAYVQNDKLLIHYFQDSLTGPASRWYMQLDSTHICSWKNLADSFLK

XP_031738551.1 LOW QUALITY PROTEIN: uncharacterized protein LOC101203611 [Cucumis sativus]9.3e-6234.1Show/hide
Query:  IEKQVKQGSLRNVELEKKLNRLKESVSKEEQLEKEISALDTDARDLNRRMHRLRRDNEVSQATLKSRNDQVLKQQSEIASLHELMKELEDCISLRNQTVI
        ++K+  Q       L+++L + K  +  +++LEK +  LD + R +N+    L+ +  + QAT++S+++ +   ++      EL+ +L+  I  R   ++
Subjt:  IEKQVKQGSLRNVELEKKLNRLKESVSKEEQLEKEISALDTDARDLNRRMHRLRRDNEVSQATLKSRNDQVLKQQSEIASLHELMKELEDCISLRNQTVI

Query:  EVEEKNGTQCRTIDDLQLTLKIREDQLGELINDNKGLGESVQSLNVRLSKYQDATDILMKDYTYLKEQYDRLNDDFGFVRQNHSTLRSKAEHMLTQIRRV
        ++E  N +  +T+D L +       ++ E   D K L     SL+ +L+ +Q++++ ++++Y  LK  Y ++  D+   R++  TL  + +  +  +R V
Subjt:  EVEEKNGTQCRTIDDLQLTLKIREDQLGELINDNKGLGESVQSLNVRLSKYQDATDILMKDYTYLKEQYDRLNDDFGFVRQNHSTLRSKAEHMLTQIRRV

Query:  TRRADELAEDARTLSKVIAPTQSNSKNMLKFLGKLRISLETRIMEEQSTEMEKTRKDIEELREKMDAILVALEGGKIIPDIAQSNNTMNDPPILQSTEGT
        +RRA+  AE A  L       + +S ++ +FL  +                          RE     L     GK++ + AQS+N + D        G 
Subjt:  TRRADELAEDARTLSKVIAPTQSNSKNMLKFLGKLRISLETRIMEEQSTEMEKTRKDIEELREKMDAILVALEGGKIIPDIAQSNNTMNDPPILQSTEGT

Query:  TPKYHPLYNIPVEQNPFPFFKNEQVPVHNQPGFSLPTEVPPKVTITVPNLDDPEIRKELTGGEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPD
        TP++    N    Q         Q  V   P F +P  VP      +  L+     +++   E   + +KL+VLEERLRA+EGTDV+GNIDAT+LCLVP 
Subjt:  TPKYHPLYNIPVEQNPFPFFKNEQVPVHNQPGFSLPTEVPPKVTITVPNLDDPEIRKELTGGEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPD

Query:  VILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHYFQDSLTGPASRWYMQLDSTHICSWKNLADSFLK
        +I+P KFKVP F+KYDG+SCP++HLIMYCRKMAA++ NDKLLIH FQDSLTGPA+RWY+QLD+ HI  WK+LAD+FLK
Subjt:  VILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHYFQDSLTGPASRWYMQLDSTHICSWKNLADSFLK

TrEMBL top hitse value%identityAlignment
A0A5A7T2M2 Gag-pro-like protein5.9e-5450.21Show/hide
Query:  MEEQSTEMEKTRKDIEELREKMDAI--LVALEGGKIIPDIAQSNNTMNDPPILQSTEGTTPKYHPLYNIPVEQNPFPFFKNEQVPVHNQPGFSLPTEVPP
        MEE+  +M+K R++I  L E++  I  L+++  GK+  D AQS+N     PI  + +   P+   LY++ V Q+        Q   H  P  + P  V P
Subjt:  MEEQSTEMEKTRKDIEELREKMDAI--LVALEGGKIIPDIAQSNNTMNDPPILQSTEGTTPKYHPLYNIPVEQNPFPFFKNEQVPVHNQPGFSLPTEVPP

Query:  KVTITVPNLDDPEIRKELTGGEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKL
         +   V +L+     +++   E   + +KL+VLEERLRA+EGTDV+GNIDAT+LCLVP +I+P KFKVPEF+KYDG++CP++HLIMYCRKMAAY+ NDKL
Subjt:  KVTITVPNLDDPEIRKELTGGEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKL

Query:  LIHYFQDSLTGPASRWYMQLDSTHICSWKNLADSFLK
        L+H FQDSLTGPASRWY+QLD+ HI  WK+LAD+FLK
Subjt:  LIHYFQDSLTGPASRWYMQLDSTHICSWKNLADSFLK

A0A5A7UL51 Girdin-like7.7e-5430.79Show/hide
Query:  EDQATVRQWSENVQQIHGDSLVE----NVVSQFKDVSFPESQLETTKRAWESLTVDRKAKFTSKYGHLAQLMYVQGNYSVLKALVRHWDPTYRCFTFGSI
        ++ + V +W+E +QQ  GD +      +V+S+ + +S  ++ L   K  WE+LT  R+  F+ KYGH+A+LMY+  NY  L+A++  WDP Y CFTFGS 
Subjt:  EDQATVRQWSENVQQIHGDSLVE----NVVSQFKDVSFPESQLETTKRAWESLTVDRKAKFTSKYGHLAQLMYVQGNYSVLKALVRHWDPTYRCFTFGSI

Query:  DMTPTIEEYQSLLHMPARTEVEAYSYDQELTMKRALSTLLGKIRTSDIEKQVK-QGSLRNVELEKKLNRLKESVSKEEQL-------------EKEISAL
        D+ PTIEEYQ++L MP +     Y ++ + T K           T++I+K +K +G   NV  +  +   +  + +++ L              K    +
Subjt:  DMTPTIEEYQSLLHMPARTEVEAYSYDQELTMKRALSTLLGKIRTSDIEKQVK-QGSLRNVELEKKLNRLKESVSKEEQL-------------EKEISAL

Query:  DTDARDLNRRMHR-LRRDNEVSQATLKSRNDQVLKQQSEIASLHELMKELEDCISLRNQTVIEVEEKNGTQCRTIDDLQLTLKIREDQLGELI----NDN
        D     L   M R +     +   T +S N    K + +      L+      I +++      E     +C  + D      +  + + E      +  
Subjt:  DTDARDLNRRMHR-LRRDNEVSQATLKSRNDQVLKQQSEIASLHELMKELEDCISLRNQTVIEVEEKNGTQCRTIDDLQLTLKIREDQLGELI----NDN

Query:  KGLGESVQSLNVRLSKYQ---DATDILMKDYTYLKEQYDR--LNDDFGFVRQNHSTLRSKAEHMLTQIRRVTRRADEL-----AEDARTLSKVIAPTQSN
            E+  S   +L+       A  + +K   Y  E +    L   +G V  N++ L    +  L Q    T+   E       ED +   +        
Subjt:  KGLGESVQSLNVRLSKYQ---DATDILMKDYTYLKEQYDR--LNDDFGFVRQNHSTLRSKAEHMLTQIRRVTRRADEL-----AEDARTLSKVIAPTQSN

Query:  SKNMLKFLGKLRISLETRIMEEQSTEMEKTRKDIEELREKMDAILVALEGGKIIPDIAQSNNTMNDPPILQSTEGTTPKYHPLYNIPVEQNPFPFFKNEQ
         K++ K   K      T   E       K   DI   RE        +E GK     AQ N  +     L+                 E+N     +NE+
Subjt:  SKNMLKFLGKLRISLETRIMEEQSTEMEKTRKDIEELREKMDAILVALEGGKIIPDIAQSNNTMNDPPILQSTEGTTPKYHPLYNIPVEQNPFPFFKNEQ

Query:  VPVHNQPGFSLPTEVPPKV--TITVPNLDDPEIRKELTGGEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPK
        +           T +  ++  T ++ +L+     +++   E   + +KL+VLEERLRA+E TDV+GN DAT+LCLVP +I+P KFKVPEF+KYDG++CP+
Subjt:  VPVHNQPGFSLPTEVPPKV--TITVPNLDDPEIRKELTGGEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPK

Query:  NHLIMYCRKMAAYVQNDKLLIHYFQDSLTGPASRWYMQLDSTHICSWKNLADSFLK
         HLIMYCRKMAA++ NDKLL+H FQDSLTGPASRWY+QLD+ HI  WK+LAD+FLK
Subjt:  NHLIMYCRKMAAYVQNDKLLIHYFQDSLTGPASRWYMQLDSTHICSWKNLADSFLK

A0A5D3CXD6 Retrotrans_gag domain-containing protein4.5e-5450.21Show/hide
Query:  MEEQSTEMEKTRKDIEELREKMDAI--LVALEGGKIIPDIAQSNNTMNDPPILQSTEGTTPKYHPLYNIPVEQNPFPFFKNEQVPVHNQPGFSLPTEVPP
        MEE+  +M+K R++I  L E++  I  L+++  GK+  D AQS+N + D        G TP YH   N+P  Q          VP +       P  V P
Subjt:  MEEQSTEMEKTRKDIEELREKMDAI--LVALEGGKIIPDIAQSNNTMNDPPILQSTEGTTPKYHPLYNIPVEQNPFPFFKNEQVPVHNQPGFSLPTEVPP

Query:  KVTITVPNLDDPEIRKELTGGEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKL
         +   + +L+     +++   E   + +KL+VLEERLRA+EGTDV+GNIDAT+LCLVPD+I+P KFKVPEF+KYDG++CP++HLIMYCRKMAA++ NDKL
Subjt:  KVTITVPNLDDPEIRKELTGGEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKL

Query:  LIHYFQDSLTGPASRWYMQLDSTHICSWKNLADSFLK
        L+H FQDSLTGPASRWY+QLD+ HI  WK+LAD+FLK
Subjt:  LIHYFQDSLTGPASRWYMQLDSTHICSWKNLADSFLK

A0A6J1DM29 LOW QUALITY PROTEIN: uncharacterized protein LOC1110222312.5e-6556.37Show/hide
Query:  EEQSTEMEKTRKDIEELREKMDAILVALEGGKIIPDIAQSNNTMNDPPILQSTEGTTPK---------------------YHPLYNIPVEQNPFPFFKNE
        +++ +E EKTRKDIEELREK+DAIL+ALE GK    IA+++N +++PP  Q   G  P                      Y+PLY+IP  Q P P  +  
Subjt:  EEQSTEMEKTRKDIEELREKMDAILVALEGGKIIPDIAQSNNTMNDPPILQSTEGTTPK---------------------YHPLYNIPVEQNPFPFFKNE

Query:  QVPVHNQPG--FSLPTEV--PPKVTITVPNLDDPEIRKELTGGEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGAS
          P     G  F  P +V  PP    TV NL  P+  K+    ++  SSEKLEVLEERLRAVEGTDVFGNIDA++LCL   +++PPKFK+PEFEKYDG+S
Subjt:  QVPVHNQPG--FSLPTEV--PPKVTITVPNLDDPEIRKELTGGEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGAS

Query:  CPKNHLIMYCRKMAAYVQNDKLLIHYFQDSLTGPASRWYMQLDSTHICSWKNLADSFLK
        CPKNHLIMYCRKMAAY+QNDKLLIH FQDSL+GP S WYM LDS H+ SWKNLADSFLK
Subjt:  CPKNHLIMYCRKMAAYVQNDKLLIHYFQDSLTGPASRWYMQLDSTHICSWKNLADSFLK

A0A6J1DZ90 Ribonuclease H1.3e-6456.92Show/hide
Query:  MEEQSTEMEKTRKDIEELREKMDAILVALEGGKIIPDIAQSNNTMND------------PPILQSTEGTTPK---YHPLYNIPVEQNPFPFFKN-EQVPV
        ME+Q  E EKTRKDIEELREK+D I + LE GK   D A S+N +++            PP+    EG  P+   Y+PLY++P+ Q P  F K  +Q+P 
Subjt:  MEEQSTEMEKTRKDIEELREKMDAILVALEGGKIIPDIAQSNNTMND------------PPILQSTEGTTPK---YHPLYNIPVEQNPFPFFKN-EQVPV

Query:  HNQPGFSLPTEV--PPKVTITVPNLDDPEIRKELTGGEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHL
        +    F  P ++  PP    TV NL     + +    +   S EK EVLEERLRA+EGTDVFGNIDA++LCLV  +++PPKFKVPEFEKYDG+SCPKNHL
Subjt:  HNQPGFSLPTEV--PPKVTITVPNLDDPEIRKELTGGEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILPPKFKVPEFEKYDGASCPKNHL

Query:  IMYCRKMAAYVQNDKLLIHYFQDSLTGPASRWYMQLDSTHICSWKNLADSFLK
        IMYCRKM AYVQN KLLIH FQDSL G ASRWYMQLDS+H+ SWKNLADSFLK
Subjt:  IMYCRKMAAYVQNDKLLIHYFQDSLTGPASRWYMQLDSTHICSWKNLADSFLK

SwissProt top hitse value%identityAlignment
Q54G05 Putative leucine-rich repeat-containing protein DDB_G02905035.4e-0419.27Show/hide
Query:  RTSDIEKQVKQGSLRNVELEKKLNRLKESV-SKEEQLEKEISALDTDARDLNRRMHRLRRDNEVSQATLKSRNDQVLKQQSEIASLHELMKELEDCISLR
        + + + + ++     + EL+ KLN+L + +  K+E+L    S ++    +LN   +++   NE+ +    S ++  LK    +  L + ++E ++ +   
Subjt:  RTSDIEKQVKQGSLRNVELEKKLNRLKESV-SKEEQLEKEISALDTDARDLNRRMHRLRRDNEVSQATLKSRNDQVLKQQSEIASLHELMKELEDCISLR

Query:  NQTVIEVEEKNGTQCRTIDDLQLTLKIREDQLGELINDNKGLGESVQSLNVRLS-KYQDATDILMKDYTYLKEQYDRLNDDFGFVRQNHSTLRSKAEHML
          ++IE +EK       ID LQ  L  ++D++ EL+ +N+   + +QS  ++LS + Q+  + L+ + + + E    LN++   + +     +S ++ + 
Subjt:  NQTVIEVEEKNGTQCRTIDDLQLTLKIREDQLGELINDNKGLGESVQSLNVRLS-KYQDATDILMKDYTYLKEQYDRLNDDFGFVRQNHSTLRSKAEHML

Query:  TQIRRVTRRADELAEDARTLSKVIAPTQSNSKNMLKFLGKLRISLETRIMEEQ---STEMEKTRKDIEELREKMD
        +++ +++    +  E+ R+L   I   Q     +++        L++++ E++   +  +E  +  ++EL+ K++
Subjt:  TQIRRVTRRADELAEDARTLSKVIAPTQSNSKNMLKFLGKLRISLETRIMEEQ---STEMEKTRKDIEELREKMD

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTATAGAAGATCAAGCAACAGTACGTCAATGGTCAGAAAATGTACAACAAATCCACGGAGATTCTTTAGTAGAAAATGTTGTTTCTCAATTTAAGGATGTCAGTTT
TCCAGAAAGTCAATTAGAAACAACAAAACGGGCTTGGGAAAGTTTAACTGTAGATAGAAAGGCTAAATTTACAAGCAAATATGGCCATCTAGCTCAGCTCATGTATGTAC
AAGGCAATTATTCTGTATTAAAAGCTTTGGTTCGACATTGGGATCCAACCTACAGATGTTTCACATTTGGCTCAATCGACATGACTCCTACAATAGAGGAATATCAATCC
CTTCTGCATATGCCAGCACGAACAGAGGTTGAAGCTTATTCTTACGATCAAGAGCTTACAATGAAAAGAGCATTATCTACTCTTTTGGGCAAGATTCGTACAAGCGACAT
AGAGAAACAAGTAAAGCAAGGTTCGTTGCGCAACGTTGAACTAGAAAAAAAGTTGAACCGATTAAAGGAAAGTGTCAGCAAAGAAGAACAGTTAGAAAAGGAAATTTCAG
CATTAGACACGGATGCCCGCGACCTGAACAGAAGAATGCATCGATTAAGAAGGGATAATGAAGTCTCCCAAGCAACTCTCAAGTCAAGGAATGACCAAGTTTTGAAGCAA
CAATCTGAGATTGCCTCACTCCATGAGTTGATGAAAGAGCTCGAAGATTGCATTAGTTTGAGGAATCAAACGGTTATTGAGGTAGAAGAAAAGAATGGAACGCAATGTCG
AACAATTGACGACCTGCAATTAACGCTCAAGATTAGAGAGGATCAACTAGGGGAGCTCATCAACGACAACAAGGGTCTAGGAGAGTCCGTTCAGTCACTCAATGTTCGCC
TCAGTAAGTATCAGGATGCCACTGACATATTAATGAAAGACTATACCTATCTAAAGGAGCAGTACGACAGATTGAACGATGACTTTGGGTTTGTGAGACAGAACCACTCG
ACACTACGAAGTAAAGCGGAACATATGCTCACTCAGATTAGGAGAGTCACTCGAAGGGCAGATGAACTAGCAGAAGATGCACGTACTCTCTCTAAAGTCATAGCACCTAC
ACAGTCGAATAGCAAGAATATGCTTAAGTTTCTGGGAAAGCTTCGTATAAGTTTAGAGACAAGGATCATGGAAGAGCAAAGTACTGAGATGGAGAAAACAAGGAAAGATA
TTGAGGAGTTACGAGAAAAAATGGATGCCATTCTTGTCGCTCTGGAAGGAGGCAAAATAATACCTGATATTGCTCAGTCCAACAATACAATGAATGACCCTCCAATCTTG
CAATCAACAGAGGGTACTACTCCAAAATATCATCCATTGTACAATATTCCAGTAGAGCAGAACCCATTTCCATTCTTCAAGAATGAGCAAGTGCCTGTACACAATCAACC
TGGATTTTCACTACCCACAGAGGTACCTCCCAAGGTGACCATTACAGTTCCCAATTTAGATGATCCTGAAATAAGAAAAGAGCTAACGGGAGGTGAGAAAGTCTCTTCTA
GTGAAAAGCTTGAAGTCCTGGAGGAAAGGTTAAGGGCAGTAGAAGGAACAGACGTCTTCGGAAATATAGATGCGACCAAGCTATGCTTGGTACCAGATGTAATCCTCCCT
CCAAAATTCAAGGTGCCCGAGTTTGAAAAGTATGATGGAGCATCCTGTCCTAAAAACCATCTCATCATGTATTGCAGGAAGATGGCAGCATACGTCCAAAATGACAAGCT
GTTAATTCACTACTTCCAGGACAGTCTTACTGGTCCAGCATCTCGATGGTATATGCAGTTAGATAGCACTCATATATGTTCATGGAAGAATCTAGCGGATTCATTTTTAA
AGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTATAGAAGATCAAGCAACAGTACGTCAATGGTCAGAAAATGTACAACAAATCCACGGAGATTCTTTAGTAGAAAATGTTGTTTCTCAATTTAAGGATGTCAGTTT
TCCAGAAAGTCAATTAGAAACAACAAAACGGGCTTGGGAAAGTTTAACTGTAGATAGAAAGGCTAAATTTACAAGCAAATATGGCCATCTAGCTCAGCTCATGTATGTAC
AAGGCAATTATTCTGTATTAAAAGCTTTGGTTCGACATTGGGATCCAACCTACAGATGTTTCACATTTGGCTCAATCGACATGACTCCTACAATAGAGGAATATCAATCC
CTTCTGCATATGCCAGCACGAACAGAGGTTGAAGCTTATTCTTACGATCAAGAGCTTACAATGAAAAGAGCATTATCTACTCTTTTGGGCAAGATTCGTACAAGCGACAT
AGAGAAACAAGTAAAGCAAGGTTCGTTGCGCAACGTTGAACTAGAAAAAAAGTTGAACCGATTAAAGGAAAGTGTCAGCAAAGAAGAACAGTTAGAAAAGGAAATTTCAG
CATTAGACACGGATGCCCGCGACCTGAACAGAAGAATGCATCGATTAAGAAGGGATAATGAAGTCTCCCAAGCAACTCTCAAGTCAAGGAATGACCAAGTTTTGAAGCAA
CAATCTGAGATTGCCTCACTCCATGAGTTGATGAAAGAGCTCGAAGATTGCATTAGTTTGAGGAATCAAACGGTTATTGAGGTAGAAGAAAAGAATGGAACGCAATGTCG
AACAATTGACGACCTGCAATTAACGCTCAAGATTAGAGAGGATCAACTAGGGGAGCTCATCAACGACAACAAGGGTCTAGGAGAGTCCGTTCAGTCACTCAATGTTCGCC
TCAGTAAGTATCAGGATGCCACTGACATATTAATGAAAGACTATACCTATCTAAAGGAGCAGTACGACAGATTGAACGATGACTTTGGGTTTGTGAGACAGAACCACTCG
ACACTACGAAGTAAAGCGGAACATATGCTCACTCAGATTAGGAGAGTCACTCGAAGGGCAGATGAACTAGCAGAAGATGCACGTACTCTCTCTAAAGTCATAGCACCTAC
ACAGTCGAATAGCAAGAATATGCTTAAGTTTCTGGGAAAGCTTCGTATAAGTTTAGAGACAAGGATCATGGAAGAGCAAAGTACTGAGATGGAGAAAACAAGGAAAGATA
TTGAGGAGTTACGAGAAAAAATGGATGCCATTCTTGTCGCTCTGGAAGGAGGCAAAATAATACCTGATATTGCTCAGTCCAACAATACAATGAATGACCCTCCAATCTTG
CAATCAACAGAGGGTACTACTCCAAAATATCATCCATTGTACAATATTCCAGTAGAGCAGAACCCATTTCCATTCTTCAAGAATGAGCAAGTGCCTGTACACAATCAACC
TGGATTTTCACTACCCACAGAGGTACCTCCCAAGGTGACCATTACAGTTCCCAATTTAGATGATCCTGAAATAAGAAAAGAGCTAACGGGAGGTGAGAAAGTCTCTTCTA
GTGAAAAGCTTGAAGTCCTGGAGGAAAGGTTAAGGGCAGTAGAAGGAACAGACGTCTTCGGAAATATAGATGCGACCAAGCTATGCTTGGTACCAGATGTAATCCTCCCT
CCAAAATTCAAGGTGCCCGAGTTTGAAAAGTATGATGGAGCATCCTGTCCTAAAAACCATCTCATCATGTATTGCAGGAAGATGGCAGCATACGTCCAAAATGACAAGCT
GTTAATTCACTACTTCCAGGACAGTCTTACTGGTCCAGCATCTCGATGGTATATGCAGTTAGATAGCACTCATATATGTTCATGGAAGAATCTAGCGGATTCATTTTTAA
AGTAA
Protein sequenceShow/hide protein sequence
MGIEDQATVRQWSENVQQIHGDSLVENVVSQFKDVSFPESQLETTKRAWESLTVDRKAKFTSKYGHLAQLMYVQGNYSVLKALVRHWDPTYRCFTFGSIDMTPTIEEYQS
LLHMPARTEVEAYSYDQELTMKRALSTLLGKIRTSDIEKQVKQGSLRNVELEKKLNRLKESVSKEEQLEKEISALDTDARDLNRRMHRLRRDNEVSQATLKSRNDQVLKQ
QSEIASLHELMKELEDCISLRNQTVIEVEEKNGTQCRTIDDLQLTLKIREDQLGELINDNKGLGESVQSLNVRLSKYQDATDILMKDYTYLKEQYDRLNDDFGFVRQNHS
TLRSKAEHMLTQIRRVTRRADELAEDARTLSKVIAPTQSNSKNMLKFLGKLRISLETRIMEEQSTEMEKTRKDIEELREKMDAILVALEGGKIIPDIAQSNNTMNDPPIL
QSTEGTTPKYHPLYNIPVEQNPFPFFKNEQVPVHNQPGFSLPTEVPPKVTITVPNLDDPEIRKELTGGEKVSSSEKLEVLEERLRAVEGTDVFGNIDATKLCLVPDVILP
PKFKVPEFEKYDGASCPKNHLIMYCRKMAAYVQNDKLLIHYFQDSLTGPASRWYMQLDSTHICSWKNLADSFLK