; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh04G021010 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh04G021010
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCmo_Chr04:13173060..13175246
RNA-Seq ExpressionCmoCh04G021010
SyntenyCmoCh04G021010
Gene Ontology termsGO:0006397 - mRNA processing (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8655918.1 NAC domain-containing protein 48 [Hibiscus syriacus]3.3e-5844.06Show/hide
Query:  MQIEDYLYQKNIHEPLLEVKLDTMTTEQWKLKDQQVLGLILVTLSRNVAFNIIKEKTTLGMLKALSNFYEKPSTINKVYLMQRLFNLQMSEGGSVANHIN
        MQIED++YQKN+++PLL  + + +  E W L D+Q LG+I +TLSRNVAFNI KEKTT G++ ALS+ YEKP T NKV+LM+RLFNL+M+E  SVA H+N
Subjt:  MQIEDYLYQKNIHEPLLEVKLDTMTTEQWKLKDQQVLGLILVTLSRNVAFNIIKEKTTLGMLKALSNFYEKPSTINKVYLMQRLFNLQMSEGGSVANHIN

Query:  EFNMIVRKPSSVEINFEDEIKALILMSSLPESWDTVIAATSSFQGSEKLKFNEIQDIVISESIPKRE-------------IEDSSGRDPSVDQRGKKFGK
        E N I  + SSVEI F+DE++ALIL+SSLP+SW+  + A S+  G+ KLKF+++ D+V+SE I +RE             +E+        +     F  
Subjt:  EFNMIVRKPSSVEINFEDEIKALILMSSLPESWDTVIAATSSFQGSEKLKFNEIQDIVISESIPKRE-------------IEDSSGRDPSVDQRGKKFGK

Query:  SSWKIVKGAMVVARGTKFRTPYTTLGCMNMAAVAKSDSNSSLWHNRLEHTSVKGMKMLVVEVVLEGYRSDMSEYLFWDDKNIKVLRHCDVTFDENFMYKN
          WKI K  +V+ARG K  T Y T    N+ A A +D  S+LWH RL H S KGMK L      +G  SD              L+  DV   E++++  
Subjt:  SSWKIVKGAMVVARGTKFRTPYTTLGCMNMAAVAKSDSNSSLWHNRLEHTSVKGMKMLVVEVVLEGYRSDMSEYLFWDDKNIKVLRHCDVTFDENFMYKN

Query:  KKKKGEKKGSKTMKQLGVEV
        +KK    K  KT K   +E+
Subjt:  KKKKGEKKGSKTMKQLGVEV

KAF3643966.1 Pleiotropic drug resistance protein 1 [Capsicum annuum]3.0e-5940.34Show/hide
Query:  QKNIHEPLLEVKLDTMTTEQWKLKDQQVLGLILVTLSRNVAFNIIKEKTTLGMLKALSNFYEKPSTINKVYLMQRLFNLQMSEGGSVANHINEFNMIVRK
        +K +HEPL+ VKL+ M  + WKLKD+Q LGLI +TLSRNVAFNI+KEKTT  +LKALSN YEKPS  NKVYLM+RLFNLQM E GSVA+HINEFNMIV +
Subjt:  QKNIHEPLLEVKLDTMTTEQWKLKDQQVLGLILVTLSRNVAFNIIKEKTTLGMLKALSNFYEKPSTINKVYLMQRLFNLQMSEGGSVANHINEFNMIVRK

Query:  PSSVEINFEDEIKALILMSSLPESWDTVIAATSSFQGSEKLKFNEIQDIVISESIPKREIEDSSGRDPSVDQRGK-------------------------
          SV+INFEDEIKALILMSSLPE   T++ A SS  GSEKLKF++I+D+V S+SI KREI +SSG   SVD+RG+                         
Subjt:  PSSVEINFEDEIKALILMSSLPESWDTVIAATSSFQGSEKLKFNEIQDIVISESIPKREIEDSSGRDPSVDQRGK-------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------KFGKSSWKIVKGAMVVARGTKFRTPYTTLGCMNMAAVAKSDSNSSLWHNRLEHTSVKGMKMLV
                                             +FGK SWKI+KGAMVVARGTK  T +TT  C+NMA VA+  S+  LWHNRL H S K MKML 
Subjt:  -------------------------------------KFGKSSWKIVKGAMVVARGTKFRTPYTTLGCMNMAAVAKSDSNSSLWHNRLEHTSVKGMKMLV

Query:  VEVVLEGYR
         +  LEG +
Subjt:  VEVVLEGYR

KAG7011443.1 hypothetical protein SDJN02_26349, partial [Cucurbita argyrosperma subsp. argyrosperma]2.0e-9547.71Show/hide
Query:  MQIEDYLYQKNIHEPLLEVKLDTMTTEQWKLKDQQVLGLILVTLSRNVAFNIIKEKTTLGMLKALSNFYEKPSTINKVYLMQRLFNLQMSEGGSVANHIN
        MQIEDYLY+K++HEPL  VKLDTMTTEQWKLKD+Q L LI +TLSRN AFNIIKEKTT  +LKALSN YEK S +NKVYLM+RLFNLQMSEGGS+A++IN
Subjt:  MQIEDYLYQKNIHEPLLEVKLDTMTTEQWKLKDQQVLGLILVTLSRNVAFNIIKEKTTLGMLKALSNFYEKPSTINKVYLMQRLFNLQMSEGGSVANHIN

Query:  EFNMIVRKPSSVEINFEDEIKALILMSSLPESWDTVIAATSSFQGSEKLKFNEIQDIVISESIPKREIEDSSGRDPSVD---------------------
        EFNMIV + S VEINF+DEIKALILMSSLPESWDTV+AA +S +GS+KLKF+EI+D+V+ ESI  R+  DSSG+  S D                     
Subjt:  EFNMIVRKPSSVEINFEDEIKALILMSSLPESWDTVIAATSSFQGSEKLKFNEIQDIVISESIPKREIEDSSGRDPSVD---------------------

Query:  ----------------------QRG--KKFGKSSWKIVKGAMVVARGTKFRTPYTTLGCMNMAAVAKSDSNSSLWHNRLEHTSVKGMKMLVVEVVLE---
                              Q+G   +FGKSSWKIVKGAMVVARGTK  T YTT  C+NM A   S SNSSLWHNRL H SVKGMKML+ +  LE   
Subjt:  ----------------------QRG--KKFGKSSWKIVKGAMVVARGTKFRTPYTTLGCMNMAAVAKSDSNSSLWHNRLEHTSVKGMKMLVVEVVLE---

Query:  -------------------------------------------------------------------------------------------------GYR
                                                                                                         GY 
Subjt:  -------------------------------------------------------------------------------------------------GYR

Query:  SDMSEYLFWDDKNIKVLRHCDVTFDENFMYKNKKKKGEKKGSKTMKQLGVEVKLLKDSPSDVVANTQETLETIAEEPEVE
        S++  Y FWDDKN K+LRH D+TFDEN +YKNK    EK  S+T KQ+ VE++  ++SPSDV    QET   +AEE +VE
Subjt:  SDMSEYLFWDDKNIKVLRHCDVTFDENFMYKNKKKKGEKKGSKTMKQLGVEVKLLKDSPSDVVANTQETLETIAEEPEVE

VFQ59121.1 unnamed protein product [Cuscuta campestris]4.1e-6470.49Show/hide
Query:  MQIEDYLYQKNIHEPLLEVKLDTMTTEQWKLKDQQVLGLILVTLSRNVAFNIIKEKTTLGMLKALSNFYEKPSTINKVYLMQRLFNLQMSEGGSVANHIN
        MQIEDYLYQK++ +PL EVK D+MT EQWK+KD+Q LG+I +TL++NVAFNI+KE TT G++KALSN YEKPS +NKVYLM+RLFNLQM E GSVANHIN
Subjt:  MQIEDYLYQKNIHEPLLEVKLDTMTTEQWKLKDQQVLGLILVTLSRNVAFNIIKEKTTLGMLKALSNFYEKPSTINKVYLMQRLFNLQMSEGGSVANHIN

Query:  EFNMIVRKPSSVEINFEDEIKALILMSSLPESWDTVIAATSSFQGSEKLKFNEIQDIVISESIPKREIEDSSGRDPSVDQRGK
        +FNMIV +   VEINFEDEIK LIL+SS+PESWD V+AA SS +GSEKL+F+EI+D+V+SESI KRE+ DSSG   SVD+RG+
Subjt:  EFNMIVRKPSSVEINFEDEIKALILMSSLPESWDTVIAATSSFQGSEKLKFNEIQDIVISESIPKREIEDSSGRDPSVDQRGK

VFR00719.1 unnamed protein product [Cuscuta campestris]1.5e-7141.13Show/hide
Query:  MQIEDYLYQKNIHEPLLEVKLDTMTTEQWKLKDQQVLGLILVTLSRNVAFNIIKEKTTLGMLKALSNFYEKPSTINKVYLMQRLFNLQMSEGGSVANHIN
        MQIEDYLYQK++HEPL  VK D+MT EQWKLKD+Q LG+I +TL++NVAFNI+KE TT G++KALSN YEKPS +NKVYLM+RLFNLQM E GSVANHIN
Subjt:  MQIEDYLYQKNIHEPLLEVKLDTMTTEQWKLKDQQVLGLILVTLSRNVAFNIIKEKTTLGMLKALSNFYEKPSTINKVYLMQRLFNLQMSEGGSVANHIN

Query:  EFNMIVRKPSSVEINFEDEIKALILMSSLP----------------------------------------------------------------------
        +FNMIV +  SVEINFEDEIKALIL+SS+P                                                                      
Subjt:  EFNMIVRKPSSVEINFEDEIKALILMSSLP----------------------------------------------------------------------

Query:  --------------------ESWDTVIAATSSFQGSEKLKFNEIQDIVISESIPKREIEDSSGRDPSVDQRGKKFGKSSWKIVKGAMVVARGTKFRTPYT
                            ESWDTV+AA SS +GSEKL+F+EI+D+V+SESI KRE+ DSSG   SVDQ+G+   K   +  +        +  R+  T
Subjt:  --------------------ESWDTVIAATSSFQGSEKLKFNEIQDIVISESIPKREIEDSSGRDPSVDQRGKKFGKSSWKIVKGAMVVARGTKFRTPYT

Query:  TLGC----------------MNMAAVAKSDS------------------NSSLW---HNRLEHTSVKGMKMLVVEVVLEGYRSDMSEYLFWDDKNIKVLR
           C                 N  +    DS                  N   W   H  + H +    K+  +E++            FWDDKN K+LR
Subjt:  TLGC----------------MNMAAVAKSDS------------------NSSLW---HNRLEHTSVKGMKMLVVEVVLEGYRSDMSEYLFWDDKNIKVLR

Query:  HCDVTFDENFMYKNKKKKGEKKGSKTMKQLGVEVKLLKDSPSDVVANTQETLETIAEEPEVE
        HCDVTFDE+ +YK++    E+K  ++ KQ+GVEV+L K +P +V A TQ T +TI EEPEVE
Subjt:  HCDVTFDENFMYKNKKKKGEKKGSKTMKQLGVEVKLLKDSPSDVVANTQETLETIAEEPEVE

TrEMBL top hitse value%identityAlignment
A0A2G2YBC9 Intron_maturas2 domain-containing protein1.7e-5573.33Show/hide
Query:  MQIEDYLYQKNIHEPLLEVKLDTMTTEQWKLKDQQVLGLILVTLSRNVAFNIIKEKTTLGMLKALSNFYEKPSTINKVYLMQRLFNLQMSEGGSVANHIN
        MQIEDYLYQK +HEPL+ VKL  M  + WKLKD+Q LGLI +TLSRNVAFNI+KEKTT  +LKALSN YEKPS  NKVYLM+RLFNLQM E GSVA+HIN
Subjt:  MQIEDYLYQKNIHEPLLEVKLDTMTTEQWKLKDQQVLGLILVTLSRNVAFNIIKEKTTLGMLKALSNFYEKPSTINKVYLMQRLFNLQMSEGGSVANHIN

Query:  EFNMIVRKPSSVEINFEDEIKALILMSSLPESWDTVIAATSSFQGSEKLKFNEIQDIVISESIPK
        EFN+IV +  SV+INFEDEIKALILMSSLPE   T++AA SSF GSEKLKF++I+D+V SESI K
Subjt:  EFNMIVRKPSSVEINFEDEIKALILMSSLPESWDTVIAATSSFQGSEKLKFNEIQDIVISESIPK

A0A484K039 Uncharacterized protein2.0e-6470.49Show/hide
Query:  MQIEDYLYQKNIHEPLLEVKLDTMTTEQWKLKDQQVLGLILVTLSRNVAFNIIKEKTTLGMLKALSNFYEKPSTINKVYLMQRLFNLQMSEGGSVANHIN
        MQIEDYLYQK++ +PL EVK D+MT EQWK+KD+Q LG+I +TL++NVAFNI+KE TT G++KALSN YEKPS +NKVYLM+RLFNLQM E GSVANHIN
Subjt:  MQIEDYLYQKNIHEPLLEVKLDTMTTEQWKLKDQQVLGLILVTLSRNVAFNIIKEKTTLGMLKALSNFYEKPSTINKVYLMQRLFNLQMSEGGSVANHIN

Query:  EFNMIVRKPSSVEINFEDEIKALILMSSLPESWDTVIAATSSFQGSEKLKFNEIQDIVISESIPKREIEDSSGRDPSVDQRGK
        +FNMIV +   VEINFEDEIK LIL+SS+PESWD V+AA SS +GSEKL+F+EI+D+V+SESI KRE+ DSSG   SVD+RG+
Subjt:  EFNMIVRKPSSVEINFEDEIKALILMSSLPESWDTVIAATSSFQGSEKLKFNEIQDIVISESIPKREIEDSSGRDPSVDQRGK

A0A484NK44 CCHC-type domain-containing protein7.5e-7241.13Show/hide
Query:  MQIEDYLYQKNIHEPLLEVKLDTMTTEQWKLKDQQVLGLILVTLSRNVAFNIIKEKTTLGMLKALSNFYEKPSTINKVYLMQRLFNLQMSEGGSVANHIN
        MQIEDYLYQK++HEPL  VK D+MT EQWKLKD+Q LG+I +TL++NVAFNI+KE TT G++KALSN YEKPS +NKVYLM+RLFNLQM E GSVANHIN
Subjt:  MQIEDYLYQKNIHEPLLEVKLDTMTTEQWKLKDQQVLGLILVTLSRNVAFNIIKEKTTLGMLKALSNFYEKPSTINKVYLMQRLFNLQMSEGGSVANHIN

Query:  EFNMIVRKPSSVEINFEDEIKALILMSSLP----------------------------------------------------------------------
        +FNMIV +  SVEINFEDEIKALIL+SS+P                                                                      
Subjt:  EFNMIVRKPSSVEINFEDEIKALILMSSLP----------------------------------------------------------------------

Query:  --------------------ESWDTVIAATSSFQGSEKLKFNEIQDIVISESIPKREIEDSSGRDPSVDQRGKKFGKSSWKIVKGAMVVARGTKFRTPYT
                            ESWDTV+AA SS +GSEKL+F+EI+D+V+SESI KRE+ DSSG   SVDQ+G+   K   +  +        +  R+  T
Subjt:  --------------------ESWDTVIAATSSFQGSEKLKFNEIQDIVISESIPKREIEDSSGRDPSVDQRGKKFGKSSWKIVKGAMVVARGTKFRTPYT

Query:  TLGC----------------MNMAAVAKSDS------------------NSSLW---HNRLEHTSVKGMKMLVVEVVLEGYRSDMSEYLFWDDKNIKVLR
           C                 N  +    DS                  N   W   H  + H +    K+  +E++            FWDDKN K+LR
Subjt:  TLGC----------------MNMAAVAKSDS------------------NSSLW---HNRLEHTSVKGMKMLVVEVVLEGYRSDMSEYLFWDDKNIKVLR

Query:  HCDVTFDENFMYKNKKKKGEKKGSKTMKQLGVEVKLLKDSPSDVVANTQETLETIAEEPEVE
        HCDVTFDE+ +YK++    E+K  ++ KQ+GVEV+L K +P +V A TQ T +TI EEPEVE
Subjt:  HCDVTFDENFMYKNKKKKGEKKGSKTMKQLGVEVKLLKDSPSDVVANTQETLETIAEEPEVE

A0A6A2WDE1 NAC domain-containing protein 481.6e-5844.06Show/hide
Query:  MQIEDYLYQKNIHEPLLEVKLDTMTTEQWKLKDQQVLGLILVTLSRNVAFNIIKEKTTLGMLKALSNFYEKPSTINKVYLMQRLFNLQMSEGGSVANHIN
        MQIED++YQKN+++PLL  + + +  E W L D+Q LG+I +TLSRNVAFNI KEKTT G++ ALS+ YEKP T NKV+LM+RLFNL+M+E  SVA H+N
Subjt:  MQIEDYLYQKNIHEPLLEVKLDTMTTEQWKLKDQQVLGLILVTLSRNVAFNIIKEKTTLGMLKALSNFYEKPSTINKVYLMQRLFNLQMSEGGSVANHIN

Query:  EFNMIVRKPSSVEINFEDEIKALILMSSLPESWDTVIAATSSFQGSEKLKFNEIQDIVISESIPKRE-------------IEDSSGRDPSVDQRGKKFGK
        E N I  + SSVEI F+DE++ALIL+SSLP+SW+  + A S+  G+ KLKF+++ D+V+SE I +RE             +E+        +     F  
Subjt:  EFNMIVRKPSSVEINFEDEIKALILMSSLPESWDTVIAATSSFQGSEKLKFNEIQDIVISESIPKRE-------------IEDSSGRDPSVDQRGKKFGK

Query:  SSWKIVKGAMVVARGTKFRTPYTTLGCMNMAAVAKSDSNSSLWHNRLEHTSVKGMKMLVVEVVLEGYRSDMSEYLFWDDKNIKVLRHCDVTFDENFMYKN
          WKI K  +V+ARG K  T Y T    N+ A A +D  S+LWH RL H S KGMK L      +G  SD              L+  DV   E++++  
Subjt:  SSWKIVKGAMVVARGTKFRTPYTTLGCMNMAAVAKSDSNSSLWHNRLEHTSVKGMKMLVVEVVLEGYRSDMSEYLFWDDKNIKVLRHCDVTFDENFMYKN

Query:  KKKKGEKKGSKTMKQLGVEV
        +KK    K  KT K   +E+
Subjt:  KKKKGEKKGSKTMKQLGVEV

A0A6A3ATL2 E3 ubiquitin-protein ligase SINAT33.7e-5543.21Show/hide
Query:  MQIEDYLYQKNIHEPLLEVKLDTMTTEQWKLKDQQVLGLILVTLSRNVAFNIIKEKTTLGMLKALSNFYEKPSTINKVYLMQRLFNLQMSEGGSVANHIN
        MQIED+LYQKN+++ LL  + + M  E W L D+Q LG+I +TL RNVAFNI KEKTT+G++  LS+ YEK S  NKV+LM+RLFNL+M+E  SVA H+N
Subjt:  MQIEDYLYQKNIHEPLLEVKLDTMTTEQWKLKDQQVLGLILVTLSRNVAFNIIKEKTTLGMLKALSNFYEKPSTINKVYLMQRLFNLQMSEGGSVANHIN

Query:  EFNMIVRKPSSVEINFEDEIKALILMSSLPESWDTVIAATSSFQGSEKLKFNEIQDI----VISESIPKREIEDSSGRDPSVDQRGKKFGKS--------
        E N I  + SSV+I F+DE++ALIL+SSLP+SW+  + A SS  G+ KLKF+++QD+    VI E+     +  +S     +   G  F  +        
Subjt:  EFNMIVRKPSSVEINFEDEIKALILMSSLPESWDTVIAATSSFQGSEKLKFNEIQDI----VISESIPKREIEDSSGRDPSVDQRGKKFGKS--------

Query:  -----SWKIVKGAMVVARGTKFRTPYTTLGCMNMAAVAKSDSNSSLWHNRLEHTSVKGMKMLVVEVVLEGYRSDMSEYLFWDDKNIKVLRHCDVTFDENF
              WKI KGA+V+ARG K  T Y T    N+ AVA +D  S++WH RL H S KGMK L    + +G  SD              L++ DV   E+ 
Subjt:  -----SWKIVKGAMVVARGTKFRTPYTTLGCMNMAAVAKSDSNSSLWHNRLEHTSVKGMKMLVVEVVLEGYRSDMSEYLFWDDKNIKVLRHCDVTFDENF

Query:  MYKNKKKKGEKKGSKTMKQLGVEV
        ++  +KK    K  KT K   +E+
Subjt:  MYKNKKKKGEKKGSKTMKQLGVEV

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-1730.73Show/hide
Query:  QIEDYLYQKNIHEPLLEV---KLDTMTTEQWKLKDQQVLGLILVTLSRNVAFNIIKEKTTLGMLKALSNFYEKPSTINKVYLMQRLFNLQMSEGGSVANH
        ++ D L Q+ +H+ +L+V   K DTM  E W   D++    I + LS +V  NII E T  G+   L + Y   +  NK+YL ++L+ L MSEG +  +H
Subjt:  QIEDYLYQKNIHEPLLEV---KLDTMTTEQWKLKDQQVLGLILVTLSRNVAFNIIKEKTTLGMLKALSNFYEKPSTINKVYLMQRLFNLQMSEGGSVANH

Query:  INEFNMIVRKPSSVEINFEDEIKALILMSSLPESWDTVIAATSSFQGSEKLKFNEIQD-IVISESIPKREIEDSSGRDPSVDQRGKKFGKSS
        +N FN ++ + +++ +  E+E KA++L++SLP S+D +  AT+   G   ++  ++   ++++E + K+   ++ G+    + RG+ + +SS
Subjt:  INEFNMIVRKPSSVEINFEDEIKALILMSSLPESWDTVIAATSSFQGSEKLKFNEIQD-IVISESIPKREIEDSSGRDPSVDQRGKKFGKSS

Arabidopsis top hitse value%identityAlignment
AT3G29785.1 unknown protein1.4e-1443.18Show/hide
Query:  MQIEDYLYQKNIHEPLLEVKLDTMTTEQWKLKDQQVLGLILVTLSRNVAFNIIKEKTTLGMLKALSNFYEKPSTINKVYLMQRLFNLQ
        M+IEDYLY K +H+PL + K++TM+ + W +  +QVL +I +T+S+N+A N+ KEK+  G++K LS+ Y+KPST N V   +   +++
Subjt:  MQIEDYLYQKNIHEPLLEVKLDTMTTEQWKLKDQQVLGLILVTLSRNVAFNIIKEKTTLGMLKALSNFYEKPSTINKVYLMQRLFNLQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGATTGAAGATTATCTGTACCAGAAAAATATTCACGAACCCCTATTGGAGGTTAAGCTGGATACTATGACCACGGAACAATGGAAGCTCAAGGATCAACAGGTCTT
AGGGTTGATCCTGGTGACGCTGTCCAGAAACGTGGCGTTCAACATTATCAAAGAGAAGACAACGTTAGGTATGTTGAAGGCGTTGTCGAATTTCTATGAAAAACCATCGA
CTATAAACAAGGTGTATTTGATGCAGAGATTGTTCAATCTACAAATGTCTGAAGGTGGATCTGTTGCTAATCATATAAATGAATTCAATATGATTGTAAGAAAACCGAGT
TCGGTGGAAATTAATTTTGAAGATGAAATTAAAGCATTGATTTTGATGTCATCTTTACCCGAGTCATGGGATACTGTTATTGCCGCGACTAGTAGTTTCCAAGGATCTGA
GAAACTCAAGTTCAATGAAATCCAAGATATAGTTATTAGTGAGAGTATTCCCAAACGAGAAATTGAGGATTCATCTGGTAGAGATCCTAGTGTTGACCAAAGGGGAAAAA
AATTTGGGAAGAGTTCGTGGAAGATTGTGAAGGGTGCTATGGTGGTAGCACGTGGCACAAAATTTAGAACCCCATACACCACTTTAGGGTGTATGAACATGGCTGCTGTT
GCTAAGAGTGATTCAAATTCAAGTTTATGGCACAATAGACTTGAACATACGAGCGTCAAAGGAATGAAGATGCTGGTTGTGGAAGTAGTTTTGGAAGGCTATAGGTCTGA
CATGTCCGAGTACTTGTTTTGGGATGATAAGAACATAAAAGTCCTAAGACATTGTGACGTGACCTTTGATGAAAATTTCATGTACAAGAACAAAAAGAAGAAAGGAGAGA
AGAAAGGTTCCAAGACAATGAAGCAATTGGGAGTTGAGGTTAAGTTGTTGAAAGATTCACCTAGTGATGTTGTAGCAAATACTCAAGAAACTCTTGAGACTATTGCTGAG
GAACCAGAGGTGGAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCAGATTGAAGATTATCTGTACCAGAAAAATATTCACGAACCCCTATTGGAGGTTAAGCTGGATACTATGACCACGGAACAATGGAAGCTCAAGGATCAACAGGTCTT
AGGGTTGATCCTGGTGACGCTGTCCAGAAACGTGGCGTTCAACATTATCAAAGAGAAGACAACGTTAGGTATGTTGAAGGCGTTGTCGAATTTCTATGAAAAACCATCGA
CTATAAACAAGGTGTATTTGATGCAGAGATTGTTCAATCTACAAATGTCTGAAGGTGGATCTGTTGCTAATCATATAAATGAATTCAATATGATTGTAAGAAAACCGAGT
TCGGTGGAAATTAATTTTGAAGATGAAATTAAAGCATTGATTTTGATGTCATCTTTACCCGAGTCATGGGATACTGTTATTGCCGCGACTAGTAGTTTCCAAGGATCTGA
GAAACTCAAGTTCAATGAAATCCAAGATATAGTTATTAGTGAGAGTATTCCCAAACGAGAAATTGAGGATTCATCTGGTAGAGATCCTAGTGTTGACCAAAGGGGAAAAA
AATTTGGGAAGAGTTCGTGGAAGATTGTGAAGGGTGCTATGGTGGTAGCACGTGGCACAAAATTTAGAACCCCATACACCACTTTAGGGTGTATGAACATGGCTGCTGTT
GCTAAGAGTGATTCAAATTCAAGTTTATGGCACAATAGACTTGAACATACGAGCGTCAAAGGAATGAAGATGCTGGTTGTGGAAGTAGTTTTGGAAGGCTATAGGTCTGA
CATGTCCGAGTACTTGTTTTGGGATGATAAGAACATAAAAGTCCTAAGACATTGTGACGTGACCTTTGATGAAAATTTCATGTACAAGAACAAAAAGAAGAAAGGAGAGA
AGAAAGGTTCCAAGACAATGAAGCAATTGGGAGTTGAGGTTAAGTTGTTGAAAGATTCACCTAGTGATGTTGTAGCAAATACTCAAGAAACTCTTGAGACTATTGCTGAG
GAACCAGAGGTGGAGTAA
Protein sequenceShow/hide protein sequence
MQIEDYLYQKNIHEPLLEVKLDTMTTEQWKLKDQQVLGLILVTLSRNVAFNIIKEKTTLGMLKALSNFYEKPSTINKVYLMQRLFNLQMSEGGSVANHINEFNMIVRKPS
SVEINFEDEIKALILMSSLPESWDTVIAATSSFQGSEKLKFNEIQDIVISESIPKREIEDSSGRDPSVDQRGKKFGKSSWKIVKGAMVVARGTKFRTPYTTLGCMNMAAV
AKSDSNSSLWHNRLEHTSVKGMKMLVVEVVLEGYRSDMSEYLFWDDKNIKVLRHCDVTFDENFMYKNKKKKGEKKGSKTMKQLGVEVKLLKDSPSDVVANTQETLETIAE
EPEVE