; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI02G24390 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI02G24390
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionReverse transcriptase
Genome locationChr2:20919617..20920651
RNA-Seq ExpressionCSPI02G24390
SyntenyCSPI02G24390
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051973.1 E3 ubiquitin-protein ligase SHPRH isoform X4 [Cucumis melo var. makuwa]9.0e-9157.6Show/hide
Query:  MAGRRGNNPAAGENRAQEAAKEITALSPRTTTVRLLAVEESLGDLHNKFDRLMESVELLSRREEFPQPPPRNEINIQND-HRFGEARGQRARGYFRNMNN
        MAGRRG  PAAG+N  QE  +EIT L PRT+ VRLLA+E SLGDLH+KFDR+++++E    +EE          +++ D   F     ++ R     +N+
Subjt:  MAGRRGNNPAAGENRAQEAAKEITALSPRTTTVRLLAVEESLGDLHNKFDRLMESVELLSRREEFPQPPPRNEINIQND-HRFGEARGQRARGYFRNMNN

Query:  PRGFQRRRPGYAIPQQFDEDFQEDQEAWQEIQEDDSSSGDEQGNM-WNLNDDLRAGRNNRRNEVRRGEYHDYKMKIDLPMYDGKRNIEAFLDWIKSTENF
                                ++   ++      SG       W   D L  G+  +  EV    +HDYKMKIDLP Y+GKR+ ++FLDW+KSTENF
Subjt:  PRGFQRRRPGYAIPQQFDEDFQEDQEAWQEIQEDDSSSGDEQGNM-WNLNDDLRAGRNNRRNEVRRGEYHDYKMKIDLPMYDGKRNIEAFLDWIKSTENF

Query:  FNYMETPERKKVHLVALKLRVGASAWWDQLEINKQRCGKQPIRSWEKMKKLLKARFLPLNYEQTLYNQYQKCRQGVRSVADYIEEFHRLSARTNLSENEQ
        FNYM+TP+RKKVHLV LKLRVGASAWWDQLEIN+QR GK PIR WEKMKKLLKARFLP NYEQTLYNQYQ CRQG RSVA+YIEEFHRLS RTNLSENE 
Subjt:  FNYMETPERKKVHLVALKLRVGASAWWDQLEINKQRCGKQPIRSWEKMKKLLKARFLPLNYEQTLYNQYQKCRQGVRSVADYIEEFHRLSARTNLSENEQ

Query:  HQVARYMGGLRFDIKEKVKLQPLCFLSEAISFAETVEEMIAI
        HQ+AR++GGLRFDIKEKVKLQP  FLSEAIS AETV+EMIAI
Subjt:  HQVARYMGGLRFDIKEKVKLQPLCFLSEAISFAETVEEMIAI

TYK11204.1 reverse transcriptase [Cucumis melo var. makuwa]2.8e-9262.05Show/hide
Query:  MAGRRGNNPAAGENRAQEAAKEITALSPRTTTVRLLAVEESLGDLHNKFDRLMESVELLSRREEFPQPPPRNEINIQNDHRFGEARGQRARGYFRNMNNP
        MAGRR   PA G+NR QE  +E  ALSPRT+TV  LAVE+SLGDLH KFDR+M+++E L+RR +      R E N  ND   G   G+RAR  FRN+ N 
Subjt:  MAGRRGNNPAAGENRAQEAAKEITALSPRTTTVRLLAVEESLGDLHNKFDRLMESVELLSRREEFPQPPPRNEINIQNDHRFGEARGQRARGYFRNMNNP

Query:  RGFQRRRPGYAIPQQFDEDFQEDQEAWQEIQEDDSSSGDEQGNMWNLNDDLRAGRNNRRNEVRRGEYHDYKMKIDLPMYDGKRNIEAFLDWIKSTENFFN
        R  QR+RP     +  D D  E+ EAWQ  QE DSSS DEQGN+WN N D++  +  R +E +R  +HDYKMKIDLP Y GK ++E+FLDWI++TENFFN
Subjt:  RGFQRRRPGYAIPQQFDEDFQEDQEAWQEIQEDDSSSGDEQGNMWNLNDDLRAGRNNRRNEVRRGEYHDYKMKIDLPMYDGKRNIEAFLDWIKSTENFFN

Query:  YMETPERKKVHLVALKLRVGASAWWDQLEINKQRCGKQPIRSWEKMKKLLKARFLPLNYEQTLYNQYQKCRQGVRSVADYIEEFHRLSARTNLSENEQHQ
        YM+T +RKKVHLVALKLR GASAWWDQ+EIN+QR GK PI SWEKMKKL+KARFLP NYEQTLYNQYQ CRQG +SVA+YI+EFH+LSARTNL   E+  
Subjt:  YMETPERKKVHLVALKLRVGASAWWDQLEINKQRCGKQPIRSWEKMKKLLKARFLPLNYEQTLYNQYQKCRQGVRSVADYIEEFHRLSARTNLSENEQHQ

Query:  VAR
         AR
Subjt:  VAR

XP_031741035.1 uncharacterized protein LOC116403692 [Cucumis sativus]3.6e-17289.21Show/hide
Query:  MAGRRGNNPAAGENRAQEAAKEITALSPRTTTVRLLAVEESLGDLHNKFDRLMESVELLSRREEFPQPPPRNEINIQNDHRFGEARGQRARGYFRNMNNP
        MAGRR NNPA GENR QEAA+EIT LSP+T+TVRLLAVEESLGDLHNKFDRLMESVELL+RREEFPQPPPRNEIN QND RFGE RG+RARGY RNMNNP
Subjt:  MAGRRGNNPAAGENRAQEAAKEITALSPRTTTVRLLAVEESLGDLHNKFDRLMESVELLSRREEFPQPPPRNEINIQNDHRFGEARGQRARGYFRNMNNP

Query:  RGFQRRRPGYAIPQQFDEDFQEDQEAWQEIQEDDSSSGDEQGNMWNLNDDLRAGRNNRRNEVRRGEYHDYKMKIDLPMYDGKRNIEAFLDWIKSTENFFN
        RG QRRRPGYAI QQ DEDFQEDQEAWQE QEDDSSSGDEQGNMWN ND+ RAGRNN+R E RRGEYHDYKMKIDLPMYDGKRNIEAFLDWIKSTENFFN
Subjt:  RGFQRRRPGYAIPQQFDEDFQEDQEAWQEIQEDDSSSGDEQGNMWNLNDDLRAGRNNRRNEVRRGEYHDYKMKIDLPMYDGKRNIEAFLDWIKSTENFFN

Query:  YMETPERKKVHLVALKLRVGASAWWDQLEINKQRCGKQPIRSWEKMKKLLKARFLPLNYEQTLYNQYQKCRQGVRSVADYIEEFHRLSARTNLSENEQHQ
        YM+TPERKKVHLVALKLR GASAWWDQLEIN+QRCGKQP+RSWEKMKKLLKARFLP NYEQTLYNQYQ CRQGVRSVA+YIEEFHRLSARTNLSENEQHQ
Subjt:  YMETPERKKVHLVALKLRVGASAWWDQLEINKQRCGKQPIRSWEKMKKLLKARFLPLNYEQTLYNQYQKCRQGVRSVADYIEEFHRLSARTNLSENEQHQ

Query:  VARYMGGLRFDIKEKVKLQPLCFLSEAISFAETVEEMIAIRPR
        VAR++GGLRFDIKEKV+LQP  FLSEAISFAETVEEMIAIR +
Subjt:  VARYMGGLRFDIKEKVKLQPLCFLSEAISFAETVEEMIAIRPR

XP_031743026.1 uncharacterized protein LOC116404533 [Cucumis sativus]5.3e-16887.46Show/hide
Query:  MAGRRGNNPAAGENRAQEAAKEITALSPRTTTVRLLAVEESLGDLHNKFDRLMESVELLSRREEFPQPPPRNEINIQNDHRFGEARGQRARGYFRNMNNP
        MAGRR NNPA GENR QEA +EIT LSP+T+TVRLLAVEESLGDLHNKFD+LMESVELL+RR EFPQPPPRNEIN QND RFGEARG+RARGY RNMNNP
Subjt:  MAGRRGNNPAAGENRAQEAAKEITALSPRTTTVRLLAVEESLGDLHNKFDRLMESVELLSRREEFPQPPPRNEINIQNDHRFGEARGQRARGYFRNMNNP

Query:  RGFQRRRPGYAIPQQFDEDFQEDQEAWQEIQEDDSSSGDEQGNMWNLNDDLRAGRNNRRNEVRRGEYHDYKMKIDLPMYDGKRNIEAFLDWIKSTENFFN
        RG QRRRPGYAI QQ DED QEDQE WQE QEDDSSSGDEQGNMWN ND+ RAGRNNRR E RRGEYHDYKMKIDLPMY GKRNIEAFLDWIKSTENFF 
Subjt:  RGFQRRRPGYAIPQQFDEDFQEDQEAWQEIQEDDSSSGDEQGNMWNLNDDLRAGRNNRRNEVRRGEYHDYKMKIDLPMYDGKRNIEAFLDWIKSTENFFN

Query:  YMETPERKKVHLVALKLRVGASAWWDQLEINKQRCGKQPIRSWEKMKKLLKARFLPLNYEQTLYNQYQKCRQGVRSVADYIEEFHRLSARTNLSENEQHQ
        YM+TPERKKVHLVALKLR GASAWWDQLEIN+QRCGKQP+RSWEKMKKLLKARFLP NYEQTLYNQYQ CRQGVR+VA+YIEEFHRLSARTNLSENEQHQ
Subjt:  YMETPERKKVHLVALKLRVGASAWWDQLEINKQRCGKQPIRSWEKMKKLLKARFLPLNYEQTLYNQYQKCRQGVRSVADYIEEFHRLSARTNLSENEQHQ

Query:  VARYMGGLRFDIKEKVKLQPLCFLSEAISFAETVEEMIAIRPR
        VAR++GGLRFDIKEKV+LQP  FLSEAISFAETVEEMIAIR +
Subjt:  VARYMGGLRFDIKEKVKLQPLCFLSEAISFAETVEEMIAIRPR

XP_031744062.1 uncharacterized protein LOC116404773 [Cucumis sativus]1.3e-8682.5Show/hide
Query:  MWNLNDDLRAGRNNRRNEVRRGEYHDYKMKIDLPMYDGKRNIEAFLDWIKSTENFFNYMETPERKKVHLVALKLRVGASAWWDQLEINKQRCGKQPIRSW
        MWNLNDDLRAGRNNRR EVRRGEYHDYKMKIDLPMYDGKRNIEAFLDWIKSTENFFNYM+TPERKKVHLVALKLR GASAWWDQLEIN+QRCGKQPIRSW
Subjt:  MWNLNDDLRAGRNNRRNEVRRGEYHDYKMKIDLPMYDGKRNIEAFLDWIKSTENFFNYMETPERKKVHLVALKLRVGASAWWDQLEINKQRCGKQPIRSW

Query:  EKMKKLLKARFLPLNYEQTLYNQYQKCRQGVRSVADYIEEFHRLSARTNLSENEQHQVARYMGGLRFDIKEKVKLQPLCFLSEAISFAETVEEMIAIRPR
        EKMKKLLKARFLP NYEQTLYNQYQ CRQGVRSVADYIEEFHRLSARTNLSENEQHQVAR++G                         ETVEEMIAIR +
Subjt:  EKMKKLLKARFLPLNYEQTLYNQYQKCRQGVRSVADYIEEFHRLSARTNLSENEQHQVARYMGGLRFDIKEKVKLQPLCFLSEAISFAETVEEMIAIRPR

TrEMBL top hitse value%identityAlignment
A0A5A7SX86 Reverse transcriptase3.2e-6253.36Show/hide
Query:  MAGRRGNNPAAGENRAQEAAKEITALSPRTTTVRLLAVEESLGDLHNKFDRLMESVELLSRREEFPQPPPRNEINIQNDHRFGEARGQRARGYFRNMNNP
        MA +RG   AAG+NR QE A+EIT LSPRTT VRLLAVE+SLGDLH+KFDR+M+                                              
Subjt:  MAGRRGNNPAAGENRAQEAAKEITALSPRTTTVRLLAVEESLGDLHNKFDRLMESVELLSRREEFPQPPPRNEINIQNDHRFGEARGQRARGYFRNMNNP

Query:  RGFQRRRPGYAIPQQFDEDFQEDQEAWQEIQEDDSSSGDEQGNMWNLNDDLRAGRNNRRNEVRRGEYHDYKMKIDLPMYDGKRNIEAFLDWIKSTENFFN
          + RRR            F  D               DEQGN+WN NDD + G+  +R E RR  +HDY+MKIDL +Y+GKR+IE+FLDW+KSTENFFN
Subjt:  RGFQRRRPGYAIPQQFDEDFQEDQEAWQEIQEDDSSSGDEQGNMWNLNDDLRAGRNNRRNEVRRGEYHDYKMKIDLPMYDGKRNIEAFLDWIKSTENFFN

Query:  YMETPERKKVHLVALKLRVGASAWWDQLEINKQRCGKQPIRSWEKMKKLLKARFLPLNYEQTLYNQYQ
        YM+ P+RKKVHLVALKLR GAS WWDQLEIN+QRCGK  IRSWEKMKKLLKARFLP N+EQTLYNQYQ
Subjt:  YMETPERKKVHLVALKLRVGASAWWDQLEINKQRCGKQPIRSWEKMKKLLKARFLPLNYEQTLYNQYQ

A0A5A7U7S6 E3 ubiquitin-protein ligase SHPRH isoform X44.3e-9157.6Show/hide
Query:  MAGRRGNNPAAGENRAQEAAKEITALSPRTTTVRLLAVEESLGDLHNKFDRLMESVELLSRREEFPQPPPRNEINIQND-HRFGEARGQRARGYFRNMNN
        MAGRRG  PAAG+N  QE  +EIT L PRT+ VRLLA+E SLGDLH+KFDR+++++E    +EE          +++ D   F     ++ R     +N+
Subjt:  MAGRRGNNPAAGENRAQEAAKEITALSPRTTTVRLLAVEESLGDLHNKFDRLMESVELLSRREEFPQPPPRNEINIQND-HRFGEARGQRARGYFRNMNN

Query:  PRGFQRRRPGYAIPQQFDEDFQEDQEAWQEIQEDDSSSGDEQGNM-WNLNDDLRAGRNNRRNEVRRGEYHDYKMKIDLPMYDGKRNIEAFLDWIKSTENF
                                ++   ++      SG       W   D L  G+  +  EV    +HDYKMKIDLP Y+GKR+ ++FLDW+KSTENF
Subjt:  PRGFQRRRPGYAIPQQFDEDFQEDQEAWQEIQEDDSSSGDEQGNM-WNLNDDLRAGRNNRRNEVRRGEYHDYKMKIDLPMYDGKRNIEAFLDWIKSTENF

Query:  FNYMETPERKKVHLVALKLRVGASAWWDQLEINKQRCGKQPIRSWEKMKKLLKARFLPLNYEQTLYNQYQKCRQGVRSVADYIEEFHRLSARTNLSENEQ
        FNYM+TP+RKKVHLV LKLRVGASAWWDQLEIN+QR GK PIR WEKMKKLLKARFLP NYEQTLYNQYQ CRQG RSVA+YIEEFHRLS RTNLSENE 
Subjt:  FNYMETPERKKVHLVALKLRVGASAWWDQLEINKQRCGKQPIRSWEKMKKLLKARFLPLNYEQTLYNQYQKCRQGVRSVADYIEEFHRLSARTNLSENEQ

Query:  HQVARYMGGLRFDIKEKVKLQPLCFLSEAISFAETVEEMIAI
        HQ+AR++GGLRFDIKEKVKLQP  FLSEAIS AETV+EMIAI
Subjt:  HQVARYMGGLRFDIKEKVKLQPLCFLSEAISFAETVEEMIAI

A0A5D3CJ99 Reverse transcriptase1.3e-9262.05Show/hide
Query:  MAGRRGNNPAAGENRAQEAAKEITALSPRTTTVRLLAVEESLGDLHNKFDRLMESVELLSRREEFPQPPPRNEINIQNDHRFGEARGQRARGYFRNMNNP
        MAGRR   PA G+NR QE  +E  ALSPRT+TV  LAVE+SLGDLH KFDR+M+++E L+RR +      R E N  ND   G   G+RAR  FRN+ N 
Subjt:  MAGRRGNNPAAGENRAQEAAKEITALSPRTTTVRLLAVEESLGDLHNKFDRLMESVELLSRREEFPQPPPRNEINIQNDHRFGEARGQRARGYFRNMNNP

Query:  RGFQRRRPGYAIPQQFDEDFQEDQEAWQEIQEDDSSSGDEQGNMWNLNDDLRAGRNNRRNEVRRGEYHDYKMKIDLPMYDGKRNIEAFLDWIKSTENFFN
        R  QR+RP     +  D D  E+ EAWQ  QE DSSS DEQGN+WN N D++  +  R +E +R  +HDYKMKIDLP Y GK ++E+FLDWI++TENFFN
Subjt:  RGFQRRRPGYAIPQQFDEDFQEDQEAWQEIQEDDSSSGDEQGNMWNLNDDLRAGRNNRRNEVRRGEYHDYKMKIDLPMYDGKRNIEAFLDWIKSTENFFN

Query:  YMETPERKKVHLVALKLRVGASAWWDQLEINKQRCGKQPIRSWEKMKKLLKARFLPLNYEQTLYNQYQKCRQGVRSVADYIEEFHRLSARTNLSENEQHQ
        YM+T +RKKVHLVALKLR GASAWWDQ+EIN+QR GK PI SWEKMKKL+KARFLP NYEQTLYNQYQ CRQG +SVA+YI+EFH+LSARTNL   E+  
Subjt:  YMETPERKKVHLVALKLRVGASAWWDQLEINKQRCGKQPIRSWEKMKKLLKARFLPLNYEQTLYNQYQKCRQGVRSVADYIEEFHRLSARTNLSENEQHQ

Query:  VAR
         AR
Subjt:  VAR

A0A5D3DGR0 Reverse transcriptase8.2e-7450.85Show/hide
Query:  MAGRRGNNPAAGENRAQEAAKEITALSPRTTTVRLLAVEESLGD----LHNKFDRLMESVELLSRREEFPQPPPRNEINIQNDHRFGEARGQRARGYFRN
        M  +RG  PAA E   ++ A E   LSPRT++  L +VE S+ +    L+    RL E    L+ R+   +PP      +QN  R    RG+R   YFR 
Subjt:  MAGRRGNNPAAGENRAQEAAKEITALSPRTTTVRLLAVEESLGD----LHNKFDRLMESVELLSRREEFPQPPPRNEINIQNDHRFGEARGQRARGYFRN

Query:  MNNPRGFQRRRPGYAIPQQ---FDEDFQEDQEAWQEIQED-DSSSGDEQGNMWNLNDDLRAGRNNR--RNEVRRGEYHDYKMKIDLPMYDGKRNIEAFLD
          N   FQ R     IP+      +  ++ +  WQ  +E+ ++SS  E+ +    NDD+   R +R  +NE ++ E  +YKMKIDLP YDGKRNIE FLD
Subjt:  MNNPRGFQRRRPGYAIPQQ---FDEDFQEDQEAWQEIQED-DSSSGDEQGNMWNLNDDLRAGRNNR--RNEVRRGEYHDYKMKIDLPMYDGKRNIEAFLD

Query:  WIKSTENFFNYMETPERKKVHLVALKLRVGASAWWDQLEINKQRCGKQPIRSWEKMKKLLKARFLPLNYEQTLYNQYQKCRQGVRSVADYIEEFHRLSAR
        W+K+TENFF YM T + KKVHLVALKL+ GASAWWDQ+ +N+Q+ GK PIRSWEKMKKL+K RF+P NYEQTLY QYQ CRQG+R  A+YIEEFHRL  R
Subjt:  WIKSTENFFNYMETPERKKVHLVALKLRVGASAWWDQLEINKQRCGKQPIRSWEKMKKLLKARFLPLNYEQTLYNQYQKCRQGVRSVADYIEEFHRLSAR

Query:  TNLSENEQHQVARYMGGLRFDIKEKVKLQPLCFLSEAISFAETVEEMIAIRPRT
        TNL E E+H ++ ++GGLRFD+KEKVKLQP   LSEAI++AETVEEMI  R ++
Subjt:  TNLSENEQHQVARYMGGLRFDIKEKVKLQPLCFLSEAISFAETVEEMIAIRPRT

A0A5D3E417 Transposon Ty3-I Gag-Pol polyprotein isoform X14.1e-6547.97Show/hide
Query:  MAGRRGNNPAAGENRAQEAAKEITALSPRTTTVRLLAVEESLGDLHNKFDRLMESVELLSRREEFPQPPPRNEINIQNDHRFGEARGQRARGYFRNMNNP
        MAGRRGNNP AG      A+K+                         ++ +L  S++   R       P R E   +ND   G   G+RAR  ++N  N 
Subjt:  MAGRRGNNPAAGENRAQEAAKEITALSPRTTTVRLLAVEESLGDLHNKFDRLMESVELLSRREEFPQPPPRNEINIQNDHRFGEARGQRARGYFRNMNNP

Query:  RGFQRRRPGYAIPQQFDEDFQEDQEAWQEIQEDDSSSGDEQGNMWNLNDDLRAGRNNRRNEVRRGEYHDYKMKIDLPMYDGKRNIEAFLDWIKSTENFFN
           QRRRP     Q  D++ QE+ E WQ  Q+ DSS GDEQGN+WN + + R  +  R  E RR  YHDYKMKIDLP Y+GKR+IE+FLDWIK+TENFF 
Subjt:  RGFQRRRPGYAIPQQFDEDFQEDQEAWQEIQEDDSSSGDEQGNMWNLNDDLRAGRNNRRNEVRRGEYHDYKMKIDLPMYDGKRNIEAFLDWIKSTENFFN

Query:  YMETPERKKVHLVALKLRVGASAWWDQLEINKQRCGKQPIRSWEKMKKLLKARFLPLNYEQTL---YNQYQKCRQGVRSVADYIEEFHRLSARTNLSENE
        YM  P+RKKVHLVALKL+ GASAW                               P++Y Q +   Y+QYQ CRQG + VA+YIEEFHRL AR NLSENE
Subjt:  YMETPERKKVHLVALKLRVGASAWWDQLEINKQRCGKQPIRSWEKMKKLLKARFLPLNYEQTL---YNQYQKCRQGVRSVADYIEEFHRLSARTNLSENE

Query:  QHQVARYMGGLRFDIKEKVKLQPLCFLSEAISFAETVEEMIAIR
        QHQ+AR++GGLRFDIKEKVKL     LSEAIS AETVEEM+ +R
Subjt:  QHQVARYMGGLRFDIKEKVKLQPLCFLSEAISFAETVEEMIAIR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G15180.1 Zinc knuckle (CCHC-type) family protein9.5e-0630.43Show/hide
Query:  FLDWIKSTENFFNYMETPERKKVHLVALKLRVGASAWWDQLEINKQRCGKQPIRSWEKMKKLLKARFLP
        +L W  +   +F +  T +  K+ +   +L+  A  WWDQ E N+    + PIR+WE++K  + A++ P
Subjt:  FLDWIKSTENFFNYMETPERKKVHLVALKLRVGASAWWDQLEINKQRCGKQPIRSWEKMKKLLKARFLP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGGTCGGAGAGGTAACAACCCTGCCGCGGGAGAAAATCGTGCGCAAGAAGCAGCGAAGGAAATCACCGCCCTCTCTCCAAGGACAACAACAGTTCGCTTGCTGGC
TGTTGAGGAATCCTTGGGAGATCTCCATAATAAATTTGATAGATTGATGGAAAGCGTCGAATTGTTAAGCCGAAGGGAAGAATTCCCACAACCACCACCACGGAACGAAA
TTAACATCCAAAACGACCATCGTTTTGGTGAAGCAAGAGGTCAGCGAGCAAGGGGATACTTCAGAAACATGAACAACCCACGAGGTTTTCAAAGAAGGAGACCCGGTTAC
GCCATACCACAACAGTTTGACGAAGACTTTCAAGAAGATCAAGAAGCATGGCAAGAAATCCAAGAAGATGATTCTTCAAGCGGGGATGAACAAGGAAACATGTGGAACCT
CAATGACGACTTACGAGCAGGAAGAAATAACCGAAGAAATGAAGTCAGAAGAGGAGAGTACCACGACTATAAGATGAAGATTGACCTTCCCATGTATGATGGCAAACGAA
ATATAGAAGCATTCTTAGACTGGATAAAGAGCACTGAGAATTTCTTCAACTACATGGAGACACCAGAACGCAAGAAAGTCCACCTAGTAGCCTTAAAATTAAGAGTCGGT
GCGTCAGCTTGGTGGGATCAGTTGGAAATTAACAAACAAAGATGTGGGAAACAGCCGATTCGCTCGTGGGAAAAGATGAAGAAGTTGCTGAAAGCAAGATTCCTACCCCT
AAACTATGAACAAACACTGTACAATCAGTACCAAAAATGCCGCCAAGGTGTCCGATCAGTAGCTGACTACATTGAAGAATTCCACCGCTTGAGTGCAAGAACAAACCTAA
GCGAAAATGAACAACATCAGGTTGCAAGATATATGGGAGGTCTCCGATTCGACATCAAGGAAAAGGTCAAACTACAACCATTATGTTTCCTGTCTGAAGCAATATCCTTT
GCAGAAACAGTGGAAGAGATGATTGCGATTCGTCCAAGAACCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGGTCGGAGAGGTAACAACCCTGCCGCGGGAGAAAATCGTGCGCAAGAAGCAGCGAAGGAAATCACCGCCCTCTCTCCAAGGACAACAACAGTTCGCTTGCTGGC
TGTTGAGGAATCCTTGGGAGATCTCCATAATAAATTTGATAGATTGATGGAAAGCGTCGAATTGTTAAGCCGAAGGGAAGAATTCCCACAACCACCACCACGGAACGAAA
TTAACATCCAAAACGACCATCGTTTTGGTGAAGCAAGAGGTCAGCGAGCAAGGGGATACTTCAGAAACATGAACAACCCACGAGGTTTTCAAAGAAGGAGACCCGGTTAC
GCCATACCACAACAGTTTGACGAAGACTTTCAAGAAGATCAAGAAGCATGGCAAGAAATCCAAGAAGATGATTCTTCAAGCGGGGATGAACAAGGAAACATGTGGAACCT
CAATGACGACTTACGAGCAGGAAGAAATAACCGAAGAAATGAAGTCAGAAGAGGAGAGTACCACGACTATAAGATGAAGATTGACCTTCCCATGTATGATGGCAAACGAA
ATATAGAAGCATTCTTAGACTGGATAAAGAGCACTGAGAATTTCTTCAACTACATGGAGACACCAGAACGCAAGAAAGTCCACCTAGTAGCCTTAAAATTAAGAGTCGGT
GCGTCAGCTTGGTGGGATCAGTTGGAAATTAACAAACAAAGATGTGGGAAACAGCCGATTCGCTCGTGGGAAAAGATGAAGAAGTTGCTGAAAGCAAGATTCCTACCCCT
AAACTATGAACAAACACTGTACAATCAGTACCAAAAATGCCGCCAAGGTGTCCGATCAGTAGCTGACTACATTGAAGAATTCCACCGCTTGAGTGCAAGAACAAACCTAA
GCGAAAATGAACAACATCAGGTTGCAAGATATATGGGAGGTCTCCGATTCGACATCAAGGAAAAGGTCAAACTACAACCATTATGTTTCCTGTCTGAAGCAATATCCTTT
GCAGAAACAGTGGAAGAGATGATTGCGATTCGTCCAAGAACCTAA
Protein sequenceShow/hide protein sequence
MAGRRGNNPAAGENRAQEAAKEITALSPRTTTVRLLAVEESLGDLHNKFDRLMESVELLSRREEFPQPPPRNEINIQNDHRFGEARGQRARGYFRNMNNPRGFQRRRPGY
AIPQQFDEDFQEDQEAWQEIQEDDSSSGDEQGNMWNLNDDLRAGRNNRRNEVRRGEYHDYKMKIDLPMYDGKRNIEAFLDWIKSTENFFNYMETPERKKVHLVALKLRVG
ASAWWDQLEINKQRCGKQPIRSWEKMKKLLKARFLPLNYEQTLYNQYQKCRQGVRSVADYIEEFHRLSARTNLSENEQHQVARYMGGLRFDIKEKVKLQPLCFLSEAISF
AETVEEMIAIRPRT