; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0014181 (gene) of Snake gourd v1 genome

Gene IDTan0014181
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRibonuclease H
Genome locationLG06:25082986..25085395
RNA-Seq ExpressionTan0014181
SyntenyTan0014181
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144467.1 uncharacterized protein LOC111014147 [Momordica charantia]4.8e-5244.15Show/hide
Query:  MEIHHLNLVKFPERMKAPPGRRDPTKYCMFHRDHGHATSNCYQLRDEIEALIREGYLKEYIQDETPPTPSRNDRGRPESSRGRAEDEQPPEIRTIFGGPA
        MEI    L+K+PERMKAP  +R  ++YC+FHR HGHAT +C+ L++E+E LIR GYLKEY+++     P     G  + S  R       EIRTI GGP 
Subjt:  MEIHHLNLVKFPERMKAPPGRRDPTKYCMFHRDHGHATSNCYQLRDEIEALIREGYLKEYIQDETPPTPSRNDRGRPESSRGRAEDEQPPEIRTIFGGPA

Query:  GGESSRKRKAIAREAR--------------------------------------------QDPYFQGFGGERVTPEGSIELPVTFGEGTSSVTQIINFLV
          ES RKRKA  REAR                                                  GFGGERV PEG IE PVTFG G  SVT++++ LV
Subjt:  GGESSRKRKAIAREAR--------------------------------------------QDPYFQGFGGERVTPEGSIELPVTFGEGTSSVTQIINFLV

Query:  VECTSAYNAILGRPTLHQLKAVASTYHQLLKFPTTNDIGVVKGQQKASRKCYFTALKETRKRSFK
        V  TS+YNAILGRPT+H L+A+ STYHQ +KFPT   +G +KG+Q+ SR+CY+T++++  + S K
Subjt:  VECTSAYNAILGRPTLHQLKAVASTYHQLLKFPTTNDIGVVKGQQKASRKCYFTALKETRKRSFK

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]3.9e-4632.66Show/hide
Query:  VAGLQDKGLLRSIGKKEPASYVEFIARAQKYMSAKDLL-----QSKEVPDQEKND-KSMDQAGTQNHPPTISKRRRVEESSRGRPEDQSFERYTSTTLPP
        + GL D+ L   + ++ PA++ E + +A+K +  ++LL     Q +   D E  D KS D+    N       R     +  G    + +ER+T TT+P 
Subjt:  VAGLQDKGLLRSIGKKEPASYVEFIARAQKYMSAKDLL-----QSKEVPDQEKND-KSMDQAGTQNHPPTISKRRRVEESSRGRPEDQSFERYTSTTLPP

Query:  EKILMEIHHL---NLVKFPERMKAPPGRRDPTKYCMFHRDHGHATSNCYQLRDEIEALIREGYLKEYI-QDETPPTPSRNDRGRPESSRGRAEDEQPPEI
         +IL  I       L+K PE+++  P RR   KYC FHR+HGH TS+ ++L+ +IE LI++GY K+++ +  T     + +R R  +   R   ++P  I
Subjt:  EKILMEIHHL---NLVKFPERMKAPPGRRDPTKYCMFHRDHGHATSNCYQLRDEIEALIREGYLKEYI-QDETPPTPSRNDRGRPESSRGRAEDEQPPEI

Query:  RTIFGGPAGGESSRKRKAIAREARQD-------------------------PY-----------------------------------------------
         TIFGGP+GG+S  KRK +AR AR++                         P+                                               
Subjt:  RTIFGGPAGGESSRKRKAIAREARQD-------------------------PY-----------------------------------------------

Query:  ---FQGFGGERVTPEGSIELPVTFGEGTSSVTQIINFLVVECTSAYNAILGRPTLHQLKAVASTYHQLLKFPTTNDIGVVKGQQKASRKCYFTALKET
             GF GE V PEG I+LPVT G+  + VTQ+  F+VV+  SAYNAI GRP +H  +A+ ST HQ+LK+ T N +G V+G+Q ASR+CY + LK T
Subjt:  ---FQGFGGERVTPEGSIELPVTFGEGTSSVTQIINFLVVECTSAYNAILGRPTLHQLKAVASTYHQLLKFPTTNDIGVVKGQQKASRKCYFTALKET

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]2.9e-4934.66Show/hide
Query:  EKFKVEGFVDTTTLIAIVAGLQDKGLLRSIGKKEPASYVEFIARAQKYMSAKDLLQSKE-VPDQEKNDKSMDQ-----------AGTQNHPPTISKRRRV
        E+ KV    D + +   +  L D+ L   +G++ P ++VE + +A+K +  ++LL++K   P+++ + K + Q            G+ +       RR  
Subjt:  EKFKVEGFVDTTTLIAIVAGLQDKGLLRSIGKKEPASYVEFIARAQKYMSAKDLLQSKE-VPDQEKNDKSMDQ-----------AGTQNHPPTISKRRRV

Query:  EESSRGRPEDQSFERYTSTTLPPEKILMEIHHL---NLVKFPERMKAPPGRRDPTKYCMFHRDHGHATSNCYQLRDEIEALIREGYLKEYIQDETPPTPS
           SR RP    +ERYTS+T+P  +IL  I       L+K PE+++    +R+  KYC FHRDHGH T++C++L+ +IE LI++GY K+++      +  
Subjt:  EESSRGRPEDQSFERYTSTTLPPEKILMEIHHL---NLVKFPERMKAPPGRRDPTKYCMFHRDHGHATSNCYQLRDEIEALIREGYLKEYIQDETPPTPS

Query:  RNDRGRPESSRGRAEDEQPPEIRTIFGGPAGGESSRKRKAIAREARQ-----------------DPYFQGFG-----------------GERVTPEGS-I
        + +  +   +  R ED +P  I TIFGGP GG+S  KRK +AREAR+                 D   +G                     RV  +G  I
Subjt:  RNDRGRPESSRGRAEDEQPPEIRTIFGGPAGGESSRKRKAIAREARQ-----------------DPYFQGFG-----------------GERVTPEGS-I

Query:  ELPVTFGEGTSSVTQIINFLVVECTSAYNAILGRPTLHQLKAVASTYHQLLKFPTTNDIGVVKGQQKASRKCYFTALK
        +LPVT G+  + VTQ+  F+V++  SAYNAI GRP +H  +AV ST HQ+LK+ T N++G+V+G+QK SR+CY +ALK
Subjt:  ELPVTFGEGTSSVTQIINFLVVECTSAYNAILGRPTLHQLKAVASTYHQLLKFPTTNDIGVVKGQQKASRKCYFTALK

XP_024041095.1 uncharacterized protein LOC112098853 [Citrus clementina]4.1e-5131.49Show/hide
Query:  KVEGFVDTTTLIAIVAGLQDKGLLRSIGKKEPASYVEFIARAQKYMSAKDLLQSK--EVPDQEKNDKSMDQAGTQN--HPPTISKRRRVEESSRGRPED-
        +V+G+ D   L  I+ GL+   L  S+ K+ P SY E +ARA+KY +A++  +++  E  +  K  K  D    +    P    +R +     R    D 
Subjt:  KVEGFVDTTTLIAIVAGLQDKGLLRSIGKKEPASYVEFIARAQKYMSAKDLLQSK--EVPDQEKNDKSMDQAGTQN--HPPTISKRRRVEESSRGRPED-

Query:  -----QSFERYTSTTLPPEKILMEIHHLNLVKFPERMKAPPGRRDPTKYCMFHRDHGHATSNCYQLRDEIEALIREGYLKEYIQDETPPTPSRNDRGRPE
               F  +T    P E+ILM++ +  L + P  MK  P RR+P KYC FH+DHGH TS C++L+++IE+L+R+G L+EY+++      S     +PE
Subjt:  -----QSFERYTSTTLPPEKILMEIHHLNLVKFPERMKAPPGRRDPTKYCMFHRDHGHATSNCYQLRDEIEALIREGYLKEYIQDETPPTPSRNDRGRPE

Query:  SSRGRA------EDEQPPEIRTIFGGPAGGESSRKRKAIAREARQDPY----------------------------------------------------
        SS+ R        DE   ++  I+GGPA G+S + RK +AR+AR +P                                                     
Subjt:  SSRGRA------EDEQPPEIRTIFGGPAGGESSRKRKAIAREARQDPY----------------------------------------------------

Query:  --------------------------------FQGFGGERVTPEGSIELPVTFGEGTSSVTQIINFLVVECTSAYNAILGRPTLHQLKAVASTYHQLLKF
                                          GF G  V PEG IEL V+FG+  + VT ++NF+VV+  S+YNA+LGRPTL+ LKA  S YH  LKF
Subjt:  --------------------------------FQGFGGERVTPEGSIELPVTFGEGTSSVTQIINFLVVECTSAYNAILGRPTLHQLKAVASTYHQLLKF

Query:  PTTNDIGVVKGQQKASRKCYFTALKETRKRSFKRDLDPNMVEADHDMAEAD
        PT   +GVV+G+QK +R+CY  A +  ++      LDP  V+ D   +  D
Subjt:  PTTNDIGVVKGQQKASRKCYFTALKETRKRSFKRDLDPNMVEADHDMAEAD

XP_024047974.1 uncharacterized protein LOC112101548 [Citrus clementina]2.6e-4531.33Show/hide
Query:  EKFKVEGFVDTTTLIAIVAGLQDKGLLRSIGKKEPASYVEFIARAQKYMSAKDLLQSK---EVPDQEKNDKSMDQAG--TQNHPPTISKRRRV-------
        E  +V+G+ D   L  ++ GLQ   L  S+ K  P++Y E ++RA+KY +A++  QSK   E  +  KN K+ +         P   S++ +        
Subjt:  EKFKVEGFVDTTTLIAIVAGLQDKGLLRSIGKKEPASYVEFIARAQKYMSAKDLLQSK---EVPDQEKNDKSMDQAG--TQNHPPTISKRRRV-------

Query:  EESSRGRPEDQSFERYTSTTLPPEKILMEIHHLNLVKFPERMKAPPGRRDPTKYCMFHRDHGHATSNCYQLRDEIEALIREGYLKEYIQDETPPTPSRND
         +  R RP  Q F  YT    P E ILM++ +  L K P  +K+   RR+  KYC F++D GH TS C+ L+++IE+L+R+  L+ Y++ +     SR +
Subjt:  EESSRGRPEDQSFERYTSTTLPPEKILMEIHHLNLVKFPERMKAPPGRRDPTKYCMFHRDHGHATSNCYQLRDEIEALIREGYLKEYIQDETPPTPSRND

Query:  RGRPESSR---GRAE----DEQPPEIRTIFGGPAGGESSRKRKAIAREARQDP-----------------------------------------------
        R  PESSR   G+ +    DE    +  I+GGP  G S + RK +AR+AR +P                                               
Subjt:  RGRPESSR---GRAE----DEQPPEIRTIFGGPAGGESSRKRKAIAREARQDP-----------------------------------------------

Query:  -------------------------------------YFQGFGGERVTPEGSIELPVTFGEGTSSVTQIINFLVVECTSAYNAILGRPTLHQLKAVASTY
                                                GF G  V PEG IEL V+FG+  + VT ++ F+VV+  SAYN++LGRPTL+ +KA  S Y
Subjt:  -------------------------------------YFQGFGGERVTPEGSIELPVTFGEGTSSVTQIINFLVVECTSAYNAILGRPTLHQLKAVASTY

Query:  HQLLKFPTTNDIGVVKGQQKASRKCYFTALKETRKRSFKRDLDPNMVEAD
        H  LKFPT   IGVV+G QK +R+CY  + K+  +      LDP  V  D
Subjt:  HQLLKFPTTNDIGVVKGQQKASRKCYFTALKETRKRSFKRDLDPNMVEAD

TrEMBL top hitse value%identityAlignment
A0A2N9ECS2 RNase H domain-containing protein1.5e-4635.75Show/hide
Query:  EKFKVEGFVDTTTLIAIVAGLQDKGLLRSIGKKEPASYVEFIARAQKYMSAKDLLQSKEVPDQEKNDKSMDQAGTQNHPPTISK---RRRVEESSRGRPE
        E   V+G  D   L A ++GLQ    L S+ K  P S  E +  AQ+YM+ +D LQ+++   + +ND +     T    P + +   ++R + S RG   
Subjt:  EKFKVEGFVDTTTLIAIVAGLQDKGLLRSIGKKEPASYVEFIARAQKYMSAKDLLQSKEVPDQEKNDKSMDQAGTQNHPPTISK---RRRVEESSRGRPE

Query:  DQSFERYTSTTLPPEKILMEIHHLNLVKFPERMKAPPGRRDPTKYCMFHRDHGHATSNCYQLRDEIEALIREGYLKEYIQDETPPTPSRNDRGRPESSRG
        ++ F  +T    P ++I M+I     +++P ++   P RR   KYC FHRDHGH T +CY L+ +IE LI++G L+ ++  E      R  + RP+  R 
Subjt:  DQSFERYTSTTLPPEKILMEIHHLNLVKFPERMKAPPGRRDPTKYCMFHRDHGHATSNCYQLRDEIEALIREGYLKEYIQDETPPTPSRNDRGRPESSRG

Query:  RAEDE-QPP--EIRTIFGG-PAGGESSRKRKAIAREARQ------------------------DPYFQGFGGERVTPEGSIELPVTFGEGTSSVTQIINF
          ED  +PP  EI  I GG  AGG S   RKA AR+                           +    GF G  V P G I L +  G      T+ + F
Subjt:  RAEDE-QPP--EIRTIFGG-PAGGESSRKRKAIAREARQ------------------------DPYFQGFGGERVTPEGSIELPVTFGEGTSSVTQIINF

Query:  LVVECTSAYNAILGRPTLHQLKAVASTYHQLLKFPTTNDIGVVKGQQKASRKCYFTAL
        LVV+C SAYN I+GRPTL++L+AV STYH L++FPT + IG +KG Q  +R+CY T++
Subjt:  LVVECTSAYNAILGRPTLHQLKAVASTYHQLLKFPTTNDIGVVKGQQKASRKCYFTAL

A0A2N9F5L1 RNase H domain-containing protein8.6e-4732.83Show/hide
Query:  LLKVREVERWDLEALIEQVDPLFTDKIMNERIPEKFKVEGFVDTTTLIAIVAGLQDKGLLRSIGKKEPASYVEFIARAQKYMSAKDLLQSKEVPDQEKND
        LL V+++E   L   + +          NE   E  K++   +  T+ A +AGL+    L  + K  P +  E +  A K+M+A+D L++ + P  ++  
Subjt:  LLKVREVERWDLEALIEQVDPLFTDKIMNERIPEKFKVEGFVDTTTLIAIVAGLQDKGLLRSIGKKEPASYVEFIARAQKYMSAKDLLQSKEVPDQEKND

Query:  KSMDQAGTQNHPPTISKRRRVEESSRGRPEDQSFERYTSTTLPPEKILMEIHHLNLVKFPERMKAPPGRRDPTKYCMFHRDHGHATSNCYQLRDEIEALI
        ++ D+   +     + K     E  R       F  +T    P +K+L++I     +++P ++++ P  R    YC FHRDHGH T +C  L +++E LI
Subjt:  KSMDQAGTQNHPPTISKRRRVEESSRGRPEDQSFERYTSTTLPPEKILMEIHHLNLVKFPERMKAPPGRRDPTKYCMFHRDHGHATSNCYQLRDEIEALI

Query:  REGYLKEYIQDETPPTPSRNDRGRPESSRGRAEDEQPP---EIRTIFGGPAGGESSR-KRKAIAREA-------------RQDPYFQGF-----------
        R+G L++Y+       P+     +P + R +AE  +P    EIRTI GGPA G +SR  R+A AR+A             R D     F           
Subjt:  REGYLKEYIQDETPPTPSRNDRGRPESSRGRAEDEQPP---EIRTIFGGPAGGESSR-KRKAIAREA-------------RQDPYFQGF-----------

Query:  -----------GGERVTPEGSIELPVTFGEGTSSVTQIINFLVVECTSAYNAILGRPTLHQLKAVASTYHQLLKFPTTNDIGVVKGQQKASRKCYFTAL
                    G++V P G + LP+T G    +V++ ++FLVV C SAYNAI+GRPTL++L+AV STYH LLKFPT + IG V+G Q A+R+CY  +L
Subjt:  -----------GGERVTPEGSIELPVTFGEGTSSVTQIINFLVVECTSAYNAILGRPTLHQLKAVASTYHQLLKFPTTNDIGVVKGQQKASRKCYFTAL

A0A2N9H7V3 Ribonuclease H3.8e-4732.37Show/hide
Query:  LLKVREVERWDLEALIEQVDPLFTDKIMNERIPEKFKVEGFVDTTTLIAIVAGLQDKGLLRSIGKKEPASYVEFIARAQKYMSAKDLLQSKEVPDQEKND
        LL V+++E   L   + +          NE   E  K++   +  T+ A +AGL+    L  + K  P +  E +  A K+M+A+D L++ E P  ++  
Subjt:  LLKVREVERWDLEALIEQVDPLFTDKIMNERIPEKFKVEGFVDTTTLIAIVAGLQDKGLLRSIGKKEPASYVEFIARAQKYMSAKDLLQSKEVPDQEKND

Query:  KSMDQAGTQNHPPTISKRRRVEESSRGRPEDQSFERYTSTTLPPEKILMEIHHLNLVKFPERMKAPPGRRDPTKYCMFHRDHGHATSNCYQLRDEIEALI
        ++ D+   +     + K     E  R       F  +T    P +K+L++I     +++P ++++ P  R    YC FHRDHGH T  C  L+++IE LI
Subjt:  KSMDQAGTQNHPPTISKRRRVEESSRGRPEDQSFERYTSTTLPPEKILMEIHHLNLVKFPERMKAPPGRRDPTKYCMFHRDHGHATSNCYQLRDEIEALI

Query:  REGYLKEYIQDETPPTPSRNDRGRPESSRGRAEDEQP---PEIRTIFGGPAGGESSR-KRKAIAREARQ-------------------------------
        R+G L++Y+   T   PS     +P + R +AE  +P    EIRTI GGPA G +SR  RKA AR+                                  
Subjt:  REGYLKEYIQDETPPTPSRNDRGRPESSRGRAEDEQP---PEIRTIFGGPAGGESSR-KRKAIAREARQ-------------------------------

Query:  ----------------------DPYFQGFGGERVTPEGSIELPVTFGEGTSSVTQIINFLVVECTSAYNAILGRPTLHQLKAVASTYHQLLKFPTTNDIG
                              D    GF G++V P G + LP+T G    +V++ ++FLVV C SAYNAI+GRPTL++L+AV STYH LLKFPT + IG
Subjt:  ----------------------DPYFQGFGGERVTPEGSIELPVTFGEGTSSVTQIINFLVVECTSAYNAILGRPTLHQLKAVASTYHQLLKFPTTNDIG

Query:  VVKGQQKASRKCYFTAL
         V+G Q A+R+CY  +L
Subjt:  VVKGQQKASRKCYFTAL

A0A6J1CTS4 uncharacterized protein LOC1110141472.3e-5244.15Show/hide
Query:  MEIHHLNLVKFPERMKAPPGRRDPTKYCMFHRDHGHATSNCYQLRDEIEALIREGYLKEYIQDETPPTPSRNDRGRPESSRGRAEDEQPPEIRTIFGGPA
        MEI    L+K+PERMKAP  +R  ++YC+FHR HGHAT +C+ L++E+E LIR GYLKEY+++     P     G  + S  R       EIRTI GGP 
Subjt:  MEIHHLNLVKFPERMKAPPGRRDPTKYCMFHRDHGHATSNCYQLRDEIEALIREGYLKEYIQDETPPTPSRNDRGRPESSRGRAEDEQPPEIRTIFGGPA

Query:  GGESSRKRKAIAREAR--------------------------------------------QDPYFQGFGGERVTPEGSIELPVTFGEGTSSVTQIINFLV
          ES RKRKA  REAR                                                  GFGGERV PEG IE PVTFG G  SVT++++ LV
Subjt:  GGESSRKRKAIAREAR--------------------------------------------QDPYFQGFGGERVTPEGSIELPVTFGEGTSSVTQIINFLV

Query:  VECTSAYNAILGRPTLHQLKAVASTYHQLLKFPTTNDIGVVKGQQKASRKCYFTALKETRKRSFK
        V  TS+YNAILGRPT+H L+A+ STYHQ +KFPT   +G +KG+Q+ SR+CY+T++++  + S K
Subjt:  VECTSAYNAILGRPTLHQLKAVASTYHQLLKFPTTNDIGVVKGQQKASRKCYFTALKETRKRSFK

A0A6J1DZB9 uncharacterized protein LOC1110249041.4e-4934.66Show/hide
Query:  EKFKVEGFVDTTTLIAIVAGLQDKGLLRSIGKKEPASYVEFIARAQKYMSAKDLLQSKE-VPDQEKNDKSMDQ-----------AGTQNHPPTISKRRRV
        E+ KV    D + +   +  L D+ L   +G++ P ++VE + +A+K +  ++LL++K   P+++ + K + Q            G+ +       RR  
Subjt:  EKFKVEGFVDTTTLIAIVAGLQDKGLLRSIGKKEPASYVEFIARAQKYMSAKDLLQSKE-VPDQEKNDKSMDQ-----------AGTQNHPPTISKRRRV

Query:  EESSRGRPEDQSFERYTSTTLPPEKILMEIHHL---NLVKFPERMKAPPGRRDPTKYCMFHRDHGHATSNCYQLRDEIEALIREGYLKEYIQDETPPTPS
           SR RP    +ERYTS+T+P  +IL  I       L+K PE+++    +R+  KYC FHRDHGH T++C++L+ +IE LI++GY K+++      +  
Subjt:  EESSRGRPEDQSFERYTSTTLPPEKILMEIHHL---NLVKFPERMKAPPGRRDPTKYCMFHRDHGHATSNCYQLRDEIEALIREGYLKEYIQDETPPTPS

Query:  RNDRGRPESSRGRAEDEQPPEIRTIFGGPAGGESSRKRKAIAREARQ-----------------DPYFQGFG-----------------GERVTPEGS-I
        + +  +   +  R ED +P  I TIFGGP GG+S  KRK +AREAR+                 D   +G                     RV  +G  I
Subjt:  RNDRGRPESSRGRAEDEQPPEIRTIFGGPAGGESSRKRKAIAREARQ-----------------DPYFQGFG-----------------GERVTPEGS-I

Query:  ELPVTFGEGTSSVTQIINFLVVECTSAYNAILGRPTLHQLKAVASTYHQLLKFPTTNDIGVVKGQQKASRKCYFTALK
        +LPVT G+  + VTQ+  F+V++  SAYNAI GRP +H  +AV ST HQ+LK+ T N++G+V+G+QK SR+CY +ALK
Subjt:  ELPVTFGEGTSSVTQIINFLVVECTSAYNAILGRPTLHQLKAVASTYHQLLKFPTTNDIGVVKGQQKASRKCYFTALK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTCGATTCATAGGGAACCAAAGCCGAAGCGAAGGAATAGAAGAATCTGATCTTGACCGTCACCCCCGGAGAATTAGAACAATCTCCTCTCCTCGGCCTCCTACTGA
CACAGGCCGACCACACGAGAGGATTGTCGAATTAAGAGAGAAGGTCGACAAGATGGACGAGTGCCTGTGGAAGATTTTGGAGAAGCTCGACCAGCAAGATCAACACAAGG
GAAAAGCGGAGGTTGCAGAAGGGATGGGGGAGTCAAAATCCCCAACCCAAAAGCACATCGATAAGTTGATCCTTGGAGGAAACAACATACCGCGTCGATCTTATGCCAAG
ATTGACTCACACACTTACCGCAGGAATCAACCAGGCAACTCGAAAGTAGTGGCCGAAAAAAGCTCCGGGGCGACGTTAGACTTGCGAGAAATAATTAACCAGAAGCGGAA
AGCACAGGCCGAGGCCGACCAGTTCGCAGCCGAGGCCGATCAGATGACGGACGAGGCCGACCCGAGGAGAACTGAGGACGACGTAGTTGGAGCGACTCCACCCTGGAAGG
CTGACATCCGTAAAATGCAGAAAGAGTTTGGTTATCTTCAAGGGGACCTCTTGAAGGTTCGAGAGGTCGAGAGGTGGGATCTTGAGGCTCTTATCGAGCAAGTTGATCCC
CTCTTCACCGACAAGATAATGAACGAGAGAATTCCAGAGAAATTCAAAGTCGAAGGTTTCGTAGATACTACGACATTAATTGCCATAGTTGCAGGTCTGCAAGATAAGGG
TTTATTGAGATCAATTGGCAAGAAAGAACCAGCATCTTACGTTGAATTCATAGCAAGAGCACAAAAATACATGAGCGCAAAAGACCTATTGCAATCCAAAGAAGTACCCG
ACCAAGAGAAGAACGACAAAAGCATGGATCAAGCTGGAACACAAAACCATCCTCCAACAATCTCCAAAAGAAGAAGAGTCGAAGAAAGTAGTCGAGGCCGACCAGAGGAT
CAGTCATTCGAAAGATACACCTCCACGACTCTTCCTCCTGAAAAGATCTTAATGGAGATCCACCATTTAAATCTTGTGAAGTTTCCCGAGAGAATGAAAGCTCCACCTGG
AAGAAGGGACCCAACCAAGTATTGCATGTTTCACAGGGACCATGGGCACGCGACCTCCAATTGTTACCAACTCCGAGACGAGATCGAAGCTCTAATCAGAGAAGGATACT
TGAAAGAATACATTCAAGACGAGACTCCACCCACGCCATCAAGAAACGACCGAGGCCGACCAGAATCAAGCCGAGGCCGAGCCGAGGACGAACAACCTCCAGAGATCCGA
ACAATTTTTGGAGGACCGGCGGGGGGCGAATCCAGCAGAAAGAGGAAGGCAATTGCTCGTGAAGCTCGACAAGATCCTTATTTCCAAGGTTTTGGAGGAGAAAGAGTTAC
CCCAGAAGGCAGCATAGAGCTCCCAGTCACCTTTGGCGAAGGGACGAGCTCGGTAACACAAATAATCAACTTCCTAGTAGTGGAATGCACTTCTGCCTACAACGCCATCC
TTGGGAGGCCAACACTTCATCAGTTAAAGGCAGTTGCTTCCACGTACCATCAGTTGTTGAAGTTTCCAACCACCAACGACATTGGGGTCGTTAAAGGACAACAAAAAGCT
TCGAGGAAATGCTACTTCACTGCATTAAAAGAGACAAGGAAACGAAGCTTTAAAAGAGACCTCGACCCCAACATGGTCGAGGCCGACCACGATATGGCCGAGGCCGACGT
GGTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTCGATTCATAGGGAACCAAAGCCGAAGCGAAGGAATAGAAGAATCTGATCTTGACCGTCACCCCCGGAGAATTAGAACAATCTCCTCTCCTCGGCCTCCTACTGA
CACAGGCCGACCACACGAGAGGATTGTCGAATTAAGAGAGAAGGTCGACAAGATGGACGAGTGCCTGTGGAAGATTTTGGAGAAGCTCGACCAGCAAGATCAACACAAGG
GAAAAGCGGAGGTTGCAGAAGGGATGGGGGAGTCAAAATCCCCAACCCAAAAGCACATCGATAAGTTGATCCTTGGAGGAAACAACATACCGCGTCGATCTTATGCCAAG
ATTGACTCACACACTTACCGCAGGAATCAACCAGGCAACTCGAAAGTAGTGGCCGAAAAAAGCTCCGGGGCGACGTTAGACTTGCGAGAAATAATTAACCAGAAGCGGAA
AGCACAGGCCGAGGCCGACCAGTTCGCAGCCGAGGCCGATCAGATGACGGACGAGGCCGACCCGAGGAGAACTGAGGACGACGTAGTTGGAGCGACTCCACCCTGGAAGG
CTGACATCCGTAAAATGCAGAAAGAGTTTGGTTATCTTCAAGGGGACCTCTTGAAGGTTCGAGAGGTCGAGAGGTGGGATCTTGAGGCTCTTATCGAGCAAGTTGATCCC
CTCTTCACCGACAAGATAATGAACGAGAGAATTCCAGAGAAATTCAAAGTCGAAGGTTTCGTAGATACTACGACATTAATTGCCATAGTTGCAGGTCTGCAAGATAAGGG
TTTATTGAGATCAATTGGCAAGAAAGAACCAGCATCTTACGTTGAATTCATAGCAAGAGCACAAAAATACATGAGCGCAAAAGACCTATTGCAATCCAAAGAAGTACCCG
ACCAAGAGAAGAACGACAAAAGCATGGATCAAGCTGGAACACAAAACCATCCTCCAACAATCTCCAAAAGAAGAAGAGTCGAAGAAAGTAGTCGAGGCCGACCAGAGGAT
CAGTCATTCGAAAGATACACCTCCACGACTCTTCCTCCTGAAAAGATCTTAATGGAGATCCACCATTTAAATCTTGTGAAGTTTCCCGAGAGAATGAAAGCTCCACCTGG
AAGAAGGGACCCAACCAAGTATTGCATGTTTCACAGGGACCATGGGCACGCGACCTCCAATTGTTACCAACTCCGAGACGAGATCGAAGCTCTAATCAGAGAAGGATACT
TGAAAGAATACATTCAAGACGAGACTCCACCCACGCCATCAAGAAACGACCGAGGCCGACCAGAATCAAGCCGAGGCCGAGCCGAGGACGAACAACCTCCAGAGATCCGA
ACAATTTTTGGAGGACCGGCGGGGGGCGAATCCAGCAGAAAGAGGAAGGCAATTGCTCGTGAAGCTCGACAAGATCCTTATTTCCAAGGTTTTGGAGGAGAAAGAGTTAC
CCCAGAAGGCAGCATAGAGCTCCCAGTCACCTTTGGCGAAGGGACGAGCTCGGTAACACAAATAATCAACTTCCTAGTAGTGGAATGCACTTCTGCCTACAACGCCATCC
TTGGGAGGCCAACACTTCATCAGTTAAAGGCAGTTGCTTCCACGTACCATCAGTTGTTGAAGTTTCCAACCACCAACGACATTGGGGTCGTTAAAGGACAACAAAAAGCT
TCGAGGAAATGCTACTTCACTGCATTAAAAGAGACAAGGAAACGAAGCTTTAAAAGAGACCTCGACCCCAACATGGTCGAGGCCGACCACGATATGGCCGAGGCCGACGT
GGTCTAA
Protein sequenceShow/hide protein sequence
MSRFIGNQSRSEGIEESDLDRHPRRIRTISSPRPPTDTGRPHERIVELREKVDKMDECLWKILEKLDQQDQHKGKAEVAEGMGESKSPTQKHIDKLILGGNNIPRRSYAK
IDSHTYRRNQPGNSKVVAEKSSGATLDLREIINQKRKAQAEADQFAAEADQMTDEADPRRTEDDVVGATPPWKADIRKMQKEFGYLQGDLLKVREVERWDLEALIEQVDP
LFTDKIMNERIPEKFKVEGFVDTTTLIAIVAGLQDKGLLRSIGKKEPASYVEFIARAQKYMSAKDLLQSKEVPDQEKNDKSMDQAGTQNHPPTISKRRRVEESSRGRPED
QSFERYTSTTLPPEKILMEIHHLNLVKFPERMKAPPGRRDPTKYCMFHRDHGHATSNCYQLRDEIEALIREGYLKEYIQDETPPTPSRNDRGRPESSRGRAEDEQPPEIR
TIFGGPAGGESSRKRKAIAREARQDPYFQGFGGERVTPEGSIELPVTFGEGTSSVTQIINFLVVECTSAYNAILGRPTLHQLKAVASTYHQLLKFPTTNDIGVVKGQQKA
SRKCYFTALKETRKRSFKRDLDPNMVEADHDMAEADVV