; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018170 (gene) of Snake gourd v1 genome

Gene IDTan0018170
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRibonuclease H
Genome locationLG07:16656334..16664214
RNA-Seq ExpressionTan0018170
SyntenyTan0018170
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055957.1 uncharacterized protein E6C27_scaffold319G00830 [Cucumis melo var. makuwa]7.0e-4146.09Show/hide
Query:  NEVSTNFAKSEISKDEGSATSPVLRYIPLSRRKKGESPFAECTKSLT-------------------------MEFKSLKIFDERPELSLTQKKLLKEGYT
        +E STN AKS I  DE ++  P+LRY+PLSRRKKGESPF E  + L                           EFKSLKI  E+P+LS TQKKLL+EG+ 
Subjt:  NEVSTNFAKSEISKDEGSATSPVLRYIPLSRRKKGESPFAECTKSLT-------------------------MEFKSLKIFDERPELSLTQKKLLKEGYT

Query:  IPASRKGLGYKSPRGVRIIRKGKAKVADANHITVEEVDDSDEKKNVTQRTSVFSRIGPLVARPSALQRLSTTQVEEDQSPPISGSTRTSTLMRIRMPTKC
        IP SRKGLGYKSP  +RI RKGK KV D NHITV+EVD  +EK+   QRTS F RI P VAR    +RLS T+ +       S                 
Subjt:  IPASRKGLGYKSPRGVRIIRKGKAKVADANHITVEEVDDSDEKKNVTQRTSVFSRIGPLVARPSALQRLSTTQVEEDQSPPISGSTRTSTLMRIRMPTKC

Query:  MELMRPSALQRLGAPAKEKKNVPSTSDATRLSAFQRLSVTTSKK-DGPSASVFDRI
          L R SA QRL    KE+K +  TS  T+ SAF+RLS+T  K    P A + +R+
Subjt:  MELMRPSALQRLGAPAKEKKNVPSTSDATRLSAFQRLSVTTSKK-DGPSASVFDRI

KAA0061113.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.3e-3928.74Show/hide
Query:  KLIEGPSKDRVVVKDNPL-FDQFTPAVGQSKEASN-QDVMS-------VMMADVESDERMAEMER----KISLLMKRSLVQFGSFEPIVVWMNDEPSSKN
        K+++   K+ +VV   PL F +      + K+  N  D++        + + + +  E+   ++     K   ++   + ++   + +++ +  E   K 
Subjt:  KLIEGPSKDRVVVKDNPL-FDQFTPAVGQSKEASN-QDVMS-------VMMADVESDERMAEMER----KISLLMKRSLVQFGSFEPIVVWMNDEPSSKN

Query:  SQEGGIQKQYVQEKNKRTEDENEGWTVVTRRKKRQQSYTQKESRLFRHHKRKSKSQKKKRKQVTKKPVYAVREDENLFRPRPLVTLEEFFPKNFLSKSQD
          E  +  +  QEK +  E+++EGWTVVTRRKKR+ +  QKESRL+ +++R +K+QK K+K+ T+K      +D++  R + +VTL +FFP  FL   QD
Subjt:  SQEGGIQKQYVQEKNKRTEDENEGWTVVTRRKKRQQSYTQKESRLFRHHKRKSKSQKKKRKQVTKKPVYAVREDENLFRPRPLVTLEEFFPKNFLSKSQD

Query:  EAFEVVACHVTGTIEDPSCSYETT----------------------------------------------------------------------------
        E   VVA     +   P  +YE+T                                                                            
Subjt:  EAFEVVACHVTGTIEDPSCSYETT----------------------------------------------------------------------------

Query:  -------------TDRPKEVPQIDV---------------KEETTKCTNGPALRNNEVSTNFAKSEISKDEGSATSPVLRYIPLSRRKKGESPFAECTKS
                      D   EV  ++V                 E+ K T       +E ST+ AKS I  DE ++  P+LRY+PLSRRKKGESPF E  + 
Subjt:  -------------TDRPKEVPQIDV---------------KEETTKCTNGPALRNNEVSTNFAKSEISKDEGSATSPVLRYIPLSRRKKGESPFAECTKS

Query:  LTM-----------------------------------------------------------EFKSLKIFDERPELSLTQKKLLKEGYTIPASRKGLGYK
        L +                                                           EFKSLKI  E+P+LS TQKKLL+EG+ IP SRKGLGYK
Subjt:  LTM-----------------------------------------------------------EFKSLKIFDERPELSLTQKKLLKEGYTIPASRKGLGYK

Query:  SPRGVRIIRKGKAKVADANHITVEEVDDSDEKKNVTQRTSVFSRIGPLVARPSALQRLSTTQVEEDQSPPISGSTRTSTLMRIRMPTK
         P  +RI RKGK KV D+NHITV+EVD  +EK+  +QRTS F R+ P VAR    +RLS  + E       S   R ST  R+ M  K
Subjt:  SPRGVRIIRKGKAKVADANHITVEEVDDSDEKKNVTQRTSVFSRIGPLVARPSALQRLSTTQVEEDQSPPISGSTRTSTLMRIRMPTK

KAA0061113.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]2.8e-0546.38Show/hide
Query:  IANKIVKLIEGPSKDRVVVKDNPLFDQFTPAVGQSKEASNQDVMSVMMADVESDERMAEMERKISLLMK
        +A  I+K +    K  +V+K+NPL+D    +  +SK+ ++ DVMSVMMAD+  +  MAEMERKI+ LMK
Subjt:  IANKIVKLIEGPSKDRVVVKDNPLFDQFTPAVGQSKEASNQDVMSVMMADVESDERMAEMERKISLLMK

KAA0061113.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]2.7e-3740Show/hide
Query:  NEVSTNFAKSEISKDEGSATSPVLRYIPLSRRKKGESPFAECTKSLTM----------------------------------------------------
        +E STN AKS I  DE ++  P+LRY+PLSRRKKGESPF E  + L +                                                    
Subjt:  NEVSTNFAKSEISKDEGSATSPVLRYIPLSRRKKGESPFAECTKSLTM----------------------------------------------------

Query:  -------EFKSLKIFDERPELSLTQKKLLKEGYTIPASRKGLGYKSPRGVRIIRKGKAKVADANHITVEEVDDSDEKKNVTQRTSVFSRIGPLVARPSAL
               EFKSLKI+ E+P+LS TQKKLL+EG+ IP SRKGLGYKSP  +RI RKGK KV D+NHIT++E D  +EK+  +QRTS F RI P VAR    
Subjt:  -------EFKSLKIFDERPELSLTQKKLLKEGYTIPASRKGLGYKSPRGVRIIRKGKAKVADANHITVEEVDDSDEKKNVTQRTSVFSRIGPLVARPSAL

Query:  QRLSTTQVEEDQSPPISGSTRTSTLMRIRMPTKCMELMRPSALQRLGAPAKEKKNVPSTSDATRLSAFQRLSVTTSKK-DGPSASVFDRI
        ++LS T+ E       S                   L R SA QRL    KE+K +  TS  T+ SAF+RLS+T  K    P A + +R+
Subjt:  QRLSTTQVEEDQSPPISGSTRTSTLMRIRMPTKCMELMRPSALQRLGAPAKEKKNVPSTSDATRLSAFQRLSVTTSKK-DGPSASVFDRI

TYK28162.1 uncharacterized protein E5676_scaffold289G00760 [Cucumis melo var. makuwa]5.9e-4043.98Show/hide
Query:  NEVSTNFAKSEISKDEGSATSPVLRYIPLSRRKKGESPFAECTKSLTM-----------------------------------EFKSLKIFDERPELSLT
        +E STN AKS I  DE ++  P+LRY+PLSRRKKGESPF E  + L +                                   +FKSLKI  E+P+LS T
Subjt:  NEVSTNFAKSEISKDEGSATSPVLRYIPLSRRKKGESPFAECTKSLTM-----------------------------------EFKSLKIFDERPELSLT

Query:  QKKLLKEGYTIPASRKGLGYKSPRGVRIIRKGKAKVADANHITVEEVDDSDEKKNVTQRTSVFSRIGPLVARPSALQRLSTTQVEEDQSPPISGSTRTST
        QKKLL+EG+ IP SRKGLGYKSP  +RI RKGK KV D+NHITV+EVD  +EK+   QRTS F RI P VAR    +RLS T+ E       S       
Subjt:  QKKLLKEGYTIPASRKGLGYKSPRGVRIIRKGKAKVADANHITVEEVDDSDEKKNVTQRTSVFSRIGPLVARPSALQRLSTTQVEEDQSPPISGSTRTST

Query:  LMRIRMPTKCMELMRPSALQRLGAPAKEKKNVPSTSDATRLSAFQRLSVTTSKK-DGPSASVFDRI
                    L + SA QRL    KE+K +  TS  T+ SAF+RLS+T  K    P A + +R+
Subjt:  LMRIRMPTKCMELMRPSALQRLGAPAKEKKNVPSTSDATRLSAFQRLSVTTSKK-DGPSASVFDRI

XP_031739134.1 uncharacterized protein LOC116402863 [Cucumis sativus]3.6e-3727.83Show/hide
Query:  KRSLVQFGSFEPIVVWMNDEPSSKNSQEGGIQKQYVQEKNKRTEDENEGWTVVTRRKKRQQSYTQKESRLFRHHKRKSKSQKKKRKQVTKKPVYAVREDE
        ++SLVQFG+FEPIVV    E S ++           Q + +  E+++EGW VVT RKKRQ   TQ+ESR +++++R +K+QK K+K+ T K      ED 
Subjt:  KRSLVQFGSFEPIVVWMNDEPSSKNSQEGGIQKQYVQEKNKRTEDENEGWTVVTRRKKRQQSYTQKESRLFRHHKRKSKSQKKKRKQVTKKPVYAVREDE

Query:  NLFRPRPLVTLEEFFPKNFLSKSQDEAFEVVACHVTGTIED--------------------------------------------------PSCSYETTT
        N  RP+ LVTL +F PK+FL   QDE  EVVACH   T E+                                                  P+ +YE+ +
Subjt:  NLFRPRPLVTLEEFFPKNFLSKSQDEAFEVVACHVTGTIED--------------------------------------------------PSCSYETTT

Query:  -------------------DRPKEVPQI--DVKEETTKCTNGPAL-------------------------------------------------------
                           +RP  V     + + +     NG A+                                                       
Subjt:  -------------------DRPKEVPQI--DVKEETTKCTNGPAL-------------------------------------------------------

Query:  --------------------------------------------------------------RNN-----------------------------------
                                                                      +NN                                   
Subjt:  --------------------------------------------------------------RNN-----------------------------------

Query:  ------EVSTNFAKSEISKDEGSATSPVLRYIPLSRRKKGESPFAECTKSLTM-----------------------------------------------
              E  T+  K  I KDE +A +PVLRY+PLSRRKKGESPF E  K L +                                               
Subjt:  ------EVSTNFAKSEISKDEGSATSPVLRYIPLSRRKKGESPFAECTKSLTM-----------------------------------------------

Query:  ------------EFKSLKIFDERPELSLTQKKLLKEGYTIPASRKGLGYKSPRGVRIIRKGKAKVADANHITVEEVDDSDEKKNVTQRTSVFSRIGPLVA
                    EFKSL+I D RPELS TQKKLL+EG++IP SRKGLGYKSP  +RI +KGK KV D NHIT+EE D++D K+   QR SVF RI P VA
Subjt:  ------------EFKSLKIFDERPELSLTQKKLLKEGYTIPASRKGLGYKSPRGVRIIRKGKAKVADANHITVEEVDDSDEKKNVTQRTSVFSRIGPLVA

Query:  RPSALQRLSTTQVEEDQSPPISGSTRTSTLMR-----IRMPTKCMEL--MRPSALQRLGAPAKEKKNVPSTSDATRLSAFQRL-----------SVTTSK
        RP   +RLS T+ E ++   +    R S   R     I+  + C  L   RPSA +RLG    +KKNV     A R   F  L           ++ T K
Subjt:  RPSALQRLSTTQVEEDQSPPISGSTRTSTLMR-----IRMPTKCMEL--MRPSALQRLGAPAKEKKNVPSTSDATRLSAFQRL-----------SVTTSK

Query:  KDGPS-ASVFDRIYH
        K+  S   V+ RI H
Subjt:  KDGPS-ASVFDRIYH

TrEMBL top hitse value%identityAlignment
A0A5A7UMY2 Reverse transcriptase domain-containing protein3.4e-4146.09Show/hide
Query:  NEVSTNFAKSEISKDEGSATSPVLRYIPLSRRKKGESPFAECTKSLT-------------------------MEFKSLKIFDERPELSLTQKKLLKEGYT
        +E STN AKS I  DE ++  P+LRY+PLSRRKKGESPF E  + L                           EFKSLKI  E+P+LS TQKKLL+EG+ 
Subjt:  NEVSTNFAKSEISKDEGSATSPVLRYIPLSRRKKGESPFAECTKSLT-------------------------MEFKSLKIFDERPELSLTQKKLLKEGYT

Query:  IPASRKGLGYKSPRGVRIIRKGKAKVADANHITVEEVDDSDEKKNVTQRTSVFSRIGPLVARPSALQRLSTTQVEEDQSPPISGSTRTSTLMRIRMPTKC
        IP SRKGLGYKSP  +RI RKGK KV D NHITV+EVD  +EK+   QRTS F RI P VAR    +RLS T+ +       S                 
Subjt:  IPASRKGLGYKSPRGVRIIRKGKAKVADANHITVEEVDDSDEKKNVTQRTSVFSRIGPLVARPSALQRLSTTQVEEDQSPPISGSTRTSTLMRIRMPTKC

Query:  MELMRPSALQRLGAPAKEKKNVPSTSDATRLSAFQRLSVTTSKK-DGPSASVFDRI
          L R SA QRL    KE+K +  TS  T+ SAF+RLS+T  K    P A + +R+
Subjt:  MELMRPSALQRLGAPAKEKKNVPSTSDATRLSAFQRLSVTTSKK-DGPSASVFDRI

A0A5D3BY54 Ty3-gypsy retrotransposon protein6.3e-4028.74Show/hide
Query:  KLIEGPSKDRVVVKDNPL-FDQFTPAVGQSKEASN-QDVMS-------VMMADVESDERMAEMER----KISLLMKRSLVQFGSFEPIVVWMNDEPSSKN
        K+++   K+ +VV   PL F +      + K+  N  D++        + + + +  E+   ++     K   ++   + ++   + +++ +  E   K 
Subjt:  KLIEGPSKDRVVVKDNPL-FDQFTPAVGQSKEASN-QDVMS-------VMMADVESDERMAEMER----KISLLMKRSLVQFGSFEPIVVWMNDEPSSKN

Query:  SQEGGIQKQYVQEKNKRTEDENEGWTVVTRRKKRQQSYTQKESRLFRHHKRKSKSQKKKRKQVTKKPVYAVREDENLFRPRPLVTLEEFFPKNFLSKSQD
          E  +  +  QEK +  E+++EGWTVVTRRKKR+ +  QKESRL+ +++R +K+QK K+K+ T+K      +D++  R + +VTL +FFP  FL   QD
Subjt:  SQEGGIQKQYVQEKNKRTEDENEGWTVVTRRKKRQQSYTQKESRLFRHHKRKSKSQKKKRKQVTKKPVYAVREDENLFRPRPLVTLEEFFPKNFLSKSQD

Query:  EAFEVVACHVTGTIEDPSCSYETT----------------------------------------------------------------------------
        E   VVA     +   P  +YE+T                                                                            
Subjt:  EAFEVVACHVTGTIEDPSCSYETT----------------------------------------------------------------------------

Query:  -------------TDRPKEVPQIDV---------------KEETTKCTNGPALRNNEVSTNFAKSEISKDEGSATSPVLRYIPLSRRKKGESPFAECTKS
                      D   EV  ++V                 E+ K T       +E ST+ AKS I  DE ++  P+LRY+PLSRRKKGESPF E  + 
Subjt:  -------------TDRPKEVPQIDV---------------KEETTKCTNGPALRNNEVSTNFAKSEISKDEGSATSPVLRYIPLSRRKKGESPFAECTKS

Query:  LTM-----------------------------------------------------------EFKSLKIFDERPELSLTQKKLLKEGYTIPASRKGLGYK
        L +                                                           EFKSLKI  E+P+LS TQKKLL+EG+ IP SRKGLGYK
Subjt:  LTM-----------------------------------------------------------EFKSLKIFDERPELSLTQKKLLKEGYTIPASRKGLGYK

Query:  SPRGVRIIRKGKAKVADANHITVEEVDDSDEKKNVTQRTSVFSRIGPLVARPSALQRLSTTQVEEDQSPPISGSTRTSTLMRIRMPTK
         P  +RI RKGK KV D+NHITV+EVD  +EK+  +QRTS F R+ P VAR    +RLS  + E       S   R ST  R+ M  K
Subjt:  SPRGVRIIRKGKAKVADANHITVEEVDDSDEKKNVTQRTSVFSRIGPLVARPSALQRLSTTQVEEDQSPPISGSTRTSTLMRIRMPTK

A0A5D3BY54 Ty3-gypsy retrotransposon protein1.3e-0546.38Show/hide
Query:  IANKIVKLIEGPSKDRVVVKDNPLFDQFTPAVGQSKEASNQDVMSVMMADVESDERMAEMERKISLLMK
        +A  I+K +    K  +V+K+NPL+D    +  +SK+ ++ DVMSVMMAD+  +  MAEMERKI+ LMK
Subjt:  IANKIVKLIEGPSKDRVVVKDNPLFDQFTPAVGQSKEASNQDVMSVMMADVESDERMAEMERKISLLMK

A0A5D3BY54 Ty3-gypsy retrotransposon protein1.3e-3740Show/hide
Query:  NEVSTNFAKSEISKDEGSATSPVLRYIPLSRRKKGESPFAECTKSLTM----------------------------------------------------
        +E STN AKS I  DE ++  P+LRY+PLSRRKKGESPF E  + L +                                                    
Subjt:  NEVSTNFAKSEISKDEGSATSPVLRYIPLSRRKKGESPFAECTKSLTM----------------------------------------------------

Query:  -------EFKSLKIFDERPELSLTQKKLLKEGYTIPASRKGLGYKSPRGVRIIRKGKAKVADANHITVEEVDDSDEKKNVTQRTSVFSRIGPLVARPSAL
               EFKSLKI+ E+P+LS TQKKLL+EG+ IP SRKGLGYKSP  +RI RKGK KV D+NHIT++E D  +EK+  +QRTS F RI P VAR    
Subjt:  -------EFKSLKIFDERPELSLTQKKLLKEGYTIPASRKGLGYKSPRGVRIIRKGKAKVADANHITVEEVDDSDEKKNVTQRTSVFSRIGPLVARPSAL

Query:  QRLSTTQVEEDQSPPISGSTRTSTLMRIRMPTKCMELMRPSALQRLGAPAKEKKNVPSTSDATRLSAFQRLSVTTSKK-DGPSASVFDRI
        ++LS T+ E       S                   L R SA QRL    KE+K +  TS  T+ SAF+RLS+T  K    P A + +R+
Subjt:  QRLSTTQVEEDQSPPISGSTRTSTLMRIRMPTKCMELMRPSALQRLGAPAKEKKNVPSTSDATRLSAFQRLSVTTSKK-DGPSASVFDRI

A0A5D3C0W6 Ty3-gypsy retrotransposon protein6.6e-3728.29Show/hide
Query:  ERKISLLMKRSLVQFGSFEPIVVWMNDEPSSKNSQEGGIQKQYVQEKNKRTEDENEGWTVVTRRKKRQQSYTQKESRLFRHHKRKSKSQKKKRKQVTKKP
        E+KI L     L +FG+FEP+VV  + E + ++S          QEK +  E+++E WT+VTRRKKR+ +  QKE R +R+++R +K+QK K+K+ T+K 
Subjt:  ERKISLLMKRSLVQFGSFEPIVVWMNDEPSSKNSQEGGIQKQYVQEKNKRTEDENEGWTVVTRRKKRQQSYTQKESRLFRHHKRKSKSQKKKRKQVTKKP

Query:  VYAVREDENLFRPRPLVTLEEFFPKNFLSKSQDEAFEVVACHVTGTIEDPSCSYETTTDR-----------------PKEVPQIDVK-------------
            +ED++  R + L+TL +FFP  FL   QDE   VVACH     E+ S    +  +                  P+E+  I +              
Subjt:  VYAVREDENLFRPRPLVTLEEFFPKNFLSKSQDEAFEVVACHVTGTIEDPSCSYETTTDR-----------------PKEVPQIDVK-------------

Query:  ----EETTKC-------------------------------------TNGPAL-----------------------------------------------
            E T  C                                      NG A+                                               
Subjt:  ----EETTKC-------------------------------------TNGPAL-----------------------------------------------

Query:  ---------------------------------------------------------RNNEVSTNFAKSEISKDEGSATSPVLRYIPLSRRKKGESPFAE
                                                                   +E ST+ AKS I  DE ++   +LRY+PLSRRKKGESPF E
Subjt:  ---------------------------------------------------------RNNEVSTNFAKSEISKDEGSATSPVLRYIPLSRRKKGESPFAE

Query:  CTKSLTM-----------------------------------------------------------EFKSLKIFDERPELSLTQKKLLKEGYTIPASRKG
          + L +                                                           EFKSLKI  E+P+LS TQKKLL+EG+ IP SRKG
Subjt:  CTKSLTM-----------------------------------------------------------EFKSLKIFDERPELSLTQKKLLKEGYTIPASRKG

Query:  LGYKSPRGVRIIRKGKAKVADANHITVEEVDDSDEKKNVTQRTSVFSRIGPLVARPSALQRLSTTQVEEDQSPPISGSTRTSTLMRIRMPTKCMELMRPS
        LGYK P  +RI RKGK K+ D+NHITV+EVD   EK+  +QRTS F RI P VAR    +RLS T+ E       S   R S   R+ + TK      P 
Subjt:  LGYKSPRGVRIIRKGKAKVADANHITVEEVDDSDEKKNVTQRTSVFSRIGPLVARPSALQRLSTTQVEEDQSPPISGSTRTSTLMRIRMPTKCMELMRPS

Query:  A--LQRLG
        A  + RLG
Subjt:  A--LQRLG

A0A5D3DXC7 Reverse transcriptase domain-containing protein2.8e-4043.98Show/hide
Query:  NEVSTNFAKSEISKDEGSATSPVLRYIPLSRRKKGESPFAECTKSLTM-----------------------------------EFKSLKIFDERPELSLT
        +E STN AKS I  DE ++  P+LRY+PLSRRKKGESPF E  + L +                                   +FKSLKI  E+P+LS T
Subjt:  NEVSTNFAKSEISKDEGSATSPVLRYIPLSRRKKGESPFAECTKSLTM-----------------------------------EFKSLKIFDERPELSLT

Query:  QKKLLKEGYTIPASRKGLGYKSPRGVRIIRKGKAKVADANHITVEEVDDSDEKKNVTQRTSVFSRIGPLVARPSALQRLSTTQVEEDQSPPISGSTRTST
        QKKLL+EG+ IP SRKGLGYKSP  +RI RKGK KV D+NHITV+EVD  +EK+   QRTS F RI P VAR    +RLS T+ E       S       
Subjt:  QKKLLKEGYTIPASRKGLGYKSPRGVRIIRKGKAKVADANHITVEEVDDSDEKKNVTQRTSVFSRIGPLVARPSALQRLSTTQVEEDQSPPISGSTRTST

Query:  LMRIRMPTKCMELMRPSALQRLGAPAKEKKNVPSTSDATRLSAFQRLSVTTSKK-DGPSASVFDRI
                    L + SA QRL    KE+K +  TS  T+ SAF+RLS+T  K    P A + +R+
Subjt:  LMRIRMPTKCMELMRPSALQRLGAPAKEKKNVPSTSDATRLSAFQRLSVTTSKK-DGPSASVFDRI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAAAGGATAAGGAGACCTTCTTCGCTACTTTGATTCTTAAATTGAAATGGCTTACTGATGAAATCAAAGGGAAAAAAGAGTTCAAGAGTTTACCATGGAAAACAAA
GGAACAGAGGAGGAATCAAAGGCCATTAGAGACTACTCCCAGCGGGTGTTCAATGTTGATCAATATAGTGCATTCAGAGGACCCTACTTCATATTTACAAAATATTTTGG
AAATTTGTGGCATTGTCAAATTAAATGGAATTTCTGCCGAGATGCTTTTCGCCTACGAGGACCGAACCCCTGATGCGATTGCAAACAAGATCGTAAAGTTGATCGAAGGA
CCCTCCAAGGATAGAGTGGTCGTCAAAGATAACCCGCTGTTTGACCAGTTTACCCCTGCTGTCGGTCAATCAAAGGAGGCATCGAATCAAGATGTGATGTCTGTGATGAT
GGCCGATGTGGAATCCGACGAAAGGATGGCAGAGATGGAGAGAAAGATTAGTCTCCTGATGAAGCGAAGCCTAGTTCAGTTTGGATCCTTTGAACCTATCGTTGTGTGGA
TGAATGATGAACCCTCAAGTAAGAATTCTCAAGAGGGAGGCATCCAAAAGCAGTACGTTCAAGAAAAGAATAAGCGGACCGAAGATGAAAACGAAGGTTGGACTGTCGTG
ACTCGTCGCAAGAAGCGACAACAAAGTTACACGCAGAAGGAATCGCGACTATTCCGACACCATAAGAGAAAAAGCAAGTCGCAAAAGAAGAAAAGAAAACAGGTCACAAA
GAAGCCTGTTTACGCCGTGAGGGAAGACGAAAACCTCTTCCGCCCACGACCACTGGTAACTTTGGAGGAATTCTTCCCAAAGAATTTCCTAAGTAAAAGCCAGGATGAGG
CATTTGAGGTAGTTGCGTGTCACGTTACCGGTACGATTGAAGATCCTTCATGCTCGTATGAGACGACAACAGATCGGCCTAAGGAGGTACCACAGATTGATGTGAAAGAA
GAAACTACCAAATGTACAAATGGGCCTGCTCTGAGAAACAACGAAGTCTCTACGAACTTTGCAAAATCTGAAATTTCGAAAGATGAAGGAAGCGCAACCTCCCCTGTTTT
GCGTTACATTCCTTTGTCTCGACGAAAGAAGGGTGAGTCACCATTTGCAGAGTGCACAAAAAGCCTGACTATGGAGTTCAAAAGTCTGAAGATCTTCGATGAGAGACCTG
AGCTCTCATTAACACAAAAGAAGCTTTTAAAGGAAGGTTATACTATCCCTGCATCAAGGAAAGGACTGGGGTATAAGTCTCCGAGAGGAGTCCGCATAATAAGAAAAGGG
AAGGCAAAGGTGGCAGACGCAAACCACATAACAGTGGAAGAGGTAGACGATTCAGATGAAAAGAAAAACGTTACCCAAAGGACTTCTGTTTTTAGCCGCATCGGGCCGTT
GGTGGCGCGACCTTCAGCCCTCCAACGATTGAGCACCACTCAAGTAGAAGAAGATCAGTCACCTCCCATTTCCGGTTCCACTCGAACCTCAACCCTCATGAGGATAAGGA
TGCCCACCAAATGTATGGAGCTAATGCGACCATCAGCATTACAAAGGTTAGGTGCGCCTGCGAAGGAAAAGAAAAATGTACCTTCGACCTCAGATGCGACGCGCCTTTCA
GCTTTTCAAAGGCTAAGTGTAACCACTTCAAAAAAAGATGGGCCCTCCGCATCAGTTTTTGATAGGATTTACCATGATTGCCCATCTTCTATGTCGAAATTGTGGAGCCT
TGGATCATGTACTTTGATGGCGCGGCACGAAGGAGCGGTGCGGGGGCAGGCATTATCTTCATCTCACCTGAGAAACACATGCTCCCTTACAGCTTTACACTTAGCAAGTT
GTGCTCAAACAATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAAAAGGATAAGGAGACCTTCTTCGCTACTTTGATTCTTAAATTGAAATGGCTTACTGATGAAATCAAAGGGAAAAAAGAGTTCAAGAGTTTACCATGGAAAACAAA
GGAACAGAGGAGGAATCAAAGGCCATTAGAGACTACTCCCAGCGGGTGTTCAATGTTGATCAATATAGTGCATTCAGAGGACCCTACTTCATATTTACAAAATATTTTGG
AAATTTGTGGCATTGTCAAATTAAATGGAATTTCTGCCGAGATGCTTTTCGCCTACGAGGACCGAACCCCTGATGCGATTGCAAACAAGATCGTAAAGTTGATCGAAGGA
CCCTCCAAGGATAGAGTGGTCGTCAAAGATAACCCGCTGTTTGACCAGTTTACCCCTGCTGTCGGTCAATCAAAGGAGGCATCGAATCAAGATGTGATGTCTGTGATGAT
GGCCGATGTGGAATCCGACGAAAGGATGGCAGAGATGGAGAGAAAGATTAGTCTCCTGATGAAGCGAAGCCTAGTTCAGTTTGGATCCTTTGAACCTATCGTTGTGTGGA
TGAATGATGAACCCTCAAGTAAGAATTCTCAAGAGGGAGGCATCCAAAAGCAGTACGTTCAAGAAAAGAATAAGCGGACCGAAGATGAAAACGAAGGTTGGACTGTCGTG
ACTCGTCGCAAGAAGCGACAACAAAGTTACACGCAGAAGGAATCGCGACTATTCCGACACCATAAGAGAAAAAGCAAGTCGCAAAAGAAGAAAAGAAAACAGGTCACAAA
GAAGCCTGTTTACGCCGTGAGGGAAGACGAAAACCTCTTCCGCCCACGACCACTGGTAACTTTGGAGGAATTCTTCCCAAAGAATTTCCTAAGTAAAAGCCAGGATGAGG
CATTTGAGGTAGTTGCGTGTCACGTTACCGGTACGATTGAAGATCCTTCATGCTCGTATGAGACGACAACAGATCGGCCTAAGGAGGTACCACAGATTGATGTGAAAGAA
GAAACTACCAAATGTACAAATGGGCCTGCTCTGAGAAACAACGAAGTCTCTACGAACTTTGCAAAATCTGAAATTTCGAAAGATGAAGGAAGCGCAACCTCCCCTGTTTT
GCGTTACATTCCTTTGTCTCGACGAAAGAAGGGTGAGTCACCATTTGCAGAGTGCACAAAAAGCCTGACTATGGAGTTCAAAAGTCTGAAGATCTTCGATGAGAGACCTG
AGCTCTCATTAACACAAAAGAAGCTTTTAAAGGAAGGTTATACTATCCCTGCATCAAGGAAAGGACTGGGGTATAAGTCTCCGAGAGGAGTCCGCATAATAAGAAAAGGG
AAGGCAAAGGTGGCAGACGCAAACCACATAACAGTGGAAGAGGTAGACGATTCAGATGAAAAGAAAAACGTTACCCAAAGGACTTCTGTTTTTAGCCGCATCGGGCCGTT
GGTGGCGCGACCTTCAGCCCTCCAACGATTGAGCACCACTCAAGTAGAAGAAGATCAGTCACCTCCCATTTCCGGTTCCACTCGAACCTCAACCCTCATGAGGATAAGGA
TGCCCACCAAATGTATGGAGCTAATGCGACCATCAGCATTACAAAGGTTAGGTGCGCCTGCGAAGGAAAAGAAAAATGTACCTTCGACCTCAGATGCGACGCGCCTTTCA
GCTTTTCAAAGGCTAAGTGTAACCACTTCAAAAAAAGATGGGCCCTCCGCATCAGTTTTTGATAGGATTTACCATGATTGCCCATCTTCTATGTCGAAATTGTGGAGCCT
TGGATCATGTACTTTGATGGCGCGGCACGAAGGAGCGGTGCGGGGGCAGGCATTATCTTCATCTCACCTGAGAAACACATGCTCCCTTACAGCTTTACACTTAGCAAGTT
GTGCTCAAACAATGTAG
Protein sequenceShow/hide protein sequence
MQKDKETFFATLILKLKWLTDEIKGKKEFKSLPWKTKEQRRNQRPLETTPSGCSMLINIVHSEDPTSYLQNILEICGIVKLNGISAEMLFAYEDRTPDAIANKIVKLIEG
PSKDRVVVKDNPLFDQFTPAVGQSKEASNQDVMSVMMADVESDERMAEMERKISLLMKRSLVQFGSFEPIVVWMNDEPSSKNSQEGGIQKQYVQEKNKRTEDENEGWTVV
TRRKKRQQSYTQKESRLFRHHKRKSKSQKKKRKQVTKKPVYAVREDENLFRPRPLVTLEEFFPKNFLSKSQDEAFEVVACHVTGTIEDPSCSYETTTDRPKEVPQIDVKE
ETTKCTNGPALRNNEVSTNFAKSEISKDEGSATSPVLRYIPLSRRKKGESPFAECTKSLTMEFKSLKIFDERPELSLTQKKLLKEGYTIPASRKGLGYKSPRGVRIIRKG
KAKVADANHITVEEVDDSDEKKNVTQRTSVFSRIGPLVARPSALQRLSTTQVEEDQSPPISGSTRTSTLMRIRMPTKCMELMRPSALQRLGAPAKEKKNVPSTSDATRLS
AFQRLSVTTSKKDGPSASVFDRIYHDCPSSMSKLWSLGSCTLMARHEGAVRGQALSSSHLRNTCSLTALHLASCAQTM