; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g22410 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g22410
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionDimer_Tnp_hAT domain-containing protein
Genome locationchr4:16240599..16243420
RNA-Seq ExpressionMoc04g22410
SyntenyMoc04g22410
Gene Ontology termsGO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR008906 - HAT, C-terminal dimerisation domain
IPR012337 - Ribonuclease H-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PKU60096.1 hypothetical protein MA16_Dca020494 [Dendrobium catenatum]1.5e-5748.47Show/hide
Query:  SSKQWQDSVWAKKLEGKEVKRIVSSDKHFWPAVVYAIKTTKPLVGVLRMVDSEKMSAMGFIYGAMDSAKEEIAKNLGGEE---------IDS--------
        +S QW    WAKK EGKE+++IV +D+ FW +V+YAI TT+PLV VLRMVD+EK  AMGFIY AMD AKE IA NLGG E         ID         
Subjt:  SSKQWQDSVWAKKLEGKEVKRIVSSDKHFWPAVVYAIKTTKPLVGVLRMVDSEKMSAMGFIYGAMDSAKEEIAKNLGGEE---------IDS--------

Query:  ----------------------------------------------------FRRREGFFGFQQAIASSKKRSPVDWWTQFGDGTPNLAKFAIKVLSLTC
                                                            FR +EGFFG QQA  +  KRSPV+WW QFGDGTP L +FA+KVL LTC
Subjt:  ----------------------------------------------------FRRREGFFGFQQAIASSKKRSPVDWWTQFGDGTPNLAKFAIKVLSLTC

Query:  SASGCERNWSTFNQVHTKRRNRSSTTKLNNLVYIMYNKRFKDNHLKRKALKEEEDPLVLDDL
        S+S CERNWST+NQ+HTKRRNR ST ++N+LVYIMYN+R +D +LK+K L ++EDPL+ DD+
Subjt:  SASGCERNWSTFNQVHTKRRNRSSTTKLNNLVYIMYNKRFKDNHLKRKALKEEEDPLVLDDL

PKU85227.1 hypothetical protein MA16_Dca025470 [Dendrobium catenatum]7.5e-5749.8Show/hide
Query:  WAKKLEGKEVKRIVSSDKHFWPAVVYAIKTTKPLVGVLRMVDSEKMSAMGFIYGAMDSAKEEIAKNLGGEE---------IDS-----------------
        WAKK EGKE+++IV +D+ FW +V+YAI TT+PLV VLRMVD+EK  AMGFIY AMD AKE IA NLGG E         ID                  
Subjt:  WAKKLEGKEVKRIVSSDKHFWPAVVYAIKTTKPLVGVLRMVDSEKMSAMGFIYGAMDSAKEEIAKNLGGEE---------IDS-----------------

Query:  -------------------------------------------FRRREGFFGFQQAIASSKKRSPVDWWTQFGDGTPNLAKFAIKVLSLTCSASGCERNW
                                                   FR +EGFFG QQA  +  KRSPV+WW QFGDGTP L +FA+KVL LTCS+SGCERNW
Subjt:  -------------------------------------------FRRREGFFGFQQAIASSKKRSPVDWWTQFGDGTPNLAKFAIKVLSLTCSASGCERNW

Query:  STFNQVHTKRRNRSSTTKLNNLVYIMYNKRFKDNHLKRKALKEEEDPLVLDDL
        ST+NQVHTKRRNR ST ++N+LVYIMYN+R +D +LK+K L ++EDPL+ DD+
Subjt:  STFNQVHTKRRNRSSTTKLNNLVYIMYNKRFKDNHLKRKALKEEEDPLVLDDL

XP_019160773.1 PREDICTED: uncharacterized protein LOC109157329 [Ipomoea nil]2.6e-5746.95Show/hide
Query:  SSKQWQDSVWAKKLEGKEVKRIVSSDKHFWPAVVYAIKTTKPLVGVLRMVDSEKMSAMGFIYGAMDSAKEEIAKNLGGEEID------------------
        +S++W +S +A K +GKE++RI+  D +FWP+++YAIKTTKPLV VLR+VD EK  AMGF+Y  MD AKE+IAKNLGGEE D                  
Subjt:  SSKQWQDSVWAKKLEGKEVKRIVSSDKHFWPAVVYAIKTTKPLVGVLRMVDSEKMSAMGFIYGAMDSAKEEIAKNLGGEEID------------------

Query:  ---------------------------------------------------SFRRREGFFGFQQAIASSKKRSPVDWWTQFGDGTPNLAKFAIKVLSLTC
                                                           +F  ++GFFG  QA  +  KRSPV+WWTQ+GDGTP L KFAIKVL LTC
Subjt:  ---------------------------------------------------SFRRREGFFGFQQAIASSKKRSPVDWWTQFGDGTPNLAKFAIKVLSLTC

Query:  SASGCERNWSTFNQVHTKRRNRSSTTKLNNLVYIMYNKRFKDNHLKRKALKEEEDPLVLDDL
        S+SGCERNWS FNQV TKRRNR +T ++N LVYI+YNK+ KD HLK K+L  +ED L++D+L
Subjt:  SASGCERNWSTFNQVHTKRRNRSSTTKLNNLVYIMYNKRFKDNHLKRKALKEEEDPLVLDDL

XP_022159386.1 uncharacterized protein LOC111025802 [Momordica charantia]2.1e-9166.67Show/hide
Query:  SSKQWQDSVWAKKLEGKEVKRIVSSDKHFWPAVVYAIKTTKPLVGVLRMVDSEKMSAMGFIYGAMDSAKEEIAKNLGGEE--------------------
        +SKQ QDSVWAKK EGKEVKRI+ SDKHFWPAVVYAIKT KPLVGVLRMVDSEK+ AMGFIYGAMDSAKEEIAKN GGEE                    
Subjt:  SSKQWQDSVWAKKLEGKEVKRIVSSDKHFWPAVVYAIKTTKPLVGVLRMVDSEKMSAMGFIYGAMDSAKEEIAKNLGGEE--------------------

Query:  -------------------------------------------------IDSFRRREGFFGFQQAIASSKKRSPVDWWTQFGDGTPNLAKFAIKVLSLTC
                                                          DSFRRREGFFGFQQAIAS KKRS VDWWTQFGDGTP LAKFAIKVLS TC
Subjt:  -------------------------------------------------IDSFRRREGFFGFQQAIASSKKRSPVDWWTQFGDGTPNLAKFAIKVLSLTC

Query:  SASGCERNWSTFNQVHTKRRNRSSTTKLNNLVYIMYNKRFKDNHLKRKALKEEEDPLVLDDLNIFELGEGSSTQQRDVSDKTKQPKSRSKD
        SASGC RNWSTFNQVHTKRRNR STTKLNNLVYIMYNKR KD HLKRKALKEEEDPL         LGEG+STQQRDVSDKTKQPKSRSKD
Subjt:  SASGCERNWSTFNQVHTKRRNRSSTTKLNNLVYIMYNKRFKDNHLKRKALKEEEDPLVLDDLNIFELGEGSSTQQRDVSDKTKQPKSRSKD

XP_022159386.1 uncharacterized protein LOC111025802 [Momordica charantia]1.9e-0733.33Show/hide
Query:  MDRFMATNVGDVDANGGNKVQNQVTPTNAKEARNSVCMDIGSSKQWQDSVWAKKLEGKEVKRIVSSDKHFW------------------PAVVYAIKTTK
        MDRFM T+VGDVDA+GGNKVQNQVTPTNAKEAR      +      + S W  K E K  + I++  +  W                    + + + +  
Subjt:  MDRFMATNVGDVDANGGNKVQNQVTPTNAKEARNSVCMDIGSSKQWQDSVWAKKLEGKEVKRIVSSDKHFW------------------PAVVYAIKTTK

Query:  PLVGVLRMVDSEKMSAMGFIYGAMDSAKEEIAKNL
          V +  +  S+ +     I+  +D   +EI +NL
Subjt:  PLVGVLRMVDSEKMSAMGFIYGAMDSAKEEIAKNL

XP_022159386.1 uncharacterized protein LOC111025802 [Momordica charantia]3.6e-5947.71Show/hide
Query:  SSKQWQDSVWAKKLEGKEVKRIVSSDKHFWPAVVYAIKTTKPLVGVLRMVDSEKMSAMGFIYGAMDSAKEEIAKNLGGEEID------------------
        +S++W +S +A K +GKEV++I+  D +FWP+ +YAIKTTKPLV VLR+VD +K  AMGF+Y AMD AKE+IAKNLGGEE D                  
Subjt:  SSKQWQDSVWAKKLEGKEVKRIVSSDKHFWPAVVYAIKTTKPLVGVLRMVDSEKMSAMGFIYGAMDSAKEEIAKNLGGEEID------------------

Query:  ---------------------------------------------------SFRRREGFFGFQQAIASSKKRSPVDWWTQFGDGTPNLAKFAIKVLSLTC
                                                           +F  REGFFG  QA ++  KR+P++WWTQ+GDGTP L KFAIKVL LTC
Subjt:  ---------------------------------------------------SFRRREGFFGFQQAIASSKKRSPVDWWTQFGDGTPNLAKFAIKVLSLTC

Query:  SASGCERNWSTFNQVHTKRRNRSSTTKLNNLVYIMYNKRFKDNHLKRKALKEEEDPLVLDDL
        S+SGCERNWS FNQVHTKRRNR +TT++N LVYI+YNK+ KD HLK K+L   ED L++D+L
Subjt:  SASGCERNWSTFNQVHTKRRNRSSTTKLNNLVYIMYNKRFKDNHLKRKALKEEEDPLVLDDL

TrEMBL top hitse value%identityAlignment
A0A2I0W1H0 Dimer_Tnp_hAT domain-containing protein8.1e-5749.41Show/hide
Query:  WAKKLEGKEVKRIVSSDKHFWPAVVYAIKTTKPLVGVLRMVDSEKMSAMGFIYGAMDSAKEEIAKNLGGEE---------IDS-----------------
        WAKK EGKE+++IV +D+ FW +V+YAI TT+PLV VLRMVD+EK  AMGFIY AMD AKE IA NLGG E         ID                  
Subjt:  WAKKLEGKEVKRIVSSDKHFWPAVVYAIKTTKPLVGVLRMVDSEKMSAMGFIYGAMDSAKEEIAKNLGGEE---------IDS-----------------

Query:  -------------------------------------------FRRREGFFGFQQAIASSKKRSPVDWWTQFGDGTPNLAKFAIKVLSLTCSASGCERNW
                                                   FR +EGFFG QQ   +  KRSPV+WW QFGDGTP L +FA+KVL LTCS+SGCERNW
Subjt:  -------------------------------------------FRRREGFFGFQQAIASSKKRSPVDWWTQFGDGTPNLAKFAIKVLSLTCSASGCERNW

Query:  STFNQVHTKRRNRSSTTKLNNLVYIMYNKRFKDNHLKRKALKEEEDPLVLDDL
        ST+NQVHTKRRNR ST ++N+LVYIMYN+R +D +LK+K L ++EDPL+ DD+
Subjt:  STFNQVHTKRRNRSSTTKLNNLVYIMYNKRFKDNHLKRKALKEEEDPLVLDDL

A0A2I0WAP2 Uncharacterized protein1.8e-5649.41Show/hide
Query:  WAKKLEGKEVKRIVSSDKHFWPAVVYAIKTTKPLVGVLRMVDSEKMSAMGFIYGAMDSAKEEIAKNLGGEE---------IDS-----------------
        WAKK EGKE+++IV +D+ FW +V+YAI TT+PLV VLRMVD+EK  AMGFIY AMD AKE IA NLGG E         ID                  
Subjt:  WAKKLEGKEVKRIVSSDKHFWPAVVYAIKTTKPLVGVLRMVDSEKMSAMGFIYGAMDSAKEEIAKNLGGEE---------IDS-----------------

Query:  -------------------------------------------FRRREGFFGFQQAIASSKKRSPVDWWTQFGDGTPNLAKFAIKVLSLTCSASGCERNW
                                                   FR +EGFFG QQA  +  KRSPV+WW QFGDGTP L +FA+KVL LTCS+SGCERNW
Subjt:  -------------------------------------------FRRREGFFGFQQAIASSKKRSPVDWWTQFGDGTPNLAKFAIKVLSLTCSASGCERNW

Query:  STFNQVHTKRRNRSSTTKLNNLVYIMYNKRFKDNHLKRKALKEEEDPLVLDDL
        ST+NQV+TKRRNR ST ++N+LVYIMYN+R +D +LK+K L ++EDPL+ DD+
Subjt:  STFNQVHTKRRNRSSTTKLNNLVYIMYNKRFKDNHLKRKALKEEEDPLVLDDL

A0A2I0XBC2 Dimer_Tnp_hAT domain-containing protein3.6e-5749.8Show/hide
Query:  WAKKLEGKEVKRIVSSDKHFWPAVVYAIKTTKPLVGVLRMVDSEKMSAMGFIYGAMDSAKEEIAKNLGGEE---------IDS-----------------
        WAKK EGKE+++IV +D+ FW +V+YAI TT+PLV VLRMVD+EK  AMGFIY AMD AKE IA NLGG E         ID                  
Subjt:  WAKKLEGKEVKRIVSSDKHFWPAVVYAIKTTKPLVGVLRMVDSEKMSAMGFIYGAMDSAKEEIAKNLGGEE---------IDS-----------------

Query:  -------------------------------------------FRRREGFFGFQQAIASSKKRSPVDWWTQFGDGTPNLAKFAIKVLSLTCSASGCERNW
                                                   FR +EGFFG QQA  +  KRSPV+WW QFGDGTP L +FA+KVL LTCS+SGCERNW
Subjt:  -------------------------------------------FRRREGFFGFQQAIASSKKRSPVDWWTQFGDGTPNLAKFAIKVLSLTCSASGCERNW

Query:  STFNQVHTKRRNRSSTTKLNNLVYIMYNKRFKDNHLKRKALKEEEDPLVLDDL
        ST+NQVHTKRRNR ST ++N+LVYIMYN+R +D +LK+K L ++EDPL+ DD+
Subjt:  STFNQVHTKRRNRSSTTKLNNLVYIMYNKRFKDNHLKRKALKEEEDPLVLDDL

A0A6J1E3R9 uncharacterized protein LOC1110258021.0e-9166.67Show/hide
Query:  SSKQWQDSVWAKKLEGKEVKRIVSSDKHFWPAVVYAIKTTKPLVGVLRMVDSEKMSAMGFIYGAMDSAKEEIAKNLGGEE--------------------
        +SKQ QDSVWAKK EGKEVKRI+ SDKHFWPAVVYAIKT KPLVGVLRMVDSEK+ AMGFIYGAMDSAKEEIAKN GGEE                    
Subjt:  SSKQWQDSVWAKKLEGKEVKRIVSSDKHFWPAVVYAIKTTKPLVGVLRMVDSEKMSAMGFIYGAMDSAKEEIAKNLGGEE--------------------

Query:  -------------------------------------------------IDSFRRREGFFGFQQAIASSKKRSPVDWWTQFGDGTPNLAKFAIKVLSLTC
                                                          DSFRRREGFFGFQQAIAS KKRS VDWWTQFGDGTP LAKFAIKVLS TC
Subjt:  -------------------------------------------------IDSFRRREGFFGFQQAIASSKKRSPVDWWTQFGDGTPNLAKFAIKVLSLTC

Query:  SASGCERNWSTFNQVHTKRRNRSSTTKLNNLVYIMYNKRFKDNHLKRKALKEEEDPLVLDDLNIFELGEGSSTQQRDVSDKTKQPKSRSKD
        SASGC RNWSTFNQVHTKRRNR STTKLNNLVYIMYNKR KD HLKRKALKEEEDPL         LGEG+STQQRDVSDKTKQPKSRSKD
Subjt:  SASGCERNWSTFNQVHTKRRNRSSTTKLNNLVYIMYNKRFKDNHLKRKALKEEEDPLVLDDLNIFELGEGSSTQQRDVSDKTKQPKSRSKD

A0A6J1E3R9 uncharacterized protein LOC1110258029.1e-0833.33Show/hide
Query:  MDRFMATNVGDVDANGGNKVQNQVTPTNAKEARNSVCMDIGSSKQWQDSVWAKKLEGKEVKRIVSSDKHFW------------------PAVVYAIKTTK
        MDRFM T+VGDVDA+GGNKVQNQVTPTNAKEAR      +      + S W  K E K  + I++  +  W                    + + + +  
Subjt:  MDRFMATNVGDVDANGGNKVQNQVTPTNAKEARNSVCMDIGSSKQWQDSVWAKKLEGKEVKRIVSSDKHFW------------------PAVVYAIKTTK

Query:  PLVGVLRMVDSEKMSAMGFIYGAMDSAKEEIAKNL
          V +  +  S+ +     I+  +D   +EI +NL
Subjt:  PLVGVLRMVDSEKMSAMGFIYGAMDSAKEEIAKNL

A0A6J1E3R9 uncharacterized protein LOC1110258027.3e-5848.47Show/hide
Query:  SSKQWQDSVWAKKLEGKEVKRIVSSDKHFWPAVVYAIKTTKPLVGVLRMVDSEKMSAMGFIYGAMDSAKEEIAKNLGGEE---------IDS--------
        +S QW    WAKK EGKE+++IV +D+ FW +V+YAI TT+PLV VLRMVD+EK  AMGFIY AMD AKE IA NLGG E         ID         
Subjt:  SSKQWQDSVWAKKLEGKEVKRIVSSDKHFWPAVVYAIKTTKPLVGVLRMVDSEKMSAMGFIYGAMDSAKEEIAKNLGGEE---------IDS--------

Query:  ----------------------------------------------------FRRREGFFGFQQAIASSKKRSPVDWWTQFGDGTPNLAKFAIKVLSLTC
                                                            FR +EGFFG QQA  +  KRSPV+WW QFGDGTP L +FA+KVL LTC
Subjt:  ----------------------------------------------------FRRREGFFGFQQAIASSKKRSPVDWWTQFGDGTPNLAKFAIKVLSLTC

Query:  SASGCERNWSTFNQVHTKRRNRSSTTKLNNLVYIMYNKRFKDNHLKRKALKEEEDPLVLDDL
        S+S CERNWST+NQ+HTKRRNR ST ++N+LVYIMYN+R +D +LK+K L ++EDPL+ DD+
Subjt:  SASGCERNWSTFNQVHTKRRNRSSTTKLNNLVYIMYNKRFKDNHLKRKALKEEEDPLVLDDL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G79740.1 hAT transposon superfamily8.1e-1737.93Show/hide
Query:  EIDSFRRREGFFGFQQAIASSKKRSPVDWWTQFGDGTPNLAKFAIKVLSLTCSASGCERNWSTFNQVHTKRRNRSSTTKLNNLVYIMYNKRFKDNHLKRK
        +I +F R +G FG   A+ +    SP  WW QFGD  P L + AI++LS  CS    ER WSTF Q+H +RRN+     LN L Y+  N +        +
Subjt:  EIDSFRRREGFFGFQQAIASSKKRSPVDWWTQFGDGTPNLAKFAIKVLSLTCSASGCERNWSTFNQVHTKRRNRSSTTKLNNLVYIMYNKRFKDNHLKRK

Query:  ALKEEEDPLVLDDLNI
         +  E DP+ L+D+++
Subjt:  ALKEEEDPLVLDDLNI

AT3G22220.1 hAT transposon superfamily7.8e-2027.31Show/hide
Query:  SSKQWQDSVWAKKLEGKEVKRIVSSDKHFWPAVVYAIKTTKPLVGVLRMVDSEKMSAMGFIYGAMDSAKEEIAKNLGGEE--------------------
        +S +W D  ++K+  G  +   + +D+ FW A+  A   T P++ VLR+V SE+  AMG++Y AM  AKE I  NL   E                    
Subjt:  SSKQWQDSVWAKKLEGKEVKRIVSSDKHFWPAVVYAIKTTKPLVGVLRMVDSEKMSAMGFIYGAMDSAKEEIAKNLGGEE--------------------

Query:  ---------------------------------------------IDSFRRREGFFGFQQAIASSKKRSPVDWWTQFGDGTPNLAKFAIKVLSLTCSAS-
                                                     I+S++   G FG   AI +     P +WW+ +G+   NL++FAI++LS TCS+S 
Subjt:  ---------------------------------------------IDSFRRREGFFGFQQAIASSKKRSPVDWWTQFGDGTPNLAKFAIKVLSLTCSAS-

Query:  GCERNWSTFNQVHTKRRNRSSTTKLNNLVYIMYNKRFK
        G  RN ++ +Q++ + +N     +LN+LV++ YN R +
Subjt:  GCERNWSTFNQVHTKRRNRSSTTKLNNLVYIMYNKRFK

AT3G22220.2 hAT transposon superfamily7.8e-2027.31Show/hide
Query:  SSKQWQDSVWAKKLEGKEVKRIVSSDKHFWPAVVYAIKTTKPLVGVLRMVDSEKMSAMGFIYGAMDSAKEEIAKNLGGEE--------------------
        +S +W D  ++K+  G  +   + +D+ FW A+  A   T P++ VLR+V SE+  AMG++Y AM  AKE I  NL   E                    
Subjt:  SSKQWQDSVWAKKLEGKEVKRIVSSDKHFWPAVVYAIKTTKPLVGVLRMVDSEKMSAMGFIYGAMDSAKEEIAKNLGGEE--------------------

Query:  ---------------------------------------------IDSFRRREGFFGFQQAIASSKKRSPVDWWTQFGDGTPNLAKFAIKVLSLTCSAS-
                                                     I+S++   G FG   AI +     P +WW+ +G+   NL++FAI++LS TCS+S 
Subjt:  ---------------------------------------------IDSFRRREGFFGFQQAIASSKKRSPVDWWTQFGDGTPNLAKFAIKVLSLTCSAS-

Query:  GCERNWSTFNQVHTKRRNRSSTTKLNNLVYIMYNKRFK
        G  RN ++ +Q++ + +N     +LN+LV++ YN R +
Subjt:  GCERNWSTFNQVHTKRRNRSSTTKLNNLVYIMYNKRFK

AT4G15020.1 hAT transposon superfamily5.8e-1524.07Show/hide
Query:  SSKQWQDSVWAKKLEGKEVKRIVSSDKHFWPAVVYAIKTTKPLVGVLRMVDSEKMSAMGFIYGAMDSAKEEIAKNLGG----------------------
        +S +W +  ++++  G  +  +  +D+ FW AV      T PL+  LR+V SEK  AMG++Y A+  AK+ I  +L                        
Subjt:  SSKQWQDSVWAKKLEGKEVKRIVSSDKHFWPAVVYAIKTTKPLVGVLRMVDSEKMSAMGFIYGAMDSAKEEIAKNLGG----------------------

Query:  ----------------------------------------------EEIDSFRRREGFFGFQQAIASSKKRSPVDWWTQFGDGTPNLAKFAIKVLSLTCS
                                                      +E+ S++   G FG   AI +     P +WW+ +G+   NL++FAI++LS TCS
Subjt:  ----------------------------------------------EEIDSFRRREGFFGFQQAIASSKKRSPVDWWTQFGDGTPNLAKFAIKVLSLTCS

Query:  AS-GCERNWSTFNQVHTKRRNRSSTTKLNNLVYIMYNKRFK
        +S  C RN      ++ + +N     +L++LV++ YN R +
Subjt:  AS-GCERNWSTFNQVHTKRRNRSSTTKLNNLVYIMYNKRFK

AT5G33406.1 hAT dimerisation domain-containing protein / transposase-related4.1e-3733.84Show/hide
Query:  SKQWQDSVWAKKLEGKEVKRIVSSDKHFWPAVVYAIKTTKPLVGVLRMVDSEKMSAMGFIYGAMDSAKEEIAKN--------------------------
        S +W  S W K+  G ++K     +  FW  V++A+K   PL+ VLRMVD E+   MG+IYGAMD AKE I K+                          
Subjt:  SKQWQDSVWAKKLEGKEVKRIVSSDKHFWPAVVYAIKTTKPLVGVLRMVDSEKMSAMGFIYGAMDSAKEEIAKN--------------------------

Query:  -------------------------LGG-------------------EEIDSFRRREGFFGFQQAIASSKKRSPVDWWTQFGDGTPNLAKFAIKVLSLTC
                                 LGG                    E+D+F++  G FG   AI    K SP +WW+ +G  TPNL  FAIKVLSLTC
Subjt:  -------------------------LGG-------------------EEIDSFRRREGFFGFQQAIASSKKRSPVDWWTQFGDGTPNLAKFAIKVLSLTC

Query:  SASGCERNWSTFNQVHTKRRNRSSTTKLNNLVYIMYNKRFKDNHLKRKALKEEEDPLVLDDLN
        SA+GCERNW  F  +HTKRRNR +  +LN+++++ YN+  +    +R    +  DP++L++++
Subjt:  SASGCERNWSTFNQVHTKRRNRSSTTKLNNLVYIMYNKRFKDNHLKRKALKEEEDPLVLDDLN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCGATTCATGGCGACTAATGTTGGTGATGTTGATGCCAATGGTGGAAATAAGGTTCAAAACCAAGTCACTCCTACAAATGCGAAGGAGGCTCGTAATTCT
GTGTGTATGGACATTGGGAGTTCTAAGCAATGGCAGGATAGTGTTTGGGCAAAAAAGCTAGAAGGGAAGGAAGTTAAGAGAATAGTTTCAAGTGACAAACATTTT
TGGCCGGCTGTGGTATATGCAATTAAGACAACTAAACCTTTAGTGGGGGTTTTGAGAATGGTTGATTCTGAAAAGATGTCTGCAATGGGATTTATATATGGTGCT
ATGGATTCAGCAAAGGAGGAGATTGCCAAAAATCTTGGAGGGGAGGAAATTGATTCATTTCGAAGGAGGGAAGGATTTTTTGGCTTCCAACAGGCAATAGCATCT
TCCAAAAAGCGGTCTCCAGTTGATTGGTGGACTCAATTTGGTGATGGCACACCAAACCTAGCTAAATTTGCCATCAAAGTTTTGAGTCTTACTTGTTCAGCATCT
GGTTGCGAGCGTAATTGGAGTACATTTAATCAGGTGCATACAAAGAGAAGGAATCGTTCGAGTACAACTAAGTTAAATAACTTAGTTTATATCATGTACAATAAA
AGATTCAAGGATAACCACTTGAAGCGAAAGGCTCTCAAAGAGGAAGAAGATCCATTGGTACTAGATGATTTGAATATTTTTGAGCTTGGAGAAGGAAGTAGTACT
CAACAGAGAGACGTTAGTGATAAGACAAAACAACCAAAATCTAGAAGTAAAGACACTGCCGCTGCTACTATCCGAAAACCATTGGAGGGGCCACACACTGATCTG
AAATTGGAAAGCCGCGTTGCTACTGGAGCGACGTTGTTGTTACCAACTGCCAAGATGTCGCCGTCGTGGGGAGCGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATCGATTCATGGCGACTAATGTTGGTGATGTTGATGCCAATGGTGGAAATAAGGTTCAAAACCAAGTCACTCCTACAAATGCGAAGGAGGCTCGTAATTCT
GTGTGTATGGACATTGGGAGTTCTAAGCAATGGCAGGATAGTGTTTGGGCAAAAAAGCTAGAAGGGAAGGAAGTTAAGAGAATAGTTTCAAGTGACAAACATTTT
TGGCCGGCTGTGGTATATGCAATTAAGACAACTAAACCTTTAGTGGGGGTTTTGAGAATGGTTGATTCTGAAAAGATGTCTGCAATGGGATTTATATATGGTGCT
ATGGATTCAGCAAAGGAGGAGATTGCCAAAAATCTTGGAGGGGAGGAAATTGATTCATTTCGAAGGAGGGAAGGATTTTTTGGCTTCCAACAGGCAATAGCATCT
TCCAAAAAGCGGTCTCCAGTTGATTGGTGGACTCAATTTGGTGATGGCACACCAAACCTAGCTAAATTTGCCATCAAAGTTTTGAGTCTTACTTGTTCAGCATCT
GGTTGCGAGCGTAATTGGAGTACATTTAATCAGGTGCATACAAAGAGAAGGAATCGTTCGAGTACAACTAAGTTAAATAACTTAGTTTATATCATGTACAATAAA
AGATTCAAGGATAACCACTTGAAGCGAAAGGCTCTCAAAGAGGAAGAAGATCCATTGGTACTAGATGATTTGAATATTTTTGAGCTTGGAGAAGGAAGTAGTACT
CAACAGAGAGACGTTAGTGATAAGACAAAACAACCAAAATCTAGAAGTAAAGACACTGCCGCTGCTACTATCCGAAAACCATTGGAGGGGCCACACACTGATCTG
AAATTGGAAAGCCGCGTTGCTACTGGAGCGACGTTGTTGTTACCAACTGCCAAGATGTCGCCGTCGTGGGGAGCGTAG
Protein sequenceShow/hide protein sequence
MDRFMATNVGDVDANGGNKVQNQVTPTNAKEARNSVCMDIGSSKQWQDSVWAKKLEGKEVKRIVSSDKHFWPAVVYAIKTTKPLVGVLRMVDSEKMSAMGFIYGA
MDSAKEEIAKNLGGEEIDSFRRREGFFGFQQAIASSKKRSPVDWWTQFGDGTPNLAKFAIKVLSLTCSASGCERNWSTFNQVHTKRRNRSSTTKLNNLVYIMYNK
RFKDNHLKRKALKEEEDPLVLDDLNIFELGEGSSTQQRDVSDKTKQPKSRSKDTAAATIRKPLEGPHTDLKLESRVATGATLLLPTAKMSPSWGA