; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g21330 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g21330
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionBED-type domain-containing protein
Genome locationchr4:15555999..15558230
RNA-Seq ExpressionMoc04g21330
SyntenyMoc04g21330
Gene Ontology termsGO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR008906 - HAT, C-terminal dimerisation domain
IPR012337 - Ribonuclease H-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN63729.1 hypothetical protein VITISV_009577 [Vitis vinifera]8.1e-7258.44Show/hide
Query:  MIPSFWNSVLYTLKASGPLVRVLRLVDGDKPA-MGYIYEAMDRAKEAIKSGFNGNEAKYRPIWEIVDKRWDCQLHRPLHAAGYYLNASLFYDNKERIMQD
        ++PSFWN++++ LK SGPLVRVLRLVDG+K A MGYIYEAM+RAK+ I   FNGNE KY+ I+ I+DKR + QLHRPLHA GY+LN   FYD K  I  D
Subjt:  MIPSFWNSVLYTLKASGPLVRVLRLVDGDKPA-MGYIYEAMDRAKEAIKSGFNGNEAKYRPIWEIVDKRWDCQLHRPLHAAGYYLNASLFYDNKERIMQD

Query:  PEL------------------AMRVLSLTCSASGCERNWSVFEHIHSKKRNMLEQKRLNDLVYIKYNQALKERYTLQDQLDPITLDYIDESNEWLIGTLE
         E+                  AM+VL+LTC ASGCERNWS+FE+IHSK+RN L+ +RLNDLVYIKYN+ALK RY  ++ +DPI+L  ID+SNEWLIG +E
Subjt:  PEL------------------AMRVLSLTCSASGCERNWSVFEHIHSKKRNMLEQKRLNDLVYIKYNQALKERYTLQDQLDPITLDYIDESNEWLIGTLE

Query:  EED--GQEENELVFDDDDLTWGDVAEASGVREPIRQTRTYARS
        +ED  G  ++  VFDDD+LTWGDVA A+G  E    TR  AR+
Subjt:  EED--GQEENELVFDDDDLTWGDVAEASGVREPIRQTRTYARS

RVW73543.1 hypothetical protein CK203_062078 [Vitis vinifera]2.5e-7356.47Show/hide
Query:  MIPSFWNSVLYTLKASGPLVRVLRLVDGDKPA-MGYIYEAMDRAKEAIKSGFNGNEAKYRPIWEIVDKRWDCQLHRPLHAAGYYLNASLFYDNKE-----
        ++PSFWN++++ LK SGPLVRVLRLVDG+K A MGYIYEAM+RAK+AI   FNGNE KY+ I+ I+DKRW+ QLHRPLHAAGY+LN   FYD  E     
Subjt:  MIPSFWNSVLYTLKASGPLVRVLRLVDGDKPA-MGYIYEAMDRAKEAIKSGFNGNEAKYRPIWEIVDKRWDCQLHRPLHAAGYYLNASLFYDNKE-----

Query:  -----------RIMQDP--------------ELAMRVLSLTCSASGCERNWSVFEHIHSKKRNMLEQKRLNDLVYIKYNQALKERYTLQDQLDPITLDYI
                   R+ +DP              + AM+VL+LTCSASGC+RNWS+FE+IHSK+RN L+ +RLNDLVY KYN+ALK RY  ++ +DPI+L  I
Subjt:  -----------RIMQDP--------------ELAMRVLSLTCSASGCERNWSVFEHIHSKKRNMLEQKRLNDLVYIKYNQALKERYTLQDQLDPITLDYI

Query:  DESNEWLIGTLEEED--GQEENELVFDDDDLTWGDVAEASGVREPIRQTRTYARS
        D+SNEWLIG +E+ED  G  +++ VFDDD+LTWGDVA A+G  E    TR  AR+
Subjt:  DESNEWLIGTLEEED--GQEENELVFDDDDLTWGDVAEASGVREPIRQTRTYARS

RWR75319.1 hypothetical protein CKAN_00369500 [Cinnamomum micranthum f. kanehirae]1.1e-7151.16Show/hide
Query:  MIPSFWNSVLYTLKASGPLVRVLRLVDGD-KPAMGYIYEAMDRAKEAIKSGFNGNEAKYRPIWEIVDKRWDCQLHRPLHAAGYYLNASLFYDN-------
        +IPSFWN+V+Y LK SGPL+ VLRLVDG+ KP MGYIYEAMDRAKEAI + F G E +Y  I+EI+D+RWD QLHRPLHAAGY+LN   FY N       
Subjt:  MIPSFWNSVLYTLKASGPLVRVLRLVDGD-KPAMGYIYEAMDRAKEAIKSGFNGNEAKYRPIWEIVDKRWDCQLHRPLHAAGYYLNASLFYDN-------

Query:  -------------------KERIMQD---------------------------------------PELAMRVLSLTCSASGCERNWSVFEHIHSKKRNML
                           +++I Q+                                        + A++VLSLTCS+SGCERNWSVFEHIHSKKRN L
Subjt:  -------------------KERIMQD---------------------------------------PELAMRVLSLTCSASGCERNWSVFEHIHSKKRNML

Query:  EQKRLNDLVYIKYNQALKERYTLQDQLDPITLDYIDESNEWLIGTLEEEDGQEENELVFDDDDLTWGDVAEASGVREPIRQTRTYARSKGKSPATPAPTT
        ++KRLNDLV++KYN+ALK RY ++D+LDPI+L  IDESNEWL+G ++ E   E++E VFDDD LTWGDVA ASGV E  ++TR  A S  ++ ATP  T 
Subjt:  EQKRLNDLVYIKYNQALKERYTLQDQLDPITLDYIDESNEWLIGTLEEEDGQEENELVFDDDDLTWGDVAEASGVREPIRQTRTYARSKGKSPATPAPTT

Query:  S
        S
Subjt:  S

XP_030941202.1 uncharacterized protein LOC115966010 [Quercus lobata]2.1e-7251.43Show/hide
Query:  MIPSFWNSVLYTLKASGPLVRVLRLVDGDK-PAMGYIYEAMDRAKEAIKSGFNGNEAKYRPIWEIVDKRWDCQLHRPLHAAGYYLNASLFYDNKERIMQD
        ++ +FWN+V+Y+LK SGP+VRVLRLVDG+K PAMGYIYEAMDRAKEAI+  FNG E +Y+ I+EI+D+RWDCQLHRPLHAAGY+LN   FYDN+  I +D
Subjt:  MIPSFWNSVLYTLKASGPLVRVLRLVDGDK-PAMGYIYEAMDRAKEAIKSGFNGNEAKYRPIWEIVDKRWDCQLHRPLHAAGYYLNASLFYDNKERIMQD

Query:  PEL------------------------------------------------------------------AMRVLSLTCSASGCERNWSVFEHIHSKKRNM
         E+                                                                  A++VLSLTCS+SGCERNWS+FEH+HSKKRN 
Subjt:  PEL------------------------------------------------------------------AMRVLSLTCSASGCERNWSVFEHIHSKKRNM

Query:  LEQKRLNDLVYIKYNQALKERYTLQDQLDPITLDYIDESNEWLIGTLEEEDGQE--ENELVFDDDDLTWGDVAEASGVRE
        L Q R+NDLVYIKYN+ALK RY L+D +DPI+L  +D+SNEWLIG +EE++ +E  E++LVF+DD LTWG+V  A GV E
Subjt:  LEQKRLNDLVYIKYNQALKERYTLQDQLDPITLDYIDESNEWLIGTLEEEDGQE--ENELVFDDDDLTWGDVAEASGVRE

XP_030955048.1 uncharacterized protein LOC115977364 [Quercus lobata]7.3e-7351.79Show/hide
Query:  MIPSFWNSVLYTLKASGPLVRVLRLVDGDK-PAMGYIYEAMDRAKEAIKSGFNGNEAKYRPIWEIVDKRWDCQLHRPLHAAGYYLNASLFYDNKERIMQD
        ++ +FWN+V+Y+LK SGP+VRVLRLVDG+K PAMGYIYEAMDRAKEAI+  FNG E +Y+ I+EI+D+RWDCQLHRPLHAAGY+LN   FYDN+  I +D
Subjt:  MIPSFWNSVLYTLKASGPLVRVLRLVDGDK-PAMGYIYEAMDRAKEAIKSGFNGNEAKYRPIWEIVDKRWDCQLHRPLHAAGYYLNASLFYDNKERIMQD

Query:  PEL------------------------------------------------------------------AMRVLSLTCSASGCERNWSVFEHIHSKKRNM
         E+                                                                  A++VL LTCS+SGCERNWS+FEH+HSKKRN 
Subjt:  PEL------------------------------------------------------------------AMRVLSLTCSASGCERNWSVFEHIHSKKRNM

Query:  LEQKRLNDLVYIKYNQALKERYTLQDQLDPITLDYIDESNEWLIGTLEEEDGQE--ENELVFDDDDLTWGDVAEASGVRE
        L Q R+NDLVYIKYN+ALK RY L+D +DPI+L  +D+SNEWLIG +EE++ +E  E++LVF+DD LTWG+VA ASGV E
Subjt:  LEQKRLNDLVYIKYNQALKERYTLQDQLDPITLDYIDESNEWLIGTLEEEDGQE--ENELVFDDDDLTWGDVAEASGVRE

TrEMBL top hitse value%identityAlignment
A0A3S3MFA1 Dimer_Tnp_hAT domain-containing protein5.1e-7251.16Show/hide
Query:  MIPSFWNSVLYTLKASGPLVRVLRLVDGD-KPAMGYIYEAMDRAKEAIKSGFNGNEAKYRPIWEIVDKRWDCQLHRPLHAAGYYLNASLFYDN-------
        +IPSFWN+V+Y LK SGPL+ VLRLVDG+ KP MGYIYEAMDRAKEAI + F G E +Y  I+EI+D+RWD QLHRPLHAAGY+LN   FY N       
Subjt:  MIPSFWNSVLYTLKASGPLVRVLRLVDGD-KPAMGYIYEAMDRAKEAIKSGFNGNEAKYRPIWEIVDKRWDCQLHRPLHAAGYYLNASLFYDN-------

Query:  -------------------KERIMQD---------------------------------------PELAMRVLSLTCSASGCERNWSVFEHIHSKKRNML
                           +++I Q+                                        + A++VLSLTCS+SGCERNWSVFEHIHSKKRN L
Subjt:  -------------------KERIMQD---------------------------------------PELAMRVLSLTCSASGCERNWSVFEHIHSKKRNML

Query:  EQKRLNDLVYIKYNQALKERYTLQDQLDPITLDYIDESNEWLIGTLEEEDGQEENELVFDDDDLTWGDVAEASGVREPIRQTRTYARSKGKSPATPAPTT
        ++KRLNDLV++KYN+ALK RY ++D+LDPI+L  IDESNEWL+G ++ E   E++E VFDDD LTWGDVA ASGV E  ++TR  A S  ++ ATP  T 
Subjt:  EQKRLNDLVYIKYNQALKERYTLQDQLDPITLDYIDESNEWLIGTLEEEDGQEENELVFDDDDLTWGDVAEASGVREPIRQTRTYARSKGKSPATPAPTT

Query:  S
        S
Subjt:  S

A0A438GMU6 Uncharacterized protein1.2e-7356.47Show/hide
Query:  MIPSFWNSVLYTLKASGPLVRVLRLVDGDKPA-MGYIYEAMDRAKEAIKSGFNGNEAKYRPIWEIVDKRWDCQLHRPLHAAGYYLNASLFYDNKE-----
        ++PSFWN++++ LK SGPLVRVLRLVDG+K A MGYIYEAM+RAK+AI   FNGNE KY+ I+ I+DKRW+ QLHRPLHAAGY+LN   FYD  E     
Subjt:  MIPSFWNSVLYTLKASGPLVRVLRLVDGDKPA-MGYIYEAMDRAKEAIKSGFNGNEAKYRPIWEIVDKRWDCQLHRPLHAAGYYLNASLFYDNKE-----

Query:  -----------RIMQDP--------------ELAMRVLSLTCSASGCERNWSVFEHIHSKKRNMLEQKRLNDLVYIKYNQALKERYTLQDQLDPITLDYI
                   R+ +DP              + AM+VL+LTCSASGC+RNWS+FE+IHSK+RN L+ +RLNDLVY KYN+ALK RY  ++ +DPI+L  I
Subjt:  -----------RIMQDP--------------ELAMRVLSLTCSASGCERNWSVFEHIHSKKRNMLEQKRLNDLVYIKYNQALKERYTLQDQLDPITLDYI

Query:  DESNEWLIGTLEEED--GQEENELVFDDDDLTWGDVAEASGVREPIRQTRTYARS
        D+SNEWLIG +E+ED  G  +++ VFDDD+LTWGDVA A+G  E    TR  AR+
Subjt:  DESNEWLIGTLEEED--GQEENELVFDDDDLTWGDVAEASGVREPIRQTRTYARS

A0A7N2L2M4 BED-type domain-containing protein3.6e-7351.79Show/hide
Query:  MIPSFWNSVLYTLKASGPLVRVLRLVDGDK-PAMGYIYEAMDRAKEAIKSGFNGNEAKYRPIWEIVDKRWDCQLHRPLHAAGYYLNASLFYDNKERIMQD
        ++ +FWN+V+Y+LK SGP+VRVLRLVDG+K PAMGYIYEAMDRAKEAI+  FNG E +Y+ I+EI+D+RWDCQLHRPLHAAGY+LN   FYDN+  I +D
Subjt:  MIPSFWNSVLYTLKASGPLVRVLRLVDGDK-PAMGYIYEAMDRAKEAIKSGFNGNEAKYRPIWEIVDKRWDCQLHRPLHAAGYYLNASLFYDNKERIMQD

Query:  PEL------------------------------------------------------------------AMRVLSLTCSASGCERNWSVFEHIHSKKRNM
         E+                                                                  A++VL LTCS+SGCERNWS+FEH+HSKKRN 
Subjt:  PEL------------------------------------------------------------------AMRVLSLTCSASGCERNWSVFEHIHSKKRNM

Query:  LEQKRLNDLVYIKYNQALKERYTLQDQLDPITLDYIDESNEWLIGTLEEEDGQE--ENELVFDDDDLTWGDVAEASGVRE
        L Q R+NDLVYIKYN+ALK RY L+D +DPI+L  +D+SNEWLIG +EE++ +E  E++LVF+DD LTWG+VA ASGV E
Subjt:  LEQKRLNDLVYIKYNQALKERYTLQDQLDPITLDYIDESNEWLIGTLEEEDGQE--ENELVFDDDDLTWGDVAEASGVRE

A0A7N2MQZ7 Uncharacterized protein1.0e-7251.43Show/hide
Query:  MIPSFWNSVLYTLKASGPLVRVLRLVDGDK-PAMGYIYEAMDRAKEAIKSGFNGNEAKYRPIWEIVDKRWDCQLHRPLHAAGYYLNASLFYDNKERIMQD
        ++ +FWN+V+Y+LK SGP+VRVLRLVDG+K PAMGYIYEAMDRAKEAI+  FNG E +Y+ I+EI+D+RWDCQLHRPLHAAGY+LN   FYDN+  I +D
Subjt:  MIPSFWNSVLYTLKASGPLVRVLRLVDGDK-PAMGYIYEAMDRAKEAIKSGFNGNEAKYRPIWEIVDKRWDCQLHRPLHAAGYYLNASLFYDNKERIMQD

Query:  PEL------------------------------------------------------------------AMRVLSLTCSASGCERNWSVFEHIHSKKRNM
         E+                                                                  A++VLSLTCS+SGCERNWS+FEH+HSKKRN 
Subjt:  PEL------------------------------------------------------------------AMRVLSLTCSASGCERNWSVFEHIHSKKRNM

Query:  LEQKRLNDLVYIKYNQALKERYTLQDQLDPITLDYIDESNEWLIGTLEEEDGQE--ENELVFDDDDLTWGDVAEASGVRE
        L Q R+NDLVYIKYN+ALK RY L+D +DPI+L  +D+SNEWLIG +EE++ +E  E++LVF+DD LTWG+V  A GV E
Subjt:  LEQKRLNDLVYIKYNQALKERYTLQDQLDPITLDYIDESNEWLIGTLEEEDGQE--ENELVFDDDDLTWGDVAEASGVRE

A5BQ63 Uncharacterized protein3.9e-7258.44Show/hide
Query:  MIPSFWNSVLYTLKASGPLVRVLRLVDGDKPA-MGYIYEAMDRAKEAIKSGFNGNEAKYRPIWEIVDKRWDCQLHRPLHAAGYYLNASLFYDNKERIMQD
        ++PSFWN++++ LK SGPLVRVLRLVDG+K A MGYIYEAM+RAK+ I   FNGNE KY+ I+ I+DKR + QLHRPLHA GY+LN   FYD K  I  D
Subjt:  MIPSFWNSVLYTLKASGPLVRVLRLVDGDKPA-MGYIYEAMDRAKEAIKSGFNGNEAKYRPIWEIVDKRWDCQLHRPLHAAGYYLNASLFYDNKERIMQD

Query:  PEL------------------AMRVLSLTCSASGCERNWSVFEHIHSKKRNMLEQKRLNDLVYIKYNQALKERYTLQDQLDPITLDYIDESNEWLIGTLE
         E+                  AM+VL+LTC ASGCERNWS+FE+IHSK+RN L+ +RLNDLVYIKYN+ALK RY  ++ +DPI+L  ID+SNEWLIG +E
Subjt:  PEL------------------AMRVLSLTCSASGCERNWSVFEHIHSKKRNMLEQKRLNDLVYIKYNQALKERYTLQDQLDPITLDYIDESNEWLIGTLE

Query:  EED--GQEENELVFDDDDLTWGDVAEASGVREPIRQTRTYARS
        +ED  G  ++  VFDDD+LTWGDVA A+G  E    TR  AR+
Subjt:  EED--GQEENELVFDDDDLTWGDVAEASGVREPIRQTRTYARS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G79740.1 hAT transposon superfamily3.0e-2429.66Show/hide
Query:  FWNSVLYTLKASGPLVRVLRLVDGDKPAMGYIYEAMDRAKEAIKSGFNGNEAKYRPIWEIVDKRWDCQLHRPLHAAGYYLNASLFYDNK-----------
        FW +V  ++  S P+++VLR V   KPA+G IYE M +AKE+I++ +  +E K++   +IVD  W   LH PLHAA  +LN S+ Y+ +           
Subjt:  FWNSVLYTLKASGPLVRVLRLVDGDKPAMGYIYEAMDRAKEAIKSGFNGNEAKYRPIWEIVDKRWDCQLHRPLHAAGYYLNASLFYDNK-----------

Query:  ----ERIMQDPEL-------------------------------------------------AMRVLSLTCSASGCERNWSVFEHIHSKKRNMLEQKRLN
            E+++   +L                                                 A+R+LS  CS    ER WS F+ +H ++RN ++++ LN
Subjt:  ----ERIMQDPEL-------------------------------------------------AMRVLSLTCSASGCERNWSVFEHIHSKKRNMLEQKRLN

Query:  DLVYIKYNQALKERYTLQDQLDPITLDYIDESNEWL
         L Y+  N  L    TL  + DPI L+ ID  +EW+
Subjt:  DLVYIKYNQALKERYTLQDQLDPITLDYIDESNEWL

AT3G22220.1 hAT transposon superfamily4.1e-2129.88Show/hide
Query:  FWNSVLYTLKASGPLVRVLRLVDGD-KPAMGYIYEAMDRAKEAIKSGFNGNEAKYRPIWEIVDKRWDCQLHRPLHAAGYYLNASLFY-------------
        FW ++      + P++RVLR+V  + KPAMGY+Y AM RAKEAIK+     E +Y   W+I+D+ W   L +PL+AAG+YLN   FY             
Subjt:  FWNSVLYTLKASGPLVRVLRLVDGD-KPAMGYIYEAMDRAKEAIKSGFNGNEAKYRPIWEIVDKRWDCQLHRPLHAAGYYLNASLFY-------------

Query:  --DNKERIMQDPEL-------------------------------------------------AMRVLSLTCSAS-GCERNWSVFEHIHSKKRNMLEQKR
          D  E+++ D  +                                                 A+R+LS TCS+S G  RN +    I+  K N +E++R
Subjt:  --DNKERIMQDPEL-------------------------------------------------AMRVLSLTCSAS-GCERNWSVFEHIHSKKRNMLEQKR

Query:  LNDLVYIKYNQALK---ERYTLQDQLDPITLDYIDESNEWL
        LNDLV+++YN  L+      +  D +DP++   ++   +W+
Subjt:  LNDLVYIKYNQALK---ERYTLQDQLDPITLDYIDESNEWL

AT4G15020.1 hAT transposon superfamily5.5e-2630.49Show/hide
Query:  SFWNSVLYTLKASGPLVRVLRLVDGDK-PAMGYIYEAMDRAKEAIKSGFNGNEAKYRPIWEIVDKRWDCQLHRPLHAAGYYLNASLFYDNKERI------
        +FW +V      + PL+R LR+V  +K PAMGY+Y A+ RAK+AIK+    N   Y   W+I+D+ W+ Q H PL AAG++LN  LFY+  E I      
Subjt:  SFWNSVLYTLKASGPLVRVLRLVDGDK-PAMGYIYEAMDRAKEAIKSGFNGNEAKYRPIWEIVDKRWDCQLHRPLHAAGYYLNASLFYDNKERI------

Query:  --------------MQD--------------------------------------------PELAMRVLSLTCSAS-GCERNWSVFEHIHSKKRNMLEQK
                      +QD                                               A+R+LS TCS+S  C RN    EHI+  K N +EQK
Subjt:  --------------MQD--------------------------------------------PELAMRVLSLTCSAS-GCERNWSVFEHIHSKKRNMLEQK

Query:  RLNDLVYIKYNQALKE--RYTLQDQLDPITLDYIDESNEWLIG--------------TLEEEDGQEENELVFDDDDLTWG-DVAEASGVREPIRQTRTYA
        RL+DLV+++YN  L++    +  D LDP++ + ID   EW+ G              +LE     +   ++ D +DL  G D  E   V + +R    Y 
Subjt:  RLNDLVYIKYNQALKE--RYTLQDQLDPITLDYIDESNEWLIG--------------TLEEEDGQEENELVFDDDDLTWG-DVAEASGVREPIRQTRTYA

Query:  RSKGK
         +  K
Subjt:  RSKGK

AT4G15020.2 hAT transposon superfamily5.5e-2630.49Show/hide
Query:  SFWNSVLYTLKASGPLVRVLRLVDGDK-PAMGYIYEAMDRAKEAIKSGFNGNEAKYRPIWEIVDKRWDCQLHRPLHAAGYYLNASLFYDNKERI------
        +FW +V      + PL+R LR+V  +K PAMGY+Y A+ RAK+AIK+    N   Y   W+I+D+ W+ Q H PL AAG++LN  LFY+  E I      
Subjt:  SFWNSVLYTLKASGPLVRVLRLVDGDK-PAMGYIYEAMDRAKEAIKSGFNGNEAKYRPIWEIVDKRWDCQLHRPLHAAGYYLNASLFYDNKERI------

Query:  --------------MQD--------------------------------------------PELAMRVLSLTCSAS-GCERNWSVFEHIHSKKRNMLEQK
                      +QD                                               A+R+LS TCS+S  C RN    EHI+  K N +EQK
Subjt:  --------------MQD--------------------------------------------PELAMRVLSLTCSAS-GCERNWSVFEHIHSKKRNMLEQK

Query:  RLNDLVYIKYNQALKE--RYTLQDQLDPITLDYIDESNEWLIG--------------TLEEEDGQEENELVFDDDDLTWG-DVAEASGVREPIRQTRTYA
        RL+DLV+++YN  L++    +  D LDP++ + ID   EW+ G              +LE     +   ++ D +DL  G D  E   V + +R    Y 
Subjt:  RLNDLVYIKYNQALKE--RYTLQDQLDPITLDYIDESNEWLIG--------------TLEEEDGQEENELVFDDDDLTWG-DVAEASGVREPIRQTRTYA

Query:  RSKGK
         +  K
Subjt:  RSKGK

AT5G33406.1 hAT dimerisation domain-containing protein / transposase-related1.2e-6043.1Show/hide
Query:  SFWNSVLYTLKASGPLVRVLRLVDGD-KPAMGYIYEAMDRAKEAIKSGFNGNEAKYRPIWEIVDKRWDCQLHRPLHAAGYYLNASLFY------------
        SFW +VL+ LK  GPL++VLR+VDG+ KP MGYIY AMD+AKE I   F   E  Y+  +EI+D+RWD QLHRPLHAAGYYLN    Y            
Subjt:  SFWNSVLYTLKASGPLVRVLRLVDGD-KPAMGYIYEAMDRAKEAIKSGFNGNEAKYRPIWEIVDKRWDCQLHRPLHAAGYYLNASLFY------------

Query:  --------------DNKERIMQD------------------------------------PEL---AMRVLSLTCSASGCERNWSVFEHIHSKKRNMLEQK
                      + +++I+ +                                    P L   A++VLSLTCSA+GCERNW VF+ +H+K+RN L Q 
Subjt:  --------------DNKERIMQD------------------------------------PEL---AMRVLSLTCSASGCERNWSVFEHIHSKKRNMLEQK

Query:  RLNDLVYIKYNQALKERYTLQDQLDPITLDYIDESNEWLIGTLEEEDGQEEN-ELVFDDDDLTWGDVAEASGVREPIRQTRTYARS----KGKSPAT
        RLND++++KYN+AL+ RY   D  DPI L+ ID+ NEWL G +EE     EN +LVF++DDLTW +V EA+G  +P   TR+ A S    KGK  A+
Subjt:  RLNDLVYIKYNQALKERYTLQDQLDPITLDYIDESNEWLIGTLEEEDGQEEN-ELVFDDDDLTWGDVAEASGVREPIRQTRTYARS----KGKSPAT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCCCATCTTTTTGGAATTCAGTTTTGTATACTTTGAAAGCATCGGGCCCTTTAGTTCGTGTGTTGAGGCTAGTAGATGGTGATAAACCGGCTATGGGTTATATTTA
CGAAGCTATGGATAGGGCAAAGGAGGCTATTAAGAGTGGTTTTAATGGAAACGAGGCAAAGTATAGACCGATTTGGGAGATTGTTGATAAAAGGTGGGATTGTCAACTTC
ATCGGCCTTTACATGCAGCCGGATATTATTTGAATGCATCTCTTTTTTATGACAACAAAGAGAGAATAATGCAAGATCCTGAGTTAGCCATGAGAGTCCTTAGTCTAACG
TGCAGCGCTTCTGGCTGTGAGCGTAATTGGAGTGTTTTTGAACATATTCATTCAAAGAAGAGGAACATGTTAGAGCAAAAGCGTCTTAATGACTTGGTTTATATAAAATA
TAACCAAGCACTTAAAGAGCGCTACACTCTACAAGACCAACTCGATCCAATTACTTTGGATTATATTGATGAAAGTAATGAATGGTTGATCGGAACACTCGAAGAAGAAG
ATGGTCAAGAAGAAAATGAGTTGGTCTTTGATGATGACGACCTCACATGGGGAGATGTAGCCGAGGCTAGTGGTGTTAGAGAACCCATAAGGCAAACTAGAACATATGCA
AGATCTAAAGGAAAGTCGCCTGCTACTCCCGCTCCTACAACTTCAAGAGTTCGAAAATCAGTGTGCGCCTCACTTCAATCGAGCATCACCTTTGTATCGCCTCTCGCCTC
AAAGCGATCAAAGGACTCGTCGCCTTGCGGAGTGGCTAACTATCGAAACTTGTCAAAACGAAGAAAACGTCACATAACTGGGCTGTTCGCGTCGAGATCACAAAGTGGGC
CACCGGAGCCGCTGGTTCGTGTGCTAACCGAAGGTTTGTACCAAAGAACGGGTCGCGTCGCTGTGGAATCCACCGGAACACGCCACACTACCGCACGAAAGAGAGAGACC
CACGATTTGATTCGCGAACTGTCGCCGTCTGAGCTTCTGTTCGTCGGAAGAGAACACACGTTGTCATGGGTGTGGTTGCGTCGCCATGAAGAACGCTGCACAGAGGCTGC
CGTTTGCGTTTCTAGGTGCAAGGAAGAGCTGCTGCCGCCGCTGCTCGTGATGGCCTACCACCGCGCCGAGAAGTCGTAG
mRNA sequenceShow/hide mRNA sequence
ATGATCCCATCTTTTTGGAATTCAGTTTTGTATACTTTGAAAGCATCGGGCCCTTTAGTTCGTGTGTTGAGGCTAGTAGATGGTGATAAACCGGCTATGGGTTATATTTA
CGAAGCTATGGATAGGGCAAAGGAGGCTATTAAGAGTGGTTTTAATGGAAACGAGGCAAAGTATAGACCGATTTGGGAGATTGTTGATAAAAGGTGGGATTGTCAACTTC
ATCGGCCTTTACATGCAGCCGGATATTATTTGAATGCATCTCTTTTTTATGACAACAAAGAGAGAATAATGCAAGATCCTGAGTTAGCCATGAGAGTCCTTAGTCTAACG
TGCAGCGCTTCTGGCTGTGAGCGTAATTGGAGTGTTTTTGAACATATTCATTCAAAGAAGAGGAACATGTTAGAGCAAAAGCGTCTTAATGACTTGGTTTATATAAAATA
TAACCAAGCACTTAAAGAGCGCTACACTCTACAAGACCAACTCGATCCAATTACTTTGGATTATATTGATGAAAGTAATGAATGGTTGATCGGAACACTCGAAGAAGAAG
ATGGTCAAGAAGAAAATGAGTTGGTCTTTGATGATGACGACCTCACATGGGGAGATGTAGCCGAGGCTAGTGGTGTTAGAGAACCCATAAGGCAAACTAGAACATATGCA
AGATCTAAAGGAAAGTCGCCTGCTACTCCCGCTCCTACAACTTCAAGAGTTCGAAAATCAGTGTGCGCCTCACTTCAATCGAGCATCACCTTTGTATCGCCTCTCGCCTC
AAAGCGATCAAAGGACTCGTCGCCTTGCGGAGTGGCTAACTATCGAAACTTGTCAAAACGAAGAAAACGTCACATAACTGGGCTGTTCGCGTCGAGATCACAAAGTGGGC
CACCGGAGCCGCTGGTTCGTGTGCTAACCGAAGGTTTGTACCAAAGAACGGGTCGCGTCGCTGTGGAATCCACCGGAACACGCCACACTACCGCACGAAAGAGAGAGACC
CACGATTTGATTCGCGAACTGTCGCCGTCTGAGCTTCTGTTCGTCGGAAGAGAACACACGTTGTCATGGGTGTGGTTGCGTCGCCATGAAGAACGCTGCACAGAGGCTGC
CGTTTGCGTTTCTAGGTGCAAGGAAGAGCTGCTGCCGCCGCTGCTCGTGATGGCCTACCACCGCGCCGAGAAGTCGTAG
Protein sequenceShow/hide protein sequence
MIPSFWNSVLYTLKASGPLVRVLRLVDGDKPAMGYIYEAMDRAKEAIKSGFNGNEAKYRPIWEIVDKRWDCQLHRPLHAAGYYLNASLFYDNKERIMQDPELAMRVLSLT
CSASGCERNWSVFEHIHSKKRNMLEQKRLNDLVYIKYNQALKERYTLQDQLDPITLDYIDESNEWLIGTLEEEDGQEENELVFDDDDLTWGDVAEASGVREPIRQTRTYA
RSKGKSPATPAPTTSRVRKSVCASLQSSITFVSPLASKRSKDSSPCGVANYRNLSKRRKRHITGLFASRSQSGPPEPLVRVLTEGLYQRTGRVAVESTGTRHTTARKRET
HDLIRELSPSELLFVGREHTLSWVWLRRHEERCTEAAVCVSRCKEELLPPLLVMAYHRAEKS