; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g00200 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g00200
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptiontype 2 DNA topoisomerase 6 subunit B-like isoform X1
Genome locationchr8:114650..123635
RNA-Seq ExpressionMoc08g00200
SyntenyMoc08g00200
Gene Ontology termsGO:0042138 - meiotic DNA double-strand break formation (biological process)
GO:0016853 - isomerase activity (molecular function)
InterPro domainsIPR003863 - Protein of unknown function DUF220
IPR023393 - START-like domain superfamily
IPR034566 - Type 2 DNA topoisomerase 6 subunit B-like, plants


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7029197.1 Type 2 DNA topoisomerase 6 subunit B-like protein, partial [Cucurbita argyrosperma subsp. argyrosperma]8.2e-19172.31Show/hide
Query:  TSSFFNTEMHLLRTTLQTLSSSATLCRFRSSPISDTGLGSCLEEFQDFKCPVEGILAEQWVAHVQRYFAVGLDYRTIVMRLSINTFLVFYADGIVSLRTT
        T+ F NTEMHLL TTLQTLSSSATL RFRSS ISDTGLGSC+EEFQ+ KCPVEGILAE W                               +GIV +RTT
Subjt:  TSSFFNTEMHLLRTTLQTLSSSATLCRFRSSPISDTGLGSCLEEFQDFKCPVEGILAEQWVAHVQRYFAVGLDYRTIVMRLSINTFLVFYADGIVSLRTT

Query:  ---NFCDNEISCYQLNLKENVTIREPIWLPSNVKHGVKFRHGITAVDVLLAEINCFFQKIRILKIPNVAMEVVVECQDIPGSRNDAVFLENLSSSSSFTA
           + CD EISCY+LNLKENVT R+PI LPSN+KH         +VDVLL E NCFFQKI +LK+PN+AMEV+ + QD+PGSRNDAVFL+  S SSSF A
Subjt:  ---NFCDNEISCYQLNLKENVTIREPIWLPSNVKHGVKFRHGITAVDVLLAEINCFFQKIRILKIPNVAMEVVVECQDIPGSRNDAVFLENLSSSSSFTA

Query:  STLDHLKLGLEDYVSRHGSSLTCDSCFPNRDNLKSGGGMICEEKRKTTTLAVEAAIVISELSNPTTNCLGARFSDTEVFCFKDFAPCSISEAFLKALRGI
        STLDHLKLGLE YV RHGSSL CDSCFPNRDNL+SGGGM+CE+K KTT L VEAA+V SE SNP+ NC GA  SDT+V CFKDFAPCSISEA LKAL GI
Subjt:  STLDHLKLGLEDYVSRHGSSLTCDSCFPNRDNLKSGGGMICEEKRKTTTLAVEAAIVISELSNPTTNCLGARFSDTEVFCFKDFAPCSISEAFLKALRGI

Query:  DWKRYGLSLESATNQRSHALLKWEHVPLSFHIHIVVHCYQKLVAEVMPLVQKTQFDKKLISKAVKLALDDLKNKYAGFLLSANTLKISRFAPDLAKTIAG
        DWKRYGL+LE A +QRSHALLKWEH+PLSFHIHIVVHCY KLV E MPL+QKT+FDKKLI KAVKLALDD KNK+AG LLSA+TLKIS+FAPDLAK+IAG
Subjt:  DWKRYGLSLESATNQRSHALLKWEHVPLSFHIHIVVHCYQKLVAEVMPLVQKTQFDKKLISKAVKLALDDLKNKYAGFLLSANTLKISRFAPDLAKTIAG

Query:  LVLYSNDLDFQEECLAILGLQPQQSEGEIVEENIKERIISVIEVNDGRPKGTKEVAPLLFGDGRHQLQFVDDECDEDGFDPMEL
        LVLYS+D+DF+EECLAILGLQP QSEGEIV ENIK++IIS IEVND R +GTKEVAPLLF +GRH+LQFVDDECDEDGFDPM+L
Subjt:  LVLYSNDLDFQEECLAILGLQPQQSEGEIVEENIKERIISVIEVNDGRPKGTKEVAPLLFGDGRHQLQFVDDECDEDGFDPMEL

XP_022131879.1 type 2 DNA topoisomerase 6 subunit B-like isoform X1 [Momordica charantia]1.5e-22990.99Show/hide
Query:  ISDTGLGSCLEEFQDFKCPVEGILAEQWVAHVQRYFAVGLDYRTIVMRLSINTFLVFYADGIVSLRTTNFCDNEISCYQLNLKENVTIREPIWLPSNVKH
        ISDTGLGSCLEEFQDFKCPVEGILAEQW                               DGIVSLRTTNFCDNEISCYQLNLKENVTIREPIWLPSNVKH
Subjt:  ISDTGLGSCLEEFQDFKCPVEGILAEQWVAHVQRYFAVGLDYRTIVMRLSINTFLVFYADGIVSLRTTNFCDNEISCYQLNLKENVTIREPIWLPSNVKH

Query:  GVKF-----RHGIT-AVDVLLAEINCFFQKIRILKIPNVAMEVVVECQDIPGSRNDAVFLENLSSSSSFTASTLDHLKLGLEDYVSRHGSSLTCDSCFPN
        GVKF     R  ++ AVDVLLAEINCFFQKIRILKIPNVAMEVVVECQDIPGSRNDAVFLENLSSSSSFTASTLDHLKLGLEDYVSRHGSSLTCDSCFPN
Subjt:  GVKF-----RHGIT-AVDVLLAEINCFFQKIRILKIPNVAMEVVVECQDIPGSRNDAVFLENLSSSSSFTASTLDHLKLGLEDYVSRHGSSLTCDSCFPN

Query:  RDNLKSGGGMICEEKRKTTTLAVEAAIVISELSNPTTNCLGARFSDTEVFCFKDFAPCSISEAFLKALRGIDWKRYGLSLESATNQRSHALLKWEHVPLS
        RDNLKSGGGMICEEKRKTTTLAVEAAIVISELSNPTTNCLGARFSDTEVFCFKDFAPCSISEAFLKALRGIDWKRYGLSLESATNQRSHALLKWEHVPLS
Subjt:  RDNLKSGGGMICEEKRKTTTLAVEAAIVISELSNPTTNCLGARFSDTEVFCFKDFAPCSISEAFLKALRGIDWKRYGLSLESATNQRSHALLKWEHVPLS

Query:  FHIHIVVHCYQKLVAEVMPLVQKTQFDKKLISKAVKLALDDLKNKYAGFLLSANTLKISRFAPDLAKTIAGLVLYSNDLDFQEECLAILGLQPQQSEGEI
        FHIHIVVHCYQKLVAEVMPLVQKTQFDKKLISKAVKLALDDLKNKYAGFLLSANTLKISRFAPDLAKTIAGLVLYSNDLDFQEECLAILGLQPQQSEGEI
Subjt:  FHIHIVVHCYQKLVAEVMPLVQKTQFDKKLISKAVKLALDDLKNKYAGFLLSANTLKISRFAPDLAKTIAGLVLYSNDLDFQEECLAILGLQPQQSEGEI

Query:  VEENIKERIISVIEVNDGRPKGTKEVAPLLFGDGRHQLQFVDDECDEDGFDPMEL
        VEENIKERIISVIEVNDGRPKGTKEVAPLLFGDGRHQLQFVDDECDEDGFDPMEL
Subjt:  VEENIKERIISVIEVNDGRPKGTKEVAPLLFGDGRHQLQFVDDECDEDGFDPMEL

XP_022131880.1 type 2 DNA topoisomerase 6 subunit B-like isoform X2 [Momordica charantia]5.8e-22990.77Show/hide
Query:  ISDTGLGSCLEEFQDFKCPVEGILAEQWVAHVQRYFAVGLDYRTIVMRLSINTFLVFYADGIVSLRTTNFCDNEISCYQLNLKENVTIREPIWLPSNVKH
        IS+TGLGSCLEEFQDFKCPVEGILAEQW                               DGIVSLRTTNFCDNEISCYQLNLKENVTIREPIWLPSNVKH
Subjt:  ISDTGLGSCLEEFQDFKCPVEGILAEQWVAHVQRYFAVGLDYRTIVMRLSINTFLVFYADGIVSLRTTNFCDNEISCYQLNLKENVTIREPIWLPSNVKH

Query:  GVKF-----RHGIT-AVDVLLAEINCFFQKIRILKIPNVAMEVVVECQDIPGSRNDAVFLENLSSSSSFTASTLDHLKLGLEDYVSRHGSSLTCDSCFPN
        GVKF     R  ++ AVDVLLAEINCFFQKIRILKIPNVAMEVVVECQDIPGSRNDAVFLENLSSSSSFTASTLDHLKLGLEDYVSRHGSSLTCDSCFPN
Subjt:  GVKF-----RHGIT-AVDVLLAEINCFFQKIRILKIPNVAMEVVVECQDIPGSRNDAVFLENLSSSSSFTASTLDHLKLGLEDYVSRHGSSLTCDSCFPN

Query:  RDNLKSGGGMICEEKRKTTTLAVEAAIVISELSNPTTNCLGARFSDTEVFCFKDFAPCSISEAFLKALRGIDWKRYGLSLESATNQRSHALLKWEHVPLS
        RDNLKSGGGMICEEKRKTTTLAVEAAIVISELSNPTTNCLGARFSDTEVFCFKDFAPCSISEAFLKALRGIDWKRYGLSLESATNQRSHALLKWEHVPLS
Subjt:  RDNLKSGGGMICEEKRKTTTLAVEAAIVISELSNPTTNCLGARFSDTEVFCFKDFAPCSISEAFLKALRGIDWKRYGLSLESATNQRSHALLKWEHVPLS

Query:  FHIHIVVHCYQKLVAEVMPLVQKTQFDKKLISKAVKLALDDLKNKYAGFLLSANTLKISRFAPDLAKTIAGLVLYSNDLDFQEECLAILGLQPQQSEGEI
        FHIHIVVHCYQKLVAEVMPLVQKTQFDKKLISKAVKLALDDLKNKYAGFLLSANTLKISRFAPDLAKTIAGLVLYSNDLDFQEECLAILGLQPQQSEGEI
Subjt:  FHIHIVVHCYQKLVAEVMPLVQKTQFDKKLISKAVKLALDDLKNKYAGFLLSANTLKISRFAPDLAKTIAGLVLYSNDLDFQEECLAILGLQPQQSEGEI

Query:  VEENIKERIISVIEVNDGRPKGTKEVAPLLFGDGRHQLQFVDDECDEDGFDPMEL
        VEENIKERIISVIEVNDGRPKGTKEVAPLLFGDGRHQLQFVDDECDEDGFDPMEL
Subjt:  VEENIKERIISVIEVNDGRPKGTKEVAPLLFGDGRHQLQFVDDECDEDGFDPMEL

XP_022132255.1 uncharacterized protein LOC111005155 [Momordica charantia]9.4e-18799.7Show/hide
Query:  MTKTSISIAQNERNDVLEAFNPAISNFLAQLSQKVQNSLKPQLNRLANEGQKLTSMSPFAKGERKGSSTSSLKLDVEKQLQAWRENPSWTDKPPQIKVTV
        MTKTSISIAQNERNDVLEAFNPAISNFLAQLSQKVQNSLKPQLNRLANEGQKLTSMSPFAKGERKGSSTSSLKLDVEKQLQAWRENPSWTDKPPQIKVTV
Subjt:  MTKTSISIAQNERNDVLEAFNPAISNFLAQLSQKVQNSLKPQLNRLANEGQKLTSMSPFAKGERKGSSTSSLKLDVEKQLQAWRENPSWTDKPPQIKVTV

Query:  PKDTLSRLNAKVDVGLPPDAVYNIVTDPDNKRVFKNIKEVISRKVLIDEGSRQVVEVEQAALWRFLWWSGTISVHVLVDQNRADHSMKFKQLKTGFMKKF
        PKDTLSRLNAKVDVGLPPDAVYNIVTDPDNKRVFKNIKEVISRKVLIDEGSRQVVEVEQAALWRFLWWSGTISVHVLVDQNRADHSMKFKQLKTGFMKKF
Subjt:  PKDTLSRLNAKVDVGLPPDAVYNIVTDPDNKRVFKNIKEVISRKVLIDEGSRQVVEVEQAALWRFLWWSGTISVHVLVDQNRADHSMKFKQLKTGFMKKF

Query:  EGCWRVEPIFVDESVCFPVKPKNLTDYHACTKGKGRIGSRVSLEQLIQPAIVPPPPISWYLRGITTRTTEMLILDLLAEAKRIREDAKGQTLNNELDISQ
        EGCWRVEPIFVDESVCFPVKPKNLTDYHACTKGKGRIGSRVSLEQLIQPAIVPPPPISWYLRGITTRTTEMLILDLLAEAKRIREDAKGQTLNNELDISQ
Subjt:  EGCWRVEPIFVDESVCFPVKPKNLTDYHACTKGKGRIGSRVSLEQLIQPAIVPPPPISWYLRGITTRTTEMLILDLLAEAKRIREDAKGQTLNNELDISQ

Query:  GTCDSNLLDSILDIKERWAMHRRNAKPCRAR
        GTCDSNLLDSILDIKERWAMHRRNAKPCR R
Subjt:  GTCDSNLLDSILDIKERWAMHRRNAKPCRAR

XP_023545219.1 type 2 DNA topoisomerase 6 subunit B-like [Cucurbita pepo subsp. pepo]1.2e-18472.75Show/hide
Query:  ISDTGLGSCLEEFQDFKCPVEGILAEQWVAHVQRYFAVGLDYRTIVMRLSINTFLVFYADGIVSLRTTNFCDNEISCYQLNLKENVTIREPIWLPSNVKH
        ISDTGLGSC+EEFQ+ KCPVEGILAE W                               +GIV +RTTN CD EISCY+LNLKENVT R+PI LPSN+KH
Subjt:  ISDTGLGSCLEEFQDFKCPVEGILAEQWVAHVQRYFAVGLDYRTIVMRLSINTFLVFYADGIVSLRTTNFCDNEISCYQLNLKENVTIREPIWLPSNVKH

Query:  GVKFR------HGITAVDVLLAEINCFFQKIRILKIPNVAMEVVVECQDIPGSRNDAVFLENLSSSSSFTASTLDHLKLGLEDYVSRHGSSLTCDSCFPN
        GVKF           +VDVLL E NCFFQKI +LK+PN+AMEV+ + QD+PGSRNDAVFLE +S+SSSF ASTLDHLKLGLE YV RHGSSL CDSCFPN
Subjt:  GVKFR------HGITAVDVLLAEINCFFQKIRILKIPNVAMEVVVECQDIPGSRNDAVFLENLSSSSSFTASTLDHLKLGLEDYVSRHGSSLTCDSCFPN

Query:  RDNLKSGGGMICEEKRKTTTLAVEAAIVISELSNPTTNCLGARFSDTEVFCFKDFAPCSISEAFLKALRGIDWKRYGLSLESATNQRSHALLKWEHVPLS
        RDNL+SGGGM+CE+K KTT L VEAA+V SE+SNPT NC GA  SDT+V CFKDFAPCSISEA LKAL GIDWKRYGL+LE A +QRSHALLKWEH+PLS
Subjt:  RDNLKSGGGMICEEKRKTTTLAVEAAIVISELSNPTTNCLGARFSDTEVFCFKDFAPCSISEAFLKALRGIDWKRYGLSLESATNQRSHALLKWEHVPLS

Query:  FHIHIVVHCYQKLVAEVMPLVQKTQFDKKLISKAVKLALDDLKNKYAGFLLSANTLKISRFAPDLAKTIAGLVLYSNDLDFQEECLAILGLQPQQSEGEI
        FHIHIVVHCY KLV E MPL+QKT+FDKKLI KAVKLALDD KNK+AG LLSA+TLKIS+FAPDLAK+IAGLVLYS+D+DF+EECLAILGLQP QSEGEI
Subjt:  FHIHIVVHCYQKLVAEVMPLVQKTQFDKKLISKAVKLALDDLKNKYAGFLLSANTLKISRFAPDLAKTIAGLVLYSNDLDFQEECLAILGLQPQQSEGEI

Query:  VEENIKERIISVIEVNDGRPKGTKEVAPLLFGDGRHQLQFVDDECDEDGFDPMEL
        V ENIK++IIS IEVND R +GTKEVAPLLF +GRH+LQFVDDECDEDGFDPM+L
Subjt:  VEENIKERIISVIEVNDGRPKGTKEVAPLLFGDGRHQLQFVDDECDEDGFDPMEL

TrEMBL top hitse value%identityAlignment
A0A6J1BQX4 type 2 DNA topoisomerase 6 subunit B-like isoform X22.8e-22990.77Show/hide
Query:  ISDTGLGSCLEEFQDFKCPVEGILAEQWVAHVQRYFAVGLDYRTIVMRLSINTFLVFYADGIVSLRTTNFCDNEISCYQLNLKENVTIREPIWLPSNVKH
        IS+TGLGSCLEEFQDFKCPVEGILAEQW                               DGIVSLRTTNFCDNEISCYQLNLKENVTIREPIWLPSNVKH
Subjt:  ISDTGLGSCLEEFQDFKCPVEGILAEQWVAHVQRYFAVGLDYRTIVMRLSINTFLVFYADGIVSLRTTNFCDNEISCYQLNLKENVTIREPIWLPSNVKH

Query:  GVKF-----RHGIT-AVDVLLAEINCFFQKIRILKIPNVAMEVVVECQDIPGSRNDAVFLENLSSSSSFTASTLDHLKLGLEDYVSRHGSSLTCDSCFPN
        GVKF     R  ++ AVDVLLAEINCFFQKIRILKIPNVAMEVVVECQDIPGSRNDAVFLENLSSSSSFTASTLDHLKLGLEDYVSRHGSSLTCDSCFPN
Subjt:  GVKF-----RHGIT-AVDVLLAEINCFFQKIRILKIPNVAMEVVVECQDIPGSRNDAVFLENLSSSSSFTASTLDHLKLGLEDYVSRHGSSLTCDSCFPN

Query:  RDNLKSGGGMICEEKRKTTTLAVEAAIVISELSNPTTNCLGARFSDTEVFCFKDFAPCSISEAFLKALRGIDWKRYGLSLESATNQRSHALLKWEHVPLS
        RDNLKSGGGMICEEKRKTTTLAVEAAIVISELSNPTTNCLGARFSDTEVFCFKDFAPCSISEAFLKALRGIDWKRYGLSLESATNQRSHALLKWEHVPLS
Subjt:  RDNLKSGGGMICEEKRKTTTLAVEAAIVISELSNPTTNCLGARFSDTEVFCFKDFAPCSISEAFLKALRGIDWKRYGLSLESATNQRSHALLKWEHVPLS

Query:  FHIHIVVHCYQKLVAEVMPLVQKTQFDKKLISKAVKLALDDLKNKYAGFLLSANTLKISRFAPDLAKTIAGLVLYSNDLDFQEECLAILGLQPQQSEGEI
        FHIHIVVHCYQKLVAEVMPLVQKTQFDKKLISKAVKLALDDLKNKYAGFLLSANTLKISRFAPDLAKTIAGLVLYSNDLDFQEECLAILGLQPQQSEGEI
Subjt:  FHIHIVVHCYQKLVAEVMPLVQKTQFDKKLISKAVKLALDDLKNKYAGFLLSANTLKISRFAPDLAKTIAGLVLYSNDLDFQEECLAILGLQPQQSEGEI

Query:  VEENIKERIISVIEVNDGRPKGTKEVAPLLFGDGRHQLQFVDDECDEDGFDPMEL
        VEENIKERIISVIEVNDGRPKGTKEVAPLLFGDGRHQLQFVDDECDEDGFDPMEL
Subjt:  VEENIKERIISVIEVNDGRPKGTKEVAPLLFGDGRHQLQFVDDECDEDGFDPMEL

A0A6J1BRH4 type 2 DNA topoisomerase 6 subunit B-like isoform X17.4e-23090.99Show/hide
Query:  ISDTGLGSCLEEFQDFKCPVEGILAEQWVAHVQRYFAVGLDYRTIVMRLSINTFLVFYADGIVSLRTTNFCDNEISCYQLNLKENVTIREPIWLPSNVKH
        ISDTGLGSCLEEFQDFKCPVEGILAEQW                               DGIVSLRTTNFCDNEISCYQLNLKENVTIREPIWLPSNVKH
Subjt:  ISDTGLGSCLEEFQDFKCPVEGILAEQWVAHVQRYFAVGLDYRTIVMRLSINTFLVFYADGIVSLRTTNFCDNEISCYQLNLKENVTIREPIWLPSNVKH

Query:  GVKF-----RHGIT-AVDVLLAEINCFFQKIRILKIPNVAMEVVVECQDIPGSRNDAVFLENLSSSSSFTASTLDHLKLGLEDYVSRHGSSLTCDSCFPN
        GVKF     R  ++ AVDVLLAEINCFFQKIRILKIPNVAMEVVVECQDIPGSRNDAVFLENLSSSSSFTASTLDHLKLGLEDYVSRHGSSLTCDSCFPN
Subjt:  GVKF-----RHGIT-AVDVLLAEINCFFQKIRILKIPNVAMEVVVECQDIPGSRNDAVFLENLSSSSSFTASTLDHLKLGLEDYVSRHGSSLTCDSCFPN

Query:  RDNLKSGGGMICEEKRKTTTLAVEAAIVISELSNPTTNCLGARFSDTEVFCFKDFAPCSISEAFLKALRGIDWKRYGLSLESATNQRSHALLKWEHVPLS
        RDNLKSGGGMICEEKRKTTTLAVEAAIVISELSNPTTNCLGARFSDTEVFCFKDFAPCSISEAFLKALRGIDWKRYGLSLESATNQRSHALLKWEHVPLS
Subjt:  RDNLKSGGGMICEEKRKTTTLAVEAAIVISELSNPTTNCLGARFSDTEVFCFKDFAPCSISEAFLKALRGIDWKRYGLSLESATNQRSHALLKWEHVPLS

Query:  FHIHIVVHCYQKLVAEVMPLVQKTQFDKKLISKAVKLALDDLKNKYAGFLLSANTLKISRFAPDLAKTIAGLVLYSNDLDFQEECLAILGLQPQQSEGEI
        FHIHIVVHCYQKLVAEVMPLVQKTQFDKKLISKAVKLALDDLKNKYAGFLLSANTLKISRFAPDLAKTIAGLVLYSNDLDFQEECLAILGLQPQQSEGEI
Subjt:  FHIHIVVHCYQKLVAEVMPLVQKTQFDKKLISKAVKLALDDLKNKYAGFLLSANTLKISRFAPDLAKTIAGLVLYSNDLDFQEECLAILGLQPQQSEGEI

Query:  VEENIKERIISVIEVNDGRPKGTKEVAPLLFGDGRHQLQFVDDECDEDGFDPMEL
        VEENIKERIISVIEVNDGRPKGTKEVAPLLFGDGRHQLQFVDDECDEDGFDPMEL
Subjt:  VEENIKERIISVIEVNDGRPKGTKEVAPLLFGDGRHQLQFVDDECDEDGFDPMEL

A0A6J1BRY3 uncharacterized protein LOC1110051554.6e-18799.7Show/hide
Query:  MTKTSISIAQNERNDVLEAFNPAISNFLAQLSQKVQNSLKPQLNRLANEGQKLTSMSPFAKGERKGSSTSSLKLDVEKQLQAWRENPSWTDKPPQIKVTV
        MTKTSISIAQNERNDVLEAFNPAISNFLAQLSQKVQNSLKPQLNRLANEGQKLTSMSPFAKGERKGSSTSSLKLDVEKQLQAWRENPSWTDKPPQIKVTV
Subjt:  MTKTSISIAQNERNDVLEAFNPAISNFLAQLSQKVQNSLKPQLNRLANEGQKLTSMSPFAKGERKGSSTSSLKLDVEKQLQAWRENPSWTDKPPQIKVTV

Query:  PKDTLSRLNAKVDVGLPPDAVYNIVTDPDNKRVFKNIKEVISRKVLIDEGSRQVVEVEQAALWRFLWWSGTISVHVLVDQNRADHSMKFKQLKTGFMKKF
        PKDTLSRLNAKVDVGLPPDAVYNIVTDPDNKRVFKNIKEVISRKVLIDEGSRQVVEVEQAALWRFLWWSGTISVHVLVDQNRADHSMKFKQLKTGFMKKF
Subjt:  PKDTLSRLNAKVDVGLPPDAVYNIVTDPDNKRVFKNIKEVISRKVLIDEGSRQVVEVEQAALWRFLWWSGTISVHVLVDQNRADHSMKFKQLKTGFMKKF

Query:  EGCWRVEPIFVDESVCFPVKPKNLTDYHACTKGKGRIGSRVSLEQLIQPAIVPPPPISWYLRGITTRTTEMLILDLLAEAKRIREDAKGQTLNNELDISQ
        EGCWRVEPIFVDESVCFPVKPKNLTDYHACTKGKGRIGSRVSLEQLIQPAIVPPPPISWYLRGITTRTTEMLILDLLAEAKRIREDAKGQTLNNELDISQ
Subjt:  EGCWRVEPIFVDESVCFPVKPKNLTDYHACTKGKGRIGSRVSLEQLIQPAIVPPPPISWYLRGITTRTTEMLILDLLAEAKRIREDAKGQTLNNELDISQ

Query:  GTCDSNLLDSILDIKERWAMHRRNAKPCRAR
        GTCDSNLLDSILDIKERWAMHRRNAKPCR R
Subjt:  GTCDSNLLDSILDIKERWAMHRRNAKPCRAR

A0A6J1HCF2 type 2 DNA topoisomerase 6 subunit B-like isoform X63.1e-18372.09Show/hide
Query:  ISDTGLGSCLEEFQDFKCPVEGILAEQWVAHVQRYFAVGLDYRTIVMRLSINTFLVFYADGIVSLRTTNFCDNEISCYQLNLKENVTIREPIWLPSNVKH
        ISDTGLGSC+EEFQ+ KCPVEGILAE W                               +GIV +RTTN CD EISCY+LNLKENVT R+PI LPSN+KH
Subjt:  ISDTGLGSCLEEFQDFKCPVEGILAEQWVAHVQRYFAVGLDYRTIVMRLSINTFLVFYADGIVSLRTTNFCDNEISCYQLNLKENVTIREPIWLPSNVKH

Query:  GVKFR------HGITAVDVLLAEINCFFQKIRILKIPNVAMEVVVECQDIPGSRNDAVFLENLSSSSSFTASTLDHLKLGLEDYVSRHGSSLTCDSCFPN
        GVKF           +VDVLL E NCFFQKI +LK+PN+AMEV+ + QD+PGSRNDAVFLE  S SSSF ASTLDHLKLGLE YV RHGSSL CDSCFPN
Subjt:  GVKFR------HGITAVDVLLAEINCFFQKIRILKIPNVAMEVVVECQDIPGSRNDAVFLENLSSSSSFTASTLDHLKLGLEDYVSRHGSSLTCDSCFPN

Query:  RDNLKSGGGMICEEKRKTTTLAVEAAIVISELSNPTTNCLGARFSDTEVFCFKDFAPCSISEAFLKALRGIDWKRYGLSLESATNQRSHALLKWEHVPLS
        RDNL+SGGGM+CE+K KTT L VEAA+V SE+ NPT NC GA  SDT+V CFKDFAPCSISEA LKAL GIDWKRYGL+LE A +QRSHALLKWEH+PLS
Subjt:  RDNLKSGGGMICEEKRKTTTLAVEAAIVISELSNPTTNCLGARFSDTEVFCFKDFAPCSISEAFLKALRGIDWKRYGLSLESATNQRSHALLKWEHVPLS

Query:  FHIHIVVHCYQKLVAEVMPLVQKTQFDKKLISKAVKLALDDLKNKYAGFLLSANTLKISRFAPDLAKTIAGLVLYSNDLDFQEECLAILGLQPQQSEGEI
        FHIHIVVHCY KLV E MPL+QKT+FDKKLI KAVKLALDD KNK+AG LLSA+TLKIS+FAPDLAK+IAGLVLYS+D+DF+EECLAILGLQP QSEGEI
Subjt:  FHIHIVVHCYQKLVAEVMPLVQKTQFDKKLISKAVKLALDDLKNKYAGFLLSANTLKISRFAPDLAKTIAGLVLYSNDLDFQEECLAILGLQPQQSEGEI

Query:  VEENIKERIISVIEVNDGRPKGTKEVAPLLFGDGRHQLQFVDDECDEDGFDPMEL
        V ENIK++IIS IEVND R +GT+EVAPLLF +GRH+LQ+VDDECDEDGFDPM+L
Subjt:  VEENIKERIISVIEVNDGRPKGTKEVAPLLFGDGRHQLQFVDDECDEDGFDPMEL

A0A6J1K810 type 2 DNA topoisomerase 6 subunit B-like isoform X58.3e-18171.43Show/hide
Query:  ISDTGLGSCLEEFQDFKCPVEGILAEQWVAHVQRYFAVGLDYRTIVMRLSINTFLVFYADGIVSLRTTNFCDNEISCYQLNLKENVTIREPIWLPSNVKH
        ISDTGLGSC+EEFQ+ KCPVEGILAE W                               +GIV +RTTN CD EISCYQLNLKENVT R+PI LPSN+KH
Subjt:  ISDTGLGSCLEEFQDFKCPVEGILAEQWVAHVQRYFAVGLDYRTIVMRLSINTFLVFYADGIVSLRTTNFCDNEISCYQLNLKENVTIREPIWLPSNVKH

Query:  GVKFR------HGITAVDVLLAEINCFFQKIRILKIPNVAMEVVVECQDIPGSRNDAVFLENLSSSSSFTASTLDHLKLGLEDYVSRHGSSLTCDSCFPN
        GVKF           ++DVLL E NCFFQKI +LK+PN+AMEV+ + QD+PGSRNDAVFLE  S SSSF ASTLDHLK GLE YV RHGSSL CDSCFPN
Subjt:  GVKFR------HGITAVDVLLAEINCFFQKIRILKIPNVAMEVVVECQDIPGSRNDAVFLENLSSSSSFTASTLDHLKLGLEDYVSRHGSSLTCDSCFPN

Query:  RDNLKSGGGMICEEKRKTTTLAVEAAIVISELSNPTTNCLGARFSDTEVFCFKDFAPCSISEAFLKALRGIDWKRYGLSLESATNQRSHALLKWEHVPLS
        RDNL+SGGGM+CE+K KTT L VEAA+V +E+SNPT NC  A  S+T+V CFKDFAPCSISEA LKAL GIDWKRYGL+LE A +QRSHALLKWEH+PLS
Subjt:  RDNLKSGGGMICEEKRKTTTLAVEAAIVISELSNPTTNCLGARFSDTEVFCFKDFAPCSISEAFLKALRGIDWKRYGLSLESATNQRSHALLKWEHVPLS

Query:  FHIHIVVHCYQKLVAEVMPLVQKTQFDKKLISKAVKLALDDLKNKYAGFLLSANTLKISRFAPDLAKTIAGLVLYSNDLDFQEECLAILGLQPQQSEGEI
        FHIHIVVHCY K V E MPL+QKT+FDKKLI KAVKLALDD KNK+AG LLSA+TLKIS+FAPDLAK+IAGLVLYS+D+DF+EECLAILGLQP QSEGEI
Subjt:  FHIHIVVHCYQKLVAEVMPLVQKTQFDKKLISKAVKLALDDLKNKYAGFLLSANTLKISRFAPDLAKTIAGLVLYSNDLDFQEECLAILGLQPQQSEGEI

Query:  VEENIKERIISVIEVNDGRPKGTKEVAPLLFGDGRHQLQFVDDECDEDGFDPMEL
        V ENIK +IIS IEVND R +GTKEVAPLLF +GRH+LQFVDDECDEDG DPM+L
Subjt:  VEENIKERIISVIEVNDGRPKGTKEVAPLLFGDGRHQLQFVDDECDEDGFDPMEL

SwissProt top hitse value%identityAlignment
Q5Q0E6 Type 2 DNA topoisomerase 6 subunit B-like6.8e-7938.93Show/hide
Query:  ISDTGLGSCLEEFQDFKCPVEGILAEQWVAHVQRYFAVGLDYRTIVMRLSINTFLVFYADGIVSLRTTNFCDNEISCYQLNLKENVTIREPIWLPSNVKH
        I+DTG+G  LEEFQ+ +CP E   A+ W                               DG++S++TT F D+E+  Y +NL E +  +     PS  K+
Subjt:  ISDTGLGSCLEEFQDFKCPVEGILAEQWVAHVQRYFAVGLDYRTIVMRLSINTFLVFYADGIVSLRTTNFCDNEISCYQLNLKENVTIREPIWLPSNVKH

Query:  GVKFR------HGITAVDVLLAEINCFFQKIRILKIPNVAMEVVVECQDIPGSRNDAVFLENLSSSSSFTASTLDHLKLGLEDYVSRHGSSL--TCDSCF
        G KF           ++DVL+A I  FFQKI +L+I NV ++++V+    PG++   VF  N   +  FTAS L+ LK GLEDYV RH + L   CD CF
Subjt:  GVKFR------HGITAVDVLLAEINCFFQKIRILKIPNVAMEVVVECQDIPGSRNDAVFLENLSSSSSFTASTLDHLKLGLEDYVSRHGSSL--TCDSCF

Query:  PNRDNLKSGGGMIC-EEKRKTTTLAVEAAIVISELSNPTTNCLGARFSDTEVFCFKDFAPCSISEAFLKALRGIDWKRYGLSLESATNQRSHALLKWEHV
         +R++LK G G +C E+K K     +E  IVIS+L   T +C  +    TEV  F +F P  +    L AL+ IDWK+YGL L +  +Q  H  L+W++ 
Subjt:  PNRDNLKSGGGMIC-EEKRKTTTLAVEAAIVISELSNPTTNCLGARFSDTEVFCFKDFAPCSISEAFLKALRGIDWKRYGLSLESATNQRSHALLKWEHV

Query:  PLSFHIHIVVHCYQKLVAEVMPLVQKTQFDKKLISKAVKLALDDLKNKYAGFLLSANTLKISRFAPDLAKTIAGLVLYSNDLDFQEECLAILGLQPQQSE
        P    I I +H Y        P  QK      L+ K +K ALD+LK K+ GFLLS+++ KI  + PDLA++IAGL+  S DLDFQ +CL++LG Q Q+ E
Subjt:  PLSFHIHIVVHCYQKLVAEVMPLVQKTQFDKKLISKAVKLALDDLKNKYAGFLLSANTLKISRFAPDLAKTIAGLVLYSNDLDFQEECLAILGLQPQQSE

Query:  GEIVEENIKERIISVIEVNDGRPKGTKEVAPLLFGDGRHQLQFVDDE
         + VE  I+ +I++VI +N+ +P+  +E AP LF DG  +  F +DE
Subjt:  GEIVEENIKERIISVIEVNDGRPKGTKEVAPLLFGDGRHQLQFVDDE

Arabidopsis top hitse value%identityAlignment
AT1G23560.1 Domain of unknown function (DUF220)1.5e-8149.35Show/hide
Query:  NPAISNFLAQLSQKVQNSLKPQLNRLANEGQ-KLTSMSPFAKGERKGSSTSSLKLDVEKQLQAWRENPSWTDKPPQIKVTVPKDTLSRLNAKVDVGLPPD
        N     FL  ++Q +Q  LK +  RL N  +  L S+S       +      L    EKQLQAWR+NPSW D+PP++ V         LN + DVGLPP+
Subjt:  NPAISNFLAQLSQKVQNSLKPQLNRLANEGQ-KLTSMSPFAKGERKGSSTSSLKLDVEKQLQAWRENPSWTDKPPQIKVTVPKDTLSRLNAKVDVGLPPD

Query:  AVYNIVTDPDNKRVFKNIKEVISRKVLIDEGSRQVVEVEQAALWRFLWWSGTISVHVLVDQNRADHSMKFKQLKTGFMKKFEGCWRVEPIFVDESVCFPV
         VYNI T PDNKR FKNIKE ISRKVLIDEG +Q VEV+QAA W+FLWW GT  +H++V++NR + + K+KQ  T FMK FEGCW+VEP+F+DE +C   
Subjt:  AVYNIVTDPDNKRVFKNIKEVISRKVLIDEGSRQVVEVEQAALWRFLWWSGTISVHVLVDQNRADHSMKFKQLKTGFMKKFEGCWRVEPIFVDESVCFPV

Query:  KPKNLTDYHACTKGKGRIGSRVSLEQLIQP-AIVPPPPISWYLRGITTRTTEMLILDLLAEAKRIREDAKGQTLNNELDISQGTCDSNLLDSILDIKERW
        KPK+  DYH+C+ G+GRIGS+V+++Q+ QP A++ PPP+SWY+RGIT +TTE +I DL AEA R+R    G  ++++ + +  T + +  D   DIKERW
Subjt:  KPKNLTDYHACTKGKGRIGSRVSLEQLIQP-AIVPPPPISWYLRGITTRTTEMLILDLLAEAKRIREDAKGQTLNNELDISQGTCDSNLLDSILDIKERW

Query:  AMHRRNAK
          HRR+ +
Subjt:  AMHRRNAK

AT1G60460.1 unknown protein2.2e-8039.02Show/hide
Query:  ISDTGLGSCLEEFQDFKCPVEGILAEQWVAHVQRYFAVGLDYRTIVMRLSINTFLVFYADGIVSLRTTNFCDNEISCYQLNLKENVTIREPIWLPSNVKH
        I+DTG+G  LEEFQ+ +CP E   A+ W                               DG++S++TT F D+E+  Y +NL E +  +     PS  K+
Subjt:  ISDTGLGSCLEEFQDFKCPVEGILAEQWVAHVQRYFAVGLDYRTIVMRLSINTFLVFYADGIVSLRTTNFCDNEISCYQLNLKENVTIREPIWLPSNVKH

Query:  GVKFR------HGITAVDVLLAEINCFFQKIRILKIPNVAMEVVVECQDIPGSRNDAVFLENLSSSSSFTASTLDHLKLGLEDYVSRHGSSL--TCDSCF
        G KF           ++DVL+A I  FFQKI +L+I NV ++++V+    PG++   VF  N   +  FTAS L+ LK GLEDYV RH + L   CD CF
Subjt:  GVKFR------HGITAVDVLLAEINCFFQKIRILKIPNVAMEVVVECQDIPGSRNDAVFLENLSSSSSFTASTLDHLKLGLEDYVSRHGSSL--TCDSCF

Query:  PNRDNLKSGGGMIC-EEKRKTTTLAVEAAIVISELSNPTTNCLGARFSDTEVFCFKDFAPCSISEAFLKALRGIDWKRYGLSLESATNQRSHALLKWEHV
         +R++LK G G +C E+K K     +E  IVIS+L   T +C  +    TEV  F +F P  +    L AL+ IDWK+YGL L +  +Q  H  L+W++ 
Subjt:  PNRDNLKSGGGMIC-EEKRKTTTLAVEAAIVISELSNPTTNCLGARFSDTEVFCFKDFAPCSISEAFLKALRGIDWKRYGLSLESATNQRSHALLKWEHV

Query:  PLSFHIHIVVHCYQKLVAEVMPLVQKTQFDKKLISKAVKLALDDLKNKYAGFLLSANTLKISRFAPDLAKTIAGLVLYSNDLDFQEECLAILGLQPQQSE
        P    I I +H Y        P  QK      L+ K +K ALD+LK K+ GFLLS+++ KI  + PDLA++IAGL+  S DLDFQ +CL++LG Q Q+ E
Subjt:  PLSFHIHIVVHCYQKLVAEVMPLVQKTQFDKKLISKAVKLALDDLKNKYAGFLLSANTLKISRFAPDLAKTIAGLVLYSNDLDFQEECLAILGLQPQQSE

Query:  GEIVEENIKERIISVIEVNDGRPKGTKEVAPLLFGDGRHQLQFVDDECDED
         + VE  I+ +I++VI +N+ +P+  +E AP LF DG  +  F +DE  ED
Subjt:  GEIVEENIKERIISVIEVNDGRPKGTKEVAPLLFGDGRHQLQFVDDECDED

AT1G60460.2 unknown protein4.8e-8038.93Show/hide
Query:  ISDTGLGSCLEEFQDFKCPVEGILAEQWVAHVQRYFAVGLDYRTIVMRLSINTFLVFYADGIVSLRTTNFCDNEISCYQLNLKENVTIREPIWLPSNVKH
        I+DTG+G  LEEFQ+ +CP E   A+ W                               DG++S++TT F D+E+  Y +NL E +  +     PS  K+
Subjt:  ISDTGLGSCLEEFQDFKCPVEGILAEQWVAHVQRYFAVGLDYRTIVMRLSINTFLVFYADGIVSLRTTNFCDNEISCYQLNLKENVTIREPIWLPSNVKH

Query:  GVKFR------HGITAVDVLLAEINCFFQKIRILKIPNVAMEVVVECQDIPGSRNDAVFLENLSSSSSFTASTLDHLKLGLEDYVSRHGSSL--TCDSCF
        G KF           ++DVL+A I  FFQKI +L+I NV ++++V+    PG++   VF  N   +  FTAS L+ LK GLEDYV RH + L   CD CF
Subjt:  GVKFR------HGITAVDVLLAEINCFFQKIRILKIPNVAMEVVVECQDIPGSRNDAVFLENLSSSSSFTASTLDHLKLGLEDYVSRHGSSL--TCDSCF

Query:  PNRDNLKSGGGMIC-EEKRKTTTLAVEAAIVISELSNPTTNCLGARFSDTEVFCFKDFAPCSISEAFLKALRGIDWKRYGLSLESATNQRSHALLKWEHV
         +R++LK G G +C E+K K     +E  IVIS+L   T +C  +    TEV  F +F P  +    L AL+ IDWK+YGL L +  +Q  H  L+W++ 
Subjt:  PNRDNLKSGGGMIC-EEKRKTTTLAVEAAIVISELSNPTTNCLGARFSDTEVFCFKDFAPCSISEAFLKALRGIDWKRYGLSLESATNQRSHALLKWEHV

Query:  PLSFHIHIVVHCYQKLVAEVMPLVQKTQFDKKLISKAVKLALDDLKNKYAGFLLSANTLKISRFAPDLAKTIAGLVLYSNDLDFQEECLAILGLQPQQSE
        P    I I +H Y        P  QK      L+ K +K ALD+LK K+ GFLLS+++ KI  + PDLA++IAGL+  S DLDFQ +CL++LG Q Q+ E
Subjt:  PLSFHIHIVVHCYQKLVAEVMPLVQKTQFDKKLISKAVKLALDDLKNKYAGFLLSANTLKISRFAPDLAKTIAGLVLYSNDLDFQEECLAILGLQPQQSE

Query:  GEIVEENIKERIISVIEVNDGRPKGTKEVAPLLFGDGRHQLQFVDDE
         + VE  I+ +I++VI +N+ +P+  +E AP LF DG  +  F +DE
Subjt:  GEIVEENIKERIISVIEVNDGRPKGTKEVAPLLFGDGRHQLQFVDDE

AT1G70480.1 Domain of unknown function (DUF220)3.9e-10661.51Show/hide
Query:  QNERNDVLEAFNPAISNFLAQLSQKVQNSLKPQLNRLANEGQKLTSMSPFAKGERKGSSTSSLKLDVEKQLQAWRENPSWTDKPPQIKVTVPKDTLSRLN
        +N+ N V   F+PA+S F+   SQK+Q  LK QL  L ++   + S    +  + KG+S+ S+++D+EKQL  WRENPSWTD+ P +KV +PK +L  L 
Subjt:  QNERNDVLEAFNPAISNFLAQLSQKVQNSLKPQLNRLANEGQKLTSMSPFAKGERKGSSTSSLKLDVEKQLQAWRENPSWTDKPPQIKVTVPKDTLSRLN

Query:  AKVDVGLPPDAVYNIVTDPDNKRVFKNIKEVISRKVLIDEGSRQVVEVEQAALWRFLWWSGTISVHVLVDQNRADHSMKFKQLKTGFMKKFEGCWRVEPI
        A+V+VGLPPDAVYNIV DPDN+RVFKNIKEV+SRKVL+D+G RQVVEVEQAALWRFLWWSGTISVHVLVDQ+RADHSMKFKQ+K+GFMK+FEG W+V+P+
Subjt:  AKVDVGLPPDAVYNIVTDPDNKRVFKNIKEVISRKVLIDEGSRQVVEVEQAALWRFLWWSGTISVHVLVDQNRADHSMKFKQLKTGFMKKFEGCWRVEPI

Query:  FVDESVCFPVKPKNLTDYHACTKGKGRIGSRVSLEQLIQPAIVPPPPISWYLRGITTRTTEMLILDLLAEAKRIREDAKGQTLNNELDISQGTCDSNLLD
        FVDE +C  +KPK L +Y+ CT GKGRIGS+V+L+QLIQPAIVPPPPISWYLRGIT +TTEMLI DLLAE  RIR  A G   +        + D   + 
Subjt:  FVDESVCFPVKPKNLTDYHACTKGKGRIGSRVSLEQLIQPAIVPPPPISWYLRGITTRTTEMLILDLLAEAKRIREDAKGQTLNNELDISQGTCDSNLLD

Query:  SILDIKERWAMHRRNAK
        +  DIKERW+ HRR ++
Subjt:  SILDIKERWAMHRRNAK

AT1G70480.2 Domain of unknown function (DUF220)3.9e-10661.51Show/hide
Query:  QNERNDVLEAFNPAISNFLAQLSQKVQNSLKPQLNRLANEGQKLTSMSPFAKGERKGSSTSSLKLDVEKQLQAWRENPSWTDKPPQIKVTVPKDTLSRLN
        +N+ N V   F+PA+S F+   SQK+Q  LK QL  L ++   + S    +  + KG+S+ S+++D+EKQL  WRENPSWTD+ P +KV +PK +L  L 
Subjt:  QNERNDVLEAFNPAISNFLAQLSQKVQNSLKPQLNRLANEGQKLTSMSPFAKGERKGSSTSSLKLDVEKQLQAWRENPSWTDKPPQIKVTVPKDTLSRLN

Query:  AKVDVGLPPDAVYNIVTDPDNKRVFKNIKEVISRKVLIDEGSRQVVEVEQAALWRFLWWSGTISVHVLVDQNRADHSMKFKQLKTGFMKKFEGCWRVEPI
        A+V+VGLPPDAVYNIV DPDN+RVFKNIKEV+SRKVL+D+G RQVVEVEQAALWRFLWWSGTISVHVLVDQ+RADHSMKFKQ+K+GFMK+FEG W+V+P+
Subjt:  AKVDVGLPPDAVYNIVTDPDNKRVFKNIKEVISRKVLIDEGSRQVVEVEQAALWRFLWWSGTISVHVLVDQNRADHSMKFKQLKTGFMKKFEGCWRVEPI

Query:  FVDESVCFPVKPKNLTDYHACTKGKGRIGSRVSLEQLIQPAIVPPPPISWYLRGITTRTTEMLILDLLAEAKRIREDAKGQTLNNELDISQGTCDSNLLD
        FVDE +C  +KPK L +Y+ CT GKGRIGS+V+L+QLIQPAIVPPPPISWYLRGIT +TTEMLI DLLAE  RIR  A G   +        + D   + 
Subjt:  FVDESVCFPVKPKNLTDYHACTKGKGRIGSRVSLEQLIQPAIVPPPPISWYLRGITTRTTEMLILDLLAEAKRIREDAKGQTLNNELDISQGTCDSNLLD

Query:  SILDIKERWAMHRRNAK
        +  DIKERW+ HRR ++
Subjt:  SILDIKERWAMHRRNAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAAAAACAAGTATATCCATCGCCCAGAATGAGAGGAACGATGTTCTAGAAGCATTTAATCCTGCAATATCCAATTTCCTTGCGCAGCTTTCTCAGAAAGTTCAAAA
TTCTCTGAAGCCACAGCTTAATAGATTGGCAAATGAAGGTCAGAAATTGACGTCAATGAGTCCATTCGCAAAAGGGGAGAGGAAGGGATCTTCCACATCATCCTTGAAGC
TTGATGTGGAGAAGCAACTGCAGGCATGGAGAGAGAACCCTTCATGGACTGACAAACCTCCACAAATAAAGGTAACTGTACCGAAAGATACCCTTAGCAGACTCAATGCA
AAGGTTGATGTTGGTTTGCCACCAGATGCTGTCTATAATATCGTGACAGACCCTGATAACAAGAGAGTTTTCAAGAATATTAAAGAGGTGATATCCAGAAAGGTTTTGAT
TGATGAAGGCTCGAGACAAGTGGTTGAGGTGGAGCAAGCAGCTTTGTGGAGATTTCTTTGGTGGTCCGGGACCATTTCAGTTCATGTTCTAGTAGATCAAAACAGAGCGG
ATCACTCAATGAAGTTCAAGCAACTGAAGACTGGATTTATGAAGAAATTCGAAGGCTGCTGGAGAGTAGAGCCTATATTTGTTGATGAAAGCGTCTGCTTTCCAGTAAAA
CCCAAGAATTTGACAGATTATCATGCTTGTACAAAGGGCAAGGGAAGAATAGGGTCAAGGGTGAGCCTCGAACAGCTGATCCAGCCAGCCATTGTTCCACCACCGCCCAT
CTCCTGGTATCTTCGCGGAATTACCACCAGGACCACTGAGATGCTCATTCTTGATCTGCTTGCTGAAGCCAAAAGGATAAGGGAAGATGCCAAAGGTCAAACTTTGAACA
ATGAGCTTGATATTTCTCAGGGAACGTGTGATAGCAACTTATTAGATAGCATTCTTGATATTAAAGAAAGATGGGCAATGCATAGGAGGAATGCGAAGCCATGTCGAGCG
CGGGAGTCTGTTTCTAGTCTTCTCCCAAAACTTCTACCATGGAGTTTAATTCGCTTCAAGACCTCTTCCTTCTTCAATACGGAGATGCATCTGCTCCGAACGACTTTGCA
GACTCTCAGTTCTTCTGCAACGCTCTGCCGCTTCCGATCCTCCCCTATTTCAGATACCGGCCTTGGAAGCTGCCTAGAGGAGTTTCAGGATTTTAAGTGTCCCGTTGAAG
GTATCCTTGCTGAACAGTGGGTAGCTCATGTACAGAGATATTTTGCAGTAGGATTAGACTACCGTACCATTGTTATGAGGTTGAGTATTAATACATTTCTGGTATTTTAT
GCAGATGGAATTGTTTCTTTGAGAACTACTAATTTTTGTGATAATGAGATAAGTTGCTATCAGTTGAATCTGAAAGAAAATGTCACTATCAGGGAGCCAATTTGGCTACC
TTCAAACGTTAAGCATGGTGTGAAATTCAGGCATGGAATCACTGCTGTTGATGTTTTGCTAGCGGAGATCAATTGCTTCTTTCAGAAGATTCGCATTCTAAAGATCCCTA
ATGTTGCTATGGAGGTTGTGGTCGAGTGCCAGGATATCCCAGGGTCACGAAATGATGCAGTCTTCCTGGAAAACTTGTCCTCTAGTTCCTCTTTCACAGCCTCAACCCTG
GATCATCTGAAATTGGGCCTTGAGGATTATGTATCAAGGCATGGATCTAGTTTGACGTGTGACTCTTGCTTCCCTAACAGGGATAATCTGAAGAGTGGGGGTGGAATGAT
TTGTGAAGAAAAGCGTAAAACGACCACACTGGCAGTGGAGGCAGCGATTGTGATAAGTGAATTATCAAATCCGACCACTAATTGCTTGGGAGCACGGTTCTCCGATACAG
AGGTTTTTTGTTTTAAAGATTTTGCACCTTGCTCGATCTCTGAGGCATTTCTAAAGGCATTAAGAGGCATTGACTGGAAACGTTATGGTTTGTCTTTGGAAAGTGCTACC
AATCAAAGAAGCCATGCATTACTAAAGTGGGAACACGTGCCCCTATCTTTTCATATTCATATTGTTGTCCACTGCTACCAGAAGCTTGTGGCCGAGGTCATGCCATTGGT
GCAGAAGACTCAGTTTGATAAAAAACTCATAAGTAAAGCAGTTAAGCTTGCACTGGACGATTTGAAGAATAAATATGCGGGATTTCTTCTCAGTGCCAATACCCTGAAGA
TCAGTAGGTTTGCCCCTGATCTTGCCAAAACGATTGCTGGCCTAGTTTTATACTCCAATGACTTGGATTTCCAAGAAGAATGCCTTGCAATTCTTGGCTTGCAACCTCAA
CAATCAGAAGGGGAAATTGTTGAAGAAAATATTAAGGAAAGGATAATTTCAGTCATTGAGGTGAATGACGGAAGGCCAAAAGGAACAAAAGAAGTTGCCCCTCTCTTGTT
TGGAGATGGTCGCCACCAACTACAGTTTGTGGACGATGAATGTGATGAAGATGGATTTGATCCAATGGAGTTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGACAAAAACAAGTATATCCATCGCCCAGAATGAGAGGAACGATGTTCTAGAAGCATTTAATCCTGCAATATCCAATTTCCTTGCGCAGCTTTCTCAGAAAGTTCAAAA
TTCTCTGAAGCCACAGCTTAATAGATTGGCAAATGAAGGTCAGAAATTGACGTCAATGAGTCCATTCGCAAAAGGGGAGAGGAAGGGATCTTCCACATCATCCTTGAAGC
TTGATGTGGAGAAGCAACTGCAGGCATGGAGAGAGAACCCTTCATGGACTGACAAACCTCCACAAATAAAGGTAACTGTACCGAAAGATACCCTTAGCAGACTCAATGCA
AAGGTTGATGTTGGTTTGCCACCAGATGCTGTCTATAATATCGTGACAGACCCTGATAACAAGAGAGTTTTCAAGAATATTAAAGAGGTGATATCCAGAAAGGTTTTGAT
TGATGAAGGCTCGAGACAAGTGGTTGAGGTGGAGCAAGCAGCTTTGTGGAGATTTCTTTGGTGGTCCGGGACCATTTCAGTTCATGTTCTAGTAGATCAAAACAGAGCGG
ATCACTCAATGAAGTTCAAGCAACTGAAGACTGGATTTATGAAGAAATTCGAAGGCTGCTGGAGAGTAGAGCCTATATTTGTTGATGAAAGCGTCTGCTTTCCAGTAAAA
CCCAAGAATTTGACAGATTATCATGCTTGTACAAAGGGCAAGGGAAGAATAGGGTCAAGGGTGAGCCTCGAACAGCTGATCCAGCCAGCCATTGTTCCACCACCGCCCAT
CTCCTGGTATCTTCGCGGAATTACCACCAGGACCACTGAGATGCTCATTCTTGATCTGCTTGCTGAAGCCAAAAGGATAAGGGAAGATGCCAAAGGTCAAACTTTGAACA
ATGAGCTTGATATTTCTCAGGGAACGTGTGATAGCAACTTATTAGATAGCATTCTTGATATTAAAGAAAGATGGGCAATGCATAGGAGGAATGCGAAGCCATGTCGAGCG
CGGGAGTCTGTTTCTAGTCTTCTCCCAAAACTTCTACCATGGAGTTTAATTCGCTTCAAGACCTCTTCCTTCTTCAATACGGAGATGCATCTGCTCCGAACGACTTTGCA
GACTCTCAGTTCTTCTGCAACGCTCTGCCGCTTCCGATCCTCCCCTATTTCAGATACCGGCCTTGGAAGCTGCCTAGAGGAGTTTCAGGATTTTAAGTGTCCCGTTGAAG
GTATCCTTGCTGAACAGTGGGTAGCTCATGTACAGAGATATTTTGCAGTAGGATTAGACTACCGTACCATTGTTATGAGGTTGAGTATTAATACATTTCTGGTATTTTAT
GCAGATGGAATTGTTTCTTTGAGAACTACTAATTTTTGTGATAATGAGATAAGTTGCTATCAGTTGAATCTGAAAGAAAATGTCACTATCAGGGAGCCAATTTGGCTACC
TTCAAACGTTAAGCATGGTGTGAAATTCAGGCATGGAATCACTGCTGTTGATGTTTTGCTAGCGGAGATCAATTGCTTCTTTCAGAAGATTCGCATTCTAAAGATCCCTA
ATGTTGCTATGGAGGTTGTGGTCGAGTGCCAGGATATCCCAGGGTCACGAAATGATGCAGTCTTCCTGGAAAACTTGTCCTCTAGTTCCTCTTTCACAGCCTCAACCCTG
GATCATCTGAAATTGGGCCTTGAGGATTATGTATCAAGGCATGGATCTAGTTTGACGTGTGACTCTTGCTTCCCTAACAGGGATAATCTGAAGAGTGGGGGTGGAATGAT
TTGTGAAGAAAAGCGTAAAACGACCACACTGGCAGTGGAGGCAGCGATTGTGATAAGTGAATTATCAAATCCGACCACTAATTGCTTGGGAGCACGGTTCTCCGATACAG
AGGTTTTTTGTTTTAAAGATTTTGCACCTTGCTCGATCTCTGAGGCATTTCTAAAGGCATTAAGAGGCATTGACTGGAAACGTTATGGTTTGTCTTTGGAAAGTGCTACC
AATCAAAGAAGCCATGCATTACTAAAGTGGGAACACGTGCCCCTATCTTTTCATATTCATATTGTTGTCCACTGCTACCAGAAGCTTGTGGCCGAGGTCATGCCATTGGT
GCAGAAGACTCAGTTTGATAAAAAACTCATAAGTAAAGCAGTTAAGCTTGCACTGGACGATTTGAAGAATAAATATGCGGGATTTCTTCTCAGTGCCAATACCCTGAAGA
TCAGTAGGTTTGCCCCTGATCTTGCCAAAACGATTGCTGGCCTAGTTTTATACTCCAATGACTTGGATTTCCAAGAAGAATGCCTTGCAATTCTTGGCTTGCAACCTCAA
CAATCAGAAGGGGAAATTGTTGAAGAAAATATTAAGGAAAGGATAATTTCAGTCATTGAGGTGAATGACGGAAGGCCAAAAGGAACAAAAGAAGTTGCCCCTCTCTTGTT
TGGAGATGGTCGCCACCAACTACAGTTTGTGGACGATGAATGTGATGAAGATGGATTTGATCCAATGGAGTTGTAA
Protein sequenceShow/hide protein sequence
MTKTSISIAQNERNDVLEAFNPAISNFLAQLSQKVQNSLKPQLNRLANEGQKLTSMSPFAKGERKGSSTSSLKLDVEKQLQAWRENPSWTDKPPQIKVTVPKDTLSRLNA
KVDVGLPPDAVYNIVTDPDNKRVFKNIKEVISRKVLIDEGSRQVVEVEQAALWRFLWWSGTISVHVLVDQNRADHSMKFKQLKTGFMKKFEGCWRVEPIFVDESVCFPVK
PKNLTDYHACTKGKGRIGSRVSLEQLIQPAIVPPPPISWYLRGITTRTTEMLILDLLAEAKRIREDAKGQTLNNELDISQGTCDSNLLDSILDIKERWAMHRRNAKPCRA
RESVSSLLPKLLPWSLIRFKTSSFFNTEMHLLRTTLQTLSSSATLCRFRSSPISDTGLGSCLEEFQDFKCPVEGILAEQWVAHVQRYFAVGLDYRTIVMRLSINTFLVFY
ADGIVSLRTTNFCDNEISCYQLNLKENVTIREPIWLPSNVKHGVKFRHGITAVDVLLAEINCFFQKIRILKIPNVAMEVVVECQDIPGSRNDAVFLENLSSSSSFTASTL
DHLKLGLEDYVSRHGSSLTCDSCFPNRDNLKSGGGMICEEKRKTTTLAVEAAIVISELSNPTTNCLGARFSDTEVFCFKDFAPCSISEAFLKALRGIDWKRYGLSLESAT
NQRSHALLKWEHVPLSFHIHIVVHCYQKLVAEVMPLVQKTQFDKKLISKAVKLALDDLKNKYAGFLLSANTLKISRFAPDLAKTIAGLVLYSNDLDFQEECLAILGLQPQ
QSEGEIVEENIKERIISVIEVNDGRPKGTKEVAPLLFGDGRHQLQFVDDECDEDGFDPMEL