; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC02G026010 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC02G026010
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionEnergy-coupling factor transporter ATP-binding protein EcfA1
Genome locationCmU531Chr02:369409..380979
RNA-Seq ExpressionCmUC02G026010
SyntenyCmUC02G026010
Gene Ontology termsGO:0006303 - double-strand break repair via nonhomologous end joining (biological process)
GO:0031848 - protection from non-homologous end joining at telomere (biological process)
GO:0036297 - interstrand cross-link repair (biological process)
GO:0055085 - transmembrane transport (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016020 - membrane (cellular component)
GO:0035312 - 5'-3' exodeoxyribonuclease activity (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0003684 - damaged DNA binding (molecular function)
InterPro domainsIPR036866 - Ribonuclease Z/Hydroxyacylglutathione hydrolase-like
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR017871 - ABC transporter-like, conserved site
IPR015856 - ABC transporter, CbiO/EcfA subunit
IPR011084 - DNA repair metallo-beta-lactamase
IPR003593 - AAA+ ATPase domain
IPR003439 - ABC transporter-like, ATP-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8651993.1 hypothetical protein Csa_016958 [Cucumis sativus]0.0e+0080.45Show/hide
Query:  MPIEMPQGVPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIVPHSSFPIYSTFLTKSIVLQQFPLLHDSLFVCIEVGQSLVVKDPDGAFTVTVFDAHHCP--
        MPIEMPQG+PFSVDTW+PSSK+KRHHFLTHAHRDHTTGIVPH SFPIYSTFLTKSIVLQQFP LHDSLFVCIEVGQSLVVKDPDGAFTVTVFDAHHCP  
Subjt:  MPIEMPQGVPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIVPHSSFPIYSTFLTKSIVLQQFPLLHDSLFVCIEVGQSLVVKDPDGAFTVTVFDAHHCP--

Query:  ------GYFGNVLHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSTHSAIHQVINCIWKHPDAPLVYLICNLLGQEEILQQVS
              GYFGNVLHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPS HS+IHQ+INCIWKHPDAPLVYLICNLLGQE+ILQQVS
Subjt:  ------GYFGNVLHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSTHSAIHQVINCIWKHPDAPLVYLICNLLGQEEILQQVS

Query:  QTFGSKIFADESTKASYKALELIDPDILTQDPSSRFHLLDGFPKLCETARTLLADAQTNFLSEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHGI
        QTFGSKIFADES KA YKALELI+PDILTQDPSSRFHLLDGFPKLC+TARTLLADAQTNFLSEPLVIRPSTQWYVREE+SEI N+RKQIISEAIKDQHGI
Subjt:  QTFGSKIFADESTKASYKALELIDPDILTQDPSSRFHLLDGFPKLCETARTLLADAQTNFLSEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHGI

Query:  WHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAVDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLNASVIEVGCSPMVEAPTQTNVDPQLQPVKL
        WHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRA+DLDYVK+KLS SSLTSNGLIWKLFGIAE+SSSDL+ASVIEV CS +VEAPTQ NV+PQ Q VK+
Subjt:  WHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAVDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLNASVIEVGCSPMVEAPTQTNVDPQLQPVKL

Query:  HPVPREMLNIFPSSNLPPLTLFGRARLGVDDAILLS-EVSYPSTEDEPVEAVGDKVADLSIHDAYGKLGDKSSKNSKNEVHSEGKHKKFENDALLANVNA
        +PVP+EMLNI  SSNLPPLTLFGRARL ++DA LLS EVSYPSTE+EPVEAVGD VA+LSIHDA  KL  KSS NSK+EVH EGKH+KF NDAL A+VNA
Subjt:  HPVPREMLNIFPSSNLPPLTLFGRARLGVDDAILLS-EVSYPSTEDEPVEAVGDKVADLSIHDAYGKLGDKSSKNSKNEVHSEGKHKKFENDALLANVNA

Query:  SLYSDRTELPISEIRAVSVNNNPLEVVHSDIEEHVHEQERRAKETESLDDRKDVGIILKTHIGKIVNNHRIAGCSNSHLLSVGSSKGFNDKFRKLYRSMN
        SL S+ T+LPISEI+ +S NNN  EV +SD+EEHVHEQE R KE E  D+ KDV IIL+TH+GK+VNN RIAGCSNSHLLSVGSSKGFNDKFRKLYRSMN
Subjt:  SLYSDRTELPISEIRAVSVNNNPLEVVHSDIEEHVHEQERRAKETESLDDRKDVGIILKTHIGKIVNNHRIAGCSNSHLLSVGSSKGFNDKFRKLYRSMN

Query:  VPVPEPLPSLVPVSPARQR---------------RPATPRRSVSAATPANRRSSPLVSLLLLRFLSHSSVELRSTHLRTLSLFQRSLPNTIFKVLSFRRI
        VPVPEPLPSLV +  +R+R               RP   R+S S    A R+   + SLL +   +  S + +S   R++S    + P            
Subjt:  VPVPEPLPSLVPVSPARQR---------------RPATPRRSVSAATPANRRSSPLVSLLLLRFLSHSSVELRSTHLRTLSLFQRSLPNTIFKVLSFRRI

Query:  EFLLHHFCSSMVSKSSSLNLVTSSPPLLSPLFSSPPSRINLTENIAIEGRNLSFSITTRQGKIVPILRDCSLRIPSGQFWMLLGPNGCGKSTLLKVLAGL
                 +MV KSSS NLV  SPPLLSPLFS+PPSRIN TENIAIEGRNL+FSITT+QGK VPILRDCSLRIPSG+FWMLLGPNGCGKSTLLKVLAGL
Subjt:  EFLLHHFCSSMVSKSSSLNLVTSSPPLLSPLFSSPPSRINLTENIAIEGRNLSFSITTRQGKIVPILRDCSLRIPSGQFWMLLGPNGCGKSTLLKVLAGL

Query:  LNTTTGAVYVKKPKCFVFQNPDYQVVMPTVEADVAFGLGKLNLTNDEVKSRVLEALSA--------RSVQTLSGGQKQRVAIAGALAEACKVLLLDELTT
        LNTT+GAVYVKKPKCFVFQNPDYQVVMPTVEADVAFGLGKLNL NDEVKSRVLEALSA        RSVQTLSGGQKQRVAIAGALAEACKVLLLDELTT
Subjt:  LNTTTGAVYVKKPKCFVFQNPDYQVVMPTVEADVAFGLGKLNLTNDEVKSRVLEALSA--------RSVQTLSGGQKQRVAIAGALAEACKVLLLDELTT

Query:  FLDENDQMGVIKAVRNSLRNSEDVTALWVTHRLEELEYADGAIYMEDGKVVMHGDVASIQSFIQTKQSDYIKRNYYS
        FLDENDQ GVIKAVRNSL NSEDVTALWVTHRLEELEYADGAIYMEDGKVVMHGDVASIQSFIQTKQSDYIKR Y+S
Subjt:  FLDENDQMGVIKAVRNSLRNSEDVTALWVTHRLEELEYADGAIYMEDGKVVMHGDVASIQSFIQTKQSDYIKRNYYS

XP_004147360.2 protein artemis isoform X1 [Cucumis sativus]3.0e-30685.3Show/hide
Query:  MPIEMPQGVPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIVPHSSFPIYSTFLTKSIVLQQFPLLHDSLFVCIEVGQSLVVKDPDGAFTVTVFDAHHCP--
        MPIEMPQG+PFSVDTW+PSSK+KRHHFLTHAHRDHTTGIVPH SFPIYSTFLTKSIVLQQFP LHDSLFVCIEVGQSLVVKDPDGAFTVTVFDAHHCP  
Subjt:  MPIEMPQGVPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIVPHSSFPIYSTFLTKSIVLQQFPLLHDSLFVCIEVGQSLVVKDPDGAFTVTVFDAHHCP--

Query:  ------GYFGNVLHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSTHSAIHQVINCIWKHPDAPLVYLICNLLGQEEILQQVS
              GYFGNVLHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPS HS+IHQ+INCIWKHPDAPLVYLICNLLGQE+ILQQVS
Subjt:  ------GYFGNVLHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSTHSAIHQVINCIWKHPDAPLVYLICNLLGQEEILQQVS

Query:  QTFGSKIFADESTKASYKALELIDPDILTQDPSSRFHLLDGFPKLCETARTLLADAQTNFLSEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHGI
        QTFGSKIFADES KA YKALELI+PDILTQDPSSRFHLLDGFPKLC+TARTLLADAQTNFLSEPLVIRPSTQWYVREE+SEI N+RKQIISEAIKDQHGI
Subjt:  QTFGSKIFADESTKASYKALELIDPDILTQDPSSRFHLLDGFPKLCETARTLLADAQTNFLSEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHGI

Query:  WHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAVDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLNASVIEVGCSPMVEAPTQTNVDPQLQPVKL
        WHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRA+DLDYVK+KLS SSLTSNGLIWKLFGIAE+SSSDL+ASVIEV CS +VEAPTQ NV+PQ Q VK+
Subjt:  WHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAVDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLNASVIEVGCSPMVEAPTQTNVDPQLQPVKL

Query:  HPVPREMLNIFPSSNLPPLTLFGRARLGVDDAILLS-EVSYPSTEDEPVEAVGDKVADLSIHDAYGKLGDKSSKNSKNEVHSEGKHKKFENDALLANVNA
        +PVP+EMLNI  SSNLPPLTLFGRARL ++DA LLS EVSYPSTE+EPVEAVGD VA+LSIHDA  KL  KSS NSK+EVH EGKH+KF NDAL A+VNA
Subjt:  HPVPREMLNIFPSSNLPPLTLFGRARLGVDDAILLS-EVSYPSTEDEPVEAVGDKVADLSIHDAYGKLGDKSSKNSKNEVHSEGKHKKFENDALLANVNA

Query:  SLYSDRTELPISEIRAVSVNNNPLEVVHSDIEEHVHEQERRAKETESLDDRKDVGIILKTHIGKIVNNHRIAGCSNSHLLSVGSSKGFNDKFRKLYRSMN
        SL S+ T+LPISEI+ +S NNN  EV +SD+EEHVHEQE R KE E  D+ KDV IIL+TH+GK+VNN RIAGCSNSHLLSVGSSKGFNDKFRKLYRSMN
Subjt:  SLYSDRTELPISEIRAVSVNNNPLEVVHSDIEEHVHEQERRAKETESLDDRKDVGIILKTHIGKIVNNHRIAGCSNSHLLSVGSSKGFNDKFRKLYRSMN

Query:  VPVPEPLPSLVPVSPARQR
        VPVPEPLPSLV +  +R+R
Subjt:  VPVPEPLPSLVPVSPARQR

XP_008460868.1 PREDICTED: protein artemis [Cucumis melo]1.5e-30285.3Show/hide
Query:  MPIEMPQGVPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIVPHSSFPIYSTFLTKSIVLQQFPLLHDSLFVCIEVGQSLVVKDPDGAFTVTVFDAHHCP--
        MPIEMPQG+PFSVDTWTPSSKQKRHHFLTHAHRDHTTGIVPH SFPIYSTFLTKSIVL QFP LHDSLFVCIEVGQSLVVKDPDGAFTVTVFDAHHCP  
Subjt:  MPIEMPQGVPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIVPHSSFPIYSTFLTKSIVLQQFPLLHDSLFVCIEVGQSLVVKDPDGAFTVTVFDAHHCP--

Query:  ------GYFGNVLHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSTHSAIHQVINCIWKHPDAPLVYLICNLLGQEEILQQVS
              GYFGNVLHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPS HS+IHQ+INCIWKHPDAPLVYLICNLLGQE+ILQQVS
Subjt:  ------GYFGNVLHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSTHSAIHQVINCIWKHPDAPLVYLICNLLGQEEILQQVS

Query:  QTFGSKIFADESTKASYKALELIDPDILTQDPSSRFHLLDGFPKLCETARTLLADAQTNFLSEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHGI
        QTFGSKIFADESTKA YKALELIDPDILTQD SSRFHLLDGFPKLC+TARTLLADAQ   LSEPLVIRPSTQWYVREE+SEI N+RKQIISEAIKDQHGI
Subjt:  QTFGSKIFADESTKASYKALELIDPDILTQDPSSRFHLLDGFPKLCETARTLLADAQTNFLSEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHGI

Query:  WHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAVDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLNASVIEVGCSPMVEAPTQTNVDPQLQPVKL
        WHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRA+DLDYVKKKLS SSLTSNGLIWKLFGIAE+SSSDL+ASVIEV  S +VEAPTQ NVDPQ + VK+
Subjt:  WHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAVDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLNASVIEVGCSPMVEAPTQTNVDPQLQPVKL

Query:  HPVPREMLNIFPSSNLPPLTLFGRARLGVDDAILL-SEVSYPSTEDEPVEAVGDKVADLSIHDAYGKLGDKSSKNSKNEVHSEGKHKKFENDALLANVNA
        +PVP+EMLNI  SSNLPPLTLFGRARL ++DA LL  EVSYPSTE+EPVEAVG  +ADLSIHDA  KL DKSS NSKNEVH +GKHKKF NDAL A+VNA
Subjt:  HPVPREMLNIFPSSNLPPLTLFGRARLGVDDAILL-SEVSYPSTEDEPVEAVGDKVADLSIHDAYGKLGDKSSKNSKNEVHSEGKHKKFENDALLANVNA

Query:  SLYSDRTELPISEIRAVSVNNNPLEVVHSDIEEHVHEQERRAKETESLDDRKDVGIILKTHIGKIVNNHRIAGCSNSHLLSVGSSKGFNDKFRKLYRSMN
        SLYSD  +LPISEI+ +S NNN  EV +SD+EEHVHEQE R KE ES D  KDV IIL+TH+GK+VNN RIA CSNSHLLSVGSSKGFNDKFRKLYRSMN
Subjt:  SLYSDRTELPISEIRAVSVNNNPLEVVHSDIEEHVHEQERRAKETESLDDRKDVGIILKTHIGKIVNNHRIAGCSNSHLLSVGSSKGFNDKFRKLYRSMN

Query:  VPVPEPLPSLVPVSPARQR
        VPVPEPLPSLV +  +R+R
Subjt:  VPVPEPLPSLVPVSPARQR

XP_023532364.1 uncharacterized protein LOC111794563 isoform X2 [Cucurbita pepo subsp. pepo]2.5e-28480.49Show/hide
Query:  MPIEMPQGVPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-VPHSSFPIYSTFLTKSIVLQQFPLLHDSLFVCIEVGQSLVVKDPDGAFTVTVFDAHHCPG
        MPIEMP G+PFSVDTWTPSSKQKRHHFLTHAH DHT GI   HSSFPI+STF+TKSIVLQ FP LHDSLFVCIEVGQ+LVVKDPDG FTVTV DAHHCPG
Subjt:  MPIEMPQGVPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-VPHSSFPIYSTFLTKSIVLQQFPLLHDSLFVCIEVGQSLVVKDPDGAFTVTVFDAHHCPG

Query:  YFGNVLHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSTHSAIHQVINCIWKHPDAPLVYLICNLLGQEEILQQVSQTFGSKI
         FGN+LHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPS HSAIHQVINCIWKHPDAPLVYLIC+ LGQE+ILQQVSQTFGSKI
Subjt:  YFGNVLHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSTHSAIHQVINCIWKHPDAPLVYLICNLLGQEEILQQVSQTFGSKI

Query:  FADESTKASYKALELIDPDILTQDPSSRFHLLDGFPKLCETARTLLADAQTNFLSEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHGIWHVCYSM
        F DE TKA YKALELIDPDILTQDPSSRFHLL GFPKLC+TA+ LLA+AQTNF  EPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHGIWHVCYSM
Subjt:  FADESTKASYKALELIDPDILTQDPSSRFHLLDGFPKLCETARTLLADAQTNFLSEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHGIWHVCYSM

Query:  HSSKEELEWALQILAPKWVVSTTPGCRAVDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLNASVIEVGCSPMVEAPTQTNVDPQLQPVKLHPVPREM
        HSSKEELEWALQILAPKWVVSTTPGCRA+DLDYVKKK SCSSLTSNGLIW+LFG+AEESSSDL+AS IEV CSP+VE  T  ++DPQLQP KL+ VPRE 
Subjt:  HSSKEELEWALQILAPKWVVSTTPGCRAVDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLNASVIEVGCSPMVEAPTQTNVDPQLQPVKLHPVPREM

Query:  LNIFPSSNLPPLTLFGRARLGVDDAILL-SEVSYPSTEDEPVEAVGDKVADLSIHDAYGKLGDKSSKNSKNEVHSEGKHKKFEND-ALLANVNASLYSDR
        L+I  SSNLPPLTLFGRARL   DA +L  EVSYPSTE+EPVEAVGDKVADLSIHDA G+  DK SK+SKNE++S+GKH+KF N+  LLA+ +AS  SD 
Subjt:  LNIFPSSNLPPLTLFGRARLGVDDAILL-SEVSYPSTEDEPVEAVGDKVADLSIHDAYGKLGDKSSKNSKNEVHSEGKHKKFEND-ALLANVNASLYSDR

Query:  TELPISEIRAVSV-NNNPLEVVHSDIEE-HVHEQERRAKETESLDDRKDVGIILKTHIGKIVNNHRIAGCSNSHLLSVGSSKGFNDKFRKLYRSMNVPVP
          L  SE++ VS+ NNNP E V S++EE HVHEQE R    +SLDD +DV  + +THIGK+V + RIAGCSNSH+LSVGSSKGFND+FRKLYRSMNV VP
Subjt:  TELPISEIRAVSV-NNNPLEVVHSDIEE-HVHEQERRAKETESLDDRKDVGIILKTHIGKIVNNHRIAGCSNSHLLSVGSSKGFNDKFRKLYRSMNVPVP

Query:  EPLPSLVPVSPARQR
        EPLPSLV +  +R+R
Subjt:  EPLPSLVPVSPARQR

XP_038901492.1 uncharacterized protein LOC120088341 [Benincasa hispida]7.3e-31086.11Show/hide
Query:  MPIEMPQGVPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIVPHSSFPIYSTFLTKSIVLQQFPLLHDSLFVCIEVGQSLVVKDPDGAFTVTVFDAHHCP--
        MPIEMPQG+PFSVDTWTP+SKQKRHHFLTHAHRDHTTGI PHSSFPIYSTFLTKSIVLQQFPLLHDSLFVCIEVGQSLVVKDPDGAFTVTVFDAHHCP  
Subjt:  MPIEMPQGVPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIVPHSSFPIYSTFLTKSIVLQQFPLLHDSLFVCIEVGQSLVVKDPDGAFTVTVFDAHHCP--

Query:  ------GYFGNVLHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSTHSAIHQVINCIWKHPDAPLVYLICNLLGQEEILQQVS
              G FGN+LHTGDCRLTPECLQNLP+KYRGKSGKEPRCKLDLIFLDCTFGRFFQ FPS HS+I QVINCIWKHPDA LVYLICNLLGQEEILQQVS
Subjt:  ------GYFGNVLHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSTHSAIHQVINCIWKHPDAPLVYLICNLLGQEEILQQVS

Query:  QTFGSKIFADESTKASYKALELIDPDILTQDPSSRFHLLDGFPKLCETARTLLADAQTNFLSEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHGI
        QTFGSKIFADESTKA YKALELIDPDILTQDPSSRFHL+DGFPKLC+TA+TLLADAQTNF S PLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHGI
Subjt:  QTFGSKIFADESTKASYKALELIDPDILTQDPSSRFHLLDGFPKLCETARTLLADAQTNFLSEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHGI

Query:  WHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAVDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLNASVIEVGCSPMVEAPTQTNVDPQLQPVKL
        WHVCYSMHSS+EELEWALQILAPKWVVSTTPGCRA+DLDYVKKKLS SS+TSNGLIWKLFGI E+SSSDL+ASVIEV CSP+VEA    NVDPQ QPVK+
Subjt:  WHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAVDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLNASVIEVGCSPMVEAPTQTNVDPQLQPVKL

Query:  HPVPREMLNIFPSSNLPPLTLFGRARLGVDDAILLSEVSYPSTEDEPVEAVGDKVADLSIHDAYGKLGDKSSKNSKNEVHSEGKHKKFENDALLANVNAS
        +P+P+EMLNI  SSNLPPLTLFGRARL ++DAILL EVSYPSTE+EPVEAVGD VADLSIHDA GKL  KSSK+SKNEVHS+GKH+KF NDA+LA+VNAS
Subjt:  HPVPREMLNIFPSSNLPPLTLFGRARLGVDDAILLSEVSYPSTEDEPVEAVGDKVADLSIHDAYGKLGDKSSKNSKNEVHSEGKHKKFENDALLANVNAS

Query:  LYSDRTELPISEIRAVSVNNNPLEVVHSDI-EEHVHEQERRAKETESLDDRKDVGIILKTHIGKIVNNHRIAGCSNSHLLSVGSSKGFNDKFRKLYRSMN
        LYSD T LP SEI+ VS+NNNP EVV +D+ EEHVHEQE RAK  ESLDD KDV I LKTH+GKIVNN RIA CSNSHLLSVGSSKGFNDKFRKLYRSMN
Subjt:  LYSDRTELPISEIRAVSVNNNPLEVVHSDI-EEHVHEQERRAKETESLDDRKDVGIILKTHIGKIVNNHRIAGCSNSHLLSVGSSKGFNDKFRKLYRSMN

Query:  VPVPEPLPSLVPVSPARQR
        VPVPEPLPSLV +  +R+R
Subjt:  VPVPEPLPSLVPVSPARQR

TrEMBL top hitse value%identityAlignment
A0A0A0LMF2 DRMBL domain-containing protein1.5e-30685.3Show/hide
Query:  MPIEMPQGVPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIVPHSSFPIYSTFLTKSIVLQQFPLLHDSLFVCIEVGQSLVVKDPDGAFTVTVFDAHHCP--
        MPIEMPQG+PFSVDTW+PSSK+KRHHFLTHAHRDHTTGIVPH SFPIYSTFLTKSIVLQQFP LHDSLFVCIEVGQSLVVKDPDGAFTVTVFDAHHCP  
Subjt:  MPIEMPQGVPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIVPHSSFPIYSTFLTKSIVLQQFPLLHDSLFVCIEVGQSLVVKDPDGAFTVTVFDAHHCP--

Query:  ------GYFGNVLHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSTHSAIHQVINCIWKHPDAPLVYLICNLLGQEEILQQVS
              GYFGNVLHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPS HS+IHQ+INCIWKHPDAPLVYLICNLLGQE+ILQQVS
Subjt:  ------GYFGNVLHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSTHSAIHQVINCIWKHPDAPLVYLICNLLGQEEILQQVS

Query:  QTFGSKIFADESTKASYKALELIDPDILTQDPSSRFHLLDGFPKLCETARTLLADAQTNFLSEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHGI
        QTFGSKIFADES KA YKALELI+PDILTQDPSSRFHLLDGFPKLC+TARTLLADAQTNFLSEPLVIRPSTQWYVREE+SEI N+RKQIISEAIKDQHGI
Subjt:  QTFGSKIFADESTKASYKALELIDPDILTQDPSSRFHLLDGFPKLCETARTLLADAQTNFLSEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHGI

Query:  WHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAVDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLNASVIEVGCSPMVEAPTQTNVDPQLQPVKL
        WHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRA+DLDYVK+KLS SSLTSNGLIWKLFGIAE+SSSDL+ASVIEV CS +VEAPTQ NV+PQ Q VK+
Subjt:  WHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAVDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLNASVIEVGCSPMVEAPTQTNVDPQLQPVKL

Query:  HPVPREMLNIFPSSNLPPLTLFGRARLGVDDAILLS-EVSYPSTEDEPVEAVGDKVADLSIHDAYGKLGDKSSKNSKNEVHSEGKHKKFENDALLANVNA
        +PVP+EMLNI  SSNLPPLTLFGRARL ++DA LLS EVSYPSTE+EPVEAVGD VA+LSIHDA  KL  KSS NSK+EVH EGKH+KF NDAL A+VNA
Subjt:  HPVPREMLNIFPSSNLPPLTLFGRARLGVDDAILLS-EVSYPSTEDEPVEAVGDKVADLSIHDAYGKLGDKSSKNSKNEVHSEGKHKKFENDALLANVNA

Query:  SLYSDRTELPISEIRAVSVNNNPLEVVHSDIEEHVHEQERRAKETESLDDRKDVGIILKTHIGKIVNNHRIAGCSNSHLLSVGSSKGFNDKFRKLYRSMN
        SL S+ T+LPISEI+ +S NNN  EV +SD+EEHVHEQE R KE E  D+ KDV IIL+TH+GK+VNN RIAGCSNSHLLSVGSSKGFNDKFRKLYRSMN
Subjt:  SLYSDRTELPISEIRAVSVNNNPLEVVHSDIEEHVHEQERRAKETESLDDRKDVGIILKTHIGKIVNNHRIAGCSNSHLLSVGSSKGFNDKFRKLYRSMN

Query:  VPVPEPLPSLVPVSPARQR
        VPVPEPLPSLV +  +R+R
Subjt:  VPVPEPLPSLVPVSPARQR

A0A1S3CCY1 protein artemis7.5e-30385.3Show/hide
Query:  MPIEMPQGVPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIVPHSSFPIYSTFLTKSIVLQQFPLLHDSLFVCIEVGQSLVVKDPDGAFTVTVFDAHHCP--
        MPIEMPQG+PFSVDTWTPSSKQKRHHFLTHAHRDHTTGIVPH SFPIYSTFLTKSIVL QFP LHDSLFVCIEVGQSLVVKDPDGAFTVTVFDAHHCP  
Subjt:  MPIEMPQGVPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIVPHSSFPIYSTFLTKSIVLQQFPLLHDSLFVCIEVGQSLVVKDPDGAFTVTVFDAHHCP--

Query:  ------GYFGNVLHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSTHSAIHQVINCIWKHPDAPLVYLICNLLGQEEILQQVS
              GYFGNVLHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPS HS+IHQ+INCIWKHPDAPLVYLICNLLGQE+ILQQVS
Subjt:  ------GYFGNVLHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSTHSAIHQVINCIWKHPDAPLVYLICNLLGQEEILQQVS

Query:  QTFGSKIFADESTKASYKALELIDPDILTQDPSSRFHLLDGFPKLCETARTLLADAQTNFLSEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHGI
        QTFGSKIFADESTKA YKALELIDPDILTQD SSRFHLLDGFPKLC+TARTLLADAQ   LSEPLVIRPSTQWYVREE+SEI N+RKQIISEAIKDQHGI
Subjt:  QTFGSKIFADESTKASYKALELIDPDILTQDPSSRFHLLDGFPKLCETARTLLADAQTNFLSEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHGI

Query:  WHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAVDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLNASVIEVGCSPMVEAPTQTNVDPQLQPVKL
        WHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRA+DLDYVKKKLS SSLTSNGLIWKLFGIAE+SSSDL+ASVIEV  S +VEAPTQ NVDPQ + VK+
Subjt:  WHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAVDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLNASVIEVGCSPMVEAPTQTNVDPQLQPVKL

Query:  HPVPREMLNIFPSSNLPPLTLFGRARLGVDDAILL-SEVSYPSTEDEPVEAVGDKVADLSIHDAYGKLGDKSSKNSKNEVHSEGKHKKFENDALLANVNA
        +PVP+EMLNI  SSNLPPLTLFGRARL ++DA LL  EVSYPSTE+EPVEAVG  +ADLSIHDA  KL DKSS NSKNEVH +GKHKKF NDAL A+VNA
Subjt:  HPVPREMLNIFPSSNLPPLTLFGRARLGVDDAILL-SEVSYPSTEDEPVEAVGDKVADLSIHDAYGKLGDKSSKNSKNEVHSEGKHKKFENDALLANVNA

Query:  SLYSDRTELPISEIRAVSVNNNPLEVVHSDIEEHVHEQERRAKETESLDDRKDVGIILKTHIGKIVNNHRIAGCSNSHLLSVGSSKGFNDKFRKLYRSMN
        SLYSD  +LPISEI+ +S NNN  EV +SD+EEHVHEQE R KE ES D  KDV IIL+TH+GK+VNN RIA CSNSHLLSVGSSKGFNDKFRKLYRSMN
Subjt:  SLYSDRTELPISEIRAVSVNNNPLEVVHSDIEEHVHEQERRAKETESLDDRKDVGIILKTHIGKIVNNHRIAGCSNSHLLSVGSSKGFNDKFRKLYRSMN

Query:  VPVPEPLPSLVPVSPARQR
        VPVPEPLPSLV +  +R+R
Subjt:  VPVPEPLPSLVPVSPARQR

A0A5D3BSH5 Protein artemis1.3e-28381.74Show/hide
Query:  MPIEMPQGVPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIVPHSSFPIYSTFLTKSIVLQQFPLLHDSLFVCIEVGQSLVVKDPDGAFTVTVFDAHHCP--
        MPIEMPQG+PFSVDTWTPSSKQKRHHFLTHAHRDHTTGIVPH SFPIYSTFLTKSIVL QFP LHDSLFVCIEVGQSLVVKDPDGAFTVTVFDAHHCP  
Subjt:  MPIEMPQGVPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIVPHSSFPIYSTFLTKSIVLQQFPLLHDSLFVCIEVGQSLVVKDPDGAFTVTVFDAHHCP--

Query:  ------GYFGNVLHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSTHSAIHQVINCIWKHPDAPLVYLICNLLGQEEILQQVS
              GYFGNVLHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPS HS+IH                       QE+ILQQVS
Subjt:  ------GYFGNVLHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSTHSAIHQVINCIWKHPDAPLVYLICNLLGQEEILQQVS

Query:  QTFGSKIFADESTKASYKALELIDPDILTQDPSSRFHLLDGFPKLCETARTLLADAQTNFLSEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHGI
        QTFGSKIFADESTKA YKALELIDPDILTQD SSRFHLLDGFPKLC+TARTLLADAQ   LSEPLVIRPSTQWYVREE+SEI N+RKQIISEAIKDQHGI
Subjt:  QTFGSKIFADESTKASYKALELIDPDILTQDPSSRFHLLDGFPKLCETARTLLADAQTNFLSEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHGI

Query:  WHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAVDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLNASVIEVGCSPMVEAPTQTNVDPQLQPVKL
        WHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRA+DLDYVKKKLS SSLTSNGLIWKLFGIAE+SSSDL+ASVIEV  S +VEAPTQ NVDPQ + VK+
Subjt:  WHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAVDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLNASVIEVGCSPMVEAPTQTNVDPQLQPVKL

Query:  HPVPREMLNIFPSSNLPPLTLFGRARLGVDDAILL-SEVSYPSTEDEPVEAVGDKVADLSIHDAYGKLGDKSSKNSKNEVHSEGKHKKFENDALLANVNA
        +PVP+EMLNI  SSNLPPLTLFGRARL ++DA LL  EVSYPSTE+EPVEAVG  +ADLSIHDA  KL DKSS NSKNEVH +GKHKKF NDAL A+VNA
Subjt:  HPVPREMLNIFPSSNLPPLTLFGRARLGVDDAILL-SEVSYPSTEDEPVEAVGDKVADLSIHDAYGKLGDKSSKNSKNEVHSEGKHKKFENDALLANVNA

Query:  SLYSDRTELPISEIRAVSVNNNPLEVVHSDIEEHVHEQERRAKETESLDDRKDVGIILKTHIGKIVNNHRIAGCSNSHLLSVGSSKGFNDKFRKLYRSMN
        SLYSD  +LPISEI+ +S NNN  EV +SD+EEHVHEQE R KE ES D  KDV IIL+TH+GK+VNN RIA CSNSHLLSVGSSKGFNDKFRKLYRSMN
Subjt:  SLYSDRTELPISEIRAVSVNNNPLEVVHSDIEEHVHEQERRAKETESLDDRKDVGIILKTHIGKIVNNHRIAGCSNSHLLSVGSSKGFNDKFRKLYRSMN

Query:  VPVPEPLPSLVPVSPARQR
        VPVPEPLPSLV +  +R+R
Subjt:  VPVPEPLPSLVPVSPARQR

A0A6J1G997 uncharacterized protein LOC111451874 isoform X22.3e-28380.81Show/hide
Query:  MPIEMPQGVPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-VPHSSFPIYSTFLTKSIVLQQFPLLHDSLFVCIEVGQSLVVKDPDGAFTVTVFDAHHCPG
        MPIEMP G+PFSVDTWTPSSKQKRHHFLTHAH DHT GI   HSSFPI+STF+TKSIVLQ FP LHDSLFVCIEVGQ+LVVKDPDG FTVTVFDAHHCPG
Subjt:  MPIEMPQGVPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-VPHSSFPIYSTFLTKSIVLQQFPLLHDSLFVCIEVGQSLVVKDPDGAFTVTVFDAHHCPG

Query:  YFGNVLHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSTHSAIHQVINCIWKHPDAPLVYLICNLLGQEEILQQVSQTFGSKI
         FGN+LHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPS HSAIHQVINCIWKHPDAPLVYLIC+ LGQE+ILQQVSQTFGSKI
Subjt:  YFGNVLHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSTHSAIHQVINCIWKHPDAPLVYLICNLLGQEEILQQVSQTFGSKI

Query:  FADESTKASYKALELIDPDILTQDPSSRFHLLDGFPKLCETARTLLADAQTNFLSEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHGIWHVCYSM
        F DE TKA YKALELIDPDILTQDPSSRFHLL GFPKLC+TA+ LLA+AQTNF  EPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHGIWHVCYSM
Subjt:  FADESTKASYKALELIDPDILTQDPSSRFHLLDGFPKLCETARTLLADAQTNFLSEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHGIWHVCYSM

Query:  HSSKEELEWALQILAPKWVVSTTPGCRAVDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLNASVIEVGCSPMVEAPTQTNVDPQLQPVKLHPVPREM
        HSSKEELEWALQILAPKWVVSTTPGCRA+DLDYVKKK S SSLTSNGLIWKLFG+AEESSSDL+ASVIEV CSP+VE  T  ++DPQLQP KL+ VPRE 
Subjt:  HSSKEELEWALQILAPKWVVSTTPGCRAVDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLNASVIEVGCSPMVEAPTQTNVDPQLQPVKLHPVPREM

Query:  LNIFPSSNLPPLTLFGRARLGVDDAILL-SEVSYPSTEDEPVEAVGDKVADLSIHDAYGKLGDKSSKNSKNEVHSEGKHKKFEND-ALLANVNASLYSDR
        L+I   SNLPPLTLFGRARL   DA +L  EVSYPSTE+EPVEAVGDKVADLSIHDA G+  DK SK+SKNEV+S+GKH+KF ND  LLA+ +AS  SDR
Subjt:  LNIFPSSNLPPLTLFGRARLGVDDAILL-SEVSYPSTEDEPVEAVGDKVADLSIHDAYGKLGDKSSKNSKNEVHSEGKHKKFEND-ALLANVNASLYSDR

Query:  TELPISEIRAVSV-NNNPLEVVHSDIEE-HVHEQERRAKETESLDDRKDVGIILKTHIGKIVNNHRIAGCSNSHLLSVGSSKGFNDKFRKLYRSMNVPVP
          L  SE+  VS+ NNNP E V S++EE HVHEQE R K  +SLDD +DV  + KTHIGK+V + R+   SNSH+LSVGSSKGFND+FRKLYRSMNV VP
Subjt:  TELPISEIRAVSV-NNNPLEVVHSDIEE-HVHEQERRAKETESLDDRKDVGIILKTHIGKIVNNHRIAGCSNSHLLSVGSSKGFNDKFRKLYRSMNVPVP

Query:  EPLPSLVPVSPARQR
        EPLPSLV +  +R+R
Subjt:  EPLPSLVPVSPARQR

A0A6J1L262 uncharacterized protein LOC111499723 isoform X21.9e-28280.16Show/hide
Query:  MPIEMPQGVPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-VPHSSFPIYSTFLTKSIVLQQFPLLHDSLFVCIEVGQSLVVKDPDGAFTVTVFDAHHCPG
        MPIEMP G+PFSVDTWTPSSKQKRHHFLTHAH DHT GI   HSSFPI+STF+TKSIVLQ FP LHDSLFVCIEVGQ+LVVKDP+G FTVTVFDAHHCPG
Subjt:  MPIEMPQGVPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-VPHSSFPIYSTFLTKSIVLQQFPLLHDSLFVCIEVGQSLVVKDPDGAFTVTVFDAHHCPG

Query:  YFGNVLHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSTHSAIHQVINCIWKHPDAPLVYLICNLLGQEEILQQVSQTFGSKI
         FGN+LHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPS HSAIHQVINCIWKHPDAPLVYLIC+ LGQE+ILQQVSQTFGSKI
Subjt:  YFGNVLHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSTHSAIHQVINCIWKHPDAPLVYLICNLLGQEEILQQVSQTFGSKI

Query:  FADESTKASYKALELIDPDILTQDPSSRFHLLDGFPKLCETARTLLADAQTNFLSEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHGIWHVCYSM
        F DE TKA YKALELIDPDILTQDPSSRFHLL GFPKLC+TA+ LLA+AQTNF  EPLVIRPSTQWYVREELSE CNTRKQIISEAIKDQHGIWHVCYSM
Subjt:  FADESTKASYKALELIDPDILTQDPSSRFHLLDGFPKLCETARTLLADAQTNFLSEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHGIWHVCYSM

Query:  HSSKEELEWALQILAPKWVVSTTPGCRAVDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLNASVIEVGCSPMVEAPTQTNVDPQLQPVKLHPVPREM
        HSSKEELEWALQILAPKWVVSTTPGCRA+DLDYVK K SCSSLTS+GLIWKLFG+AEESSSDL+AS IEV CSP+VE  T  ++DPQLQP KL+ VPRE 
Subjt:  HSSKEELEWALQILAPKWVVSTTPGCRAVDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLNASVIEVGCSPMVEAPTQTNVDPQLQPVKLHPVPREM

Query:  LNIFPSSNLPPLTLFGRARLGVDDA-ILLSEVSYPSTEDEPVEAVGDKVADLSIHDAYGKLGDKSSKNSKNEVHSEGKHKKFEND-ALLANVNASLYSDR
        L+I   SNLPPLTLFGRARL  +DA +LL EVSYPS E+EPVEAVGDKVADLSIHDA G+  DK SK+SKNEV+S+GKH+KF N   LLA+  AS  SDR
Subjt:  LNIFPSSNLPPLTLFGRARLGVDDA-ILLSEVSYPSTEDEPVEAVGDKVADLSIHDAYGKLGDKSSKNSKNEVHSEGKHKKFEND-ALLANVNASLYSDR

Query:  TELPISEIRAVSV-NNNPLEVVHSDIEE-HVHEQERRAKETESLDDRKDVGIILKTHIGKIVNNHRIAGCSNSHLLSVGSSKGFNDKFRKLYRSMNVPVP
          L ISE++ VS+ NNNP E V S++EE H HEQE R K  +SLDD +DV  + +T IGK+V + RIAGCSNSH+LSVGSSKGFN +FRKLYRSMNV VP
Subjt:  TELPISEIRAVSV-NNNPLEVVHSDIEE-HVHEQERRAKETESLDDRKDVGIILKTHIGKIVNNHRIAGCSNSHLLSVGSSKGFNDKFRKLYRSMNVPVP

Query:  EPLPSLVPVSPARQR
        EPLPSLV +  +R+R
Subjt:  EPLPSLVPVSPARQR

SwissProt top hitse value%identityAlignment
A3DJK3 Energy-coupling factor transporter ATP-binding protein EcfA15.6e-2936.74Show/hide
Query:  VPILRDCSLRIPSGQFWMLLGPNGCGKSTLLKVLAGLLNTTTGAVY--------------VKKPKCFVFQNPDYQVVMPTVEADVAFGLGKLNLTNDEVK
        V +L++ +  I  G+F  ++G NG GKST  ++L  +L  T G +Y              +++    VFQNPD Q++  +VE DVAFG   + + +DE+ 
Subjt:  VPILRDCSLRIPSGQFWMLLGPNGCGKSTLLKVLAGLLNTTTGAVY--------------VKKPKCFVFQNPDYQVVMPTVEADVAFGLGKLNLTNDEVK

Query:  SRVLEALSARSVQT--------LSGGQKQRVAIAGALAEACKVLLLDELTTFLDENDQMGVIKAVRNSLRNSEDVTALWVTHRLEELEYADGAIYMEDGK
         RV EAL +  ++         LSGGQKQRVAIAG LA   K ++LDE T+ LD + +  V+K +R+ L   E++T + +TH +EE   A   + M++GK
Subjt:  SRVLEALSARSVQT--------LSGGQKQRVAIAGALAEACKVLLLDELTTFLDENDQMGVIKAVRNSLRNSEDVTALWVTHRLEELEYADGAIYMEDGK

Query:  VVMHGDVASIQSFIQ
        +VM G+   I S ++
Subjt:  VVMHGDVASIQSFIQ

Q03PY5 Energy-coupling factor transporter ATP-binding protein EcfA14.3e-2938.84Show/hide
Query:  LTENIAIEGRNLSFSITTRQGKIVPILRDCSLRIPSGQFWMLLGPNGCGKSTLLKVLAGLLNTTTGAV-------------YVKKPKCFVFQNPDYQVVM
        +TENI I   +L++     Q    P L D S  + +G++  ++G NG GKSTL K L GLL  T G+V              V++    +FQNPD Q V 
Subjt:  LTENIAIEGRNLSFSITTRQGKIVPILRDCSLRIPSGQFWMLLGPNGCGKSTLLKVLAGLLNTTTGAV-------------YVKKPKCFVFQNPDYQVVM

Query:  PTVEADVAFGLGKLNLTNDEVKSRVLEAL--------SARSVQTLSGGQKQRVAIAGALAEACKVLLLDELTTFLDENDQMGVIKAVRNSLRNSEDVTAL
         TVE DVAFGL    ++ DE+  RV  AL        + R   +LSGGQKQRVA+AG +A A K+L+LDE T+ LD   ++ ++  VR  LR  +++T +
Subjt:  PTVEADVAFGLGKLNLTNDEVKSRVLEAL--------SARSVQTLSGGQKQRVAIAGALAEACKVLLLDELTTFLDENDQMGVIKAVRNSLRNSEDVTAL

Query:  WVTHRLEELEYADGAIYMEDGKVV
         +TH ++E   AD  + ++DG++V
Subjt:  WVTHRLEELEYADGAIYMEDGKVV

Q1WSB8 Energy-coupling factor transporter ATP-binding protein EcfA12.1e-3138.89Show/hide
Query:  INLTENIAIEGRNLSFSITTRQGKIVPILRDCSLRIPSGQFWMLLGPNGCGKSTLLKVLAGLLNTTTGAVYV-------------KKPKCFVFQNPDYQV
        ++LT+   IE  NLSF    RQ K   IL++ +L I  GQ+  L+G NG GKSTL +++ GLL   +G++ V             +K    VFQNPD Q 
Subjt:  INLTENIAIEGRNLSFSITTRQGKIVPILRDCSLRIPSGQFWMLLGPNGCGKSTLLKVLAGLLNTTTGAVYV-------------KKPKCFVFQNPDYQV

Query:  VMPTVEADVAFGLGKLNLTNDEVKSRVLEALSARSVQ--------TLSGGQKQRVAIAGALAEACKVLLLDELTTFLDENDQMGVIKAVRNSLRNSEDVT
        V   VE+D+AFGL    +  +E+ SRV +AL    ++         LSGGQKQRVAIAG LA   K+++LDE T+ LD   + G+I  +++ L++    T
Subjt:  VMPTVEADVAFGLGKLNLTNDEVKSRVLEALSARSVQ--------TLSGGQKQRVAIAGALAEACKVLLLDELTTFLDENDQMGVIKAVRNSLRNSEDVT

Query:  ALWVTHRLEELEYADGAIYMEDGKVVMHGDVASI
         + +TH ++E   AD  I ++DGK++  G   SI
Subjt:  ALWVTHRLEELEYADGAIYMEDGKVVMHGDVASI

Q8H1R4 ABC transporter I family member 102.6e-9067.43Show/hide
Query:  LVTSSPPLLSPLFSSPPS-------RINLTENIAIEGRNLSFSITTRQGKIVPILRDCSLRIPSGQFWMLLGPNGCGKSTLLKVLAGLLNTTTGAVYVKK
        L  +SPP L PL S   S       RI  ++N+A+E RNL FS++TRQG  VPILRDCS RIPSGQ WM+LGPNGCGKSTLLK+LAG++N ++G V+V+K
Subjt:  LVTSSPPLLSPLFSSPPS-------RINLTENIAIEGRNLSFSITTRQGKIVPILRDCSLRIPSGQFWMLLGPNGCGKSTLLKVLAGLLNTTTGAVYVKK

Query:  PKCFVFQNPDYQVVMPTVEADVAFGLGKL-NLTNDEVKSRVLEALSA--------RSVQTLSGGQKQRVAIAGALAEACKVLLLDELTTFLDENDQMGVI
        PK FVFQNPD+QVVMPTVEADVAFGLGK  ++  +EVKSRV++AL A        R +QTLSGGQKQR+AIAGALAEACKVLLLDELTTFLDE+DQMGVI
Subjt:  PKCFVFQNPDYQVVMPTVEADVAFGLGKL-NLTNDEVKSRVLEALSA--------RSVQTLSGGQKQRVAIAGALAEACKVLLLDELTTFLDENDQMGVI

Query:  KAVRNSLRNSE-DVTALWVTHRLEELEYADGAIYMEDGKVVMHGDVASIQSFIQTKQSDYI
        KAV++ +   + DVTALWVTHRLEEL+YADGA+YME+G+VV HGD A+I  FI+ KQS YI
Subjt:  KAVRNSLRNSE-DVTALWVTHRLEELEYADGAIYMEDGKVVMHGDVASIQSFIQTKQSDYI

Q8R7Y4 Energy-coupling factor transporter ATP-binding protein EcfA17.1e-3240.54Show/hide
Query:  IEGRNLSFSITTRQGKIVPILRDCSLRIPSGQFWMLLGPNGCGKSTLLKVLAGLLNTTTGAVYVK--------------KPKCFVFQNPDYQVVMPTVEA
        I  +N+SF  +  + K  P+L+D +L+   GQF  ++G NG GKSTL K    LL  T G VYVK              +    VFQNPD Q+V   VE 
Subjt:  IEGRNLSFSITTRQGKIVPILRDCSLRIPSGQFWMLLGPNGCGKSTLLKVLAGLLNTTTGAVYVK--------------KPKCFVFQNPDYQVVMPTVEA

Query:  DVAFGLGKLNLTNDEVKSRVLEALSARSV--------QTLSGGQKQRVAIAGALAEACKVLLLDELTTFLDENDQMGVIKAVRNSLRNSEDVTALWVTHR
        DVAFG   L +  +E++ RV  AL A  +          LSGGQKQRVAIAG +A   + ++LDE T  LD   +  VI  ++  L   E +T + +TH 
Subjt:  DVAFGLGKLNLTNDEVKSRVLEALSARSV--------QTLSGGQKQRVAIAGALAEACKVLLLDELTTFLDENDQMGVIKAVRNSLRNSEDVTALWVTHR

Query:  LEELEYADGAIYMEDGKVVMHG
        +EE+  AD  I M+DGKVV+ G
Subjt:  LEELEYADGAIYMEDGKVVMHG

Arabidopsis top hitse value%identityAlignment
AT1G19025.1 DNA repair metallo-beta-lactamase family protein5.4e-12842.05Show/hide
Query:  MPIEMPQGVPFSVDT---WTPSSKQKRHHFLTHAHRDHTTGIVPHS--SFPIYSTFLTKSIVLQQFPLLHDSLFVCIEVGQSLVVKDPDGAFTVTVFDAH
        M IEMP+G+PF+VDT   +T + ++KRHHFLTHAH+DHT G+ P +   FPIYST LT S++LQ+FP L +S FV +E+GQS++V DPDG F VT FDA+
Subjt:  MPIEMPQGVPFSVDT---WTPSSKQKRHHFLTHAHRDHTTGIVPHS--SFPIYSTFLTKSIVLQQFPLLHDSLFVCIEVGQSLVVKDPDGAFTVTVFDAH

Query:  HCP--------GYFGNVLHTGDCRLTPECLQNLPEKYRGKS-GKEPRCKLDLIFLDCTFGR--FFQQFPSTHSAIHQVINCIWKHPDAPLVYLICNLLGQ
        HCP        G FGN+LHTGDCRLT +CL +LPEKY G+S G +P+C L  IFLDCTFG+    Q+FP+ HSAI Q+INCIW HPDAP+VYL C++LGQ
Subjt:  HCP--------GYFGNVLHTGDCRLTPECLQNLPEKYRGKS-GKEPRCKLDLIFLDCTFGR--FFQQFPSTHSAIHQVINCIWKHPDAPLVYLICNLLGQ

Query:  EEILQQVSQTFGSKIFADESTKAS-YKALELIDPDILTQDPSSRFHLLDGFPKLCETARTLLADAQTNFLSEPLVIRPSTQWYVREELSE-----ICNTR
        E++L +VS+TFGSKI+ D++T    +++L +I P+I+++DPSSRFH+  GFPKL E     LA+A++   SEPL+IRPS QWYV ++  +     I   R
Subjt:  EEILQQVSQTFGSKIFADESTKAS-YKALELIDPDILTQDPSSRFHLLDGFPKLCETARTLLADAQTNFLSEPLVIRPSTQWYVREELSE-----ICNTR

Query:  KQIISEAIKDQHGIWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAVDLDYVKKKLSCSSLTSNGLIWKLFGIAEESS--SDLNASVIEVGCSPMVE
        K   SEA+KD+ G+WHVCYSMHSS+ ELE A+Q+L+PKWVVST P CRA++L+YVKK    S  + +   WKL  I  E+S  +  +   + + C  M E
Subjt:  KQIISEAIKDQHGIWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAVDLDYVKKKLSCSSLTSNGLIWKLFGIAEESS--SDLNASVIEVGCSPMVE

Query:  APTQTNVDPQLQPV-KLHPVPREMLNIFPSSNLPPLTLFGRARLGVDDAILLSEVSYPSTEDEPVEAVGDKVADLSIHDAYGKLGDKSSKNSKNEVHSEG
             +   +L+PV +     +++L++ P ++L P+TLFGRAR    +   L E                                      +  +H++ 
Subjt:  APTQTNVDPQLQPV-KLHPVPREMLNIFPSSNLPPLTLFGRARLGVDDAILLSEVSYPSTEDEPVEAVGDKVADLSIHDAYGKLGDKSSKNSKNEVHSEG

Query:  KHKKFENDALLANVNASLYSDRTELPISEIRAVSVNNNPLEVVHSDIEEHVHEQERRAKETESLDDRKDVGIILKTHIGKIVNNHRIAGCSNSHLLSVGS
         + K                     P+ E   V V   PL+    D +E   E+E                 I  T   K                   +
Subjt:  KHKKFENDALLANVNASLYSDRTELPISEIRAVSVNNNPLEVVHSDIEEHVHEQERRAKETESLDDRKDVGIILKTHIGKIVNNHRIAGCSNSHLLSVGS

Query:  SKGFNDKFRKLYRSMNVPVPEPLPSLVPVSPARQR
         K  +   RKLYRSMN PVP PLPSL+ +  AR+R
Subjt:  SKGFNDKFRKLYRSMNVPVPEPLPSLVPVSPARQR

AT2G36910.1 ATP binding cassette subfamily B11.4e-1428.63Show/hide
Query:  IEGRNLSFSITTRQGKIVPILRDCSLRIPSGQFWMLLGPNGCGKSTLLKVLAGLLNTTTGAVY-------------VKKPKCFVFQNPDYQVVMPTVEAD
        +E +++ FS  +R    + I RD SLR  +G+   L+GP+GCGKS+++ ++      ++G V              ++K    V Q P   +   T+  +
Subjt:  IEGRNLSFSITTRQGKIVPILRDCSLRIPSGQFWMLLGPNGCGKSTLLKVLAGLLNTTTGAVY-------------VKKPKCFVFQNPDYQVVMPTVEAD

Query:  VAFG--------------LGKLNLTNDEVKSRVLEALSARSVQTLSGGQKQRVAIAGALAEACKVLLLDELTTFLDENDQMGVIKAVRNSLRNSEDVTAL
        +A+G              L   +     +       +  R VQ LSGGQKQR+AIA AL    +++LLDE T+ LD   +  V +A+  +       T++
Subjt:  VAFG--------------LGKLNLTNDEVKSRVLEALSARSVQTLSGGQKQRVAIAGALAEACKVLLLDELTTFLDENDQMGVIKAVRNSLRNSEDVTAL

Query:  WVTHRLEELEYADGAIYMEDGKVVMHG
         V HRL  +  A     ++DGKV   G
Subjt:  WVTHRLEELEYADGAIYMEDGKVVMHG

AT2G47800.1 multidrug resistance-associated protein 46.2e-1529.2Show/hide
Query:  TIFKVLSFRRIEFLLHHFCSSMVSKSSS-LNLVTSSPPLLSPLFS--SPPSRINLTENIAIEGRNLSFSITTRQGKIVPILRDCSLRIPSGQFWMLLGPN
        TIFK+L     +  +  F  SM+S S + ++L      ++S   S  +    +    N A+E R+ SFS      +  P L D + ++  G+   ++G  
Subjt:  TIFKVLSFRRIEFLLHHFCSSMVSKSSS-LNLVTSSPPLLSPLFS--SPPSRINLTENIAIEGRNLSFSITTRQGKIVPILRDCSLRIPSGQFWMLLGPN

Query:  GCGKSTLLKVLAGLLNTTTGAVYVKKPKCFVFQNPDYQVVMPTVEADVAFGLGK--------LNLTNDEVKSRVLE-----ALSARSVQTLSGGQKQRVA
        G GKS+LL  + G ++  +G V V     +V Q     +   TV+ ++ FGL          LN+ + E   +++E      +  R +  LSGGQKQR+ 
Subjt:  GCGKSTLLKVLAGLLNTTTGAVYVKKPKCFVFQNPDYQVVMPTVEADVAFGLGK--------LNLTNDEVKSRVLE-----ALSARSVQTLSGGQKQRVA

Query:  IAGALAEACKVLLLDELTTFLDENDQMGVI-KAVRNSLRNSEDVTALWVTHRLEELEYADGAIYMEDGKVVMHG
        +A A+ + C V LLD++ + +D +    +  K VR +L+     T L VTH+++ L   D  + M DGK+V  G
Subjt:  IAGALAEACKVLLLDELTTFLDENDQMGVI-KAVRNSLRNSEDVTALWVTHRLEELEYADGAIYMEDGKVVMHG

AT4G18050.1 P-glycoprotein 94.0e-1428.74Show/hide
Query:  SKSSSLNLVTSSPPLLSPLFSSPPSRINLTENIAIEGRNLSFSITTRQGKIVPILRDCSLRIPSGQFWMLLGPNGCGKSTLLKVLAGLLNTTTGAVYVKK
        S +S  +++ S+P + S   SS         N  IE R++SF    R    V I RD  L IPSG+   L+G +G GKST++ ++    N  +G + + +
Subjt:  SKSSSLNLVTSSPPLLSPLFSSPPSRINLTENIAIEGRNLSFSITTRQGKIVPILRDCSLRIPSGQFWMLLGPNGCGKSTLLKVLAGLLNTTTGAVYVKK

Query:  PKCFVFQ-----------NPDYQVVMPTVEADVAFGLGKLNLTNDEVKSRVLEA----------------LSARSVQTLSGGQKQRVAIAGALAEACKVL
         +   F+           + +  +   T+ +++A+G      T +E+ +    A                +  R VQ LSGGQKQR+AIA A+ +  K+L
Subjt:  PKCFVFQ-----------NPDYQVVMPTVEADVAFGLGKLNLTNDEVKSRVLEA----------------LSARSVQTLSGGQKQRVAIAGALAEACKVL

Query:  LLDELTTFLDENDQMGVIKAVRNSLRNSEDVTALWVTHRLEELEYADGAIYMEDGKVVMHG
        LLDE T+ LD   +  V  A+   + N    T + V HRL  ++ AD    +++G +   G
Subjt:  LLDELTTFLDENDQMGVIKAVRNSLRNSEDVTALWVTHRLEELEYADGAIYMEDGKVVMHG

AT4G33460.1 ABC transporter family protein1.8e-9167.43Show/hide
Query:  LVTSSPPLLSPLFSSPPS-------RINLTENIAIEGRNLSFSITTRQGKIVPILRDCSLRIPSGQFWMLLGPNGCGKSTLLKVLAGLLNTTTGAVYVKK
        L  +SPP L PL S   S       RI  ++N+A+E RNL FS++TRQG  VPILRDCS RIPSGQ WM+LGPNGCGKSTLLK+LAG++N ++G V+V+K
Subjt:  LVTSSPPLLSPLFSSPPS-------RINLTENIAIEGRNLSFSITTRQGKIVPILRDCSLRIPSGQFWMLLGPNGCGKSTLLKVLAGLLNTTTGAVYVKK

Query:  PKCFVFQNPDYQVVMPTVEADVAFGLGKL-NLTNDEVKSRVLEALSA--------RSVQTLSGGQKQRVAIAGALAEACKVLLLDELTTFLDENDQMGVI
        PK FVFQNPD+QVVMPTVEADVAFGLGK  ++  +EVKSRV++AL A        R +QTLSGGQKQR+AIAGALAEACKVLLLDELTTFLDE+DQMGVI
Subjt:  PKCFVFQNPDYQVVMPTVEADVAFGLGKL-NLTNDEVKSRVLEALSA--------RSVQTLSGGQKQRVAIAGALAEACKVLLLDELTTFLDENDQMGVI

Query:  KAVRNSLRNSE-DVTALWVTHRLEELEYADGAIYMEDGKVVMHGDVASIQSFIQTKQSDYI
        KAV++ +   + DVTALWVTHRLEEL+YADGA+YME+G+VV HGD A+I  FI+ KQS YI
Subjt:  KAVRNSLRNSE-DVTALWVTHRLEELEYADGAIYMEDGKVVMHGDVASIQSFIQTKQSDYI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGATCGAAATGCCCCAAGGCGTGCCATTTTCAGTGGATACATGGACTCCATCTTCCAAGCAAAAGCGCCACCATTTTCTGACCCACGCTCACAGGGATCACACCAC
TGGGATTGTCCCCCATTCTTCCTTCCCCATTTATTCTACTTTTCTCACCAAATCCATCGTTCTTCAGCAATTTCCTCTGCTTCATGATTCGTTGTTTGTATGTATCGAGG
TGGGGCAATCGCTGGTCGTCAAAGATCCTGATGGAGCTTTCACCGTTACTGTTTTTGATGCTCATCACTGCCCTGGCTATTTTGGCAATGTTCTGCATACGGGTGATTGT
AGATTAACTCCTGAGTGCCTACAGAACTTACCTGAGAAGTATCGTGGAAAAAGTGGTAAAGAGCCAAGATGTAAACTGGATCTGATTTTTCTGGACTGCACATTTGGTAG
ATTCTTTCAACAATTCCCCAGCACGCATTCAGCAATACATCAGGTTATTAATTGCATTTGGAAACACCCTGATGCTCCGTTAGTATATCTGATTTGCAATCTTCTAGGAC
AGGAAGAGATATTGCAACAAGTGTCCCAAACATTTGGTTCAAAGATATTTGCTGATGAATCCACGAAAGCAAGTTACAAGGCTCTTGAACTTATAGATCCTGACATCCTC
ACTCAAGATCCATCTTCCCGTTTTCATCTGCTTGATGGATTCCCTAAACTATGTGAAACTGCAAGAACACTGCTTGCAGATGCCCAGACCAATTTTCTGTCTGAACCTCT
TGTAATCCGACCTTCAACCCAGTGGTACGTTCGTGAGGAATTATCAGAGATTTGCAACACAAGAAAACAAATAATTAGTGAAGCAATTAAAGATCAACATGGTATTTGGC
ATGTTTGTTACTCGATGCACTCGTCAAAGGAAGAACTAGAATGGGCCTTGCAAATTTTAGCACCAAAATGGGTTGTTTCAACCACTCCTGGTTGTCGAGCCGTGGATTTG
GATTATGTGAAAAAGAAACTCAGTTGTTCTAGTTTAACTTCTAATGGCCTAATTTGGAAGCTTTTTGGTATAGCTGAGGAAAGTTCTTCAGATTTAAATGCTTCAGTGAT
TGAAGTGGGTTGTTCTCCTATGGTTGAAGCACCAACTCAAACAAACGTAGACCCTCAACTGCAGCCGGTGAAACTGCATCCTGTTCCTCGAGAAATGTTAAACATTTTTC
CTTCAAGCAACTTGCCACCTCTCACACTATTCGGGCGAGCTAGACTTGGCGTTGATGATGCCATTCTGTTGTCAGAAGTTTCATATCCATCTACAGAGGACGAGCCTGTA
GAAGCAGTTGGAGATAAAGTAGCAGACTTGTCCATTCATGATGCATATGGTAAACTGGGTGACAAATCATCAAAAAATTCTAAAAACGAGGTTCACTCCGAAGGAAAACA
CAAGAAGTTTGAGAATGATGCATTATTAGCTAATGTAAACGCCTCACTTTACTCCGATCGGACTGAGCTCCCTATTTCTGAAATAAGAGCTGTGTCTGTGAACAATAACC
CACTAGAAGTAGTCCACAGTGACATAGAAGAACATGTCCATGAGCAGGAACGTAGAGCAAAGGAAACAGAATCGTTAGACGATCGTAAGGATGTCGGTATTATTCTCAAA
ACACACATTGGAAAGATTGTTAATAATCACAGAATAGCAGGGTGTAGTAATTCACATCTTTTAAGTGTTGGATCTTCGAAGGGATTTAACGACAAGTTTCGAAAGTTGTA
TAGGTCAATGAATGTACCTGTGCCTGAGCCCCTTCCTTCGCTGGTCCCCGTCTCTCCTGCACGCCAACGACGACCAGCCACCCCCCGCCGATCAGTTTCCGCCGCCACGC
CTGCCAACCGTCGTTCGTCGCCGTTGGTGAGTCTTCTTTTGCTCCGTTTTCTCTCTCATTCCTCTGTAGAGCTGCGCAGCACCCACCTCCGAACACTATCACTATTTCAA
CGATCACTCCCAAACACAATCTTCAAGGTGCTTTCATTTCGAAGAATTGAATTTCTTCTCCACCATTTCTGCAGCAGCATGGTTTCGAAGAGTTCCTCTCTCAATCTTGT
TACTTCATCGCCACCTCTTCTTTCTCCTCTCTTCTCCAGCCCCCCCTCCAGGATAAATCTTACTGAAAATATAGCCATTGAGGGACGCAATCTCAGCTTCTCGATCACGA
CAAGGCAGGGAAAAATTGTGCCTATTCTTAGGGATTGCTCACTCCGGATTCCTTCGGGACAGTTTTGGATGCTTCTTGGACCTAATGGCTGTGGAAAATCCACCCTCTTG
AAGGTCTTGGCTGGTCTTCTGAATACAACAACTGGTGCAGTATACGTCAAGAAGCCAAAGTGCTTTGTCTTCCAAAACCCGGACTATCAGGTAGTGATGCCTACCGTGGA
AGCTGATGTTGCATTTGGCCTCGGTAAGTTAAATCTGACCAATGATGAAGTTAAATCAAGGGTTTTGGAAGCACTGAGTGCAAGATCCGTCCAAACGCTGAGTGGTGGTC
AGAAGCAAAGAGTTGCCATCGCTGGTGCTTTGGCTGAAGCATGTAAAGTACTTCTATTAGATGAACTGACCACATTTCTGGATGAGAACGATCAGATGGGGGTGATCAAA
GCAGTTAGGAATTCTTTGCGCAATTCTGAAGATGTTACAGCATTATGGGTGACCCATCGCTTAGAAGAACTTGAATATGCAGATGGCGCTATCTACATGGAAGATGGGAA
AGTCGTCATGCATGGTGATGTGGCTAGTATCCAAAGTTTCATCCAAACAAAACAATCAGATTATATCAAGCGTAATTACTATTCGTGA
mRNA sequenceShow/hide mRNA sequence
ATTTTTGCTTGCTGTGGAAGCTTGAGAGAATTATGAATCCTCTAATTAGGGTTCATACTTGGATTTTCGTCTTCTTCCCCAATCGGAATTTCCCCTCAGAACTTGAAGAT
GCCGATCGAAATGCCCCAAGGCGTGCCATTTTCAGTGGATACATGGACTCCATCTTCCAAGCAAAAGCGCCACCATTTTCTGACCCACGCTCACAGGGATCACACCACTG
GGATTGTCCCCCATTCTTCCTTCCCCATTTATTCTACTTTTCTCACCAAATCCATCGTTCTTCAGCAATTTCCTCTGCTTCATGATTCGTTGTTTGTATGTATCGAGGTG
GGGCAATCGCTGGTCGTCAAAGATCCTGATGGAGCTTTCACCGTTACTGTTTTTGATGCTCATCACTGCCCTGGCTATTTTGGCAATGTTCTGCATACGGGTGATTGTAG
ATTAACTCCTGAGTGCCTACAGAACTTACCTGAGAAGTATCGTGGAAAAAGTGGTAAAGAGCCAAGATGTAAACTGGATCTGATTTTTCTGGACTGCACATTTGGTAGAT
TCTTTCAACAATTCCCCAGCACGCATTCAGCAATACATCAGGTTATTAATTGCATTTGGAAACACCCTGATGCTCCGTTAGTATATCTGATTTGCAATCTTCTAGGACAG
GAAGAGATATTGCAACAAGTGTCCCAAACATTTGGTTCAAAGATATTTGCTGATGAATCCACGAAAGCAAGTTACAAGGCTCTTGAACTTATAGATCCTGACATCCTCAC
TCAAGATCCATCTTCCCGTTTTCATCTGCTTGATGGATTCCCTAAACTATGTGAAACTGCAAGAACACTGCTTGCAGATGCCCAGACCAATTTTCTGTCTGAACCTCTTG
TAATCCGACCTTCAACCCAGTGGTACGTTCGTGAGGAATTATCAGAGATTTGCAACACAAGAAAACAAATAATTAGTGAAGCAATTAAAGATCAACATGGTATTTGGCAT
GTTTGTTACTCGATGCACTCGTCAAAGGAAGAACTAGAATGGGCCTTGCAAATTTTAGCACCAAAATGGGTTGTTTCAACCACTCCTGGTTGTCGAGCCGTGGATTTGGA
TTATGTGAAAAAGAAACTCAGTTGTTCTAGTTTAACTTCTAATGGCCTAATTTGGAAGCTTTTTGGTATAGCTGAGGAAAGTTCTTCAGATTTAAATGCTTCAGTGATTG
AAGTGGGTTGTTCTCCTATGGTTGAAGCACCAACTCAAACAAACGTAGACCCTCAACTGCAGCCGGTGAAACTGCATCCTGTTCCTCGAGAAATGTTAAACATTTTTCCT
TCAAGCAACTTGCCACCTCTCACACTATTCGGGCGAGCTAGACTTGGCGTTGATGATGCCATTCTGTTGTCAGAAGTTTCATATCCATCTACAGAGGACGAGCCTGTAGA
AGCAGTTGGAGATAAAGTAGCAGACTTGTCCATTCATGATGCATATGGTAAACTGGGTGACAAATCATCAAAAAATTCTAAAAACGAGGTTCACTCCGAAGGAAAACACA
AGAAGTTTGAGAATGATGCATTATTAGCTAATGTAAACGCCTCACTTTACTCCGATCGGACTGAGCTCCCTATTTCTGAAATAAGAGCTGTGTCTGTGAACAATAACCCA
CTAGAAGTAGTCCACAGTGACATAGAAGAACATGTCCATGAGCAGGAACGTAGAGCAAAGGAAACAGAATCGTTAGACGATCGTAAGGATGTCGGTATTATTCTCAAAAC
ACACATTGGAAAGATTGTTAATAATCACAGAATAGCAGGGTGTAGTAATTCACATCTTTTAAGTGTTGGATCTTCGAAGGGATTTAACGACAAGTTTCGAAAGTTGTATA
GGTCAATGAATGTACCTGTGCCTGAGCCCCTTCCTTCGCTGGTCCCCGTCTCTCCTGCACGCCAACGACGACCAGCCACCCCCCGCCGATCAGTTTCCGCCGCCACGCCT
GCCAACCGTCGTTCGTCGCCGTTGGTGAGTCTTCTTTTGCTCCGTTTTCTCTCTCATTCCTCTGTAGAGCTGCGCAGCACCCACCTCCGAACACTATCACTATTTCAACG
ATCACTCCCAAACACAATCTTCAAGGTGCTTTCATTTCGAAGAATTGAATTTCTTCTCCACCATTTCTGCAGCAGCATGGTTTCGAAGAGTTCCTCTCTCAATCTTGTTA
CTTCATCGCCACCTCTTCTTTCTCCTCTCTTCTCCAGCCCCCCCTCCAGGATAAATCTTACTGAAAATATAGCCATTGAGGGACGCAATCTCAGCTTCTCGATCACGACA
AGGCAGGGAAAAATTGTGCCTATTCTTAGGGATTGCTCACTCCGGATTCCTTCGGGACAGTTTTGGATGCTTCTTGGACCTAATGGCTGTGGAAAATCCACCCTCTTGAA
GGTCTTGGCTGGTCTTCTGAATACAACAACTGGTGCAGTATACGTCAAGAAGCCAAAGTGCTTTGTCTTCCAAAACCCGGACTATCAGGTAGTGATGCCTACCGTGGAAG
CTGATGTTGCATTTGGCCTCGGTAAGTTAAATCTGACCAATGATGAAGTTAAATCAAGGGTTTTGGAAGCACTGAGTGCAAGATCCGTCCAAACGCTGAGTGGTGGTCAG
AAGCAAAGAGTTGCCATCGCTGGTGCTTTGGCTGAAGCATGTAAAGTACTTCTATTAGATGAACTGACCACATTTCTGGATGAGAACGATCAGATGGGGGTGATCAAAGC
AGTTAGGAATTCTTTGCGCAATTCTGAAGATGTTACAGCATTATGGGTGACCCATCGCTTAGAAGAACTTGAATATGCAGATGGCGCTATCTACATGGAAGATGGGAAAG
TCGTCATGCATGGTGATGTGGCTAGTATCCAAAGTTTCATCCAAACAAAACAATCAGATTATATCAAGCGTAATTACTATTCGTGACTGTTATTTCTCTTTCAACTGATG
TTTTTCTTGTCACATTATTATAACATTGTAGTATATAATTTTGTATCATCTTTTTACATAATAAAATCATTTTATCAGCTGAAATTTGTTGGC
Protein sequenceShow/hide protein sequence
MPIEMPQGVPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIVPHSSFPIYSTFLTKSIVLQQFPLLHDSLFVCIEVGQSLVVKDPDGAFTVTVFDAHHCPGYFGNVLHTGDC
RLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSTHSAIHQVINCIWKHPDAPLVYLICNLLGQEEILQQVSQTFGSKIFADESTKASYKALELIDPDIL
TQDPSSRFHLLDGFPKLCETARTLLADAQTNFLSEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHGIWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAVDL
DYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLNASVIEVGCSPMVEAPTQTNVDPQLQPVKLHPVPREMLNIFPSSNLPPLTLFGRARLGVDDAILLSEVSYPSTEDEPV
EAVGDKVADLSIHDAYGKLGDKSSKNSKNEVHSEGKHKKFENDALLANVNASLYSDRTELPISEIRAVSVNNNPLEVVHSDIEEHVHEQERRAKETESLDDRKDVGIILK
THIGKIVNNHRIAGCSNSHLLSVGSSKGFNDKFRKLYRSMNVPVPEPLPSLVPVSPARQRRPATPRRSVSAATPANRRSSPLVSLLLLRFLSHSSVELRSTHLRTLSLFQ
RSLPNTIFKVLSFRRIEFLLHHFCSSMVSKSSSLNLVTSSPPLLSPLFSSPPSRINLTENIAIEGRNLSFSITTRQGKIVPILRDCSLRIPSGQFWMLLGPNGCGKSTLL
KVLAGLLNTTTGAVYVKKPKCFVFQNPDYQVVMPTVEADVAFGLGKLNLTNDEVKSRVLEALSARSVQTLSGGQKQRVAIAGALAEACKVLLLDELTTFLDENDQMGVIK
AVRNSLRNSEDVTALWVTHRLEELEYADGAIYMEDGKVVMHGDVASIQSFIQTKQSDYIKRNYYS