; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg13568 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg13568
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionFormimidoyltransferase-cyclodeaminase-like
Genome locationCarg_Chr02:651768..657675
RNA-Seq ExpressionCarg13568
SyntenyCarg13568
Gene Ontology termsGO:0006820 - anion transport (biological process)
GO:0034220 - ion transmembrane transport (biological process)
GO:0005886 - plasma membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0005542 - folic acid binding (molecular function)
GO:0008381 - mechanosensitive ion channel activity (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR012886 - Formiminotransferase, N-terminal subdomain
IPR019191 - Essential protein Yae1, N-terminal
IPR022384 - Formiminotransferase catalytic domain superfamily
IPR037064 - Formiminotransferase, N-terminal subdomain superfamily
IPR037070 - Formiminotransferase, C-terminal subdomain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8075869.1 hypothetical protein FH972_014552 [Carpinus fangiana]2.6e-24766.57Show/hide
Query:  MSKLALACCKVYISESRNKAALESIEQAAKLFPEASIINKFTNEVYNRVGYTLVSKLPSKPSIKPCSLRSAVLNMVKAAFSAIDLDSHCGSHPRLGVVDH
        M KL L CCKVYISESRN+AALE+IEQAAKLFPE +IINKF +E YNRVGYTLVSK+  KPS   CSLRSAVL+MVKAA   IDL+ HCGSHPRLGVVDH
Subjt:  MSKLALACCKVYISESRNKAALESIEQAAKLFPEASIINKFTNEVYNRVGYTLVSKLPSKPSIKPCSLRSAVLNMVKAAFSAIDLDSHCGSHPRLGVVDH

Query:  TCLYPLAFASLDDAAAIAKSLAADVGCGLQVPTFLYGAAHEEGRKLATIRRELGYFKPNSDGL-WAGGLKSDSLPLKPDEGPAEASKAKGVIVIGATEWV
         C +PLA+ASLD  A IAKSLAAD+G  +QVPTFLYGAAHEEGR L +IRRELGYFKPNS G  W+GG KS+ LPLKPDEGPA+ SK KGV+VIGAT WV
Subjt:  TCLYPLAFASLDDAAAIAKSLAADVGCGLQVPTFLYGAAHEEGRKLATIRRELGYFKPNSDGL-WAGGLKSDSLPLKPDEGPAEASKAKGVIVIGATEWV

Query:  DNYNIPIFSTDIVAVRNIAKQVSERGGGLPSVQAMALAHDEGVIEVACNLQQPSKVEGKMVQQEVERLAESEGLAVGKDGGRRIRLVAAADIVTWDTIAA
        DNYNIP+FSTDI A+R +AK++S RGGGLPSVQAMALAHD+ V EVACNL +PSKV G +VQQEVERL+  EG+AVGK        ++  +IV     + 
Subjt:  DNYNIPIFSTDIVAVRNIAKQVSERGGGLPSVQAMALAHDEGVIEVACNLQQPSKVEGKMVQQEVERLAESEGLAVGKDGGRRIRLVAAADIVTWDTIAA

Query:  LHLGDFKVVRTHGFTSPPTPKIPLLKPH--TLPPNLQQTTTLSDSH---LQMAFN-LTAKDKRISLEQKELLCCKYFVSESRNRSVLEAIERAVSQDPDS
        L L  F++  T    S    K  +L+    +L P L+       ++   L   F     KDK+ S++Q  LLCCK F+SE+RN + L+AIERA   DP++
Subjt:  LHLGDFKVVRTHGFTSPPTPKIPLLKPH--TLPPNLQQTTTLSDSH---LQMAFN-LTAKDKRISLEQKELLCCKYFVSESRNRSVLEAIERAVSQDPDS

Query:  VIVNKFEDRAYNRTRYTIVSYVVHDAKGNAIYSPLHQTVLAMTEVAFANINLESHSGAHPRLGVVDDIIFHPLARASLHEAAWLAKAVAKDMATMFQVPV
        VIVNKF DR+YNRTRYT+VSYVVHD  G+A+YSPL QTVLAM + AF  +NLE HSGAHPRLGVVDDI+FHPLA+ASL EAAWLAKAVA D+   FQVPV
Subjt:  VIVNKFEDRAYNRTRYTIVSYVVHDAKGNAIYSPLHQTVLAMTEVAFANINLESHSGAHPRLGVVDDIIFHPLARASLHEAAWLAKAVAKDMATMFQVPV

Query:  FLYSAAHPSGKAPDDLRRELGYFRPNYNGNQWAGWSMPETLPEKPDEGPNSVSRERGITMIGARPWTAMYNVPILSTDVAATRRIARMVSARGGGLPTVQ
        +LY+AAHP+GKA D +RRELG++RPN+ GNQW GW MPE LPEKPDEGP +VSR RGI+MIGARPW A+YN+PILS DV+A R+IARMVSARGGGLPTVQ
Subjt:  FLYSAAHPSGKAPDDLRRELGYFRPNYNGNQWAGWSMPETLPEKPDEGPNSVSRERGITMIGARPWTAMYNVPILSTDVAATRRIARMVSARGGGLPTVQ

Query:  TIGLLHDDETTEIACVLLEPNQIGADRVQRRVEVLAAQFGLEVENGYFTDYSPEMVVEKYLNLIAGD
        T+GL++ +++TEIAC+LLEPNQIGADRVQ +VE+LAA+ GL+VE GYFTD+SPEMV+E Y+ LI+ +
Subjt:  TIGLLHDDETTEIACVLLEPNQIGADRVQRRVEVLAAQFGLEVENGYFTDYSPEMVVEKYLNLIAGD

KAF9670855.1 hypothetical protein SADUNF_Sadunf13G0112400 [Salix dunnii]6.2e-27361.75Show/hide
Query:  DDIFDSSLNLEETHLKEGYAEGYKDGLVAGKEEAKHVGLKVGFEVGEEIGFFRGCVDVWKSVILIDSERFSSRVQKSVKQMEELVEQYPLQDPENEQVQE
        +DIFDSSLNLEETH KEGY EGY  GL++GKEEA+  GLK+GFE+GEE+GF+RGCVDVW S I ++  +FS+R+++++K+MEEL+E+YP+ DPE+E+V E
Subjt:  DDIFDSSLNLEETHLKEGYAEGYKDGLVAGKEEAKHVGLKVGFEVGEEIGFFRGCVDVWKSVILIDSERFSSRVQKSVKQMEELVEQYPLQDPENEQVQE

Query:  LMEGLRLKFRTICATLGLVTCSREKVMSKLALACCKVYISESRNKAALESIEQAAKLFPEASIINKFTNEVYNRVGYTLVSKLPSKPSIKPCSLRSAVLN
        +M+ LRLKFR I A LG                   V +    N+ ALESIE+AAKLFPEA I+NKF +  YNRVGYTLVS L  KPS+  C+L+ AVL+
Subjt:  LMEGLRLKFRTICATLGLVTCSREKVMSKLALACCKVYISESRNKAALESIEQAAKLFPEASIINKFTNEVYNRVGYTLVSKLPSKPSIKPCSLRSAVLN

Query:  MVKAAFSAIDLDSHCGSHPRLGVVDHTCLYPLAFASLDDAAAIAKSLAADVGCGLQVPTFLYGAAHEEGRKLATIRRELGYFKPNSDGLWAGGLKSDSLP
        M+KAA   ID  SHCGSHPRLGVVDH C +PLA +SLD AA IAKSLA DVG  L+VPTFLYGAA+ EGR L +IRRELGYFKPNS   WAGG KS+SLP
Subjt:  MVKAAFSAIDLDSHCGSHPRLGVVDHTCLYPLAFASLDDAAAIAKSLAADVGCGLQVPTFLYGAAHEEGRKLATIRRELGYFKPNSDGLWAGGLKSDSLP

Query:  LKPDEGPAEASKAKGVIVIGATEWVDNYNIPIFSTDIVAVRNIAKQVSERGGGLPSVQAMALAHDEGVIEVACNLQQPSKVEGKMVQQEVERLAESEGLA
        LKPDEGPA  ++AKGV+VIGAT WVDNYN+P+FSTDI AVR IAK+VS RGGGLPSVQAMALAH + VIEVACNL +PS V G+MVQQEVERLA+ EGL 
Subjt:  LKPDEGPAEASKAKGVIVIGATEWVDNYNIPIFSTDIVAVRNIAKQVSERGGGLPSVQAMALAHDEGVIEVACNLQQPSKVEGKMVQQEVERLAESEGLA

Query:  VGKDGGRRIRLVAAADIVTWDTIAALHLGDFKVVRTH---GFTSPPTPKIPLLKPHTLPPNLQQTTTLSDSHLQMAFNLTAKDKRISLEQKELLCCKYFV
        +     RR  L      + +  +   HL    V R H    F    + KI   + H +                       K+K+ + E   L+CC  F+
Subjt:  VGKDGGRRIRLVAAADIVTWDTIAALHLGDFKVVRTH---GFTSPPTPKIPLLKPHTLPPNLQQTTTLSDSHLQMAFNLTAKDKRISLEQKELLCCKYFV

Query:  SESRNRSVLEAIERAVSQDPDSVIVNKFEDRAYNRTRYTIVSYVVHDAKGNAIYSPLHQTVLAMTEVAFANINLESHSGAHPRLGVVDDIIFHPLARASL
        SE+RN + L+ IERA   DP+SVIVNKFED+ YNR R+TIVSYVV D+ G+ IYSPLHQTVLA+ E A+  INLE HSGAHPRLGVVDDI FHPLA ASL
Subjt:  SESRNRSVLEAIERAVSQDPDSVIVNKFEDRAYNRTRYTIVSYVVHDAKGNAIYSPLHQTVLAMTEVAFANINLESHSGAHPRLGVVDDIIFHPLARASL

Query:  HEAAWLAKAVAKDMATMFQVPVFLYSAAHPSGKAPDDLRRELGYFRPNYNGNQWAGWSMPETLPEKPDEGPNSVSRERGITMIGARPWTAMYNVPILSTD
         EAAWLAKAVA D+ + FQVPVFLY+AAHP+G+APD +RRELGY+RPN+ G+ WAGW++PE LPE PD GP+ VSR RG+T+IGAR W   YN+PI+ TD
Subjt:  HEAAWLAKAVAKDMATMFQVPVFLYSAAHPSGKAPDDLRRELGYFRPNYNGNQWAGWSMPETLPEKPDEGPNSVSRERGITMIGARPWTAMYNVPILSTD

Query:  VAATRRIARMVSARGGGLPTVQTIGLLHDDETTEIACVLLEPNQIGADRVQRRVEVLAAQFGLEVENGYFTDYSPEMVVEKYLNLIA
        V+  RRIARMVSARGGGLPTVQ++ L H D++TEIAC+LLEPN+IGADRVQ +VE+LAAQ GL+VE GYFTD+SPEM+V+KY+NLI+
Subjt:  VAATRRIARMVSARGGGLPTVQTIGLLHDDETTEIACVLLEPNQIGADRVQRRVEVLAAQFGLEVENGYFTDYSPEMVVEKYLNLIA

KAG7034931.1 Glutamate formimidoyltransferase, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+00100Show/hide
Query:  MDDIFDSSLNLEETHLKEGYAEGYKDGLVAGKEEAKHVGLKVGFEVGEEIGFFRGCVDVWKSVILIDSERFSSRVQKSVKQMEELVEQYPLQDPENEQVQ
        MDDIFDSSLNLEETHLKEGYAEGYKDGLVAGKEEAKHVGLKVGFEVGEEIGFFRGCVDVWKSVILIDSERFSSRVQKSVKQMEELVEQYPLQDPENEQVQ
Subjt:  MDDIFDSSLNLEETHLKEGYAEGYKDGLVAGKEEAKHVGLKVGFEVGEEIGFFRGCVDVWKSVILIDSERFSSRVQKSVKQMEELVEQYPLQDPENEQVQ

Query:  ELMEGLRLKFRTICATLGLVTCSREKVMSKLALACCKVYISESRNKAALESIEQAAKLFPEASIINKFTNEVYNRVGYTLVSKLPSKPSIKPCSLRSAVL
        ELMEGLRLKFRTICATLGLVTCSREKVMSKLALACCKVYISESRNKAALESIEQAAKLFPEASIINKFTNEVYNRVGYTLVSKLPSKPSIKPCSLRSAVL
Subjt:  ELMEGLRLKFRTICATLGLVTCSREKVMSKLALACCKVYISESRNKAALESIEQAAKLFPEASIINKFTNEVYNRVGYTLVSKLPSKPSIKPCSLRSAVL

Query:  NMVKAAFSAIDLDSHCGSHPRLGVVDHTCLYPLAFASLDDAAAIAKSLAADVGCGLQVPTFLYGAAHEEGRKLATIRRELGYFKPNSDGLWAGGLKSDSL
        NMVKAAFSAIDLDSHCGSHPRLGVVDHTCLYPLAFASLDDAAAIAKSLAADVGCGLQVPTFLYGAAHEEGRKLATIRRELGYFKPNSDGLWAGGLKSDSL
Subjt:  NMVKAAFSAIDLDSHCGSHPRLGVVDHTCLYPLAFASLDDAAAIAKSLAADVGCGLQVPTFLYGAAHEEGRKLATIRRELGYFKPNSDGLWAGGLKSDSL

Query:  PLKPDEGPAEASKAKGVIVIGATEWVDNYNIPIFSTDIVAVRNIAKQVSERGGGLPSVQAMALAHDEGVIEVACNLQQPSKVEGKMVQQEVERLAESEGL
        PLKPDEGPAEASKAKGVIVIGATEWVDNYNIPIFSTDIVAVRNIAKQVSERGGGLPSVQAMALAHDEGVIEVACNLQQPSKVEGKMVQQEVERLAESEGL
Subjt:  PLKPDEGPAEASKAKGVIVIGATEWVDNYNIPIFSTDIVAVRNIAKQVSERGGGLPSVQAMALAHDEGVIEVACNLQQPSKVEGKMVQQEVERLAESEGL

Query:  AVGKDGGRRIRLVAAADIVTWDTIAALHLGDFKVVRTHGFTSPPTPKIPLLKPHTLPPNLQQTTTLSDSHLQMAFNLTAKDKRISLEQKELLCCKYFVSE
        AVGKDGGRRIRLVAAADIVTWDTIAALHLGDFKVVRTHGFTSPPTPKIPLLKPHTLPPNLQQTTTLSDSHLQMAFNLTAKDKRISLEQKELLCCKYFVSE
Subjt:  AVGKDGGRRIRLVAAADIVTWDTIAALHLGDFKVVRTHGFTSPPTPKIPLLKPHTLPPNLQQTTTLSDSHLQMAFNLTAKDKRISLEQKELLCCKYFVSE

Query:  SRNRSVLEAIERAVSQDPDSVIVNKFEDRAYNRTRYTIVSYVVHDAKGNAIYSPLHQTVLAMTEVAFANINLESHSGAHPRLGVVDDIIFHPLARASLHE
        SRNRSVLEAIERAVSQDPDSVIVNKFEDRAYNRTRYTIVSYVVHDAKGNAIYSPLHQTVLAMTEVAFANINLESHSGAHPRLGVVDDIIFHPLARASLHE
Subjt:  SRNRSVLEAIERAVSQDPDSVIVNKFEDRAYNRTRYTIVSYVVHDAKGNAIYSPLHQTVLAMTEVAFANINLESHSGAHPRLGVVDDIIFHPLARASLHE

Query:  AAWLAKAVAKDMATMFQVPVFLYSAAHPSGKAPDDLRRELGYFRPNYNGNQWAGWSMPETLPEKPDEGPNSVSRERGITMIGARPWTAMYNVPILSTDVA
        AAWLAKAVAKDMATMFQVPVFLYSAAHPSGKAPDDLRRELGYFRPNYNGNQWAGWSMPETLPEKPDEGPNSVSRERGITMIGARPWTAMYNVPILSTDVA
Subjt:  AAWLAKAVAKDMATMFQVPVFLYSAAHPSGKAPDDLRRELGYFRPNYNGNQWAGWSMPETLPEKPDEGPNSVSRERGITMIGARPWTAMYNVPILSTDVA

Query:  ATRRIARMVSARGGGLPTVQTIGLLHDDETTEIACVLLEPNQIGADRVQRRVEVLAAQFGLEVENGYFTDYSPEMVVEKYLNLIAGDH
        ATRRIARMVSARGGGLPTVQTIGLLHDDETTEIACVLLEPNQIGADRVQRRVEVLAAQFGLEVENGYFTDYSPEMVVEKYLNLIAGDH
Subjt:  ATRRIARMVSARGGGLPTVQTIGLLHDDETTEIACVLLEPNQIGADRVQRRVEVLAAQFGLEVENGYFTDYSPEMVVEKYLNLIAGDH

PQM39096.1 glutamate formimidoyltransferase-like [Prunus yedoensis var. nudiflora]1.2e-22863.36Show/hide
Query:  MSKLALACCKVYISESRNKAALESIEQAAKLFPEASIINKFTNEVYNRVGYTLVSKLPSKPSIKPCSLRSAVLNMVKAAFSAIDLDSHCGSHPRLGVVDH
        M K  L CCKVYISESRN+AALE IE+AAKLF EA I+NKF +E YNRVGYTLVSKL  KPS  PC LR AVL MVKAAF  IDL+ HCGSHPRLGVVDH
Subjt:  MSKLALACCKVYISESRNKAALESIEQAAKLFPEASIINKFTNEVYNRVGYTLVSKLPSKPSIKPCSLRSAVLNMVKAAFSAIDLDSHCGSHPRLGVVDH

Query:  TCLYPLAFASLDDAAAIAKSLAADVGCGLQVPTFLYGAAHEEGRKLATIRRELGYFKPNSDG-LWAGGLKSDSLPLKPDEGPAEASKAKGVIVIGATEWV
         C +PL  ASLD  A +A  L ADVG  LQVPTFLYGAAHEEGR L +IRRELGYFKP S G  W GG KS+ L LKPD+GP + ++ KGVIVIGAT WV
Subjt:  TCLYPLAFASLDDAAAIAKSLAADVGCGLQVPTFLYGAAHEEGRKLATIRRELGYFKPNSDG-LWAGGLKSDSLPLKPDEGPAEASKAKGVIVIGATEWV

Query:  DNYNIPIFSTDIVAVRNIAKQVSERGGGLPSVQAMALAHDEGVIEVACNLQQPSKVEGKMVQQEVERLAESEGLAVGKDGGRRIRLVAAADIVTWDTIAA
        DNYN+P+FSTDI AVR IAK+VS RGGGLPSVQAMALAH E VIEVACNL +P KV G  VQ EVERL+E EG+ VGK                      
Subjt:  DNYNIPIFSTDIVAVRNIAKQVSERGGGLPSVQAMALAHDEGVIEVACNLQQPSKVEGKMVQQEVERLAESEGLAVGKDGGRRIRLVAAADIVTWDTIAA

Query:  LHLGDFKVVRTHGFTSPPTPKIPLLKPHTLPPNLQQTTTLSDS----HLQMAFNLTAKD-KRISLEQKELLCCKYFVSESRNRSVLEAIERAVSQDPDSV
         +  DF   +             L++ +     ++QT TL+        +M  +   KD K+ +++Q  LLCCK ++SESRN + L+AIERA   DP+SV
Subjt:  LHLGDFKVVRTHGFTSPPTPKIPLLKPHTLPPNLQQTTTLSDS----HLQMAFNLTAKD-KRISLEQKELLCCKYFVSESRNRSVLEAIERAVSQDPDSV

Query:  IVNKFEDRAYNRTRYTIVSYVVHDAKGNAIYSPLHQTVLAMTEVAFANINLESHSGAHPRLGVVDDIIFHPLARASLHEAAWLAKAVAKDMATMFQVPVF
        IVNKFEDRAYNR RYTIVSYV+HD+ G+AIYSPL QTV+AM E AF  INLE HSGAHPRLGVVDDI+FHPLARASL EAAWLAKAVA D+   FQVPV+
Subjt:  IVNKFEDRAYNRTRYTIVSYVVHDAKGNAIYSPLHQTVLAMTEVAFANINLESHSGAHPRLGVVDDIIFHPLARASLHEAAWLAKAVAKDMATMFQVPVF

Query:  LYSAAHPSGKAPDDLRRELGYFRPNYNGNQWAGWSMPETLPEKPDEGPNSVSRERGITMIGARPWTAMYNVPILSTDVAATRRIARMVSARGGGLPTVQT
        LY+AAHP+GKA D +RRELGY+RPN+ G+QWAGW+MPE L EKPDEGP S+   RGI+MIGA               VAATRRIARMVSARGGGLPTVQT
Subjt:  LYSAAHPSGKAPDDLRRELGYFRPNYNGNQWAGWSMPETLPEKPDEGPNSVSRERGITMIGARPWTAMYNVPILSTDVAATRRIARMVSARGGGLPTVQT

Query:  IGLLHDDETTEIACVLLEPNQIGADRVQRRVEVLAAQFGLEVENGYFTDYSPEMVVEKYLNLIAGD
        +GL+H +++TEIAC+LLEPNQIG +RVQ  VE+LAAQ GL+VE GYFTD+SP+M++EKY+ L + D
Subjt:  IGLLHDDETTEIACVLLEPNQIGADRVQRRVEVLAAQFGLEVENGYFTDYSPEMVVEKYLNLIAGD

RXH89850.1 hypothetical protein DVH24_032207 [Malus domestica]1.6e-27362.06Show/hide
Query:  DDIF-DSSLNLEETHLKEGYAEGYKDGLVAGKEEAKHVGLKVGFEVGEEIGFFRGCVDVWKSVILIDSERFSSRVQKSVKQMEELVEQYPLQDPENEQVQ
        DDIF +  LNLE TH KEG+ EGY+DGLVAGKEEAK  G+K GFEVGEE+GF+RGCVDVW S I +D  RF+ RVQK+V+QMEE++E+YP+ +PE+E V 
Subjt:  DDIF-DSSLNLEETHLKEGYAEGYKDGLVAGKEEAKHVGLKVGFEVGEEIGFFRGCVDVWKSVILIDSERFSSRVQKSVKQMEELVEQYPLQDPENEQVQ

Query:  ELMEGLRLKFRTICATLGL---------VTCSRE-KVMSKLALACCKVYISESRNKAALESIEQAAKLFPEASIINKFTNEVYNRVGYTLVSKLPSKPSI
        ++ME LRLKFR +CA+LG+         V   +E + M K  L CCKVYISESRN+AALES+E+AAKLF EA I+NKF +E YNRVGYTLVS L  KPS+
Subjt:  ELMEGLRLKFRTICATLGL---------VTCSRE-KVMSKLALACCKVYISESRNKAALESIEQAAKLFPEASIINKFTNEVYNRVGYTLVSKLPSKPSI

Query:  KPCSLRSAVLNMVKAAFSAIDLDSHCGSHPRLGVVDHTCLYPLAFASLDDAAAIAKSLAADVGCGLQVPTFLYGAAHEEGRKLATIRRELGYFKPNSDG-
         P  L+ AVL MVKAAF  IDL+SHCGSHPRLGVVDH C +PL  ASL+  A +A SLAA+VG  LQVPTFLYGAAHEEGR L ++RRELGYFKPNS G 
Subjt:  KPCSLRSAVLNMVKAAFSAIDLDSHCGSHPRLGVVDHTCLYPLAFASLDDAAAIAKSLAADVGCGLQVPTFLYGAAHEEGRKLATIRRELGYFKPNSDG-

Query:  LWAGGLKSDSLPLKPDEGPAEASKAKGVIVIGATEWVDNYNIPIFSTDIVAVRNIAKQVSERGGGLPSVQAMALAHDEGVIEVACNLQQPSKVEGKMVQQ
         W GG KSD L LKPD+GP + ++ +GVIVIGAT WVDNYN+P+ STDI AVR IAK+VS RGGGL SVQAMALAH E +IEVACNL +P KV G  VQ 
Subjt:  LWAGGLKSDSLPLKPDEGPAEASKAKGVIVIGATEWVDNYNIPIFSTDIVAVRNIAKQVSERGGGLPSVQAMALAHDEGVIEVACNLQQPSKVEGKMVQQ

Query:  EVERLAESEGLAVGKDGGRRIRLVAAADIVTWDTIAALHLGDFKVVRTHGFTSPPTPKIPLLKPHTLPPNLQQTTTLSDSHLQMAFNLTAKDKRISLEQK
        EVERLA+ EG+ VGK                       +  DF   R                       +++    +       F +    K+ +++Q 
Subjt:  EVERLAESEGLAVGKDGGRRIRLVAAADIVTWDTIAALHLGDFKVVRTHGFTSPPTPKIPLLKPHTLPPNLQQTTTLSDSHLQMAFNLTAKDKRISLEQK

Query:  ELLCCKYFVSESRNRSVLEAIERAVSQDPDSVIVNKFEDRAYNRTRYTIVSYVVHDAKGNAIYSPLHQTVLAMTEVAFANINLESHSGAHPRLGVVDDII
         LLCCK ++SESRN + L+ IERA   DP+SVIVNKFEDR YNR RYTIVSYV+HD+ G+AIYSPL QTV+AMTE AF  INLE HSGAHPRLGVVDDI+
Subjt:  ELLCCKYFVSESRNRSVLEAIERAVSQDPDSVIVNKFEDRAYNRTRYTIVSYVVHDAKGNAIYSPLHQTVLAMTEVAFANINLESHSGAHPRLGVVDDII

Query:  FHPLARASLHEAAWLAKAVAKDMATMFQVPVFLYSAAHPSGKAPDDLRRELGYFRPNYNGNQWAGWSMPETLPEKPDEGPNSVSRERGITMIGARPWTAM
        FHPLARASL EAAWLAKAVA D+   FQVPV+LY+AAHP+GKA D +RRELGY+RPN+         MP+ L EKPDEGP  VS  RGITMIGAR W  +
Subjt:  FHPLARASLHEAAWLAKAVAKDMATMFQVPVFLYSAAHPSGKAPDDLRRELGYFRPNYNGNQWAGWSMPETLPEKPDEGPNSVSRERGITMIGARPWTAM

Query:  YNVPILSTDVAATRRIARMVSARGGGLPTVQTIGLLHDDETTEIACVLLEPNQIGADRVQRRVEVLAAQFGLEVENGYFTDYSPEMVVEKYLNLIA
        YN+PI+STDVAAT+RIARMVSARGGGLPTVQT+GLLH +++TEIAC+LL+PNQ+GAD VQ  VE+LAAQ GL+VE GYFTDYSP+M+ EKY+ LI+
Subjt:  YNVPILSTDVAATRRIARMVSARGGGLPTVQTIGLLHDDETTEIACVLLEPNQIGADRVQRRVEVLAAQFGLEVENGYFTDYSPEMVVEKYLNLIA

TrEMBL top hitse value%identityAlignment
A0A314UQS5 Glutamate formimidoyltransferase-like6.0e-22963.36Show/hide
Query:  MSKLALACCKVYISESRNKAALESIEQAAKLFPEASIINKFTNEVYNRVGYTLVSKLPSKPSIKPCSLRSAVLNMVKAAFSAIDLDSHCGSHPRLGVVDH
        M K  L CCKVYISESRN+AALE IE+AAKLF EA I+NKF +E YNRVGYTLVSKL  KPS  PC LR AVL MVKAAF  IDL+ HCGSHPRLGVVDH
Subjt:  MSKLALACCKVYISESRNKAALESIEQAAKLFPEASIINKFTNEVYNRVGYTLVSKLPSKPSIKPCSLRSAVLNMVKAAFSAIDLDSHCGSHPRLGVVDH

Query:  TCLYPLAFASLDDAAAIAKSLAADVGCGLQVPTFLYGAAHEEGRKLATIRRELGYFKPNSDG-LWAGGLKSDSLPLKPDEGPAEASKAKGVIVIGATEWV
         C +PL  ASLD  A +A  L ADVG  LQVPTFLYGAAHEEGR L +IRRELGYFKP S G  W GG KS+ L LKPD+GP + ++ KGVIVIGAT WV
Subjt:  TCLYPLAFASLDDAAAIAKSLAADVGCGLQVPTFLYGAAHEEGRKLATIRRELGYFKPNSDG-LWAGGLKSDSLPLKPDEGPAEASKAKGVIVIGATEWV

Query:  DNYNIPIFSTDIVAVRNIAKQVSERGGGLPSVQAMALAHDEGVIEVACNLQQPSKVEGKMVQQEVERLAESEGLAVGKDGGRRIRLVAAADIVTWDTIAA
        DNYN+P+FSTDI AVR IAK+VS RGGGLPSVQAMALAH E VIEVACNL +P KV G  VQ EVERL+E EG+ VGK                      
Subjt:  DNYNIPIFSTDIVAVRNIAKQVSERGGGLPSVQAMALAHDEGVIEVACNLQQPSKVEGKMVQQEVERLAESEGLAVGKDGGRRIRLVAAADIVTWDTIAA

Query:  LHLGDFKVVRTHGFTSPPTPKIPLLKPHTLPPNLQQTTTLSDS----HLQMAFNLTAKD-KRISLEQKELLCCKYFVSESRNRSVLEAIERAVSQDPDSV
         +  DF   +             L++ +     ++QT TL+        +M  +   KD K+ +++Q  LLCCK ++SESRN + L+AIERA   DP+SV
Subjt:  LHLGDFKVVRTHGFTSPPTPKIPLLKPHTLPPNLQQTTTLSDS----HLQMAFNLTAKD-KRISLEQKELLCCKYFVSESRNRSVLEAIERAVSQDPDSV

Query:  IVNKFEDRAYNRTRYTIVSYVVHDAKGNAIYSPLHQTVLAMTEVAFANINLESHSGAHPRLGVVDDIIFHPLARASLHEAAWLAKAVAKDMATMFQVPVF
        IVNKFEDRAYNR RYTIVSYV+HD+ G+AIYSPL QTV+AM E AF  INLE HSGAHPRLGVVDDI+FHPLARASL EAAWLAKAVA D+   FQVPV+
Subjt:  IVNKFEDRAYNRTRYTIVSYVVHDAKGNAIYSPLHQTVLAMTEVAFANINLESHSGAHPRLGVVDDIIFHPLARASLHEAAWLAKAVAKDMATMFQVPVF

Query:  LYSAAHPSGKAPDDLRRELGYFRPNYNGNQWAGWSMPETLPEKPDEGPNSVSRERGITMIGARPWTAMYNVPILSTDVAATRRIARMVSARGGGLPTVQT
        LY+AAHP+GKA D +RRELGY+RPN+ G+QWAGW+MPE L EKPDEGP S+   RGI+MIGA               VAATRRIARMVSARGGGLPTVQT
Subjt:  LYSAAHPSGKAPDDLRRELGYFRPNYNGNQWAGWSMPETLPEKPDEGPNSVSRERGITMIGARPWTAMYNVPILSTDVAATRRIARMVSARGGGLPTVQT

Query:  IGLLHDDETTEIACVLLEPNQIGADRVQRRVEVLAAQFGLEVENGYFTDYSPEMVVEKYLNLIAGD
        +GL+H +++TEIAC+LLEPNQIG +RVQ  VE+LAAQ GL+VE GYFTD+SP+M++EKY+ L + D
Subjt:  IGLLHDDETTEIACVLLEPNQIGADRVQRRVEVLAAQFGLEVENGYFTDYSPEMVVEKYLNLIAGD

A0A498J302 Uncharacterized protein8.0e-27462.06Show/hide
Query:  DDIF-DSSLNLEETHLKEGYAEGYKDGLVAGKEEAKHVGLKVGFEVGEEIGFFRGCVDVWKSVILIDSERFSSRVQKSVKQMEELVEQYPLQDPENEQVQ
        DDIF +  LNLE TH KEG+ EGY+DGLVAGKEEAK  G+K GFEVGEE+GF+RGCVDVW S I +D  RF+ RVQK+V+QMEE++E+YP+ +PE+E V 
Subjt:  DDIF-DSSLNLEETHLKEGYAEGYKDGLVAGKEEAKHVGLKVGFEVGEEIGFFRGCVDVWKSVILIDSERFSSRVQKSVKQMEELVEQYPLQDPENEQVQ

Query:  ELMEGLRLKFRTICATLGL---------VTCSRE-KVMSKLALACCKVYISESRNKAALESIEQAAKLFPEASIINKFTNEVYNRVGYTLVSKLPSKPSI
        ++ME LRLKFR +CA+LG+         V   +E + M K  L CCKVYISESRN+AALES+E+AAKLF EA I+NKF +E YNRVGYTLVS L  KPS+
Subjt:  ELMEGLRLKFRTICATLGL---------VTCSRE-KVMSKLALACCKVYISESRNKAALESIEQAAKLFPEASIINKFTNEVYNRVGYTLVSKLPSKPSI

Query:  KPCSLRSAVLNMVKAAFSAIDLDSHCGSHPRLGVVDHTCLYPLAFASLDDAAAIAKSLAADVGCGLQVPTFLYGAAHEEGRKLATIRRELGYFKPNSDG-
         P  L+ AVL MVKAAF  IDL+SHCGSHPRLGVVDH C +PL  ASL+  A +A SLAA+VG  LQVPTFLYGAAHEEGR L ++RRELGYFKPNS G 
Subjt:  KPCSLRSAVLNMVKAAFSAIDLDSHCGSHPRLGVVDHTCLYPLAFASLDDAAAIAKSLAADVGCGLQVPTFLYGAAHEEGRKLATIRRELGYFKPNSDG-

Query:  LWAGGLKSDSLPLKPDEGPAEASKAKGVIVIGATEWVDNYNIPIFSTDIVAVRNIAKQVSERGGGLPSVQAMALAHDEGVIEVACNLQQPSKVEGKMVQQ
         W GG KSD L LKPD+GP + ++ +GVIVIGAT WVDNYN+P+ STDI AVR IAK+VS RGGGL SVQAMALAH E +IEVACNL +P KV G  VQ 
Subjt:  LWAGGLKSDSLPLKPDEGPAEASKAKGVIVIGATEWVDNYNIPIFSTDIVAVRNIAKQVSERGGGLPSVQAMALAHDEGVIEVACNLQQPSKVEGKMVQQ

Query:  EVERLAESEGLAVGKDGGRRIRLVAAADIVTWDTIAALHLGDFKVVRTHGFTSPPTPKIPLLKPHTLPPNLQQTTTLSDSHLQMAFNLTAKDKRISLEQK
        EVERLA+ EG+ VGK                       +  DF   R                       +++    +       F +    K+ +++Q 
Subjt:  EVERLAESEGLAVGKDGGRRIRLVAAADIVTWDTIAALHLGDFKVVRTHGFTSPPTPKIPLLKPHTLPPNLQQTTTLSDSHLQMAFNLTAKDKRISLEQK

Query:  ELLCCKYFVSESRNRSVLEAIERAVSQDPDSVIVNKFEDRAYNRTRYTIVSYVVHDAKGNAIYSPLHQTVLAMTEVAFANINLESHSGAHPRLGVVDDII
         LLCCK ++SESRN + L+ IERA   DP+SVIVNKFEDR YNR RYTIVSYV+HD+ G+AIYSPL QTV+AMTE AF  INLE HSGAHPRLGVVDDI+
Subjt:  ELLCCKYFVSESRNRSVLEAIERAVSQDPDSVIVNKFEDRAYNRTRYTIVSYVVHDAKGNAIYSPLHQTVLAMTEVAFANINLESHSGAHPRLGVVDDII

Query:  FHPLARASLHEAAWLAKAVAKDMATMFQVPVFLYSAAHPSGKAPDDLRRELGYFRPNYNGNQWAGWSMPETLPEKPDEGPNSVSRERGITMIGARPWTAM
        FHPLARASL EAAWLAKAVA D+   FQVPV+LY+AAHP+GKA D +RRELGY+RPN+         MP+ L EKPDEGP  VS  RGITMIGAR W  +
Subjt:  FHPLARASLHEAAWLAKAVAKDMATMFQVPVFLYSAAHPSGKAPDDLRRELGYFRPNYNGNQWAGWSMPETLPEKPDEGPNSVSRERGITMIGARPWTAM

Query:  YNVPILSTDVAATRRIARMVSARGGGLPTVQTIGLLHDDETTEIACVLLEPNQIGADRVQRRVEVLAAQFGLEVENGYFTDYSPEMVVEKYLNLIA
        YN+PI+STDVAAT+RIARMVSARGGGLPTVQT+GLLH +++TEIAC+LL+PNQ+GAD VQ  VE+LAAQ GL+VE GYFTDYSP+M+ EKY+ LI+
Subjt:  YNVPILSTDVAATRRIARMVSARGGGLPTVQTIGLLHDDETTEIACVLLEPNQIGADRVQRRVEVLAAQFGLEVENGYFTDYSPEMVVEKYLNLIA

A0A5N5GAR3 Formimidoyltransferase-cyclodeaminase-like6.5e-19149.3Show/hide
Query:  DDIF-DSSLNLEETHLKEGYAEGYKDGLVAGKEEAKHVGLKVGFEVGEEIGFFRGCVDVWKSVILIDSERFSSRVQKSVKQMEELVEQYPLQDPENEQVQ
        DDIF +  LNLE  H KEG+ EGY++GL+AGKEEAK  G+K GFEVGEE+GF++GCVDVW S I +D  RFS RVQK+V+QMEEL+E+YP+ +PE+E V 
Subjt:  DDIF-DSSLNLEETHLKEGYAEGYKDGLVAGKEEAKHVGLKVGFEVGEEIGFFRGCVDVWKSVILIDSERFSSRVQKSVKQMEELVEQYPLQDPENEQVQ

Query:  ELMEGLRLKFRTICATLGLVTCSRE-KVMSKLALACCKVYISESRNKAALESIEQAAKLFPEASIINKFTNEVYNRVGYTLVSKLPSKPSIKPCSLRSAV
        ++ME LRLKFR +CA+  LV   +E   M K  L CCKVYISESRN+AAL+SIE+AAKLF EA I+NKF +E YNRVGYTL+SKL  +PS   C LR AV
Subjt:  ELMEGLRLKFRTICATLGLVTCSRE-KVMSKLALACCKVYISESRNKAALESIEQAAKLFPEASIINKFTNEVYNRVGYTLVSKLPSKPSIKPCSLRSAV

Query:  LNMVKAAFSAIDLDSHCGSHPRLGVVDHTCLYPLAFASLDDAAAIAKSLAADVGCGLQVPTFLYGAAHEEGRKLATIRRELGYFKPNSDG-LWAGGLKSD
        L MVKAA S++                                                PTFLYGAAHEEGR L ++ RELGYFKPNS G  W  G    
Subjt:  LNMVKAAFSAIDLDSHCGSHPRLGVVDHTCLYPLAFASLDDAAAIAKSLAADVGCGLQVPTFLYGAAHEEGRKLATIRRELGYFKPNSDG-LWAGGLKSD

Query:  SLPLKPDEGPAEASKAKGVIVIGATEWVDNYNIPIFSTDIVAVRNIAKQVSERGGGLPSVQAMALAHDEGVIEVACNLQQPSKVEGKMVQQEVERLAESE
                                   VDNYN+P+ STDI +VR IAK VS RG GL SVQAMA AH E VIEVACNL +P KV G              
Subjt:  SLPLKPDEGPAEASKAKGVIVIGATEWVDNYNIPIFSTDIVAVRNIAKQVSERGGGLPSVQAMALAHDEGVIEVACNLQQPSKVEGKMVQQEVERLAESE

Query:  GLAVGKDGGRRIRLVAAADIVTWDTIAALHLGDFKVVRTHGFTSPPTPKIPLLKPHTLPPNLQQTTTLSDSHLQMAFNLTAKDKRISLEQKELLCCKYFV
            G+D                                                                          K K+ +++Q  LLCCK+F+
Subjt:  GLAVGKDGGRRIRLVAAADIVTWDTIAALHLGDFKVVRTHGFTSPPTPKIPLLKPHTLPPNLQQTTTLSDSHLQMAFNLTAKDKRISLEQKELLCCKYFV

Query:  SESRNRSVLEAIERAVSQDPDSVIVNKFEDRAYNRTRYTIVSYVVHDAKGNAIYSPLHQTVLAMTEVAFANINLESHSGAHPRLGVVDDIIFHPLARASL
        SE RN +              SVI+NKFEDRAYNR  YTIVSYV+HD+ G+AIYSPL QTV+ M E AF  INLE HSGAHPRLGVVDDI+FHP ARASL
Subjt:  SESRNRSVLEAIERAVSQDPDSVIVNKFEDRAYNRTRYTIVSYVVHDAKGNAIYSPLHQTVLAMTEVAFANINLESHSGAHPRLGVVDDIIFHPLARASL

Query:  HEAAWLAKAVAKDMATMFQVPVFLYSAAHPSGKAPDDLRRELGYFRPNYNGNQWAGWSMPETLPEKPDEGPNSVSRERGITMIGARPWTAMYNVPILSTD
         EAAWLAKAVA D+  +FQ     Y+A H +GKA D + RELGY++PN+ G+QWAGW MP+                 GI+MI  RPW A+YN+P+LSTD
Subjt:  HEAAWLAKAVAKDMATMFQVPVFLYSAAHPSGKAPDDLRRELGYFRPNYNGNQWAGWSMPETLPEKPDEGPNSVSRERGITMIGARPWTAMYNVPILSTD

Query:  VAATRRIARMVSARGGGLPTVQTIGLLHDDETTEIACVLLEPNQIGADRVQRRVEVLAAQFGLEVENGYFTDYSPEMVVEKYLNLIA
        VAAT+ +ARMV+ARGGGLPTVQT+GL+H +++TE AC+LLEPN +G DRVQ  +E+LAAQ GL+VE GYFTD+SP+M++EKY+ LI+
Subjt:  VAATRRIARMVSARGGGLPTVQTIGLLHDDETTEIACVLLEPNQIGADRVQRRVEVLAAQFGLEVENGYFTDYSPEMVVEKYLNLIA

A0A5N6RAR5 Uncharacterized protein1.3e-24766.57Show/hide
Query:  MSKLALACCKVYISESRNKAALESIEQAAKLFPEASIINKFTNEVYNRVGYTLVSKLPSKPSIKPCSLRSAVLNMVKAAFSAIDLDSHCGSHPRLGVVDH
        M KL L CCKVYISESRN+AALE+IEQAAKLFPE +IINKF +E YNRVGYTLVSK+  KPS   CSLRSAVL+MVKAA   IDL+ HCGSHPRLGVVDH
Subjt:  MSKLALACCKVYISESRNKAALESIEQAAKLFPEASIINKFTNEVYNRVGYTLVSKLPSKPSIKPCSLRSAVLNMVKAAFSAIDLDSHCGSHPRLGVVDH

Query:  TCLYPLAFASLDDAAAIAKSLAADVGCGLQVPTFLYGAAHEEGRKLATIRRELGYFKPNSDGL-WAGGLKSDSLPLKPDEGPAEASKAKGVIVIGATEWV
         C +PLA+ASLD  A IAKSLAAD+G  +QVPTFLYGAAHEEGR L +IRRELGYFKPNS G  W+GG KS+ LPLKPDEGPA+ SK KGV+VIGAT WV
Subjt:  TCLYPLAFASLDDAAAIAKSLAADVGCGLQVPTFLYGAAHEEGRKLATIRRELGYFKPNSDGL-WAGGLKSDSLPLKPDEGPAEASKAKGVIVIGATEWV

Query:  DNYNIPIFSTDIVAVRNIAKQVSERGGGLPSVQAMALAHDEGVIEVACNLQQPSKVEGKMVQQEVERLAESEGLAVGKDGGRRIRLVAAADIVTWDTIAA
        DNYNIP+FSTDI A+R +AK++S RGGGLPSVQAMALAHD+ V EVACNL +PSKV G +VQQEVERL+  EG+AVGK        ++  +IV     + 
Subjt:  DNYNIPIFSTDIVAVRNIAKQVSERGGGLPSVQAMALAHDEGVIEVACNLQQPSKVEGKMVQQEVERLAESEGLAVGKDGGRRIRLVAAADIVTWDTIAA

Query:  LHLGDFKVVRTHGFTSPPTPKIPLLKPH--TLPPNLQQTTTLSDSH---LQMAFN-LTAKDKRISLEQKELLCCKYFVSESRNRSVLEAIERAVSQDPDS
        L L  F++  T    S    K  +L+    +L P L+       ++   L   F     KDK+ S++Q  LLCCK F+SE+RN + L+AIERA   DP++
Subjt:  LHLGDFKVVRTHGFTSPPTPKIPLLKPH--TLPPNLQQTTTLSDSH---LQMAFN-LTAKDKRISLEQKELLCCKYFVSESRNRSVLEAIERAVSQDPDS

Query:  VIVNKFEDRAYNRTRYTIVSYVVHDAKGNAIYSPLHQTVLAMTEVAFANINLESHSGAHPRLGVVDDIIFHPLARASLHEAAWLAKAVAKDMATMFQVPV
        VIVNKF DR+YNRTRYT+VSYVVHD  G+A+YSPL QTVLAM + AF  +NLE HSGAHPRLGVVDDI+FHPLA+ASL EAAWLAKAVA D+   FQVPV
Subjt:  VIVNKFEDRAYNRTRYTIVSYVVHDAKGNAIYSPLHQTVLAMTEVAFANINLESHSGAHPRLGVVDDIIFHPLARASLHEAAWLAKAVAKDMATMFQVPV

Query:  FLYSAAHPSGKAPDDLRRELGYFRPNYNGNQWAGWSMPETLPEKPDEGPNSVSRERGITMIGARPWTAMYNVPILSTDVAATRRIARMVSARGGGLPTVQ
        +LY+AAHP+GKA D +RRELG++RPN+ GNQW GW MPE LPEKPDEGP +VSR RGI+MIGARPW A+YN+PILS DV+A R+IARMVSARGGGLPTVQ
Subjt:  FLYSAAHPSGKAPDDLRRELGYFRPNYNGNQWAGWSMPETLPEKPDEGPNSVSRERGITMIGARPWTAMYNVPILSTDVAATRRIARMVSARGGGLPTVQ

Query:  TIGLLHDDETTEIACVLLEPNQIGADRVQRRVEVLAAQFGLEVENGYFTDYSPEMVVEKYLNLIAGD
        T+GL++ +++TEIAC+LLEPNQIGADRVQ +VE+LAA+ GL+VE GYFTD+SPEMV+E Y+ LI+ +
Subjt:  TIGLLHDDETTEIACVLLEPNQIGADRVQRRVEVLAAQFGLEVENGYFTDYSPEMVVEKYLNLIAGD

A0A7N2RCG2 Uncharacterized protein1.2e-19755.51Show/hide
Query:  REKVMSKLALACCKVYISESRNKAALESIEQAAKLFPEASIINKFTNEVYNRVGYTLVSKL--PSKPSIKPCSLRSAVLNMVKAAFSAIDLDSHCGSHPR
        ++K + +  L CCK++ISESRN AAL++IEQAA+L PE  I+NKF +  YNRV YTLVS +   +  S     L+ AVL MV+AAF AI+L+ H G+HPR
Subjt:  REKVMSKLALACCKVYISESRNKAALESIEQAAKLFPEASIINKFTNEVYNRVGYTLVSKL--PSKPSIKPCSLRSAVLNMVKAAFSAIDLDSHCGSHPR

Query:  LGVVDHTCLYPLAFASLDDAAAIAKSLAADVGCGLQVPTFLYGAAHEEGRKLATIRRELGYFKPNSDG-LWAGGLKSDSLPLKPDEGPAEASKAKGVIVI
        LGVVD    +PLA ASLD+AA ++K++A D+    QVP FLY AAH  G+ L TIRRELGY++PN  G  WAG    D L  KPDEGP+E S+A+G+ +I
Subjt:  LGVVDHTCLYPLAFASLDDAAAIAKSLAADVGCGLQVPTFLYGAAHEEGRKLATIRRELGYFKPNSDG-LWAGGLKSDSLPLKPDEGPAEASKAKGVIVI

Query:  GATEWVDNYNIPIFSTDIVAVRNIAKQVSERGGGLPSVQAMALAHDEGVIEVACNLQQPSKVEGKMVQQEVERLAESEGLAVGKDGGRRIRLVAAADIVT
        GA  WV  YNIPI STD    R IA+ V+ RGGGLP+VQ + L H E   E+AC L +P+++    VQ +                              
Subjt:  GATEWVDNYNIPIFSTDIVAVRNIAKQVSERGGGLPSVQAMALAHDEGVIEVACNLQQPSKVEGKMVQQEVERLAESEGLAVGKDGGRRIRLVAAADIVT

Query:  WDTIAALHLGDFKVVRTHGFTSPPTPKIPLLKPHTLPPNLQQTTTLSDSHLQMAFNLTAKDKRISLEQKELLCCKYFVSESRNRSVLEAIERAVSQDPDS
                                                                    DK+ ++EQ  LLCCK F+SESRNR  L+AIE+A   +P++
Subjt:  WDTIAALHLGDFKVVRTHGFTSPPTPKIPLLKPHTLPPNLQQTTTLSDSHLQMAFNLTAKDKRISLEQKELLCCKYFVSESRNRSVLEAIERAVSQDPDS

Query:  VIVNKFEDRAYNRTRYTIVSYVVHDAKGNAIYSPLHQTVLAMTEVAFANINLESHSGAHPRLGVVDDIIFHPLARASLHEAAWLAKAVAKDMATMFQVPV
        VIVNKFEDRAYNR RYT+VSYVV D  GNAIYSPL Q VLAM E AF  INLE HSG HPRLGVVDDI+FHPLARASL EAAWLAKAVA D+A  FQVPV
Subjt:  VIVNKFEDRAYNRTRYTIVSYVVHDAKGNAIYSPLHQTVLAMTEVAFANINLESHSGAHPRLGVVDDIIFHPLARASLHEAAWLAKAVAKDMATMFQVPV

Query:  FLYSAAHPSGKAPDDLRRELGYFRPNYNGNQWAGWSMPETLPEKPDEGPNSVSRERGITMIGARPWTAMYNVPILSTDVAATRRIARMVSARGGGLPTVQ
        FLY+AAHP+GKA D +RRELGY+RPN  GNQWAGW+MP+ L EKPDEGP+ VSR RGITMIGARPW A+YN+PILSTD +  RRIARMVSARGGGLPTVQ
Subjt:  FLYSAAHPSGKAPDDLRRELGYFRPNYNGNQWAGWSMPETLPEKPDEGPNSVSRERGITMIGARPWTAMYNVPILSTDVAATRRIARMVSARGGGLPTVQ

Query:  TIGLLHDDETTEIACVLLEPNQIGADRVQRRVEVLAAQFGLEVENGYFTDYSPEMVVEKYLNL
        T+GL+H +++TEIAC+LLEPNQIGADRVQ +VE+LAA+ G+EVE GYFTD+SPEM++EKY+NL
Subjt:  TIGLLHDDETTEIACVLLEPNQIGADRVQRRVEVLAAQFGLEVENGYFTDYSPEMVVEKYLNL

SwissProt top hitse value%identityAlignment
O88618 Formimidoyltransferase-cyclodeaminase1.2e-1630.68Show/hide
Query:  ELLCCKYFVSESRNRSVLEAIERAVSQDPDSVIVNKFEDRAYNRTRYTIVSYVVHDAKGNAIYSPLHQTVLAMTEVAFANINLESHSGAHPRLGVVDDII
        +L+ C    SE  N+ V++AI +A+SQ P  V+++     + NRT YT V       +G           L+    A   I++  H G HPR+G +D   
Subjt:  ELLCCKYFVSESRNRSVLEAIERAVSQDPDSVIVNKFEDRAYNRTRYTIVSYVVHDAKGNAIYSPLHQTVLAMTEVAFANINLESHSGAHPRLGVVDDII

Query:  FHPLARASLHEAAWLAKAVAKDMATMFQVPVFLYSAAHPSGKAPDDLRRELGYFRPNYNGNQWAGWSMPETLPEK-------PDEGPNSVSRERGITMIG
        F P+   S+ E    AKA  + +A    VPV+LY  A     A    R+ L    P     ++      E LPEK       PD GP+S     G T+ G
Subjt:  FHPLARASLHEAAWLAKAVAKDMATMFQVPVFLYSAAHPSGKAPDDLRRELGYFRPNYNGNQWAGWSMPETLPEK-------PDEGPNSVSRERGITMIG

Query:  ARPWTAMYNVPILSTDVAATRRIARMVSARG------GGLPTVQTIG-LLHDDETTEIACVLLE
        AR +   +N+ +LST   A  RIA  +  +G      G L  VQ IG  L +    +++  LL+
Subjt:  ARPWTAMYNVPILSTDVAATRRIARMVSARG------GGLPTVQTIG-LLHDDETTEIACVLLE

P53603 Formimidoyltransferase-cyclodeaminase2.0e-1630.31Show/hide
Query:  ELLCCKYFVSESRNRSVLEAIERAVSQDPDSVIVNKFEDRAYNRTRYTIVSYVVHDAKGNAIYSPLHQTVLAMTEVAFANINLESHSGAHPRLGVVDDII
        +L+ C    SE +N+ V++AI RAV+Q P  V+++     + NRT YT V       +G           L     A+  I++  H G HPR+G +D   
Subjt:  ELLCCKYFVSESRNRSVLEAIERAVSQDPDSVIVNKFEDRAYNRTRYTIVSYVVHDAKGNAIYSPLHQTVLAMTEVAFANINLESHSGAHPRLGVVDDII

Query:  FHPLARASLHEAAWLAKAVAKDMATMFQVPVFLY-SAAHPSGKAPDDLRRELGYFRPNYNGNQWAGWSMPETLPEK-------PDEGPNSVSRERGITMI
        F P+   ++ E    A+A  + +A    VPV+LY  AA  +G+      R   Y                E LPEK       PD GP++     G T+ 
Subjt:  FHPLARASLHEAAWLAKAVAKDMATMFQVPVFLY-SAAHPSGKAPDDLRRELGYFRPNYNGNQWAGWSMPETLPEK-------PDEGPNSVSRERGITMI

Query:  GARPWTAMYNVPILSTDVAATRRIARMVSARG------GGLPTVQTIGLLHDDE
        GAR +   +N+ +LST   A  RIA  +  +G      G L  VQ IG   D++
Subjt:  GARPWTAMYNVPILSTDVAATRRIARMVSARG------GGLPTVQTIGLLHDDE

Q91XD4 Formimidoyltransferase-cyclodeaminase4.1e-1731.06Show/hide
Query:  ELLCCKYFVSESRNRSVLEAIERAVSQDPDSVIVNKFEDRAYNRTRYTIVSYVVHDAKGNAIYSPLHQTVLAMTEVAFANINLESHSGAHPRLGVVDDII
        +L+ C    SE  N+ V++AI RA+SQ P  V+++     + NRT YT V       +G           L     A   I++  H G HPR+G +D   
Subjt:  ELLCCKYFVSESRNRSVLEAIERAVSQDPDSVIVNKFEDRAYNRTRYTIVSYVVHDAKGNAIYSPLHQTVLAMTEVAFANINLESHSGAHPRLGVVDDII

Query:  FHPLARASLHEAAWLAKAVAKDMATMFQVPVFLYSAAHPSGKAPDDLRRELGYFRPNYNGNQWAGWSMPETLPEK-------PDEGPNSVSRERGITMIG
        F P+   S+ E    AKA  + +A    VPV+LY  A     A    R+ L    P     ++      E LPEK       PD GP+S     G T+ G
Subjt:  FHPLARASLHEAAWLAKAVAKDMATMFQVPVFLYSAAHPSGKAPDDLRRELGYFRPNYNGNQWAGWSMPETLPEK-------PDEGPNSVSRERGITMIG

Query:  ARPWTAMYNVPILSTDVAATRRIARMVSARG------GGLPTVQTIG-LLHDDETTEIACVLLE
        AR +   +N+ +LST   A  RIA  +  +G      G L  VQ IG  L +    +++  LL+
Subjt:  ARPWTAMYNVPILSTDVAATRRIARMVSARG------GGLPTVQTIG-LLHDDETTEIACVLLE

Q99XR4 Glutamate formimidoyltransferase3.1e-1724.41Show/hide
Query:  SESRNRSVLEAIERAVSQDPDSVIVNKFEDRAYNRTRYTIVSYVVHDAKGNAIYSPLHQTVLAMTEVAFANINLESHSGAHPRLGVVDDIIFHPLARASL
        SE +N++V++ +       P   +++   D ++NR+ +T+V     D +       + +    + + A  NI++  H G HPR+G  D   F P+   + 
Subjt:  SESRNRSVLEAIERAVSQDPDSVIVNKFEDRAYNRTRYTIVSYVVHDAKGNAIYSPLHQTVLAMTEVAFANINLESHSGAHPRLGVVDDIIFHPLARASL

Query:  HEAAWLAKAVAKDMATMFQVPVFLY--SAAHPSGKAPDDLRRELGYFRPNYNGNQWAGWSMPETLPEK---PDEGPNSVSRERGITMIGARPWTAMYNVP
         E   ++K VA+ +     +P+FLY  SA  P        R+ L   R      Q+ G  MPE L E+   PD G   +    G+T +GAR     +NV 
Subjt:  HEAAWLAKAVAKDMATMFQVPVFLY--SAAHPSGKAPDDLRRELGYFRPNYNGNQWAGWSMPETLPEK---PDEGPNSVSRERGITMIGARPWTAMYNVP

Query:  ILSTDVAATRRIARMVSARGGGLPTVQTIG-LLHDDETTEIACVLLEPNQIGADRVQRRVEVLAAQFGLEVENGYFTDYSPEM----VVEKYLNL
        + + ++    +IA+++   GGG    + IG +L D    +++  ++   +    R    ++  A ++G+ V        +P      V E YL +
Subjt:  ILSTDVAATRRIARMVSARGGGLPTVQTIG-LLHDDETTEIACVLLEPNQIGADRVQRRVEVLAAQFGLEVENGYFTDYSPEM----VVEKYLNL

Q9YH58 Formimidoyltransferase-cyclodeaminase7.7e-1629.17Show/hide
Query:  ELLCCKYFVSESRNRSVLEAIERAVSQDPDSVIVNKFEDRAYNRTRYTIVSYVVHDAKGNAIYSPLHQTVLAMTEVAFANINLESHSGAHPRLGVVDDII
        +L+ C    SE  N+ V+EA+ RA+SQ P   +++     + NRT YT V       +G           L+   +A+  I++  H G HPR+G +D   
Subjt:  ELLCCKYFVSESRNRSVLEAIERAVSQDPDSVIVNKFEDRAYNRTRYTIVSYVVHDAKGNAIYSPLHQTVLAMTEVAFANINLESHSGAHPRLGVVDDII

Query:  FHPLARASLHEAAWLAKAVAKDMATMFQVPVFLYSAAHPSGKAPDDLRRELGYFRPNYNGNQWAGWSMPETLPEK-------PDEGPNSVSRERGITMIG
        F P+   S+ E    A    + ++    VPV+LY  A     A  + RR L    P     ++      E LP+K       PD GP +   + G T+ G
Subjt:  FHPLARASLHEAAWLAKAVAKDMATMFQVPVFLYSAAHPSGKAPDDLRRELGYFRPNYNGNQWAGWSMPETLPEK-------PDEGPNSVSRERGITMIG

Query:  ARPWTAMYNVPILSTDVAATRRIARMVSARG------GGLPTVQTIG-LLHDDETTEIACVLLE
        AR +   YN+ +L T   A  RIA  +  +G      G L  VQ IG  L ++   +++  LL+
Subjt:  ARPWTAMYNVPILSTDVAATRRIARMVSARG------GGLPTVQTIG-LLHDDETTEIACVLLE

Arabidopsis top hitse value%identityAlignment
AT2G20830.1 transferases;folic acid binding1.9e-8658.63Show/hide
Query:  MSKLALACCKVYISESRNKAALESIEQAAKLFPEASIINKFTNEVYNRVGYTLVSKLPSKPSIKPCSLRSAVLNMVKAAFSAIDLDSHCGSHPRLGVVDH
        M +  L CCKVYISE+RNK ALE+IE+A K FP A+I+NKF +  Y RVGYT+VS L +  S    SL++AV  MVK A   I+L+ HCGSHPRLGVVDH
Subjt:  MSKLALACCKVYISESRNKAALESIEQAAKLFPEASIINKFTNEVYNRVGYTLVSKLPSKPSIKPCSLRSAVLNMVKAAFSAIDLDSHCGSHPRLGVVDH

Query:  TCLYPLAFASLDDAAAIAKSLAADVGCGLQVPTFLYGAAHEEGRKLATIRRELGYFKPNSDG-LWAGGLKSDSLPLKPDEGPAEASKAKGVIVIGATEWV
         C +PL+  S++  +++A SLA D+G  L+VPT+LYGAA +E   L +IRR+LGYFK N +G  WAGG   + +PLKPD GP E SKAKGV+ +GA  WV
Subjt:  TCLYPLAFASLDDAAAIAKSLAADVGCGLQVPTFLYGAAHEEGRKLATIRRELGYFKPNSDG-LWAGGLKSDSLPLKPDEGPAEASKAKGVIVIGATEWV

Query:  DNYNIPIFSTDIVAVRNIAKQVSERGGGLPSVQAMALAHDEGVIEVACNLQQPSKVEGKMVQQEVERLAESEGLAVGK
         NYN+P+ S D+ AVR IA++ SERGGGL SVQ MAL H EGVIEVACNL  PS+V G  VQ  +ERL   EGL VGK
Subjt:  DNYNIPIFSTDIVAVRNIAKQVSERGGGLPSVQAMALAHDEGVIEVACNLQQPSKVEGKMVQQEVERLAESEGLAVGK

AT2G20830.2 transferases;folic acid binding1.0e-11954.19Show/hide
Query:  DDIFDSSLNLEETHLKEGYAEGYKDGLVAGKEEAKHVGLKVGFEVGEEIGFFRGCVDVWKSVILIDSERFSSRVQKSVKQMEELVEQYPLQDPENEQVQE
        +D  D  + LEETH+++G+ EGY++GLV+G+E+A+H+GLK+GFE GE IGF+RGC  +W S + ID  RFS ++ K +     L+++ PL DPE+E    
Subjt:  DDIFDSSLNLEETHLKEGYAEGYKDGLVAGKEEAKHVGLKVGFEVGEEIGFFRGCVDVWKSVILIDSERFSSRVQKSVKQMEELVEQYPLQDPENEQVQE

Query:  LMEGLRLKFRTICATLGLVTCSRE--KVMSKLALACCKVYISESRNKAALESIEQAAKLFPEASIINKFTNEVYNRVGYTLVSKLPSKPSIKPCSLRSAV
        + + LR+KF  ICA+LG      E  + M +  L CCKVYISE+RNK ALE+IE+A K FP A+I+NKF +  Y RVGYT+VS L +  S    SL++AV
Subjt:  LMEGLRLKFRTICATLGLVTCSRE--KVMSKLALACCKVYISESRNKAALESIEQAAKLFPEASIINKFTNEVYNRVGYTLVSKLPSKPSIKPCSLRSAV

Query:  LNMVKAAFSAIDLDSHCGSHPRLGVVDHTCLYPLAFASLDDAAAIAKSLAADVGCGLQVPTFLYGAAHEEGRKLATIRRELGYFKPNSDG-LWAGGLKSD
          MVK A   I+L+ HCGSHPRLGVVDH C +PL+  S++  +++A SLA D+G  L+VPT+LYGAA +E   L +IRR+LGYFK N +G  WAGG   +
Subjt:  LNMVKAAFSAIDLDSHCGSHPRLGVVDHTCLYPLAFASLDDAAAIAKSLAADVGCGLQVPTFLYGAAHEEGRKLATIRRELGYFKPNSDG-LWAGGLKSD

Query:  SLPLKPDEGPAEASKAKGVIVIGATEWVDNYNIPIFSTDIVAVRNIAKQVSERGGGLPSVQAMALAHDEGVIEVACNLQQPSKVEGKMVQQEVERLAESE
         +PLKPD GP E SKAKGV+ +GA  WV NYN+P+ S D+ AVR IA++ SERGGGL SVQ MAL H EGVIEVACNL  PS+V G  VQ  +ERL   E
Subjt:  SLPLKPDEGPAEASKAKGVIVIGATEWVDNYNIPIFSTDIVAVRNIAKQVSERGGGLPSVQAMALAHDEGVIEVACNLQQPSKVEGKMVQQEVERLAESE

Query:  GLAVGK
        GL VGK
Subjt:  GLAVGK

AT2G20830.3 transferases;folic acid binding1.9e-8658.63Show/hide
Query:  MSKLALACCKVYISESRNKAALESIEQAAKLFPEASIINKFTNEVYNRVGYTLVSKLPSKPSIKPCSLRSAVLNMVKAAFSAIDLDSHCGSHPRLGVVDH
        M +  L CCKVYISE+RNK ALE+IE+A K FP A+I+NKF +  Y RVGYT+VS L +  S    SL++AV  MVK A   I+L+ HCGSHPRLGVVDH
Subjt:  MSKLALACCKVYISESRNKAALESIEQAAKLFPEASIINKFTNEVYNRVGYTLVSKLPSKPSIKPCSLRSAVLNMVKAAFSAIDLDSHCGSHPRLGVVDH

Query:  TCLYPLAFASLDDAAAIAKSLAADVGCGLQVPTFLYGAAHEEGRKLATIRRELGYFKPNSDG-LWAGGLKSDSLPLKPDEGPAEASKAKGVIVIGATEWV
         C +PL+  S++  +++A SLA D+G  L+VPT+LYGAA +E   L +IRR+LGYFK N +G  WAGG   + +PLKPD GP E SKAKGV+ +GA  WV
Subjt:  TCLYPLAFASLDDAAAIAKSLAADVGCGLQVPTFLYGAAHEEGRKLATIRRELGYFKPNSDG-LWAGGLKSDSLPLKPDEGPAEASKAKGVIVIGATEWV

Query:  DNYNIPIFSTDIVAVRNIAKQVSERGGGLPSVQAMALAHDEGVIEVACNLQQPSKVEGKMVQQEVERLAESEGLAVGK
         NYN+P+ S D+ AVR IA++ SERGGGL SVQ MAL H EGVIEVACNL  PS+V G  VQ  +ERL   EGL VGK
Subjt:  DNYNIPIFSTDIVAVRNIAKQVSERGGGLPSVQAMALAHDEGVIEVACNLQQPSKVEGKMVQQEVERLAESEGLAVGK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGACATCTTCGATTCCTCCCTCAATCTTGAAGAGACCCATTTGAAGGAAGGCTACGCCGAGGGCTACAAAGACGGCCTAGTCGCTGGCAAAGAAGAGGCAAAGCA
CGTAGGCCTTAAAGTTGGTTTCGAGGTCGGCGAGGAAATTGGATTCTTCAGAGGGTGTGTGGACGTCTGGAAATCTGTTATTCTGATCGATTCAGAACGGTTTTCGAGTC
GGGTTCAGAAGAGTGTGAAGCAGATGGAGGAGTTGGTAGAACAATACCCGCTTCAGGACCCTGAGAATGAGCAAGTTCAGGAGCTGATGGAAGGATTGAGGCTCAAGTTC
AGAACGATTTGCGCCACTCTGGGTCTAGTGACCTGCAGTAGGGAGAAAGTAATGTCAAAATTGGCTCTTGCTTGCTGCAAGGTTTACATATCTGAAAGCAGAAACAAGGC
TGCGCTGGAGTCAATTGAACAAGCTGCCAAGCTCTTCCCTGAAGCATCCATTATCAACAAGTTCACGAATGAGGTTTACAATAGAGTTGGGTATACCCTTGTTTCCAAGC
TCCCATCAAAGCCATCTATAAAGCCATGTTCTCTGAGAAGTGCAGTCCTGAACATGGTTAAGGCTGCATTTTCGGCAATCGACTTGGACTCGCATTGTGGCAGCCACCCA
CGACTCGGAGTTGTAGATCATACATGCTTATATCCTTTGGCCTTTGCTTCCTTGGACGATGCAGCTGCAATTGCAAAATCTCTGGCAGCTGATGTTGGATGTGGTCTACA
AGTCCCAACATTTCTATATGGAGCGGCTCATGAAGAGGGAAGGAAGCTAGCCACGATCCGAAGAGAGCTGGGTTATTTCAAGCCAAATTCTGATGGCTTATGGGCTGGAG
GGCTGAAATCAGATTCATTGCCACTAAAGCCAGATGAGGGTCCAGCTGAAGCAAGTAAAGCAAAAGGGGTCATCGTGATAGGAGCAACAGAGTGGGTTGATAACTACAAC
ATCCCGATCTTCTCTACCGATATCGTCGCTGTTCGTAACATTGCAAAGCAAGTCAGTGAGAGAGGAGGTGGGCTTCCTTCAGTTCAAGCAATGGCCCTTGCTCATGATGA
AGGTGTGATTGAGGTGGCTTGTAATTTGCAACAACCAAGTAAAGTGGAAGGAAAAATGGTTCAGCAGGAGGTTGAACGGCTTGCAGAAAGTGAGGGATTAGCTGTGGGGA
AAGACGGAGGACGCCGAATAAGACTGGTAGCCGCTGCAGACATAGTCACTTGGGACACAATCGCAGCACTCCATTTGGGTGACTTTAAAGTTGTTAGAACTCACGGGTTC
ACATCTCCCCCGACCCCCAAAATCCCCCTTTTAAAACCTCACACTCTTCCCCCTAATCTGCAACAAACAACCACTCTATCAGATTCACACCTACAAATGGCTTTCAATCT
CACCGCCAAGGACAAGAGAATAAGCTTGGAGCAAAAGGAACTTCTGTGCTGCAAATACTTCGTTTCTGAATCCCGCAATCGCTCTGTACTGGAAGCCATCGAGAGAGCTG
TAAGCCAAGACCCAGATTCTGTAATCGTCAACAAATTCGAAGACAGAGCTTACAACAGGACCCGATACACCATCGTCTCCTATGTCGTCCACGACGCTAAAGGAAACGCC
ATTTACAGCCCATTGCATCAAACAGTTCTGGCCATGACGGAGGTTGCTTTCGCTAACATCAACCTCGAGTCCCATTCTGGCGCTCACCCTCGTCTTGGAGTCGTCGACGA
CATCATTTTCCATCCCCTGGCTCGGGCGTCACTTCACGAAGCCGCTTGGCTCGCTAAGGCGGTCGCTAAGGACATGGCCACTATGTTTCAAGTGCCTGTATTTCTGTACT
CTGCGGCTCATCCAAGTGGGAAGGCGCCGGACGATTTGAGACGGGAGCTTGGGTATTTCCGGCCGAATTACAATGGGAATCAATGGGCCGGGTGGTCGATGCCGGAAACT
TTGCCGGAAAAGCCTGATGAAGGTCCAAATTCTGTCTCTCGGGAGAGGGGAATCACAATGATCGGTGCGCGTCCGTGGACGGCGATGTACAATGTTCCGATATTGTCGAC
TGATGTGGCAGCTACTCGGAGAATAGCGCGGATGGTGAGTGCTAGAGGAGGAGGGCTGCCGACGGTGCAAACCATAGGGCTTCTTCACGACGATGAGACGACGGAGATAG
CCTGCGTTTTGCTGGAGCCCAACCAGATCGGAGCCGATCGGGTCCAGAGACGGGTGGAGGTTCTTGCCGCCCAATTTGGGTTGGAAGTTGAGAATGGTTATTTCACCGAT
TACTCGCCGGAGATGGTTGTCGAGAAATACTTGAATTTGATCGCCGGAGATCACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACGACATCTTCGATTCCTCCCTCAATCTTGAAGAGACCCATTTGAAGGAAGGCTACGCCGAGGGCTACAAAGACGGCCTAGTCGCTGGCAAAGAAGAGGCAAAGCA
CGTAGGCCTTAAAGTTGGTTTCGAGGTCGGCGAGGAAATTGGATTCTTCAGAGGGTGTGTGGACGTCTGGAAATCTGTTATTCTGATCGATTCAGAACGGTTTTCGAGTC
GGGTTCAGAAGAGTGTGAAGCAGATGGAGGAGTTGGTAGAACAATACCCGCTTCAGGACCCTGAGAATGAGCAAGTTCAGGAGCTGATGGAAGGATTGAGGCTCAAGTTC
AGAACGATTTGCGCCACTCTGGGTCTAGTGACCTGCAGTAGGGAGAAAGTAATGTCAAAATTGGCTCTTGCTTGCTGCAAGGTTTACATATCTGAAAGCAGAAACAAGGC
TGCGCTGGAGTCAATTGAACAAGCTGCCAAGCTCTTCCCTGAAGCATCCATTATCAACAAGTTCACGAATGAGGTTTACAATAGAGTTGGGTATACCCTTGTTTCCAAGC
TCCCATCAAAGCCATCTATAAAGCCATGTTCTCTGAGAAGTGCAGTCCTGAACATGGTTAAGGCTGCATTTTCGGCAATCGACTTGGACTCGCATTGTGGCAGCCACCCA
CGACTCGGAGTTGTAGATCATACATGCTTATATCCTTTGGCCTTTGCTTCCTTGGACGATGCAGCTGCAATTGCAAAATCTCTGGCAGCTGATGTTGGATGTGGTCTACA
AGTCCCAACATTTCTATATGGAGCGGCTCATGAAGAGGGAAGGAAGCTAGCCACGATCCGAAGAGAGCTGGGTTATTTCAAGCCAAATTCTGATGGCTTATGGGCTGGAG
GGCTGAAATCAGATTCATTGCCACTAAAGCCAGATGAGGGTCCAGCTGAAGCAAGTAAAGCAAAAGGGGTCATCGTGATAGGAGCAACAGAGTGGGTTGATAACTACAAC
ATCCCGATCTTCTCTACCGATATCGTCGCTGTTCGTAACATTGCAAAGCAAGTCAGTGAGAGAGGAGGTGGGCTTCCTTCAGTTCAAGCAATGGCCCTTGCTCATGATGA
AGGTGTGATTGAGGTGGCTTGTAATTTGCAACAACCAAGTAAAGTGGAAGGAAAAATGGTTCAGCAGGAGGTTGAACGGCTTGCAGAAAGTGAGGGATTAGCTGTGGGGA
AAGACGGAGGACGCCGAATAAGACTGGTAGCCGCTGCAGACATAGTCACTTGGGACACAATCGCAGCACTCCATTTGGGTGACTTTAAAGTTGTTAGAACTCACGGGTTC
ACATCTCCCCCGACCCCCAAAATCCCCCTTTTAAAACCTCACACTCTTCCCCCTAATCTGCAACAAACAACCACTCTATCAGATTCACACCTACAAATGGCTTTCAATCT
CACCGCCAAGGACAAGAGAATAAGCTTGGAGCAAAAGGAACTTCTGTGCTGCAAATACTTCGTTTCTGAATCCCGCAATCGCTCTGTACTGGAAGCCATCGAGAGAGCTG
TAAGCCAAGACCCAGATTCTGTAATCGTCAACAAATTCGAAGACAGAGCTTACAACAGGACCCGATACACCATCGTCTCCTATGTCGTCCACGACGCTAAAGGAAACGCC
ATTTACAGCCCATTGCATCAAACAGTTCTGGCCATGACGGAGGTTGCTTTCGCTAACATCAACCTCGAGTCCCATTCTGGCGCTCACCCTCGTCTTGGAGTCGTCGACGA
CATCATTTTCCATCCCCTGGCTCGGGCGTCACTTCACGAAGCCGCTTGGCTCGCTAAGGCGGTCGCTAAGGACATGGCCACTATGTTTCAAGTGCCTGTATTTCTGTACT
CTGCGGCTCATCCAAGTGGGAAGGCGCCGGACGATTTGAGACGGGAGCTTGGGTATTTCCGGCCGAATTACAATGGGAATCAATGGGCCGGGTGGTCGATGCCGGAAACT
TTGCCGGAAAAGCCTGATGAAGGTCCAAATTCTGTCTCTCGGGAGAGGGGAATCACAATGATCGGTGCGCGTCCGTGGACGGCGATGTACAATGTTCCGATATTGTCGAC
TGATGTGGCAGCTACTCGGAGAATAGCGCGGATGGTGAGTGCTAGAGGAGGAGGGCTGCCGACGGTGCAAACCATAGGGCTTCTTCACGACGATGAGACGACGGAGATAG
CCTGCGTTTTGCTGGAGCCCAACCAGATCGGAGCCGATCGGGTCCAGAGACGGGTGGAGGTTCTTGCCGCCCAATTTGGGTTGGAAGTTGAGAATGGTTATTTCACCGAT
TACTCGCCGGAGATGGTTGTCGAGAAATACTTGAATTTGATCGCCGGAGATCACTGATTACCAATGATTTCTGTAATGGAGATTCGGATTTCGTAATAAAATGGAAATGG
AAATTGTAAAAATACAGAATAATTGAAGCTCTACAATAATCTGCTTTTTCTTTTTTATTTATTTTTTAATATTAATGTAAAAATA
Protein sequenceShow/hide protein sequence
MDDIFDSSLNLEETHLKEGYAEGYKDGLVAGKEEAKHVGLKVGFEVGEEIGFFRGCVDVWKSVILIDSERFSSRVQKSVKQMEELVEQYPLQDPENEQVQELMEGLRLKF
RTICATLGLVTCSREKVMSKLALACCKVYISESRNKAALESIEQAAKLFPEASIINKFTNEVYNRVGYTLVSKLPSKPSIKPCSLRSAVLNMVKAAFSAIDLDSHCGSHP
RLGVVDHTCLYPLAFASLDDAAAIAKSLAADVGCGLQVPTFLYGAAHEEGRKLATIRRELGYFKPNSDGLWAGGLKSDSLPLKPDEGPAEASKAKGVIVIGATEWVDNYN
IPIFSTDIVAVRNIAKQVSERGGGLPSVQAMALAHDEGVIEVACNLQQPSKVEGKMVQQEVERLAESEGLAVGKDGGRRIRLVAAADIVTWDTIAALHLGDFKVVRTHGF
TSPPTPKIPLLKPHTLPPNLQQTTTLSDSHLQMAFNLTAKDKRISLEQKELLCCKYFVSESRNRSVLEAIERAVSQDPDSVIVNKFEDRAYNRTRYTIVSYVVHDAKGNA
IYSPLHQTVLAMTEVAFANINLESHSGAHPRLGVVDDIIFHPLARASLHEAAWLAKAVAKDMATMFQVPVFLYSAAHPSGKAPDDLRRELGYFRPNYNGNQWAGWSMPET
LPEKPDEGPNSVSRERGITMIGARPWTAMYNVPILSTDVAATRRIARMVSARGGGLPTVQTIGLLHDDETTEIACVLLEPNQIGADRVQRRVEVLAAQFGLEVENGYFTD
YSPEMVVEKYLNLIAGDH