; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0010457 (gene) of Snake gourd v1 genome

Gene IDTan0010457
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionBED-type domain-containing protein
Genome locationLG08:68075215..68080513
RNA-Seq ExpressionTan0010457
SyntenyTan0010457
Gene Ontology termsGO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR007021 - Domain of unknown function DUF659
IPR008906 - HAT, C-terminal dimerisation domain
IPR012337 - Ribonuclease H-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN60848.1 hypothetical protein VITISV_023570 [Vitis vinifera]1.7e-7833.59Show/hide
Query:  GAKQKSMGVEPPTPYETQNKYLNMEFQEMEKYVQEQKKKWETYGCTIMSDGWTGPTKLSIINFMVYS---------------------------------
        GA+Q  MG+EPP+PYE ++KYL+ME+++ME YV  Q++KW+TYGCTIMSDGWTGP KLSIINFMVYS                                 
Subjt:  GAKQKSMGVEPPTPYETQNKYLNMEFQEMEKYVQEQKKKWETYGCTIMSDGWTGPTKLSIINFMVYS---------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------EEIESRVLNTLFWDHVESIVNIFEPMYTALRIVDSEVVPTMPVIYSLIENLKYRISQVKGRGWVNQIIQNR
                                     +E+ES + +  +W+ V  +V+I+E +YT LRIVDSEVVPTM  +Y LI  +K  + ++  + WV +II +R
Subjt:  -----------------------------EEIESRVLNTLFWDHVESIVNIFEPMYTALRIVDSEVVPTMPVIYSLIENLKYRISQVKGRGWVNQIIQNR

Query:  WDKTLSHPLHAAGHWWGMYGRSAPTVQKLAIRVLSQTASSSACERNWSTFGLVHTKQRN---------------------RDMGARDEKVAEADYIDMVD
        WD+TL HPLHAA  WW MYG  APT+++LAI+VLSQTAS SACERNWSTF L+HTKQRN                     RD+ A  +KVAE DY+D++D
Subjt:  WDKTLSHPLHAAGHWWGMYGRSAPTVQKLAIRVLSQTASSSACERNWSTFGLVHTKQRN---------------------RDMGARDEKVAEADYIDMVD

Query:  VVADPNDDGDDSLFQWVRPVHLDDESGNPDPEIANMAAETGINVEQVMNNEVRMSDDQLNDYQFNICLDGGYSEAELLKVRKKLCKGKLTNDDEDDSTDD
        +  +  ++ D+ LFQWVRP+HLDDE GNPDP IA    E G++++QV++ EV                   +S+      R+ + +  +T+    DST  
Subjt:  VVADPNDDGDDSLFQWVRPVHLDDESGNPDPEIANMAAETGINVEQVMNNEVRMSDDQLNDYQFNICLDGGYSEAELLKVRKKLCKGKLTNDDEDDSTDD

Query:  DNDD----DNTSDGGGDTNVGISTGYQADEG----------QSSVPGLSPFSCEADFEHVTQDEDHGSREGGEGIAAIGKLFHRQRDTGYFYSDENSLLS
        ++        TS  G D + G  T   +D G          QS  P    F+CE DF H TQDEDHGSR  G G+ AIGK + R R       +E+SL +
Subjt:  DNDD----DNTSDGGGDTNVGISTGYQADEG----------QSSVPGLSPFSCEADFEHVTQDEDHGSREGGEGIAAIGKLFHRQRDTGYFYSDENSLLS

Query:  SMEAMSI------SNNNTQ------QDYGQGHYSYSNYDDSSSREHSNFQSSFQPQSY
        S E+MS+      S+N           YGQ   + S+  D    E  N+ SS Q   Y
Subjt:  SMEAMSI------SNNNTQ------QDYGQGHYSYSNYDDSSSREHSNFQSSFQPQSY

CAN71355.1 hypothetical protein VITISV_034905 [Vitis vinifera]2.7e-7932.29Show/hide
Query:  GAKQKSMGVEPPTPYETQNKYLNMEFQEMEKYVQEQKKKWETYGCTIMSDGWTGPTKLSIINFMVYS---------------------------------
        GA+Q  MG+EPP+PYE ++KYL+ME+++ME YV  Q++KW+TYGCTIMSDGWTGPTKLSIINFMVYS                                 
Subjt:  GAKQKSMGVEPPTPYETQNKYLNMEFQEMEKYVQEQKKKWETYGCTIMSDGWTGPTKLSIINFMVYS---------------------------------

Query:  --------------------------------------------------------------------------------------------EEIESRVL
                                                                                                    +E+ES + 
Subjt:  --------------------------------------------------------------------------------------------EEIESRVL

Query:  NTLFWDHVESIVNIFEPMYTALRIVDSEVVPTMPVIYSLIENLKYRISQVKGRGWVNQIIQNRWDKTLSHPLHAA-------------------------
        +  +W+ V  +V+I+E +YT LRIVDSEVVPTMP +Y LI  +K  + ++  + WV +II +RWD+TL HPLHAA                         
Subjt:  NTLFWDHVESIVNIFEPMYTALRIVDSEVVPTMPVIYSLIENLKYRISQVKGRGWVNQIIQNRWDKTLSHPLHAA-------------------------

Query:  --------------------------------------------GHWWGMYGRSAPTVQKLAIRVLSQTASSSACERNWSTFGLVHTKQRN---------
                                                      WW MYG  APT+++LAI+VLSQTASSSACERNWSTF L+HTKQRN         
Subjt:  --------------------------------------------GHWWGMYGRSAPTVQKLAIRVLSQTASSSACERNWSTFGLVHTKQRN---------

Query:  ------------RDMGARDEKVAEADYIDMVDVVADPNDDGDDSLFQWVRPVHLDDESGNPDPEIANMAAETGINVEQVMNNEVRMSDDQLNDYQFNICL
                    RDM A  +KVAE DY+D++D+  +  ++ D+ LFQWVRP+HLDDE GNPDP IA    E G++V++V++ EV                
Subjt:  ------------RDMGARDEKVAEADYIDMVDVVADPNDDGDDSLFQWVRPVHLDDESGNPDPEIANMAAETGINVEQVMNNEVRMSDDQLNDYQFNICL

Query:  DGGYSEAELLKVRKKLCKGKLTNDDEDDSTDDDNDDDNTSDGGGDTNVGISTGYQADEG------QSSVPGLSPFSCEADFEHVTQDEDHGSREGGEGIA
           +S+      ++ + +  +T+    DST  ++    ++ G G TN G   G   DEG      QS  P    F+CE DF H TQDEDH SR  G G+ 
Subjt:  DGGYSEAELLKVRKKLCKGKLTNDDEDDSTDDDNDDDNTSDGGGDTNVGISTGYQADEG------QSSVPGLSPFSCEADFEHVTQDEDHGSREGGEGIA

Query:  AIGKLFHRQRDTGYFYSDENSLLSSMEAMSI------SNNNTQ------QDYGQGHYSYSNYDDSSSREHSNFQSSFQPQSYSSSQSQGPSIEEM-----
        AIGK + R R       +E+SL +S E+MS+      S+N           YGQ   + S+  D    E  N+ SS Q   Y   Q Q    +       
Subjt:  AIGKLFHRQRDTGYFYSDENSLLSSMEAMSI------SNNNTQ------QDYGQGHYSYSNYDDSSSREHSNFQSSFQPQSYSSSQSQGPSIEEM-----

Query:  -YPRIGSICW-PYESYRFHVDHYIQHFQYRMTWEQYC
         +P  G +     E Y +HV  Y Q+++  M+W +YC
Subjt:  -YPRIGSICW-PYESYRFHVDHYIQHFQYRMTWEQYC

CAN75936.1 hypothetical protein VITISV_034804 [Vitis vinifera]2.3e-7833.33Show/hide
Query:  GAKQKSMGVEPPTPYETQNKYLNMEFQEMEKYVQEQKKKWETYGCTIMSDGWTGPTKLSIINFMVY----------------------------------
        GA+Q  MG+E P+PYE ++KYL+ME+++ME YV  Q++KW+TYGCTIMSDGWTGPTKLSIINFMVY                                  
Subjt:  GAKQKSMGVEPPTPYETQNKYLNMEFQEMEKYVQEQKKKWETYGCTIMSDGWTGPTKLSIINFMVY----------------------------------

Query:  -------------------------------------------------------------SEEIESRVLNTLFWDHVESIVNIFEPMYTALRIVDSEVV
                                                                      +E+ES + +  +W+ V  +V+I+E +YT LRIVDSEVV
Subjt:  -------------------------------------------------------------SEEIESRVLNTLFWDHVESIVNIFEPMYTALRIVDSEVV

Query:  PTMPVIYSLIENLKYRISQVKGRGWVNQIIQNRWDKTLSHPLHAA-------------------------------------------------------
        PTMP +Y LI  +K  + ++  + WV +II +RWD+TL HPLHAA                                                       
Subjt:  PTMPVIYSLIENLKYRISQVKGRGWVNQIIQNRWDKTLSHPLHAA-------------------------------------------------------

Query:  --------------GHWWGMYGRSAPTVQKLAIRVLSQTASSSACERNWSTFGLVHTKQRN---------------------RDMGARDEKVAEADYIDM
                        WW MYG  APT+++LAI+VLSQTASSS CERNWSTF L+HTKQRN                     RDM A  +KVAE DY+D+
Subjt:  --------------GHWWGMYGRSAPTVQKLAIRVLSQTASSSACERNWSTFGLVHTKQRN---------------------RDMGARDEKVAEADYIDM

Query:  VDVVADPNDDGDDSLFQWVRPVHLDDESGNPDPEIANMAAETGINVEQVMNNEVR---MSDDQLNDYQFNICLDGGYSEAELLKVRKKLCKGKLTNDDED
        +D+  +  ++ D+ LFQWVRP+HLDDE GNPDP IA    E G++V++V++ E+     S D  + +Q       G S+  +   R       + +    
Subjt:  VDVVADPNDDGDDSLFQWVRPVHLDDESGNPDPEIANMAAETGINVEQVMNNEVR---MSDDQLNDYQFNICLDGGYSEAELLKVRKKLCKGKLTNDDED

Query:  DSTDDDNDDDNTSDGGGDTNVGISTGYQADEG------QSSVPGLSPFSCEADFEHVTQDEDHGSREGGEGIAAIGKLFHRQRDTGYFYSDENSLLSSME
         +T     D + S G G TN G   G   DEG      QS  P    F+C+ DF H TQDEDHGSR  G G+ AIGK + R R       +E+SL +S E
Subjt:  DSTDDDNDDDNTSDGGGDTNVGISTGYQADEG------QSSVPGLSPFSCEADFEHVTQDEDHGSREGGEGIAAIGKLFHRQRDTGYFYSDENSLLSSME

Query:  AMSI------SNNNTQ------QDYGQGHYSYSNYDDSSSREHSNFQSSFQPQSYSSSQSQGPSIEEM------YPRIGSICW-PYESYRFHVDHYIQHF
        +MS+      S+N           YGQ   + S+  D    E  N+ SS Q   Y   Q Q    +        +P  G +     E Y +HV  Y Q++
Subjt:  AMSI------SNNNTQ------QDYGQGHYSYSNYDDSSSREHSNFQSSFQPQSYSSSQSQGPSIEEM------YPRIGSICW-PYESYRFHVDHYIQHF

Query:  QYRMT
        +  M+
Subjt:  QYRMT

RVW84907.1 hypothetical protein CK203_039480 [Vitis vinifera]6.0e-7932.04Show/hide
Query:  GAKQKSMGVEPPTPYETQNKYLNMEFQEMEKYVQEQKKKWETYGCTIMSDGWTGPTKLSIINFMVYSE--------------------------------
        GA+Q  MG+EPP+PYE +NKYL ME++EME YV +Q++KW+TYGCTIMSDGWTGPTKLSIINFMVYS+                                
Subjt:  GAKQKSMGVEPPTPYETQNKYLNMEFQEMEKYVQEQKKKWETYGCTIMSDGWTGPTKLSIINFMVYSE--------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------EIESRVLNTLFWDHVESIVNIFEPMYTALRIVDSEVVPTMPVIYSLIENLKYRISQVKGRGWVNQIIQNRWDKTLSHPLHAAGH---------
               E+E  + +  +WD + +IV+++EP+Y  LR+VDSEVVPTMP +Y L+  +K  + +     W+ +IIQ+RW KTL HPLHAA +         
Subjt:  -------EIESRVLNTLFWDHVESIVNIFEPMYTALRIVDSEVVPTMPVIYSLIENLKYRISQVKGRGWVNQIIQNRWDKTLSHPLHAAGH---------

Query:  ------------------------------------------------------------WWGMYGRSAPTVQKLAIRVLSQTASSSACERNWSTFGLVH
                                                                    WW MYG   PT++KLAI+VLSQTASSSACERNWSTF L+H
Subjt:  ------------------------------------------------------------WWGMYGRSAPTVQKLAIRVLSQTASSSACERNWSTFGLVH

Query:  TKQRN---------------------RDMGARDEKVAEADYIDMVDVVADPNDDGDDSLFQWVRPVHLDDESGNPDPEIANMAAETGINVEQVMNNEV--
        TKQRN                     RDM A +++VAE DY+D++D+ A+  ++ D+ LFQWVRP+HLDDE GNPDP IA  A E G+NVE+V++ EV  
Subjt:  TKQRN---------------------RDMGARDEKVAEADYIDMVDVVADPNDDGDDSLFQWVRPVHLDDESGNPDPEIANMAAETGINVEQVMNNEV--

Query:  ----RMSDDQ----LNDYQFNICLDGGYSEAELLKVRKKLCKGKLTNDDEDDSTDDDNDDDNTSDGGGDTNVGISTGYQADEGQSSVPGLSPFSCEADFE
            + +DD     L+ +Q       G+S       R        +  D      DD  D    +GGGD           DE + S   +S F+CE DF 
Subjt:  ----RMSDDQ----LNDYQFNICLDGGYSEAELLKVRKKLCKGKLTNDDEDDSTDDDNDDDNTSDGGGDTNVGISTGYQADEGQSSVPGLSPFSCEADFE

Query:  HVTQDEDHGSREGGEGIAAIGKLFHRQRDTGYFYSDENSLLSSMEAMSI-----SNNNTQQDYGQGHYSYSNYDDSSSREH--SNFQSSFQ-PQSYSSSQ
        H TQDEDHGSR  G GI AIG+ + R R+      +E  L  S E+MSI      ++N    Y     SY     S+  E+  S++  S Q P       
Subjt:  HVTQDEDHGSREGGEGIAAIGKLFHRQRDTGYFYSDENSLLSSMEAMSI-----SNNNTQQDYGQGHYSYSNYDDSSSREH--SNFQSSFQ-PQSYSSSQ

Query:  SQGPSIEEM----YP-RIGSICWPYESYRFHVDHYIQHFQYRMTWEQYC
         +G  +       YP  + ++    E Y +HV  Y Q+++  +TW QYC
Subjt:  SQGPSIEEM----YP-RIGSICWPYESYRFHVDHYIQHFQYRMTWEQYC

RVW88488.1 hypothetical protein CK203_043879 [Vitis vinifera]9.9e-8235.95Show/hide
Query:  GAKQKSMGVEPPTPYETQNKYLNMEFQEMEKYVQEQKKKWETYGCTIMSDGWTGPTKLS------------IINFMVY---------------SEEIESR
        GA+Q  MG+EPP+PYE +NKYL ME++EME YV +Q++KW+TYGCTIMSDGWTGPTKL+            +  + +Y                 ++ S 
Subjt:  GAKQKSMGVEPPTPYETQNKYLNMEFQEMEKYVQEQKKKWETYGCTIMSDGWTGPTKLS------------IINFMVY---------------SEEIESR

Query:  VLNTLFWDHVESIVNIFEPMYTALRIVDSEVVPTMPVIYSLIENLKYRISQVKGRGWVNQIIQNRWDKTLSHPLHAAGH---------------------
        V+ TLF   ++  V+++EP+Y  LR+VDSE VPTMP +Y L+  +K  + +     W+ +IIQ+RW KTL HPLHAA +                     
Subjt:  VLNTLFWDHVESIVNIFEPMYTALRIVDSEVVPTMPVIYSLIENLKYRISQVKGRGWVNQIIQNRWDKTLSHPLHAAGH---------------------

Query:  ------------------------------------------------WWGMYGRSAPTVQKLAIRVLSQTASSSACERNWSTFGLVHTKQRN-------
                                                        WW MYG   PT++KLAI+VLSQTASSSACERNWSTF L+HTKQRN       
Subjt:  ------------------------------------------------WWGMYGRSAPTVQKLAIRVLSQTASSSACERNWSTFGLVHTKQRN-------

Query:  --------------RDMGARDEKVAEADYIDMVDVVADPNDDGDDSLFQWVRPVHLDDESGNPDPEIANMAAETGINVEQVMNNEV------RMSDDQ--
                      RDM A +++VAE DY+D++D+ A+  ++ D+ LFQWVRP+HLDD+ GNPDP IA  A E G+NVE+V++ EV      + +DD   
Subjt:  --------------RDMGARDEKVAEADYIDMVDVVADPNDDGDDSLFQWVRPVHLDDESGNPDPEIANMAAETGINVEQVMNNEV------RMSDDQ--

Query:  --LNDYQFNICLDGGYSEAELLKVRKKLCKGKLTNDDEDDSTDDDNDDDNTSDGGGDTNVGISTGYQADEGQSSVPGLSPFSCEADFEHVTQDEDHGSRE
          L+ +Q       G+S       R        +  D      DD  D    +GGGD           DE + S   +S F+CE DF H TQDEDHGSR 
Subjt:  --LNDYQFNICLDGGYSEAELLKVRKKLCKGKLTNDDEDDSTDDDNDDDNTSDGGGDTNVGISTGYQADEGQSSVPGLSPFSCEADFEHVTQDEDHGSRE

Query:  GGEGIAAIGKLFHRQRDTGYFYSDENSLLSSMEAMSI-----SNNNTQQDYGQGHYSYSNYDDSSSREH--SNFQSSFQ-PQSYSSSQSQGPSIEEM---
         G GI AIG+ + R R+      +E  L  S E+MSI      ++N    Y     SY     S+  E+  S++  S Q P        +G  +      
Subjt:  GGEGIAAIGKLFHRQRDTGYFYSDENSLLSSMEAMSI-----SNNNTQQDYGQGHYSYSNYDDSSSREH--SNFQSSFQ-PQSYSSSQSQGPSIEEM---

Query:  -YP-RIGSICWPYESYRFHVDHYIQHFQYRMTWEQYC
         YP  + ++    E Y +HV  Y  +++  +TW QYC
Subjt:  -YP-RIGSICWPYESYRFHVDHYIQHFQYRMTWEQYC

TrEMBL top hitse value%identityAlignment
A0A438HKF1 Uncharacterized protein2.9e-7932.04Show/hide
Query:  GAKQKSMGVEPPTPYETQNKYLNMEFQEMEKYVQEQKKKWETYGCTIMSDGWTGPTKLSIINFMVYSE--------------------------------
        GA+Q  MG+EPP+PYE +NKYL ME++EME YV +Q++KW+TYGCTIMSDGWTGPTKLSIINFMVYS+                                
Subjt:  GAKQKSMGVEPPTPYETQNKYLNMEFQEMEKYVQEQKKKWETYGCTIMSDGWTGPTKLSIINFMVYSE--------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------EIESRVLNTLFWDHVESIVNIFEPMYTALRIVDSEVVPTMPVIYSLIENLKYRISQVKGRGWVNQIIQNRWDKTLSHPLHAAGH---------
               E+E  + +  +WD + +IV+++EP+Y  LR+VDSEVVPTMP +Y L+  +K  + +     W+ +IIQ+RW KTL HPLHAA +         
Subjt:  -------EIESRVLNTLFWDHVESIVNIFEPMYTALRIVDSEVVPTMPVIYSLIENLKYRISQVKGRGWVNQIIQNRWDKTLSHPLHAAGH---------

Query:  ------------------------------------------------------------WWGMYGRSAPTVQKLAIRVLSQTASSSACERNWSTFGLVH
                                                                    WW MYG   PT++KLAI+VLSQTASSSACERNWSTF L+H
Subjt:  ------------------------------------------------------------WWGMYGRSAPTVQKLAIRVLSQTASSSACERNWSTFGLVH

Query:  TKQRN---------------------RDMGARDEKVAEADYIDMVDVVADPNDDGDDSLFQWVRPVHLDDESGNPDPEIANMAAETGINVEQVMNNEV--
        TKQRN                     RDM A +++VAE DY+D++D+ A+  ++ D+ LFQWVRP+HLDDE GNPDP IA  A E G+NVE+V++ EV  
Subjt:  TKQRN---------------------RDMGARDEKVAEADYIDMVDVVADPNDDGDDSLFQWVRPVHLDDESGNPDPEIANMAAETGINVEQVMNNEV--

Query:  ----RMSDDQ----LNDYQFNICLDGGYSEAELLKVRKKLCKGKLTNDDEDDSTDDDNDDDNTSDGGGDTNVGISTGYQADEGQSSVPGLSPFSCEADFE
            + +DD     L+ +Q       G+S       R        +  D      DD  D    +GGGD           DE + S   +S F+CE DF 
Subjt:  ----RMSDDQ----LNDYQFNICLDGGYSEAELLKVRKKLCKGKLTNDDEDDSTDDDNDDDNTSDGGGDTNVGISTGYQADEGQSSVPGLSPFSCEADFE

Query:  HVTQDEDHGSREGGEGIAAIGKLFHRQRDTGYFYSDENSLLSSMEAMSI-----SNNNTQQDYGQGHYSYSNYDDSSSREH--SNFQSSFQ-PQSYSSSQ
        H TQDEDHGSR  G GI AIG+ + R R+      +E  L  S E+MSI      ++N    Y     SY     S+  E+  S++  S Q P       
Subjt:  HVTQDEDHGSREGGEGIAAIGKLFHRQRDTGYFYSDENSLLSSMEAMSI-----SNNNTQQDYGQGHYSYSNYDDSSSREH--SNFQSSFQ-PQSYSSSQ

Query:  SQGPSIEEM----YP-RIGSICWPYESYRFHVDHYIQHFQYRMTWEQYC
         +G  +       YP  + ++    E Y +HV  Y Q+++  +TW QYC
Subjt:  SQGPSIEEM----YP-RIGSICWPYESYRFHVDHYIQHFQYRMTWEQYC

A0A438HVM1 Uncharacterized protein4.8e-8235.95Show/hide
Query:  GAKQKSMGVEPPTPYETQNKYLNMEFQEMEKYVQEQKKKWETYGCTIMSDGWTGPTKLS------------IINFMVY---------------SEEIESR
        GA+Q  MG+EPP+PYE +NKYL ME++EME YV +Q++KW+TYGCTIMSDGWTGPTKL+            +  + +Y                 ++ S 
Subjt:  GAKQKSMGVEPPTPYETQNKYLNMEFQEMEKYVQEQKKKWETYGCTIMSDGWTGPTKLS------------IINFMVY---------------SEEIESR

Query:  VLNTLFWDHVESIVNIFEPMYTALRIVDSEVVPTMPVIYSLIENLKYRISQVKGRGWVNQIIQNRWDKTLSHPLHAAGH---------------------
        V+ TLF   ++  V+++EP+Y  LR+VDSE VPTMP +Y L+  +K  + +     W+ +IIQ+RW KTL HPLHAA +                     
Subjt:  VLNTLFWDHVESIVNIFEPMYTALRIVDSEVVPTMPVIYSLIENLKYRISQVKGRGWVNQIIQNRWDKTLSHPLHAAGH---------------------

Query:  ------------------------------------------------WWGMYGRSAPTVQKLAIRVLSQTASSSACERNWSTFGLVHTKQRN-------
                                                        WW MYG   PT++KLAI+VLSQTASSSACERNWSTF L+HTKQRN       
Subjt:  ------------------------------------------------WWGMYGRSAPTVQKLAIRVLSQTASSSACERNWSTFGLVHTKQRN-------

Query:  --------------RDMGARDEKVAEADYIDMVDVVADPNDDGDDSLFQWVRPVHLDDESGNPDPEIANMAAETGINVEQVMNNEV------RMSDDQ--
                      RDM A +++VAE DY+D++D+ A+  ++ D+ LFQWVRP+HLDD+ GNPDP IA  A E G+NVE+V++ EV      + +DD   
Subjt:  --------------RDMGARDEKVAEADYIDMVDVVADPNDDGDDSLFQWVRPVHLDDESGNPDPEIANMAAETGINVEQVMNNEV------RMSDDQ--

Query:  --LNDYQFNICLDGGYSEAELLKVRKKLCKGKLTNDDEDDSTDDDNDDDNTSDGGGDTNVGISTGYQADEGQSSVPGLSPFSCEADFEHVTQDEDHGSRE
          L+ +Q       G+S       R        +  D      DD  D    +GGGD           DE + S   +S F+CE DF H TQDEDHGSR 
Subjt:  --LNDYQFNICLDGGYSEAELLKVRKKLCKGKLTNDDEDDSTDDDNDDDNTSDGGGDTNVGISTGYQADEGQSSVPGLSPFSCEADFEHVTQDEDHGSRE

Query:  GGEGIAAIGKLFHRQRDTGYFYSDENSLLSSMEAMSI-----SNNNTQQDYGQGHYSYSNYDDSSSREH--SNFQSSFQ-PQSYSSSQSQGPSIEEM---
         G GI AIG+ + R R+      +E  L  S E+MSI      ++N    Y     SY     S+  E+  S++  S Q P        +G  +      
Subjt:  GGEGIAAIGKLFHRQRDTGYFYSDENSLLSSMEAMSI-----SNNNTQQDYGQGHYSYSNYDDSSSREH--SNFQSSFQ-PQSYSSSQSQGPSIEEM---

Query:  -YP-RIGSICWPYESYRFHVDHYIQHFQYRMTWEQYC
         YP  + ++    E Y +HV  Y  +++  +TW QYC
Subjt:  -YP-RIGSICWPYESYRFHVDHYIQHFQYRMTWEQYC

A5AII9 BED-type domain-containing protein1.3e-7932.29Show/hide
Query:  GAKQKSMGVEPPTPYETQNKYLNMEFQEMEKYVQEQKKKWETYGCTIMSDGWTGPTKLSIINFMVYS---------------------------------
        GA+Q  MG+EPP+PYE ++KYL+ME+++ME YV  Q++KW+TYGCTIMSDGWTGPTKLSIINFMVYS                                 
Subjt:  GAKQKSMGVEPPTPYETQNKYLNMEFQEMEKYVQEQKKKWETYGCTIMSDGWTGPTKLSIINFMVYS---------------------------------

Query:  --------------------------------------------------------------------------------------------EEIESRVL
                                                                                                    +E+ES + 
Subjt:  --------------------------------------------------------------------------------------------EEIESRVL

Query:  NTLFWDHVESIVNIFEPMYTALRIVDSEVVPTMPVIYSLIENLKYRISQVKGRGWVNQIIQNRWDKTLSHPLHAA-------------------------
        +  +W+ V  +V+I+E +YT LRIVDSEVVPTMP +Y LI  +K  + ++  + WV +II +RWD+TL HPLHAA                         
Subjt:  NTLFWDHVESIVNIFEPMYTALRIVDSEVVPTMPVIYSLIENLKYRISQVKGRGWVNQIIQNRWDKTLSHPLHAA-------------------------

Query:  --------------------------------------------GHWWGMYGRSAPTVQKLAIRVLSQTASSSACERNWSTFGLVHTKQRN---------
                                                      WW MYG  APT+++LAI+VLSQTASSSACERNWSTF L+HTKQRN         
Subjt:  --------------------------------------------GHWWGMYGRSAPTVQKLAIRVLSQTASSSACERNWSTFGLVHTKQRN---------

Query:  ------------RDMGARDEKVAEADYIDMVDVVADPNDDGDDSLFQWVRPVHLDDESGNPDPEIANMAAETGINVEQVMNNEVRMSDDQLNDYQFNICL
                    RDM A  +KVAE DY+D++D+  +  ++ D+ LFQWVRP+HLDDE GNPDP IA    E G++V++V++ EV                
Subjt:  ------------RDMGARDEKVAEADYIDMVDVVADPNDDGDDSLFQWVRPVHLDDESGNPDPEIANMAAETGINVEQVMNNEVRMSDDQLNDYQFNICL

Query:  DGGYSEAELLKVRKKLCKGKLTNDDEDDSTDDDNDDDNTSDGGGDTNVGISTGYQADEG------QSSVPGLSPFSCEADFEHVTQDEDHGSREGGEGIA
           +S+      ++ + +  +T+    DST  ++    ++ G G TN G   G   DEG      QS  P    F+CE DF H TQDEDH SR  G G+ 
Subjt:  DGGYSEAELLKVRKKLCKGKLTNDDEDDSTDDDNDDDNTSDGGGDTNVGISTGYQADEG------QSSVPGLSPFSCEADFEHVTQDEDHGSREGGEGIA

Query:  AIGKLFHRQRDTGYFYSDENSLLSSMEAMSI------SNNNTQ------QDYGQGHYSYSNYDDSSSREHSNFQSSFQPQSYSSSQSQGPSIEEM-----
        AIGK + R R       +E+SL +S E+MS+      S+N           YGQ   + S+  D    E  N+ SS Q   Y   Q Q    +       
Subjt:  AIGKLFHRQRDTGYFYSDENSLLSSMEAMSI------SNNNTQ------QDYGQGHYSYSNYDDSSSREHSNFQSSFQPQSYSSSQSQGPSIEEM-----

Query:  -YPRIGSICW-PYESYRFHVDHYIQHFQYRMTWEQYC
         +P  G +     E Y +HV  Y Q+++  M+W +YC
Subjt:  -YPRIGSICW-PYESYRFHVDHYIQHFQYRMTWEQYC

A5AQW8 BED-type domain-containing protein1.1e-7833.33Show/hide
Query:  GAKQKSMGVEPPTPYETQNKYLNMEFQEMEKYVQEQKKKWETYGCTIMSDGWTGPTKLSIINFMVY----------------------------------
        GA+Q  MG+E P+PYE ++KYL+ME+++ME YV  Q++KW+TYGCTIMSDGWTGPTKLSIINFMVY                                  
Subjt:  GAKQKSMGVEPPTPYETQNKYLNMEFQEMEKYVQEQKKKWETYGCTIMSDGWTGPTKLSIINFMVY----------------------------------

Query:  -------------------------------------------------------------SEEIESRVLNTLFWDHVESIVNIFEPMYTALRIVDSEVV
                                                                      +E+ES + +  +W+ V  +V+I+E +YT LRIVDSEVV
Subjt:  -------------------------------------------------------------SEEIESRVLNTLFWDHVESIVNIFEPMYTALRIVDSEVV

Query:  PTMPVIYSLIENLKYRISQVKGRGWVNQIIQNRWDKTLSHPLHAA-------------------------------------------------------
        PTMP +Y LI  +K  + ++  + WV +II +RWD+TL HPLHAA                                                       
Subjt:  PTMPVIYSLIENLKYRISQVKGRGWVNQIIQNRWDKTLSHPLHAA-------------------------------------------------------

Query:  --------------GHWWGMYGRSAPTVQKLAIRVLSQTASSSACERNWSTFGLVHTKQRN---------------------RDMGARDEKVAEADYIDM
                        WW MYG  APT+++LAI+VLSQTASSS CERNWSTF L+HTKQRN                     RDM A  +KVAE DY+D+
Subjt:  --------------GHWWGMYGRSAPTVQKLAIRVLSQTASSSACERNWSTFGLVHTKQRN---------------------RDMGARDEKVAEADYIDM

Query:  VDVVADPNDDGDDSLFQWVRPVHLDDESGNPDPEIANMAAETGINVEQVMNNEVR---MSDDQLNDYQFNICLDGGYSEAELLKVRKKLCKGKLTNDDED
        +D+  +  ++ D+ LFQWVRP+HLDDE GNPDP IA    E G++V++V++ E+     S D  + +Q       G S+  +   R       + +    
Subjt:  VDVVADPNDDGDDSLFQWVRPVHLDDESGNPDPEIANMAAETGINVEQVMNNEVR---MSDDQLNDYQFNICLDGGYSEAELLKVRKKLCKGKLTNDDED

Query:  DSTDDDNDDDNTSDGGGDTNVGISTGYQADEG------QSSVPGLSPFSCEADFEHVTQDEDHGSREGGEGIAAIGKLFHRQRDTGYFYSDENSLLSSME
         +T     D + S G G TN G   G   DEG      QS  P    F+C+ DF H TQDEDHGSR  G G+ AIGK + R R       +E+SL +S E
Subjt:  DSTDDDNDDDNTSDGGGDTNVGISTGYQADEG------QSSVPGLSPFSCEADFEHVTQDEDHGSREGGEGIAAIGKLFHRQRDTGYFYSDENSLLSSME

Query:  AMSI------SNNNTQ------QDYGQGHYSYSNYDDSSSREHSNFQSSFQPQSYSSSQSQGPSIEEM------YPRIGSICW-PYESYRFHVDHYIQHF
        +MS+      S+N           YGQ   + S+  D    E  N+ SS Q   Y   Q Q    +        +P  G +     E Y +HV  Y Q++
Subjt:  AMSI------SNNNTQ------QDYGQGHYSYSNYDDSSSREHSNFQSSFQPQSYSSSQSQGPSIEEM------YPRIGSICW-PYESYRFHVDHYIQHF

Query:  QYRMT
        +  M+
Subjt:  QYRMT

A5B1X7 Uncharacterized protein8.4e-7933.59Show/hide
Query:  GAKQKSMGVEPPTPYETQNKYLNMEFQEMEKYVQEQKKKWETYGCTIMSDGWTGPTKLSIINFMVYS---------------------------------
        GA+Q  MG+EPP+PYE ++KYL+ME+++ME YV  Q++KW+TYGCTIMSDGWTGP KLSIINFMVYS                                 
Subjt:  GAKQKSMGVEPPTPYETQNKYLNMEFQEMEKYVQEQKKKWETYGCTIMSDGWTGPTKLSIINFMVYS---------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------EEIESRVLNTLFWDHVESIVNIFEPMYTALRIVDSEVVPTMPVIYSLIENLKYRISQVKGRGWVNQIIQNR
                                     +E+ES + +  +W+ V  +V+I+E +YT LRIVDSEVVPTM  +Y LI  +K  + ++  + WV +II +R
Subjt:  -----------------------------EEIESRVLNTLFWDHVESIVNIFEPMYTALRIVDSEVVPTMPVIYSLIENLKYRISQVKGRGWVNQIIQNR

Query:  WDKTLSHPLHAAGHWWGMYGRSAPTVQKLAIRVLSQTASSSACERNWSTFGLVHTKQRN---------------------RDMGARDEKVAEADYIDMVD
        WD+TL HPLHAA  WW MYG  APT+++LAI+VLSQTAS SACERNWSTF L+HTKQRN                     RD+ A  +KVAE DY+D++D
Subjt:  WDKTLSHPLHAAGHWWGMYGRSAPTVQKLAIRVLSQTASSSACERNWSTFGLVHTKQRN---------------------RDMGARDEKVAEADYIDMVD

Query:  VVADPNDDGDDSLFQWVRPVHLDDESGNPDPEIANMAAETGINVEQVMNNEVRMSDDQLNDYQFNICLDGGYSEAELLKVRKKLCKGKLTNDDEDDSTDD
        +  +  ++ D+ LFQWVRP+HLDDE GNPDP IA    E G++++QV++ EV                   +S+      R+ + +  +T+    DST  
Subjt:  VVADPNDDGDDSLFQWVRPVHLDDESGNPDPEIANMAAETGINVEQVMNNEVRMSDDQLNDYQFNICLDGGYSEAELLKVRKKLCKGKLTNDDEDDSTDD

Query:  DNDD----DNTSDGGGDTNVGISTGYQADEG----------QSSVPGLSPFSCEADFEHVTQDEDHGSREGGEGIAAIGKLFHRQRDTGYFYSDENSLLS
        ++        TS  G D + G  T   +D G          QS  P    F+CE DF H TQDEDHGSR  G G+ AIGK + R R       +E+SL +
Subjt:  DNDD----DNTSDGGGDTNVGISTGYQADEG----------QSSVPGLSPFSCEADFEHVTQDEDHGSREGGEGIAAIGKLFHRQRDTGYFYSDENSLLS

Query:  SMEAMSI------SNNNTQ------QDYGQGHYSYSNYDDSSSREHSNFQSSFQPQSY
        S E+MS+      S+N           YGQ   + S+  D    E  N+ SS Q   Y
Subjt:  SMEAMSI------SNNNTQ------QDYGQGHYSYSNYDDSSSREHSNFQSSFQPQSY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43260.1 hAT transposon superfamily protein1.8e-0431.43Show/hide
Query:  EGAKQKSMGVEPPTPYETQNKYLNMEFQEMEKYVQEQKKKWETYGCTIMSDGWTGPTKLSIINFMVYSEE
        E A Q   GV PP+ Y+ +   L  E   M+  ++EQ+ +W   GC++ +D W+   + SI+N  +  +E
Subjt:  EGAKQKSMGVEPPTPYETQNKYLNMEFQEMEKYVQEQKKKWETYGCTIMSDGWTGPTKLSIINFMVYSEE

AT1G79740.1 hAT transposon superfamily1.3e-0734.91Show/hide
Query:  GHWWGMYGRSAPTVQKLAIRVLSQTASSSACERNWSTFGLVHTKQRNRDMGARDEKVAEADYID-------MVDVVADPNDDGD-DSLFQWVRPVHLDDE
        G WW  +G SAP +Q++AIR+LSQ  S    ER WSTF  +H ++RN+      E + +  Y++       M+ +  DP    D D + +WV      +E
Subjt:  GHWWGMYGRSAPTVQKLAIRVLSQTASSSACERNWSTFGLVHTKQRNRDMGARDEKVAEADYID-------MVDVVADPNDDGD-DSLFQWVRPVHLDDE

Query:  SGNPDP
        + NP P
Subjt:  SGNPDP

AT3G17450.1 hAT dimerisation domain-containing protein5.2e-0440.48Show/hide
Query:  WWGMYGRSAPTVQKLAIRVLSQTASSSACERNWSTFGLVHTK
        WW  +G S   +Q++A+R+LS T SS  CE  WS +  V+++
Subjt:  WWGMYGRSAPTVQKLAIRVLSQTASSSACERNWSTFGLVHTK

AT5G31412.1 hAT transposon superfamily protein3.5e-0832.71Show/hide
Query:  VNIFEPMYTALRIVDSEVVPTMPVIYSLIENLKYRISQV-----KGRGWVNQIIQNRWDKTLSHPLHAAGHWWGMYGRSAPTVQKLAIRVLSQTASSSAC
        V  F P+   LR+ D E  P +  I+  +  ++ +I+       +    +  II  +    L      + +WW +YG   PT+Q+LAI++LS T SSS+ 
Subjt:  VNIFEPMYTALRIVDSEVVPTMPVIYSLIENLKYRISQV-----KGRGWVNQIIQNRWDKTLSHPLHAAGHWWGMYGRSAPTVQKLAIRVLSQTASSSAC

Query:  ERNWSTF
        ERNWS F
Subjt:  ERNWSTF

AT5G33406.1 hAT dimerisation domain-containing protein / transposase-related2.5e-1427.32Show/hide
Query:  EIESRVLNTLFWDHVESIVNIFEPMYTALRIVDSEVVPTMPVIYSLIENLKYRIS-----QVKGRGWVNQIIQNRWDKTLSHPLHAAGH-----------
        +I+S      FW +V   + +  P+   LR+VD E  P M  IY  ++  K  I      + +      +II  RWD  L  PLHAAG+           
Subjt:  EIESRVLNTLFWDHVESIVNIFEPMYTALRIVDSEVVPTMPVIYSLIENLKYRIS-----QVKGRGWVNQIIQNRWDKTLSHPLHAAGH-----------

Query:  -----------------------------------------------------------WWGMYGRSAPTVQKLAIRVLSQTASSSACERNWSTFGLVHT
                                                                   WW  YG S P +Q  AI+VLS T S++ CERNW  F L+HT
Subjt:  -----------------------------------------------------------WWGMYGRSAPTVQKLAIRVLSQTASSSACERNWSTFGLVHT

Query:  KQRNR
        K+RNR
Subjt:  KQRNR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTATGAGCTACTCCGAAAGAAAACAATATCGTGAAGTTATACGTAACTCAAGAAAGACGACCCAAGAAGAGGAAGACATGCGTAGGAGTGGACTTGAGCGTGGTCA
AGGCTCTAATCAGTTCCAACGGTCTAGTAGCATGAAGGAGCCATCGTACTACAAGTCTGAAGGAGCCAAACAAAAAAGTATGGGAGTTGAACCACCTACACCGTACGAAA
CTCAAAATAAATATTTGAATATGGAGTTTCAAGAGATGGAAAAGTATGTGCAGGAGCAGAAAAAGAAGTGGGAAACTTATGGCTGCACGATTATGTCTGATGGATGGACA
GGACCAACAAAACTATCCATCATCAATTTTATGGTCTATTCTGAGGAAATTGAATCTCGCGTATTAAATACCTTATTTTGGGATCATGTGGAGAGCATAGTTAATATTTT
TGAGCCAATGTACACTGCTCTCCGAATAGTCGACTCGGAGGTTGTTCCAACAATGCCTGTTATATATAGCTTGATTGAAAATTTGAAATATAGAATAAGTCAGGTTAAAG
GCAGAGGTTGGGTCAACCAAATAATACAAAATCGATGGGACAAGACACTTTCTCATCCTCTTCATGCTGCAGGTCATTGGTGGGGCATGTATGGTAGAAGTGCCCCTACA
GTCCAAAAACTAGCCATAAGGGTTTTATCTCAAACGGCTTCGTCATCTGCTTGTGAGAGAAACTGGAGTACGTTTGGGTTAGTACATACAAAACAACGTAATCGAGATAT
GGGAGCAAGAGATGAGAAGGTAGCAGAAGCAGATTATATCGATATGGTAGATGTTGTGGCAGATCCAAATGATGATGGAGATGACTCACTTTTTCAATGGGTTCGACCAG
TACATTTGGATGATGAGTCTGGCAATCCAGATCCCGAGATTGCCAACATGGCTGCAGAGACTGGAATTAATGTAGAGCAAGTCATGAATAATGAAGTTAGGATGTCTGAC
GATCAGTTAAATGACTATCAATTCAATATTTGTTTGGATGGTGGTTACTCTGAGGCAGAACTTCTAAAAGTGAGAAAAAAACTTTGTAAAGGTAAGTTGACAAATGATGA
TGAAGATGACAGTACAGACGATGACAATGATGATGATAATACATCTGATGGAGGAGGTGATACAAATGTTGGGATCTCAACTGGCTACCAAGCTGATGAAGGACAATCTA
GTGTACCAGGATTAAGTCCTTTTTCATGTGAGGCTGATTTTGAACATGTCACCCAAGACGAGGATCATGGATCTCGAGAGGGTGGTGAGGGGATTGCTGCTATTGGTAAA
CTATTTCATCGACAAAGGGATACAGGATACTTTTATTCTGATGAGAATTCCTTACTTTCAAGTATGGAAGCGATGAGCATCAGCAACAATAATACACAACAAGACTATGG
TCAAGGTCATTATTCGTACTCAAATTATGATGATTCTTCAAGTCGTGAACATAGCAACTTTCAGTCGTCCTTTCAACCACAGAGTTATTCATCATCTCAAAGTCAAGGAC
CATCGATTGAAGAGATGTATCCACGAATAGGGTCCATATGTTGGCCGTACGAGTCATATCGTTTTCATGTTGACCATTATATTCAACATTTTCAATATAGGATGACTTGG
GAGCAATATTGTAATTTCTATCAGAATTACCAAGATTCTGGGTTAGATGCTCCAAGGTCATCTTTTGGTATTAGAAATGTAATGTTCACATATTATTTAATCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCTATGAGCTACTCCGAAAGAAAACAATATCGTGAAGTTATACGTAACTCAAGAAAGACGACCCAAGAAGAGGAAGACATGCGTAGGAGTGGACTTGAGCGTGGTCA
AGGCTCTAATCAGTTCCAACGGTCTAGTAGCATGAAGGAGCCATCGTACTACAAGTCTGAAGGAGCCAAACAAAAAAGTATGGGAGTTGAACCACCTACACCGTACGAAA
CTCAAAATAAATATTTGAATATGGAGTTTCAAGAGATGGAAAAGTATGTGCAGGAGCAGAAAAAGAAGTGGGAAACTTATGGCTGCACGATTATGTCTGATGGATGGACA
GGACCAACAAAACTATCCATCATCAATTTTATGGTCTATTCTGAGGAAATTGAATCTCGCGTATTAAATACCTTATTTTGGGATCATGTGGAGAGCATAGTTAATATTTT
TGAGCCAATGTACACTGCTCTCCGAATAGTCGACTCGGAGGTTGTTCCAACAATGCCTGTTATATATAGCTTGATTGAAAATTTGAAATATAGAATAAGTCAGGTTAAAG
GCAGAGGTTGGGTCAACCAAATAATACAAAATCGATGGGACAAGACACTTTCTCATCCTCTTCATGCTGCAGGTCATTGGTGGGGCATGTATGGTAGAAGTGCCCCTACA
GTCCAAAAACTAGCCATAAGGGTTTTATCTCAAACGGCTTCGTCATCTGCTTGTGAGAGAAACTGGAGTACGTTTGGGTTAGTACATACAAAACAACGTAATCGAGATAT
GGGAGCAAGAGATGAGAAGGTAGCAGAAGCAGATTATATCGATATGGTAGATGTTGTGGCAGATCCAAATGATGATGGAGATGACTCACTTTTTCAATGGGTTCGACCAG
TACATTTGGATGATGAGTCTGGCAATCCAGATCCCGAGATTGCCAACATGGCTGCAGAGACTGGAATTAATGTAGAGCAAGTCATGAATAATGAAGTTAGGATGTCTGAC
GATCAGTTAAATGACTATCAATTCAATATTTGTTTGGATGGTGGTTACTCTGAGGCAGAACTTCTAAAAGTGAGAAAAAAACTTTGTAAAGGTAAGTTGACAAATGATGA
TGAAGATGACAGTACAGACGATGACAATGATGATGATAATACATCTGATGGAGGAGGTGATACAAATGTTGGGATCTCAACTGGCTACCAAGCTGATGAAGGACAATCTA
GTGTACCAGGATTAAGTCCTTTTTCATGTGAGGCTGATTTTGAACATGTCACCCAAGACGAGGATCATGGATCTCGAGAGGGTGGTGAGGGGATTGCTGCTATTGGTAAA
CTATTTCATCGACAAAGGGATACAGGATACTTTTATTCTGATGAGAATTCCTTACTTTCAAGTATGGAAGCGATGAGCATCAGCAACAATAATACACAACAAGACTATGG
TCAAGGTCATTATTCGTACTCAAATTATGATGATTCTTCAAGTCGTGAACATAGCAACTTTCAGTCGTCCTTTCAACCACAGAGTTATTCATCATCTCAAAGTCAAGGAC
CATCGATTGAAGAGATGTATCCACGAATAGGGTCCATATGTTGGCCGTACGAGTCATATCGTTTTCATGTTGACCATTATATTCAACATTTTCAATATAGGATGACTTGG
GAGCAATATTGTAATTTCTATCAGAATTACCAAGATTCTGGGTTAGATGCTCCAAGGTCATCTTTTGGTATTAGAAATGTAATGTTCACATATTATTTAATCTAA
Protein sequenceShow/hide protein sequence
MPMSYSERKQYREVIRNSRKTTQEEEDMRRSGLERGQGSNQFQRSSSMKEPSYYKSEGAKQKSMGVEPPTPYETQNKYLNMEFQEMEKYVQEQKKKWETYGCTIMSDGWT
GPTKLSIINFMVYSEEIESRVLNTLFWDHVESIVNIFEPMYTALRIVDSEVVPTMPVIYSLIENLKYRISQVKGRGWVNQIIQNRWDKTLSHPLHAAGHWWGMYGRSAPT
VQKLAIRVLSQTASSSACERNWSTFGLVHTKQRNRDMGARDEKVAEADYIDMVDVVADPNDDGDDSLFQWVRPVHLDDESGNPDPEIANMAAETGINVEQVMNNEVRMSD
DQLNDYQFNICLDGGYSEAELLKVRKKLCKGKLTNDDEDDSTDDDNDDDNTSDGGGDTNVGISTGYQADEGQSSVPGLSPFSCEADFEHVTQDEDHGSREGGEGIAAIGK
LFHRQRDTGYFYSDENSLLSSMEAMSISNNNTQQDYGQGHYSYSNYDDSSSREHSNFQSSFQPQSYSSSQSQGPSIEEMYPRIGSICWPYESYRFHVDHYIQHFQYRMTW
EQYCNFYQNYQDSGLDAPRSSFGIRNVMFTYYLI