; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G05690 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G05690
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionVARLMGL domain-containing protein
Genome locationClcChr01:5347600..5352120
RNA-Seq ExpressionClc01G05690
SyntenyClc01G05690
Gene Ontology termsNA
InterPro domainsIPR032795 - DUF3741-associated sequence motif


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055152.1 putative dna repair [Cucumis melo var. makuwa]5.0e-6150.53Show/hide
Query:  MGLDILPQNPSSSFNCRGSNFGTRSLPESPRVSSARLSDVDCHHRRLSLQI-ISDKEN--INFFEE-AKREKEKVSKKVALVDITNNNRKIEFGKQEVGF
        MGLD  PQ  SSS+   G N  TRSL ESPR SS+RLS+VDCHHRRLSLQI I +KEN  I   E+  KREK+KV +KVALVDITN+N KI +  QE+G 
Subjt:  MGLDILPQNPSSSFNCRGSNFGTRSLPESPRVSSARLSDVDCHHRRLSLQI-ISDKEN--INFFEE-AKREKEKVSKKVALVDITNNNRKIEFGKQEVGF

Query:  S--QIKVEIKSSKKLKKTAVDESRRSSKVVRKNQE-VMISKKQKLISMSMQKPKRRAREGEAFDCPTSNNLVNNLHHSTIFPAKKEPSPPAIQAPREQVE
        S    KVE+KS KKL+KT V ES  +SKVV  NQ+  M+SKKQKLISM MQ  K R  E EAFDCPT+N L+  LHH TIF    EP             
Subjt:  S--QIKVEIKSSKKLKKTAVDESRRSSKVVRKNQE-VMISKKQKLISMSMQKPKRRAREGEAFDCPTSNNLVNNLHHSTIFPAKKEPSPPAIQAPREQVE

Query:  VYDLHICDVIRKYSKGKAKPAGRDGGERNAVDKTTTTDGGSTEFEYIKRI-LTNHGNSNSIISPPNNPTNPSIFHHPE---AAEDQQWGRRLLNCWHVRR
                    Y KGK KPA   GGE +AVD TTTTDG ST+F+YIK I +++  NSN ++        PS FHH E   A ++++W +RL     ++ 
Subjt:  VYDLHICDVIRKYSKGKAKPAGRDGGERNAVDKTTTTDGGSTEFEYIKRI-LTNHGNSNSIISPPNNPTNPSIFHHPE---AAEDQQWGRRLLNCWHVRR

Query:  GMKGWELGEEAVRERSVKKEYFPRAKYEVV------EDMDALIINKRVVEETEGIVKVVELHILDSLLRETVALIS
        G+ G   G+   R+R  +   FP AK  +V      ED++   +   + EE EGIVK+VELHILDSLLRET+ALIS
Subjt:  GMKGWELGEEAVRERSVKKEYFPRAKYEVV------EDMDALIINKRVVEETEGIVKVVELHILDSLLRETVALIS

KAG6605686.1 hypothetical protein SDJN03_03003, partial [Cucurbita argyrosperma subsp. sororia]5.7e-4935.57Show/hide
Query:  EWHFGGRSSSRRATTADHPRQRYRPSLPSCMSTLFHFFDFRSSRFTHIVFDNHHPSSFNLPHHRPPLPKASHHVVEAPRNSLELEGASISCLRNKEKNLQ
        +W FGG SS RRA   D  + R+RPSLPSC                                            V APRNSLE  G        +E+N Q
Subjt:  EWHFGGRSSSRRATTADHPRQRYRPSLPSCMSTLFHFFDFRSSRFTHIVFDNHHPSSFNLPHHRPPLPKASHHVVEAPRNSLELEGASISCLRNKEKNLQ

Query:  LQMGLQIKTRNGSTKSKASEQQLPNNDHIIALESPSGKTPNLLARLMGLDILPQNPSSSFNCRGSNFGTRSLPESPRVSSARLSDVDCHHRRLSLQIISD
        +QMGL+I T               N DH  AL+SPS KTPNLLARLMGLDILPQ  +S          TRSLP SPRVSS+RLSDVD HH R SL I  D
Subjt:  LQMGLQIKTRNGSTKSKASEQQLPNNDHIIALESPSGKTPNLLARLMGLDILPQNPSSSFNCRGSNFGTRSLPESPRVSSARLSDVDCHHRRLSLQIISD

Query:  KENINFFEEAKREKEKVSKKVALVDITNNNRKIEFGKQEVGFSQIKVEIKSSKKLKKTAVDESRRSSKVVRKNQEVMISKKQKLIS----MSMQKPKRRA
         EN    +E K+E+E+V +KVALVDITNNN K+ +GK                                  KNQ+V + +K   IS         PKR+ 
Subjt:  KENINFFEEAKREKEKVSKKVALVDITNNNRKIEFGKQEVGFSQIKVEIKSSKKLKKTAVDESRRSSKVVRKNQEVMISKKQKLIS----MSMQKPKRRA

Query:  REGEAFDCPTSNNLVNNLHHSTIFPAKKEPSPPAIQAPREQVEVYDLHICDVIRKYSKGKAKPAGRDGGERNAVDKTTTTDGGSTEFEYIKRILTNHGNS
        R+                        KK+     ++ P+ +             ++  GK +PA  + G R      +T DGG+ E +YIKRILT+    
Subjt:  REGEAFDCPTSNNLVNNLHHSTIFPAKKEPSPPAIQAPREQVEVYDLHICDVIRKYSKGKAKPAGRDGGERNAVDKTTTTDGGSTEFEYIKRILTNHGNS

Query:  NSIISPPNNPTNPSIFHHPEAA----------------EDQQWGRRLLNCWHVRRGMKGWELGEEAVRERSVKKEYFPRAKYEVVEDMDALI---INK-R
        +     P NP NPSIFHH E +                +D+  G  ++NC      MKGWEL                RAK  V+ED+D+LI   + K +
Subjt:  NSIISPPNNPTNPSIFHHPEAA----------------EDQQWGRRLLNCWHVRRGMKGWELGEEAVRERSVKKEYFPRAKYEVVEDMDALI---INK-R

Query:  VVEETEGIVKVVELHILDSLLRETVALISSLPKCSHF
         V E EG+V+  + HILDSLLRET A I SL K   F
Subjt:  VVEETEGIVKVVELHILDSLLRETVALISSLPKCSHF

KAG7035594.1 hypothetical protein SDJN02_02391, partial [Cucurbita argyrosperma subsp. argyrosperma]1.1e-6339.63Show/hide
Query:  EWHFGGRSSSRRATTADHPRQRYRPSLPSCMSTLFHFFDFRSSRFTHIVFDNHHPSSFNLPHHRPPLPKASHHVVEAPRNSLELEGASISCLRNKEKNLQ
        +W FGG SS RRA      R R+RPSLPSCM+TLFHFFD  S   TH+  + H PSS         L       V APRNSLE  G        +E+N Q
Subjt:  EWHFGGRSSSRRATTADHPRQRYRPSLPSCMSTLFHFFDFRSSRFTHIVFDNHHPSSFNLPHHRPPLPKASHHVVEAPRNSLELEGASISCLRNKEKNLQ

Query:  LQMGLQIKTRNGSTKSKASEQQLPNNDHIIALESPSGKTPNLLARLMGLDILPQNPSSSFNCRGSNFGTRSLPESPRVSSARLSDVDCHHRRLSLQIISD
        +QMGL+I T               N DH  AL+SPS KTPNLLARLMGLDILPQ  +S          TRSLP SPRVSS+RLSDVD HH R SL I  D
Subjt:  LQMGLQIKTRNGSTKSKASEQQLPNNDHIIALESPSGKTPNLLARLMGLDILPQNPSSSFNCRGSNFGTRSLPESPRVSSARLSDVDCHHRRLSLQIISD

Query:  KENINFFEEAKREKEKVSKKVALVDITNNNRKIEFGKQEVGFSQIKVEIKSSKKLKKTAVDESRRSSKVVRKNQEVMISKKQKLIS--MSMQKPKRRARE
         EN    +E K+E+E+V +KVALVDITNNN K+ +GK                                  KNQ+V + +K   IS       PKR+ R 
Subjt:  KENINFFEEAKREKEKVSKKVALVDITNNNRKIEFGKQEVGFSQIKVEIKSSKKLKKTAVDESRRSSKVVRKNQEVMISKKQKLIS--MSMQKPKRRARE

Query:  GEAFDCPTSNNLVNNLHHSTIFPAKKEPSPPAIQAPREQVEVYDLHICDVIRKYSKGKAKPAGRDGGERNAVDKTTTTDGGSTEFEYIKRILTNHGNSNS
                          +T    K+E SPP     R + + +    C    ++  GK +PA  + G R       T DGG+ E +YIKRILT+    + 
Subjt:  GEAFDCPTSNNLVNNLHHSTIFPAKKEPSPPAIQAPREQVEVYDLHICDVIRKYSKGKAKPAGRDGGERNAVDKTTTTDGGSTEFEYIKRILTNHGNSNS

Query:  IISPPNNPTNPSIFHHPEAA----------------EDQQWGRRLLNCWHVRRGMKGWELGEEAVRERSVKKEYFPRAKYEVVEDMDALI---INK-RVV
            P NP NPSIFHH E +                +D+  G  ++NC      MKGWEL                RAK  V+ED+D+LI   + K + V
Subjt:  IISPPNNPTNPSIFHHPEAA----------------EDQQWGRRLLNCWHVRRGMKGWELGEEAVRERSVKKEYFPRAKYEVVEDMDALI---INK-RVV

Query:  EETEGIVKVVELHILDSLLRETVALISSLPKCSHF
         E EG+V+  + HILDSLLRET A I SL K   F
Subjt:  EETEGIVKVVELHILDSLLRETVALISSLPKCSHF

XP_011656164.1 uncharacterized protein LOC105435648 [Cucumis sativus]8.1e-12057.09Show/hide
Query:  MGKVE-WHFGGR-SSSRRATTADHPR-QRYRPSLPSCMSTLFHFFDFRSSRFTHIVFDNHHPSSFNLPHHRPPL--PKASHHVVEAPRNSLELE-GASIS
        MGK E W+FGGR SSSRR TT D    QR   SLPSCMSTLFH FDFRSS FTHIVFDNH  SSF+L HH P L   KASHH VEAPRNSLEL+ G SIS
Subjt:  MGKVE-WHFGGR-SSSRRATTADHPR-QRYRPSLPSCMSTLFHFFDFRSSRFTHIVFDNHHPSSFNLPHHRPPL--PKASHHVVEAPRNSLELE-GASIS

Query:  CLRNKEKNLQLQMGLQIKTRNGSTKSKASEQQLPNNDHIIALESPSGKTPNLLARLMGLDILPQNP-SSSFNCRGSNFGTRSLPESPRVSSARLSDVDCH
        CLRNKE+NLQLQMGLQIKTRNGSTKSKA+EQQLPNND+IIALESPS  TPNLLARLMGLD  PQ   SSS+N    N GTRSL ESPR S +RLSDVD H
Subjt:  CLRNKEKNLQLQMGLQIKTRNGSTKSKASEQQLPNNDHIIALESPSGKTPNLLARLMGLDILPQNP-SSSFNCRGSNFGTRSLPESPRVSSARLSDVDCH

Query:  HRRLSLQI-ISDKEN--INFFEE-AKREKEKVSK-KVALVDITNNNRKIEFGKQEVGFSQI-KVEIKSSKKLKKTAVDESRRSSKVVRKNQE-VMISKKQ
        HRRLSLQI I +KEN  I   EE +KREK+KV + KVAL+DITN+  K+    QE+G SQ  KVE+KS KKLKKT  ++S  S  V R NQ+ V++S KQ
Subjt:  HRRLSLQI-ISDKEN--INFFEE-AKREKEKVSK-KVALVDITNNNRKIEFGKQEVGFSQI-KVEIKSSKKLKKTAVDESRRSSKVVRKNQE-VMISKKQ

Query:  KLISMSMQKPK-RRAREGEAFDCPTSNNLVNNLHHSTIFPAKKEPSPPAIQAPREQVEVYDLHICDVIRKYSKGKAKPAGRDGGERNAVDKTTTTDGGST
        K ISMSMQ PK RRAREGEA DCP SN L + L HSTIF    +P                         Y KGKAK A   GGE NAVD  TTTDGGS 
Subjt:  KLISMSMQKPK-RRAREGEAFDCPTSNNLVNNLHHSTIFPAKKEPSPPAIQAPREQVEVYDLHICDVIRKYSKGKAKPAGRDGGERNAVDKTTTTDGGST

Query:  EFEYIKRI-LTNHGNSNSIISPPNNPTNPSIFHHPEAAEDQQWGRRL---------------LNCWHVRRGMK-GWELGEEAVRERSVKKEYFPRAKYEV
        EF+YIK I +++  NSN ++ P       S F+H  A E+++W +R+                  W  +RG K GWE               FP  K+E+
Subjt:  EFEYIKRI-LTNHGNSNSIISPPNNPTNPSIFHHPEAAEDQQWGRRL---------------LNCWHVRRGMK-GWELGEEAVRERSVKKEYFPRAKYEV

Query:  VEDMDALIINKR--------VVEETEGIVKVVELHILDSLLRE-TVALIS
        VE     +INK         + EE EGIVK+VELHILDSLLRE T +LIS
Subjt:  VEDMDALIINKR--------VVEETEGIVKVVELHILDSLLRE-TVALIS

XP_022958521.1 uncharacterized protein LOC111459727 [Cucurbita moschata]9.8e-4938.27Show/hide
Query:  HRPPLPKASHHVVEAPRNSLELEGASISCLRNKEKNLQLQMGLQIKTRNGSTKSKASEQQLPNNDHIIALESPSGKTPNLLARLMGLDILPQNPSSSFNC
        H P LP      V APRNSLE  G        +E+N Q+QMGL+I T               N DH  AL+SPS KTPNLLARLMGLDILPQ  +S    
Subjt:  HRPPLPKASHHVVEAPRNSLELEGASISCLRNKEKNLQLQMGLQIKTRNGSTKSKASEQQLPNNDHIIALESPSGKTPNLLARLMGLDILPQNPSSSFNC

Query:  RGSNFGTRSLPESPRVSSARLSDVDCHHRRLSLQIISDKENINFFEEAKREKEKVSKKVALVDITNNNRKIEFGKQEVGFSQIKVEIKSSKKLKKTAVDE
              TRSLP SPRVSS RLSDVD HH R SL I  D EN    +E K+E+E+V +KVALVDITNNN K+ +GK                         
Subjt:  RGSNFGTRSLPESPRVSSARLSDVDCHHRRLSLQIISDKENINFFEEAKREKEKVSKKVALVDITNNNRKIEFGKQEVGFSQIKVEIKSSKKLKKTAVDE

Query:  SRRSSKVVRKNQEVMISKKQKLISMS--MQKPKRRAREGEAFDCPTSNNLVNNLHHSTIFPAKKEPSPPAIQAPREQVEVYDLHICDVIRKYSKGKAKPA
                 KNQ+V + +K   IS       PKR+ R                   +T    K+E SPP     R +             ++  GK +PA
Subjt:  SRRSSKVVRKNQEVMISKKQKLISMS--MQKPKRRAREGEAFDCPTSNNLVNNLHHSTIFPAKKEPSPPAIQAPREQVEVYDLHICDVIRKYSKGKAKPA

Query:  GRDGGERNAVDKTTTTDGGSTEFEYIKRILTNHGNSNSIISPPNNPTNPSIFHHPEAA----------------EDQQWGRRLLNCWHVRRGMKGWELGE
          + G R       T DGG+ E +YIKRILT+    +     P NP NPSIFHH E +                +D+  G  ++NC      MKGWEL  
Subjt:  GRDGGERNAVDKTTTTDGGSTEFEYIKRILTNHGNSNSIISPPNNPTNPSIFHHPEAA----------------EDQQWGRRLLNCWHVRRGMKGWELGE

Query:  EAVRERSVKKEYFPRAKYEVVEDMDALI---INK-RVVEETEGIVKVVELHILDSLLRETVALISSLPKCSHF
                      RAK  V++D+D+LI   + K + V E EG+V+  E HILDSLLRET A I SL K   F
Subjt:  EAVRERSVKKEYFPRAKYEVVEDMDALI---INK-RVVEETEGIVKVVELHILDSLLRETVALISSLPKCSHF

TrEMBL top hitse value%identityAlignment
A0A0A0KNC2 VARLMGL domain-containing protein3.9e-12057.09Show/hide
Query:  MGKVE-WHFGGR-SSSRRATTADHPR-QRYRPSLPSCMSTLFHFFDFRSSRFTHIVFDNHHPSSFNLPHHRPPL--PKASHHVVEAPRNSLELE-GASIS
        MGK E W+FGGR SSSRR TT D    QR   SLPSCMSTLFH FDFRSS FTHIVFDNH  SSF+L HH P L   KASHH VEAPRNSLEL+ G SIS
Subjt:  MGKVE-WHFGGR-SSSRRATTADHPR-QRYRPSLPSCMSTLFHFFDFRSSRFTHIVFDNHHPSSFNLPHHRPPL--PKASHHVVEAPRNSLELE-GASIS

Query:  CLRNKEKNLQLQMGLQIKTRNGSTKSKASEQQLPNNDHIIALESPSGKTPNLLARLMGLDILPQNP-SSSFNCRGSNFGTRSLPESPRVSSARLSDVDCH
        CLRNKE+NLQLQMGLQIKTRNGSTKSKA+EQQLPNND+IIALESPS  TPNLLARLMGLD  PQ   SSS+N    N GTRSL ESPR S +RLSDVD H
Subjt:  CLRNKEKNLQLQMGLQIKTRNGSTKSKASEQQLPNNDHIIALESPSGKTPNLLARLMGLDILPQNP-SSSFNCRGSNFGTRSLPESPRVSSARLSDVDCH

Query:  HRRLSLQI-ISDKEN--INFFEE-AKREKEKVSK-KVALVDITNNNRKIEFGKQEVGFSQI-KVEIKSSKKLKKTAVDESRRSSKVVRKNQE-VMISKKQ
        HRRLSLQI I +KEN  I   EE +KREK+KV + KVAL+DITN+  K+    QE+G SQ  KVE+KS KKLKKT  ++S  S  V R NQ+ V++S KQ
Subjt:  HRRLSLQI-ISDKEN--INFFEE-AKREKEKVSK-KVALVDITNNNRKIEFGKQEVGFSQI-KVEIKSSKKLKKTAVDESRRSSKVVRKNQE-VMISKKQ

Query:  KLISMSMQKPK-RRAREGEAFDCPTSNNLVNNLHHSTIFPAKKEPSPPAIQAPREQVEVYDLHICDVIRKYSKGKAKPAGRDGGERNAVDKTTTTDGGST
        K ISMSMQ PK RRAREGEA DCP SN L + L HSTIF    +P                         Y KGKAK A   GGE NAVD  TTTDGGS 
Subjt:  KLISMSMQKPK-RRAREGEAFDCPTSNNLVNNLHHSTIFPAKKEPSPPAIQAPREQVEVYDLHICDVIRKYSKGKAKPAGRDGGERNAVDKTTTTDGGST

Query:  EFEYIKRI-LTNHGNSNSIISPPNNPTNPSIFHHPEAAEDQQWGRRL---------------LNCWHVRRGMK-GWELGEEAVRERSVKKEYFPRAKYEV
        EF+YIK I +++  NSN ++ P       S F+H  A E+++W +R+                  W  +RG K GWE               FP  K+E+
Subjt:  EFEYIKRI-LTNHGNSNSIISPPNNPTNPSIFHHPEAAEDQQWGRRL---------------LNCWHVRRGMK-GWELGEEAVRERSVKKEYFPRAKYEV

Query:  VEDMDALIINKR--------VVEETEGIVKVVELHILDSLLRE-TVALIS
        VE     +INK         + EE EGIVK+VELHILDSLLRE T +LIS
Subjt:  VEDMDALIINKR--------VVEETEGIVKVVELHILDSLLRE-TVALIS

A0A5D3BMU7 Putative dna repair2.4e-6150.53Show/hide
Query:  MGLDILPQNPSSSFNCRGSNFGTRSLPESPRVSSARLSDVDCHHRRLSLQI-ISDKEN--INFFEE-AKREKEKVSKKVALVDITNNNRKIEFGKQEVGF
        MGLD  PQ  SSS+   G N  TRSL ESPR SS+RLS+VDCHHRRLSLQI I +KEN  I   E+  KREK+KV +KVALVDITN+N KI +  QE+G 
Subjt:  MGLDILPQNPSSSFNCRGSNFGTRSLPESPRVSSARLSDVDCHHRRLSLQI-ISDKEN--INFFEE-AKREKEKVSKKVALVDITNNNRKIEFGKQEVGF

Query:  S--QIKVEIKSSKKLKKTAVDESRRSSKVVRKNQE-VMISKKQKLISMSMQKPKRRAREGEAFDCPTSNNLVNNLHHSTIFPAKKEPSPPAIQAPREQVE
        S    KVE+KS KKL+KT V ES  +SKVV  NQ+  M+SKKQKLISM MQ  K R  E EAFDCPT+N L+  LHH TIF    EP             
Subjt:  S--QIKVEIKSSKKLKKTAVDESRRSSKVVRKNQE-VMISKKQKLISMSMQKPKRRAREGEAFDCPTSNNLVNNLHHSTIFPAKKEPSPPAIQAPREQVE

Query:  VYDLHICDVIRKYSKGKAKPAGRDGGERNAVDKTTTTDGGSTEFEYIKRI-LTNHGNSNSIISPPNNPTNPSIFHHPE---AAEDQQWGRRLLNCWHVRR
                    Y KGK KPA   GGE +AVD TTTTDG ST+F+YIK I +++  NSN ++        PS FHH E   A ++++W +RL     ++ 
Subjt:  VYDLHICDVIRKYSKGKAKPAGRDGGERNAVDKTTTTDGGSTEFEYIKRI-LTNHGNSNSIISPPNNPTNPSIFHHPE---AAEDQQWGRRLLNCWHVRR

Query:  GMKGWELGEEAVRERSVKKEYFPRAKYEVV------EDMDALIINKRVVEETEGIVKVVELHILDSLLRETVALIS
        G+ G   G+   R+R  +   FP AK  +V      ED++   +   + EE EGIVK+VELHILDSLLRET+ALIS
Subjt:  GMKGWELGEEAVRERSVKKEYFPRAKYEVV------EDMDALIINKRVVEETEGIVKVVELHILDSLLRETVALIS

A0A6J1H2A5 uncharacterized protein LOC1114597274.7e-4938.27Show/hide
Query:  HRPPLPKASHHVVEAPRNSLELEGASISCLRNKEKNLQLQMGLQIKTRNGSTKSKASEQQLPNNDHIIALESPSGKTPNLLARLMGLDILPQNPSSSFNC
        H P LP      V APRNSLE  G        +E+N Q+QMGL+I T               N DH  AL+SPS KTPNLLARLMGLDILPQ  +S    
Subjt:  HRPPLPKASHHVVEAPRNSLELEGASISCLRNKEKNLQLQMGLQIKTRNGSTKSKASEQQLPNNDHIIALESPSGKTPNLLARLMGLDILPQNPSSSFNC

Query:  RGSNFGTRSLPESPRVSSARLSDVDCHHRRLSLQIISDKENINFFEEAKREKEKVSKKVALVDITNNNRKIEFGKQEVGFSQIKVEIKSSKKLKKTAVDE
              TRSLP SPRVSS RLSDVD HH R SL I  D EN    +E K+E+E+V +KVALVDITNNN K+ +GK                         
Subjt:  RGSNFGTRSLPESPRVSSARLSDVDCHHRRLSLQIISDKENINFFEEAKREKEKVSKKVALVDITNNNRKIEFGKQEVGFSQIKVEIKSSKKLKKTAVDE

Query:  SRRSSKVVRKNQEVMISKKQKLISMS--MQKPKRRAREGEAFDCPTSNNLVNNLHHSTIFPAKKEPSPPAIQAPREQVEVYDLHICDVIRKYSKGKAKPA
                 KNQ+V + +K   IS       PKR+ R                   +T    K+E SPP     R +             ++  GK +PA
Subjt:  SRRSSKVVRKNQEVMISKKQKLISMS--MQKPKRRAREGEAFDCPTSNNLVNNLHHSTIFPAKKEPSPPAIQAPREQVEVYDLHICDVIRKYSKGKAKPA

Query:  GRDGGERNAVDKTTTTDGGSTEFEYIKRILTNHGNSNSIISPPNNPTNPSIFHHPEAA----------------EDQQWGRRLLNCWHVRRGMKGWELGE
          + G R       T DGG+ E +YIKRILT+    +     P NP NPSIFHH E +                +D+  G  ++NC      MKGWEL  
Subjt:  GRDGGERNAVDKTTTTDGGSTEFEYIKRILTNHGNSNSIISPPNNPTNPSIFHHPEAA----------------EDQQWGRRLLNCWHVRRGMKGWELGE

Query:  EAVRERSVKKEYFPRAKYEVVEDMDALI---INK-RVVEETEGIVKVVELHILDSLLRETVALISSLPKCSHF
                      RAK  V++D+D+LI   + K + V E EG+V+  E HILDSLLRET A I SL K   F
Subjt:  EAVRERSVKKEYFPRAKYEVVEDMDALI---INK-RVVEETEGIVKVVELHILDSLLRETVALISSLPKCSHF

A0A7J8V457 Uncharacterized protein2.9e-3030.32Show/hide
Query:  WHFGGRSSSRRATTADHPRQRYRPSLPSCMSTLFHFFDFRSSRF--THIVFDNHHPSS----FNLPH-----HRPPLPKASHHVVEAPRNSLELE-----
        W  G ++S++R   A   +     +   C+S +F FFDF   +F   H      + SS    F  PH     H   +P A     EAPRNSLE E     
Subjt:  WHFGGRSSSRRATTADHPRQRYRPSLPSCMSTLFHFFDFRSSRF--THIVFDNHHPSS----FNLPH-----HRPPLPKASHHVVEAPRNSLELE-----

Query:  ------GASISCLRNKE-KNLQLQMGLQIKTRNGSTKSKASEQQLPNNDHIIALE-SPSGKTPNLLARLMGLDILPQNPSSSF---------------NC
               AS++   +KE ++L + MG+QIKT +G  +SK       NND    +  SP  KTP L+ARLMGLD+LP+  S SF               + 
Subjt:  ------GASISCLRNKE-KNLQLQMGLQIKTRNGSTKSKASEQQLPNNDHIIALE-SPSGKTPNLLARLMGLDILPQNPSSSF---------------NC

Query:  RGSNF-GTRSLPESPRVSSARLSDVDCHHRRLSLQIISDKENINFFEEA------------------------KREKEKVSKKVALVDITNNNRKIEFGK
         G +F GTRSLPE+PR+SSAR SDVD HH R SLQI  +KEN++  EE                         K+ KE V +KV + DITN+ R  E  +
Subjt:  RGSNF-GTRSLPESPRVSSARLSDVDCHHRRLSLQIISDKENINFFEEA------------------------KREKEKVSKKVALVDITNNNRKIEFGK

Query:  QEVGFSQIKVEIKSSKKLKKTAVDESR-------------------------------RSSKVVRKNQEVMISKKQKLISMS------------------
        +E+  +Q K + K SK + K A D +                                + S++    Q + + +K KL +++                  
Subjt:  QEVGFSQIKVEIKSSKKLKKTAVDESR-------------------------------RSSKVVRKNQEVMISKKQKLISMS------------------

Query:  --MQKPKR-----RAREGEAFDCPTSNNLVN----------------NLHHSTIFPAKKEPSPPAIQAPREQV---------EVYDLHICDVIRKYSKGK
          ++KP+R     R ++ E F  P++ N  N                N+  S++FP KK+PSPPA + P++QV             L  C      +K +
Subjt:  --MQKPKR-----RAREGEAFDCPTSNNLVN----------------NLHHSTIFPAKKEPSPPAIQAPREQV---------EVYDLHICDVIRKYSKGK

Query:  AK--PAGRDG--GERNAVDKTTTTDGGSTEF-EYIKRILTNHGNSN----SIIS--PPNNPTNPSIFHHPE--AAEDQQWGRRLLNC-----WHVRR---
        A    + RD   G+R   + TTTT G   E+ EYI RIL   G       S+ S   P++P NPSIF++ E  A  + +  +  L C     +H+     
Subjt:  AK--PAGRDG--GERNAVDKTTTTDGGSTEF-EYIKRILTNHGNSN----SIIS--PPNNPTNPSIFHHPE--AAEDQQWGRRLLNC-----WHVRR---

Query:  --------GMKGW-----------ELGEEAVRERSVKKEYFPRAKYEVVEDMDALI------INKRVV----EETEGIVKVVELHILDSLLRETVA
                 MK W            +G + +     K   FPRA   V+ED+DALI      +  R V    EE EGIV  +   IL++L+ ET A
Subjt:  --------GMKGW-----------ELGEEAVRERSVKKEYFPRAKYEVVEDMDALI------INKRVV----EETEGIVKVVELHILDSLLRETVA

A0A7J9A4K1 Uncharacterized protein3.8e-3030.22Show/hide
Query:  WHFGGRSSSRRATTADHPRQRYRPSLPSCMSTLFHFFDFRSSRF--THIVFDNHHPSS----FNLPH-----HRPPLPKASHHVVEAPRNSLELE-----
        W  G ++S++R   A   +     +   C+S +F FFDF   +F   H      + SS    F  PH     H   +P A     EAPRNSLE E     
Subjt:  WHFGGRSSSRRATTADHPRQRYRPSLPSCMSTLFHFFDFRSSRF--THIVFDNHHPSS----FNLPH-----HRPPLPKASHHVVEAPRNSLELE-----

Query:  ------GASISCLRNKE-KNLQLQMGLQIKTRNGSTKSKASEQQLPNNDHIIALE-SPSGKTPNLLARLMGLDILPQNPSSSF---------------NC
               AS++   +KE ++L + MG+QIKT +G  +SK       NND    +  SP  KTP L+ARLMGLD+LP+  S SF               + 
Subjt:  ------GASISCLRNKE-KNLQLQMGLQIKTRNGSTKSKASEQQLPNNDHIIALE-SPSGKTPNLLARLMGLDILPQNPSSSF---------------NC

Query:  RGSNF-GTRSLPESPRVSSARLSDVDCHHRRLSLQIISDKENINFFEEA------------------------KREKEKVSKKVALVDITNNNRKIEFGK
         G +F GTRSLPE+PR+SSAR SDVD HH R SLQI  +KEN++  EE                         K+ KE V +KV + DITN  R  E  +
Subjt:  RGSNF-GTRSLPESPRVSSARLSDVDCHHRRLSLQIISDKENINFFEEA------------------------KREKEKVSKKVALVDITNNNRKIEFGK

Query:  QEVGFSQIKVEIKSSKKLKKTAVDESR-------------------------------RSSKVVRKNQEVMISKKQKLISMS------------------
        +E+  +Q K + K SK + K A D +                                + S++  + Q + + +K KL +++                  
Subjt:  QEVGFSQIKVEIKSSKKLKKTAVDESR-------------------------------RSSKVVRKNQEVMISKKQKLISMS------------------

Query:  MQKPKR-----RAREGEAFDCPTSNNLVN----------------NLHHSTIFPAKKEPSPPAIQAPREQV---------EVYDLHICDVIRKYSKGKA-
        ++KP+R     R ++ E F  P++ N  N                N+  S++FP KK+PSPPA + P++QV             L  C      +K +A 
Subjt:  MQKPKR-----RAREGEAFDCPTSNNLVN----------------NLHHSTIFPAKKEPSPPAIQAPREQV---------EVYDLHICDVIRKYSKGKA-

Query:  ----KPAGRDGGERNAVDKTTTTDGGSTEF-EYIKRILTNHGNSN----SIIS--PPNNPTNPSIFHHPE--AAEDQQWGRRLLNC-----WHVRR----
             P    G   N V  TTTT G   E+ EYI RIL   G       S+ S   P++P NPSIF++ E     + +  +  L C     +H+      
Subjt:  ----KPAGRDGGERNAVDKTTTTDGGSTEF-EYIKRILTNHGNSN----SIIS--PPNNPTNPSIFHHPE--AAEDQQWGRRLLNC-----WHVRR----

Query:  -------GMKGW-----------ELGEEAVRERSVKKEYFPRAKYEVVEDMDALI------INKRVV----EETEGIVKVVELHILDSLLRETVA
                MK W            +G + +     K   FPRA   V+ED+DALI      +  R V    EE EGIV  +   IL++L+ ET A
Subjt:  -------GMKGW-----------ELGEEAVRERSVKKEYFPRAKYEVVEDMDALI------INKRVV----EETEGIVKVVELHILDSLLRETVA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G25430.1 unknown protein1.9e-1334.32Show/hide
Query:  MGKVEWHFGGRSSSRRATTADHPRQRYRPSLPSCMSTLFHFFDFRSSRFTHIVFDNHHPSSFNLPHHRPPL--PKASHHVVEAPRNSLEL-EGASISCLR
        MG+ EW+ GGRS      T     ++       C++ L+HFF F      H  F + H       HH+P +  P  +   + APRNSL+L E + +S   
Subjt:  MGKVEWHFGGRSSSRRATTADHPRQRYRPSLPSCMSTLFHFFDFRSSRFTHIVFDNHHPSSFNLPHHRPPL--PKASHHVVEAPRNSLEL-EGASISCLR

Query:  NKEKNLQLQMGLQIKTRNGSTKSKASEQQLPNNDHIIALESPSGKTPNLLARLMGLDILPQNPSSSFNCR---------GSNFGTRSLPESPRVSSARLS
          E+      GL I    G  KS      +    H   L  P  KTPN++ARLMGLD+LP N   + + R         G+  GTRSLP SPR+SS    
Subjt:  NKEKNLQLQMGLQIKTRNGSTKSKASEQQLPNNDHIIALESPSGKTPNLLARLMGLDILPQNPSSSFNCR---------GSNFGTRSLPESPRVSSARLS

Query:  DVDCHHRRLSLQIISDKENINFFEEAKREKEKVSKK
          D  + RLSL++  ++EN N  EE  R + K  K+
Subjt:  DVDCHHRRLSLQIISDKENINFFEEAKREKEKVSKK

AT5G51850.1 unknown protein1.1e-1025.5Show/hide
Query:  GGRSSSRRATTADHPRQRYRPSLPSCMSTLFHFFDFRSSRFTHIVFDNHHPSSFNLPHHRPPLPKASHHVVEAPRNS--LELEGASISCLRNKEKNL-QL
        G  SSSR   TA+            CM+  +H            +FD+HH                 H  +++P  S  L+L   S+     K+K +  +
Subjt:  GGRSSSRRATTADHPRQRYRPSLPSCMSTLFHFFDFRSSRFTHIVFDNHHPSSFNLPHHRPPLPKASHHVVEAPRNS--LELEGASISCLRNKEKNL-QL

Query:  QMGLQIKTRNGSTKSKASEQQLPNNDHIIAL-ESPSGKTPNLLARLMGLDILPQNPSSSFNC--------------RGSNFGTRSLPESPRVSSARLSDV
         +G+++KT  G+  S+       ++     +  SP  KTPNL+ARLMGLD+LP     + +               R S  GTRSLP SPR+SSAR SD 
Subjt:  QMGLQIKTRNGSTKSKASEQQLPNNDHIIAL-ESPSGKTPNLLARLMGLDILPQNPSSSFNC--------------RGSNFGTRSLPESPRVSSARLSDV

Query:  DCHHRRLSLQIISDKE----NINFFEE------------AKREKEK-VSKKVALVDITNN--NR------------------KIEFGKQEVGFSQIKVEI
        D H  RLSLQ+  +KE     +   +E             K+ KE+ V+++V  +DITN+  NR                  +  F ++E   S      
Subjt:  DCHHRRLSLQIISDKE----NINFFEE------------AKREKEK-VSKKVALVDITNN--NR------------------KIEFGKQEVGFSQIKVEI

Query:  KSSKKLKKTAVDESRRSSKVVRKNQEVMISKKQKLISMSMQKPKR---------------RAREGEAFDCPTSNNLVNNLHHSTIFPAKKEPSP------
         SS    +  + + + +  ++ + Q     K+++L  +++ K                  R R+ E F   + +     LH    F  KK P        
Subjt:  KSSKKLKKTAVDESRRSSKVVRKNQEVMISKKQKLISMSMQKPKR---------------RAREGEAFDCPTSNNLVNNLHHSTIFPAKKEPSP------

Query:  PAIQAPREQVEVYDLHICD---VIRKYSKGKAKPAGRDGGERNAVDKTTTTDGGSTEFEYIKRILTNHGNSNSIISPPNNPTNPSIFHHPEAAEDQQWGR
         A + P +Q+   +  I +    IR  S  K +         +  D   T    ++E +YI RI+    N   I S      + SIF   E   D   G 
Subjt:  PAIQAPREQVEVYDLHICD---VIRKYSKGKAKPAGRDGGERNAVDKTTTTDGGSTEFEYIKRILTNHGNSNSIISPPNNPTNPSIFHHPEAAEDQQWGR

Query:  RLLNCWH----------------VRRG-MKGWELGEE---AVRERSVKKEYFPRAKYEVVEDMDALIINKRVVEETEGIVKVVELHILDSLLRETVALIS
          L C                   RRG  +G EL  E   AV   S K    P  +  +V+    L   K++ EE E I+  +E  I+D+L+RET++ +S
Subjt:  RLLNCWH----------------VRRG-MKGWELGEE---AVRERSVKKEYFPRAKYEVVEDMDALIINKRVVEETEGIVKVVELHILDSLLRETVALIS

AT5G62170.1 unknown protein1.4e-1627.04Show/hide
Query:  WHFGG--RSSSRRATTADHPRQRYRPSL-------PSCMSTLFHFFDFRSSRFTHIVFDNHHPSSFNLPHHRPPLPKASHHVVEAPRNSLEL--EGASIS
        W  GG  +SSS+       P Q   PSL         CMS +F+ FDF+     H+         F + HH   LPK     V+APRNSLE   E  S S
Subjt:  WHFGG--RSSSRRATTADHPRQRYRPSL-------PSCMSTLFHFFDFRSSRFTHIVFDNHHPSSFNLPHHRPPLPKASHHVVEAPRNSLEL--EGASIS

Query:  CLRNKEKNLQLQMGLQIKTRNGSTKSKASEQQLPNNDHIIALESPSGKTPNLLARLMGLDILPQN------PSSSFN---------CRGSNF--------
          R K+ NL + MG++IKT+  +  S AS    P   +     SPS KTP L+ARLMGLD++P N      PSSS +          R S+         
Subjt:  CLRNKEKNLQLQMGLQIKTRNGSTKSKASEQQLPNNDHIIALESPSGKTPNLLARLMGLDILPQN------PSSSFN---------CRGSNF--------

Query:  -------GTRSLPESPRVSSARLS-DVDCH-HRRLSL--------------------------QIISDKENINFFEEAK----REKEKVSKKVAL-VDIT
               GTRSLPE+PR+S  R S DV+C+ H+R SL                          +I  DKEN +  E A+    + KE VS++  +  DIT
Subjt:  -------GTRSLPESPRVSSARLS-DVDCH-HRRLSL--------------------------QIISDKENINFFEEAK----REKEKVSKKVAL-VDIT

Query:  NNN---RKIEFGKQEVGFSQIKVEIKSSK---------KLKKTAVDESRRSSKVV--------------------------RKNQEVMISKKQKLISMSM
        N     R++   K+    + I     SS          K K T++  +  +SK++                          ++ +     KK +     +
Subjt:  NNN---RKIEFGKQEVGFSQIKVEIKSSK---------KLKKTAVDESRRSSKVV--------------------------RKNQEVMISKKQKLISMSM

Query:  QKPKRRAREGEAFDCPTSNNLVNNLHHSTIF------PAKKEP----------SPPAIQAPREQVEVYDLHICDVIRKYSKGKAKPAGRDGGERNAVDKT
         KP +  +E      P  NN  NN +   +        +KK P          S P I+    Q      ++     +  + +A  +        +    
Subjt:  QKPKRRAREGEAFDCPTSNNLVNNLHHSTIF------PAKKEP----------SPPAIQAPREQVEVYDLHICDVIRKYSKGKAKPAGRDGGERNAVDKT

Query:  TTTDGGSTEFEYIKRILTNHG-NSNSIIS-----PPNNPTNPSIFH----------HPEAAEDQ---QWGRRLL---------------------NCWHV
            GG  E EYI R L   G + ++ IS      P++P +PSIF+           P  + +    +  R+LL                      C + 
Subjt:  TTTDGGSTEFEYIKRILTNHG-NSNSIIS-----PPNNPTNPSIFH----------HPEAAEDQ---QWGRRLL---------------------NCWHV

Query:  RRGMKGWELGEEAVRERSVKKEYFPRAKYEVVEDMDALI------INKRVVEETEGIVKVVELHILDSLLRET
         R  +  + G E + E S + E FP AK  V+ED+DAL+      I     E+ EGIV  +E  I ++L+ ET
Subjt:  RRGMKGWELGEEAVRERSVKKEYFPRAKYEVVEDMDALI------INKRVVEETEGIVKVVELHILDSLLRET


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAAGGTAGAATGGCACTTTGGAGGAAGATCATCTTCTCGTCGAGCCACCACCGCCGATCACCCCCGGCAACGCTACCGTCCTTCTCTTCCGAGCTGTATGAGCAC
CCTTTTTCACTTCTTTGATTTTCGTTCCTCTCGTTTTACTCACATTGTCTTCGATAATCACCACCCGTCCTCCTTCAACCTCCCGCATCATCGCCCTCCCCTACCCAAAG
CTTCCCATCATGTTGTTGAAGCACCAAGGAATAGCTTGGAATTAGAGGGAGCTTCAATTTCTTGCTTAAGAAATAAAGAAAAAAATTTGCAACTTCAAATGGGACTTCAA
ATCAAAACTAGAAATGGTAGCACAAAATCAAAAGCAAGTGAGCAACAACTTCCAAATAATGATCACATTATTGCATTGGAATCTCCAAGTGGAAAGACACCAAATCTCTT
GGCAAGATTAATGGGTCTTGATATTCTCCCTCAAAACCCCTCTTCTTCTTTCAATTGTCGCGGGTCAAATTTCGGAACCCGTTCTCTCCCAGAGAGCCCGAGAGTATCGT
CAGCAAGACTATCAGACGTCGATTGTCATCATCGTCGCCTCTCACTCCAAATTATTTCAGACAAAGAAAATATAAATTTTTTTGAAGAGGCTAAGCGAGAAAAGGAAAAA
GTGAGCAAGAAAGTTGCCCTCGTTGATATCACCAATAATAACAGAAAAATAGAGTTCGGAAAACAAGAAGTTGGTTTTAGTCAAATTAAAGTCGAGATCAAGTCCTCTAA
GAAACTTAAGAAAACAGCCGTTGACGAATCAAGACGTAGTTCGAAAGTCGTGCGTAAAAATCAAGAGGTAATGATTTCAAAGAAGCAAAAGCTAATATCGATGTCGATGC
AAAAACCAAAGCGGAGGGCTAGAGAAGGTGAAGCATTTGATTGTCCAACAAGTAATAACCTTGTTAACAACCTCCATCATTCAACCATTTTTCCAGCAAAGAAAGAGCCT
TCTCCTCCGGCGATCCAAGCCCCTCGTGAACAGGTTGAAGTCTATGATCTTCATATTTGTGATGTAATTAGAAAGTACTCAAAGGGCAAGGCGAAGCCGGCGGGCAGAGA
CGGCGGAGAAAGAAACGCCGTTGACAAGACCACCACCACAGACGGCGGATCAACGGAGTTCGAATACATCAAAAGAATACTAACCAACCACGGCAATTCAAACTCGATCA
TCTCACCCCCCAATAACCCGACGAACCCCTCAATCTTCCACCACCCAGAAGCGGCGGAGGACCAGCAATGGGGCAGACGACTACTAAACTGTTGGCACGTGCGAAGAGGA
ATGAAGGGATGGGAATTGGGTGAAGAAGCTGTGAGGGAGAGATCGGTGAAGAAAGAGTACTTCCCACGTGCGAAATATGAAGTAGTGGAAGATATGGATGCTTTAATAAT
CAACAAGAGAGTGGTGGAGGAAACAGAAGGGATTGTGAAGGTGGTTGAGCTTCACATTTTAGACTCCCTTTTACGAGAAACTGTTGCCCTAATTTCCTCCCTACCAAAAT
GCTCTCATTTTCCTAATTTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAAGGTAGAATGGCACTTTGGAGGAAGATCATCTTCTCGTCGAGCCACCACCGCCGATCACCCCCGGCAACGCTACCGTCCTTCTCTTCCGAGCTGTATGAGCAC
CCTTTTTCACTTCTTTGATTTTCGTTCCTCTCGTTTTACTCACATTGTCTTCGATAATCACCACCCGTCCTCCTTCAACCTCCCGCATCATCGCCCTCCCCTACCCAAAG
CTTCCCATCATGTTGTTGAAGCACCAAGGAATAGCTTGGAATTAGAGGGAGCTTCAATTTCTTGCTTAAGAAATAAAGAAAAAAATTTGCAACTTCAAATGGGACTTCAA
ATCAAAACTAGAAATGGTAGCACAAAATCAAAAGCAAGTGAGCAACAACTTCCAAATAATGATCACATTATTGCATTGGAATCTCCAAGTGGAAAGACACCAAATCTCTT
GGCAAGATTAATGGGTCTTGATATTCTCCCTCAAAACCCCTCTTCTTCTTTCAATTGTCGCGGGTCAAATTTCGGAACCCGTTCTCTCCCAGAGAGCCCGAGAGTATCGT
CAGCAAGACTATCAGACGTCGATTGTCATCATCGTCGCCTCTCACTCCAAATTATTTCAGACAAAGAAAATATAAATTTTTTTGAAGAGGCTAAGCGAGAAAAGGAAAAA
GTGAGCAAGAAAGTTGCCCTCGTTGATATCACCAATAATAACAGAAAAATAGAGTTCGGAAAACAAGAAGTTGGTTTTAGTCAAATTAAAGTCGAGATCAAGTCCTCTAA
GAAACTTAAGAAAACAGCCGTTGACGAATCAAGACGTAGTTCGAAAGTCGTGCGTAAAAATCAAGAGGTAATGATTTCAAAGAAGCAAAAGCTAATATCGATGTCGATGC
AAAAACCAAAGCGGAGGGCTAGAGAAGGTGAAGCATTTGATTGTCCAACAAGTAATAACCTTGTTAACAACCTCCATCATTCAACCATTTTTCCAGCAAAGAAAGAGCCT
TCTCCTCCGGCGATCCAAGCCCCTCGTGAACAGGTTGAAGTCTATGATCTTCATATTTGTGATGTAATTAGAAAGTACTCAAAGGGCAAGGCGAAGCCGGCGGGCAGAGA
CGGCGGAGAAAGAAACGCCGTTGACAAGACCACCACCACAGACGGCGGATCAACGGAGTTCGAATACATCAAAAGAATACTAACCAACCACGGCAATTCAAACTCGATCA
TCTCACCCCCCAATAACCCGACGAACCCCTCAATCTTCCACCACCCAGAAGCGGCGGAGGACCAGCAATGGGGCAGACGACTACTAAACTGTTGGCACGTGCGAAGAGGA
ATGAAGGGATGGGAATTGGGTGAAGAAGCTGTGAGGGAGAGATCGGTGAAGAAAGAGTACTTCCCACGTGCGAAATATGAAGTAGTGGAAGATATGGATGCTTTAATAAT
CAACAAGAGAGTGGTGGAGGAAACAGAAGGGATTGTGAAGGTGGTTGAGCTTCACATTTTAGACTCCCTTTTACGAGAAACTGTTGCCCTAATTTCCTCCCTACCAAAAT
GCTCTCATTTTCCTAATTTCTAA
Protein sequenceShow/hide protein sequence
MGKVEWHFGGRSSSRRATTADHPRQRYRPSLPSCMSTLFHFFDFRSSRFTHIVFDNHHPSSFNLPHHRPPLPKASHHVVEAPRNSLELEGASISCLRNKEKNLQLQMGLQ
IKTRNGSTKSKASEQQLPNNDHIIALESPSGKTPNLLARLMGLDILPQNPSSSFNCRGSNFGTRSLPESPRVSSARLSDVDCHHRRLSLQIISDKENINFFEEAKREKEK
VSKKVALVDITNNNRKIEFGKQEVGFSQIKVEIKSSKKLKKTAVDESRRSSKVVRKNQEVMISKKQKLISMSMQKPKRRAREGEAFDCPTSNNLVNNLHHSTIFPAKKEP
SPPAIQAPREQVEVYDLHICDVIRKYSKGKAKPAGRDGGERNAVDKTTTTDGGSTEFEYIKRILTNHGNSNSIISPPNNPTNPSIFHHPEAAEDQQWGRRLLNCWHVRRG
MKGWELGEEAVRERSVKKEYFPRAKYEVVEDMDALIINKRVVEETEGIVKVVELHILDSLLRETVALISSLPKCSHFPNF