; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0015350 (gene) of Snake gourd v1 genome

Gene IDTan0015350
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF1666)
Genome locationLG06:431921..449937
RNA-Seq ExpressionTan0015350
SyntenyTan0015350
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7013928.1 hypothetical protein SDJN02_24097 [Cucurbita argyrosperma subsp. argyrosperma]1.1e-17782.4Show/hide
Query:  ERGLKEEDGANPKCTAEAQAKTEAEA------DEDDFIMEEVKRRLKELRRNSFMVLIPEEDEEEQELEVEVEGEGETSCGDLEWRDVEAEGRQWWGGFG
        ER LKEEDG + KC AEA+A  EAE       D++DFIMEEVKRRLKELRRNSFMVLIPEE+EEE+E EV   GEGETSCG+ EWRDVEAEGRQWWGGFG
Subjt:  ERGLKEEDGANPKCTAEAQAKTEAEA------DEDDFIMEEVKRRLKELRRNSFMVLIPEEDEEEQELEVEVEGEGETSCGDLEWRDVEAEGRQWWGGFG

Query:  AVYDQYCERMLFFNQTSIQPLIHSGHESASPRSASKKSASPLRCLSLKRIEEPEDEKEHLHPPLTLIDSYH--ETAYVAHISLSWEALHCQYTQLNHLIS
        AVYD YCERM+FF++ S Q    SG ESAS RSA KKSASPLRCLSLKRIEEPEDE E L P   LIDS H  ETAYVAHI LSWEALHCQYTQLNHLIS
Subjt:  AVYDQYCERMLFFNQTSIQPLIHSGHESASPRSASKKSASPLRCLSLKRIEEPEDEKEHLHPPLTLIDSYH--ETAYVAHISLSWEALHCQYTQLNHLIS

Query:  CQPQNPTTHYNLTAQLFQQFQVLLQRFIENEPFQQGLRPAIYARTCRAFPKMLHVPNIQASDPNETQEQESDFLILAPDLLVIIEASIFTFHRFVKMDKK
        CQPQNPTT YNLTAQLFQQFQVLLQRFIENEPFQQ LRPAIYART R FPKMLHVPNIQASD N  QEQESD LILA DLLVIIEASIFTFHRF+KMDKK
Subjt:  CQPQNPTTHYNLTAQLFQQFQVLLQRFIENEPFQQGLRPAIYARTCRAFPKMLHVPNIQASDPNETQEQESDFLILAPDLLVIIEASIFTFHRFVKMDKK

Query:  ISNSASLSLRNQTQDASLLARIRSSLDKKKTKLKELRKKSKGWKQKTWPQTYEDMQLLFGVVDVKIMSRLVKMSRITKEQLLWCEEKMNKLDLSDGKLRR
         S SASLSLRNQTQDA+LLARIRSSLDKKK KLKE+RKKS+GWKQKTWPQTYEDMQLLFGVVD+KI+SRL+KM RI+KEQLLWCEEK+ KLD+SDGKLRR
Subjt:  ISNSASLSLRNQTQDASLLARIRSSLDKKKTKLKELRKKSKGWKQKTWPQTYEDMQLLFGVVDVKIMSRLVKMSRITKEQLLWCEEKMNKLDLSDGKLRR

Query:  DPSPLLFPC
        DPSPLLFPC
Subjt:  DPSPLLFPC

XP_022954221.1 uncharacterized protein LOC111456540 [Cucurbita moschata]5.2e-17882.64Show/hide
Query:  ERGLKEEDGANPKCTAEAQAKTEAEA------DEDDFIMEEVKRRLKELRRNSFMVLIPEEDEEEQELEVEVEGEGETSCGDLEWRDVEAEGRQWWGGFG
        ER LKEEDG + KC AEA+A  EAE       D++DFIMEEVKRRLKELRRNSFMVLIPEE+EEE+E EV   GEGETSCG+ EWRDVEAEGRQWWGGFG
Subjt:  ERGLKEEDGANPKCTAEAQAKTEAEA------DEDDFIMEEVKRRLKELRRNSFMVLIPEEDEEEQELEVEVEGEGETSCGDLEWRDVEAEGRQWWGGFG

Query:  AVYDQYCERMLFFNQTSIQPLIHSGHESASPRSASKKSASPLRCLSLKRIEEPEDEKEHLHPPLTLIDSYH--ETAYVAHISLSWEALHCQYTQLNHLIS
        AVYD YCERM+FF++ S Q    SG ESAS RSA KKSASPLRCLSLKRIEEPEDE E L P   LIDS H  ETAYVAHI LSWEALHCQYTQLNHLIS
Subjt:  AVYDQYCERMLFFNQTSIQPLIHSGHESASPRSASKKSASPLRCLSLKRIEEPEDEKEHLHPPLTLIDSYH--ETAYVAHISLSWEALHCQYTQLNHLIS

Query:  CQPQNPTTHYNLTAQLFQQFQVLLQRFIENEPFQQGLRPAIYARTCRAFPKMLHVPNIQASDPNETQEQESDFLILAPDLLVIIEASIFTFHRFVKMDKK
        CQPQNPTT YNLTAQLFQQFQVLLQRFIENEPFQQ LRPAIYART R FPKMLHVPNIQASD N  QEQESD LILA DLLVIIEASIFTFHRF+KMDKK
Subjt:  CQPQNPTTHYNLTAQLFQQFQVLLQRFIENEPFQQGLRPAIYARTCRAFPKMLHVPNIQASDPNETQEQESDFLILAPDLLVIIEASIFTFHRFVKMDKK

Query:  ISNSASLSLRNQTQDASLLARIRSSLDKKKTKLKELRKKSKGWKQKTWPQTYEDMQLLFGVVDVKIMSRLVKMSRITKEQLLWCEEKMNKLDLSDGKLRR
         S SASLSLRNQTQDA+LLARIRSSLDKKK KLKE+RKKS+GWKQKTWPQTYEDMQLLFGVVD+KI+SRLVKM RI+KEQLLWCEEK+ KLD+SDGKLRR
Subjt:  ISNSASLSLRNQTQDASLLARIRSSLDKKKTKLKELRKKSKGWKQKTWPQTYEDMQLLFGVVDVKIMSRLVKMSRITKEQLLWCEEKMNKLDLSDGKLRR

Query:  DPSPLLFPC
        DPSPLLFPC
Subjt:  DPSPLLFPC

XP_022992263.1 uncharacterized protein LOC111488637 [Cucurbita maxima]6.7e-17882.64Show/hide
Query:  ERGLKEEDGANPKCTAEAQAKTEAEA------DEDDFIMEEVKRRLKELRRNSFMVLIPEEDEEEQELEVEVEGEGETSCGDLEWRDVEAEGRQWWGGFG
        ER LKEEDG + KC AEA+A  EAE       D++DFIMEEVKRRLKELRRNSFMVLIPEE+EEE+E EV   GEGETSCG+ EWRDVEAEGRQWWGGFG
Subjt:  ERGLKEEDGANPKCTAEAQAKTEAEA------DEDDFIMEEVKRRLKELRRNSFMVLIPEEDEEEQELEVEVEGEGETSCGDLEWRDVEAEGRQWWGGFG

Query:  AVYDQYCERMLFFNQTSIQPLIHSGHESASPRSASKKSASPLRCLSLKRIEEPEDEKEHLHPPLTLIDSYH--ETAYVAHISLSWEALHCQYTQLNHLIS
        AVYD YCERM+FF++ S Q    SG ESAS RSA KKSASPLRCLSLKRIEEPEDE E L P  TL DS H  ETAYVAHI LSWEALHCQYTQLNHLIS
Subjt:  AVYDQYCERMLFFNQTSIQPLIHSGHESASPRSASKKSASPLRCLSLKRIEEPEDEKEHLHPPLTLIDSYH--ETAYVAHISLSWEALHCQYTQLNHLIS

Query:  CQPQNPTTHYNLTAQLFQQFQVLLQRFIENEPFQQGLRPAIYARTCRAFPKMLHVPNIQASDPNETQEQESDFLILAPDLLVIIEASIFTFHRFVKMDKK
        CQPQNPTT YNLTAQLFQQFQVLLQRFIENEPFQQ LRPAIYART R FPKMLHVPNIQASD N  QEQESD LILA DLLVIIEASIFTFHRF+KMDKK
Subjt:  CQPQNPTTHYNLTAQLFQQFQVLLQRFIENEPFQQGLRPAIYARTCRAFPKMLHVPNIQASDPNETQEQESDFLILAPDLLVIIEASIFTFHRFVKMDKK

Query:  ISNSASLSLRNQTQDASLLARIRSSLDKKKTKLKELRKKSKGWKQKTWPQTYEDMQLLFGVVDVKIMSRLVKMSRITKEQLLWCEEKMNKLDLSDGKLRR
         S SASLSLRNQTQDA+LLARIRSSLDKKK KLKE+RKKS+GWKQKTWPQTYEDMQLLFGVVD+KI+SRLVKM RI+KEQLLWCEEK+ KLD+SDGKLRR
Subjt:  ISNSASLSLRNQTQDASLLARIRSSLDKKKTKLKELRKKSKGWKQKTWPQTYEDMQLLFGVVDVKIMSRLVKMSRITKEQLLWCEEKMNKLDLSDGKLRR

Query:  DPSPLLFPC
        DPSPLLFPC
Subjt:  DPSPLLFPC

XP_023549471.1 uncharacterized protein LOC111807822 [Cucurbita pepo subsp. pepo]1.8e-17882.4Show/hide
Query:  ERGLKEEDGANPKCTAEAQAKTEAEA------DEDDFIMEEVKRRLKELRRNSFMVLIPEEDEEEQELEVEVEGEGETSCGDLEWRDVEAEGRQWWGGFG
        ER LKEEDG + KC AEA+A  EAE       D++DFIMEEVKRRLKELRRNSFMVLIPEE+EEE+E E E  GEGETSCG+ EWRDVEAEGRQWWGGFG
Subjt:  ERGLKEEDGANPKCTAEAQAKTEAEA------DEDDFIMEEVKRRLKELRRNSFMVLIPEEDEEEQELEVEVEGEGETSCGDLEWRDVEAEGRQWWGGFG

Query:  AVYDQYCERMLFFNQTSIQPLIHSGHESASPRSASKKSASPLRCLSLKRIEEPEDEKEHLHPPLTLIDSYH--ETAYVAHISLSWEALHCQYTQLNHLIS
        AVYD YCERM+FF++ S Q    SG ESAS RSA KKSASPLRCLSLKRIEEPEDE E L P   LID+ H  ETAYVAHI LSWEALHCQYTQLNHLIS
Subjt:  AVYDQYCERMLFFNQTSIQPLIHSGHESASPRSASKKSASPLRCLSLKRIEEPEDEKEHLHPPLTLIDSYH--ETAYVAHISLSWEALHCQYTQLNHLIS

Query:  CQPQNPTTHYNLTAQLFQQFQVLLQRFIENEPFQQGLRPAIYARTCRAFPKMLHVPNIQASDPNETQEQESDFLILAPDLLVIIEASIFTFHRFVKMDKK
        CQPQNPTT YNLTAQLFQQFQVLLQRFIENEPFQQ LRPAIYART R FPKMLHVPNIQASD N  QEQESD LILA DLLVIIEASIFTFHRF+KMDKK
Subjt:  CQPQNPTTHYNLTAQLFQQFQVLLQRFIENEPFQQGLRPAIYARTCRAFPKMLHVPNIQASDPNETQEQESDFLILAPDLLVIIEASIFTFHRFVKMDKK

Query:  ISNSASLSLRNQTQDASLLARIRSSLDKKKTKLKELRKKSKGWKQKTWPQTYEDMQLLFGVVDVKIMSRLVKMSRITKEQLLWCEEKMNKLDLSDGKLRR
         S SASLSLRNQTQDA+LLARIRSSLDKKK KLKE+RKKS+GWKQKTWPQTYEDMQLLFGVVD+KI+SRLVKM RI+KEQLLWCEEK+ KLD+SDGKLRR
Subjt:  ISNSASLSLRNQTQDASLLARIRSSLDKKKTKLKELRKKSKGWKQKTWPQTYEDMQLLFGVVDVKIMSRLVKMSRITKEQLLWCEEKMNKLDLSDGKLRR

Query:  DPSPLLFPC
        DPSPLLFPC
Subjt:  DPSPLLFPC

XP_038898126.1 uncharacterized protein LOC120085908 isoform X1 [Benincasa hispida]1.6e-17483.46Show/hide
Query:  EEDGANPKCTAEAQAKTEAEADEDDFIMEEVKRRLKELRRNSFMVLIPEEDEEEQELEVEVEGE-GETSCGDLEWRDVEAEGRQWWGGFGAVYDQYCERM
        EEDGA+PKC+AE     EAE DE+DFIMEEVKRRLKELRRNSFMVLIPEE+EEE+E E E EGE GE   GD EWRDVEAEGRQWWGGFGAVYD YCERM
Subjt:  EEDGANPKCTAEAQAKTEAEADEDDFIMEEVKRRLKELRRNSFMVLIPEEDEEEQELEVEVEGE-GETSCGDLEWRDVEAEGRQWWGGFGAVYDQYCERM

Query:  LFFNQTSIQPLIHSGHESASPRSASKKSASPLRCLSLKRIEEPEDEKEHLHPPLTLIDS-YH-ETAYVAHISLSWEALHCQYTQLNHLISCQPQNPTTHY
         FF++ SI+    SG ES S RSASKKSASPLRCLSLKRIEEPEDE E + P LTLIDS YH ETAYVAHI LSWEALHCQYTQLNHLISCQPQN TTHY
Subjt:  LFFNQTSIQPLIHSGHESASPRSASKKSASPLRCLSLKRIEEPEDEKEHLHPPLTLIDS-YH-ETAYVAHISLSWEALHCQYTQLNHLISCQPQNPTTHY

Query:  NLTAQLFQQFQVLLQRFIENEPFQQGLRPAIYARTCRAFPKMLHVPNIQASDPNETQEQESDFLILAPDLLVIIEASIFTFHRFVKMDKKISNSASLSLR
        NLTAQLFQQFQVLLQRFIENEPFQQGLRP IYART R FPKMLHVPNIQASDPN  QEQESD LILAPDLL+IIEASIFTFHRF+KM+KK SNSASLS +
Subjt:  NLTAQLFQQFQVLLQRFIENEPFQQGLRPAIYARTCRAFPKMLHVPNIQASDPNETQEQESDFLILAPDLLVIIEASIFTFHRFVKMDKKISNSASLSLR

Query:  NQTQDASLLARIRSSLDKKKTKLKELRKKSKGWKQKTWPQTYEDMQLLFGVVDVKIMSRLVKMSRITKEQLLWCEEKMNKLDLSDGKLRRDPSPLLFPC
        N TQDA+LLAR+RSSLDKKKTKLKE+RKKS+GWKQKT PQTYEDMQLLFG+VD+KI+SRLVKMSRITKEQLLWCEEK+NKLD+S+GKLRRD SPLLFPC
Subjt:  NQTQDASLLARIRSSLDKKKTKLKELRKKSKGWKQKTWPQTYEDMQLLFGVVDVKIMSRLVKMSRITKEQLLWCEEKMNKLDLSDGKLRRDPSPLLFPC

TrEMBL top hitse value%identityAlignment
A0A0A0K6Q7 Uncharacterized protein6.6e-17180.49Show/hide
Query:  PTSVSMERGLKEEDGANPKCTAEAQAKTEAEADEDDFIMEEVKRRLKELRRNSFMVLIPEEDEEEQE-LEVEVEGEGETSCGDLEWRDVEAEGRQWWGGF
        PT   +    KEEDG +PK  AE     EAE DEDDFIMEEVKRRLKELRRNSFMVLIPEE+EEE E  E E  GEGET     EWRDVEAEGRQWWGGF
Subjt:  PTSVSMERGLKEEDGANPKCTAEAQAKTEAEADEDDFIMEEVKRRLKELRRNSFMVLIPEEDEEEQE-LEVEVEGEGETSCGDLEWRDVEAEGRQWWGGF

Query:  GAVYDQYCERMLFFNQTSIQPLIHSGHESASPRSASKKSASPLRCLSLKRIEEPEDEKEHLHPPLTLIDSYH--ETAYVAHISLSWEALHCQYTQLNHLI
        GAVYD YCERM FF++ SI+    SG  S S RSASKKSASPLRCLSLKRIEEPEDE E + P LTLIDS H  E AYVAHI LSWEALHCQYTQLNHLI
Subjt:  GAVYDQYCERMLFFNQTSIQPLIHSGHESASPRSASKKSASPLRCLSLKRIEEPEDEKEHLHPPLTLIDSYH--ETAYVAHISLSWEALHCQYTQLNHLI

Query:  SCQPQNPTTHYNLTAQLFQQFQVLLQRFIENEPFQQGLRPAIYARTCRAFPKMLHVPNIQASDPNETQEQESDFLILAPDLLVIIEASIFTFHRFVKMDK
        SCQPQN TTHYNLTAQLFQQFQVLLQRFIENEPFQQ LRP IYART R +PKMLHVPNIQASDPN  QEQESD LILAPDLL IIEASIFTFHRF+KM+K
Subjt:  SCQPQNPTTHYNLTAQLFQQFQVLLQRFIENEPFQQGLRPAIYARTCRAFPKMLHVPNIQASDPNETQEQESDFLILAPDLLVIIEASIFTFHRFVKMDK

Query:  KISNSASLSLRNQTQDASLLARIRSSLDKKKTKLKELRKKSKGWKQKTWPQTYEDMQLLFGVVDVKIMSRLVKMSRITKEQLLWCEEKMNKLDLSDGKLR
        K S SASLS RN TQDA+LLAR+RSSLDKKKTKLKE+RKKSKGWKQKT PQTYEDMQLLFGVVD+KI++RL+KMSRITKEQLLWCEEKMNKLD+S+GKLR
Subjt:  KISNSASLSLRNQTQDASLLARIRSSLDKKKTKLKELRKKSKGWKQKTWPQTYEDMQLLFGVVDVKIMSRLVKMSRITKEQLLWCEEKMNKLDLSDGKLR

Query:  RDPSPLLFPC
        RDPSPLLFPC
Subjt:  RDPSPLLFPC

A0A1S3CGF8 uncharacterized protein LOC1035006151.5e-17080.05Show/hide
Query:  QPTSVSMERGLKEEDGANPKCTAEAQAKTEAEADEDDFIMEEVKRRLKELRRNSFMVLIPEEDEEEQE-LEVEVEGEGETSCGDLEWRDVEAEGRQWWGG
        +PT+  +    KEEDG +PK  AE     EAE DEDDFIMEEVKRRLKELRRNSFMVLIPEE+EEE E  E E  GEGE      EWRDVEAEGRQWWGG
Subjt:  QPTSVSMERGLKEEDGANPKCTAEAQAKTEAEADEDDFIMEEVKRRLKELRRNSFMVLIPEEDEEEQE-LEVEVEGEGETSCGDLEWRDVEAEGRQWWGG

Query:  FGAVYDQYCERMLFFNQTSIQPLIHSGHESASPRSASKKSASPLRCLSLKRIEEPEDEKEHLHPPLTLIDSYH--ETAYVAHISLSWEALHCQYTQLNHL
        FGAVYD YCERM FF++ SI+    SG ES S RSASKKSASPLRCLSLKRIEEPEDE E   P LT IDS H  E AYVAHI LSWEALHCQYTQLNHL
Subjt:  FGAVYDQYCERMLFFNQTSIQPLIHSGHESASPRSASKKSASPLRCLSLKRIEEPEDEKEHLHPPLTLIDSYH--ETAYVAHISLSWEALHCQYTQLNHL

Query:  ISCQPQNPTTHYNLTAQLFQQFQVLLQRFIENEPFQQGLRPAIYARTCRAFPKMLHVPNIQASDPNETQEQESDFLILAPDLLVIIEASIFTFHRFVKMD
        ISCQPQN TTHYNLTAQLFQQFQVLLQRFIENEPFQQ LRP IYART R FPKMLHVPNIQASDPN  QEQESD LILAPDLL I+EASIFTFHRF+KM+
Subjt:  ISCQPQNPTTHYNLTAQLFQQFQVLLQRFIENEPFQQGLRPAIYARTCRAFPKMLHVPNIQASDPNETQEQESDFLILAPDLLVIIEASIFTFHRFVKMD

Query:  KKISNSASLSLRNQTQDASLLARIRSSLDKKKTKLKELRKKSKGWKQKTWPQTYEDMQLLFGVVDVKIMSRLVKMSRITKEQLLWCEEKMNKLDLSDGKL
        KK SNSASLS RN TQDA+L AR+RSSLDKKKTKLKE+RKKSKGWKQKT PQTYEDMQLLFGVVD+KI++RL+KMSRITKEQLLWCEEKMNKLD+S+GKL
Subjt:  KKISNSASLSLRNQTQDASLLARIRSSLDKKKTKLKELRKKSKGWKQKTWPQTYEDMQLLFGVVDVKIMSRLVKMSRITKEQLLWCEEKMNKLDLSDGKL

Query:  RRDPSPLLFPC
        RRDPSPLLFPC
Subjt:  RRDPSPLLFPC

A0A5A7UTV1 DUF1666 domain-containing protein1.5e-17080.05Show/hide
Query:  QPTSVSMERGLKEEDGANPKCTAEAQAKTEAEADEDDFIMEEVKRRLKELRRNSFMVLIPEEDEEEQE-LEVEVEGEGETSCGDLEWRDVEAEGRQWWGG
        +PT+  +    KEEDG +PK  AE     EAE DEDDFIMEEVKRRLKELRRNSFMVLIPEE+EEE E  E E  GEGE      EWRDVEAEGRQWWGG
Subjt:  QPTSVSMERGLKEEDGANPKCTAEAQAKTEAEADEDDFIMEEVKRRLKELRRNSFMVLIPEEDEEEQE-LEVEVEGEGETSCGDLEWRDVEAEGRQWWGG

Query:  FGAVYDQYCERMLFFNQTSIQPLIHSGHESASPRSASKKSASPLRCLSLKRIEEPEDEKEHLHPPLTLIDSYH--ETAYVAHISLSWEALHCQYTQLNHL
        FGAVYD YCERM FF++ SI+    SG ES S RSASKKSASPLRCLSLKRIEEPEDE E   P LT IDS H  E AYVAHI LSWEALHCQYTQLNHL
Subjt:  FGAVYDQYCERMLFFNQTSIQPLIHSGHESASPRSASKKSASPLRCLSLKRIEEPEDEKEHLHPPLTLIDSYH--ETAYVAHISLSWEALHCQYTQLNHL

Query:  ISCQPQNPTTHYNLTAQLFQQFQVLLQRFIENEPFQQGLRPAIYARTCRAFPKMLHVPNIQASDPNETQEQESDFLILAPDLLVIIEASIFTFHRFVKMD
        ISCQPQN TTHYNLTAQLFQQFQVLLQRFIENEPFQQ LRP IYART R FPKMLHVPNIQASDPN  QEQESD LILAPDLL I+EASIFTFHRF+KM+
Subjt:  ISCQPQNPTTHYNLTAQLFQQFQVLLQRFIENEPFQQGLRPAIYARTCRAFPKMLHVPNIQASDPNETQEQESDFLILAPDLLVIIEASIFTFHRFVKMD

Query:  KKISNSASLSLRNQTQDASLLARIRSSLDKKKTKLKELRKKSKGWKQKTWPQTYEDMQLLFGVVDVKIMSRLVKMSRITKEQLLWCEEKMNKLDLSDGKL
        KK SNSASLS RN TQDA+L AR+RSSLDKKKTKLKE+RKKSKGWKQKT PQTYEDMQLLFGVVD+KI++RL+KMSRITKEQLLWCEEKMNKLD+S+GKL
Subjt:  KKISNSASLSLRNQTQDASLLARIRSSLDKKKTKLKELRKKSKGWKQKTWPQTYEDMQLLFGVVDVKIMSRLVKMSRITKEQLLWCEEKMNKLDLSDGKL

Query:  RRDPSPLLFPC
        RRDPSPLLFPC
Subjt:  RRDPSPLLFPC

A0A6J1GQB8 uncharacterized protein LOC1114565402.5e-17882.64Show/hide
Query:  ERGLKEEDGANPKCTAEAQAKTEAEA------DEDDFIMEEVKRRLKELRRNSFMVLIPEEDEEEQELEVEVEGEGETSCGDLEWRDVEAEGRQWWGGFG
        ER LKEEDG + KC AEA+A  EAE       D++DFIMEEVKRRLKELRRNSFMVLIPEE+EEE+E EV   GEGETSCG+ EWRDVEAEGRQWWGGFG
Subjt:  ERGLKEEDGANPKCTAEAQAKTEAEA------DEDDFIMEEVKRRLKELRRNSFMVLIPEEDEEEQELEVEVEGEGETSCGDLEWRDVEAEGRQWWGGFG

Query:  AVYDQYCERMLFFNQTSIQPLIHSGHESASPRSASKKSASPLRCLSLKRIEEPEDEKEHLHPPLTLIDSYH--ETAYVAHISLSWEALHCQYTQLNHLIS
        AVYD YCERM+FF++ S Q    SG ESAS RSA KKSASPLRCLSLKRIEEPEDE E L P   LIDS H  ETAYVAHI LSWEALHCQYTQLNHLIS
Subjt:  AVYDQYCERMLFFNQTSIQPLIHSGHESASPRSASKKSASPLRCLSLKRIEEPEDEKEHLHPPLTLIDSYH--ETAYVAHISLSWEALHCQYTQLNHLIS

Query:  CQPQNPTTHYNLTAQLFQQFQVLLQRFIENEPFQQGLRPAIYARTCRAFPKMLHVPNIQASDPNETQEQESDFLILAPDLLVIIEASIFTFHRFVKMDKK
        CQPQNPTT YNLTAQLFQQFQVLLQRFIENEPFQQ LRPAIYART R FPKMLHVPNIQASD N  QEQESD LILA DLLVIIEASIFTFHRF+KMDKK
Subjt:  CQPQNPTTHYNLTAQLFQQFQVLLQRFIENEPFQQGLRPAIYARTCRAFPKMLHVPNIQASDPNETQEQESDFLILAPDLLVIIEASIFTFHRFVKMDKK

Query:  ISNSASLSLRNQTQDASLLARIRSSLDKKKTKLKELRKKSKGWKQKTWPQTYEDMQLLFGVVDVKIMSRLVKMSRITKEQLLWCEEKMNKLDLSDGKLRR
         S SASLSLRNQTQDA+LLARIRSSLDKKK KLKE+RKKS+GWKQKTWPQTYEDMQLLFGVVD+KI+SRLVKM RI+KEQLLWCEEK+ KLD+SDGKLRR
Subjt:  ISNSASLSLRNQTQDASLLARIRSSLDKKKTKLKELRKKSKGWKQKTWPQTYEDMQLLFGVVDVKIMSRLVKMSRITKEQLLWCEEKMNKLDLSDGKLRR

Query:  DPSPLLFPC
        DPSPLLFPC
Subjt:  DPSPLLFPC

A0A6J1JYQ1 uncharacterized protein LOC1114886373.3e-17882.64Show/hide
Query:  ERGLKEEDGANPKCTAEAQAKTEAEA------DEDDFIMEEVKRRLKELRRNSFMVLIPEEDEEEQELEVEVEGEGETSCGDLEWRDVEAEGRQWWGGFG
        ER LKEEDG + KC AEA+A  EAE       D++DFIMEEVKRRLKELRRNSFMVLIPEE+EEE+E EV   GEGETSCG+ EWRDVEAEGRQWWGGFG
Subjt:  ERGLKEEDGANPKCTAEAQAKTEAEA------DEDDFIMEEVKRRLKELRRNSFMVLIPEEDEEEQELEVEVEGEGETSCGDLEWRDVEAEGRQWWGGFG

Query:  AVYDQYCERMLFFNQTSIQPLIHSGHESASPRSASKKSASPLRCLSLKRIEEPEDEKEHLHPPLTLIDSYH--ETAYVAHISLSWEALHCQYTQLNHLIS
        AVYD YCERM+FF++ S Q    SG ESAS RSA KKSASPLRCLSLKRIEEPEDE E L P  TL DS H  ETAYVAHI LSWEALHCQYTQLNHLIS
Subjt:  AVYDQYCERMLFFNQTSIQPLIHSGHESASPRSASKKSASPLRCLSLKRIEEPEDEKEHLHPPLTLIDSYH--ETAYVAHISLSWEALHCQYTQLNHLIS

Query:  CQPQNPTTHYNLTAQLFQQFQVLLQRFIENEPFQQGLRPAIYARTCRAFPKMLHVPNIQASDPNETQEQESDFLILAPDLLVIIEASIFTFHRFVKMDKK
        CQPQNPTT YNLTAQLFQQFQVLLQRFIENEPFQQ LRPAIYART R FPKMLHVPNIQASD N  QEQESD LILA DLLVIIEASIFTFHRF+KMDKK
Subjt:  CQPQNPTTHYNLTAQLFQQFQVLLQRFIENEPFQQGLRPAIYARTCRAFPKMLHVPNIQASDPNETQEQESDFLILAPDLLVIIEASIFTFHRFVKMDKK

Query:  ISNSASLSLRNQTQDASLLARIRSSLDKKKTKLKELRKKSKGWKQKTWPQTYEDMQLLFGVVDVKIMSRLVKMSRITKEQLLWCEEKMNKLDLSDGKLRR
         S SASLSLRNQTQDA+LLARIRSSLDKKK KLKE+RKKS+GWKQKTWPQTYEDMQLLFGVVD+KI+SRLVKM RI+KEQLLWCEEK+ KLD+SDGKLRR
Subjt:  ISNSASLSLRNQTQDASLLARIRSSLDKKKTKLKELRKKSKGWKQKTWPQTYEDMQLLFGVVDVKIMSRLVKMSRITKEQLLWCEEKMNKLDLSDGKLRR

Query:  DPSPLLFPC
        DPSPLLFPC
Subjt:  DPSPLLFPC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G69610.1 Protein of unknown function (DUF1666)1.2e-2628.78Show/hide
Query:  AEADEDDF----IMEEVKRRLKELRRNSFMVLIPEEDEEEQELE-VEVEGEGETSCGDLEWRDVEAEGRQWWGGFGAVYDQYCERM----LFFNQTSIQP
        +++D+D+F    ++E++K  L+  R      ++ E +   QEL+ +++E + +      + +D  AE          VY  Y  +M    +  +QT    
Subjt:  AEADEDDF----IMEEVKRRLKELRRNSFMVLIPEEDEEEQELE-VEVEGEGETSCGDLEWRDVEAEGRQWWGGFGAVYDQYCERM----LFFNQTSIQP

Query:  LIHSGHESASP-RSASK--KSASPLRCLSLKRIEEPEDEKEHLHPPLTLIDSYHETAYVAHISLSWEALHCQYTQLNHLISCQPQNPTTHYNLTAQLFQQ
         +    +S+ P R+  K  KS+        K+     D  E L   +       ET YV  + LSWE L  QY   + ++    Q  T  YNL A  FQ 
Subjt:  LIHSGHESASP-RSASK--KSASPLRCLSLKRIEEPEDEKEHLHPPLTLIDSYHETAYVAHISLSWEALHCQYTQLNHLISCQPQNPTTHYNLTAQLFQQ

Query:  FQVLLQRFIENEPFQQGLRPAIYARTCRAFPKMLHVPNIQASDPNETQ-EQESDFLILAPDLLVIIEASIFTFHRFVKMDK-------KISNSASLSLRN
        FQVLLQRF+ENEPFQ   R   Y +  R F   L +P ++    ++ +   E +F +    L  II  S+  F  F+  DK       K+S+   +S ++
Subjt:  FQVLLQRFIENEPFQQGLRPAIYARTCRAFPKMLHVPNIQASDPNETQ-EQESDFLILAPDLLVIIEASIFTFHRFVKMDK-------KISNSASLSLRN

Query:  QTQDASLLARIRSSLDKKKTKLKELRKK-----SKGWKQKTWPQTYEDMQLLFGVVDVKIMSRLVKMSRITKEQLLWCEEKMNKLDLSDGKLRRDPSPLL
         + D  LL  IR+ L KK+ KLKE+++       K  K ++        +LL   ++++++SR++ MS++T E+L WC+EK+ K+  +  K+  +P   L
Subjt:  QTQDASLLARIRSSLDKKKTKLKELRKK-----SKGWKQKTWPQTYEDMQLLFGVVDVKIMSRLVKMSRITKEQLLWCEEKMNKLDLSDGKLRRDPSPLL

Query:  FPC
         PC
Subjt:  FPC

AT1G73850.1 Protein of unknown function (DUF1666)6.8e-3533.16Show/hide
Query:  EEDEEEQELEVEVEGE----GETSCGDLEWRD-------VEAEGRQ---WWGGFGAVYDQYCERMLFFNQTSIQPLIHSGHESAS-------PRSASKKS
        E +EEE+E   ++ GE    G TS    EWR+            R+    W  +  V+ +Y E M F  + S Q L    HE+ S       PRS S++ 
Subjt:  EEDEEEQELEVEVEGE----GETSCGDLEWRD-------VEAEGRQ---WWGGFGAVYDQYCERMLFFNQTSIQPLIHSGHESAS-------PRSASKKS

Query:  ASPLRCLSLKRIEE--PEDEKEHLHPPLTLIDSYHETAYVAHISLSWEALHCQYTQLNHLISCQPQ--NPTTHYNLTAQLFQQFQVLLQRFIENEPFQQG
           L     K+ ++  P       +P + L     E+AYVA I L+WEAL   Y       S   +  N        A  F+ F +LLQR++ENEP++ G
Subjt:  ASPLRCLSLKRIEE--PEDEKEHLHPPLTLIDSYHETAYVAHISLSWEALHCQYTQLNHLISCQPQ--NPTTHYNLTAQLFQQFQVLLQRFIENEPFQQG

Query:  LRPAIYARTCRAFPKMLHVPNIQASDPNETQEQESD----FLILAPDLLVIIEASIFTFHRFVKMDK-----KISNSASLSLRNQTQDASLLARIRSSLD
         RP IYAR     PK+L VP  Q  +  E +E E++      I +   L+I+E  I TF  F++ DK     KI  +     +    D +L+  ++    
Subjt:  LRPAIYARTCRAFPKMLHVPNIQASDPNETQEQESD----FLILAPDLLVIIEASIFTFHRFVKMDK-----KISNSASLSLRNQTQDASLLARIRSSLD

Query:  KKKTKLKELRKKSKGWKQKTWPQTYEDMQLLFGVVDVKIMSRLVKMSRITKEQLLWCEEKMNKLDLSDG--KLRRDPSPLLFP
        KKKTKLKE+R+  K  ++K      E+M++L G++D+K++SR+++M+ + +E L WCEEKM+K+ +  G   L+RD +PL FP
Subjt:  KKKTKLKELRKKSKGWKQKTWPQTYEDMQLLFGVVDVKIMSRLVKMSRITKEQLLWCEEKMNKLDLSDG--KLRRDPSPLLFP

AT3G20260.1 Protein of unknown function (DUF1666)1.2e-11658.02Show/hide
Query:  EAEADEDDFIMEEVKRRLKELRRNSFMVLIPEEDEEEQELEV--EVEGEGETSCGDLEWRDVEAEGRQWWGGFGAVYDQYCERMLFFNQTSIQPLIHSG-
        E E D+DDFI  EVKRRLKELRRNSFMVLIPEE+EEE+E     E + +GE  C   EWRDV AEG QWWGGF AVY++YCERMLFF++ S Q L  +G 
Subjt:  EAEADEDDFIMEEVKRRLKELRRNSFMVLIPEEDEEEQELEV--EVEGEGETSCGDLEWRDVEAEGRQWWGGFGAVYDQYCERMLFFNQTSIQPLIHSG-

Query:  -----HESASPRSASKKSASPLRCLSLKRIEEPEDEKEHLHPPLTLIDSYH--ETAYVAHISLSWEALHCQYTQLNHLISCQPQNPTTHYNLTAQLFQQF
               + SPRSASKK +SP RCLSLK+ + PE++ EHL  P  + D Y   ETAYVA + L+WEALHCQYTQL+HLISCQP+ PT  YN TAQLFQQF
Subjt:  -----HESASPRSASKKSASPLRCLSLKRIEEPEDEKEHLHPPLTLIDSYH--ETAYVAHISLSWEALHCQYTQLNHLISCQPQNPTTHYNLTAQLFQQF

Query:  QVLLQRFIENEPFQQGLRPAIYARTCRAFPKMLHVPNIQASDPNETQEQESDFLILAPDLLVIIEASIFTFHRFVKMDKKISNSA----SLSLRNQTQDA
         VLLQR+IENEPF+QG R  +YAR   A PK+L  P IQ SD  E  E+++ F++LA DL+ +IE+SI TF+ F+KMDKK  N           N     
Subjt:  QVLLQRFIENEPFQQGLRPAIYARTCRAFPKMLHVPNIQASDPNETQEQESDFLILAPDLLVIIEASIFTFHRFVKMDKKISNSA----SLSLRNQTQDA

Query:  SLLARIRSSLDKKKTKLKELRKKSKGWKQKTWPQTYEDMQLLFGVVDVKIMSRLVKMSRITKEQLLWCEEKMNKLDLSDGKLRRDPSPLLFPC
        + L  ++SS+DKK+ K KEL KK+KG ++K+WPQT+E +QLLF  +D+K+ +R+++MS+I+KEQLLWCEEKM KL+ S GKL+R PSP+LFPC
Subjt:  SLLARIRSSLDKKKTKLKELRKKSKGWKQKTWPQTYEDMQLLFGVVDVKIMSRLVKMSRITKEQLLWCEEKMNKLDLSDGKLRRDPSPLLFPC

AT5G39785.1 Protein of unknown function (DUF1666)5.5e-2927.96Show/hide
Query:  ERGLKEEDGANPKCTAEAQAKTEAEADEDDF-----------IMEEVKRRLKELRRNSFMVLIPEEDEEEQELEVEVEGEGETSCGDLE-WRDVEAEGRQ
        E  LK+  G N K           E +E+D            ++E++K  +K+++    +  I EE+EE+ +    +E        DL+ WR  E +  +
Subjt:  ERGLKEEDGANPKCTAEAQAKTEAEADEDDF-----------IMEEVKRRLKELRRNSFMVLIPEEDEEEQELEVEVEGEGETSCGDLE-WRDVEAEGRQ

Query:  WWGGFGAVYD---QYCERMLFFNQTSIQPLIHSG-HESASPRSA--------SKKSASPLRCLSLKRIEEPEDEKEHLHPPLTLIDSYHETAYVAHISLS
             G V+     Y ERM   +  S Q     G  +S SP+ A        S+ S S +  ++++  +  + E E +   +  I    E  YV  + LS
Subjt:  WWGGFGAVYD---QYCERMLFFNQTSIQPLIHSG-HESASPRSA--------SKKSASPLRCLSLKRIEEPEDEKEHLHPPLTLIDSYHETAYVAHISLS

Query:  WEALHCQYTQLNHLISCQPQNPTTHYNLTAQLFQQFQVLLQRFIENEPFQQGLRPAIYARTCRAFPKMLHVPNIQASDPNETQ-------EQESDFLILA
        WE LH QY +   L+       +  YN  A  FQQFQVLLQRF+ENEPF++        R C     +L +P I+     + +       E+ +D +I +
Subjt:  WEALHCQYTQLNHLISCQPQNPTTHYNLTAQLFQQFQVLLQRFIENEPFQQGLRPAIYARTCRAFPKMLHVPNIQASDPNETQ-------EQESDFLILA

Query:  PDLLVIIEASIFTFHRFVKMDKKISNSASLSLRNQTQ----------DASLLARIRSSLDKKKTKLKELRKKS-----KGWKQKTWPQTYEDMQLLFGVV
          L+ I+E +I  F RFV+ DK  S+      R ++Q          D  + A ++S L  K+ +L+++ K       +  K K    T + +   F  V
Subjt:  PDLLVIIEASIFTFHRFVKMDKKISNSASLSLRNQTQ----------DASLLARIRSSLDKKKTKLKELRKKS-----KGWKQKTWPQTYEDMQLLFGVV

Query:  DVKIMSRLVKMSRITKEQLLWCEEKMNKLDLSDGKLRRDPSPLLFPC
        D+K+++R++ MS++T++ L+WC  K+ K++  + +L  DPS  LFPC
Subjt:  DVKIMSRLVKMSRITKEQLLWCEEKMNKLDLSDGKLRRDPSPLLFPC

AT5G39785.2 Protein of unknown function (DUF1666)5.2e-2727.68Show/hide
Query:  ERGLKEEDGANPKCTAEAQAKTEAEADEDDF-----------IMEEVKRRLKELRRNSFMVLIPEEDEEEQELEVEVEGEGETSCGDLE-WRDVEAEGRQ
        E  LK+  G N K           E +E+D            ++E++K  +K+++    +  I EE+EE+ +    +E        DL+ WR  E +  +
Subjt:  ERGLKEEDGANPKCTAEAQAKTEAEADEDDF-----------IMEEVKRRLKELRRNSFMVLIPEEDEEEQELEVEVEGEGETSCGDLE-WRDVEAEGRQ

Query:  WWGGFGAVYD---QYCERMLFFNQTSIQPLIHSG-HESASPRSA--------SKKSASPLRCLSLKRIEEPEDEKEHLHPPLTLIDSYHETAYVAHISLS
             G V+     Y ERM   +  S Q     G  +S SP+ A        S+ S S +  ++++  +  + E E +   +  I    E  YV  + LS
Subjt:  WWGGFGAVYD---QYCERMLFFNQTSIQPLIHSG-HESASPRSA--------SKKSASPLRCLSLKRIEEPEDEKEHLHPPLTLIDSYHETAYVAHISLS

Query:  WEALHCQYTQLNHLISCQPQNPTTHYNLTAQLFQQFQVLLQRFIENEPFQQGLRPAIYARTCRAFPKMLHVPNIQASDPNETQ-------EQESDFLILA
        WE LH QY +   L+       +  YN  A  FQQFQVLLQRF+ENEPF++        R C     +L +P I+     + +       E+ +D +I +
Subjt:  WEALHCQYTQLNHLISCQPQNPTTHYNLTAQLFQQFQVLLQRFIENEPFQQGLRPAIYARTCRAFPKMLHVPNIQASDPNETQ-------EQESDFLILA

Query:  PDLLVIIEASIFTFHRFVKMDKKISNSASLSLRNQTQ----------DASLLARIRSSLDK-KKTKLKELRKKS-----KGWKQKTWPQTYEDMQLLFGV
          L+ I+E +I  F RFV+ DK  S+      R ++Q          D  + A ++S L    + +L+++ K       +  K K    T + +   F  
Subjt:  PDLLVIIEASIFTFHRFVKMDKKISNSASLSLRNQTQ----------DASLLARIRSSLDK-KKTKLKELRKKS-----KGWKQKTWPQTYEDMQLLFGV

Query:  VDVKIMSRLVKMSRITKEQLLWCEEKMNKLDLSDGKLRRDPSPLLFPC
        VD+K+++R++ MS++T++ L+WC  K+ K++  + +L  DPS  LFPC
Subjt:  VDVKIMSRLVKMSRITKEQLLWCEEKMNKLDLSDGKLRRDPSPLLFPC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGCAGCCAACATCAGTTTCAATGGAGAGGGGTTTGAAGGAGGAGGATGGTGCAAATCCTAAGTGTACTGCAGAAGCACAAGCAAAGACAGAAGCTGAAGCGGATGA
GGACGATTTCATAATGGAAGAGGTAAAGAGGAGATTAAAGGAGCTAAGGAGGAACAGTTTCATGGTGTTGATTCCTGAAGAAGACGAAGAAGAACAAGAACTAGAAGTCG
AAGTCGAAGGTGAAGGAGAGACAAGCTGTGGGGATCTTGAATGGAGAGATGTGGAAGCTGAAGGTCGACAATGGTGGGGTGGGTTTGGTGCTGTTTATGACCAATACTGT
GAGAGGATGCTTTTTTTTAATCAGACGAGCATTCAACCCCTCATTCATTCTGGCCATGAATCAGCCTCCCCAAGATCTGCATCCAAAAAGAGTGCATCTCCACTTCGCTG
TCTTTCTCTGAAGAGGATCGAAGAACCTGAAGACGAGAAGGAGCATCTTCACCCTCCATTGACTCTGATTGACTCTTATCACGAGACAGCCTATGTTGCTCACATTTCCT
TGTCCTGGGAGGCCCTTCACTGTCAGTACACTCAACTCAACCACTTAATATCATGCCAACCCCAAAACCCTACTACTCATTATAATCTTACTGCTCAGCTCTTTCAGCAA
TTTCAAGTCCTCTTGCAAAGGTTTATCGAAAATGAACCCTTCCAACAAGGTCTCAGGCCTGCAATATATGCCCGAACCTGTCGAGCCTTTCCTAAAATGTTGCACGTTCC
TAACATACAAGCTTCAGATCCAAACGAGACACAGGAACAAGAATCTGATTTCCTCATCCTTGCTCCTGATCTGCTTGTCATTATTGAGGCCTCAATCTTTACTTTCCACC
GCTTTGTGAAGATGGACAAGAAAATCTCAAATTCTGCTTCTTTATCATTACGGAATCAAACCCAGGATGCCAGTCTGCTTGCTCGTATTCGATCTTCTCTTGACAAGAAG
AAGACAAAGCTGAAAGAACTTAGAAAGAAGAGTAAAGGGTGGAAGCAGAAAACGTGGCCTCAAACGTATGAAGACATGCAGTTGCTTTTCGGGGTGGTGGACGTTAAAAT
CATGTCAAGGCTTGTGAAAATGTCGAGGATTACTAAAGAACAGTTGCTTTGGTGCGAGGAGAAAATGAACAAATTAGATTTGTCTGATGGAAAATTGCGGAGAGATCCTT
CTCCTCTTCTTTTCCCGTGTTAA
mRNA sequenceShow/hide mRNA sequence
CTCCACTTCTTCTCCCTCCTCTTCACAGCGTCTCCTCCTGCGTTTCAGCAGCTCAGTGTCGCCGCCATCTCTCTGCCGTTCAGTGTCGCTGCCGCTGCCACCGTCTTTCT
TCCGTTCGTACGCTGCAGCCTCCCTCGATTTTCTACTCTCCTCTTTACTGCCTGACAATCTCATGGTGCAGCCAACATCAGTTTCAATGGAGAGGGGTTTGAAGGAGGAG
GATGGTGCAAATCCTAAGTGTACTGCAGAAGCACAAGCAAAGACAGAAGCTGAAGCGGATGAGGACGATTTCATAATGGAAGAGGTAAAGAGGAGATTAAAGGAGCTAAG
GAGGAACAGTTTCATGGTGTTGATTCCTGAAGAAGACGAAGAAGAACAAGAACTAGAAGTCGAAGTCGAAGGTGAAGGAGAGACAAGCTGTGGGGATCTTGAATGGAGAG
ATGTGGAAGCTGAAGGTCGACAATGGTGGGGTGGGTTTGGTGCTGTTTATGACCAATACTGTGAGAGGATGCTTTTTTTTAATCAGACGAGCATTCAACCCCTCATTCAT
TCTGGCCATGAATCAGCCTCCCCAAGATCTGCATCCAAAAAGAGTGCATCTCCACTTCGCTGTCTTTCTCTGAAGAGGATCGAAGAACCTGAAGACGAGAAGGAGCATCT
TCACCCTCCATTGACTCTGATTGACTCTTATCACGAGACAGCCTATGTTGCTCACATTTCCTTGTCCTGGGAGGCCCTTCACTGTCAGTACACTCAACTCAACCACTTAA
TATCATGCCAACCCCAAAACCCTACTACTCATTATAATCTTACTGCTCAGCTCTTTCAGCAATTTCAAGTCCTCTTGCAAAGGTTTATCGAAAATGAACCCTTCCAACAA
GGTCTCAGGCCTGCAATATATGCCCGAACCTGTCGAGCCTTTCCTAAAATGTTGCACGTTCCTAACATACAAGCTTCAGATCCAAACGAGACACAGGAACAAGAATCTGA
TTTCCTCATCCTTGCTCCTGATCTGCTTGTCATTATTGAGGCCTCAATCTTTACTTTCCACCGCTTTGTGAAGATGGACAAGAAAATCTCAAATTCTGCTTCTTTATCAT
TACGGAATCAAACCCAGGATGCCAGTCTGCTTGCTCGTATTCGATCTTCTCTTGACAAGAAGAAGACAAAGCTGAAAGAACTTAGAAAGAAGAGTAAAGGGTGGAAGCAG
AAAACGTGGCCTCAAACGTATGAAGACATGCAGTTGCTTTTCGGGGTGGTGGACGTTAAAATCATGTCAAGGCTTGTGAAAATGTCGAGGATTACTAAAGAACAGTTGCT
TTGGTGCGAGGAGAAAATGAACAAATTAGATTTGTCTGATGGAAAATTGCGGAGAGATCCTTCTCCTCTTCTTTTCCCGTGTTAAGTATTTCCACTGCAGAGCACTTCAT
CACAAGTTGTTGCAAGTGTGTTTAATGTAATTGGATTCTGGTTGCAGATAAGTATGATGATGGGCCCTACTACTACTATTACAACATTGAATTTGAATTTCTGGGTTTTC
CCAACTCCTTGTGTTGGTGTTTGTTATGTTAGGGAAAGAGGGGCCATTTCAACTATCTAAGCTATCTGTTGCACACTCACCTCACAATCAAATAGATCAATGTTCATGTT
CATCTCTTTTGTTTAACTTTGTATAGCAAAATACCCTTCAATTACCAAGTCCTAATTAATTTTTTTCTTTCCATTGCTACAAGAGAGGATGAAGAGTCTTGAGTCGTATG
TTGCATGTCAGGTCGTGGTAGAGTTGCTAGCTACCCAAAAGTATACACACTGAGTTGTTTCTTACACCA
Protein sequenceShow/hide protein sequence
MVQPTSVSMERGLKEEDGANPKCTAEAQAKTEAEADEDDFIMEEVKRRLKELRRNSFMVLIPEEDEEEQELEVEVEGEGETSCGDLEWRDVEAEGRQWWGGFGAVYDQYC
ERMLFFNQTSIQPLIHSGHESASPRSASKKSASPLRCLSLKRIEEPEDEKEHLHPPLTLIDSYHETAYVAHISLSWEALHCQYTQLNHLISCQPQNPTTHYNLTAQLFQQ
FQVLLQRFIENEPFQQGLRPAIYARTCRAFPKMLHVPNIQASDPNETQEQESDFLILAPDLLVIIEASIFTFHRFVKMDKKISNSASLSLRNQTQDASLLARIRSSLDKK
KTKLKELRKKSKGWKQKTWPQTYEDMQLLFGVVDVKIMSRLVKMSRITKEQLLWCEEKMNKLDLSDGKLRRDPSPLLFPC