; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0015301 (gene) of Snake gourd v1 genome

Gene IDTan0015301
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein BPS1, chloroplastic-like
Genome locationLG10:2132845..2133897
RNA-Seq ExpressionTan0015301
SyntenyTan0015301
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR008511 - Protein BYPASS-related


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7028606.1 Protein BPS1, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]6.0e-15984.7Show/hide
Query:  MSRPQE-PHRPFFPFGNPFRA-IPQGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELAMKLLCETHKDVKTLLEELGLPVSDWDEKWLDE
        MSR QE PHR FFPFGNPFRA  P+GSKLSS+LVFLLATFEDSLAE L KLTPKSENDILSFSWM LAMKLLCETHKDVKTL+EEL LPVSDWDEKWLDE
Subjt:  MSRPQE-PHRPFFPFGNPFRA-IPQGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELAMKLLCETHKDVKTLLEELGLPVSDWDEKWLDE

Query:  YLDISVKLLDICNDFSSELAQLNQGHLVLRCALHNLASTSSNQFVHARSSFDGWNQHISSRTSRVESHSPILDRLKESLDLPKV-KNSSKGKVLMHVMYG
        YLDIS+KLLDICNDFSSEL+QL+QGHLVLRCALHNLASTS NQFVHARSS DGWNQHIS+RTSRV+ HSPILDRLKESLDLPKV KNS+KGKVLMHVMYG
Subjt:  YLDISVKLLDICNDFSSELAQLNQGHLVLRCALHNLASTSSNQFVHARSSFDGWNQHISSRTSRVESHSPILDRLKESLDLPKV-KNSSKGKVLMHVMYG

Query:  VKVETLFICSVFASSFSGSSKRLLPINVPATYRWAQAFTELQKNVNMVIKKNCSSGRCTVLRELDAVDESVKKLHSMIQGNMDGSLKVEEFQNVVVDLRK
        VKV TLFICSV AS+F+GSS  LL IN+P TYRWAQAFTELQKNVN       SSGR TVLRE DAVDESVKKLHSMIQGN+DG  KVEEFQN+VVDL +
Subjt:  VKVETLFICSVFASSFSGSSKRLLPINVPATYRWAQAFTELQKNVNMVIKKNCSSGRCTVLRELDAVDESVKKLHSMIQGNMDGSLKVEEFQNVVVDLRK

Query:  EAEKLTQGVDHLTKRVDEFFHIVLSGRDALLSNLRASETVFDQGIGGMSTRQL
        EAEKLTQGVDHLTK+VDEFFH+VLSGRDALLSNLR SETV DQG G + TRQL
Subjt:  EAEKLTQGVDHLTKRVDEFFHIVLSGRDALLSNLRASETVFDQGIGGMSTRQL

XP_022147420.1 protein BPS1, chloroplastic-like isoform X1 [Momordica charantia]2.6e-16284.94Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAI-PQGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELAMKLLCETHKDVKTLLEELGLPVSDWDEKWLDEY
        MSRPQEPHRPFFPFGNPFRAI P+GSKLSSRLVFLLA FEDSLAERLKKLTPKS+NDI+SFSWMELAMKLLCETH DVKTL+E+LGLPVSDW+EKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAI-PQGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELAMKLLCETHKDVKTLLEELGLPVSDWDEKWLDEY

Query:  LDISVKLLDICNDFSSELAQLNQGHLVLRCALHNLASTSSNQFVHARSSFDGWNQHISSRTSRVESHSPILDRLKESLDLPKVKNSSKGKVLMHVMYGVK
        L ISVKLLDICNDFSSEL+QLNQG LVLRCALHNL STSSNQFV A SS DGW++HISSRTSR ES   ILD L+ESLD PKVKNSSKGKVLMHVMYGVK
Subjt:  LDISVKLLDICNDFSSELAQLNQGHLVLRCALHNLASTSSNQFVHARSSFDGWNQHISSRTSRVESHSPILDRLKESLDLPKVKNSSKGKVLMHVMYGVK

Query:  VETLFICSVFASSFSGSSKRLLPINVPATYRWAQAFTELQKNVNMVIKKNCSSGRCTVLRELDAVDESVKKLHSMIQGNMDGSLKVEEFQNVVVDLRKEA
        VETLFICSVFAS FSGSSK+L PI VP TYRWAQAFT+LQKNVN+ IKK  SSGR TVLREL+AVDESV  LHSMIQGN+DG +K EEFQN+VVD R+EA
Subjt:  VETLFICSVFASSFSGSSKRLLPINVPATYRWAQAFTELQKNVNMVIKKNCSSGRCTVLRELDAVDESVKKLHSMIQGNMDGSLKVEEFQNVVVDLRKEA

Query:  EKLTQGVDHLTKRVDEFFHIVLSGRDALLSNLRASETVFDQGIGGMS-TRQL
        EKL+QG+DHLTK+VDEFF IVLSGRDALLSNLRASETVFDQG+GG+S TRQL
Subjt:  EKLTQGVDHLTKRVDEFFHIVLSGRDALLSNLRASETVFDQGIGGMS-TRQL

XP_022935440.1 protein BPS1, chloroplastic-like [Cucurbita moschata]2.7e-15984.42Show/hide
Query:  MSRPQE-PHRPFFPFGNPFRA-IPQGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELAMKLLCETHKDVKTLLEELGLPVSDWDEKWLDE
        MSR QE PHR FFPFGNPFRA  P+GSKLSS+LVFLLATFEDSLAE L KLTPKSENDILSFSWM LA+KLLCETHKDVKTL+EEL LPVSDWDEKWLDE
Subjt:  MSRPQE-PHRPFFPFGNPFRA-IPQGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELAMKLLCETHKDVKTLLEELGLPVSDWDEKWLDE

Query:  YLDISVKLLDICNDFSSELAQLNQGHLVLRCALHNLASTSSNQFVHARSSFDGWNQHISSRTSRVESHSPILDRLKESLDLPKV-KNSSKGKVLMHVMYG
        YLDIS+KLLDICNDFSSEL+QLNQGHLVLRCALHNLASTS NQFVHARSS DGWNQHIS+RTSRV+SHSPILDRLKESLDLPKV KNS+KGKVL+HVMYG
Subjt:  YLDISVKLLDICNDFSSELAQLNQGHLVLRCALHNLASTSSNQFVHARSSFDGWNQHISSRTSRVESHSPILDRLKESLDLPKV-KNSSKGKVLMHVMYG

Query:  VKVETLFICSVFASSFSGSSKRLLPINVPATYRWAQAFTELQKNVNMVIKKNCSSGRCTVLRELDAVDESVKKLHSMIQGNMDGSLKVEEFQNVVVDLRK
        VKV TLFICSV AS+F+GSS  LL IN+P TYRWAQAFTELQKNVN       SSGR TVLRE DAVDESVKKLHSMIQGN+DG +K+EEFQN+VVDL +
Subjt:  VKVETLFICSVFASSFSGSSKRLLPINVPATYRWAQAFTELQKNVNMVIKKNCSSGRCTVLRELDAVDESVKKLHSMIQGNMDGSLKVEEFQNVVVDLRK

Query:  EAEKLTQGVDHLTKRVDEFFHIVLSGRDALLSNLRASETVFDQGIGGMSTRQL
        EAEKLTQGVDHLTK+VDEFFH+VLSGRDALLSNLR SETV DQG G + TRQL
Subjt:  EAEKLTQGVDHLTKRVDEFFHIVLSGRDALLSNLRASETVFDQGIGGMSTRQL

XP_023539967.1 protein BPS1, chloroplastic-like [Cucurbita pepo subsp. pepo]4.6e-15984.42Show/hide
Query:  MSRPQE-PHRPFFPFGNPFRA-IPQGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELAMKLLCETHKDVKTLLEELGLPVSDWDEKWLDE
        MSR QE PHR FFPFGNPFRA  P+GSKL+S+LVFLLATFEDSLAE L+KLTPKSENDILSFSWM LAMKLLCETHKDVKTL+EEL LPVSDWDEKWLDE
Subjt:  MSRPQE-PHRPFFPFGNPFRA-IPQGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELAMKLLCETHKDVKTLLEELGLPVSDWDEKWLDE

Query:  YLDISVKLLDICNDFSSELAQLNQGHLVLRCALHNLASTSSNQFVHARSSFDGWNQHISSRTSRVESHSPILDRLKESLDLPKV-KNSSKGKVLMHVMYG
        YLDIS+KLLDICNDFSSEL+QLNQGHLVLRCALHNLASTSSNQFVHARSS DGWNQHIS+RTSRV+SHSPILDRLKESLDLPKV KNS+KGKVLMHVMYG
Subjt:  YLDISVKLLDICNDFSSELAQLNQGHLVLRCALHNLASTSSNQFVHARSSFDGWNQHISSRTSRVESHSPILDRLKESLDLPKV-KNSSKGKVLMHVMYG

Query:  VKVETLFICSVFASSFSGSSKRLLPINVPATYRWAQAFTELQKNVNMVIKKNCSSGRCTVLRELDAVDESVKKLHSMIQGNMDGSLKVEEFQNVVVDLRK
        VKV TLFICSV AS+F+GSS  LL IN+P TYRWAQAFTELQKNVN       SSGR TV RE DAVDESVKKLHSMIQGN+DG +K +EFQN+VVDL +
Subjt:  VKVETLFICSVFASSFSGSSKRLLPINVPATYRWAQAFTELQKNVNMVIKKNCSSGRCTVLRELDAVDESVKKLHSMIQGNMDGSLKVEEFQNVVVDLRK

Query:  EAEKLTQGVDHLTKRVDEFFHIVLSGRDALLSNLRASETVFDQGIGGMSTRQL
        EAEKLTQGVDHLTK+VDEFFHIVLSGRDALLSNLR SETV DQG G +  RQL
Subjt:  EAEKLTQGVDHLTKRVDEFFHIVLSGRDALLSNLRASETVFDQGIGGMSTRQL

XP_038877787.1 protein BPS1, chloroplastic-like [Benincasa hispida]5.0e-16686.04Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAI-PQGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELAMKLLCETHKDVKTLLEELGLPVSDWDEKWLDEY
        MSRPQEPH PFFPFGNPFRAI P+GSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELAM LLCETH DVKTL+EELG P SDWDEKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAI-PQGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELAMKLLCETHKDVKTLLEELGLPVSDWDEKWLDEY

Query:  LDISVKLLDICNDFSSELAQLNQGHLVLRCALHNLASTSSNQFVHARSSFDGWNQHISSRTSRVESHSPILDRLKESLDLPKVKNSSKGKVLMHVMYGVK
        L++SVKLLDICNDFSSEL+QLNQGHL++RCALHNL STSS+QFVHA SS D WNQHISSRTSRVES S ILD L+ESLDLPKVKNSSKGKVLMHV+YGVK
Subjt:  LDISVKLLDICNDFSSELAQLNQGHLVLRCALHNLASTSSNQFVHARSSFDGWNQHISSRTSRVESHSPILDRLKESLDLPKVKNSSKGKVLMHVMYGVK

Query:  VETLFICSVFASSFSGSSKRLLPINVPATYRWAQAFTELQKNVNMVIKKNCSSGRCTVLRELDAVDESVKKLHSMIQGNMDGSLKVEEFQNVVVDLRKEA
        V TLFICS FAS+ SGSSKRLLP NV  T+RWA AFTELQKNVNM IKK  SSGRCT LR++DAVDESVKKLHSMIQGNMDG +KVEEFQ ++VDLR+EA
Subjt:  VETLFICSVFASSFSGSSKRLLPINVPATYRWAQAFTELQKNVNMVIKKNCSSGRCTVLRELDAVDESVKKLHSMIQGNMDGSLKVEEFQNVVVDLRKEA

Query:  EKLTQGVDHLTKRVDEFFHIVLSGRDALLSNLRASETVFDQGIGGMSTRQL
        EKLTQGVDHLTK+VD FFHIVLSGRDALLSNLRASETVFDQG+ G STRQL
Subjt:  EKLTQGVDHLTKRVDEFFHIVLSGRDALLSNLRASETVFDQGIGGMSTRQL

TrEMBL top hitse value%identityAlignment
A0A1S3AVG0 protein BPS1, chloroplastic-like1.4e-15883.19Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAI-PQGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELAMKLLCETHKDVKTLLEELGLPVSDWDEKWLDEY
        MS+PQEPHRPFFPFGNPFRAI P+G+K+SSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELAMKLL ETH +VKTL+EELG PVS+WDEKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAI-PQGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELAMKLLCETHKDVKTLLEELGLPVSDWDEKWLDEY

Query:  LDISVKLLDICNDFSSELAQLNQGHLVLRCALHNLASTSSNQFVHARSSFDGWNQHISSRTSRVESHSPILDRLKESLDLPKVKNSSKGKVLMHVMYGVK
        L+ISVKLLDICNDFSSEL+QLNQGHL+LRCALHNL STSSNQ V A SS D WNQHISSRTSRV+S SPILD LKESLDLPKVKNSSKGKVLMH +YGVK
Subjt:  LDISVKLLDICNDFSSELAQLNQGHLVLRCALHNLASTSSNQFVHARSSFDGWNQHISSRTSRVESHSPILDRLKESLDLPKVKNSSKGKVLMHVMYGVK

Query:  VETLFICSVFASSFSGSSKRLLPINVPATYRWAQAFTELQKNVNMVIKKNCSSGRCTVLRELDAVDESVKKLHSMIQGNMDGSLKVEEFQNVVVDLRKEA
        V TLFICSVFASSFSGSS+ LLP NVP ++RWA AFTELQK VNM IKK  SSGR T LR++DAV+E VKKLHSMIQGNMD     EEFQN++V+LR+EA
Subjt:  VETLFICSVFASSFSGSSKRLLPINVPATYRWAQAFTELQKNVNMVIKKNCSSGRCTVLRELDAVDESVKKLHSMIQGNMDGSLKVEEFQNVVVDLRKEA

Query:  EKLTQGVDHLTKRVDEFFHIVLSGRDALLSNLRASETVFDQGIGGMSTRQL
        E LTQGVDHLTK+VDEFFHIVLSGRD LLSNLRASETVF QG+GG+ TRQL
Subjt:  EKLTQGVDHLTKRVDEFFHIVLSGRDALLSNLRASETVFDQGIGGMSTRQL

A0A5D3D1K6 Protein BPS11.4e-15883.19Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAI-PQGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELAMKLLCETHKDVKTLLEELGLPVSDWDEKWLDEY
        MS+PQEPHRPFFPFGNPFRAI P+G+K+SSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELAMKLL ETH +VKTL+EELG PVS+WDEKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAI-PQGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELAMKLLCETHKDVKTLLEELGLPVSDWDEKWLDEY

Query:  LDISVKLLDICNDFSSELAQLNQGHLVLRCALHNLASTSSNQFVHARSSFDGWNQHISSRTSRVESHSPILDRLKESLDLPKVKNSSKGKVLMHVMYGVK
        L+ISVKLLDICNDFSSEL+QLNQGHL+LRCALHNL STSSNQ V A SS D WNQHISSRTSRV+S SPILD LKESLDLPKVKNSSKGKVLMH +YGVK
Subjt:  LDISVKLLDICNDFSSELAQLNQGHLVLRCALHNLASTSSNQFVHARSSFDGWNQHISSRTSRVESHSPILDRLKESLDLPKVKNSSKGKVLMHVMYGVK

Query:  VETLFICSVFASSFSGSSKRLLPINVPATYRWAQAFTELQKNVNMVIKKNCSSGRCTVLRELDAVDESVKKLHSMIQGNMDGSLKVEEFQNVVVDLRKEA
        V TLFICSVFASSFSGSS+ LLP NVP ++RWA AFTELQK VNM IKK  SSGR T LR++DAV+E VKKLHSMIQGNMD     EEFQN++V+LR+EA
Subjt:  VETLFICSVFASSFSGSSKRLLPINVPATYRWAQAFTELQKNVNMVIKKNCSSGRCTVLRELDAVDESVKKLHSMIQGNMDGSLKVEEFQNVVVDLRKEA

Query:  EKLTQGVDHLTKRVDEFFHIVLSGRDALLSNLRASETVFDQGIGGMSTRQL
        E LTQGVDHLTK+VDEFFHIVLSGRD LLSNLRASETVF QG+GG+ TRQL
Subjt:  EKLTQGVDHLTKRVDEFFHIVLSGRDALLSNLRASETVFDQGIGGMSTRQL

A0A6J1D2B3 protein BPS1, chloroplastic-like isoform X11.3e-16284.94Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAI-PQGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELAMKLLCETHKDVKTLLEELGLPVSDWDEKWLDEY
        MSRPQEPHRPFFPFGNPFRAI P+GSKLSSRLVFLLA FEDSLAERLKKLTPKS+NDI+SFSWMELAMKLLCETH DVKTL+E+LGLPVSDW+EKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAI-PQGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELAMKLLCETHKDVKTLLEELGLPVSDWDEKWLDEY

Query:  LDISVKLLDICNDFSSELAQLNQGHLVLRCALHNLASTSSNQFVHARSSFDGWNQHISSRTSRVESHSPILDRLKESLDLPKVKNSSKGKVLMHVMYGVK
        L ISVKLLDICNDFSSEL+QLNQG LVLRCALHNL STSSNQFV A SS DGW++HISSRTSR ES   ILD L+ESLD PKVKNSSKGKVLMHVMYGVK
Subjt:  LDISVKLLDICNDFSSELAQLNQGHLVLRCALHNLASTSSNQFVHARSSFDGWNQHISSRTSRVESHSPILDRLKESLDLPKVKNSSKGKVLMHVMYGVK

Query:  VETLFICSVFASSFSGSSKRLLPINVPATYRWAQAFTELQKNVNMVIKKNCSSGRCTVLRELDAVDESVKKLHSMIQGNMDGSLKVEEFQNVVVDLRKEA
        VETLFICSVFAS FSGSSK+L PI VP TYRWAQAFT+LQKNVN+ IKK  SSGR TVLREL+AVDESV  LHSMIQGN+DG +K EEFQN+VVD R+EA
Subjt:  VETLFICSVFASSFSGSSKRLLPINVPATYRWAQAFTELQKNVNMVIKKNCSSGRCTVLRELDAVDESVKKLHSMIQGNMDGSLKVEEFQNVVVDLRKEA

Query:  EKLTQGVDHLTKRVDEFFHIVLSGRDALLSNLRASETVFDQGIGGMS-TRQL
        EKL+QG+DHLTK+VDEFF IVLSGRDALLSNLRASETVFDQG+GG+S TRQL
Subjt:  EKLTQGVDHLTKRVDEFFHIVLSGRDALLSNLRASETVFDQGIGGMS-TRQL

A0A6J1FAM7 protein BPS1, chloroplastic-like1.3e-15984.42Show/hide
Query:  MSRPQE-PHRPFFPFGNPFRA-IPQGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELAMKLLCETHKDVKTLLEELGLPVSDWDEKWLDE
        MSR QE PHR FFPFGNPFRA  P+GSKLSS+LVFLLATFEDSLAE L KLTPKSENDILSFSWM LA+KLLCETHKDVKTL+EEL LPVSDWDEKWLDE
Subjt:  MSRPQE-PHRPFFPFGNPFRA-IPQGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELAMKLLCETHKDVKTLLEELGLPVSDWDEKWLDE

Query:  YLDISVKLLDICNDFSSELAQLNQGHLVLRCALHNLASTSSNQFVHARSSFDGWNQHISSRTSRVESHSPILDRLKESLDLPKV-KNSSKGKVLMHVMYG
        YLDIS+KLLDICNDFSSEL+QLNQGHLVLRCALHNLASTS NQFVHARSS DGWNQHIS+RTSRV+SHSPILDRLKESLDLPKV KNS+KGKVL+HVMYG
Subjt:  YLDISVKLLDICNDFSSELAQLNQGHLVLRCALHNLASTSSNQFVHARSSFDGWNQHISSRTSRVESHSPILDRLKESLDLPKV-KNSSKGKVLMHVMYG

Query:  VKVETLFICSVFASSFSGSSKRLLPINVPATYRWAQAFTELQKNVNMVIKKNCSSGRCTVLRELDAVDESVKKLHSMIQGNMDGSLKVEEFQNVVVDLRK
        VKV TLFICSV AS+F+GSS  LL IN+P TYRWAQAFTELQKNVN       SSGR TVLRE DAVDESVKKLHSMIQGN+DG +K+EEFQN+VVDL +
Subjt:  VKVETLFICSVFASSFSGSSKRLLPINVPATYRWAQAFTELQKNVNMVIKKNCSSGRCTVLRELDAVDESVKKLHSMIQGNMDGSLKVEEFQNVVVDLRK

Query:  EAEKLTQGVDHLTKRVDEFFHIVLSGRDALLSNLRASETVFDQGIGGMSTRQL
        EAEKLTQGVDHLTK+VDEFFH+VLSGRDALLSNLR SETV DQG G + TRQL
Subjt:  EAEKLTQGVDHLTKRVDEFFHIVLSGRDALLSNLRASETVFDQGIGGMSTRQL

A0A6J1ISZ1 protein BPS1, chloroplastic-like7.1e-15882.29Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAI-PQGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELAMKLLCETHKDVKTLLEELGLPVSDWDEKWLDEY
        MS+PQEPHRPFF FGNPFRAI P+GSKLSSRLVFLLATFEDSLA++L KLTPKS+ND+ SF WM LAMKLLCE H DVKTL+ +L LPVSDW EKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAI-PQGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELAMKLLCETHKDVKTLLEELGLPVSDWDEKWLDEY

Query:  LDISVKLLDICNDFSSELAQLNQGHLVLRCALHNLASTSSNQFVHARSSFDGWNQHISSRTSRVESHSPILDRLKESLDLPKVKNSSKGKVLMHVMYGVK
        LDISVKLLDICNDFSS+L+QLNQGHL+LRCALHNLASTSSNQFV AR S D WNQHISSRTSRVE+ SPI+D L+E LDLPKVKNS KGKVLM VMYGVK
Subjt:  LDISVKLLDICNDFSSELAQLNQGHLVLRCALHNLASTSSNQFVHARSSFDGWNQHISSRTSRVESHSPILDRLKESLDLPKVKNSSKGKVLMHVMYGVK

Query:  VETLFICSVFASSFSGSSKRLLPINVPATYRWAQAFTELQKNVNMVIKKNCSSGRCTVLRELDAVDESVKKLHSMIQGNMDGSLKVEEFQNVVVDLRKEA
        V TLFICSVFA +FSGSSKRLL I+VP TYRWAQAFTELQKNVNM IK N SSGR T LR LDAVDE VKKLHSMIQ N+DG +KVEE QN++VDLR EA
Subjt:  VETLFICSVFASSFSGSSKRLLPINVPATYRWAQAFTELQKNVNMVIKKNCSSGRCTVLRELDAVDESVKKLHSMIQGNMDGSLKVEEFQNVVVDLRKEA

Query:  EKLTQGVDHLTKRVDEFFHIVLSGRDALLSNLRASETVFDQGIGGMSTRQ
        EKLTQGVDHLTK+VDEFFH+VLSGRDALLSNLR+SETVF QG+ G+STRQ
Subjt:  EKLTQGVDHLTKRVDEFFHIVLSGRDALLSNLRASETVFDQGIGGMSTRQ

SwissProt top hitse value%identityAlignment
A2Z9A6 UPF0496 protein 41.4e-6235.65Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAI-PQGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELAMKLLCETHKDVKTLLEELGLPVSDWDEKWLDEY
        MSR     R FFP GNPFR + P G+ LS +L  LLA++ED+LA  L+KL P++ +D+L+ SWM LA+  L E H ++  L+ +L LPVSDWD+KW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAI-PQGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELAMKLLCETHKDVKTLLEELGLPVSDWDEKWLDEY

Query:  LDISVKLLDICNDFSSELAQLNQGHLVLRCALHNLAS----TSSNQFVHARSSFDGWNQHISSRTSRVESHSPILDRLKESLDLPKVKNSSKGKVLMHVM
        L+ SVKLLDIC   SSEL++L+QG L+L+ ALH L S     S  Q   A  S   W + +  R  R+ S S  L  L  +L L KVKNS KGKVLM  +
Subjt:  LDISVKLLDICNDFSSELAQLNQGHLVLRCALHNLAS----TSSNQFVHARSSFDGWNQHISSRTSRVESHSPILDRLKESLDLPKVKNSSKGKVLMHVM

Query:  YGVKVETLFICSVFASSFSGSSKRLLPINVPATYRWAQAFTELQKNVNMVIKKNCSSGRCTVLRELDAVDESVKKLH-----------------------
        YG++  T+F+CS+F +  SGS K L+ ++VP  + W+QAF +L   V+  + +  + G    ++EL+ V+   K+LH                       
Subjt:  YGVKVETLFICSVFASSFSGSSKRLLPINVPATYRWAQAFTELQKNVNMVIKKNCSSGRCTVLRELDAVDESVKKLH-----------------------

Query:  ------SMIQ------------------------------------------------------------------GNMDGSLKV-----------EEFQ
              S++Q                                                                  G+ D S  V           EE  
Subjt:  ------SMIQ------------------------------------------------------------------GNMDGSLKV-----------EEFQ

Query:  NVVVDLRKEAEKLTQGVDHLTKRVDEFFHIVLSGRDALLSNLRASE
        N +  + K AE L  G+D L+KRV +FF IVL+GRDALL NLR S+
Subjt:  NVVVDLRKEAEKLTQGVDHLTKRVDEFFHIVLSGRDALLSNLRASE

Q337C0 UPF0496 protein 46.4e-6335.43Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAI-PQGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELAMKLLCETHKDVKTLLEELGLPVSDWDEKWLDEY
        MSR     R FFP GNPFR + P G+ LS +L  LLA++ED+LA  L+KL P++ +D+L+ SWM LA+  L E H ++  L+ +L LPVSDWD+KW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAI-PQGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELAMKLLCETHKDVKTLLEELGLPVSDWDEKWLDEY

Query:  LDISVKLLDICNDFSSELAQLNQGHLVLRCALHNLAS----TSSNQFVHARSSFDGWNQHISSRTSRVESHSPILDRLKESLDLPKVKNSSKGKVLMHVM
        L+ SVKLLDIC   SSEL++L+QG L+L+ ALH L S     S  Q   A  S   W + +  R +R+ S S  L  L  +L L KVKNS+KGKVLM  +
Subjt:  LDISVKLLDICNDFSSELAQLNQGHLVLRCALHNLAS----TSSNQFVHARSSFDGWNQHISSRTSRVESHSPILDRLKESLDLPKVKNSSKGKVLMHVM

Query:  YGVKVETLFICSVFASSFSGSSKRLLPINVPATYRWAQAFTELQKNVNMVIKKNCSSGRCTVLRELDAVDESVKKLH-----------------------
        YG++  T+F+CS+F +  SGS K L+ ++VP  + W+QAF +L   V+  + +  S G    ++EL+ V+   ++LH                       
Subjt:  YGVKVETLFICSVFASSFSGSSKRLLPINVPATYRWAQAFTELQKNVNMVIKKNCSSGRCTVLRELDAVDESVKKLH-----------------------

Query:  --------------------------------------------------------SMIQ----------------GNMDGSLKV-----------EEFQ
                                                                +M++                G+ D S  V           EE  
Subjt:  --------------------------------------------------------SMIQ----------------GNMDGSLKV-----------EEFQ

Query:  NVVVDLRKEAEKLTQGVDHLTKRVDEFFHIVLSGRDALLSNLRASE
        N +  + K AE L  G+D L+KRV +FF IVL+GRDALL NLR S+
Subjt:  NVVVDLRKEAEKLTQGVDHLTKRVDEFFHIVLSGRDALLSNLRASE

Q9LMM6 Protein BPS1, chloroplastic1.8e-8952.99Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAI-PQGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELAMKLLCETHKDVKTLLEELGLPVSDWDEKWLDEY
        M+RPQ+P R FFPFGNPF+ +  + S LSS+L+ LL  FE +LA  + KL PK ++DIL+ SWM+ AM+ LCETH  +KTL+ +L LPVSDW++KW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAI-PQGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELAMKLLCETHKDVKTLLEELGLPVSDWDEKWLDEY

Query:  LDISVKLLDICNDFSSELAQLNQGHLVLRCALHNLASTSSNQFVHARSSFDGWNQHISSRTSRVESHSPILDRLKESLDLPKVKNSSKGKVLMHVMYGVK
        LDISVKLLD+CN FSSEL +LNQGHL+L+ ALHNL + S      A+SS D W QHI S+  R+E+   IL  L ++L+LPKVKNS+KGKVLM  +YGVK
Subjt:  LDISVKLLDICNDFSSELAQLNQGHLVLRCALHNLASTSSNQFVHARSSFDGWNQHISSRTSRVESHSPILDRLKESLDLPKVKNSSKGKVLMHVMYGVK

Query:  VETLFICSVFASSFSGSSKRLLPINVPATYRWAQAFTELQKNVNMVIKKNCSSGRCTVLRELDAVDESVKKLHSMIQGNMDGSLKVEEFQNVVVDLRKEA
        V+TL+I  VFA++FSGSS+ L+ + V     WAQ+F E+Q  +N  IK    S   TVL+EL+AV   VKKL+  IQ    GS+     Q     L+   
Subjt:  VETLFICSVFASSFSGSSKRLLPINVPATYRWAQAFTELQKNVNMVIKKNCSSGRCTVLRELDAVDESVKKLHSMIQGNMDGSLKVEEFQNVVVDLRKEA

Query:  EKLTQGVDHLTKRVDEFFHIVLSGRDALLSNLRA
         +L+ G+D ++K VD FF I+LSGRD LL NLR+
Subjt:  EKLTQGVDHLTKRVDEFFHIVLSGRDALLSNLRA

Arabidopsis top hitse value%identityAlignment
AT1G01550.1 Protein of unknown function (DUF793)1.3e-9052.99Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAI-PQGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELAMKLLCETHKDVKTLLEELGLPVSDWDEKWLDEY
        M+RPQ+P R FFPFGNPF+ +  + S LSS+L+ LL  FE +LA  + KL PK ++DIL+ SWM+ AM+ LCETH  +KTL+ +L LPVSDW++KW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAI-PQGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELAMKLLCETHKDVKTLLEELGLPVSDWDEKWLDEY

Query:  LDISVKLLDICNDFSSELAQLNQGHLVLRCALHNLASTSSNQFVHARSSFDGWNQHISSRTSRVESHSPILDRLKESLDLPKVKNSSKGKVLMHVMYGVK
        LDISVKLLD+CN FSSEL +LNQGHL+L+ ALHNL + S      A+SS D W QHI S+  R+E+   IL  L ++L+LPKVKNS+KGKVLM  +YGVK
Subjt:  LDISVKLLDICNDFSSELAQLNQGHLVLRCALHNLASTSSNQFVHARSSFDGWNQHISSRTSRVESHSPILDRLKESLDLPKVKNSSKGKVLMHVMYGVK

Query:  VETLFICSVFASSFSGSSKRLLPINVPATYRWAQAFTELQKNVNMVIKKNCSSGRCTVLRELDAVDESVKKLHSMIQGNMDGSLKVEEFQNVVVDLRKEA
        V+TL+I  VFA++FSGSS+ L+ + V     WAQ+F E+Q  +N  IK    S   TVL+EL+AV   VKKL+  IQ    GS+     Q     L+   
Subjt:  VETLFICSVFASSFSGSSKRLLPINVPATYRWAQAFTELQKNVNMVIKKNCSSGRCTVLRELDAVDESVKKLHSMIQGNMDGSLKVEEFQNVVVDLRKEA

Query:  EKLTQGVDHLTKRVDEFFHIVLSGRDALLSNLRA
         +L+ G+D ++K VD FF I+LSGRD LL NLR+
Subjt:  EKLTQGVDHLTKRVDEFFHIVLSGRDALLSNLRA

AT1G01550.2 Protein of unknown function (DUF793)1.3e-9052.99Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAI-PQGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELAMKLLCETHKDVKTLLEELGLPVSDWDEKWLDEY
        M+RPQ+P R FFPFGNPF+ +  + S LSS+L+ LL  FE +LA  + KL PK ++DIL+ SWM+ AM+ LCETH  +KTL+ +L LPVSDW++KW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAI-PQGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELAMKLLCETHKDVKTLLEELGLPVSDWDEKWLDEY

Query:  LDISVKLLDICNDFSSELAQLNQGHLVLRCALHNLASTSSNQFVHARSSFDGWNQHISSRTSRVESHSPILDRLKESLDLPKVKNSSKGKVLMHVMYGVK
        LDISVKLLD+CN FSSEL +LNQGHL+L+ ALHNL + S      A+SS D W QHI S+  R+E+   IL  L ++L+LPKVKNS+KGKVLM  +YGVK
Subjt:  LDISVKLLDICNDFSSELAQLNQGHLVLRCALHNLASTSSNQFVHARSSFDGWNQHISSRTSRVESHSPILDRLKESLDLPKVKNSSKGKVLMHVMYGVK

Query:  VETLFICSVFASSFSGSSKRLLPINVPATYRWAQAFTELQKNVNMVIKKNCSSGRCTVLRELDAVDESVKKLHSMIQGNMDGSLKVEEFQNVVVDLRKEA
        V+TL+I  VFA++FSGSS+ L+ + V     WAQ+F E+Q  +N  IK    S   TVL+EL+AV   VKKL+  IQ    GS+     Q     L+   
Subjt:  VETLFICSVFASSFSGSSKRLLPINVPATYRWAQAFTELQKNVNMVIKKNCSSGRCTVLRELDAVDESVKKLHSMIQGNMDGSLKVEEFQNVVVDLRKEA

Query:  EKLTQGVDHLTKRVDEFFHIVLSGRDALLSNLRA
         +L+ G+D ++K VD FF I+LSGRD LL NLR+
Subjt:  EKLTQGVDHLTKRVDEFFHIVLSGRDALLSNLRA

AT2G46080.1 CONTAINS InterPro DOMAIN/s: Protein BYPASS related (InterPro:IPR008511)1.1e-10254.65Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAI-PQGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELAMKLLCETHKDVKTLLEELGLPVSDWDEKWLDEY
        MSRPQ+P R FFPFGNPFR +  +GS LS  L+ LL  FE  L ERLKKL PK+++DIL+ SWM+LAM+ LCETHK++ TL+ +L LPVSDW+EKW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAI-PQGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELAMKLLCETHKDVKTLLEELGLPVSDWDEKWLDEY

Query:  LDISVKLLDICNDFSSELAQLNQGHLVLRCALHNLASTSSNQFVHARSSFDGWNQHISSRTSRVESHSPILDRLKESLDLPKVKNSSKGKVLMHVMYGVK
        L+ISV+LLD+CN FSSEL +LNQG L L+C LHNL S S  +++ ARSS D W QH+++   R+E+   +LD L +SL LPKVKNS KGKVLM   YGVK
Subjt:  LDISVKLLDICNDFSSELAQLNQGHLVLRCALHNLASTSSNQFVHARSSFDGWNQHISSRTSRVESHSPILDRLKESLDLPKVKNSSKGKVLMHVMYGVK

Query:  VETLFICSVFASSFSGSSKRLLPINVPATYRWAQAFTELQKNVNMVIKKNCSSGRCTVLRELDAVDESVKKLHSMIQGNMDGSLKVEEFQNVVVDLRKEA
        V+T++ICSVF +++S S+K L  + V     WA+ FT++Q  VN  I+   SSGR T+L+EL++VD SV+KL+ MIQ  +D  ++VE F++ V++L  +A
Subjt:  VETLFICSVFASSFSGSSKRLLPINVPATYRWAQAFTELQKNVNMVIKKNCSSGRCTVLRELDAVDESVKKLHSMIQGNMDGSLKVEEFQNVVVDLRKEA

Query:  EKLTQGVDHLTKRVDEFFHIVLSGRDALLSNLRASETVFDQGIG
        EKL+QG+D L + VD FF + L GRD LL NLR+S+++    +G
Subjt:  EKLTQGVDHLTKRVDEFFHIVLSGRDALLSNLRASETVFDQGIG

AT3G61500.1 unknown protein1.2e-2138.67Show/hide
Query:  LLATFEDSLAERLKKLTPKSENDILSFSWMELAMKLLCETHKDVKTLLEELGLPVSDWDEK--WLDEYLDISVKLLDICNDFSSELAQLNQGHLVLRCAL
        LL  FE SL ERLKKL P++ ++IL+  WM LAM+LL +TH D+  L+ +L L      E   W + Y++I+ KLLD+CN F S L  +N G + L+   
Subjt:  LLATFEDSLAERLKKLTPKSENDILSFSWMELAMKLLCETHKDVKTLLEELGLPVSDWDEK--WLDEYLDISVKLLDICNDFSSELAQLNQGHLVLRCAL

Query:  HNLASTSSNQFVHARSSFDGWNQHISSRTSRVESHSPILDRLKESLDLPK
        H L   S +      S+ D W ++I++  S+      +L R  ESL+  K
Subjt:  HNLASTSSNQFVHARSSFDGWNQHISSRTSRVESHSPILDRLKESLDLPK

AT4G01360.1 unknown protein4.5e-5639.51Show/hide
Query:  NPFRAI---PQGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELAMKLLCETHKDVKTLLEELGLPVSDWDEKWLDEYLDISVKLLDICND
        NPF+ +      ++LS +L+ LL  FE +L   +++L PK +NDI+S SWM  AM+ LCETHK ++TL+++L +PVSD +E ++  + D S+K  ++CN 
Subjt:  NPFRAI---PQGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELAMKLLCETHKDVKTLLEELGLPVSDWDEKWLDEYLDISVKLLDICND

Query:  FSSELAQLNQGHLVLRCALHNLASTSSNQFVHARSSFDGWNQHISSRTSRVESHSPILDRLKESLD-------LPKVKNSSKGKVLMHVMYGVKVETLFI
        F+SE+  L  G+L+L+ A   L + S N           WNQH+ S+   +E+   +L RL ES+D         K K S++GKVL+ V+YGVKV+TL+I
Subjt:  FSSELAQLNQGHLVLRCALHNLASTSSNQFVHARSSFDGWNQHISSRTSRVESHSPILDRLKESLD-------LPKVKNSSKGKVLMHVMYGVKVETLFI

Query:  CSVFASSFSGSSKRLLPINVPATYR---WAQAFTELQKNVNMVIKKNCSSGRCTVLRELDAVDESVKKLHSMIQGNMDGSLKVEEFQNVVVDLRKEAEKL
         SVF +SFSGSSK L  + +P       W QAF ELQ  +N  IK    S   TV+++L+AV+  VKKL++ +Q      L VE  +  V++L +  E +
Subjt:  CSVFASSFSGSSKRLLPINVPATYR---WAQAFTELQKNVNMVIKKNCSSGRCTVLRELDAVDESVKKLHSMIQGNMDGSLKVEEFQNVVVDLRKEAEKL

Query:  TQGVDHLTKRVDEFFHIVLSGRDALLSNL
        ++    L+K       +V+S RDALL +L
Subjt:  TQGVDHLTKRVDEFFHIVLSGRDALLSNL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCCGACCACAAGAGCCGCACCGGCCATTCTTTCCTTTTGGGAATCCTTTCCGTGCAATACCTCAGGGTTCAAAATTGTCTTCTAGACTTGTTTTTCTGTTAGCCAC
TTTTGAGGATTCCTTGGCAGAGAGGCTGAAAAAACTCACTCCAAAATCAGAGAATGACATACTTAGCTTCTCGTGGATGGAATTAGCAATGAAGCTACTGTGTGAAACTC
ACAAAGATGTAAAAACCCTTCTAGAAGAGCTTGGGCTCCCTGTGTCTGACTGGGATGAGAAATGGCTAGATGAGTACCTGGACATCAGTGTGAAATTACTTGATATATGC
AATGATTTTAGCTCTGAGCTCGCACAGTTGAATCAGGGTCATCTGGTACTTCGGTGTGCCTTGCACAATCTGGCATCTACATCTTCCAACCAGTTTGTTCATGCCCGTTC
TTCATTCGATGGATGGAATCAACATATTAGTTCCCGAACCTCCAGAGTTGAGAGCCATTCTCCCATCTTGGACCGTCTTAAGGAATCACTTGATCTTCCAAAGGTTAAGA
ACTCATCCAAAGGCAAGGTTTTGATGCATGTGATGTACGGAGTGAAGGTGGAGACTCTGTTTATTTGCAGTGTTTTTGCTTCTTCCTTCTCAGGTTCTTCCAAAAGGTTG
TTACCCATCAATGTTCCGGCTACATATAGATGGGCACAAGCTTTTACTGAATTACAGAAAAATGTAAATATGGTAATTAAAAAAAATTGTTCCAGTGGAAGATGTACTGT
ATTGAGAGAGCTTGATGCAGTTGATGAGAGTGTTAAAAAATTGCATTCCATGATTCAAGGAAATATGGATGGCAGCTTGAAAGTAGAAGAGTTCCAGAATGTGGTTGTAG
ATTTGAGGAAGGAGGCAGAAAAGCTTACACAAGGTGTTGATCATCTTACTAAACGAGTTGATGAGTTTTTTCACATTGTTTTATCTGGACGTGATGCATTGCTTTCAAAT
CTTAGAGCAAGTGAGACCGTATTTGATCAAGGAATAGGGGGGATGTCTACAAGGCAACTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCCGACCACAAGAGCCGCACCGGCCATTCTTTCCTTTTGGGAATCCTTTCCGTGCAATACCTCAGGGTTCAAAATTGTCTTCTAGACTTGTTTTTCTGTTAGCCAC
TTTTGAGGATTCCTTGGCAGAGAGGCTGAAAAAACTCACTCCAAAATCAGAGAATGACATACTTAGCTTCTCGTGGATGGAATTAGCAATGAAGCTACTGTGTGAAACTC
ACAAAGATGTAAAAACCCTTCTAGAAGAGCTTGGGCTCCCTGTGTCTGACTGGGATGAGAAATGGCTAGATGAGTACCTGGACATCAGTGTGAAATTACTTGATATATGC
AATGATTTTAGCTCTGAGCTCGCACAGTTGAATCAGGGTCATCTGGTACTTCGGTGTGCCTTGCACAATCTGGCATCTACATCTTCCAACCAGTTTGTTCATGCCCGTTC
TTCATTCGATGGATGGAATCAACATATTAGTTCCCGAACCTCCAGAGTTGAGAGCCATTCTCCCATCTTGGACCGTCTTAAGGAATCACTTGATCTTCCAAAGGTTAAGA
ACTCATCCAAAGGCAAGGTTTTGATGCATGTGATGTACGGAGTGAAGGTGGAGACTCTGTTTATTTGCAGTGTTTTTGCTTCTTCCTTCTCAGGTTCTTCCAAAAGGTTG
TTACCCATCAATGTTCCGGCTACATATAGATGGGCACAAGCTTTTACTGAATTACAGAAAAATGTAAATATGGTAATTAAAAAAAATTGTTCCAGTGGAAGATGTACTGT
ATTGAGAGAGCTTGATGCAGTTGATGAGAGTGTTAAAAAATTGCATTCCATGATTCAAGGAAATATGGATGGCAGCTTGAAAGTAGAAGAGTTCCAGAATGTGGTTGTAG
ATTTGAGGAAGGAGGCAGAAAAGCTTACACAAGGTGTTGATCATCTTACTAAACGAGTTGATGAGTTTTTTCACATTGTTTTATCTGGACGTGATGCATTGCTTTCAAAT
CTTAGAGCAAGTGAGACCGTATTTGATCAAGGAATAGGGGGGATGTCTACAAGGCAACTGTGA
Protein sequenceShow/hide protein sequence
MSRPQEPHRPFFPFGNPFRAIPQGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELAMKLLCETHKDVKTLLEELGLPVSDWDEKWLDEYLDISVKLLDIC
NDFSSELAQLNQGHLVLRCALHNLASTSSNQFVHARSSFDGWNQHISSRTSRVESHSPILDRLKESLDLPKVKNSSKGKVLMHVMYGVKVETLFICSVFASSFSGSSKRL
LPINVPATYRWAQAFTELQKNVNMVIKKNCSSGRCTVLRELDAVDESVKKLHSMIQGNMDGSLKVEEFQNVVVDLRKEAEKLTQGVDHLTKRVDEFFHIVLSGRDALLSN
LRASETVFDQGIGGMSTRQL