; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g19060 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g19060
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr9:14849911..14857291
RNA-Seq ExpressionMoc09g19060
SyntenyMoc09g19060
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR001969 - Aspartic peptidase, active site
IPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022155341.1 uncharacterized protein LOC111022474 [Momordica charantia]8.9e-18373.35Show/hide
Query:  MAFRRNTRAHNYEDPNPR------------------------APQGVPQMNPQVALLAEALQVLLDNANEAGGAQVQQPRRAQIQQEEVQFIRDFKRFGP
        MAFRRNTRAHNYEDPNPR                        APQGVPQ+NPQVALLAEALQVLLDNAN AGGAQ QQPRRAQIQQEEVQFIRDFKRFGP
Subjt:  MAFRRNTRAHNYEDPNPR------------------------APQGVPQMNPQVALLAEALQVLLDNANEAGGAQVQQPRRAQIQQEEVQFIRDFKRFGP

Query:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDEFKVRGAVFMLWGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV
        PVFNGVSERPTA EEWVRELEALYVYLGCSDEFKVRGA+FML GEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV
Subjt:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDEFKVRGAVFMLWGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV

Query:  AQHERKFTELSRFGMQYIPTEQLKIDKFIDGLRREIKGLIILKEPTTYAAAVSSGVKRKFASFSSSQPTRGHQHLVQRQTASPVCPSCKKSHAGPCWAGK
         Q+ERKFTELSRFGMQYIPTEQLKIDKFID LRREIKGL++LKEPTTYAAAV   +                                            
Subjt:  AQHERKFTELSRFGMQYIPTEQLKIDKFIDGLRREIKGLIILKEPTTYAAAVSSGVKRKFASFSSSQPTRGHQHLVQRQTASPVCPSCKKSHAGPCWAGK

Query:  RICYRCQKEGHFARECPITGSNTQALGQRIPATVAAQGGTHRARVFALTRGNVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHANLELESLG
         +  +C +E                  QRIPAT A QGGTHRAR+FALTRG+VEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHA+LELESLG
Subjt:  RICYRCQKEGHFARECPITGSNTQALGQRIPATVAAQGGTHRARVFALTRGNVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHANLELESLG

Query:  FLLSVSTPSGSVLVTSQVVKGGQLSFDGQTLKVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRSPPDKTLPLKESKSG
        FL SVST SGSVL TSQVVKGGQLSFDGQ L VKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFR P  +    K  K+G
Subjt:  FLLSVSTPSGSVLVTSQVVKGGQLSFDGQTLKVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRSPPDKTLPLKESKSG

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]3.0e-21577Show/hide
Query:  MAFRRNTRAHNYEDPNPR------------------------APQGVPQMNPQVALLAEALQVLLDNANEAGGAQVQQPRRAQIQQEEVQFIRDFKRFGP
        MAFRRNTRAHNYEDPN R                        APQGVPQ+NPQVALLAEALQVLL NAN AGGAQVQQPRRAQI Q+EVQFIRDFK FGP
Subjt:  MAFRRNTRAHNYEDPNPR------------------------APQGVPQMNPQVALLAEALQVLLDNANEAGGAQVQQPRRAQIQQEEVQFIRDFKRFGP

Query:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDEFKVRGAVFMLWGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV
        PVFNGVSERPTAAEEWVRELEALYVYLGCSD+FKVRGAVFML GEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPV  RNEKR EFLRLTQGSLTV
Subjt:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDEFKVRGAVFMLWGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV

Query:  AQHERKFTELSRFGMQYIPTEQLKIDKFIDGLRREIKGLIILKEPTTYAAAV---------------------SSGVKRKFASFSSSQPTRGHQHLVQRQ
        AQ+ERKFTELSRFG QY+PTEQLKIDKFIDGLRREIKGL++LKEPTTYAAAV                     +SGVKRKFASFS+SQ +RGHQH  QRQ
Subjt:  AQHERKFTELSRFGMQYIPTEQLKIDKFIDGLRREIKGLIILKEPTTYAAAV---------------------SSGVKRKFASFSSSQPTRGHQHLVQRQ

Query:  TASPVCPSCKKSHAGPCWAGKRICYRCQKEGHFARECPITGSNTQALGQRIPATVAAQGGTHRARVFALTRGNVEHAEAVVTGTVLVLSMPAYALFDSGS
        TA PVCPSCKK+HA PCW GK+IC++CQKEGHF REC +TGSNTQAL Q+ P   A QGGT  ARVFALTRG+VEHAEAVVTGT+L+LS+PAYALFDSGS
Subjt:  TASPVCPSCKKSHAGPCWAGKRICYRCQKEGHFARECPITGSNTQALGQRIPATVAAQGGTHRARVFALTRGNVEHAEAVVTGTVLVLSMPAYALFDSGS

Query:  SHSFIASTFVRHANLELESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQTLKVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRSPPDKTLPLK
        SHSFIASTFVRHA+LELES GF LSVSTPSGSVLVTSQVVKGGQLSF GQTL+V LIQL+MQDFDVILGMDWLAANRANI+CSKKEVSF     +    K
Subjt:  SHSFIASTFVRHANLELESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQTLKVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRSPPDKTLPLK

Query:  ESKSGRAASVASL
          K+G    V++L
Subjt:  ESKSGRAASVASL

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]1.0e-2452.1Show/hide
Query:  QLSFDGQTLKVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRSPPDKTLPLKESKSGRAASVASLVTVCSPIHSELESLEVELTVDDVSALLARL
        Q+  D Q+LK    Q ++     +    WL   + + DC   E+S+   P K   + ++ S +AASVASLV  CS +HSELE  EVELTVDDVSALLARL
Subjt:  QLSFDGQTLKVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRSPPDKTLPLKESKSGRAASVASLVTVCSPIHSELESLEVELTVDDVSALLARL

Query:  SVEPSLRQRIIVAQKEDPSLAKGFSMVGHRDFTLS-------AAPLCFPAPSSLLRRSLVRATSRRF
        SVEPSLRQRIIVAQKEDPSLAKGFSMVGH DFTLS          LC P    L +  L  A +  F
Subjt:  SVEPSLRQRIIVAQKEDPSLAKGFSMVGHRDFTLS-------AAPLCFPAPSSLLRRSLVRATSRRF

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]2.4e-21277Show/hide
Query:  MAFRRNTRAHNYEDPNPR------------------------APQGVPQMNPQVALLAEALQVLLDNANEAGGAQVQQPRRAQIQQEEVQFIRDFKRFGP
        MAFRRNTRAHNY+DPNPR                        APQGVPQ+NPQVALLAEALQVLLDNAN AGGAQVQQPRRAQI Q+EVQFIRDFKRFGP
Subjt:  MAFRRNTRAHNYEDPNPR------------------------APQGVPQMNPQVALLAEALQVLLDNANEAGGAQVQQPRRAQIQQEEVQFIRDFKRFGP

Query:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDEFKVRGAVFMLWGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV
        PVFNGVSERPTA EEWVRELEALYVYLGCSD+FKVRGAVFML GEAVNWWESVAAAEDH NVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV
Subjt:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDEFKVRGAVFMLWGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV

Query:  AQHERKFTELSRFGMQYIPTEQLKIDKFIDGLRREIKGLIILKEPTTYAAAV---------------------SSGVKRKFASFSSSQPTRGHQHLVQRQ
        AQ+ERKFTELSRFGMQYIPTEQLKIDKFIDGLR EIKGL+++KEPTTYAAA+                     SSGVKRKFA FSSSQ +RGHQH VQRQ
Subjt:  AQHERKFTELSRFGMQYIPTEQLKIDKFIDGLRREIKGLIILKEPTTYAAAV---------------------SSGVKRKFASFSSSQPTRGHQHLVQRQ

Query:  TASPVCPSCKKSHAGPCWAGKRICYRCQKEGHFARECPITGSNTQALGQRIPATVAAQGGTHRARVFALTRGNVEHAEAVVTGTVLVLSMPAYALFDSGS
        TA PVCPSCKK+HAGPCW GKRIC+RCQK                      PA  AAQGGT RARVFALTRG+VEHAEAVVTGT+LV+SMPAYALFDSGS
Subjt:  TASPVCPSCKKSHAGPCWAGKRICYRCQKEGHFARECPITGSNTQALGQRIPATVAAQGGTHRARVFALTRGNVEHAEAVVTGTVLVLSMPAYALFDSGS

Query:  SHSFIASTFVRHANLELESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQTLKVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRSPPDKTLPLK
        SHSFIASTFVRHA+LELESLGFLLSVSTPSGSVLV SQVVKGGQLSFDGQT +VKLIQLDMQDFDVILGMDWLAANRANI+CSKKEVSFR P  +    K
Subjt:  SHSFIASTFVRHANLELESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQTLKVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRSPPDKTLPLK

Query:  ESKSGRAASVASL
          K G    V++L
Subjt:  ESKSGRAASVASL

XP_022156992.1 uncharacterized protein LOC111023821 [Momordica charantia]1.5e-18271.98Show/hide
Query:  QTMAFRRNTRAHNYEDPNPR----APQGVPQMNP-QVALLAEALQVLLDNANEAGGAQVQQPRRAQIQQEEVQFIRDFKRFGPPVFNGVSERPTAAEEWV
        +TMAFRRNTRAHNYEDPNPR    A   VP   P  VA L           N AGGAQVQQPRRAQ  QEEVQFIRDFKRFGPPVFNGVSERPTAAEEWV
Subjt:  QTMAFRRNTRAHNYEDPNPR----APQGVPQMNP-QVALLAEALQVLLDNANEAGGAQVQQPRRAQIQQEEVQFIRDFKRFGPPVFNGVSERPTAAEEWV

Query:  RELEALYVYLGCSDEFKVRGAVFMLWGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQHERKFTELSRFGMQY
        RELEALYVYLGCSD+FKV+GAV                                            NEKRAEFLRLTQGSLTVAQ+ERKFTELSRF MQY
Subjt:  RELEALYVYLGCSDEFKVRGAVFMLWGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQHERKFTELSRFGMQY

Query:  IPTEQLKIDKFIDGLRREIKGLIILKEPTTYAAAV---------------------SSGVKRKFASFSSSQPTRGHQHLVQRQTASPVCPSCKKSHAGPC
        IP EQLKIDKFIDGL REIKGL++LKEPTTYAAAV                     SSGVKRKFASFSSSQP+RGHQH VQRQTA PVCPSCKKSH GPC
Subjt:  IPTEQLKIDKFIDGLRREIKGLIILKEPTTYAAAV---------------------SSGVKRKFASFSSSQPTRGHQHLVQRQTASPVCPSCKKSHAGPC

Query:  WAGKRICYRCQKEGHFARECPITGSNTQALGQRIPATVAAQGGTHRARVFALTRGNVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHANLEL
        W GK ICYRCQKEGHFARECP+TG NTQ LGQRIP T AAQGGTHRARVFALTRG+V HAEAVV GTVLVLSMPAYALFDS SSHSFIASTFVRHA+LEL
Subjt:  WAGKRICYRCQKEGHFARECPITGSNTQALGQRIPATVAAQGGTHRARVFALTRGNVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHANLEL

Query:  ESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQTLKVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRSPPDKTLPLKESKSGRAASVASL
        ESLGFLLSVSTPSGSVLVTSQ+VKGGQLSFDGQTL+VKLIQLDMQDFDVILGMDWLAAN+ANIDCSKKE SFR P ++    K  K+     V++L
Subjt:  ESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQTLKVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRSPPDKTLPLKESKSGRAASVASL

XP_022158750.1 uncharacterized protein LOC111025215 [Momordica charantia]4.2e-21779.34Show/hide
Query:  MAFRRNTRAHNYEDPNPR------------------------APQGVPQMNPQVALLAEALQVLLDNANEAGGAQVQQPRRAQIQQEEVQFIRDFKRFGP
        MAFRRNTRAHNYEDPNPR                        A QGVPQ+NPQVALLAEALQVLLDNAN AGGAQVQQPR AQI QEE            
Subjt:  MAFRRNTRAHNYEDPNPR------------------------APQGVPQMNPQVALLAEALQVLLDNANEAGGAQVQQPRRAQIQQEEVQFIRDFKRFGP

Query:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDEFKVRGAVFMLWGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV
             VSERPTAAEEWVRELEALYVYLGCSD+FKVRGAVFML GEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKR EFLRLTQGSLTV
Subjt:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDEFKVRGAVFMLWGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV

Query:  AQHERKFTELSRFGMQYIPTEQLKIDKFIDGLRREIKGLIILKEPTTYAAAV---------------------SSGVKRKFASFSSSQPTRGHQHLVQRQ
        A++ERKFTELSRFGMQYIPT+QLKIDKFIDGLRREIKGL++LKEPTTYAAAV                     SSGVKRKFASFSSSQP+R HQH VQRQ
Subjt:  AQHERKFTELSRFGMQYIPTEQLKIDKFIDGLRREIKGLIILKEPTTYAAAV---------------------SSGVKRKFASFSSSQPTRGHQHLVQRQ

Query:  TASPVCPSCKKSHAGPCWAGKRICYRCQKEGHFARECPITGSNTQALGQRIPATVAAQGGTHRARVFALTRGNVEHAEAVVTGTVLVLSMPAYALFDSGS
        TA PVCPSCKKSHAGPCW GKRICYRCQKEGHFARECP+TGSNTQALGQRIPAT AAQGGTHRARVFALTRG+VE+AEAVVT TVLVLSMPAYALFDSGS
Subjt:  TASPVCPSCKKSHAGPCWAGKRICYRCQKEGHFARECPITGSNTQALGQRIPATVAAQGGTHRARVFALTRGNVEHAEAVVTGTVLVLSMPAYALFDSGS

Query:  SHSFIASTFVRHANLELESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQTLKVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRSPPDKTLPLK
        SHSFIASTFV HA+LELESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQTL+VKLIQLDMQDFDVILGMDWLAANRANIDCSKK+VSFR P  +    K
Subjt:  SHSFIASTFVRHANLELESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQTLKVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRSPPDKTLPLK

Query:  ESKSGRAASVASL
          K+G    V +L
Subjt:  ESKSGRAASVASL

TrEMBL top hitse value%identityAlignment
A0A6J1DQB9 Reverse transcriptase1.5e-21577Show/hide
Query:  MAFRRNTRAHNYEDPNPR------------------------APQGVPQMNPQVALLAEALQVLLDNANEAGGAQVQQPRRAQIQQEEVQFIRDFKRFGP
        MAFRRNTRAHNYEDPN R                        APQGVPQ+NPQVALLAEALQVLL NAN AGGAQVQQPRRAQI Q+EVQFIRDFK FGP
Subjt:  MAFRRNTRAHNYEDPNPR------------------------APQGVPQMNPQVALLAEALQVLLDNANEAGGAQVQQPRRAQIQQEEVQFIRDFKRFGP

Query:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDEFKVRGAVFMLWGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV
        PVFNGVSERPTAAEEWVRELEALYVYLGCSD+FKVRGAVFML GEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPV  RNEKR EFLRLTQGSLTV
Subjt:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDEFKVRGAVFMLWGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV

Query:  AQHERKFTELSRFGMQYIPTEQLKIDKFIDGLRREIKGLIILKEPTTYAAAV---------------------SSGVKRKFASFSSSQPTRGHQHLVQRQ
        AQ+ERKFTELSRFG QY+PTEQLKIDKFIDGLRREIKGL++LKEPTTYAAAV                     +SGVKRKFASFS+SQ +RGHQH  QRQ
Subjt:  AQHERKFTELSRFGMQYIPTEQLKIDKFIDGLRREIKGLIILKEPTTYAAAV---------------------SSGVKRKFASFSSSQPTRGHQHLVQRQ

Query:  TASPVCPSCKKSHAGPCWAGKRICYRCQKEGHFARECPITGSNTQALGQRIPATVAAQGGTHRARVFALTRGNVEHAEAVVTGTVLVLSMPAYALFDSGS
        TA PVCPSCKK+HA PCW GK+IC++CQKEGHF REC +TGSNTQAL Q+ P   A QGGT  ARVFALTRG+VEHAEAVVTGT+L+LS+PAYALFDSGS
Subjt:  TASPVCPSCKKSHAGPCWAGKRICYRCQKEGHFARECPITGSNTQALGQRIPATVAAQGGTHRARVFALTRGNVEHAEAVVTGTVLVLSMPAYALFDSGS

Query:  SHSFIASTFVRHANLELESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQTLKVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRSPPDKTLPLK
        SHSFIASTFVRHA+LELES GF LSVSTPSGSVLVTSQVVKGGQLSF GQTL+V LIQL+MQDFDVILGMDWLAANRANI+CSKKEVSF     +    K
Subjt:  SHSFIASTFVRHANLELESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQTLKVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRSPPDKTLPLK

Query:  ESKSGRAASVASL
          K+G    V++L
Subjt:  ESKSGRAASVASL

A0A6J1DQB9 Reverse transcriptase4.8e-2552.1Show/hide
Query:  QLSFDGQTLKVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRSPPDKTLPLKESKSGRAASVASLVTVCSPIHSELESLEVELTVDDVSALLARL
        Q+  D Q+LK    Q ++     +    WL   + + DC   E+S+   P K   + ++ S +AASVASLV  CS +HSELE  EVELTVDDVSALLARL
Subjt:  QLSFDGQTLKVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRSPPDKTLPLKESKSGRAASVASLVTVCSPIHSELESLEVELTVDDVSALLARL

Query:  SVEPSLRQRIIVAQKEDPSLAKGFSMVGHRDFTLS-------AAPLCFPAPSSLLRRSLVRATSRRF
        SVEPSLRQRIIVAQKEDPSLAKGFSMVGH DFTLS          LC P    L +  L  A +  F
Subjt:  SVEPSLRQRIIVAQKEDPSLAKGFSMVGHRDFTLS-------AAPLCFPAPSSLLRRSLVRATSRRF

A0A6J1DQB9 Reverse transcriptase1.2e-21277Show/hide
Query:  MAFRRNTRAHNYEDPNPR------------------------APQGVPQMNPQVALLAEALQVLLDNANEAGGAQVQQPRRAQIQQEEVQFIRDFKRFGP
        MAFRRNTRAHNY+DPNPR                        APQGVPQ+NPQVALLAEALQVLLDNAN AGGAQVQQPRRAQI Q+EVQFIRDFKRFGP
Subjt:  MAFRRNTRAHNYEDPNPR------------------------APQGVPQMNPQVALLAEALQVLLDNANEAGGAQVQQPRRAQIQQEEVQFIRDFKRFGP

Query:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDEFKVRGAVFMLWGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV
        PVFNGVSERPTA EEWVRELEALYVYLGCSD+FKVRGAVFML GEAVNWWESVAAAEDH NVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV
Subjt:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDEFKVRGAVFMLWGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV

Query:  AQHERKFTELSRFGMQYIPTEQLKIDKFIDGLRREIKGLIILKEPTTYAAAV---------------------SSGVKRKFASFSSSQPTRGHQHLVQRQ
        AQ+ERKFTELSRFGMQYIPTEQLKIDKFIDGLR EIKGL+++KEPTTYAAA+                     SSGVKRKFA FSSSQ +RGHQH VQRQ
Subjt:  AQHERKFTELSRFGMQYIPTEQLKIDKFIDGLRREIKGLIILKEPTTYAAAV---------------------SSGVKRKFASFSSSQPTRGHQHLVQRQ

Query:  TASPVCPSCKKSHAGPCWAGKRICYRCQKEGHFARECPITGSNTQALGQRIPATVAAQGGTHRARVFALTRGNVEHAEAVVTGTVLVLSMPAYALFDSGS
        TA PVCPSCKK+HAGPCW GKRIC+RCQK                      PA  AAQGGT RARVFALTRG+VEHAEAVVTGT+LV+SMPAYALFDSGS
Subjt:  TASPVCPSCKKSHAGPCWAGKRICYRCQKEGHFARECPITGSNTQALGQRIPATVAAQGGTHRARVFALTRGNVEHAEAVVTGTVLVLSMPAYALFDSGS

Query:  SHSFIASTFVRHANLELESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQTLKVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRSPPDKTLPLK
        SHSFIASTFVRHA+LELESLGFLLSVSTPSGSVLV SQVVKGGQLSFDGQT +VKLIQLDMQDFDVILGMDWLAANRANI+CSKKEVSFR P  +    K
Subjt:  SHSFIASTFVRHANLELESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQTLKVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRSPPDKTLPLK

Query:  ESKSGRAASVASL
          K G    V++L
Subjt:  ESKSGRAASVASL

A0A6J1DRF5 uncharacterized protein LOC1110224744.3e-18373.35Show/hide
Query:  MAFRRNTRAHNYEDPNPR------------------------APQGVPQMNPQVALLAEALQVLLDNANEAGGAQVQQPRRAQIQQEEVQFIRDFKRFGP
        MAFRRNTRAHNYEDPNPR                        APQGVPQ+NPQVALLAEALQVLLDNAN AGGAQ QQPRRAQIQQEEVQFIRDFKRFGP
Subjt:  MAFRRNTRAHNYEDPNPR------------------------APQGVPQMNPQVALLAEALQVLLDNANEAGGAQVQQPRRAQIQQEEVQFIRDFKRFGP

Query:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDEFKVRGAVFMLWGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV
        PVFNGVSERPTA EEWVRELEALYVYLGCSDEFKVRGA+FML GEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV
Subjt:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDEFKVRGAVFMLWGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV

Query:  AQHERKFTELSRFGMQYIPTEQLKIDKFIDGLRREIKGLIILKEPTTYAAAVSSGVKRKFASFSSSQPTRGHQHLVQRQTASPVCPSCKKSHAGPCWAGK
         Q+ERKFTELSRFGMQYIPTEQLKIDKFID LRREIKGL++LKEPTTYAAAV   +                                            
Subjt:  AQHERKFTELSRFGMQYIPTEQLKIDKFIDGLRREIKGLIILKEPTTYAAAVSSGVKRKFASFSSSQPTRGHQHLVQRQTASPVCPSCKKSHAGPCWAGK

Query:  RICYRCQKEGHFARECPITGSNTQALGQRIPATVAAQGGTHRARVFALTRGNVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHANLELESLG
         +  +C +E                  QRIPAT A QGGTHRAR+FALTRG+VEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHA+LELESLG
Subjt:  RICYRCQKEGHFARECPITGSNTQALGQRIPATVAAQGGTHRARVFALTRGNVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHANLELESLG

Query:  FLLSVSTPSGSVLVTSQVVKGGQLSFDGQTLKVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRSPPDKTLPLKESKSG
        FL SVST SGSVL TSQVVKGGQLSFDGQ L VKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFR P  +    K  K+G
Subjt:  FLLSVSTPSGSVLVTSQVVKGGQLSFDGQTLKVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRSPPDKTLPLKESKSG

A0A6J1DTE5 uncharacterized protein LOC1110238217.3e-18371.98Show/hide
Query:  QTMAFRRNTRAHNYEDPNPR----APQGVPQMNP-QVALLAEALQVLLDNANEAGGAQVQQPRRAQIQQEEVQFIRDFKRFGPPVFNGVSERPTAAEEWV
        +TMAFRRNTRAHNYEDPNPR    A   VP   P  VA L           N AGGAQVQQPRRAQ  QEEVQFIRDFKRFGPPVFNGVSERPTAAEEWV
Subjt:  QTMAFRRNTRAHNYEDPNPR----APQGVPQMNP-QVALLAEALQVLLDNANEAGGAQVQQPRRAQIQQEEVQFIRDFKRFGPPVFNGVSERPTAAEEWV

Query:  RELEALYVYLGCSDEFKVRGAVFMLWGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQHERKFTELSRFGMQY
        RELEALYVYLGCSD+FKV+GAV                                            NEKRAEFLRLTQGSLTVAQ+ERKFTELSRF MQY
Subjt:  RELEALYVYLGCSDEFKVRGAVFMLWGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQHERKFTELSRFGMQY

Query:  IPTEQLKIDKFIDGLRREIKGLIILKEPTTYAAAV---------------------SSGVKRKFASFSSSQPTRGHQHLVQRQTASPVCPSCKKSHAGPC
        IP EQLKIDKFIDGL REIKGL++LKEPTTYAAAV                     SSGVKRKFASFSSSQP+RGHQH VQRQTA PVCPSCKKSH GPC
Subjt:  IPTEQLKIDKFIDGLRREIKGLIILKEPTTYAAAV---------------------SSGVKRKFASFSSSQPTRGHQHLVQRQTASPVCPSCKKSHAGPC

Query:  WAGKRICYRCQKEGHFARECPITGSNTQALGQRIPATVAAQGGTHRARVFALTRGNVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHANLEL
        W GK ICYRCQKEGHFARECP+TG NTQ LGQRIP T AAQGGTHRARVFALTRG+V HAEAVV GTVLVLSMPAYALFDS SSHSFIASTFVRHA+LEL
Subjt:  WAGKRICYRCQKEGHFARECPITGSNTQALGQRIPATVAAQGGTHRARVFALTRGNVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHANLEL

Query:  ESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQTLKVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRSPPDKTLPLKESKSGRAASVASL
        ESLGFLLSVSTPSGSVLVTSQ+VKGGQLSFDGQTL+VKLIQLDMQDFDVILGMDWLAAN+ANIDCSKKE SFR P ++    K  K+     V++L
Subjt:  ESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQTLKVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRSPPDKTLPLKESKSGRAASVASL

A0A6J1DWP4 uncharacterized protein LOC1110252152.0e-21779.34Show/hide
Query:  MAFRRNTRAHNYEDPNPR------------------------APQGVPQMNPQVALLAEALQVLLDNANEAGGAQVQQPRRAQIQQEEVQFIRDFKRFGP
        MAFRRNTRAHNYEDPNPR                        A QGVPQ+NPQVALLAEALQVLLDNAN AGGAQVQQPR AQI QEE            
Subjt:  MAFRRNTRAHNYEDPNPR------------------------APQGVPQMNPQVALLAEALQVLLDNANEAGGAQVQQPRRAQIQQEEVQFIRDFKRFGP

Query:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDEFKVRGAVFMLWGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV
             VSERPTAAEEWVRELEALYVYLGCSD+FKVRGAVFML GEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKR EFLRLTQGSLTV
Subjt:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDEFKVRGAVFMLWGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV

Query:  AQHERKFTELSRFGMQYIPTEQLKIDKFIDGLRREIKGLIILKEPTTYAAAV---------------------SSGVKRKFASFSSSQPTRGHQHLVQRQ
        A++ERKFTELSRFGMQYIPT+QLKIDKFIDGLRREIKGL++LKEPTTYAAAV                     SSGVKRKFASFSSSQP+R HQH VQRQ
Subjt:  AQHERKFTELSRFGMQYIPTEQLKIDKFIDGLRREIKGLIILKEPTTYAAAV---------------------SSGVKRKFASFSSSQPTRGHQHLVQRQ

Query:  TASPVCPSCKKSHAGPCWAGKRICYRCQKEGHFARECPITGSNTQALGQRIPATVAAQGGTHRARVFALTRGNVEHAEAVVTGTVLVLSMPAYALFDSGS
        TA PVCPSCKKSHAGPCW GKRICYRCQKEGHFARECP+TGSNTQALGQRIPAT AAQGGTHRARVFALTRG+VE+AEAVVT TVLVLSMPAYALFDSGS
Subjt:  TASPVCPSCKKSHAGPCWAGKRICYRCQKEGHFARECPITGSNTQALGQRIPATVAAQGGTHRARVFALTRGNVEHAEAVVTGTVLVLSMPAYALFDSGS

Query:  SHSFIASTFVRHANLELESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQTLKVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRSPPDKTLPLK
        SHSFIASTFV HA+LELESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQTL+VKLIQLDMQDFDVILGMDWLAANRANIDCSKK+VSFR P  +    K
Subjt:  SHSFIASTFVRHANLELESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQTLKVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRSPPDKTLPLK

Query:  ESKSGRAASVASL
          K+G    V +L
Subjt:  ESKSGRAASVASL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCGAGGGTCGATACGAGGAGTTCTTTGGAGGGAAGACTATTGGGGCCTTGGGTACAAATGGTCAAGGGCCAATAAATGGTGAAGTCATCGGGGCCTCGGGTATAAA
TGGTCGAGGGCCGATGCAGCAGGGTCGGGGCTTTGGGTATAAATGGTCAGGGGCCGACTCTACTCGGAAGGATAACCGTGGAAGGTCTTTAGTGTATGGTCCTTGTCAGT
CTCCTCGTCATCACCAGACAATGGCTTTTCGACGGAACACGAGAGCTCACAACTACGAGGATCCGAATCCTAGGGCACCTCAGGGAGTTCCCCAAATGAATCCTCAGGTG
GCATTACTAGCTGAGGCATTGCAAGTATTGCTGGATAATGCGAATGAAGCCGGTGGAGCTCAGGTGCAGCAGCCTCGCCGGGCACAGATTCAACAAGAGGAGGTTCAGTT
TATCAGGGATTTCAAACGCTTTGGACCACCCGTTTTCAACGGGGTGAGTGAGAGGCCTACTGCGGCCGAGGAATGGGTCCGGGAGTTGGAAGCCCTTTATGTGTATTTGG
GATGCTCCGACGAATTCAAGGTCCGGGGAGCAGTGTTTATGCTTTGGGGAGAAGCAGTAAATTGGTGGGAGTCGGTGGCGGCAGCGGAGGATCACGCCAATGTACCCGTC
ACATGGGCGAGGTTTAAGGACCTACTCTATGAATACTATTTCCCCGTGACTGTCAGGAATGAAAAACGGGCAGAGTTTCTCCGTCTCACTCAAGGGAGCCTAACTGTGGC
CCAACACGAGAGGAAATTCACTGAGCTGTCCCGTTTTGGAATGCAATATATTCCTACTGAACAATTAAAGATTGACAAGTTCATTGACGGTTTGCGTAGGGAGATCAAGG
GGCTCATTATTCTCAAGGAACCAACTACTTATGCAGCGGCGGTCAGCTCGGGGGTCAAGAGGAAATTTGCATCGTTCTCCTCCAGTCAACCCACAAGAGGACACCAGCAT
CTTGTGCAAAGGCAGACTGCTTCTCCAGTGTGCCCCTCTTGTAAGAAGAGCCATGCTGGACCATGTTGGGCGGGAAAAAGAATTTGTTACAGGTGCCAGAAGGAAGGACA
TTTCGCAAGGGAGTGTCCGATTACCGGCTCGAATACCCAAGCTTTAGGCCAGAGGATCCCTGCGACAGTGGCAGCTCAAGGTGGGACCCACAGGGCGCGCGTCTTCGCTC
TCACCAGGGGAAATGTTGAGCATGCCGAGGCGGTGGTCACAGGGACTGTTTTAGTGCTTAGTATGCCTGCTTACGCATTATTTGACTCTGGATCTAGTCATTCTTTTATT
GCATCTACCTTTGTTCGACATGCGAACCTAGAGCTAGAATCGTTAGGCTTTTTGTTGTCAGTATCTACACCGTCAGGTTCTGTGTTGGTCACTAGTCAAGTGGTGAAAGG
AGGCCAGCTCTCTTTCGATGGTCAGACCTTGAAGGTGAAATTAATCCAACTGGACATGCAGGATTTCGATGTGATACTAGGCATGGATTGGTTAGCTGCCAACCGGGCTA
ATATTGATTGCTCGAAGAAGGAAGTTAGCTTTCGCAGCCCTCCGGACAAAACTTTACCTTTAAAGGAGTCAAAGTCGGGAAGGGCGGCAAGTGTGGCATCTCTTGTAACG
GTCTGCAGTCCAATACACAGTGAGTTGGAAAGCTTGGAGGTGGAGCTAACGGTGGATGATGTCTCCGCGTTGTTGGCTCGACTCTCAGTGGAACCTAGCCTGAGACAGAG
GATCATTGTTGCCCAAAAGGAAGACCCTAGCTTGGCCAAAGGCTTTAGTATGGTGGGCCATAGGGATTTCACTCTCTCGGCCGCCCCCCTGTGTTTTCCGGCCCCATCCT
CCCTCCTCCGGCGTTCTCTTGTGCGTGCGACGTCGCGGCGGTTCTGCACACGGGTTCCACGAGCAGCGACGGCGCAACCTTCCCTTGGCGGCGTTCGAGCAGCGGCGGCC
GGGCGCGACTTCCCTTGGCGGCGTTCAAGCAGGGCGGTCGGGCGCGACCTTGCGGCGTTCGAGCAGCGGCGGCCGGGCGCGACTTTCCCTTGGCGGCGTCCGAGCAGCGG
CGGCCGGCGACTCCCGGCGCTCCAGCGGCAGCGCTTCGTAACGGGTACAGCGACGGCACGGCCCCCTCCTCACAACGGCGCACGGCGAACGGCGGTACAGCAGCGGAATG
GCGGTAGAATTTGTACATTACAGCGGTGTTTAGGACGTTTGGCGGTGCCCCACATCCGTTCGAAGATCGATTTAGGCTACCCACACCTTGGCGAGTTAGATCTAGGTGAC
CCACATATATACGAAGTCGGTTTTGGTTGTCGATATACTTGGAATGGTGTTGGATATTTGTCGATGAACAATGGTGAGTTTTGTGCTTTGATGAACTGTGAGCATTGCTG
TGGTAATGTTTGTTGCCTGGATATGCATGTTTTGTTCGGCGTGTTTAGTGAGCTATTATCCTTGATGGTCGAGGGTCGATACGAGGAGTTCTTTGGAGGGAAGACTATTG
GGGCCTTGGGTACAAATGGTCAAGGGCCAATAGATGGTGAAGTCATCGGGGCCTCGGGTATAAATGGTCGAGGGCCGATGCAGCAGGGTCGGGGCTTTGGGTATAAATGG
TCCGGGGCCGACTCTACTCGGAAGGATAACCGTGGAAGGTTAGGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGTCGAGGGTCGATACGAGGAGTTCTTTGGAGGGAAGACTATTGGGGCCTTGGGTACAAATGGTCAAGGGCCAATAAATGGTGAAGTCATCGGGGCCTCGGGTATAAA
TGGTCGAGGGCCGATGCAGCAGGGTCGGGGCTTTGGGTATAAATGGTCAGGGGCCGACTCTACTCGGAAGGATAACCGTGGAAGGTCTTTAGTGTATGGTCCTTGTCAGT
CTCCTCGTCATCACCAGACAATGGCTTTTCGACGGAACACGAGAGCTCACAACTACGAGGATCCGAATCCTAGGGCACCTCAGGGAGTTCCCCAAATGAATCCTCAGGTG
GCATTACTAGCTGAGGCATTGCAAGTATTGCTGGATAATGCGAATGAAGCCGGTGGAGCTCAGGTGCAGCAGCCTCGCCGGGCACAGATTCAACAAGAGGAGGTTCAGTT
TATCAGGGATTTCAAACGCTTTGGACCACCCGTTTTCAACGGGGTGAGTGAGAGGCCTACTGCGGCCGAGGAATGGGTCCGGGAGTTGGAAGCCCTTTATGTGTATTTGG
GATGCTCCGACGAATTCAAGGTCCGGGGAGCAGTGTTTATGCTTTGGGGAGAAGCAGTAAATTGGTGGGAGTCGGTGGCGGCAGCGGAGGATCACGCCAATGTACCCGTC
ACATGGGCGAGGTTTAAGGACCTACTCTATGAATACTATTTCCCCGTGACTGTCAGGAATGAAAAACGGGCAGAGTTTCTCCGTCTCACTCAAGGGAGCCTAACTGTGGC
CCAACACGAGAGGAAATTCACTGAGCTGTCCCGTTTTGGAATGCAATATATTCCTACTGAACAATTAAAGATTGACAAGTTCATTGACGGTTTGCGTAGGGAGATCAAGG
GGCTCATTATTCTCAAGGAACCAACTACTTATGCAGCGGCGGTCAGCTCGGGGGTCAAGAGGAAATTTGCATCGTTCTCCTCCAGTCAACCCACAAGAGGACACCAGCAT
CTTGTGCAAAGGCAGACTGCTTCTCCAGTGTGCCCCTCTTGTAAGAAGAGCCATGCTGGACCATGTTGGGCGGGAAAAAGAATTTGTTACAGGTGCCAGAAGGAAGGACA
TTTCGCAAGGGAGTGTCCGATTACCGGCTCGAATACCCAAGCTTTAGGCCAGAGGATCCCTGCGACAGTGGCAGCTCAAGGTGGGACCCACAGGGCGCGCGTCTTCGCTC
TCACCAGGGGAAATGTTGAGCATGCCGAGGCGGTGGTCACAGGGACTGTTTTAGTGCTTAGTATGCCTGCTTACGCATTATTTGACTCTGGATCTAGTCATTCTTTTATT
GCATCTACCTTTGTTCGACATGCGAACCTAGAGCTAGAATCGTTAGGCTTTTTGTTGTCAGTATCTACACCGTCAGGTTCTGTGTTGGTCACTAGTCAAGTGGTGAAAGG
AGGCCAGCTCTCTTTCGATGGTCAGACCTTGAAGGTGAAATTAATCCAACTGGACATGCAGGATTTCGATGTGATACTAGGCATGGATTGGTTAGCTGCCAACCGGGCTA
ATATTGATTGCTCGAAGAAGGAAGTTAGCTTTCGCAGCCCTCCGGACAAAACTTTACCTTTAAAGGAGTCAAAGTCGGGAAGGGCGGCAAGTGTGGCATCTCTTGTAACG
GTCTGCAGTCCAATACACAGTGAGTTGGAAAGCTTGGAGGTGGAGCTAACGGTGGATGATGTCTCCGCGTTGTTGGCTCGACTCTCAGTGGAACCTAGCCTGAGACAGAG
GATCATTGTTGCCCAAAAGGAAGACCCTAGCTTGGCCAAAGGCTTTAGTATGGTGGGCCATAGGGATTTCACTCTCTCGGCCGCCCCCCTGTGTTTTCCGGCCCCATCCT
CCCTCCTCCGGCGTTCTCTTGTGCGTGCGACGTCGCGGCGGTTCTGCACACGGGTTCCACGAGCAGCGACGGCGCAACCTTCCCTTGGCGGCGTTCGAGCAGCGGCGGCC
GGGCGCGACTTCCCTTGGCGGCGTTCAAGCAGGGCGGTCGGGCGCGACCTTGCGGCGTTCGAGCAGCGGCGGCCGGGCGCGACTTTCCCTTGGCGGCGTCCGAGCAGCGG
CGGCCGGCGACTCCCGGCGCTCCAGCGGCAGCGCTTCGTAACGGGTACAGCGACGGCACGGCCCCCTCCTCACAACGGCGCACGGCGAACGGCGGTACAGCAGCGGAATG
GCGGTAGAATTTGTACATTACAGCGGTGTTTAGGACGTTTGGCGGTGCCCCACATCCGTTCGAAGATCGATTTAGGCTACCCACACCTTGGCGAGTTAGATCTAGGTGAC
CCACATATATACGAAGTCGGTTTTGGTTGTCGATATACTTGGAATGGTGTTGGATATTTGTCGATGAACAATGGTGAGTTTTGTGCTTTGATGAACTGTGAGCATTGCTG
TGGTAATGTTTGTTGCCTGGATATGCATGTTTTGTTCGGCGTGTTTAGTGAGCTATTATCCTTGATGGTCGAGGGTCGATACGAGGAGTTCTTTGGAGGGAAGACTATTG
GGGCCTTGGGTACAAATGGTCAAGGGCCAATAGATGGTGAAGTCATCGGGGCCTCGGGTATAAATGGTCGAGGGCCGATGCAGCAGGGTCGGGGCTTTGGGTATAAATGG
TCCGGGGCCGACTCTACTCGGAAGGATAACCGTGGAAGGTTAGGATAG
Protein sequenceShow/hide protein sequence
MVEGRYEEFFGGKTIGALGTNGQGPINGEVIGASGINGRGPMQQGRGFGYKWSGADSTRKDNRGRSLVYGPCQSPRHHQTMAFRRNTRAHNYEDPNPRAPQGVPQMNPQV
ALLAEALQVLLDNANEAGGAQVQQPRRAQIQQEEVQFIRDFKRFGPPVFNGVSERPTAAEEWVRELEALYVYLGCSDEFKVRGAVFMLWGEAVNWWESVAAAEDHANVPV
TWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQHERKFTELSRFGMQYIPTEQLKIDKFIDGLRREIKGLIILKEPTTYAAAVSSGVKRKFASFSSSQPTRGHQH
LVQRQTASPVCPSCKKSHAGPCWAGKRICYRCQKEGHFARECPITGSNTQALGQRIPATVAAQGGTHRARVFALTRGNVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFI
ASTFVRHANLELESLGFLLSVSTPSGSVLVTSQVVKGGQLSFDGQTLKVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRSPPDKTLPLKESKSGRAASVASLVT
VCSPIHSELESLEVELTVDDVSALLARLSVEPSLRQRIIVAQKEDPSLAKGFSMVGHRDFTLSAAPLCFPAPSSLLRRSLVRATSRRFCTRVPRAATAQPSLGGVRAAAA
GRDFPWRRSSRAVGRDLAAFEQRRPGATFPWRRPSSGGRRLPALQRQRFVTGTATARPPPHNGARRTAVQQRNGGRICTLQRCLGRLAVPHIRSKIDLGYPHLGELDLGD
PHIYEVGFGCRYTWNGVGYLSMNNGEFCALMNCEHCCGNVCCLDMHVLFGVFSELLSLMVEGRYEEFFGGKTIGALGTNGQGPIDGEVIGASGINGRGPMQQGRGFGYKW
SGADSTRKDNRGRLG