; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g25850 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g25850
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr6:19489024..19494982
RNA-Seq ExpressionMoc06g25850
SyntenyMoc06g25850
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR001969 - Aspartic peptidase, active site
IPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022155925.1 uncharacterized protein LOC111022925 [Momordica charantia]4.7e-18768.57Show/hide
Query:  MAFRRNTRAHNYEDPNPRGEEAADPNVPPAVPGGVAPPVPQAAPQGVPQVALLAEALQLLLDNANGAGGAQAQQPRRTQIQQEEVQFITDFKRFGPPVFN
        MAFRRNT+AHNYEDPN RGE AAD NVPP VPG               +V LLAEALQ+LLDNANGAGGAQ QQP R QI QEEVQFI DFKRFGPPVFN
Subjt:  MAFRRNTRAHNYEDPNPRGEEAADPNVPPAVPGGVAPPVPQAAPQGVPQVALLAEALQLLLDNANGAGGAQAQQPRRTQIQQEEVQFITDFKRFGPPVFN

Query:  G-----------------------------VRGAVFMLRGEAVNWWESVAAAEDHANAPVTWARFKDLLYEYYFPMTVRNEKRAEFLRLTQGSLTVAQYE
        G                             VRGAVFML+GEAVNWWESVAAAEDHAN PVTWARFKDLLYEYYFP+TVRNEKRAEFLRLTQ SL VAQYE
Subjt:  G-----------------------------VRGAVFMLRGEAVNWWESVAAAEDHANAPVTWARFKDLLYEYYFPMTVRNEKRAEFLRLTQGSLTVAQYE

Query:  RKFTELSRFGMQYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAV---------------------SSGVKRKFASFSSSQPSRGHQQLVQRQTVSP
        RKFTELSRFGMQYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAV                     SSGVKRKFASFSSSQPSRGHQ   QRQT  P
Subjt:  RKFTELSRFGMQYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAV---------------------SSGVKRKFASFSSSQPSRGHQQLVQRQTVSP

Query:  VCPSCKRSHAGPCWAGKRICYRCQKEGHFARECPMTGSNTQALGQRIPATATTQGGTHRARVFALTRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSF
         CPSCK++HAGPCW GKRICYRCQKEGHFAREC MTGSNTQALGQRIPATATTQ                        GTVLVL                
Subjt:  VCPSCKRSHAGPCWAGKRICYRCQKEGHFARECPMTGSNTQALGQRIPATATTQGGTHRARVFALTRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSF

Query:  IASTFVRHADLELESLGFLLTVSTPSGSVLVTSQVVKGGQLSFDGQTVEVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFTFKGVKA
                                   SVLVTSQVVKGGQLSFDGQ ++V LIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFTFKG+K 
Subjt:  IASTFVRHADLELESLGFLLTVSTPSGSVLVTSQVVKGGQLSFDGQTVEVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFTFKGVKA

Query:  GVLRVVSALKASHLLQRGVWAYLASVVDARKVVSSIEAVRVVND
        GV RVVSALKASHLLQRGVWAYLASV+DA KVV SIEAVRVVN+
Subjt:  GVLRVVSALKASHLLQRGVWAYLASVVDARKVVSSIEAVRVVND

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]5.3e-23178.47Show/hide
Query:  MAFRRNTRAHNYEDPNPRGEEAADPNVPPAVPGGVAPPVPQAAPQGV----PQVALLAEALQLLLDNANGAGGAQAQQPRRTQIQQEEVQFITDFKRFGP
        MAFRRNTRAHNYEDPN RGE AADPNV P VPGGV PPVPQAAPQGV    PQVALLAEALQ+LL NANGAGGAQ QQPRR QI Q+EVQFI DFK FGP
Subjt:  MAFRRNTRAHNYEDPNPRGEEAADPNVPPAVPGGVAPPVPQAAPQGV----PQVALLAEALQLLLDNANGAGGAQAQQPRRTQIQQEEVQFITDFKRFGP

Query:  PVFNG-----------------------------VRGAVFMLRGEAVNWWESVAAAEDHANAPVTWARFKDLLYEYYFPMTVRNEKRAEFLRLTQGSLTV
        PVFNG                             VRGAVFMLRGEAVNWWESVAAAEDHAN PVTWARFKDLLYEYYFP+  RNEKR EFLRLTQGSLTV
Subjt:  PVFNG-----------------------------VRGAVFMLRGEAVNWWESVAAAEDHANAPVTWARFKDLLYEYYFPMTVRNEKRAEFLRLTQGSLTV

Query:  AQYERKFTELSRFGMQYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAV---------------------SSGVKRKFASFSSSQPSRGHQQLVQRQ
        AQYERKFTELSRFG QY+PTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAV                     +SGVKRKFASFS+SQ SRGHQ   QRQ
Subjt:  AQYERKFTELSRFGMQYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAV---------------------SSGVKRKFASFSSSQPSRGHQQLVQRQ

Query:  TVSPVCPSCKRSHAGPCWAGKRICYRCQKEGHFARECPMTGSNTQALGQRIPATATTQGGTHRARVFALTRGDVEHAEAVVTGTVLVLSMPAYALFDSGS
        T  PVCPSCK++HA PCW GK+IC++CQKEGHF REC MTGSNTQAL Q+ P    TQGGT  ARVFALTRGDVEHAEAVVTGT+L+LS+PAYALFDSGS
Subjt:  TVSPVCPSCKRSHAGPCWAGKRICYRCQKEGHFARECPMTGSNTQALGQRIPATATTQGGTHRARVFALTRGDVEHAEAVVTGTVLVLSMPAYALFDSGS

Query:  SHSFIASTFVRHADLELESLGFLLTVSTPSGSVLVTSQVVKGGQLSFDGQTVEVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFTFK
        SHSFIASTFVRHADLELES GF L+VSTPSGSVLVTSQVVKGGQLSF GQT+EV LIQL+MQDFDVILGMDWLAANRANI+CSKKEVSF L SGQNFTFK
Subjt:  SHSFIASTFVRHADLELESLGFLLTVSTPSGSVLVTSQVVKGGQLSFDGQTVEVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFTFK

Query:  GVKAGVLRVVSALKASHLLQRGVWAYLASVVDARKVVSSIEAVRVVND
        GVKAGV RVVSALKAS+LLQRGVWAYLASVVDARKVV SIE VRVVN+
Subjt:  GVKAGVLRVVSALKASHLLQRGVWAYLASVVDARKVVSSIEAVRVVND

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]1.3e-1976.25Show/hide
Query:  VVDA-RKVVSSIEAVRVVNDPMHSELERLEVELTVDDVSALLARLSVEPSLRQRIIVAQKEDPSLAKGFSMVGHGDFTLS
        V DA  +  +S+ ++      MHSELE  EVELTVDDVSALLARLSVEPSLRQRIIVAQKEDPSLAKGFSMVGHGDFTLS
Subjt:  VVDA-RKVVSSIEAVRVVNDPMHSELERLEVELTVDDVSALLARLSVEPSLRQRIIVAQKEDPSLAKGFSMVGHGDFTLS

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]3.3e-22577.92Show/hide
Query:  MAFRRNTRAHNYEDPNPRGEEAADPNVPPAVPGGVAPPVPQAAPQGV----PQVALLAEALQLLLDNANGAGGAQAQQPRRTQIQQEEVQFITDFKRFGP
        MAFRRNTRAHNY+DPNPRGE AADPNVP  VPG VAPPVPQAAPQGV    PQVALLAEALQ+LLDNANGAGGAQ QQPRR QI Q+EVQFI DFKRFGP
Subjt:  MAFRRNTRAHNYEDPNPRGEEAADPNVPPAVPGGVAPPVPQAAPQGV----PQVALLAEALQLLLDNANGAGGAQAQQPRRTQIQQEEVQFITDFKRFGP

Query:  PVFNG-----------------------------VRGAVFMLRGEAVNWWESVAAAEDHANAPVTWARFKDLLYEYYFPMTVRNEKRAEFLRLTQGSLTV
        PVFNG                             VRGAVFMLRGEAVNWWESVAAAEDH N PVTWARFKDLLYEYYFP+TVRNEKRAEFLRLTQGSLTV
Subjt:  PVFNG-----------------------------VRGAVFMLRGEAVNWWESVAAAEDHANAPVTWARFKDLLYEYYFPMTVRNEKRAEFLRLTQGSLTV

Query:  AQYERKFTELSRFGMQYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAV---------------------SSGVKRKFASFSSSQPSRGHQQLVQRQ
        AQYERKFTELSRFGMQYIPTEQLKIDKFIDGLR EIKGLLV+KEPTTYAAA+                     SSGVKRKFA FSSSQ SRGHQ  VQRQ
Subjt:  AQYERKFTELSRFGMQYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAV---------------------SSGVKRKFASFSSSQPSRGHQQLVQRQ

Query:  TVSPVCPSCKRSHAGPCWAGKRICYRCQKEGHFARECPMTGSNTQALGQRIPATATTQGGTHRARVFALTRGDVEHAEAVVTGTVLVLSMPAYALFDSGS
        T  PVCPSCK++HAGPCW GKRIC+RCQK                      PA A  QGGT RARVFALTRGDVEHAEAVVTGT+LV+SMPAYALFDSGS
Subjt:  TVSPVCPSCKRSHAGPCWAGKRICYRCQKEGHFARECPMTGSNTQALGQRIPATATTQGGTHRARVFALTRGDVEHAEAVVTGTVLVLSMPAYALFDSGS

Query:  SHSFIASTFVRHADLELESLGFLLTVSTPSGSVLVTSQVVKGGQLSFDGQTVEVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFTFK
        SHSFIASTFVRHADLELESLGFLL+VSTPSGSVLV SQVVKGGQLSFDGQT EVKLIQLDMQDFDVILGMDWLAANRANI+CSKKEVSFRLPSGQNFTFK
Subjt:  SHSFIASTFVRHADLELESLGFLLTVSTPSGSVLVTSQVVKGGQLSFDGQTVEVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFTFK

Query:  GVKAGVLRVVSALKASHLLQRGVWAYLASVVDARKVVSSIEAVRVVND
         VK GV RVVSALKA++LLQRG WAYLASVVDARKVV SIEAVRVVN+
Subjt:  GVKAGVLRVVSALKASHLLQRGVWAYLASVVDARKVVSSIEAVRVVND

XP_022156992.1 uncharacterized protein LOC111023821 [Momordica charantia]5.9e-20676.82Show/hide
Query:  TMAFRRNTRAHNYEDPNPRGEEAADPNVPPAVPGGVAPPVPQAAPQGVPQVALLAEALQLLLDNANGAGGAQAQQPRRTQIQQEEVQFITDFKRFGPPVF
        TMAFRRNTRAHNYEDPNPRGE AADPNVP AVPGGVAP VPQAAPQGVPQ               NGAGGAQ QQPRR Q  QEEVQFI DFKRFGPPVF
Subjt:  TMAFRRNTRAHNYEDPNPRGEEAADPNVPPAVPGGVAPPVPQAAPQGVPQVALLAEALQLLLDNANGAGGAQAQQPRRTQIQQEEVQFITDFKRFGPPVF

Query:  NGVRGAVFMLRGEAVNWWESVAAAEDHANAPVTWARFKDLLYEY------YFPMTVRNEKRAEFLRLTQGSLTVAQYERKFTELSRFGMQYIPTEQLKID
        NGV               E   AAE+       W R  + LY Y      +      NEKRAEFLRLTQGSLTVAQYERKFTELSRF MQYIP EQLKID
Subjt:  NGVRGAVFMLRGEAVNWWESVAAAEDHANAPVTWARFKDLLYEY------YFPMTVRNEKRAEFLRLTQGSLTVAQYERKFTELSRFGMQYIPTEQLKID

Query:  KFIDGLRREIKGLLVLKEPTTYAAAV---------------------SSGVKRKFASFSSSQPSRGHQQLVQRQTVSPVCPSCKRSHAGPCWAGKRICYR
        KFIDGL REIKGLLVLKEPTTYAAAV                     SSGVKRKFASFSSSQPSRGHQ  VQRQT  PVCPSCK+SH GPCW GK ICYR
Subjt:  KFIDGLRREIKGLLVLKEPTTYAAAV---------------------SSGVKRKFASFSSSQPSRGHQQLVQRQTVSPVCPSCKRSHAGPCWAGKRICYR

Query:  CQKEGHFARECPMTGSNTQALGQRIPATATTQGGTHRARVFALTRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLTV
        CQKEGHFARECPMTG NTQ LGQRIP T   QGGTHRARVFALTRGDV HAEAVV GTVLVLSMPAYALFDS SSHSFIASTFVRHADLELESLGFLL+V
Subjt:  CQKEGHFARECPMTGSNTQALGQRIPATATTQGGTHRARVFALTRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLTV

Query:  STPSGSVLVTSQVVKGGQLSFDGQTVEVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFTFKGVKAGVLRVVSALKASHLLQRGVWAY
        STPSGSVLVTSQ+VKGGQLSFDGQT+EVKLIQLDMQDFDVILGMDWLAAN+ANIDCSKKE SFRLPS QNFTFKGVKA V RVVSALKASH LQRG WAY
Subjt:  STPSGSVLVTSQVVKGGQLSFDGQTVEVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFTFKGVKAGVLRVVSALKASHLLQRGVWAY

Query:  LASVVDARKVVSSIEAVRVVND
        LASVVDARKVV SIEAVRVVN+
Subjt:  LASVVDARKVVSSIEAVRVVND

XP_022158750.1 uncharacterized protein LOC111025215 [Momordica charantia]5.5e-23683.87Show/hide
Query:  MAFRRNTRAHNYEDPNPRGEEAADPNVPPAVPGGVAPPVPQAAPQGV----PQVALLAEALQLLLDNANGAGGAQAQQPRRTQIQQEEV--------QFI
        MAFRRNTRAHNYEDPNPRGE AADPNVPPAVPGGVAPP PQAA QGV    PQVALLAEALQ+LLDNANGAGGAQ QQPR  QI QEEV        +++
Subjt:  MAFRRNTRAHNYEDPNPRGEEAADPNVPPAVPGGVAPPVPQAAPQGV----PQVALLAEALQLLLDNANGAGGAQAQQPRRTQIQQEEV--------QFI

Query:  TDFKRFGPPVFNG------VRGAVFMLRGEAVNWWESVAAAEDHANAPVTWARFKDLLYEYYFPMTVRNEKRAEFLRLTQGSLTVAQYERKFTELSRFGM
         + +     V+ G      VRGAVFMLRGEAVNWWESVAAAEDHAN PVTWARFKDLLYEYYFP+TVRNEKR EFLRLTQGSLTVA+YERKFTELSRFGM
Subjt:  TDFKRFGPPVFNG------VRGAVFMLRGEAVNWWESVAAAEDHANAPVTWARFKDLLYEYYFPMTVRNEKRAEFLRLTQGSLTVAQYERKFTELSRFGM

Query:  QYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAV---------------------SSGVKRKFASFSSSQPSRGHQQLVQRQTVSPVCPSCKRSHAG
        QYIPT+QLKIDKFIDGLRREIKGLLVLKEPTTYAAAV                     SSGVKRKFASFSSSQPSR HQ  VQRQT  PVCPSCK+SHAG
Subjt:  QYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAV---------------------SSGVKRKFASFSSSQPSRGHQQLVQRQTVSPVCPSCKRSHAG

Query:  PCWAGKRICYRCQKEGHFARECPMTGSNTQALGQRIPATATTQGGTHRARVFALTRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADL
        PCW GKRICYRCQKEGHFARECPMTGSNTQALGQRIPATA  QGGTHRARVFALTRGDVE+AEAVVT TVLVLSMPAYALFDSGSSHSFIASTFV HADL
Subjt:  PCWAGKRICYRCQKEGHFARECPMTGSNTQALGQRIPATATTQGGTHRARVFALTRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADL

Query:  ELESLGFLLTVSTPSGSVLVTSQVVKGGQLSFDGQTVEVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFTFKGVKAGVLRVVSALKA
        ELESLGFLL+VSTPSGSVLVTSQVVKGGQLSFDGQT+EVKLIQLDMQDFDVILGMDWLAANRANIDCSKK+VSFRLPSGQNFTFKGVKAGV RVV ALKA
Subjt:  ELESLGFLLTVSTPSGSVLVTSQVVKGGQLSFDGQTVEVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFTFKGVKAGVLRVVSALKA

Query:  SHLLQRGVWAYLASVVDARKVVSSIEA
        SHLLQRG WAYLASVVDARKVV SIEA
Subjt:  SHLLQRGVWAYLASVVDARKVVSSIEA

TrEMBL top hitse value%identityAlignment
A0A6J1DNV8 uncharacterized protein LOC1110229252.3e-18768.57Show/hide
Query:  MAFRRNTRAHNYEDPNPRGEEAADPNVPPAVPGGVAPPVPQAAPQGVPQVALLAEALQLLLDNANGAGGAQAQQPRRTQIQQEEVQFITDFKRFGPPVFN
        MAFRRNT+AHNYEDPN RGE AAD NVPP VPG               +V LLAEALQ+LLDNANGAGGAQ QQP R QI QEEVQFI DFKRFGPPVFN
Subjt:  MAFRRNTRAHNYEDPNPRGEEAADPNVPPAVPGGVAPPVPQAAPQGVPQVALLAEALQLLLDNANGAGGAQAQQPRRTQIQQEEVQFITDFKRFGPPVFN

Query:  G-----------------------------VRGAVFMLRGEAVNWWESVAAAEDHANAPVTWARFKDLLYEYYFPMTVRNEKRAEFLRLTQGSLTVAQYE
        G                             VRGAVFML+GEAVNWWESVAAAEDHAN PVTWARFKDLLYEYYFP+TVRNEKRAEFLRLTQ SL VAQYE
Subjt:  G-----------------------------VRGAVFMLRGEAVNWWESVAAAEDHANAPVTWARFKDLLYEYYFPMTVRNEKRAEFLRLTQGSLTVAQYE

Query:  RKFTELSRFGMQYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAV---------------------SSGVKRKFASFSSSQPSRGHQQLVQRQTVSP
        RKFTELSRFGMQYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAV                     SSGVKRKFASFSSSQPSRGHQ   QRQT  P
Subjt:  RKFTELSRFGMQYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAV---------------------SSGVKRKFASFSSSQPSRGHQQLVQRQTVSP

Query:  VCPSCKRSHAGPCWAGKRICYRCQKEGHFARECPMTGSNTQALGQRIPATATTQGGTHRARVFALTRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSF
         CPSCK++HAGPCW GKRICYRCQKEGHFAREC MTGSNTQALGQRIPATATTQ                        GTVLVL                
Subjt:  VCPSCKRSHAGPCWAGKRICYRCQKEGHFARECPMTGSNTQALGQRIPATATTQGGTHRARVFALTRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSF

Query:  IASTFVRHADLELESLGFLLTVSTPSGSVLVTSQVVKGGQLSFDGQTVEVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFTFKGVKA
                                   SVLVTSQVVKGGQLSFDGQ ++V LIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFTFKG+K 
Subjt:  IASTFVRHADLELESLGFLLTVSTPSGSVLVTSQVVKGGQLSFDGQTVEVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFTFKGVKA

Query:  GVLRVVSALKASHLLQRGVWAYLASVVDARKVVSSIEAVRVVND
        GV RVVSALKASHLLQRGVWAYLASV+DA KVV SIEAVRVVN+
Subjt:  GVLRVVSALKASHLLQRGVWAYLASVVDARKVVSSIEAVRVVND

A0A6J1DQB9 Reverse transcriptase2.6e-23178.47Show/hide
Query:  MAFRRNTRAHNYEDPNPRGEEAADPNVPPAVPGGVAPPVPQAAPQGV----PQVALLAEALQLLLDNANGAGGAQAQQPRRTQIQQEEVQFITDFKRFGP
        MAFRRNTRAHNYEDPN RGE AADPNV P VPGGV PPVPQAAPQGV    PQVALLAEALQ+LL NANGAGGAQ QQPRR QI Q+EVQFI DFK FGP
Subjt:  MAFRRNTRAHNYEDPNPRGEEAADPNVPPAVPGGVAPPVPQAAPQGV----PQVALLAEALQLLLDNANGAGGAQAQQPRRTQIQQEEVQFITDFKRFGP

Query:  PVFNG-----------------------------VRGAVFMLRGEAVNWWESVAAAEDHANAPVTWARFKDLLYEYYFPMTVRNEKRAEFLRLTQGSLTV
        PVFNG                             VRGAVFMLRGEAVNWWESVAAAEDHAN PVTWARFKDLLYEYYFP+  RNEKR EFLRLTQGSLTV
Subjt:  PVFNG-----------------------------VRGAVFMLRGEAVNWWESVAAAEDHANAPVTWARFKDLLYEYYFPMTVRNEKRAEFLRLTQGSLTV

Query:  AQYERKFTELSRFGMQYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAV---------------------SSGVKRKFASFSSSQPSRGHQQLVQRQ
        AQYERKFTELSRFG QY+PTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAV                     +SGVKRKFASFS+SQ SRGHQ   QRQ
Subjt:  AQYERKFTELSRFGMQYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAV---------------------SSGVKRKFASFSSSQPSRGHQQLVQRQ

Query:  TVSPVCPSCKRSHAGPCWAGKRICYRCQKEGHFARECPMTGSNTQALGQRIPATATTQGGTHRARVFALTRGDVEHAEAVVTGTVLVLSMPAYALFDSGS
        T  PVCPSCK++HA PCW GK+IC++CQKEGHF REC MTGSNTQAL Q+ P    TQGGT  ARVFALTRGDVEHAEAVVTGT+L+LS+PAYALFDSGS
Subjt:  TVSPVCPSCKRSHAGPCWAGKRICYRCQKEGHFARECPMTGSNTQALGQRIPATATTQGGTHRARVFALTRGDVEHAEAVVTGTVLVLSMPAYALFDSGS

Query:  SHSFIASTFVRHADLELESLGFLLTVSTPSGSVLVTSQVVKGGQLSFDGQTVEVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFTFK
        SHSFIASTFVRHADLELES GF L+VSTPSGSVLVTSQVVKGGQLSF GQT+EV LIQL+MQDFDVILGMDWLAANRANI+CSKKEVSF L SGQNFTFK
Subjt:  SHSFIASTFVRHADLELESLGFLLTVSTPSGSVLVTSQVVKGGQLSFDGQTVEVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFTFK

Query:  GVKAGVLRVVSALKASHLLQRGVWAYLASVVDARKVVSSIEAVRVVND
        GVKAGV RVVSALKAS+LLQRGVWAYLASVVDARKVV SIE VRVVN+
Subjt:  GVKAGVLRVVSALKASHLLQRGVWAYLASVVDARKVVSSIEAVRVVND

A0A6J1DQB9 Reverse transcriptase6.3e-2076.25Show/hide
Query:  VVDA-RKVVSSIEAVRVVNDPMHSELERLEVELTVDDVSALLARLSVEPSLRQRIIVAQKEDPSLAKGFSMVGHGDFTLS
        V DA  +  +S+ ++      MHSELE  EVELTVDDVSALLARLSVEPSLRQRIIVAQKEDPSLAKGFSMVGHGDFTLS
Subjt:  VVDA-RKVVSSIEAVRVVNDPMHSELERLEVELTVDDVSALLARLSVEPSLRQRIIVAQKEDPSLAKGFSMVGHGDFTLS

A0A6J1DQB9 Reverse transcriptase1.6e-22577.92Show/hide
Query:  MAFRRNTRAHNYEDPNPRGEEAADPNVPPAVPGGVAPPVPQAAPQGV----PQVALLAEALQLLLDNANGAGGAQAQQPRRTQIQQEEVQFITDFKRFGP
        MAFRRNTRAHNY+DPNPRGE AADPNVP  VPG VAPPVPQAAPQGV    PQVALLAEALQ+LLDNANGAGGAQ QQPRR QI Q+EVQFI DFKRFGP
Subjt:  MAFRRNTRAHNYEDPNPRGEEAADPNVPPAVPGGVAPPVPQAAPQGV----PQVALLAEALQLLLDNANGAGGAQAQQPRRTQIQQEEVQFITDFKRFGP

Query:  PVFNG-----------------------------VRGAVFMLRGEAVNWWESVAAAEDHANAPVTWARFKDLLYEYYFPMTVRNEKRAEFLRLTQGSLTV
        PVFNG                             VRGAVFMLRGEAVNWWESVAAAEDH N PVTWARFKDLLYEYYFP+TVRNEKRAEFLRLTQGSLTV
Subjt:  PVFNG-----------------------------VRGAVFMLRGEAVNWWESVAAAEDHANAPVTWARFKDLLYEYYFPMTVRNEKRAEFLRLTQGSLTV

Query:  AQYERKFTELSRFGMQYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAV---------------------SSGVKRKFASFSSSQPSRGHQQLVQRQ
        AQYERKFTELSRFGMQYIPTEQLKIDKFIDGLR EIKGLLV+KEPTTYAAA+                     SSGVKRKFA FSSSQ SRGHQ  VQRQ
Subjt:  AQYERKFTELSRFGMQYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAV---------------------SSGVKRKFASFSSSQPSRGHQQLVQRQ

Query:  TVSPVCPSCKRSHAGPCWAGKRICYRCQKEGHFARECPMTGSNTQALGQRIPATATTQGGTHRARVFALTRGDVEHAEAVVTGTVLVLSMPAYALFDSGS
        T  PVCPSCK++HAGPCW GKRIC+RCQK                      PA A  QGGT RARVFALTRGDVEHAEAVVTGT+LV+SMPAYALFDSGS
Subjt:  TVSPVCPSCKRSHAGPCWAGKRICYRCQKEGHFARECPMTGSNTQALGQRIPATATTQGGTHRARVFALTRGDVEHAEAVVTGTVLVLSMPAYALFDSGS

Query:  SHSFIASTFVRHADLELESLGFLLTVSTPSGSVLVTSQVVKGGQLSFDGQTVEVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFTFK
        SHSFIASTFVRHADLELESLGFLL+VSTPSGSVLV SQVVKGGQLSFDGQT EVKLIQLDMQDFDVILGMDWLAANRANI+CSKKEVSFRLPSGQNFTFK
Subjt:  SHSFIASTFVRHADLELESLGFLLTVSTPSGSVLVTSQVVKGGQLSFDGQTVEVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFTFK

Query:  GVKAGVLRVVSALKASHLLQRGVWAYLASVVDARKVVSSIEAVRVVND
         VK GV RVVSALKA++LLQRG WAYLASVVDARKVV SIEAVRVVN+
Subjt:  GVKAGVLRVVSALKASHLLQRGVWAYLASVVDARKVVSSIEAVRVVND

A0A6J1DTE5 uncharacterized protein LOC1110238212.9e-20676.82Show/hide
Query:  TMAFRRNTRAHNYEDPNPRGEEAADPNVPPAVPGGVAPPVPQAAPQGVPQVALLAEALQLLLDNANGAGGAQAQQPRRTQIQQEEVQFITDFKRFGPPVF
        TMAFRRNTRAHNYEDPNPRGE AADPNVP AVPGGVAP VPQAAPQGVPQ               NGAGGAQ QQPRR Q  QEEVQFI DFKRFGPPVF
Subjt:  TMAFRRNTRAHNYEDPNPRGEEAADPNVPPAVPGGVAPPVPQAAPQGVPQVALLAEALQLLLDNANGAGGAQAQQPRRTQIQQEEVQFITDFKRFGPPVF

Query:  NGVRGAVFMLRGEAVNWWESVAAAEDHANAPVTWARFKDLLYEY------YFPMTVRNEKRAEFLRLTQGSLTVAQYERKFTELSRFGMQYIPTEQLKID
        NGV               E   AAE+       W R  + LY Y      +      NEKRAEFLRLTQGSLTVAQYERKFTELSRF MQYIP EQLKID
Subjt:  NGVRGAVFMLRGEAVNWWESVAAAEDHANAPVTWARFKDLLYEY------YFPMTVRNEKRAEFLRLTQGSLTVAQYERKFTELSRFGMQYIPTEQLKID

Query:  KFIDGLRREIKGLLVLKEPTTYAAAV---------------------SSGVKRKFASFSSSQPSRGHQQLVQRQTVSPVCPSCKRSHAGPCWAGKRICYR
        KFIDGL REIKGLLVLKEPTTYAAAV                     SSGVKRKFASFSSSQPSRGHQ  VQRQT  PVCPSCK+SH GPCW GK ICYR
Subjt:  KFIDGLRREIKGLLVLKEPTTYAAAV---------------------SSGVKRKFASFSSSQPSRGHQQLVQRQTVSPVCPSCKRSHAGPCWAGKRICYR

Query:  CQKEGHFARECPMTGSNTQALGQRIPATATTQGGTHRARVFALTRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLTV
        CQKEGHFARECPMTG NTQ LGQRIP T   QGGTHRARVFALTRGDV HAEAVV GTVLVLSMPAYALFDS SSHSFIASTFVRHADLELESLGFLL+V
Subjt:  CQKEGHFARECPMTGSNTQALGQRIPATATTQGGTHRARVFALTRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLTV

Query:  STPSGSVLVTSQVVKGGQLSFDGQTVEVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFTFKGVKAGVLRVVSALKASHLLQRGVWAY
        STPSGSVLVTSQ+VKGGQLSFDGQT+EVKLIQLDMQDFDVILGMDWLAAN+ANIDCSKKE SFRLPS QNFTFKGVKA V RVVSALKASH LQRG WAY
Subjt:  STPSGSVLVTSQVVKGGQLSFDGQTVEVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFTFKGVKAGVLRVVSALKASHLLQRGVWAY

Query:  LASVVDARKVVSSIEAVRVVND
        LASVVDARKVV SIEAVRVVN+
Subjt:  LASVVDARKVVSSIEAVRVVND

A0A6J1DWP4 uncharacterized protein LOC1110252152.7e-23683.87Show/hide
Query:  MAFRRNTRAHNYEDPNPRGEEAADPNVPPAVPGGVAPPVPQAAPQGV----PQVALLAEALQLLLDNANGAGGAQAQQPRRTQIQQEEV--------QFI
        MAFRRNTRAHNYEDPNPRGE AADPNVPPAVPGGVAPP PQAA QGV    PQVALLAEALQ+LLDNANGAGGAQ QQPR  QI QEEV        +++
Subjt:  MAFRRNTRAHNYEDPNPRGEEAADPNVPPAVPGGVAPPVPQAAPQGV----PQVALLAEALQLLLDNANGAGGAQAQQPRRTQIQQEEV--------QFI

Query:  TDFKRFGPPVFNG------VRGAVFMLRGEAVNWWESVAAAEDHANAPVTWARFKDLLYEYYFPMTVRNEKRAEFLRLTQGSLTVAQYERKFTELSRFGM
         + +     V+ G      VRGAVFMLRGEAVNWWESVAAAEDHAN PVTWARFKDLLYEYYFP+TVRNEKR EFLRLTQGSLTVA+YERKFTELSRFGM
Subjt:  TDFKRFGPPVFNG------VRGAVFMLRGEAVNWWESVAAAEDHANAPVTWARFKDLLYEYYFPMTVRNEKRAEFLRLTQGSLTVAQYERKFTELSRFGM

Query:  QYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAV---------------------SSGVKRKFASFSSSQPSRGHQQLVQRQTVSPVCPSCKRSHAG
        QYIPT+QLKIDKFIDGLRREIKGLLVLKEPTTYAAAV                     SSGVKRKFASFSSSQPSR HQ  VQRQT  PVCPSCK+SHAG
Subjt:  QYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAV---------------------SSGVKRKFASFSSSQPSRGHQQLVQRQTVSPVCPSCKRSHAG

Query:  PCWAGKRICYRCQKEGHFARECPMTGSNTQALGQRIPATATTQGGTHRARVFALTRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADL
        PCW GKRICYRCQKEGHFARECPMTGSNTQALGQRIPATA  QGGTHRARVFALTRGDVE+AEAVVT TVLVLSMPAYALFDSGSSHSFIASTFV HADL
Subjt:  PCWAGKRICYRCQKEGHFARECPMTGSNTQALGQRIPATATTQGGTHRARVFALTRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADL

Query:  ELESLGFLLTVSTPSGSVLVTSQVVKGGQLSFDGQTVEVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFTFKGVKAGVLRVVSALKA
        ELESLGFLL+VSTPSGSVLVTSQVVKGGQLSFDGQT+EVKLIQLDMQDFDVILGMDWLAANRANIDCSKK+VSFRLPSGQNFTFKGVKAGV RVV ALKA
Subjt:  ELESLGFLLTVSTPSGSVLVTSQVVKGGQLSFDGQTVEVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFTFKGVKAGVLRVVSALKA

Query:  SHLLQRGVWAYLASVVDARKVVSSIEA
        SHLLQRG WAYLASVVDARKVV SIEA
Subjt:  SHLLQRGVWAYLASVVDARKVVSSIEA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGAGTGTCAGGGCTTCGGGTATAAATGGTTGAGGGCTGATACGTCACTAATTGGGTATCGAGGCCTCGGGTATAAATGGTCGGGGGTCGATACGCCAATA
TTGGATAAAGATGAGCGTCGAGGCCTCGGGTATAAATGGTCGGGGGTCAATGCGAGGAGTCTTTCGAAAGGAGAACTATTGGGGCCTTGGACTGTCCTAGAATCT
AGGTCGTCAGTGTACGGTCTTCGTCAGTCTCCTCGTCATCACCAGCTACGTTTCCGAGTTCTGGATACAATGGCTTTTCGACGGAACACGAGAGCTCACAACTAC
GAGGATCCGAACCCTAGGGGTGAGGAAGCAGCGGATCCAAATGTTCCCCCGGCAGTTCCTGGAGGGGTAGCACCCCCGGTCCCGCAGGCAGCACCTCAGGGAGTT
CCCCAGGTGGCGTTACTAGCTGAGGCATTGCAATTATTGCTAGATAATGCGAATGGAGCCGGTGGGGCTCAGGCGCAGCAGCCTCGCCGGACACAGATTCAACAA
GAGGAGGTCCAGTTTATCACGGATTTCAAACGCTTCGGACCACCCGTTTTCAACGGGGTCCGGGGAGCAGTGTTTATGCTTCGGGGAGAAGCAGTAAACTGGTGG
GAGTCGGTGGCGGCAGCGGAGGATCACGCCAACGCACCCGTCACATGGGCGAGGTTTAAGGACCTACTCTATGAGTACTATTTCCCCATGACTGTCAGGAATGAA
AAACGGGCAGAGTTTCTCCGTCTCACTCAAGGGAGCCTAACTGTGGCCCAATACGAGAGGAAGTTCACTGAGCTGTCCCGTTTTGGAATGCAATATATTCCTACT
GAACAATTAAAGATTGACAAGTTCATTGACGGTTTGCGTAGGGAGATCAAGGGGCTACTTGTTCTCAAGGAGCCAACTACTTATGCAGCAGCAGTCAGCTCGGGG
GTCAAGAGGAAATTTGCATCGTTCTCCTCCAGTCAACCCTCGAGGGGACACCAGCAGCTTGTGCAAAGGCAGACTGTTTCTCCGGTGTGCCCTTCTTGTAAGAGG
AGCCATGCTGGACCGTGTTGGGCGGGAAAGAGAATATGTTACAGGTGTCAGAAGGAAGGACATTTTGCAAGGGAGTGTCCGATGACCGGCTCGAATACCCAAGCT
TTAGGCCAGAGGATCCCTGCGACGGCGACAACTCAAGGTGGAACCCATAGGGCGCGCGTCTTCGCTCTTACCAGAGGGGATGTTGAGCATGCCGAGGCGGTGGTC
ACAGGGACTGTTTTAGTACTCAGTATGCCTGCTTATGCTTTATTTGACTCGGGGTCTAGTCATTCTTTCATTGCTTCTACCTTTGTTCGGCATGCGGACCTAGAG
CTAGAATCGTTAGGCTTTTTGTTGACGGTATCCACTCCGTCAGGATCTGTGTTGGTCACTAGTCAAGTGGTGAAAGGAGGCCAACTCTCTTTCGATGGTCAGACC
GTGGAGGTGAAGTTAATTCAACTGGATATGCAGGATTTCGATGTGATACTAGGCATGGATTGGTTAGCTGCTAACCGGGCTAATATCGATTGTTCGAAGAAGGAA
GTTAGCTTTCGCTTGCCCTCCGGACAAAACTTTACCTTTAAAGGAGTCAAGGCCGGGGTCCTGAGGGTGGTGTCGGCATTGAAGGCCAGCCATCTTCTCCAGCGT
GGTGTCTGGGCCTATTTGGCTAGCGTTGTGGATGCAAGGAAGGTTGTGTCGAGCATTGAGGCGGTTCGTGTAGTTAATGATCCAATGCACAGCGAGTTGGAACGC
TTGGAGGTAGAGTTGACGGTGGATGATGTCTCCGCGCTGTTGGCTCGACTCTCGGTGGAACCCAGTCTGAGGCAGAGGATCATTGTTGCCCAAAAGGAAGACCCT
AGCTTGGCCAAAGGCTTTAGTATGGTGGGCCATGGGGATTTCACTCTCTCGGAGGTACCCGTCAAAGTTTTGGCAAGAGAAACCAAGTTGTTGAGGAACCGAACG
ATTCGCTTGGTTAAGGTTTTATGGAGAAACCACCAAGTGGAAGAAGCTACTTGGGAGCGGGAGGACAATATCAAGGCGAGGTACCCTGAACTGCTAGAACAGTCA
ACTTTCGGGGACGAAAGTTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTGAGTGTCAGGGCTTCGGGTATAAATGGTTGAGGGCTGATACGTCACTAATTGGGTATCGAGGCCTCGGGTATAAATGGTCGGGGGTCGATACGCCAATA
TTGGATAAAGATGAGCGTCGAGGCCTCGGGTATAAATGGTCGGGGGTCAATGCGAGGAGTCTTTCGAAAGGAGAACTATTGGGGCCTTGGACTGTCCTAGAATCT
AGGTCGTCAGTGTACGGTCTTCGTCAGTCTCCTCGTCATCACCAGCTACGTTTCCGAGTTCTGGATACAATGGCTTTTCGACGGAACACGAGAGCTCACAACTAC
GAGGATCCGAACCCTAGGGGTGAGGAAGCAGCGGATCCAAATGTTCCCCCGGCAGTTCCTGGAGGGGTAGCACCCCCGGTCCCGCAGGCAGCACCTCAGGGAGTT
CCCCAGGTGGCGTTACTAGCTGAGGCATTGCAATTATTGCTAGATAATGCGAATGGAGCCGGTGGGGCTCAGGCGCAGCAGCCTCGCCGGACACAGATTCAACAA
GAGGAGGTCCAGTTTATCACGGATTTCAAACGCTTCGGACCACCCGTTTTCAACGGGGTCCGGGGAGCAGTGTTTATGCTTCGGGGAGAAGCAGTAAACTGGTGG
GAGTCGGTGGCGGCAGCGGAGGATCACGCCAACGCACCCGTCACATGGGCGAGGTTTAAGGACCTACTCTATGAGTACTATTTCCCCATGACTGTCAGGAATGAA
AAACGGGCAGAGTTTCTCCGTCTCACTCAAGGGAGCCTAACTGTGGCCCAATACGAGAGGAAGTTCACTGAGCTGTCCCGTTTTGGAATGCAATATATTCCTACT
GAACAATTAAAGATTGACAAGTTCATTGACGGTTTGCGTAGGGAGATCAAGGGGCTACTTGTTCTCAAGGAGCCAACTACTTATGCAGCAGCAGTCAGCTCGGGG
GTCAAGAGGAAATTTGCATCGTTCTCCTCCAGTCAACCCTCGAGGGGACACCAGCAGCTTGTGCAAAGGCAGACTGTTTCTCCGGTGTGCCCTTCTTGTAAGAGG
AGCCATGCTGGACCGTGTTGGGCGGGAAAGAGAATATGTTACAGGTGTCAGAAGGAAGGACATTTTGCAAGGGAGTGTCCGATGACCGGCTCGAATACCCAAGCT
TTAGGCCAGAGGATCCCTGCGACGGCGACAACTCAAGGTGGAACCCATAGGGCGCGCGTCTTCGCTCTTACCAGAGGGGATGTTGAGCATGCCGAGGCGGTGGTC
ACAGGGACTGTTTTAGTACTCAGTATGCCTGCTTATGCTTTATTTGACTCGGGGTCTAGTCATTCTTTCATTGCTTCTACCTTTGTTCGGCATGCGGACCTAGAG
CTAGAATCGTTAGGCTTTTTGTTGACGGTATCCACTCCGTCAGGATCTGTGTTGGTCACTAGTCAAGTGGTGAAAGGAGGCCAACTCTCTTTCGATGGTCAGACC
GTGGAGGTGAAGTTAATTCAACTGGATATGCAGGATTTCGATGTGATACTAGGCATGGATTGGTTAGCTGCTAACCGGGCTAATATCGATTGTTCGAAGAAGGAA
GTTAGCTTTCGCTTGCCCTCCGGACAAAACTTTACCTTTAAAGGAGTCAAGGCCGGGGTCCTGAGGGTGGTGTCGGCATTGAAGGCCAGCCATCTTCTCCAGCGT
GGTGTCTGGGCCTATTTGGCTAGCGTTGTGGATGCAAGGAAGGTTGTGTCGAGCATTGAGGCGGTTCGTGTAGTTAATGATCCAATGCACAGCGAGTTGGAACGC
TTGGAGGTAGAGTTGACGGTGGATGATGTCTCCGCGCTGTTGGCTCGACTCTCGGTGGAACCCAGTCTGAGGCAGAGGATCATTGTTGCCCAAAAGGAAGACCCT
AGCTTGGCCAAAGGCTTTAGTATGGTGGGCCATGGGGATTTCACTCTCTCGGAGGTACCCGTCAAAGTTTTGGCAAGAGAAACCAAGTTGTTGAGGAACCGAACG
ATTCGCTTGGTTAAGGTTTTATGGAGAAACCACCAAGTGGAAGAAGCTACTTGGGAGCGGGAGGACAATATCAAGGCGAGGTACCCTGAACTGCTAGAACAGTCA
ACTTTCGGGGACGAAAGTTTTTGA
Protein sequenceShow/hide protein sequence
MGECQGFGYKWLRADTSLIGYRGLGYKWSGVDTPILDKDERRGLGYKWSGVNARSLSKGELLGPWTVLESRSSVYGLRQSPRHHQLRFRVLDTMAFRRNTRAHNY
EDPNPRGEEAADPNVPPAVPGGVAPPVPQAAPQGVPQVALLAEALQLLLDNANGAGGAQAQQPRRTQIQQEEVQFITDFKRFGPPVFNGVRGAVFMLRGEAVNWW
ESVAAAEDHANAPVTWARFKDLLYEYYFPMTVRNEKRAEFLRLTQGSLTVAQYERKFTELSRFGMQYIPTEQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVSSG
VKRKFASFSSSQPSRGHQQLVQRQTVSPVCPSCKRSHAGPCWAGKRICYRCQKEGHFARECPMTGSNTQALGQRIPATATTQGGTHRARVFALTRGDVEHAEAVV
TGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLTVSTPSGSVLVTSQVVKGGQLSFDGQTVEVKLIQLDMQDFDVILGMDWLAANRANIDCSKKE
VSFRLPSGQNFTFKGVKAGVLRVVSALKASHLLQRGVWAYLASVVDARKVVSSIEAVRVVNDPMHSELERLEVELTVDDVSALLARLSVEPSLRQRIIVAQKEDP
SLAKGFSMVGHGDFTLSEVPVKVLARETKLLRNRTIRLVKVLWRNHQVEEATWEREDNIKARYPELLEQSTFGDESF