; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g04760 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g04760
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr1:3130053..3139284
RNA-Seq ExpressionMoc01g04760
SyntenyMoc01g04760
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR001969 - Aspartic peptidase, active site
IPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022155925.1 uncharacterized protein LOC111022925 [Momordica charantia]1.7e-23377.01Show/hide
Query:  MAFRRNTRAHNYEDPNPRGEGAADLNVPPTVPRGVAPPVPQLAPQGVPQVNPQVALLAEALQVLLDNANGAGGAQGQQPRRAQIQQEEVQFIRDFKRFGP
        MAFRRNT+AHNYEDPN RGEGAADLNVPP VP                    +V LLAEALQVLLDNANGAGGAQ QQP R QI QEEVQFIRDFKRFGP
Subjt:  MAFRRNTRAHNYEDPNPRGEGAADLNVPPTVPRGVAPPVPQLAPQGVPQVNPQVALLAEALQVLLDNANGAGGAQGQQPRRAQIQQEEVQFIRDFKRFGP

Query:  PVFNGVSERPTATEEWVRELEALYVYLGCSDEFKVRGAMFMLRGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV
        PVFNGVSERPTA EEWVRELEALYVYLGCSD+FKVRGA+FML+GEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQ SL V
Subjt:  PVFNGVSERPTATEEWVRELEALYVYLGCSDEFKVRGAMFMLRGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV

Query:  TQYERKFTELSRFGMQYIPTEQLKIDKFIDDLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSNSGVKRKFASFSSNQPSRGHQQLVQRQ
         QYERKFTELSRFGMQYIPTEQLKIDKFID LRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQV+GS+SGVKRKFASFSS+QPSRGHQ   QRQ
Subjt:  TQYERKFTELSRFGMQYIPTEQLKIDKFIDDLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSNSGVKRKFASFSSNQPSRGHQQLVQRQ

Query:  TVPPVCPSCKKSHAGPCWAGKRICYRCQKEGHFARECLMTGLNTQALGQRIPATAATQGGTHRARIFALTRGDVEHAEAVVTGTVLVLSMPAYALFDSGS
        T PP CPSCKK+HAGPCW GKRICYRCQKEGHFARECLMTG NTQALGQRIPATA TQ                        GTVLVL            
Subjt:  TVPPVCPSCKKSHAGPCWAGKRICYRCQKEGHFARECLMTGLNTQALGQRIPATAATQGGTHRARIFALTRGDVEHAEAVVTGTVLVLSMPAYALFDSGS

Query:  SHSFIASTFVRHADLELESLGFLSSVSTSSGSVLGTSQVVKGGQLSFDGQALDVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFIFK
                                       SVL TSQVVKGGQLSFDGQAL V LIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNF FK
Subjt:  SHSFIASTFVRHADLELESLGFLSSVSTSSGSVLGTSQVVKGGQLSFDGQALDVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFIFK

Query:  GVKAGVPRVVSVLKASHLFQRGAWAYLVSVVDARKVVPSIEAVRVVNEFTDVFPEDLPGLP
        G+K GVPRVVS LKASHL QRG WAYL SV+DA KVVPSIEAVRVVNEFTDVFPEDL GLP
Subjt:  GVKAGVPRVVSVLKASHLFQRGAWAYLVSVVDARKVVPSIEAVRVVNEFTDVFPEDLPGLP

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]1.2e-27987.34Show/hide
Query:  MAFRRNTRAHNYEDPNPRGEGAADLNVPPTVPRGVAPPVPQLAPQGVPQVNPQVALLAEALQVLLDNANGAGGAQGQQPRRAQIQQEEVQFIRDFKRFGP
        MAFRRNTRAHNYEDPN RGE AAD NV P VP GV PPVPQ APQGVPQVNPQVALLAEALQVLL NANGAGGAQ QQPRRAQI Q+EVQFIRDFK FGP
Subjt:  MAFRRNTRAHNYEDPNPRGEGAADLNVPPTVPRGVAPPVPQLAPQGVPQVNPQVALLAEALQVLLDNANGAGGAQGQQPRRAQIQQEEVQFIRDFKRFGP

Query:  PVFNGVSERPTATEEWVRELEALYVYLGCSDEFKVRGAMFMLRGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV
        PVFNGVSERPTA EEWVRELEALYVYLGCSD+FKVRGA+FMLRGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPV  RNEKR EFLRLTQGSLTV
Subjt:  PVFNGVSERPTATEEWVRELEALYVYLGCSDEFKVRGAMFMLRGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV

Query:  TQYERKFTELSRFGMQYIPTEQLKIDKFIDDLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSNSGVKRKFASFSSNQPSRGHQQLVQRQ
         QYERKFTELSRFG QY+PTEQLKIDKFID LRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQV+GSNSGVKRKFASFS++Q SRGHQ   QRQ
Subjt:  TQYERKFTELSRFGMQYIPTEQLKIDKFIDDLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSNSGVKRKFASFSSNQPSRGHQQLVQRQ

Query:  TVPPVCPSCKKSHAGPCWAGKRICYRCQKEGHFARECLMTGLNTQALGQRIPATAATQGGTHRARIFALTRGDVEHAEAVVTGTVLVLSMPAYALFDSGS
        T PPVCPSCKK+HA PCW GK+IC++CQKEGHF RECLMTG NTQAL Q+ P   ATQGGT  AR+FALTRGDVEHAEAVVTGT+L+LS+PAYALFDSGS
Subjt:  TVPPVCPSCKKSHAGPCWAGKRICYRCQKEGHFARECLMTGLNTQALGQRIPATAATQGGTHRARIFALTRGDVEHAEAVVTGTVLVLSMPAYALFDSGS

Query:  SHSFIASTFVRHADLELESLGFLSSVSTSSGSVLGTSQVVKGGQLSFDGQALDVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFIFK
        SHSFIASTFVRHADLELES GF  SVST SGSVL TSQVVKGGQLSF GQ L+V LIQL+MQDFDVILGMDWLAANRANI+CSKKEVSF L SGQNF FK
Subjt:  SHSFIASTFVRHADLELESLGFLSSVSTSSGSVLGTSQVVKGGQLSFDGQALDVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFIFK

Query:  GVKAGVPRVVSVLKASHLFQRGAWAYLVSVVDARKVVPSIEAVRVVNEFTDVFPEDLPGLP
        GVKAGVPRVVS LKAS+L QRG WAYL SVVDARKVVPSIE VRVVNEFTDVFPEDLPGLP
Subjt:  GVKAGVPRVVSVLKASHLFQRGAWAYLVSVVDARKVVPSIEAVRVVNEFTDVFPEDLPGLP

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]1.2e-1682.09Show/hide
Query:  LPSRAASVASLVAVCSPIHSELERLEVELTVDDVSALLARLLVEPNLRQRIIVAQKGDPGLAKGFSM
        L  +AASVASLVA CS +HSELE  EVELTVDDVSALLARL VEP+LRQRIIVAQK DP LAKGFSM
Subjt:  LPSRAASVASLVAVCSPIHSELERLEVELTVDDVSALLARLLVEPNLRQRIIVAQKGDPGLAKGFSM

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]7.7e-27187Show/hide
Query:  MAFRRNTRAHNYEDPNPRGEGAADLNVPPTVPRGVAPPVPQLAPQGVPQVNPQVALLAEALQVLLDNANGAGGAQGQQPRRAQIQQEEVQFIRDFKRFGP
        MAFRRNTRAHNY+DPNPRGEGAAD NVP  VP  VAPPVPQ APQGVPQVNPQVALLAEALQVLLDNANGAGGAQ QQPRRAQI Q+EVQFIRDFKRFGP
Subjt:  MAFRRNTRAHNYEDPNPRGEGAADLNVPPTVPRGVAPPVPQLAPQGVPQVNPQVALLAEALQVLLDNANGAGGAQGQQPRRAQIQQEEVQFIRDFKRFGP

Query:  PVFNGVSERPTATEEWVRELEALYVYLGCSDEFKVRGAMFMLRGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV
        PVFNGVSERPTATEEWVRELEALYVYLGCSD+FKVRGA+FMLRGEAVNWWESVAAAEDH NVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV
Subjt:  PVFNGVSERPTATEEWVRELEALYVYLGCSDEFKVRGAMFMLRGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV

Query:  TQYERKFTELSRFGMQYIPTEQLKIDKFIDDLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSNSGVKRKFASFSSNQPSRGHQQLVQRQ
         QYERKFTELSRFGMQYIPTEQLKIDKFID LR EIKGLLV+KEPTTYAAA+RCALVMDKCLEEPQSQQVMGS+SGVKRKFA FSS+Q SRGHQ  VQRQ
Subjt:  TQYERKFTELSRFGMQYIPTEQLKIDKFIDDLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSNSGVKRKFASFSSNQPSRGHQQLVQRQ

Query:  TVPPVCPSCKKSHAGPCWAGKRICYRCQKEGHFARECLMTGLNTQALGQRIPATAATQGGTHRARIFALTRGDVEHAEAVVTGTVLVLSMPAYALFDSGS
        T PPVCPSCKK+HAGPCW GKRIC+RCQK                      PA AA QGGT RAR+FALTRGDVEHAEAVVTGT+LV+SMPAYALFDSGS
Subjt:  TVPPVCPSCKKSHAGPCWAGKRICYRCQKEGHFARECLMTGLNTQALGQRIPATAATQGGTHRARIFALTRGDVEHAEAVVTGTVLVLSMPAYALFDSGS

Query:  SHSFIASTFVRHADLELESLGFLSSVSTSSGSVLGTSQVVKGGQLSFDGQALDVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFIFK
        SHSFIASTFVRHADLELESLGFL SVST SGSVL  SQVVKGGQLSFDGQ  +VKLIQLDMQDFDVILGMDWLAANRANI+CSKKEVSFRLPSGQNF FK
Subjt:  SHSFIASTFVRHADLELESLGFLSSVSTSSGSVLGTSQVVKGGQLSFDGQALDVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFIFK

Query:  GVKAGVPRVVSVLKASHLFQRGAWAYLVSVVDARKVVPSIEAVRVVNEFTDVFP
         VK GVPRVVS LKA++L QRGAWAYL SVVDARKVVPSIEAVRVVNEFTDVFP
Subjt:  GVKAGVPRVVSVLKASHLFQRGAWAYLVSVVDARKVVPSIEAVRVVNEFTDVFP

XP_022156992.1 uncharacterized protein LOC111023821 [Momordica charantia]9.8e-24279.57Show/hide
Query:  QTMAFRRNTRAHNYEDPNPRGEGAADLNVPPTVPRGVAPPVPQLAPQGVPQVNPQVALLAEALQVLLDNANGAGGAQGQQPRRAQIQQEEVQFIRDFKRF
        +TMAFRRNTRAHNYEDPNPRGEGAAD NVP  VP GVAP VPQ APQGVPQ                   NGAGGAQ QQPRRAQ  QEEVQFIRDFKRF
Subjt:  QTMAFRRNTRAHNYEDPNPRGEGAADLNVPPTVPRGVAPPVPQLAPQGVPQVNPQVALLAEALQVLLDNANGAGGAQGQQPRRAQIQQEEVQFIRDFKRF

Query:  GPPVFNGVSERPTATEEWVRELEALYVYLGCSDEFKVRGAMFMLRGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSL
        GPPVFNGVSERPTA EEWVRELEALYVYLGCSD+FKV+GA+                                            NEKRAEFLRLTQGSL
Subjt:  GPPVFNGVSERPTATEEWVRELEALYVYLGCSDEFKVRGAMFMLRGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSL

Query:  TVTQYERKFTELSRFGMQYIPTEQLKIDKFIDDLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSNSGVKRKFASFSSNQPSRGHQQLVQ
        TV QYERKFTELSRF MQYIP EQLKIDKFID L REIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGS+SGVKRKFASFSS+QPSRGHQ  VQ
Subjt:  TVTQYERKFTELSRFGMQYIPTEQLKIDKFIDDLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSNSGVKRKFASFSSNQPSRGHQQLVQ

Query:  RQTVPPVCPSCKKSHAGPCWAGKRICYRCQKEGHFARECLMTGLNTQALGQRIPATAATQGGTHRARIFALTRGDVEHAEAVVTGTVLVLSMPAYALFDS
        RQT PPVCPSCKKSH GPCW GK ICYRCQKEGHFAREC MTG NTQ LGQRIP T A QGGTHRAR+FALTRGDV HAEAVV GTVLVLSMPAYALFDS
Subjt:  RQTVPPVCPSCKKSHAGPCWAGKRICYRCQKEGHFARECLMTGLNTQALGQRIPATAATQGGTHRARIFALTRGDVEHAEAVVTGTVLVLSMPAYALFDS

Query:  GSSHSFIASTFVRHADLELESLGFLSSVSTSSGSVLGTSQVVKGGQLSFDGQALDVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFI
         SSHSFIASTFVRHADLELESLGFL SVST SGSVL TSQ+VKGGQLSFDGQ L+VKLIQLDMQDFDVILGMDWLAAN+ANIDCSKKE SFRLPS QNF 
Subjt:  GSSHSFIASTFVRHADLELESLGFLSSVSTSSGSVLGTSQVVKGGQLSFDGQALDVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFI

Query:  FKGVKAGVPRVVSVLKASHLFQRGAWAYLVSVVDARKVVPSIEAVRVVNEFTDVFPEDLPGLP
        FKGVKA VPRVVS LKASH  QRGAWAYL SVVDARKVVPSIEAVRVVNEFTDVFPEDLPGLP
Subjt:  FKGVKAGVPRVVSVLKASHLFQRGAWAYLVSVVDARKVVPSIEAVRVVNEFTDVFPEDLPGLP

XP_022158750.1 uncharacterized protein LOC111025215 [Momordica charantia]7.2e-26988.93Show/hide
Query:  MAFRRNTRAHNYEDPNPRGEGAADLNVPPTVPRGVAPPVPQLAPQGVPQVNPQVALLAEALQVLLDNANGAGGAQGQQPRRAQIQQEEVQFIRDFKRFGP
        MAFRRNTRAHNYEDPNPRGEGAAD NVPP VP GVAPP PQ A QGVPQVNPQVALLAEALQVLLDNANGAGGAQ QQPR AQI QEE            
Subjt:  MAFRRNTRAHNYEDPNPRGEGAADLNVPPTVPRGVAPPVPQLAPQGVPQVNPQVALLAEALQVLLDNANGAGGAQGQQPRRAQIQQEEVQFIRDFKRFGP

Query:  PVFNGVSERPTATEEWVRELEALYVYLGCSDEFKVRGAMFMLRGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV
             VSERPTA EEWVRELEALYVYLGCSD+FKVRGA+FMLRGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKR EFLRLTQGSLTV
Subjt:  PVFNGVSERPTATEEWVRELEALYVYLGCSDEFKVRGAMFMLRGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV

Query:  TQYERKFTELSRFGMQYIPTEQLKIDKFIDDLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSNSGVKRKFASFSSNQPSRGHQQLVQRQ
         +YERKFTELSRFGMQYIPT+QLKIDKFID LRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQV+GS+SGVKRKFASFSS+QPSR HQ  VQRQ
Subjt:  TQYERKFTELSRFGMQYIPTEQLKIDKFIDDLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSNSGVKRKFASFSSNQPSRGHQQLVQRQ

Query:  TVPPVCPSCKKSHAGPCWAGKRICYRCQKEGHFARECLMTGLNTQALGQRIPATAATQGGTHRARIFALTRGDVEHAEAVVTGTVLVLSMPAYALFDSGS
        T PPVCPSCKKSHAGPCW GKRICYRCQKEGHFAREC MTG NTQALGQRIPATAA QGGTHRAR+FALTRGDVE+AEAVVT TVLVLSMPAYALFDSGS
Subjt:  TVPPVCPSCKKSHAGPCWAGKRICYRCQKEGHFARECLMTGLNTQALGQRIPATAATQGGTHRARIFALTRGDVEHAEAVVTGTVLVLSMPAYALFDSGS

Query:  SHSFIASTFVRHADLELESLGFLSSVSTSSGSVLGTSQVVKGGQLSFDGQALDVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFIFK
        SHSFIASTFV HADLELESLGFL SVST SGSVL TSQVVKGGQLSFDGQ L+VKLIQLDMQDFDVILGMDWLAANRANIDCSKK+VSFRLPSGQNF FK
Subjt:  SHSFIASTFVRHADLELESLGFLSSVSTSSGSVLGTSQVVKGGQLSFDGQALDVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFIFK

Query:  GVKAGVPRVVSVLKASHLFQRGAWAYLVSVVDARKVVPSIEA
        GVKAGVPRVV  LKASHL QRGAWAYL SVVDARKVVPSIEA
Subjt:  GVKAGVPRVVSVLKASHLFQRGAWAYLVSVVDARKVVPSIEA

TrEMBL top hitse value%identityAlignment
A0A6J1DNV8 uncharacterized protein LOC1110229258.1e-23477.01Show/hide
Query:  MAFRRNTRAHNYEDPNPRGEGAADLNVPPTVPRGVAPPVPQLAPQGVPQVNPQVALLAEALQVLLDNANGAGGAQGQQPRRAQIQQEEVQFIRDFKRFGP
        MAFRRNT+AHNYEDPN RGEGAADLNVPP VP                    +V LLAEALQVLLDNANGAGGAQ QQP R QI QEEVQFIRDFKRFGP
Subjt:  MAFRRNTRAHNYEDPNPRGEGAADLNVPPTVPRGVAPPVPQLAPQGVPQVNPQVALLAEALQVLLDNANGAGGAQGQQPRRAQIQQEEVQFIRDFKRFGP

Query:  PVFNGVSERPTATEEWVRELEALYVYLGCSDEFKVRGAMFMLRGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV
        PVFNGVSERPTA EEWVRELEALYVYLGCSD+FKVRGA+FML+GEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQ SL V
Subjt:  PVFNGVSERPTATEEWVRELEALYVYLGCSDEFKVRGAMFMLRGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV

Query:  TQYERKFTELSRFGMQYIPTEQLKIDKFIDDLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSNSGVKRKFASFSSNQPSRGHQQLVQRQ
         QYERKFTELSRFGMQYIPTEQLKIDKFID LRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQV+GS+SGVKRKFASFSS+QPSRGHQ   QRQ
Subjt:  TQYERKFTELSRFGMQYIPTEQLKIDKFIDDLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSNSGVKRKFASFSSNQPSRGHQQLVQRQ

Query:  TVPPVCPSCKKSHAGPCWAGKRICYRCQKEGHFARECLMTGLNTQALGQRIPATAATQGGTHRARIFALTRGDVEHAEAVVTGTVLVLSMPAYALFDSGS
        T PP CPSCKK+HAGPCW GKRICYRCQKEGHFARECLMTG NTQALGQRIPATA TQ                        GTVLVL            
Subjt:  TVPPVCPSCKKSHAGPCWAGKRICYRCQKEGHFARECLMTGLNTQALGQRIPATAATQGGTHRARIFALTRGDVEHAEAVVTGTVLVLSMPAYALFDSGS

Query:  SHSFIASTFVRHADLELESLGFLSSVSTSSGSVLGTSQVVKGGQLSFDGQALDVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFIFK
                                       SVL TSQVVKGGQLSFDGQAL V LIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNF FK
Subjt:  SHSFIASTFVRHADLELESLGFLSSVSTSSGSVLGTSQVVKGGQLSFDGQALDVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFIFK

Query:  GVKAGVPRVVSVLKASHLFQRGAWAYLVSVVDARKVVPSIEAVRVVNEFTDVFPEDLPGLP
        G+K GVPRVVS LKASHL QRG WAYL SV+DA KVVPSIEAVRVVNEFTDVFPEDL GLP
Subjt:  GVKAGVPRVVSVLKASHLFQRGAWAYLVSVVDARKVVPSIEAVRVVNEFTDVFPEDLPGLP

A0A6J1DQB9 Reverse transcriptase5.7e-28087.34Show/hide
Query:  MAFRRNTRAHNYEDPNPRGEGAADLNVPPTVPRGVAPPVPQLAPQGVPQVNPQVALLAEALQVLLDNANGAGGAQGQQPRRAQIQQEEVQFIRDFKRFGP
        MAFRRNTRAHNYEDPN RGE AAD NV P VP GV PPVPQ APQGVPQVNPQVALLAEALQVLL NANGAGGAQ QQPRRAQI Q+EVQFIRDFK FGP
Subjt:  MAFRRNTRAHNYEDPNPRGEGAADLNVPPTVPRGVAPPVPQLAPQGVPQVNPQVALLAEALQVLLDNANGAGGAQGQQPRRAQIQQEEVQFIRDFKRFGP

Query:  PVFNGVSERPTATEEWVRELEALYVYLGCSDEFKVRGAMFMLRGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV
        PVFNGVSERPTA EEWVRELEALYVYLGCSD+FKVRGA+FMLRGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPV  RNEKR EFLRLTQGSLTV
Subjt:  PVFNGVSERPTATEEWVRELEALYVYLGCSDEFKVRGAMFMLRGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV

Query:  TQYERKFTELSRFGMQYIPTEQLKIDKFIDDLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSNSGVKRKFASFSSNQPSRGHQQLVQRQ
         QYERKFTELSRFG QY+PTEQLKIDKFID LRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQV+GSNSGVKRKFASFS++Q SRGHQ   QRQ
Subjt:  TQYERKFTELSRFGMQYIPTEQLKIDKFIDDLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSNSGVKRKFASFSSNQPSRGHQQLVQRQ

Query:  TVPPVCPSCKKSHAGPCWAGKRICYRCQKEGHFARECLMTGLNTQALGQRIPATAATQGGTHRARIFALTRGDVEHAEAVVTGTVLVLSMPAYALFDSGS
        T PPVCPSCKK+HA PCW GK+IC++CQKEGHF RECLMTG NTQAL Q+ P   ATQGGT  AR+FALTRGDVEHAEAVVTGT+L+LS+PAYALFDSGS
Subjt:  TVPPVCPSCKKSHAGPCWAGKRICYRCQKEGHFARECLMTGLNTQALGQRIPATAATQGGTHRARIFALTRGDVEHAEAVVTGTVLVLSMPAYALFDSGS

Query:  SHSFIASTFVRHADLELESLGFLSSVSTSSGSVLGTSQVVKGGQLSFDGQALDVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFIFK
        SHSFIASTFVRHADLELES GF  SVST SGSVL TSQVVKGGQLSF GQ L+V LIQL+MQDFDVILGMDWLAANRANI+CSKKEVSF L SGQNF FK
Subjt:  SHSFIASTFVRHADLELESLGFLSSVSTSSGSVLGTSQVVKGGQLSFDGQALDVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFIFK

Query:  GVKAGVPRVVSVLKASHLFQRGAWAYLVSVVDARKVVPSIEAVRVVNEFTDVFPEDLPGLP
        GVKAGVPRVVS LKAS+L QRG WAYL SVVDARKVVPSIE VRVVNEFTDVFPEDLPGLP
Subjt:  GVKAGVPRVVSVLKASHLFQRGAWAYLVSVVDARKVVPSIEAVRVVNEFTDVFPEDLPGLP

A0A6J1DQB9 Reverse transcriptase5.6e-1782.09Show/hide
Query:  LPSRAASVASLVAVCSPIHSELERLEVELTVDDVSALLARLLVEPNLRQRIIVAQKGDPGLAKGFSM
        L  +AASVASLVA CS +HSELE  EVELTVDDVSALLARL VEP+LRQRIIVAQK DP LAKGFSM
Subjt:  LPSRAASVASLVAVCSPIHSELERLEVELTVDDVSALLARLLVEPNLRQRIIVAQKGDPGLAKGFSM

A0A6J1DQB9 Reverse transcriptase3.7e-27187Show/hide
Query:  MAFRRNTRAHNYEDPNPRGEGAADLNVPPTVPRGVAPPVPQLAPQGVPQVNPQVALLAEALQVLLDNANGAGGAQGQQPRRAQIQQEEVQFIRDFKRFGP
        MAFRRNTRAHNY+DPNPRGEGAAD NVP  VP  VAPPVPQ APQGVPQVNPQVALLAEALQVLLDNANGAGGAQ QQPRRAQI Q+EVQFIRDFKRFGP
Subjt:  MAFRRNTRAHNYEDPNPRGEGAADLNVPPTVPRGVAPPVPQLAPQGVPQVNPQVALLAEALQVLLDNANGAGGAQGQQPRRAQIQQEEVQFIRDFKRFGP

Query:  PVFNGVSERPTATEEWVRELEALYVYLGCSDEFKVRGAMFMLRGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV
        PVFNGVSERPTATEEWVRELEALYVYLGCSD+FKVRGA+FMLRGEAVNWWESVAAAEDH NVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV
Subjt:  PVFNGVSERPTATEEWVRELEALYVYLGCSDEFKVRGAMFMLRGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV

Query:  TQYERKFTELSRFGMQYIPTEQLKIDKFIDDLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSNSGVKRKFASFSSNQPSRGHQQLVQRQ
         QYERKFTELSRFGMQYIPTEQLKIDKFID LR EIKGLLV+KEPTTYAAA+RCALVMDKCLEEPQSQQVMGS+SGVKRKFA FSS+Q SRGHQ  VQRQ
Subjt:  TQYERKFTELSRFGMQYIPTEQLKIDKFIDDLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSNSGVKRKFASFSSNQPSRGHQQLVQRQ

Query:  TVPPVCPSCKKSHAGPCWAGKRICYRCQKEGHFARECLMTGLNTQALGQRIPATAATQGGTHRARIFALTRGDVEHAEAVVTGTVLVLSMPAYALFDSGS
        T PPVCPSCKK+HAGPCW GKRIC+RCQK                      PA AA QGGT RAR+FALTRGDVEHAEAVVTGT+LV+SMPAYALFDSGS
Subjt:  TVPPVCPSCKKSHAGPCWAGKRICYRCQKEGHFARECLMTGLNTQALGQRIPATAATQGGTHRARIFALTRGDVEHAEAVVTGTVLVLSMPAYALFDSGS

Query:  SHSFIASTFVRHADLELESLGFLSSVSTSSGSVLGTSQVVKGGQLSFDGQALDVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFIFK
        SHSFIASTFVRHADLELESLGFL SVST SGSVL  SQVVKGGQLSFDGQ  +VKLIQLDMQDFDVILGMDWLAANRANI+CSKKEVSFRLPSGQNF FK
Subjt:  SHSFIASTFVRHADLELESLGFLSSVSTSSGSVLGTSQVVKGGQLSFDGQALDVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFIFK

Query:  GVKAGVPRVVSVLKASHLFQRGAWAYLVSVVDARKVVPSIEAVRVVNEFTDVFP
         VK GVPRVVS LKA++L QRGAWAYL SVVDARKVVPSIEAVRVVNEFTDVFP
Subjt:  GVKAGVPRVVSVLKASHLFQRGAWAYLVSVVDARKVVPSIEAVRVVNEFTDVFP

A0A6J1DTE5 uncharacterized protein LOC1110238214.7e-24279.57Show/hide
Query:  QTMAFRRNTRAHNYEDPNPRGEGAADLNVPPTVPRGVAPPVPQLAPQGVPQVNPQVALLAEALQVLLDNANGAGGAQGQQPRRAQIQQEEVQFIRDFKRF
        +TMAFRRNTRAHNYEDPNPRGEGAAD NVP  VP GVAP VPQ APQGVPQ                   NGAGGAQ QQPRRAQ  QEEVQFIRDFKRF
Subjt:  QTMAFRRNTRAHNYEDPNPRGEGAADLNVPPTVPRGVAPPVPQLAPQGVPQVNPQVALLAEALQVLLDNANGAGGAQGQQPRRAQIQQEEVQFIRDFKRF

Query:  GPPVFNGVSERPTATEEWVRELEALYVYLGCSDEFKVRGAMFMLRGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSL
        GPPVFNGVSERPTA EEWVRELEALYVYLGCSD+FKV+GA+                                            NEKRAEFLRLTQGSL
Subjt:  GPPVFNGVSERPTATEEWVRELEALYVYLGCSDEFKVRGAMFMLRGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSL

Query:  TVTQYERKFTELSRFGMQYIPTEQLKIDKFIDDLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSNSGVKRKFASFSSNQPSRGHQQLVQ
        TV QYERKFTELSRF MQYIP EQLKIDKFID L REIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGS+SGVKRKFASFSS+QPSRGHQ  VQ
Subjt:  TVTQYERKFTELSRFGMQYIPTEQLKIDKFIDDLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSNSGVKRKFASFSSNQPSRGHQQLVQ

Query:  RQTVPPVCPSCKKSHAGPCWAGKRICYRCQKEGHFARECLMTGLNTQALGQRIPATAATQGGTHRARIFALTRGDVEHAEAVVTGTVLVLSMPAYALFDS
        RQT PPVCPSCKKSH GPCW GK ICYRCQKEGHFAREC MTG NTQ LGQRIP T A QGGTHRAR+FALTRGDV HAEAVV GTVLVLSMPAYALFDS
Subjt:  RQTVPPVCPSCKKSHAGPCWAGKRICYRCQKEGHFARECLMTGLNTQALGQRIPATAATQGGTHRARIFALTRGDVEHAEAVVTGTVLVLSMPAYALFDS

Query:  GSSHSFIASTFVRHADLELESLGFLSSVSTSSGSVLGTSQVVKGGQLSFDGQALDVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFI
         SSHSFIASTFVRHADLELESLGFL SVST SGSVL TSQ+VKGGQLSFDGQ L+VKLIQLDMQDFDVILGMDWLAAN+ANIDCSKKE SFRLPS QNF 
Subjt:  GSSHSFIASTFVRHADLELESLGFLSSVSTSSGSVLGTSQVVKGGQLSFDGQALDVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFI

Query:  FKGVKAGVPRVVSVLKASHLFQRGAWAYLVSVVDARKVVPSIEAVRVVNEFTDVFPEDLPGLP
        FKGVKA VPRVVS LKASH  QRGAWAYL SVVDARKVVPSIEAVRVVNEFTDVFPEDLPGLP
Subjt:  FKGVKAGVPRVVSVLKASHLFQRGAWAYLVSVVDARKVVPSIEAVRVVNEFTDVFPEDLPGLP

A0A6J1DWP4 uncharacterized protein LOC1110252153.5e-26988.93Show/hide
Query:  MAFRRNTRAHNYEDPNPRGEGAADLNVPPTVPRGVAPPVPQLAPQGVPQVNPQVALLAEALQVLLDNANGAGGAQGQQPRRAQIQQEEVQFIRDFKRFGP
        MAFRRNTRAHNYEDPNPRGEGAAD NVPP VP GVAPP PQ A QGVPQVNPQVALLAEALQVLLDNANGAGGAQ QQPR AQI QEE            
Subjt:  MAFRRNTRAHNYEDPNPRGEGAADLNVPPTVPRGVAPPVPQLAPQGVPQVNPQVALLAEALQVLLDNANGAGGAQGQQPRRAQIQQEEVQFIRDFKRFGP

Query:  PVFNGVSERPTATEEWVRELEALYVYLGCSDEFKVRGAMFMLRGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV
             VSERPTA EEWVRELEALYVYLGCSD+FKVRGA+FMLRGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKR EFLRLTQGSLTV
Subjt:  PVFNGVSERPTATEEWVRELEALYVYLGCSDEFKVRGAMFMLRGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV

Query:  TQYERKFTELSRFGMQYIPTEQLKIDKFIDDLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSNSGVKRKFASFSSNQPSRGHQQLVQRQ
         +YERKFTELSRFGMQYIPT+QLKIDKFID LRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQV+GS+SGVKRKFASFSS+QPSR HQ  VQRQ
Subjt:  TQYERKFTELSRFGMQYIPTEQLKIDKFIDDLRREIKGLLVLKEPTTYAAAVRCALVMDKCLEEPQSQQVMGSNSGVKRKFASFSSNQPSRGHQQLVQRQ

Query:  TVPPVCPSCKKSHAGPCWAGKRICYRCQKEGHFARECLMTGLNTQALGQRIPATAATQGGTHRARIFALTRGDVEHAEAVVTGTVLVLSMPAYALFDSGS
        T PPVCPSCKKSHAGPCW GKRICYRCQKEGHFAREC MTG NTQALGQRIPATAA QGGTHRAR+FALTRGDVE+AEAVVT TVLVLSMPAYALFDSGS
Subjt:  TVPPVCPSCKKSHAGPCWAGKRICYRCQKEGHFARECLMTGLNTQALGQRIPATAATQGGTHRARIFALTRGDVEHAEAVVTGTVLVLSMPAYALFDSGS

Query:  SHSFIASTFVRHADLELESLGFLSSVSTSSGSVLGTSQVVKGGQLSFDGQALDVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFIFK
        SHSFIASTFV HADLELESLGFL SVST SGSVL TSQVVKGGQLSFDGQ L+VKLIQLDMQDFDVILGMDWLAANRANIDCSKK+VSFRLPSGQNF FK
Subjt:  SHSFIASTFVRHADLELESLGFLSSVSTSSGSVLGTSQVVKGGQLSFDGQALDVKLIQLDMQDFDVILGMDWLAANRANIDCSKKEVSFRLPSGQNFIFK

Query:  GVKAGVPRVVSVLKASHLFQRGAWAYLVSVVDARKVVPSIEA
        GVKAGVPRVV  LKASHL QRGAWAYL SVVDARKVVPSIEA
Subjt:  GVKAGVPRVVSVLKASHLFQRGAWAYLVSVVDARKVVPSIEA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCGAGGGTCGATACGAGGAGTCCTTTGGAGGGAAGACTATTGGGGCCTTGGGTATAAATGGTCAAGGGCCAATAGATGGTGAAGTCATCGGGGCCTCGGATATAAA
TGGTCGAGGGTCGATGCAGCAGGGTCGGGGCTCTGGGTATAAATGGTCGTTAGGGTATGGTCCTTGTCAGTCTCCTCGTCATCACCAGACAATGGCTTTTCGGCGAAATA
CGAGAGCTCACAACTACGAGGATCCGAATCCTAGGGGTGAGGGAGCAGCGGATCTGAATGTTCCCCCGACAGTTCCTAGAGGGGTAGCACCCCCGGTCCCTCAGTTAGCA
CCCCAGGGAGTTCCCCAGGTGAATCCCCAGGTGGCGTTACTAGCTGAGGCCTTGCAAGTATTGCTGGATAATGCGAATGGAGCCGGTGGAGCTCAAGGGCAGCAGCCTCG
TCGGGCACAGATTCAACAAGAGGAGGTTCAGTTTATCAGGGATTTCAAACGCTTTGGACCACCAGTTTTCAACGGGGTGAGTGAGAGGCCTACTGCGACCGAGGAATGGG
TCAGGGAGTTGGAAGCCCTTTATGTGTATTTGGGATGCTCCGACGAATTCAAGGTCCGGGGAGCAATGTTTATGCTTCGGGGAGAAGCAGTAAATTGGTGGGAGTCGGTG
GCGGCAGCGGAGGATCACGCCAACGTACCCGTCACATGGGCGAGGTTTAAGGACCTACTCTATGAGTACTATTTCCCCGTGACTGTCAGGAATGAAAAACGGGCAGAGTT
TCTCCGTCTCACTCAAGGGAGCCTAACTGTGACCCAATACGAGAGGAAGTTCACTGAGCTGTCCCGTTTTGGAATGCAATATATTCCTACTGAACAATTAAAGATTGACA
AGTTCATTGACGATTTGCGTAGGGAGATCAAGGGGCTACTTGTTCTCAAGGAGCCAACTACTTATGCAGCAGCAGTCAGGTGTGCGTTGGTTATGGACAAATGTCTCGAG
GAGCCTCAGTCTCAGCAGGTGATGGGCTCCAACTCAGGGGTCAAGAGGAAATTTGCATCGTTCTCCTCCAATCAACCTTCAAGAGGACACCAGCAGCTTGTGCAAAGGCA
GACTGTTCCTCCGGTGTGCCCCTCTTGTAAGAAGAGCCATGCTGGGCCGTGTTGGGCGGGAAAAAGAATATGTTACAGGTGTCAGAAGGAAGGACATTTCGCAAGGGAGT
GTCTGATGACCGGCTTGAATACCCAAGCTTTAGGCCAGAGGATCCCTGCGACGGCGGCAACTCAAGGTGGGACCCATAGGGCGCGTATCTTCGCTCTTACCAGGGGGGAT
GTTGAGCATGCCGAGGCGGTGGTCACAGGGACTGTTTTAGTGCTTAGTATGCCTGCGTACGCATTATTTGACTCTGGATCTAGTCATTCTTTCATTGCTTCTACCTTTGT
TCGACATGCGGACCTAGAGCTAGAATCGTTAGGCTTTTTGTCGTCGGTATCCACATCGTCAGGGTCTGTGTTGGGCACTAGTCAAGTGGTGAAAGGAGGCCAACTCTCCT
TTGATGGTCAGGCCTTGGATGTAAAATTAATCCAACTGGATATGCAGGATTTCGATGTGATACTAGGCATGGATTGGTTAGCTGCCAACCGGGCTAATATTGATTGCTCG
AAGAAGGAAGTTAGCTTTCGCTTGCCCTCCGGACAAAACTTTATCTTTAAAGGAGTTAAGGCCGGGGTCCCAAGGGTGGTGTCGGTATTGAAGGCCAGCCATCTCTTCCA
ACGTGGTGCCTGGGCCTATTTGGTCAGCGTCGTGGATGCAAGGAAGGTTGTGCCAAGCATTGAGGCGGTTCGTGTGGTTAATGAGTTCACTGACGTGTTCCCTGAGGACC
TCCCTGGTTTGCCTTCGAGGGCGGCGAGTGTGGCATCTCTTGTAGCGGTCTGCAGTCCAATACATAGCGAGTTGGAACGCTTGGAGGTGGAGTTGACGGTGGATGATGTC
TCCGCGTTGTTGGCTCGACTCTTAGTGGAACCTAACCTGAGACAGAGGATCATTGTTGCCCAAAAGGGAGACCCTGGCTTGGCCAAAGGCTTTAGTATGTGGAAGAAGCT
ACTTGGGAGCGGGAGGACGACATCAAGGCGAGATACCCTGAACTGTTGGAACATTCAACTTTCGGGGACGAAAGTTTTTGAAGGAGGGAAGTCTGTAACGCCCCGAGTCC
CTCAGCCGCCCCCTTTATTTTCCGGCGACCCAGCCTCCCTCCTCCGGCGTTTTCTTTTGCGAGCTACATCGCAGCTGTCCTCCGCGCACGGTAGGTCCGACGAGTGGTGC
ACGGGTTCCACGAGCAGCGGCGGTGCGACCTCCTTGGCGGCGTTCGAGCAGCGGCGGCCGACGACTCCCGACGCTCCAGCGGCTGCATTTCGTAACAGGTACAACGGGCA
CGACAGCGTTCTCGTAGTGGCGCACGGCGGACTGTTACAGCGGCGGACCGGCGATGTGACTCCCCGACGTTTCGACGGCCTGCAACAGCGGCGGCGCGCCCCGACGATCT
GTAAGAACCAACGCGGCCTCCTCCTCGCGGCGGCGCACGGCGAACGGGCGGTACAGCAGCTGAGCGGCAGCGGGATTTGTACGTTACAGTGGGGTTTAGGACGTTTGGCA
GTGACCCACATCCGTTCGGAGCTCGATTCAAGCTACCCAAACCTTGGCGAGTTAGATCTAGGTGACCCACATCTATACGAAGGTGAGGTCGACGCAAAACTTAAGGGCAC
GGTTGTGAACGACAATGTTGCAAAAGGAGCGTTCGGCTATGCTGACGGGTGCCGCAAAGACTTGAGGGCAGCTGGAGCAGAAGTTGTGTCTGGCACCAGAAGAATGAACG
AGCTGGAGCTGAAACTAGCCAAGGCCGAGGAAATGGCCCTTGGCAACGAGTCTACTATGCTAGATGATATTGAGGCTGTTATAAAAGCGTTTAAGGTTGATAACGTTGTG
GCACTTGACCGACTGAAGACTACTGAAGATCCCCTTGGTCATCTATGTGCATATCTCCTGTCATTCAATGAGCGCCTGGAAGACCCAGACTACCTAGAGTCCCAATTTTG
TGGCTTCGTTTCCTTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTCGAGGGTCGATACGAGGAGTCCTTTGGAGGGAAGACTATTGGGGCCTTGGGTATAAATGGTCAAGGGCCAATAGATGGTGAAGTCATCGGGGCCTCGGATATAAA
TGGTCGAGGGTCGATGCAGCAGGGTCGGGGCTCTGGGTATAAATGGTCGTTAGGGTATGGTCCTTGTCAGTCTCCTCGTCATCACCAGACAATGGCTTTTCGGCGAAATA
CGAGAGCTCACAACTACGAGGATCCGAATCCTAGGGGTGAGGGAGCAGCGGATCTGAATGTTCCCCCGACAGTTCCTAGAGGGGTAGCACCCCCGGTCCCTCAGTTAGCA
CCCCAGGGAGTTCCCCAGGTGAATCCCCAGGTGGCGTTACTAGCTGAGGCCTTGCAAGTATTGCTGGATAATGCGAATGGAGCCGGTGGAGCTCAAGGGCAGCAGCCTCG
TCGGGCACAGATTCAACAAGAGGAGGTTCAGTTTATCAGGGATTTCAAACGCTTTGGACCACCAGTTTTCAACGGGGTGAGTGAGAGGCCTACTGCGACCGAGGAATGGG
TCAGGGAGTTGGAAGCCCTTTATGTGTATTTGGGATGCTCCGACGAATTCAAGGTCCGGGGAGCAATGTTTATGCTTCGGGGAGAAGCAGTAAATTGGTGGGAGTCGGTG
GCGGCAGCGGAGGATCACGCCAACGTACCCGTCACATGGGCGAGGTTTAAGGACCTACTCTATGAGTACTATTTCCCCGTGACTGTCAGGAATGAAAAACGGGCAGAGTT
TCTCCGTCTCACTCAAGGGAGCCTAACTGTGACCCAATACGAGAGGAAGTTCACTGAGCTGTCCCGTTTTGGAATGCAATATATTCCTACTGAACAATTAAAGATTGACA
AGTTCATTGACGATTTGCGTAGGGAGATCAAGGGGCTACTTGTTCTCAAGGAGCCAACTACTTATGCAGCAGCAGTCAGGTGTGCGTTGGTTATGGACAAATGTCTCGAG
GAGCCTCAGTCTCAGCAGGTGATGGGCTCCAACTCAGGGGTCAAGAGGAAATTTGCATCGTTCTCCTCCAATCAACCTTCAAGAGGACACCAGCAGCTTGTGCAAAGGCA
GACTGTTCCTCCGGTGTGCCCCTCTTGTAAGAAGAGCCATGCTGGGCCGTGTTGGGCGGGAAAAAGAATATGTTACAGGTGTCAGAAGGAAGGACATTTCGCAAGGGAGT
GTCTGATGACCGGCTTGAATACCCAAGCTTTAGGCCAGAGGATCCCTGCGACGGCGGCAACTCAAGGTGGGACCCATAGGGCGCGTATCTTCGCTCTTACCAGGGGGGAT
GTTGAGCATGCCGAGGCGGTGGTCACAGGGACTGTTTTAGTGCTTAGTATGCCTGCGTACGCATTATTTGACTCTGGATCTAGTCATTCTTTCATTGCTTCTACCTTTGT
TCGACATGCGGACCTAGAGCTAGAATCGTTAGGCTTTTTGTCGTCGGTATCCACATCGTCAGGGTCTGTGTTGGGCACTAGTCAAGTGGTGAAAGGAGGCCAACTCTCCT
TTGATGGTCAGGCCTTGGATGTAAAATTAATCCAACTGGATATGCAGGATTTCGATGTGATACTAGGCATGGATTGGTTAGCTGCCAACCGGGCTAATATTGATTGCTCG
AAGAAGGAAGTTAGCTTTCGCTTGCCCTCCGGACAAAACTTTATCTTTAAAGGAGTTAAGGCCGGGGTCCCAAGGGTGGTGTCGGTATTGAAGGCCAGCCATCTCTTCCA
ACGTGGTGCCTGGGCCTATTTGGTCAGCGTCGTGGATGCAAGGAAGGTTGTGCCAAGCATTGAGGCGGTTCGTGTGGTTAATGAGTTCACTGACGTGTTCCCTGAGGACC
TCCCTGGTTTGCCTTCGAGGGCGGCGAGTGTGGCATCTCTTGTAGCGGTCTGCAGTCCAATACATAGCGAGTTGGAACGCTTGGAGGTGGAGTTGACGGTGGATGATGTC
TCCGCGTTGTTGGCTCGACTCTTAGTGGAACCTAACCTGAGACAGAGGATCATTGTTGCCCAAAAGGGAGACCCTGGCTTGGCCAAAGGCTTTAGTATGTGGAAGAAGCT
ACTTGGGAGCGGGAGGACGACATCAAGGCGAGATACCCTGAACTGTTGGAACATTCAACTTTCGGGGACGAAAGTTTTTGAAGGAGGGAAGTCTGTAACGCCCCGAGTCC
CTCAGCCGCCCCCTTTATTTTCCGGCGACCCAGCCTCCCTCCTCCGGCGTTTTCTTTTGCGAGCTACATCGCAGCTGTCCTCCGCGCACGGTAGGTCCGACGAGTGGTGC
ACGGGTTCCACGAGCAGCGGCGGTGCGACCTCCTTGGCGGCGTTCGAGCAGCGGCGGCCGACGACTCCCGACGCTCCAGCGGCTGCATTTCGTAACAGGTACAACGGGCA
CGACAGCGTTCTCGTAGTGGCGCACGGCGGACTGTTACAGCGGCGGACCGGCGATGTGACTCCCCGACGTTTCGACGGCCTGCAACAGCGGCGGCGCGCCCCGACGATCT
GTAAGAACCAACGCGGCCTCCTCCTCGCGGCGGCGCACGGCGAACGGGCGGTACAGCAGCTGAGCGGCAGCGGGATTTGTACGTTACAGTGGGGTTTAGGACGTTTGGCA
GTGACCCACATCCGTTCGGAGCTCGATTCAAGCTACCCAAACCTTGGCGAGTTAGATCTAGGTGACCCACATCTATACGAAGGTGAGGTCGACGCAAAACTTAAGGGCAC
GGTTGTGAACGACAATGTTGCAAAAGGAGCGTTCGGCTATGCTGACGGGTGCCGCAAAGACTTGAGGGCAGCTGGAGCAGAAGTTGTGTCTGGCACCAGAAGAATGAACG
AGCTGGAGCTGAAACTAGCCAAGGCCGAGGAAATGGCCCTTGGCAACGAGTCTACTATGCTAGATGATATTGAGGCTGTTATAAAAGCGTTTAAGGTTGATAACGTTGTG
GCACTTGACCGACTGAAGACTACTGAAGATCCCCTTGGTCATCTATGTGCATATCTCCTGTCATTCAATGAGCGCCTGGAAGACCCAGACTACCTAGAGTCCCAATTTTG
TGGCTTCGTTTCCTTCTGA
Protein sequenceShow/hide protein sequence
MVEGRYEESFGGKTIGALGINGQGPIDGEVIGASDINGRGSMQQGRGSGYKWSLGYGPCQSPRHHQTMAFRRNTRAHNYEDPNPRGEGAADLNVPPTVPRGVAPPVPQLA
PQGVPQVNPQVALLAEALQVLLDNANGAGGAQGQQPRRAQIQQEEVQFIRDFKRFGPPVFNGVSERPTATEEWVRELEALYVYLGCSDEFKVRGAMFMLRGEAVNWWESV
AAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVTQYERKFTELSRFGMQYIPTEQLKIDKFIDDLRREIKGLLVLKEPTTYAAAVRCALVMDKCLE
EPQSQQVMGSNSGVKRKFASFSSNQPSRGHQQLVQRQTVPPVCPSCKKSHAGPCWAGKRICYRCQKEGHFARECLMTGLNTQALGQRIPATAATQGGTHRARIFALTRGD
VEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLSSVSTSSGSVLGTSQVVKGGQLSFDGQALDVKLIQLDMQDFDVILGMDWLAANRANIDCS
KKEVSFRLPSGQNFIFKGVKAGVPRVVSVLKASHLFQRGAWAYLVSVVDARKVVPSIEAVRVVNEFTDVFPEDLPGLPSRAASVASLVAVCSPIHSELERLEVELTVDDV
SALLARLLVEPNLRQRIIVAQKGDPGLAKGFSMWKKLLGSGRTTSRRDTLNCWNIQLSGTKVFEGGKSVTPRVPQPPPLFSGDPASLLRRFLLRATSQLSSAHGRSDEWC
TGSTSSGGATSLAAFEQRRPTTPDAPAAAFRNRYNGHDSVLVVAHGGLLQRRTGDVTPRRFDGLQQRRRAPTICKNQRGLLLAAAHGERAVQQLSGSGICTLQWGLGRLA
VTHIRSELDSSYPNLGELDLGDPHLYEGEVDAKLKGTVVNDNVAKGAFGYADGCRKDLRAAGAEVVSGTRRMNELELKLAKAEEMALGNESTMLDDIEAVIKAFKVDNVV
ALDRLKTTEDPLGHLCAYLLSFNERLEDPDYLESQFCGFVSF