; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0027390 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0027390
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionSTART domain-containing protein
Genome locationchr8:74706..97073
RNA-Seq ExpressionLag0027390
SyntenyLag0027390
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0008289 - lipid binding (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR002913 - START domain
IPR012337 - Ribonuclease H-like superfamily
IPR023393 - START-like domain superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008461543.1 PREDICTED: phosphatidylcholine transfer protein-like isoform X1 [Cucumis melo]4.2e-19285.97Show/hide
Query:  MVGIFQATSEDLWRGSIYGSWGTVLVIVFISICHLFCSKKNVCSLLSRLRTSSVVADRHISNISPSCPPQLRIMEAISDADLKSLLDKLDGKLNENERWE
        M GIFQATSEDLWRG+IYGSWGTV V++FI+ICHLF SKKNV SLLSR   SS+VADRH S+ S  CPP  RIMEAISD DLKSLLD LDG++NENE+WE
Subjt:  MVGIFQATSEDLWRGSIYGSWGTVLVIVFISICHLFCSKKNVCSLLSRLRTSSVVADRHISNISPSCPPQLRIMEAISDADLKSLLDKLDGKLNENERWE

Query:  HVVDKSNDLLSYSAKCCKPKDGPLKYSSVTIFENCCPKLLRDFYMDNDFRKQWDNTLLMHEQLQMDGTSGIEVGRTLKKFPLLTPREYILSWRLWEGKDE
         VV+KSND LSYSAKCCKPKDGPLKYSSVTIFENCCPKLLRDFYMDND+RKQWD+T+LMHEQLQMDGTSGIEVGRTLKKFPLLTPREYILSWRLWEGKDE
Subjt:  HVVDKSNDLLSYSAKCCKPKDGPLKYSSVTIFENCCPKLLRDFYMDNDFRKQWDNTLLMHEQLQMDGTSGIEVGRTLKKFPLLTPREYILSWRLWEGKDE

Query:  TFYCFTKECEHPLAPQQKKYVRVTFFRSGWRIRRVSDRNACEISMLHQEDAGLNVEMARLVFAKGIWSFVCKMNKALRKYALINNPPSSSLVTAITLIKK
        TFYCFTKECEHP APQQKKYVRVTFFRSGWRIRRVS RNACEISMLHQEDAGLNVEMA+LVFAKGIWSFVCKM+KALRKYALINN PSSSLV+AITLIKK
Subjt:  TFYCFTKECEHPLAPQQKKYVRVTFFRSGWRIRRVSDRNACEISMLHQEDAGLNVEMARLVFAKGIWSFVCKMNKALRKYALINNPPSSSLVTAITLIKK

Query:  VPDGVEDGNDMISKANIIATESCGQISSGERKLSRASKKLMANGLLLLGGVICLSRGHSSLGAKVVLAYILTKLSKRADAPGGQI
        VPDG+ED + +ISKAN++  ESCGQ+SS ERKLSRASKKL+ NGLLL+GGVICLSRGHSSLGAKVV+AYILTKLSKR DAP GQ+
Subjt:  VPDGVEDGNDMISKANIIATESCGQISSGERKLSRASKKLMANGLLLLGGVICLSRGHSSLGAKVVLAYILTKLSKRADAPGGQI

XP_022934445.1 uncharacterized protein LOC111441622 [Cucurbita moschata]2.3e-19084.62Show/hide
Query:  MVGIFQATSEDLWRGSIYGSWGTVLVIVFISICHLFCSKKNVCSLLSRLRTSSVVADRHISNISPSCPPQLRIMEAISDADLKSLLDKLDGKLNENERWE
        M  +FQATSEDLWRGSIYGSWGTVLVIVFISICHLF  KKN  SLLSRL+T SVVADR ISN+SPSCPPQ RIME ISD DLK+LLD LDG+LNENE+WE
Subjt:  MVGIFQATSEDLWRGSIYGSWGTVLVIVFISICHLFCSKKNVCSLLSRLRTSSVVADRHISNISPSCPPQLRIMEAISDADLKSLLDKLDGKLNENERWE

Query:  HVVDKSNDLLSYSAKCCKPKDGPLKYSSVTIFENCCPKLLRDFYMDNDFRKQWDNTLLMHEQLQMDGTSGIEVGRTLKKFPLLTPREYILSWRLWEGKDE
         V++KSND LSYSAKCCKPKDGPLKYSSVTIFENCCP+LLRDFYMDNDFRKQWD+T+LMHEQLQMD TSG+EVGRTLKKFPL+TPREY+LSWRLWEGKD+
Subjt:  HVVDKSNDLLSYSAKCCKPKDGPLKYSSVTIFENCCPKLLRDFYMDNDFRKQWDNTLLMHEQLQMDGTSGIEVGRTLKKFPLLTPREYILSWRLWEGKDE

Query:  TFYCFTKECEHPLAPQQKKYVRVTFFRSGWRIRRVSDRNACEISMLHQEDAGLNVEMARLVFAKGIWSFVCKMNKALRKYALINNPPSSSLVTAITLIKK
        TFYCFT+ECEHPLAP+QKKYVRVTFFRSGWRIRRV  RNACEISMLHQEDAGLNVEMA+L F+KGIWSF+CKM+KALRKYALINNPPSSSLVTA TLIKK
Subjt:  TFYCFTKECEHPLAPQQKKYVRVTFFRSGWRIRRVSDRNACEISMLHQEDAGLNVEMARLVFAKGIWSFVCKMNKALRKYALINNPPSSSLVTAITLIKK

Query:  VPDGVEDGNDM----ISKANIIATE-SCGQISSGERKLSRASKKLMANGLLLLGGVICLSRGHSSLGAKVVLAYILTKLSKRADAPGGQI
        VPDG ED ND+    +SKANI+ TE SCGQ+SSGE+KLSRASKKL+A  LLLLGGVICLSRGHSSLGAKVV+AYILTKL+KRADAPG Q+
Subjt:  VPDGVEDGNDM----ISKANIIATE-SCGQISSGERKLSRASKKLMANGLLLLGGVICLSRGHSSLGAKVVLAYILTKLSKRADAPGGQI

XP_022983283.1 uncharacterized protein LOC111481908 [Cucurbita maxima]3.6e-19185.38Show/hide
Query:  MVGIFQATSEDLWRGSIYGSWGTVLVIVFISICHLFCSKKNVCSLLSRLRTSSVVADRHISNISPSCPPQLRIMEAISDADLKSLLDKLDGKLNENERWE
        M  +FQATSEDLWRGSIYGSWGTVLVIVFISICHLF  KKNV SLLSRL+T SVVAD  ISNISPSCPPQ RIME ISD DLK+LLD LDG+LNENE+WE
Subjt:  MVGIFQATSEDLWRGSIYGSWGTVLVIVFISICHLFCSKKNVCSLLSRLRTSSVVADRHISNISPSCPPQLRIMEAISDADLKSLLDKLDGKLNENERWE

Query:  HVVDKSNDLLSYSAKCCKPKDGPLKYSSVTIFENCCPKLLRDFYMDNDFRKQWDNTLLMHEQLQMDGTSGIEVGRTLKKFPLLTPREYILSWRLWEGKDE
         V++KSND LSYSAKCCKPKDGPLKYSSVTIFENCCP+LLRDFYMDNDFR+QWD+T+LMHEQLQMD TSG+EVGRTLKKFPL+TPREY+LSWRLWEGKD+
Subjt:  HVVDKSNDLLSYSAKCCKPKDGPLKYSSVTIFENCCPKLLRDFYMDNDFRKQWDNTLLMHEQLQMDGTSGIEVGRTLKKFPLLTPREYILSWRLWEGKDE

Query:  TFYCFTKECEHPLAPQQKKYVRVTFFRSGWRIRRVSDRNACEISMLHQEDAGLNVEMARLVFAKGIWSFVCKMNKALRKYALINNPPSSSLVTAITLIKK
        TFYCFTKECEHPLAP+QKKYVRVTFFRSGWRIRRV  RNACEISMLHQEDAGLNVEMA+L FAKGIWSFVCKM+KALRKYALINNPPSSSLVTA TLIKK
Subjt:  TFYCFTKECEHPLAPQQKKYVRVTFFRSGWRIRRVSDRNACEISMLHQEDAGLNVEMARLVFAKGIWSFVCKMNKALRKYALINNPPSSSLVTAITLIKK

Query:  VPDGVEDGNDM----ISKANIIATE-SCGQISSGERKLSRASKKLMANGLLLLGGVICLSRGHSSLGAKVVLAYILTKLSKRADAPGGQI
        VPDG ED ND+    +SKANI+ TE SCGQ+SSGE+KLSRASKKL+A  LLLLGGVICLSRGHSSLGAKVV+AYILTKL+KRADAPG Q+
Subjt:  VPDGVEDGNDM----ISKANIIATE-SCGQISSGERKLSRASKKLMANGLLLLGGVICLSRGHSSLGAKVVLAYILTKLSKRADAPGGQI

XP_023527842.1 uncharacterized protein LOC111790942 [Cucurbita pepo subsp. pepo]5.2e-19085.13Show/hide
Query:  MVGIFQATSEDLWRGSIYGSWGTVLVIVFISICHLFCSKKNVCSLLSRLRTSSVVADRHISNISPSCPPQLRIMEAISDADLKSLLDKLDGKLNENERWE
        M  +FQATSEDLWRGSIYGSWGTVLVIVFISICHLF  KKN  SLLSRL+T SVVADR ISN+SPS PPQ RIME ISD DLK+LLD LDG+LNENE+WE
Subjt:  MVGIFQATSEDLWRGSIYGSWGTVLVIVFISICHLFCSKKNVCSLLSRLRTSSVVADRHISNISPSCPPQLRIMEAISDADLKSLLDKLDGKLNENERWE

Query:  HVVDKSNDLLSYSAKCCKPKDGPLKYSSVTIFENCCPKLLRDFYMDNDFRKQWDNTLLMHEQLQMDGTSGIEVGRTLKKFPLLTPREYILSWRLWEGKDE
         V++KSND LSYSAKCCKPKDGPLKYSSVTIFENCCP+LLRDFYMDNDFRKQWD+T+LMHEQLQMD TSG+EVGRTLKKFPLLTPREY+LSWRLWEGKD+
Subjt:  HVVDKSNDLLSYSAKCCKPKDGPLKYSSVTIFENCCPKLLRDFYMDNDFRKQWDNTLLMHEQLQMDGTSGIEVGRTLKKFPLLTPREYILSWRLWEGKDE

Query:  TFYCFTKECEHPLAPQQKKYVRVTFFRSGWRIRRVSDRNACEISMLHQEDAGLNVEMARLVFAKGIWSFVCKMNKALRKYALINNPPSSSLVTAITLIKK
        TFYCFTKECEHPLAP+QKKYVRVTFFRSGWRIRRV  RNACEISMLHQEDAGLNVEMA+L F+KGIWSFVCKM+KALRKYALINNPPSSSLVTA TLIKK
Subjt:  TFYCFTKECEHPLAPQQKKYVRVTFFRSGWRIRRVSDRNACEISMLHQEDAGLNVEMARLVFAKGIWSFVCKMNKALRKYALINNPPSSSLVTAITLIKK

Query:  VPDGVEDGNDM----ISKANIIATE-SCGQISSGERKLSRASKKLMANGLLLLGGVICLSRGHSSLGAKVVLAYILTKLSKRADAPGGQI
        VPDG ED ND+    +SKANI+ TE SCGQ+SSGE+KLSRASKKL+A  LLLLGGVICLSRGHSSLGAKVV+AYILTKL+KRADAPG Q+
Subjt:  VPDGVEDGNDM----ISKANIIATE-SCGQISSGERKLSRASKKLMANGLLLLGGVICLSRGHSSLGAKVVLAYILTKLSKRADAPGGQI

XP_038903674.1 uncharacterized protein LOC120090201 isoform X1 [Benincasa hispida]2.6e-19486.62Show/hide
Query:  MVGI--FQATSEDLWRGSIYGSWGTVLVIVFISICHLFCSKKNVCSLLSRLRTSSVVADRHISNISPSC---PPQLRIMEAISDADLKSLLDKLDGK-LN
        MVG+  FQATSEDLWRGSIYGSWGTVLV+VFI+ICHL  SKKNVCSLLSRL+TSS+VADRHIS++SPS    PPQLR+MEAISD DLKSLLD LDG+ +N
Subjt:  MVGI--FQATSEDLWRGSIYGSWGTVLVIVFISICHLFCSKKNVCSLLSRLRTSSVVADRHISNISPSC---PPQLRIMEAISDADLKSLLDKLDGK-LN

Query:  ENERWEHVVDKSNDLLSYSAKCCKPKDGPLKYSSVTIFENCCPKLLRDFYMDNDFRKQWDNTLLMHEQLQMDGTSGIEVGRTLKKFPLLTPREYILSWRL
        ENE+WE VV+KSND LSYSAKCCKPKDGPLKYSSVTIFENCCP LLRDFYMDND+RKQWD+T+LMHEQLQMDGTSGIEVGRTLKKFPLLTPREYILSWRL
Subjt:  ENERWEHVVDKSNDLLSYSAKCCKPKDGPLKYSSVTIFENCCPKLLRDFYMDNDFRKQWDNTLLMHEQLQMDGTSGIEVGRTLKKFPLLTPREYILSWRL

Query:  WEGKDETFYCFTKECEHPLAPQQKKYVRVTFFRSGWRIRRVSDRNACEISMLHQEDAGLNVEMARLVFAKGIWSFVCKMNKALRKYALINNPPSSSLVTA
        WEGKDETFYCFTKEC+HPLAPQQKKYVRVTFFRSGWRIRRVS RNACEISMLHQEDAGLNVEMA+L FAKGIWSFVCKM+KALRKYALINNPPSSSLVTA
Subjt:  WEGKDETFYCFTKECEHPLAPQQKKYVRVTFFRSGWRIRRVSDRNACEISMLHQEDAGLNVEMARLVFAKGIWSFVCKMNKALRKYALINNPPSSSLVTA

Query:  ITLIKKVPDGVEDGNDMISKANIIATE-SCGQISSGERKLSRASKKLMANGLLLLGGVICLSRGHSSLGAKVVLAYILTKLSKRADA--PGGQIEK
        +TLIKKVPDG ED + +ISKAN++ TE SCGQ+SS ERKLSRASKKL+ANGLLL+GGVICLSRGHSSLGAKVV+AYILTKLSKRADA  PGGQ+ K
Subjt:  ITLIKKVPDGVEDGNDMISKANIIATE-SCGQISSGERKLSRASKKLMANGLLLLGGVICLSRGHSSLGAKVVLAYILTKLSKRADA--PGGQIEK

TrEMBL top hitse value%identityAlignment
A0A0A0LCC1 START domain-containing protein9.5e-19083.25Show/hide
Query:  MVGIFQATSEDLWRGSIYGSWGTVLVIVFISICHLFCSKKNVCSLLSRLRTSSVVADRHISNI-------SPS--CPPQLRIMEAISDADLKSLLDKLDG
        M GIFQATSEDLWRG+IYGSWGTV V++F++ICHLFCSKKNV SLLSR  TSS++ADRH S++       SPS  CP   RIMEAISD DLKSLLD LDG
Subjt:  MVGIFQATSEDLWRGSIYGSWGTVLVIVFISICHLFCSKKNVCSLLSRLRTSSVVADRHISNI-------SPS--CPPQLRIMEAISDADLKSLLDKLDG

Query:  KLNENERWEHVVDKSNDLLSYSAKCCKPKDGPLKYSSVTIFENCCPKLLRDFYMDNDFRKQWDNTLLMHEQLQMDGTSGIEVGRTLKKFPLLTPREYILS
        ++NENE+WE VV+KSND LSYSAKCCKPKDGPLKYSSVTIFENCCPKLLRDFYMDND+RKQWD+T+LMHEQLQMDGTSGIEVGRTLKKFPLLTPREYILS
Subjt:  KLNENERWEHVVDKSNDLLSYSAKCCKPKDGPLKYSSVTIFENCCPKLLRDFYMDNDFRKQWDNTLLMHEQLQMDGTSGIEVGRTLKKFPLLTPREYILS

Query:  WRLWEGKDETFYCFTKECEHPLAPQQKKYVRVTFFRSGWRIRRVSDRNACEISMLHQEDAGLNVEMARLVFAKGIWSFVCKMNKALRKYALINNPPSSSL
        WRLWEGKDETFYCFTKECEHP APQQKKYVRVTFFRSGWRIRRVS RNACEI+MLHQEDAGLNVEMA+LVFAKGIWSFVCKM+KALRKY+LINN PSSSL
Subjt:  WRLWEGKDETFYCFTKECEHPLAPQQKKYVRVTFFRSGWRIRRVSDRNACEISMLHQEDAGLNVEMARLVFAKGIWSFVCKMNKALRKYALINNPPSSSL

Query:  VTAITLIKKVPDGVEDGNDMISKANIIATESCGQISSGERKLSRASKKLMANGLLLLGGVICLSRGHSSLGAKVVLAYILTKLSKRADAPGGQI
        V+A+TLIKKVPDG ED + +IS+ N++ TESCGQ+SS ERKLSRASKKL+ NGLLL+GGVICLSRGHSSLGAKVV+AYILTKLSKR DAP GQ+
Subjt:  VTAITLIKKVPDGVEDGNDMISKANIIATESCGQISSGERKLSRASKKLMANGLLLLGGVICLSRGHSSLGAKVVLAYILTKLSKRADAPGGQI

A0A1S3CEV0 phosphatidylcholine transfer protein-like isoform X12.0e-19285.97Show/hide
Query:  MVGIFQATSEDLWRGSIYGSWGTVLVIVFISICHLFCSKKNVCSLLSRLRTSSVVADRHISNISPSCPPQLRIMEAISDADLKSLLDKLDGKLNENERWE
        M GIFQATSEDLWRG+IYGSWGTV V++FI+ICHLF SKKNV SLLSR   SS+VADRH S+ S  CPP  RIMEAISD DLKSLLD LDG++NENE+WE
Subjt:  MVGIFQATSEDLWRGSIYGSWGTVLVIVFISICHLFCSKKNVCSLLSRLRTSSVVADRHISNISPSCPPQLRIMEAISDADLKSLLDKLDGKLNENERWE

Query:  HVVDKSNDLLSYSAKCCKPKDGPLKYSSVTIFENCCPKLLRDFYMDNDFRKQWDNTLLMHEQLQMDGTSGIEVGRTLKKFPLLTPREYILSWRLWEGKDE
         VV+KSND LSYSAKCCKPKDGPLKYSSVTIFENCCPKLLRDFYMDND+RKQWD+T+LMHEQLQMDGTSGIEVGRTLKKFPLLTPREYILSWRLWEGKDE
Subjt:  HVVDKSNDLLSYSAKCCKPKDGPLKYSSVTIFENCCPKLLRDFYMDNDFRKQWDNTLLMHEQLQMDGTSGIEVGRTLKKFPLLTPREYILSWRLWEGKDE

Query:  TFYCFTKECEHPLAPQQKKYVRVTFFRSGWRIRRVSDRNACEISMLHQEDAGLNVEMARLVFAKGIWSFVCKMNKALRKYALINNPPSSSLVTAITLIKK
        TFYCFTKECEHP APQQKKYVRVTFFRSGWRIRRVS RNACEISMLHQEDAGLNVEMA+LVFAKGIWSFVCKM+KALRKYALINN PSSSLV+AITLIKK
Subjt:  TFYCFTKECEHPLAPQQKKYVRVTFFRSGWRIRRVSDRNACEISMLHQEDAGLNVEMARLVFAKGIWSFVCKMNKALRKYALINNPPSSSLVTAITLIKK

Query:  VPDGVEDGNDMISKANIIATESCGQISSGERKLSRASKKLMANGLLLLGGVICLSRGHSSLGAKVVLAYILTKLSKRADAPGGQI
        VPDG+ED + +ISKAN++  ESCGQ+SS ERKLSRASKKL+ NGLLL+GGVICLSRGHSSLGAKVV+AYILTKLSKR DAP GQ+
Subjt:  VPDGVEDGNDMISKANIIATESCGQISSGERKLSRASKKLMANGLLLLGGVICLSRGHSSLGAKVVLAYILTKLSKRADAPGGQI

A0A6J1D880 phosphatidylcholine transfer protein-like3.4e-17179.24Show/hide
Query:  VGIFQATSEDLWRGSIYGSWGTVLVIVFISICHLFCSKKNVCSLLSRLRTS---SVVADRHISNISPSCPPQLRIMEAISDADLKSLLDKLDGKLNENER
        +GI + TSEDLWRGS YG+WGT+LV++FISICHLF       SLLSR RTS    ++AD H  + SPS PPQ  I EAISD DLKSLLD LDGKLNENE+
Subjt:  VGIFQATSEDLWRGSIYGSWGTVLVIVFISICHLFCSKKNVCSLLSRLRTS---SVVADRHISNISPSCPPQLRIMEAISDADLKSLLDKLDGKLNENER

Query:  WEHVVDKSNDLLSYSAKCCKPKDGPLKYSSVTIFENCCPKLLRDFYMDNDFRKQWDNTLLMHEQLQMDGTSGIEVGRTLKKFPLLTPREYILSWRLWEGK
        WE VV+KSN+LLSYSAKCCKPKDGPLKY SVTIFENCCPKLLRDFYMD+DFRKQWDNT+LMHEQLQ+D  SGIEVGRTLKKFPLL PREYILSWRLWEGK
Subjt:  WEHVVDKSNDLLSYSAKCCKPKDGPLKYSSVTIFENCCPKLLRDFYMDNDFRKQWDNTLLMHEQLQMDGTSGIEVGRTLKKFPLLTPREYILSWRLWEGK

Query:  DETFYCFTKECEHPLAPQQKKYVRVTFFRSGWRIRRVSDRNACEISMLHQEDAGLNVEMARLVFAKGIWSFVCKMNKALRKYALINNPPSSSLVTAITLI
        DETFYCFTKECEHP APQQKKYVRVTFFRSGWRIRRVS RNACEISMLHQEDAGLNVEMA+L FAKGIWSFVCKM+KALR YAL + P SSSLVTA+TLI
Subjt:  DETFYCFTKECEHPLAPQQKKYVRVTFFRSGWRIRRVSDRNACEISMLHQEDAGLNVEMARLVFAKGIWSFVCKMNKALRKYALINNPPSSSLVTAITLI

Query:  KKVPDGVEDGNDMISKA-NIIAT------ESCGQISSGERKLSRASKKLMANGLLLLGGVICLSRGHSSLGAKVVLAYILTKLSKRADAPGGQIE
        KKVPDG+ED N MISKA N + T      ESCGQ S G RK S+ASKK +A GLLLLGG ICLSRGHSSLGAKVV+AYILTKLSKRA+AP G+IE
Subjt:  KKVPDGVEDGNDMISKA-NIIAT------ESCGQISSGERKLSRASKKLMANGLLLLGGVICLSRGHSSLGAKVVLAYILTKLSKRADAPGGQIE

A0A6J1F7Q2 uncharacterized protein LOC1114416221.1e-19084.62Show/hide
Query:  MVGIFQATSEDLWRGSIYGSWGTVLVIVFISICHLFCSKKNVCSLLSRLRTSSVVADRHISNISPSCPPQLRIMEAISDADLKSLLDKLDGKLNENERWE
        M  +FQATSEDLWRGSIYGSWGTVLVIVFISICHLF  KKN  SLLSRL+T SVVADR ISN+SPSCPPQ RIME ISD DLK+LLD LDG+LNENE+WE
Subjt:  MVGIFQATSEDLWRGSIYGSWGTVLVIVFISICHLFCSKKNVCSLLSRLRTSSVVADRHISNISPSCPPQLRIMEAISDADLKSLLDKLDGKLNENERWE

Query:  HVVDKSNDLLSYSAKCCKPKDGPLKYSSVTIFENCCPKLLRDFYMDNDFRKQWDNTLLMHEQLQMDGTSGIEVGRTLKKFPLLTPREYILSWRLWEGKDE
         V++KSND LSYSAKCCKPKDGPLKYSSVTIFENCCP+LLRDFYMDNDFRKQWD+T+LMHEQLQMD TSG+EVGRTLKKFPL+TPREY+LSWRLWEGKD+
Subjt:  HVVDKSNDLLSYSAKCCKPKDGPLKYSSVTIFENCCPKLLRDFYMDNDFRKQWDNTLLMHEQLQMDGTSGIEVGRTLKKFPLLTPREYILSWRLWEGKDE

Query:  TFYCFTKECEHPLAPQQKKYVRVTFFRSGWRIRRVSDRNACEISMLHQEDAGLNVEMARLVFAKGIWSFVCKMNKALRKYALINNPPSSSLVTAITLIKK
        TFYCFT+ECEHPLAP+QKKYVRVTFFRSGWRIRRV  RNACEISMLHQEDAGLNVEMA+L F+KGIWSF+CKM+KALRKYALINNPPSSSLVTA TLIKK
Subjt:  TFYCFTKECEHPLAPQQKKYVRVTFFRSGWRIRRVSDRNACEISMLHQEDAGLNVEMARLVFAKGIWSFVCKMNKALRKYALINNPPSSSLVTAITLIKK

Query:  VPDGVEDGNDM----ISKANIIATE-SCGQISSGERKLSRASKKLMANGLLLLGGVICLSRGHSSLGAKVVLAYILTKLSKRADAPGGQI
        VPDG ED ND+    +SKANI+ TE SCGQ+SSGE+KLSRASKKL+A  LLLLGGVICLSRGHSSLGAKVV+AYILTKL+KRADAPG Q+
Subjt:  VPDGVEDGNDM----ISKANIIATE-SCGQISSGERKLSRASKKLMANGLLLLGGVICLSRGHSSLGAKVVLAYILTKLSKRADAPGGQI

A0A6J1J7C0 uncharacterized protein LOC1114819081.7e-19185.38Show/hide
Query:  MVGIFQATSEDLWRGSIYGSWGTVLVIVFISICHLFCSKKNVCSLLSRLRTSSVVADRHISNISPSCPPQLRIMEAISDADLKSLLDKLDGKLNENERWE
        M  +FQATSEDLWRGSIYGSWGTVLVIVFISICHLF  KKNV SLLSRL+T SVVAD  ISNISPSCPPQ RIME ISD DLK+LLD LDG+LNENE+WE
Subjt:  MVGIFQATSEDLWRGSIYGSWGTVLVIVFISICHLFCSKKNVCSLLSRLRTSSVVADRHISNISPSCPPQLRIMEAISDADLKSLLDKLDGKLNENERWE

Query:  HVVDKSNDLLSYSAKCCKPKDGPLKYSSVTIFENCCPKLLRDFYMDNDFRKQWDNTLLMHEQLQMDGTSGIEVGRTLKKFPLLTPREYILSWRLWEGKDE
         V++KSND LSYSAKCCKPKDGPLKYSSVTIFENCCP+LLRDFYMDNDFR+QWD+T+LMHEQLQMD TSG+EVGRTLKKFPL+TPREY+LSWRLWEGKD+
Subjt:  HVVDKSNDLLSYSAKCCKPKDGPLKYSSVTIFENCCPKLLRDFYMDNDFRKQWDNTLLMHEQLQMDGTSGIEVGRTLKKFPLLTPREYILSWRLWEGKDE

Query:  TFYCFTKECEHPLAPQQKKYVRVTFFRSGWRIRRVSDRNACEISMLHQEDAGLNVEMARLVFAKGIWSFVCKMNKALRKYALINNPPSSSLVTAITLIKK
        TFYCFTKECEHPLAP+QKKYVRVTFFRSGWRIRRV  RNACEISMLHQEDAGLNVEMA+L FAKGIWSFVCKM+KALRKYALINNPPSSSLVTA TLIKK
Subjt:  TFYCFTKECEHPLAPQQKKYVRVTFFRSGWRIRRVSDRNACEISMLHQEDAGLNVEMARLVFAKGIWSFVCKMNKALRKYALINNPPSSSLVTAITLIKK

Query:  VPDGVEDGNDM----ISKANIIATE-SCGQISSGERKLSRASKKLMANGLLLLGGVICLSRGHSSLGAKVVLAYILTKLSKRADAPGGQI
        VPDG ED ND+    +SKANI+ TE SCGQ+SSGE+KLSRASKKL+A  LLLLGGVICLSRGHSSLGAKVV+AYILTKL+KRADAPG Q+
Subjt:  VPDGVEDGNDM----ISKANIIATE-SCGQISSGERKLSRASKKLMANGLLLLGGVICLSRGHSSLGAKVVLAYILTKLSKRADAPGGQI

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657507.1e-3325.65Show/hide
Query:  MSCFRIPISICKDIDSICAKFWWGSSESKRKIHWKNWNFLCTSKDSGGLGFRNILLFNQAMLAKLSWRIIKEPSSLLARVLKGRY----FKDRPFLEASL
        MS   +P SI   +D +   F WGS+  K+K H   W+ +C+ K  GGLG R     N+A+++K+ WR+++E +SL   VL+ +Y     +D  +L    
Subjt:  MSCFRIPISICKDIDSICAKFWWGSSESKRKIHWKNWNFLCTSKDSGGLGFRNILLFNQAMLAKLSWRIIKEPSSLLARVLKGRY----FKDRPFLEASL

Query:  GNNPSLTWRSIMWG-RDLFLKGYRWRVGNGRYIEIDKDPWLPRDFSKIPKLMDDHLEGRKVSNLIDENNQW--KKDWVLEHIHHQDAK--------VILN
          + S TWRSI  G RD+   G  W  G+G+ I    D W+    S  P L  D+ E     + +   + W   + W    I              V+L+
Subjt:  GNNPSLTWRSIMWG-RDLFLKGYRWRVGNGRYIEIDKDPWLPRDFSKIPKLMDDHLEGRKVSNLIDENNQW--KKDWVLEHIHHQDAK--------VILN

Query:  IPLGDPRMRDEIIWSSDKKGKFSVKSAYRLAMEKEGELEASSSDPRSSQ-DCWRSLWRIPTLPRAKICVWKILNDIVPSCSNLLKKGLNIDPLCVLCRRR
        +  G    RD + W   + G+FSV+SAY        E+      PR +    +  LW++    R K  +W + N  V +     ++ L+   +C +C+  
Subjt:  IPLGDPRMRDEIIWSSDKKGKFSVKSAYRLAMEKEGELEASSSDPRSSQ-DCWRSLWRIPTLPRAKICVWKILNDIVPSCSNLLKKGLNIDPLCVLCRRR

Query:  LKTSTHSIWECKMVRDIWPMFIPD--SLDLFSCNRANWSTLDYWDWTRQNFKDVELGKA-VILMWCIWSFRNSKLNSNSTKLVDKATLINQISSRLLEIE
        +++  H + +C     IW   +P       FS +   W   +  D  R   +D+       +++W  W +R   +   +TK  D+   + + +   +E+ 
Subjt:  LKTSTHSIWECKMVRDIWPMFIPD--SLDLFSCNRANWSTLDYWDWTRQNFKDVELGKA-VILMWCIWSFRNSKLNSNSTKLVDKATLINQISSRLLEIE

Query:  RPRR-SYLVSPKALRGENHLSQNHWSPPPPGCWKINTDASWNASEEKGGMGWIVRDSGGS
        R    + LV     R E  +    W  P  G  K+NTD +   +      G ++RD  G+
Subjt:  RPRR-SYLVSPKALRGENHLSQNHWSPPPPGCWKINTDASWNASEEKGGMGWIVRDSGGS

P53809 Phosphatidylcholine transfer protein1.1e-0424.24Show/hide
Query:  GPLKYSSVTIFENCCPKLLRDFYMDNDFRKQWDNTLLMHEQLQMDGTSGIEVGRTLKKFPL-LTPREYILSWR---LWEGKDETFYCFTKECEHPLAPQQ
        G  +Y    + E+C P LL D YMD D+RK+WD  +   ++L      G  V     K+P  L+ R+Y+ + +   L     + +    +    P  P++
Subjt:  GPLKYSSVTIFENCCPKLLRDFYMDNDFRKQWDNTLLMHEQLQMDGTSGIEVGRTLKKFPL-LTPREYILSWR---LWEGKDETFYCFTKECEHPLAPQQ

Query:  KKYVRVTFFRSGWRIRRVSDRNACEISMLHQEDAGLNVEMARLVFA--KGIWSFVCKMNKALRKY
           +RV  ++    I     +    + M + ++ G  +    + +A   G+ SF+  M KA + Y
Subjt:  KKYVRVTFFRSGWRIRRVSDRNACEISMLHQEDAGLNVEMARLVFA--KGIWSFVCKMNKALRKY

P93295 Uncharacterized mitochondrial protein AtMg003109.2e-3346.94Show/hide
Query:  MSCFRIPISICKDIDSICAKFWWGSSESKRKIHWKNWNFLCTSK-DSGGLGFRNILLFNQAMLAKLSWRIIKEPSSLLARVLKGRYFKDRPFLEASLGNN
        MSCFR+   +CK + S   +FWW S E+KRKI W  W  LC SK D GGLGFR++  FNQA+LAK S+RII +P +LL+R+L+ RYF     +E S+G  
Subjt:  MSCFRIPISICKDIDSICAKFWWGSSESKRKIHWKNWNFLCTSK-DSGGLGFRNILLFNQAMLAKLSWRIIKEPSSLLARVLKGRYFKDRPFLEASLGNN

Query:  PSLTWRSIMWGRDLFLKGYRWRVGNGRYIEIDKDPWLPRDFSKIPKL
        PS  WRSI+ GR+L  +G    +G+G + ++  D W+  D + +P L
Subjt:  PSLTWRSIMWGRDLFLKGYRWRVGNGRYIEIDKDPWLPRDFSKIPKL

Q9UKL6 Phosphatidylcholine transfer protein9.0e-0435.21Show/hide
Query:  KDGPLKYSSVTIFENCCPKLLRDFYMDNDFRKQWDNTLLMHEQLQMDGTSGIEVGRTLKKFPL-LTPREYI
        K G  +Y    + E+C P LL D YMD+D+RKQWD  +   ++L     +G  V     K+P  ++ R+Y+
Subjt:  KDGPLKYSSVTIFENCCPKLLRDFYMDNDFRKQWDNTLLMHEQLQMDGTSGIEVGRTLKKFPL-LTPREYI

Arabidopsis top hitse value%identityAlignment
AT1G55960.1 Polyketide cyclase/dehydrase and lipid transport superfamily protein2.6e-10752.88Show/hide
Query:  WGTVLVIVFISICHLFCSKKNVCSLLSRLRTSSVVADRHISNISPSCPPQLRIMEAISDADLKSLLDKLDGKLNEN-ERWEHVVDKSNDLLSYSAKCCKP
        W T + ++ I + H+F S K      +   +SS  +    S  S     Q RI + +SD DLK L++ L+ + N++ E WEHV+ KSND +SYSAK CKP
Subjt:  WGTVLVIVFISICHLFCSKKNVCSLLSRLRTSSVVADRHISNISPSCPPQLRIMEAISDADLKSLLDKLDGKLNEN-ERWEHVVDKSNDLLSYSAKCCKP

Query:  KD--GPLKYSSVTIFENCCPKLLRDFYMDNDFRKQWDNTLLMHEQLQMDGTSGIEVGRTLKKFPLLTPREYILSWRLWEGKDETFYCFTKECEHPLAPQQ
        KD  GP+KY SVT+FE    +++RDFYMDND+RK WD T++ HEQLQ+D  +GIE+GRT+KKFPLLT REY+L+WRLW+GK++ FYCFTKEC+H + PQQ
Subjt:  KD--GPLKYSSVTIFENCCPKLLRDFYMDNDFRKQWDNTLLMHEQLQMDGTSGIEVGRTLKKFPLLTPREYILSWRLWEGKDETFYCFTKECEHPLAPQQ

Query:  KKYVRVTFFRSGWRIRRVSDRNACEISMLHQEDAGLNVEMARLVFAKGIWSFVCKMNKALRKYALINNPPSSSLVTAITLIKKVPDGVEDGNDMISKANI
        +KYVRV++FRSGWRIR+V  RNACEI M HQE+AGLNVEMA+L F+KGIWS+VCKM  AL KY   ++     +++A+TL+K+VP  +E G D ++  ++
Subjt:  KKYVRVTFFRSGWRIRRVSDRNACEISMLHQEDAGLNVEMARLVFAKGIWSFVCKMNKALRKYALINNPPSSSLVTAITLIKKVPDGVEDGNDMISKANI

Query:  IATESCGQIS-----SGERKLSRASKKLMANGLLLLGGVICLSRGHSSLGAKVVLAYILTKLSKR
          +   G +S       ++K+ + S KL+A GL+L+GG ICLSRG S+LGAKV LAY+LTKL+KR
Subjt:  IATESCGQIS-----SGERKLSRASKKLMANGLLLLGGVICLSRGHSSLGAKVVLAYILTKLSKR

AT3G13062.1 Polyketide cyclase/dehydrase and lipid transport superfamily protein3.3e-11054.79Show/hide
Query:  WGTVLVIVFISICHLFCSKKNVCSLLSRLRTSSVVADRHISNISPSCPPQLRIMEA-ISDADLKSLLDKLDGKLNENERWEHVVDKSNDLLSYSAKCCKP
        W TVL +VFI +   F S+K      S   +SS V      ++S S   Q  I    +SD DLK L+ KL  +  + E WE V+ KSN  +SY+AKCCKP
Subjt:  WGTVLVIVFISICHLFCSKKNVCSLLSRLRTSSVVADRHISNISPSCPPQLRIMEA-ISDADLKSLLDKLDGKLNENERWEHVVDKSNDLLSYSAKCCKP

Query:  KDG-PLKYSSVTIFENCCPKLLRDFYMDNDFRKQWDNTLLMHEQLQMDGTSGIEVGRTLKKFPLLTPREYILSWRLWEGKDETFYCFTKECEHPLAPQQK
         DG P+KY S T+FE+C P++LRDFYMDN++RKQWD T++ HEQLQ+D  SGIE+GRT+KKFPLLTPREY+L+W+LWEGKD+ FYCF KEC+H + PQQ+
Subjt:  KDG-PLKYSSVTIFENCCPKLLRDFYMDNDFRKQWDNTLLMHEQLQMDGTSGIEVGRTLKKFPLLTPREYILSWRLWEGKDETFYCFTKECEHPLAPQQK

Query:  KYVRVTFFRSGWRIRRVSDRNACEISMLHQEDAGLNVEMARLVFAKGIWSFVCKMNKALRKYALINNPPSSSLVTAITLIKKVPDGVEDGNDMISKANII
        KYVRV++FRSGWRIR+V  RNACEI M+HQEDAGLNVEMA+L F++GIWS+VCKM  ALRKY   ++ P    ++A++L+KK+P  +E   D I+ ++  
Subjt:  KYVRVTFFRSGWRIRRVSDRNACEISMLHQEDAGLNVEMARLVFAKGIWSFVCKMNKALRKYALINNPPSSSLVTAITLIKKVPDGVEDGNDMISKANII

Query:  ATES--CGQISSGERKLSRASKKLMANGLLL----LGGVICLSRGHSSLGAKVVLAYILTKLSKR
         T     G+ +  ++ L + SKKL+ANG+LL    +GG ICLSRGHS+LGAKV LAY L+K+ KR
Subjt:  ATES--CGQISSGERKLSRASKKLMANGLLL----LGGVICLSRGHSSLGAKVVLAYILTKLSKR

AT3G13062.2 Polyketide cyclase/dehydrase and lipid transport superfamily protein3.3e-11054.79Show/hide
Query:  WGTVLVIVFISICHLFCSKKNVCSLLSRLRTSSVVADRHISNISPSCPPQLRIMEA-ISDADLKSLLDKLDGKLNENERWEHVVDKSNDLLSYSAKCCKP
        W TVL +VFI +   F S+K      S   +SS V      ++S S   Q  I    +SD DLK L+ KL  +  + E WE V+ KSN  +SY+AKCCKP
Subjt:  WGTVLVIVFISICHLFCSKKNVCSLLSRLRTSSVVADRHISNISPSCPPQLRIMEA-ISDADLKSLLDKLDGKLNENERWEHVVDKSNDLLSYSAKCCKP

Query:  KDG-PLKYSSVTIFENCCPKLLRDFYMDNDFRKQWDNTLLMHEQLQMDGTSGIEVGRTLKKFPLLTPREYILSWRLWEGKDETFYCFTKECEHPLAPQQK
         DG P+KY S T+FE+C P++LRDFYMDN++RKQWD T++ HEQLQ+D  SGIE+GRT+KKFPLLTPREY+L+W+LWEGKD+ FYCF KEC+H + PQQ+
Subjt:  KDG-PLKYSSVTIFENCCPKLLRDFYMDNDFRKQWDNTLLMHEQLQMDGTSGIEVGRTLKKFPLLTPREYILSWRLWEGKDETFYCFTKECEHPLAPQQK

Query:  KYVRVTFFRSGWRIRRVSDRNACEISMLHQEDAGLNVEMARLVFAKGIWSFVCKMNKALRKYALINNPPSSSLVTAITLIKKVPDGVEDGNDMISKANII
        KYVRV++FRSGWRIR+V  RNACEI M+HQEDAGLNVEMA+L F++GIWS+VCKM  ALRKY   ++ P    ++A++L+KK+P  +E   D I+ ++  
Subjt:  KYVRVTFFRSGWRIRRVSDRNACEISMLHQEDAGLNVEMARLVFAKGIWSFVCKMNKALRKYALINNPPSSSLVTAITLIKKVPDGVEDGNDMISKANII

Query:  ATES--CGQISSGERKLSRASKKLMANGLLL----LGGVICLSRGHSSLGAKVVLAYILTKLSKR
         T     G+ +  ++ L + SKKL+ANG+LL    +GG ICLSRGHS+LGAKV LAY L+K+ KR
Subjt:  ATES--CGQISSGERKLSRASKKLMANGLLL----LGGVICLSRGHSSLGAKVVLAYILTKLSKR

AT3G13062.3 Polyketide cyclase/dehydrase and lipid transport superfamily protein5.3e-10853.62Show/hide
Query:  WGTVLVIVFISICHLFCSKKNVCSLLSRLRTSSVVADRHISNISPSCPPQLRIMEA-ISDADLKSLLDKLDGKLNENERWEHVVDKSNDLLSYSAKCCKP
        W TVL +VFI +   F S+K      S   +SS V      ++S S   Q  I    +SD DLK L+ KL  +  + E WE V+ KSN  +SY+AKCCKP
Subjt:  WGTVLVIVFISICHLFCSKKNVCSLLSRLRTSSVVADRHISNISPSCPPQLRIMEA-ISDADLKSLLDKLDGKLNENERWEHVVDKSNDLLSYSAKCCKP

Query:  K--------DG-PLKYSSVTIFENCCPKLLRDFYMDNDFRKQWDNTLLMHEQLQMDGTSGIEVGRTLKKFPLLTPREYILSWRLWEGKDETFYCFTKECE
                 DG P+KY S T+FE+C P++LRDFYMDN++RKQWD T++ HEQLQ+D  SGIE+GRT+KKFPLLTPREY+L+W+LWEGKD+ FYCF KEC+
Subjt:  K--------DG-PLKYSSVTIFENCCPKLLRDFYMDNDFRKQWDNTLLMHEQLQMDGTSGIEVGRTLKKFPLLTPREYILSWRLWEGKDETFYCFTKECE

Query:  HPLAPQQKKYVRVTFFRSGWRIRRVSDRNACEISMLHQEDAGLNVEMARLVFAKGIWSFVCKMNKALRKYALINNPPSSSLVTAITLIKKVPDGVEDGND
        H + PQQ+KYVRV++FRSGWRIR+V  RNACEI M+HQEDAGLNVEMA+L F++GIWS+VCKM  ALRKY   ++ P    ++A++L+KK+P  +E   D
Subjt:  HPLAPQQKKYVRVTFFRSGWRIRRVSDRNACEISMLHQEDAGLNVEMARLVFAKGIWSFVCKMNKALRKYALINNPPSSSLVTAITLIKKVPDGVEDGND

Query:  MISKANIIATES--CGQISSGERKLSRASKKLMANGLLL----LGGVICLSRGHSSLGAKVVLAYILTKLSKR
         I+ ++   T     G+ +  ++ L + SKKL+ANG+LL    +GG ICLSRGHS+LGAKV LAY L+K+ KR
Subjt:  MISKANIIATES--CGQISSGERKLSRASKKLMANGLLL----LGGVICLSRGHSSLGAKVVLAYILTKLSKR

AT4G29090.1 Ribonuclease H-like superfamily protein1.5e-6228.97Show/hide
Query:  MSCFRIPISICKDIDSICAKFWWGSSESKRKIHWKNWNFLCTSKDSGGLGFRNILLFNQAMLAKLSWRIIKEPSSLLARVLKGRYFKDRPFLEASLGNNP
        M+CF +P ++CK I S+ A FWW + +  + +HWK W+ L   K  GG+GF++I  FN A+L K  WR++  P SL+A+V K RYF     L A LG+ P
Subjt:  MSCFRIPISICKDIDSICAKFWWGSSESKRKIHWKNWNFLCTSKDSGGLGFRNILLFNQAMLAKLSWRIIKEPSSLLARVLKGRYFKDRPFLEASLGNNP

Query:  SLTWRSIMWGRDLFLKGYRWRVGNGRYIEIDKDPWL---------------PRDFSKIPKLMDDHLEGRKVSNLIDEN-NQWKKDWVLEHIHHQDAKVIL
        S  W+SI   +++  +G R  VGNG  I I +  WL               P++++ +  ++       KVS+LIDE+  +W+KD +       + K+I 
Subjt:  SLTWRSIMWGRDLFLKGYRWRVGNGRYIEIDKDPWL---------------PRDFSKIPKLMDDHLEGRKVSNLIDEN-NQWKKDWVLEHIHHQDAKVIL

Query:  NIPLGDPRMRDEIIWSSDKKGKFSVKSAYRLAME--KEGELEASSSDPRSSQDCWRSLWRIPTLPRAKICVWKILNDIVPSCSNLLKKGLNIDPLCVLCR
         +  G  R+ D   W     G ++VKS Y +  +   +       S+P S    ++ +W+  T P+ +  +WK L++ +P    L  + L+ +  C+ C 
Subjt:  NIPLGDPRMRDEIIWSSDKKGKFSVKSAYRLAME--KEGELEASSSDPRSSQDCWRSLWRIPTLPRAKICVWKILNDIVPSCSNLLKKGLNIDPLCVLCR

Query:  RRLKTSTHSIWECKMVRDIWPM-FIPDSLDLFSCNRANWSTLD----YWDWTRQNFKDVELGKAVILMWCIWSFRNSKLNSNSTKLVDKATLIN--QISS
           +T  H +++C   R  W +  IP  L         W+       YW +   N        + ++ W +W     +L  N  +LV +    N  ++  
Subjt:  RRLKTSTHSIWECKMVRDIWPM-FIPDSLDLFSCNRANWSTLD----YWDWTRQNFKDVELGKAVILMWCIWSFRNSKLNSNSTKLVDKATLIN--QISS

Query:  RLL-EIERPR-RSYLVSPKALRGENHLSQNHWSPPPPGCWKINTDASWNASEEKGGMGWIVRDSGGSPICAGMKAISVNWPVKLLEAQAIWQALKSIETL
        R   ++E  R R+   S       N  S   W PPP    K NTDA+WN   E+ G+GW++R+  G     G +A+     V   E +A+  A+ S+   
Subjt:  RLL-EIERPR-RSYLVSPKALRGENHLSQNHWSPPPPGCWKINTDASWNASEEKGGMGWIVRDSGGSPICAGMKAISVNWPVKLLEAQAIWQALKSIETL

Query:  PEKPTSIIVNSDCLELILLLNHSDEDLSEIKPIVDAILLLADSIGGISFGHCTREQNCVAHSLARESAGFTRH
          +   +I  SD   LI +LN +DE    +KP +  +  L      + F    RE N +A  +ARES  F  +
Subjt:  PEKPTSIIVNSDCLELILLLNHSDEDLSEIKPIVDAILLLADSIGGISFGHCTREQNCVAHSLARESAGFTRH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTGTTTTAGGATCCCAATCAGCATATGCAAGGATATTGACAGTATATGTGCTAAATTTTGGTGGGGATCATCTGAGAGCAAGCGGAAAATACACTGGAAGAATTG
GAATTTCTTATGCACAAGCAAAGACTCAGGAGGCCTTGGGTTTAGAAACATTCTCCTCTTCAATCAAGCTATGCTAGCCAAGTTGAGTTGGAGGATAATCAAAGAGCCTT
CGAGTCTTCTGGCTAGAGTCCTCAAAGGAAGATACTTCAAGGATAGACCTTTCCTTGAAGCCTCCCTAGGGAATAACCCTTCTTTAACCTGGAGAAGCATCATGTGGGGC
AGAGACCTTTTTCTCAAAGGTTATAGGTGGAGGGTTGGAAATGGCAGGTATATCGAAATTGATAAGGACCCTTGGCTCCCAAGAGACTTTTCAAAAATTCCAAAGTTAAT
GGATGATCATCTCGAAGGCAGAAAAGTGAGCAACCTCATTGATGAGAATAATCAGTGGAAGAAGGATTGGGTGCTCGAGCATATTCATCATCAAGATGCAAAAGTCATTC
TAAACATTCCCCTAGGCGATCCTAGAATGAGGGATGAGATCATTTGGAGCTCGGACAAGAAAGGGAAGTTCTCGGTTAAAAGTGCGTACCGCTTAGCTATGGAAAAAGAA
GGAGAGCTGGAAGCTTCTTCTTCAGATCCTAGAAGTTCTCAGGATTGCTGGAGGAGCCTGTGGAGAATTCCTACCTTGCCAAGGGCCAAAATCTGTGTTTGGAAAATTCT
TAACGATATCGTTCCTTCTTGTTCTAATCTTTTGAAAAAAGGCCTAAATATTGACCCTCTGTGTGTTTTATGCAGGAGACGCCTGAAAACCTCTACCCATTCGATATGGG
AGTGTAAGATGGTTAGGGATATATGGCCCATGTTTATTCCAGACTCACTTGATTTGTTTTCTTGTAACAGGGCCAATTGGTCGACGCTGGATTATTGGGACTGGACGAGG
CAAAATTTCAAAGATGTAGAGCTAGGAAAAGCCGTAATTCTCATGTGGTGCATATGGTCTTTCAGAAACTCCAAGCTCAACAGCAACTCTACTAAGCTAGTAGATAAAGC
CACTCTGATCAACCAAATTTCGAGTAGACTTTTAGAAATCGAAAGGCCCAGAAGATCGTACCTGGTTTCTCCGAAGGCCTTAAGAGGCGAGAACCACCTGAGTCAGAACC
ATTGGTCTCCTCCTCCGCCTGGTTGCTGGAAGATCAATACGGACGCATCCTGGAACGCTTCGGAAGAGAAGGGCGGTATGGGTTGGATAGTTCGCGACTCCGGAGGTTCT
CCCATCTGTGCAGGCATGAAAGCCATCTCAGTCAATTGGCCGGTGAAACTCTTGGAAGCTCAAGCAATATGGCAAGCTCTTAAAAGTATCGAAACCTTACCGGAGAAGCC
GACGTCAATTATTGTGAACTCTGATTGTTTGGAGTTGATCTTGCTTTTGAATCACTCCGACGAAGACCTCTCGGAAATTAAGCCTATTGTTGACGCCATCTTGCTTTTGG
CTGATTCTATCGGAGGGATTTCGTTTGGTCATTGTACGAGGGAGCAAAACTGCGTTGCTCATTCCTTAGCGCGCGAAAGTGCTGGTTTTACCCGCCATTCTGGTTCGTGT
AGTCCAGGGCAGAGGCGTCCTTCCACGCTGGAAGAATCCATTCTGGGAGGCTTGGTGGTCTGTCCACTATTGCAGACCGTGCAGTACGGTGATTTTCCTCTGATCTATGA
TTCATGCTTCTGTAACGTGACTTCCCTCCCTGATTGTTTCCTCCGCCCCCTTTCTTTGTTCTCTGACTTATCCCTCCCATTTTTGCCGCAGCAGCATACCCCTGACCCCC
ACCGGTCCACCCCCGTTTCTTTCTTCTCTCCCTATCTCCCTCCCTGCTACAACCTCCCGCCGGCACTTGGAACCAAACAGAGGGCAAGCCTGTTGATTGTATCAGAGATG
GTTGGTATATTCCAGGCTACGTCGGAGGATCTTTGGCGAGGAAGTATTTATGGTAGTTGGGGCACTGTTTTGGTTATCGTTTTCATTTCAATTTGTCATCTATTTTGTTC
CAAAAAAAATGTTTGCTCCCTTCTCTCCCGCTTACGAACCTCGTCTGTCGTTGCTGATCGCCACATCTCCAACATATCTCCTTCATGTCCTCCGCAATTAAGAATAATGG
AGGCTATATCAGATGCAGATTTGAAGTCCTTATTGGATAAGTTGGATGGAAAGCTAAATGAGAATGAGAGATGGGAACATGTTGTAGATAAAAGTAATGATCTTCTTTCA
TATAGTGCTAAATGCTGCAAGCCTAAGGATGGTCCCTTGAAATACTCGAGTGTGACAATATTTGAAAATTGCTGTCCAAAATTGTTGAGAGACTTCTACATGGACAATGA
TTTTAGAAAACAGTGGGACAATACCCTACTTATGCACGAGCAGCTACAAATGGATGGAACTAGCGGAATTGAAGTTGGTCGCACCTTAAAAAAATTTCCATTATTGACAC
CTAGGGAGTATATACTATCATGGAGATTATGGGAGGGGAAAGATGAAACCTTTTACTGCTTTACCAAGGAATGTGAACATCCTTTGGCACCACAACAGAAGAAGTATGTC
CGAGTGACGTTTTTCAGGTCTGGTTGGCGAATCAGGAGAGTATCCGACAGAAATGCGTGTGAGATCAGTATGTTGCACCAAGAAGATGCTGGTTTGAATGTGGAGATGGC
AAGACTTGTGTTTGCAAAGGGCATATGGAGCTTTGTCTGTAAAATGAATAAAGCATTACGCAAATACGCTCTAATCAACAACCCTCCATCAAGTTCACTTGTCACTGCAA
TTACTCTGATTAAGAAGGTTCCAGATGGAGTTGAGGACGGGAATGACATGATCTCTAAAGCAAACATTATAGCAACTGAGTCCTGTGGACAAATTTCTTCAGGAGAGAGA
AAATTGTCGAGGGCATCAAAAAAACTGATGGCCAATGGTTTGCTTCTTCTTGGTGGTGTGATCTGCCTGTCGCGAGGTCACTCTAGTCTTGGAGCCAAAGTTGTTCTGGC
ATATATTCTTACCAAGCTAAGCAAGCGAGCTGATGCACCAGGAGGTCAAATAGAGAAGGCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGCTGTTTTAGGATCCCAATCAGCATATGCAAGGATATTGACAGTATATGTGCTAAATTTTGGTGGGGATCATCTGAGAGCAAGCGGAAAATACACTGGAAGAATTG
GAATTTCTTATGCACAAGCAAAGACTCAGGAGGCCTTGGGTTTAGAAACATTCTCCTCTTCAATCAAGCTATGCTAGCCAAGTTGAGTTGGAGGATAATCAAAGAGCCTT
CGAGTCTTCTGGCTAGAGTCCTCAAAGGAAGATACTTCAAGGATAGACCTTTCCTTGAAGCCTCCCTAGGGAATAACCCTTCTTTAACCTGGAGAAGCATCATGTGGGGC
AGAGACCTTTTTCTCAAAGGTTATAGGTGGAGGGTTGGAAATGGCAGGTATATCGAAATTGATAAGGACCCTTGGCTCCCAAGAGACTTTTCAAAAATTCCAAAGTTAAT
GGATGATCATCTCGAAGGCAGAAAAGTGAGCAACCTCATTGATGAGAATAATCAGTGGAAGAAGGATTGGGTGCTCGAGCATATTCATCATCAAGATGCAAAAGTCATTC
TAAACATTCCCCTAGGCGATCCTAGAATGAGGGATGAGATCATTTGGAGCTCGGACAAGAAAGGGAAGTTCTCGGTTAAAAGTGCGTACCGCTTAGCTATGGAAAAAGAA
GGAGAGCTGGAAGCTTCTTCTTCAGATCCTAGAAGTTCTCAGGATTGCTGGAGGAGCCTGTGGAGAATTCCTACCTTGCCAAGGGCCAAAATCTGTGTTTGGAAAATTCT
TAACGATATCGTTCCTTCTTGTTCTAATCTTTTGAAAAAAGGCCTAAATATTGACCCTCTGTGTGTTTTATGCAGGAGACGCCTGAAAACCTCTACCCATTCGATATGGG
AGTGTAAGATGGTTAGGGATATATGGCCCATGTTTATTCCAGACTCACTTGATTTGTTTTCTTGTAACAGGGCCAATTGGTCGACGCTGGATTATTGGGACTGGACGAGG
CAAAATTTCAAAGATGTAGAGCTAGGAAAAGCCGTAATTCTCATGTGGTGCATATGGTCTTTCAGAAACTCCAAGCTCAACAGCAACTCTACTAAGCTAGTAGATAAAGC
CACTCTGATCAACCAAATTTCGAGTAGACTTTTAGAAATCGAAAGGCCCAGAAGATCGTACCTGGTTTCTCCGAAGGCCTTAAGAGGCGAGAACCACCTGAGTCAGAACC
ATTGGTCTCCTCCTCCGCCTGGTTGCTGGAAGATCAATACGGACGCATCCTGGAACGCTTCGGAAGAGAAGGGCGGTATGGGTTGGATAGTTCGCGACTCCGGAGGTTCT
CCCATCTGTGCAGGCATGAAAGCCATCTCAGTCAATTGGCCGGTGAAACTCTTGGAAGCTCAAGCAATATGGCAAGCTCTTAAAAGTATCGAAACCTTACCGGAGAAGCC
GACGTCAATTATTGTGAACTCTGATTGTTTGGAGTTGATCTTGCTTTTGAATCACTCCGACGAAGACCTCTCGGAAATTAAGCCTATTGTTGACGCCATCTTGCTTTTGG
CTGATTCTATCGGAGGGATTTCGTTTGGTCATTGTACGAGGGAGCAAAACTGCGTTGCTCATTCCTTAGCGCGCGAAAGTGCTGGTTTTACCCGCCATTCTGGTTCGTGT
AGTCCAGGGCAGAGGCGTCCTTCCACGCTGGAAGAATCCATTCTGGGAGGCTTGGTGGTCTGTCCACTATTGCAGACCGTGCAGTACGGTGATTTTCCTCTGATCTATGA
TTCATGCTTCTGTAACGTGACTTCCCTCCCTGATTGTTTCCTCCGCCCCCTTTCTTTGTTCTCTGACTTATCCCTCCCATTTTTGCCGCAGCAGCATACCCCTGACCCCC
ACCGGTCCACCCCCGTTTCTTTCTTCTCTCCCTATCTCCCTCCCTGCTACAACCTCCCGCCGGCACTTGGAACCAAACAGAGGGCAAGCCTGTTGATTGTATCAGAGATG
GTTGGTATATTCCAGGCTACGTCGGAGGATCTTTGGCGAGGAAGTATTTATGGTAGTTGGGGCACTGTTTTGGTTATCGTTTTCATTTCAATTTGTCATCTATTTTGTTC
CAAAAAAAATGTTTGCTCCCTTCTCTCCCGCTTACGAACCTCGTCTGTCGTTGCTGATCGCCACATCTCCAACATATCTCCTTCATGTCCTCCGCAATTAAGAATAATGG
AGGCTATATCAGATGCAGATTTGAAGTCCTTATTGGATAAGTTGGATGGAAAGCTAAATGAGAATGAGAGATGGGAACATGTTGTAGATAAAAGTAATGATCTTCTTTCA
TATAGTGCTAAATGCTGCAAGCCTAAGGATGGTCCCTTGAAATACTCGAGTGTGACAATATTTGAAAATTGCTGTCCAAAATTGTTGAGAGACTTCTACATGGACAATGA
TTTTAGAAAACAGTGGGACAATACCCTACTTATGCACGAGCAGCTACAAATGGATGGAACTAGCGGAATTGAAGTTGGTCGCACCTTAAAAAAATTTCCATTATTGACAC
CTAGGGAGTATATACTATCATGGAGATTATGGGAGGGGAAAGATGAAACCTTTTACTGCTTTACCAAGGAATGTGAACATCCTTTGGCACCACAACAGAAGAAGTATGTC
CGAGTGACGTTTTTCAGGTCTGGTTGGCGAATCAGGAGAGTATCCGACAGAAATGCGTGTGAGATCAGTATGTTGCACCAAGAAGATGCTGGTTTGAATGTGGAGATGGC
AAGACTTGTGTTTGCAAAGGGCATATGGAGCTTTGTCTGTAAAATGAATAAAGCATTACGCAAATACGCTCTAATCAACAACCCTCCATCAAGTTCACTTGTCACTGCAA
TTACTCTGATTAAGAAGGTTCCAGATGGAGTTGAGGACGGGAATGACATGATCTCTAAAGCAAACATTATAGCAACTGAGTCCTGTGGACAAATTTCTTCAGGAGAGAGA
AAATTGTCGAGGGCATCAAAAAAACTGATGGCCAATGGTTTGCTTCTTCTTGGTGGTGTGATCTGCCTGTCGCGAGGTCACTCTAGTCTTGGAGCCAAAGTTGTTCTGGC
ATATATTCTTACCAAGCTAAGCAAGCGAGCTGATGCACCAGGAGGTCAAATAGAGAAGGCGTAA
Protein sequenceShow/hide protein sequence
MSCFRIPISICKDIDSICAKFWWGSSESKRKIHWKNWNFLCTSKDSGGLGFRNILLFNQAMLAKLSWRIIKEPSSLLARVLKGRYFKDRPFLEASLGNNPSLTWRSIMWG
RDLFLKGYRWRVGNGRYIEIDKDPWLPRDFSKIPKLMDDHLEGRKVSNLIDENNQWKKDWVLEHIHHQDAKVILNIPLGDPRMRDEIIWSSDKKGKFSVKSAYRLAMEKE
GELEASSSDPRSSQDCWRSLWRIPTLPRAKICVWKILNDIVPSCSNLLKKGLNIDPLCVLCRRRLKTSTHSIWECKMVRDIWPMFIPDSLDLFSCNRANWSTLDYWDWTR
QNFKDVELGKAVILMWCIWSFRNSKLNSNSTKLVDKATLINQISSRLLEIERPRRSYLVSPKALRGENHLSQNHWSPPPPGCWKINTDASWNASEEKGGMGWIVRDSGGS
PICAGMKAISVNWPVKLLEAQAIWQALKSIETLPEKPTSIIVNSDCLELILLLNHSDEDLSEIKPIVDAILLLADSIGGISFGHCTREQNCVAHSLARESAGFTRHSGSC
SPGQRRPSTLEESILGGLVVCPLLQTVQYGDFPLIYDSCFCNVTSLPDCFLRPLSLFSDLSLPFLPQQHTPDPHRSTPVSFFSPYLPPCYNLPPALGTKQRASLLIVSEM
VGIFQATSEDLWRGSIYGSWGTVLVIVFISICHLFCSKKNVCSLLSRLRTSSVVADRHISNISPSCPPQLRIMEAISDADLKSLLDKLDGKLNENERWEHVVDKSNDLLS
YSAKCCKPKDGPLKYSSVTIFENCCPKLLRDFYMDNDFRKQWDNTLLMHEQLQMDGTSGIEVGRTLKKFPLLTPREYILSWRLWEGKDETFYCFTKECEHPLAPQQKKYV
RVTFFRSGWRIRRVSDRNACEISMLHQEDAGLNVEMARLVFAKGIWSFVCKMNKALRKYALINNPPSSSLVTAITLIKKVPDGVEDGNDMISKANIIATESCGQISSGER
KLSRASKKLMANGLLLLGGVICLSRGHSSLGAKVVLAYILTKLSKRADAPGGQIEKA