; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g11680 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g11680
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr7:8979237..8991820
RNA-Seq ExpressionMoc07g11680
SyntenyMoc07g11680
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]1.1e-16353.19Show/hide
Query:  MLPRRSMRLRADTDPAPGGENGADPPPPPVRNQAGIVPPFPPAAARERADPAVPQVNPQLALLVEALQAVISNAAGVGGVQAPPPQHLHTPQSEARFIKD
        M PR SMRLRAD DPAPG                                                         GVGGVQAPPPQHLHTPQSEARFIKD
Subjt:  MLPRRSMRLRADTDPAPGGENGADPPPPPVRNQAGIVPPFPPAAARERADPAVPQVNPQLALLVEALQAVISNAAGVGGVQAPPPQHLHTPQSEARFIKD

Query:  FMRYGPPTFDCESERATAAEEWIRELEALYAYLGCDDQFKVKGAVFMLRGEALNWWDSVAAAEDHAN---------------------------------
        F RYGPPTFD ESERATA EEWIRELEALYAYLGC+DQFKVKGAVFMLRGEALNWWDSVAAAED+AN                                 
Subjt:  FMRYGPPTFDCESERATAAEEWIRELEALYAYLGCDDQFKVKGAVFMLRGEALNWWDSVAAAEDHAN---------------------------------

Query:  -----VAQYERKFTELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTNYAEAVRGALVMDKDVSNKASPLPEVGSSSVQLRLPLMQLEFVVNEVG
             VAQYERKFTELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPT YAEAVRGALVMDKDVSNKASPLPEVGSSS                  
Subjt:  -----VAQYERKFTELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTNYAEAVRGALVMDKDVSNKASPLPEVGSSSVQLRLPLMQLEFVVNEVG

Query:  LVLELIQCNLRKARKIQEAFTLHLQKLVNAQEPTKSFEPEFIHNVTSMSQEENGAKMAREKLSILRDGTEDKKSVQIREQDLEVTIVDSKSDLRVIHLVT
                                                                                                            
Subjt:  LVLELIQCNLRKARKIQEAFTLHLQKLVNAQEPTKSFEPEFIHNVTSMSQEENGAKMAREKLSILRDGTEDKKSVQIREQDLEVTIVDSKSDLRVIHLVT

Query:  LLLVGIIRALVDILWPALEGVPQDPTMAILQSIQGMVEMMREDRQERRVQQRREERVLQEDEGMFDFDVQKEQVGSRGRNNFANIMQPRRWERKFPSTYA
                           GV                                                                      +RKFPSTYA
Subjt:  LLLVGIIRALVDILWPALEGVPQDPTMAILQSIQGMVEMMREDRQERRVQQRREERVLQEDEGMFDFDVQKEQVGSRGRNNFANIMQPRRWERKFPSTYA

Query:  DPVLRAPQRQAQHQGMPPVCPTCKKRHTGQCWTGSKGCFRCGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGTVL
        D VLRAPQRQAQHQGMPPVCPTC+KRHTGQCWTGSKGCFRCGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGTVL
Subjt:  DPVLRAPQRQAQHQGMPPVCPTCKKRHTGQCWTGSKGCFRCGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGTVL

Query:  VHDVPAYVLFDSGSSHTFISSAFVRQATLELEPLGFLLSVSTPSGSILIASQKVRAGELSFDNQTLRARLIQLDM
        VHDVPAYVLFDSGSSHTFISS FVRQATLELEPLGFLLSVSTPSGSILIASQKVRA ELSFDNQTLRARLIQLDM
Subjt:  VHDVPAYVLFDSGSSHTFISSAFVRQATLELEPLGFLLSVSTPSGSILIASQKVRAGELSFDNQTLRARLIQLDM

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]1.8e-12940.32Show/hide
Query:  RRSMRLRADTDPAPGGENGADPPPPPVRNQAGIVPPFPPAAARERADPAVPQVNPQLALLVEALQAVISNAAGVGGVQAPPPQHLHTPQSEARFIKDFMR
        RR+ R     DP   GEN ADP   PV    G+VPP P AA +      VPQVNPQ+ALL EALQ ++ NA G GG Q   P+    PQ E +FI+DF  
Subjt:  RRSMRLRADTDPAPGGENGADPPPPPVRNQAGIVPPFPPAAARERADPAVPQVNPQLALLVEALQAVISNAAGVGGVQAPPPQHLHTPQSEARFIKDFMR

Query:  YGPPTFDCESERATAAEEWIRELEALYAYLGCDDQFKVKGAVFMLRGEALNWWDSVAAAEDHAN------------------------------------
        +GPP F+  SER TAAEEW+RELEALY YLGC D FKV+GAVFMLRGEA+NWW+SVAAAEDHAN                                    
Subjt:  YGPPTFDCESERATAAEEWIRELEALYAYLGCDDQFKVKGAVFMLRGEALNWWDSVAAAEDHAN------------------------------------

Query:  --VAQYERKFTELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTNYAEAVRGALVMDKDVSNKASPLPEVGSSSVQLRLPLMQLEFVVNEVGLVL
          VAQYERKFTELSRF  + +PTE LKI +F+ GLR+ I+G + L+ PT YA AVR ALVMDK                                     
Subjt:  --VAQYERKFTELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTNYAEAVRGALVMDKDVSNKASPLPEVGSSSVQLRLPLMQLEFVVNEVGLVL

Query:  ELIQCNLRKARKIQEAFTLHLQKLVNAQEPTKSFEPEFIHNVTSMSQEENGAKMAREKLSILRDGTEDKKSVQIREQDLEVTIVDSKSDLRVIHLVTLLL
            C                                                                                               
Subjt:  ELIQCNLRKARKIQEAFTLHLQKLVNAQEPTKSFEPEFIHNVTSMSQEENGAKMAREKLSILRDGTEDKKSVQIREQDLEVTIVDSKSDLRVIHLVTLLL

Query:  VGIIRALVDILWPALEGVPQDPTMAILQSIQGMVEMMREDRQERRVQQRREERVLQEDEGMFDFDVQKEQVGSRGRNNFANIMQPRRWERKFPSTYADPV
                                                     +++ + ++V+  + G+                           +RKF S  A   
Subjt:  VGIIRALVDILWPALEGVPQDPTMAILQSIQGMVEMMREDRQERRVQQRREERVLQEDEGMFDFDVQKEQVGSRGRNNFANIMQPRRWERKFPSTYADPV

Query:  LRAPQRQAQHQGMPPVCPTCKKRHTGQCWTGSKGCFRCGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGTVLVHD
         R  Q  AQ Q  PPVCP+CKK H   CW G K CF+C +EGHF REC M+ +NTQ L Q+ P   +TQG  Q ARVFALTR +   AE VVTGT+L+  
Subjt:  LRAPQRQAQHQGMPPVCPTCKKRHTGQCWTGSKGCFRCGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGTVLVHD

Query:  VPAYVLFDSGSSHTFISSAFVRQATLELEPLGFLLSVSTPSGSILIASQKVRAGELSFDNQTLRARLIQLDMRDFDVIVGMDWLATNQANINCLRREVSF
        +PAY LFDSGSSH+FI+S FVR A LELE  GF LSVSTPSGS+L+ SQ V+ G+LSF  QTL   LIQL+M+DFDVI+GMDWLA N+ANINC ++EVSF
Subjt:  VPAYVLFDSGSSHTFISSAFVRQATLELEPLGFLLSVSTPSGSILIASQKVRAGELSFDNQTLRARLIQLDMRDFDVIVGMDWLATNQANINCLRREVSF

Query:  QLPSSRNFTFKRVTGRVPRTVSALKARCLLQNGAWGYLANVVDVSKTPP
         L S +NFTFK V   VPR VSALKA  LLQ G W YLA+VVD  K  P
Subjt:  QLPSSRNFTFKRVTGRVPRTVSALKARCLLQNGAWGYLANVVDVSKTPP

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]7.9e-1669.01Show/hide
Query:  ELDRSEVELAVEDVSAVLAQLSVKPTLRQRIITAQKGDSSLSKGFGMLGQGDFSLSKDKALLYQGRLCVPR
        EL+ SEVEL V+DVSA+LA+LSV+P+LRQRII AQK D SL+KGF M+G GDF+LS + ALL+ GRLCVP+
Subjt:  ELDRSEVELAVEDVSAVLAQLSVKPTLRQRIITAQKGDSSLSKGFGMLGQGDFSLSKDKALLYQGRLCVPR

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]1.5e-12847.02Show/hide
Query:  YLGCDDQFKVKGAVFMLRGEALNWWDSVAAAEDHAN--------------------------------------VAQYERKFTELSRFALELIPTEALKI
        YL C++QFKVKG VFMLRGEALNWWDSVA AEDHAN                                      VAQYERKFTELS FA ELIPTEA+KI
Subjt:  YLGCDDQFKVKGAVFMLRGEALNWWDSVAAAEDHAN--------------------------------------VAQYERKFTELSRFALELIPTEALKI

Query:  KRFVKGLRKGIRGPVDLQRPTNYAEAVRGALVMDKDVSNKASPLPEVGSSSVQLRLPLMQLEFVVNEVGLVLELIQCNLRKARKIQEAFTLHLQKLVNAQ
        KRFVKGLRKGIRGPVDLQRP  YAEAVRG L+MD DVSN   PL EVGSSS                                                 
Subjt:  KRFVKGLRKGIRGPVDLQRPTNYAEAVRGALVMDKDVSNKASPLPEVGSSSVQLRLPLMQLEFVVNEVGLVLELIQCNLRKARKIQEAFTLHLQKLVNAQ

Query:  EPTKSFEPEFIHNVTSMSQEENGAKMAREKLSILRDGTEDKKSVQIREQDLEVTIVDSKSDLRVIHLVTLLLVGIIRALVDILWPALEGVPQDPTMAILQ
                                                                                                GV          
Subjt:  EPTKSFEPEFIHNVTSMSQEENGAKMAREKLSILRDGTEDKKSVQIREQDLEVTIVDSKSDLRVIHLVTLLLVGIIRALVDILWPALEGVPQDPTMAILQ

Query:  SIQGMVEMMREDRQERRVQQRREERVLQEDEGMFDFDVQKEQVGSRGRNNFANIMQPRRWERKFPSTYADPVLRAPQRQAQHQGMPPVCPTCKKRHTGQC
                                                                    +RK    YAD   RAPQR AQ QG+PPVCP+C+KR  GQC
Subjt:  SIQGMVEMMREDRQERRVQQRREERVLQEDEGMFDFDVQKEQVGSRGRNNFANIMQPRRWERKFPSTYADPVLRAPQRQAQHQGMPPVCPTCKKRHTGQC

Query:  WTGSKGCFRCGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGTVLVHDVPAYVLFDSGSSHTFISSAFVRQATLEL
        WTG++GCFRCGREGHFAREC M+AANTQRLGQR  P VSTQG                       GT LVH+VPAYVLFD GSSHTFIS+AFVRQATLEL
Subjt:  WTGSKGCFRCGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGTVLVHDVPAYVLFDSGSSHTFISSAFVRQATLEL

Query:  EPLGFLLSVSTPSGSILIASQKVRAGELSFDNQTLRARLIQLDMRDFDVIVGMDWLATNQANINCLRREVSFQLPSSRNFTFKRVTGRVPRTVSALKARC
        EPLGFLLSVSTPSGS+LIASQ VRAGELSFDNQTL ARLIQLDMRDFDVI+GMDWLATNQANINC +REVSFQLPS R+FTFK V+G VPR VSALKAR 
Subjt:  EPLGFLLSVSTPSGSILIASQKVRAGELSFDNQTLRARLIQLDMRDFDVIVGMDWLATNQANINCLRREVSFQLPSSRNFTFKRVTGRVPRTVSALKARC

Query:  LLQNGAWGYLANVVDVSKTPP
        LL NGAW YLA+VVD+S TPP
Subjt:  LLQNGAWGYLANVVDVSKTPP

XP_022157413.1 uncharacterized protein LOC111024114 [Momordica charantia]5.3e-12138.72Show/hide
Query:  RRSMRLRADTDPAPGGENGADPPPPPVRNQAGIVPPFPPAAARERADPAVPQVNPQLALLVEALQAVISNAAGVGGVQAPPPQHLHTPQSEARFIKDFMR
        RR+ R     DP P GE  ADP  P +     + PP P AA +      VPQVNPQ+ALL EALQ ++ NA G GG Q   P+    PQ E +FI+DF R
Subjt:  RRSMRLRADTDPAPGGENGADPPPPPVRNQAGIVPPFPPAAARERADPAVPQVNPQLALLVEALQAVISNAAGVGGVQAPPPQHLHTPQSEARFIKDFMR

Query:  YGPPTFDCESERATAAEEWIRELEALYAYLGCDDQFKVKGAVFMLRGEALNWWDSVAAAEDHAN------------------------------------
        +GPP F+  SER TA EEW+RELEALY YLGC D FKV+GAVFMLRGEA+NWW+SVAAAEDH N                                    
Subjt:  YGPPTFDCESERATAAEEWIRELEALYAYLGCDDQFKVKGAVFMLRGEALNWWDSVAAAEDHAN------------------------------------

Query:  --VAQYERKFTELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTNYAEAVRGALVMDKDVSNKASPLPEVGSSSVQLRLPLMQLEFVVNEVGLVL
          VAQYERKFTELSRF ++ IPTE LKI +F+ GLR  I+G + ++ PT YA A+R ALVMDK                                     
Subjt:  --VAQYERKFTELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTNYAEAVRGALVMDKDVSNKASPLPEVGSSSVQLRLPLMQLEFVVNEVGLVL

Query:  ELIQCNLRKARKIQEAFTLHLQKLVNAQEPTKSFEPEFIHNVTSMSQEENGAKMAREKLSILRDGTEDKKSVQIREQDLEVTIVDSKSDLRVIHLVTLLL
            C                                                                                               
Subjt:  ELIQCNLRKARKIQEAFTLHLQKLVNAQEPTKSFEPEFIHNVTSMSQEENGAKMAREKLSILRDGTEDKKSVQIREQDLEVTIVDSKSDLRVIHLVTLLL

Query:  VGIIRALVDILWPALEGVPQDPTMAILQSIQGMVEMMREDRQERRVQQRREERVLQEDEGMFDFDVQKEQVGSRGRNNFANIMQPRRWERKFPSTYADPV
                                                     +++ + ++V+    G+                           +RKF    +   
Subjt:  VGIIRALVDILWPALEGVPQDPTMAILQSIQGMVEMMREDRQERRVQQRREERVLQEDEGMFDFDVQKEQVGSRGRNNFANIMQPRRWERKFPSTYADPV

Query:  LRAPQRQAQHQGMPPVCPTCKKRHTGQCWTGSKGCFRCGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGTVLVHD
         R  Q   Q Q  PPVCP+CKK H G CW G + CFRC                     Q+ P   + QG  QRARVFALTR +   AE VVTGT+LV  
Subjt:  LRAPQRQAQHQGMPPVCPTCKKRHTGQCWTGSKGCFRCGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGTVLVHD

Query:  VPAYVLFDSGSSHTFISSAFVRQATLELEPLGFLLSVSTPSGSILIASQKVRAGELSFDNQTLRARLIQLDMRDFDVIVGMDWLATNQANINCLRREVSF
        +PAY LFDSGSSH+FI+S FVR A LELE LGFLLSVSTPSGS+L+ SQ V+ G+LSFD QT   +LIQLDM+DFDVI+GMDWLA N+ANINC ++EVSF
Subjt:  VPAYVLFDSGSSHTFISSAFVRQATLELEPLGFLLSVSTPSGSILIASQKVRAGELSFDNQTLRARLIQLDMRDFDVIVGMDWLATNQANINCLRREVSF

Query:  QLPSSRNFTFKRVTGRVPRTVSALKARCLLQNGAWGYLANVVDVSKTPP
        +LPS +NFTFKRV   VPR VSALKA  LLQ GAW YLA+VVD  K  P
Subjt:  QLPSSRNFTFKRVTGRVPRTVSALKARCLLQNGAWGYLANVVDVSKTPP

XP_022158750.1 uncharacterized protein LOC111025215 [Momordica charantia]3.0e-12439.65Show/hide
Query:  RRSMRLRADTDPAPGGENGADPPPPPVRNQAGIVPPFPPAAARERADPAVPQVNPQLALLVEALQAVISNAAGVGGVQAPPPQHLHTPQSEARFIKDFMR
        RR+ R     DP P GE  ADP  PP     G+ PP P AA++      VPQVNPQ+ALL EALQ ++ NA G GG Q   P+    PQ E         
Subjt:  RRSMRLRADTDPAPGGENGADPPPPPVRNQAGIVPPFPPAAARERADPAVPQVNPQLALLVEALQAVISNAAGVGGVQAPPPQHLHTPQSEARFIKDFMR

Query:  YGPPTFDCESERATAAEEWIRELEALYAYLGCDDQFKVKGAVFMLRGEALNWWDSVAAAEDHAN------------------------------------
                 SER TAAEEW+RELEALY YLGC D FKV+GAVFMLRGEA+NWW+SVAAAEDHAN                                    
Subjt:  YGPPTFDCESERATAAEEWIRELEALYAYLGCDDQFKVKGAVFMLRGEALNWWDSVAAAEDHAN------------------------------------

Query:  --VAQYERKFTELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTNYAEAVRGALVMDKDVSNKASPLPEVGSSSVQLRLPLMQLEFVVNEVGLVL
          VA+YERKFTELSRF ++ IPT+ LKI +F+ GLR+ I+G + L+ PT YA AVR ALVMDK                                     
Subjt:  --VAQYERKFTELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTNYAEAVRGALVMDKDVSNKASPLPEVGSSSVQLRLPLMQLEFVVNEVGLVL

Query:  ELIQCNLRKARKIQEAFTLHLQKLVNAQEPTKSFEPEFIHNVTSMSQEENGAKMAREKLSILRDGTEDKKSVQIREQDLEVTIVDSKSDLRVIHLVTLLL
            C                                                                                               
Subjt:  ELIQCNLRKARKIQEAFTLHLQKLVNAQEPTKSFEPEFIHNVTSMSQEENGAKMAREKLSILRDGTEDKKSVQIREQDLEVTIVDSKSDLRVIHLVTLLL

Query:  VGIIRALVDILWPALEGVPQDPTMAILQSIQGMVEMMREDRQERRVQQRREERVLQEDEGMFDFDVQKEQVGSRGRNNFANIMQPRRWERKFPSTYADPV
                                                     +++ + ++V+    G+                           +RKF S  +   
Subjt:  VGIIRALVDILWPALEGVPQDPTMAILQSIQGMVEMMREDRQERRVQQRREERVLQEDEGMFDFDVQKEQVGSRGRNNFANIMQPRRWERKFPSTYADPV

Query:  LRAPQRQAQHQGMPPVCPTCKKRHTGQCWTGSKGCFRCGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGTVLVHD
         R  Q   Q Q  PPVCP+CKK H G CW G + C+RC +EGHFARECPM+ +NTQ LGQRIP   + QG   RARVFALTR +   AE VVT TVLV  
Subjt:  LRAPQRQAQHQGMPPVCPTCKKRHTGQCWTGSKGCFRCGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGTVLVHD

Query:  VPAYVLFDSGSSHTFISSAFVRQATLELEPLGFLLSVSTPSGSILIASQKVRAGELSFDNQTLRARLIQLDMRDFDVIVGMDWLATNQANINCLRREVSF
        +PAY LFDSGSSH+FI+S FV  A LELE LGFLLSVSTPSGS+L+ SQ V+ G+LSFD QTL  +LIQLDM+DFDVI+GMDWLA N+ANI+C +++VSF
Subjt:  VPAYVLFDSGSSHTFISSAFVRQATLELEPLGFLLSVSTPSGSILIASQKVRAGELSFDNQTLRARLIQLDMRDFDVIVGMDWLATNQANINCLRREVSF

Query:  QLPSSRNFTFKRVTGRVPRTVSALKARCLLQNGAWGYLANVVDVSKTPP
        +LPS +NFTFK V   VPR V ALKA  LLQ GAW YLA+VVD  K  P
Subjt:  QLPSSRNFTFKRVTGRVPRTVSALKARCLLQNGAWGYLANVVDVSKTPP

TrEMBL top hitse value%identityAlignment
A0A6J1DQB9 Reverse transcriptase8.8e-13040.32Show/hide
Query:  RRSMRLRADTDPAPGGENGADPPPPPVRNQAGIVPPFPPAAARERADPAVPQVNPQLALLVEALQAVISNAAGVGGVQAPPPQHLHTPQSEARFIKDFMR
        RR+ R     DP   GEN ADP   PV    G+VPP P AA +      VPQVNPQ+ALL EALQ ++ NA G GG Q   P+    PQ E +FI+DF  
Subjt:  RRSMRLRADTDPAPGGENGADPPPPPVRNQAGIVPPFPPAAARERADPAVPQVNPQLALLVEALQAVISNAAGVGGVQAPPPQHLHTPQSEARFIKDFMR

Query:  YGPPTFDCESERATAAEEWIRELEALYAYLGCDDQFKVKGAVFMLRGEALNWWDSVAAAEDHAN------------------------------------
        +GPP F+  SER TAAEEW+RELEALY YLGC D FKV+GAVFMLRGEA+NWW+SVAAAEDHAN                                    
Subjt:  YGPPTFDCESERATAAEEWIRELEALYAYLGCDDQFKVKGAVFMLRGEALNWWDSVAAAEDHAN------------------------------------

Query:  --VAQYERKFTELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTNYAEAVRGALVMDKDVSNKASPLPEVGSSSVQLRLPLMQLEFVVNEVGLVL
          VAQYERKFTELSRF  + +PTE LKI +F+ GLR+ I+G + L+ PT YA AVR ALVMDK                                     
Subjt:  --VAQYERKFTELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTNYAEAVRGALVMDKDVSNKASPLPEVGSSSVQLRLPLMQLEFVVNEVGLVL

Query:  ELIQCNLRKARKIQEAFTLHLQKLVNAQEPTKSFEPEFIHNVTSMSQEENGAKMAREKLSILRDGTEDKKSVQIREQDLEVTIVDSKSDLRVIHLVTLLL
            C                                                                                               
Subjt:  ELIQCNLRKARKIQEAFTLHLQKLVNAQEPTKSFEPEFIHNVTSMSQEENGAKMAREKLSILRDGTEDKKSVQIREQDLEVTIVDSKSDLRVIHLVTLLL

Query:  VGIIRALVDILWPALEGVPQDPTMAILQSIQGMVEMMREDRQERRVQQRREERVLQEDEGMFDFDVQKEQVGSRGRNNFANIMQPRRWERKFPSTYADPV
                                                     +++ + ++V+  + G+                           +RKF S  A   
Subjt:  VGIIRALVDILWPALEGVPQDPTMAILQSIQGMVEMMREDRQERRVQQRREERVLQEDEGMFDFDVQKEQVGSRGRNNFANIMQPRRWERKFPSTYADPV

Query:  LRAPQRQAQHQGMPPVCPTCKKRHTGQCWTGSKGCFRCGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGTVLVHD
         R  Q  AQ Q  PPVCP+CKK H   CW G K CF+C +EGHF REC M+ +NTQ L Q+ P   +TQG  Q ARVFALTR +   AE VVTGT+L+  
Subjt:  LRAPQRQAQHQGMPPVCPTCKKRHTGQCWTGSKGCFRCGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGTVLVHD

Query:  VPAYVLFDSGSSHTFISSAFVRQATLELEPLGFLLSVSTPSGSILIASQKVRAGELSFDNQTLRARLIQLDMRDFDVIVGMDWLATNQANINCLRREVSF
        +PAY LFDSGSSH+FI+S FVR A LELE  GF LSVSTPSGS+L+ SQ V+ G+LSF  QTL   LIQL+M+DFDVI+GMDWLA N+ANINC ++EVSF
Subjt:  VPAYVLFDSGSSHTFISSAFVRQATLELEPLGFLLSVSTPSGSILIASQKVRAGELSFDNQTLRARLIQLDMRDFDVIVGMDWLATNQANINCLRREVSF

Query:  QLPSSRNFTFKRVTGRVPRTVSALKARCLLQNGAWGYLANVVDVSKTPP
         L S +NFTFK V   VPR VSALKA  LLQ G W YLA+VVD  K  P
Subjt:  QLPSSRNFTFKRVTGRVPRTVSALKARCLLQNGAWGYLANVVDVSKTPP

A0A6J1DQB9 Reverse transcriptase3.8e-1669.01Show/hide
Query:  ELDRSEVELAVEDVSAVLAQLSVKPTLRQRIITAQKGDSSLSKGFGMLGQGDFSLSKDKALLYQGRLCVPR
        EL+ SEVEL V+DVSA+LA+LSV+P+LRQRII AQK D SL+KGF M+G GDF+LS + ALL+ GRLCVP+
Subjt:  ELDRSEVELAVEDVSAVLAQLSVKPTLRQRIITAQKGDSSLSKGFGMLGQGDFSLSKDKALLYQGRLCVPR

A0A6J1DQB9 Reverse transcriptase7.5e-12947.02Show/hide
Query:  YLGCDDQFKVKGAVFMLRGEALNWWDSVAAAEDHAN--------------------------------------VAQYERKFTELSRFALELIPTEALKI
        YL C++QFKVKG VFMLRGEALNWWDSVA AEDHAN                                      VAQYERKFTELS FA ELIPTEA+KI
Subjt:  YLGCDDQFKVKGAVFMLRGEALNWWDSVAAAEDHAN--------------------------------------VAQYERKFTELSRFALELIPTEALKI

Query:  KRFVKGLRKGIRGPVDLQRPTNYAEAVRGALVMDKDVSNKASPLPEVGSSSVQLRLPLMQLEFVVNEVGLVLELIQCNLRKARKIQEAFTLHLQKLVNAQ
        KRFVKGLRKGIRGPVDLQRP  YAEAVRG L+MD DVSN   PL EVGSSS                                                 
Subjt:  KRFVKGLRKGIRGPVDLQRPTNYAEAVRGALVMDKDVSNKASPLPEVGSSSVQLRLPLMQLEFVVNEVGLVLELIQCNLRKARKIQEAFTLHLQKLVNAQ

Query:  EPTKSFEPEFIHNVTSMSQEENGAKMAREKLSILRDGTEDKKSVQIREQDLEVTIVDSKSDLRVIHLVTLLLVGIIRALVDILWPALEGVPQDPTMAILQ
                                                                                                GV          
Subjt:  EPTKSFEPEFIHNVTSMSQEENGAKMAREKLSILRDGTEDKKSVQIREQDLEVTIVDSKSDLRVIHLVTLLLVGIIRALVDILWPALEGVPQDPTMAILQ

Query:  SIQGMVEMMREDRQERRVQQRREERVLQEDEGMFDFDVQKEQVGSRGRNNFANIMQPRRWERKFPSTYADPVLRAPQRQAQHQGMPPVCPTCKKRHTGQC
                                                                    +RK    YAD   RAPQR AQ QG+PPVCP+C+KR  GQC
Subjt:  SIQGMVEMMREDRQERRVQQRREERVLQEDEGMFDFDVQKEQVGSRGRNNFANIMQPRRWERKFPSTYADPVLRAPQRQAQHQGMPPVCPTCKKRHTGQC

Query:  WTGSKGCFRCGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGTVLVHDVPAYVLFDSGSSHTFISSAFVRQATLEL
        WTG++GCFRCGREGHFAREC M+AANTQRLGQR  P VSTQG                       GT LVH+VPAYVLFD GSSHTFIS+AFVRQATLEL
Subjt:  WTGSKGCFRCGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGTVLVHDVPAYVLFDSGSSHTFISSAFVRQATLEL

Query:  EPLGFLLSVSTPSGSILIASQKVRAGELSFDNQTLRARLIQLDMRDFDVIVGMDWLATNQANINCLRREVSFQLPSSRNFTFKRVTGRVPRTVSALKARC
        EPLGFLLSVSTPSGS+LIASQ VRAGELSFDNQTL ARLIQLDMRDFDVI+GMDWLATNQANINC +REVSFQLPS R+FTFK V+G VPR VSALKAR 
Subjt:  EPLGFLLSVSTPSGSILIASQKVRAGELSFDNQTLRARLIQLDMRDFDVIVGMDWLATNQANINCLRREVSFQLPSSRNFTFKRVTGRVPRTVSALKARC

Query:  LLQNGAWGYLANVVDVSKTPP
        LL NGAW YLA+VVD+S TPP
Subjt:  LLQNGAWGYLANVVDVSKTPP

A0A6J1DTA8 uncharacterized protein LOC1110241142.6e-12138.72Show/hide
Query:  RRSMRLRADTDPAPGGENGADPPPPPVRNQAGIVPPFPPAAARERADPAVPQVNPQLALLVEALQAVISNAAGVGGVQAPPPQHLHTPQSEARFIKDFMR
        RR+ R     DP P GE  ADP  P +     + PP P AA +      VPQVNPQ+ALL EALQ ++ NA G GG Q   P+    PQ E +FI+DF R
Subjt:  RRSMRLRADTDPAPGGENGADPPPPPVRNQAGIVPPFPPAAARERADPAVPQVNPQLALLVEALQAVISNAAGVGGVQAPPPQHLHTPQSEARFIKDFMR

Query:  YGPPTFDCESERATAAEEWIRELEALYAYLGCDDQFKVKGAVFMLRGEALNWWDSVAAAEDHAN------------------------------------
        +GPP F+  SER TA EEW+RELEALY YLGC D FKV+GAVFMLRGEA+NWW+SVAAAEDH N                                    
Subjt:  YGPPTFDCESERATAAEEWIRELEALYAYLGCDDQFKVKGAVFMLRGEALNWWDSVAAAEDHAN------------------------------------

Query:  --VAQYERKFTELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTNYAEAVRGALVMDKDVSNKASPLPEVGSSSVQLRLPLMQLEFVVNEVGLVL
          VAQYERKFTELSRF ++ IPTE LKI +F+ GLR  I+G + ++ PT YA A+R ALVMDK                                     
Subjt:  --VAQYERKFTELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTNYAEAVRGALVMDKDVSNKASPLPEVGSSSVQLRLPLMQLEFVVNEVGLVL

Query:  ELIQCNLRKARKIQEAFTLHLQKLVNAQEPTKSFEPEFIHNVTSMSQEENGAKMAREKLSILRDGTEDKKSVQIREQDLEVTIVDSKSDLRVIHLVTLLL
            C                                                                                               
Subjt:  ELIQCNLRKARKIQEAFTLHLQKLVNAQEPTKSFEPEFIHNVTSMSQEENGAKMAREKLSILRDGTEDKKSVQIREQDLEVTIVDSKSDLRVIHLVTLLL

Query:  VGIIRALVDILWPALEGVPQDPTMAILQSIQGMVEMMREDRQERRVQQRREERVLQEDEGMFDFDVQKEQVGSRGRNNFANIMQPRRWERKFPSTYADPV
                                                     +++ + ++V+    G+                           +RKF    +   
Subjt:  VGIIRALVDILWPALEGVPQDPTMAILQSIQGMVEMMREDRQERRVQQRREERVLQEDEGMFDFDVQKEQVGSRGRNNFANIMQPRRWERKFPSTYADPV

Query:  LRAPQRQAQHQGMPPVCPTCKKRHTGQCWTGSKGCFRCGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGTVLVHD
         R  Q   Q Q  PPVCP+CKK H G CW G + CFRC                     Q+ P   + QG  QRARVFALTR +   AE VVTGT+LV  
Subjt:  LRAPQRQAQHQGMPPVCPTCKKRHTGQCWTGSKGCFRCGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGTVLVHD

Query:  VPAYVLFDSGSSHTFISSAFVRQATLELEPLGFLLSVSTPSGSILIASQKVRAGELSFDNQTLRARLIQLDMRDFDVIVGMDWLATNQANINCLRREVSF
        +PAY LFDSGSSH+FI+S FVR A LELE LGFLLSVSTPSGS+L+ SQ V+ G+LSFD QT   +LIQLDM+DFDVI+GMDWLA N+ANINC ++EVSF
Subjt:  VPAYVLFDSGSSHTFISSAFVRQATLELEPLGFLLSVSTPSGSILIASQKVRAGELSFDNQTLRARLIQLDMRDFDVIVGMDWLATNQANINCLRREVSF

Query:  QLPSSRNFTFKRVTGRVPRTVSALKARCLLQNGAWGYLANVVDVSKTPP
        +LPS +NFTFKRV   VPR VSALKA  LLQ GAW YLA+VVD  K  P
Subjt:  QLPSSRNFTFKRVTGRVPRTVSALKARCLLQNGAWGYLANVVDVSKTPP

A0A6J1DUM2 uncharacterized protein LOC1110232475.5e-16453.19Show/hide
Query:  MLPRRSMRLRADTDPAPGGENGADPPPPPVRNQAGIVPPFPPAAARERADPAVPQVNPQLALLVEALQAVISNAAGVGGVQAPPPQHLHTPQSEARFIKD
        M PR SMRLRAD DPAPG                                                         GVGGVQAPPPQHLHTPQSEARFIKD
Subjt:  MLPRRSMRLRADTDPAPGGENGADPPPPPVRNQAGIVPPFPPAAARERADPAVPQVNPQLALLVEALQAVISNAAGVGGVQAPPPQHLHTPQSEARFIKD

Query:  FMRYGPPTFDCESERATAAEEWIRELEALYAYLGCDDQFKVKGAVFMLRGEALNWWDSVAAAEDHAN---------------------------------
        F RYGPPTFD ESERATA EEWIRELEALYAYLGC+DQFKVKGAVFMLRGEALNWWDSVAAAED+AN                                 
Subjt:  FMRYGPPTFDCESERATAAEEWIRELEALYAYLGCDDQFKVKGAVFMLRGEALNWWDSVAAAEDHAN---------------------------------

Query:  -----VAQYERKFTELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTNYAEAVRGALVMDKDVSNKASPLPEVGSSSVQLRLPLMQLEFVVNEVG
             VAQYERKFTELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPT YAEAVRGALVMDKDVSNKASPLPEVGSSS                  
Subjt:  -----VAQYERKFTELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTNYAEAVRGALVMDKDVSNKASPLPEVGSSSVQLRLPLMQLEFVVNEVG

Query:  LVLELIQCNLRKARKIQEAFTLHLQKLVNAQEPTKSFEPEFIHNVTSMSQEENGAKMAREKLSILRDGTEDKKSVQIREQDLEVTIVDSKSDLRVIHLVT
                                                                                                            
Subjt:  LVLELIQCNLRKARKIQEAFTLHLQKLVNAQEPTKSFEPEFIHNVTSMSQEENGAKMAREKLSILRDGTEDKKSVQIREQDLEVTIVDSKSDLRVIHLVT

Query:  LLLVGIIRALVDILWPALEGVPQDPTMAILQSIQGMVEMMREDRQERRVQQRREERVLQEDEGMFDFDVQKEQVGSRGRNNFANIMQPRRWERKFPSTYA
                           GV                                                                      +RKFPSTYA
Subjt:  LLLVGIIRALVDILWPALEGVPQDPTMAILQSIQGMVEMMREDRQERRVQQRREERVLQEDEGMFDFDVQKEQVGSRGRNNFANIMQPRRWERKFPSTYA

Query:  DPVLRAPQRQAQHQGMPPVCPTCKKRHTGQCWTGSKGCFRCGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGTVL
        D VLRAPQRQAQHQGMPPVCPTC+KRHTGQCWTGSKGCFRCGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGTVL
Subjt:  DPVLRAPQRQAQHQGMPPVCPTCKKRHTGQCWTGSKGCFRCGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGTVL

Query:  VHDVPAYVLFDSGSSHTFISSAFVRQATLELEPLGFLLSVSTPSGSILIASQKVRAGELSFDNQTLRARLIQLDM
        VHDVPAYVLFDSGSSHTFISS FVRQATLELEPLGFLLSVSTPSGSILIASQKVRA ELSFDNQTLRARLIQLDM
Subjt:  VHDVPAYVLFDSGSSHTFISSAFVRQATLELEPLGFLLSVSTPSGSILIASQKVRAGELSFDNQTLRARLIQLDM

A0A6J1DWP4 uncharacterized protein LOC1110252151.5e-12439.65Show/hide
Query:  RRSMRLRADTDPAPGGENGADPPPPPVRNQAGIVPPFPPAAARERADPAVPQVNPQLALLVEALQAVISNAAGVGGVQAPPPQHLHTPQSEARFIKDFMR
        RR+ R     DP P GE  ADP  PP     G+ PP P AA++      VPQVNPQ+ALL EALQ ++ NA G GG Q   P+    PQ E         
Subjt:  RRSMRLRADTDPAPGGENGADPPPPPVRNQAGIVPPFPPAAARERADPAVPQVNPQLALLVEALQAVISNAAGVGGVQAPPPQHLHTPQSEARFIKDFMR

Query:  YGPPTFDCESERATAAEEWIRELEALYAYLGCDDQFKVKGAVFMLRGEALNWWDSVAAAEDHAN------------------------------------
                 SER TAAEEW+RELEALY YLGC D FKV+GAVFMLRGEA+NWW+SVAAAEDHAN                                    
Subjt:  YGPPTFDCESERATAAEEWIRELEALYAYLGCDDQFKVKGAVFMLRGEALNWWDSVAAAEDHAN------------------------------------

Query:  --VAQYERKFTELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTNYAEAVRGALVMDKDVSNKASPLPEVGSSSVQLRLPLMQLEFVVNEVGLVL
          VA+YERKFTELSRF ++ IPT+ LKI +F+ GLR+ I+G + L+ PT YA AVR ALVMDK                                     
Subjt:  --VAQYERKFTELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPTNYAEAVRGALVMDKDVSNKASPLPEVGSSSVQLRLPLMQLEFVVNEVGLVL

Query:  ELIQCNLRKARKIQEAFTLHLQKLVNAQEPTKSFEPEFIHNVTSMSQEENGAKMAREKLSILRDGTEDKKSVQIREQDLEVTIVDSKSDLRVIHLVTLLL
            C                                                                                               
Subjt:  ELIQCNLRKARKIQEAFTLHLQKLVNAQEPTKSFEPEFIHNVTSMSQEENGAKMAREKLSILRDGTEDKKSVQIREQDLEVTIVDSKSDLRVIHLVTLLL

Query:  VGIIRALVDILWPALEGVPQDPTMAILQSIQGMVEMMREDRQERRVQQRREERVLQEDEGMFDFDVQKEQVGSRGRNNFANIMQPRRWERKFPSTYADPV
                                                     +++ + ++V+    G+                           +RKF S  +   
Subjt:  VGIIRALVDILWPALEGVPQDPTMAILQSIQGMVEMMREDRQERRVQQRREERVLQEDEGMFDFDVQKEQVGSRGRNNFANIMQPRRWERKFPSTYADPV

Query:  LRAPQRQAQHQGMPPVCPTCKKRHTGQCWTGSKGCFRCGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGTVLVHD
         R  Q   Q Q  PPVCP+CKK H G CW G + C+RC +EGHFARECPM+ +NTQ LGQRIP   + QG   RARVFALTR +   AE VVT TVLV  
Subjt:  LRAPQRQAQHQGMPPVCPTCKKRHTGQCWTGSKGCFRCGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFALTRKEAADAETVVTGTVLVHD

Query:  VPAYVLFDSGSSHTFISSAFVRQATLELEPLGFLLSVSTPSGSILIASQKVRAGELSFDNQTLRARLIQLDMRDFDVIVGMDWLATNQANINCLRREVSF
        +PAY LFDSGSSH+FI+S FV  A LELE LGFLLSVSTPSGS+L+ SQ V+ G+LSFD QTL  +LIQLDM+DFDVI+GMDWLA N+ANI+C +++VSF
Subjt:  VPAYVLFDSGSSHTFISSAFVRQATLELEPLGFLLSVSTPSGSILIASQKVRAGELSFDNQTLRARLIQLDMRDFDVIVGMDWLATNQANINCLRREVSF

Query:  QLPSSRNFTFKRVTGRVPRTVSALKARCLLQNGAWGYLANVVDVSKTPP
        +LPS +NFTFK V   VPR V ALKA  LLQ GAW YLA+VVD  K  P
Subjt:  QLPSSRNFTFKRVTGRVPRTVSALKARCLLQNGAWGYLANVVDVSKTPP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGTTCTTCCAGACAATGCTTCCCCGTCGTAGTATGAGATTGCGAGCTGACACCGACCCCGCTCCCGGAGGTGAGAACGGAGCGGATCCACCGCCCCCTCCAGTTAG
GAACCAGGCAGGAATAGTCCCTCCATTTCCTCCAGCAGCAGCTCGAGAGCGGGCAGATCCTGCAGTTCCTCAGGTGAACCCCCAATTGGCATTGCTTGTGGAGGCCTTGC
AAGCAGTGATCAGTAACGCCGCAGGGGTGGGCGGGGTCCAAGCTCCACCACCCCAACACCTTCATACACCCCAGAGCGAGGCTCGCTTCATCAAGGATTTCATGCGCTAC
GGACCCCCAACCTTTGACTGTGAGAGTGAAAGAGCGACTGCAGCGGAAGAGTGGATCAGAGAGTTGGAAGCCCTTTACGCGTATCTTGGTTGCGATGACCAATTCAAGGT
GAAGGGCGCGGTTTTTATGTTGAGGGGCGAGGCCCTGAACTGGTGGGACTCAGTAGCAGCGGCAGAGGATCATGCGAATGTGGCACAGTATGAAAGGAAGTTCACGGAAC
TCTCCCGTTTCGCTCTAGAGTTGATTCCCACTGAGGCATTAAAGATCAAGAGGTTTGTTAAAGGCTTGCGCAAGGGAATCAGAGGCCCGGTGGACCTCCAGCGACCCACC
AACTATGCTGAAGCGGTTAGGGGCGCCTTGGTTATGGATAAGGATGTTTCCAACAAGGCCTCACCCCTGCCAGAGGTCGGATCATCTTCAGTTCAACTTAGACTTCCATT
AATGCAACTTGAGTTTGTAGTAAATGAAGTTGGACTTGTTCTTGAATTAATTCAATGCAACTTGAGAAAGGCAAGGAAGATTCAAGAGGCTTTCACACTGCACCTTCAAA
AGCTTGTTAATGCACAAGAACCAACAAAGAGTTTTGAGCCCGAATTTATTCATAATGTTACTTCAATGAGTCAAGAAGAGAATGGAGCAAAGATGGCACGAGAAAAGTTG
TCTATTTTGAGAGATGGCACGGAGGACAAAAAAAGTGTGCAGATTCGTGAACAGGATCTAGAAGTGACCATCGTAGATTCTAAGTCCGATCTAAGAGTGATTCATCTTGT
AACCTTACTCTTGGTTGGAATCATTCGAGCATTGGTCGACATCCTTTGGCCGGCTCTAGAGGGAGTACCGCAAGATCCTACTATGGCCATTCTTCAATCAATTCAAGGTA
TGGTGGAAATGATGAGGGAAGATAGGCAGGAAAGAAGGGTGCAACAACGAAGAGAAGAACGAGTCTTACAGGAAGATGAAGGTATGTTTGACTTCGATGTACAAAAAGAA
CAAGTTGGGAGTAGAGGAAGGAATAACTTTGCCAACATTATGCAACCGAGGAGATGGGAAAGAAAGTTCCCTTCGACTTATGCCGACCCGGTATTGAGAGCACCCCAGCG
CCAGGCTCAACACCAGGGCATGCCGCCAGTATGCCCCACCTGCAAGAAAAGACATACGGGGCAGTGCTGGACGGGAAGTAAGGGTTGTTTCAGGTGCGGAAGAGAGGGGC
ATTTTGCAAGGGAATGTCCCATGTCGGCCGCAAATACACAGAGGTTGGGTCAGAGGATTCCACCACCAGTTTCGACGCAGGGAAATAACCAAAGGGCTCGTGTCTTCGCA
CTTACTCGCAAGGAAGCAGCGGATGCCGAAACGGTGGTCACAGGTACTGTTTTAGTCCATGACGTGCCTGCGTATGTATTGTTTGATTCGGGGTCGAGCCACACCTTCAT
CTCTTCTGCGTTTGTTCGTCAGGCAACCCTCGAATTAGAGCCGTTAGGGTTTTTGTTGTCGGTTTCTACACCATCAGGGTCGATTTTGATTGCTAGCCAAAAGGTGAGGG
CAGGTGAGTTGTCTTTTGATAATCAGACTCTAAGGGCAAGACTGATCCAGCTGGACATGCGAGATTTTGACGTTATTGTGGGCATGGATTGGCTAGCTACCAACCAAGCC
AACATTAATTGCTTGAGAAGAGAAGTCTCCTTCCAACTACCTTCGAGTCGGAACTTTACTTTTAAAAGGGTTACGGGTAGAGTCCCAAGGACAGTATCAGCGTTGAAGGC
AAGATGCCTGTTGCAGAATGGAGCTTGGGGATATTTGGCCAACGTCGTCGACGTTAGTAAGACTCCACCTGTACCTCCTGGACCAGTGTTAGATGAGTTGGACCGTTCGG
AGGTAGAGTTAGCGGTGGAAGATGTTTCGGCAGTGCTGGCTCAACTCTCGGTCAAACCCACCCTAAGACAACGAATCATCACTGCACAAAAGGGAGACTCCAGTCTGAGC
AAGGGTTTCGGGATGCTGGGCCAAGGAGATTTTTCCCTTTCGAAAGATAAGGCCCTGCTCTATCAGGGGAGACTGTGCGTACCAAGAGTAGATGAAACATTGTGCTATGA
AGAAGTACCCATTGGGATCGTAGCAAGAGAGACCAAAGTGCTGCGGAACCGGGTGATTGATTTGGTGAAGGTCTTGTGGAGGAACCACCAAATAGAAGAGGCTACCTGGG
AGAGAGAAGATGAAATCAGGGCCCATTATCCTGAATTGTTCGAGCAACGAAATTTCGAGGACGAAAGACCAATACGCTGGGAAGTCATCGGTACCTTGGGAATAAACGGC
AAGGACCGGTGCACAGTTCAGGCCTTGGGAATAAATGGCAAGGCCGAACGTCAAGTTTCTGTAGAGGATTATTGTTTGCTATCTCTCACTGCTTATTTGATTGCCTTGCT
ATCGTTTTATTCTTTATGCCGTGACTGGTCGACCGCGGTGATGACGTGGAGAGGGCTTGACCGAGGAAGGATCACTAGCTTAGTTAGTGAGGATAGAGGGAAGGGTCGTT
GGGGAGTTGCTTATCGCGTGTTTAGAGTTAAGGCTAGCCAGGTGAATTATTTAGCGTTGCCTAGGATGATATATAAAATCCTGGGGCGTTACAGATGGTATCAGAGCGGA
ACCTCTCCCAGTAAGATGTGGTTCGGGGACGAACCAAGGCAGAAGCTGGAAGCATGCTGGAAGGACTCGTGTTCGTCTGTAGGGGCAGTCTATGCTCTTCTAAGTGGCTT
AGAACTTGACGGTGGCGTAGCCAGGTCGCAGGGGAATGCCGGGGCAGAGAGGTGCCGAAATCGGGTTTCGAATCCTGGGCCTGGGGCGTTACAAGAGATTGTGGCATTCA
ACAATAGATCTGTCGAGTCTCCGACGAGGTATGATGACAACTTCTCCTCAATGTTTCGTTTCAGCAGCAACGACGCATGGAACCCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCGTTCTTCCAGACAATGCTTCCCCGTCGTAGTATGAGATTGCGAGCTGACACCGACCCCGCTCCCGGAGGTGAGAACGGAGCGGATCCACCGCCCCCTCCAGTTAG
GAACCAGGCAGGAATAGTCCCTCCATTTCCTCCAGCAGCAGCTCGAGAGCGGGCAGATCCTGCAGTTCCTCAGGTGAACCCCCAATTGGCATTGCTTGTGGAGGCCTTGC
AAGCAGTGATCAGTAACGCCGCAGGGGTGGGCGGGGTCCAAGCTCCACCACCCCAACACCTTCATACACCCCAGAGCGAGGCTCGCTTCATCAAGGATTTCATGCGCTAC
GGACCCCCAACCTTTGACTGTGAGAGTGAAAGAGCGACTGCAGCGGAAGAGTGGATCAGAGAGTTGGAAGCCCTTTACGCGTATCTTGGTTGCGATGACCAATTCAAGGT
GAAGGGCGCGGTTTTTATGTTGAGGGGCGAGGCCCTGAACTGGTGGGACTCAGTAGCAGCGGCAGAGGATCATGCGAATGTGGCACAGTATGAAAGGAAGTTCACGGAAC
TCTCCCGTTTCGCTCTAGAGTTGATTCCCACTGAGGCATTAAAGATCAAGAGGTTTGTTAAAGGCTTGCGCAAGGGAATCAGAGGCCCGGTGGACCTCCAGCGACCCACC
AACTATGCTGAAGCGGTTAGGGGCGCCTTGGTTATGGATAAGGATGTTTCCAACAAGGCCTCACCCCTGCCAGAGGTCGGATCATCTTCAGTTCAACTTAGACTTCCATT
AATGCAACTTGAGTTTGTAGTAAATGAAGTTGGACTTGTTCTTGAATTAATTCAATGCAACTTGAGAAAGGCAAGGAAGATTCAAGAGGCTTTCACACTGCACCTTCAAA
AGCTTGTTAATGCACAAGAACCAACAAAGAGTTTTGAGCCCGAATTTATTCATAATGTTACTTCAATGAGTCAAGAAGAGAATGGAGCAAAGATGGCACGAGAAAAGTTG
TCTATTTTGAGAGATGGCACGGAGGACAAAAAAAGTGTGCAGATTCGTGAACAGGATCTAGAAGTGACCATCGTAGATTCTAAGTCCGATCTAAGAGTGATTCATCTTGT
AACCTTACTCTTGGTTGGAATCATTCGAGCATTGGTCGACATCCTTTGGCCGGCTCTAGAGGGAGTACCGCAAGATCCTACTATGGCCATTCTTCAATCAATTCAAGGTA
TGGTGGAAATGATGAGGGAAGATAGGCAGGAAAGAAGGGTGCAACAACGAAGAGAAGAACGAGTCTTACAGGAAGATGAAGGTATGTTTGACTTCGATGTACAAAAAGAA
CAAGTTGGGAGTAGAGGAAGGAATAACTTTGCCAACATTATGCAACCGAGGAGATGGGAAAGAAAGTTCCCTTCGACTTATGCCGACCCGGTATTGAGAGCACCCCAGCG
CCAGGCTCAACACCAGGGCATGCCGCCAGTATGCCCCACCTGCAAGAAAAGACATACGGGGCAGTGCTGGACGGGAAGTAAGGGTTGTTTCAGGTGCGGAAGAGAGGGGC
ATTTTGCAAGGGAATGTCCCATGTCGGCCGCAAATACACAGAGGTTGGGTCAGAGGATTCCACCACCAGTTTCGACGCAGGGAAATAACCAAAGGGCTCGTGTCTTCGCA
CTTACTCGCAAGGAAGCAGCGGATGCCGAAACGGTGGTCACAGGTACTGTTTTAGTCCATGACGTGCCTGCGTATGTATTGTTTGATTCGGGGTCGAGCCACACCTTCAT
CTCTTCTGCGTTTGTTCGTCAGGCAACCCTCGAATTAGAGCCGTTAGGGTTTTTGTTGTCGGTTTCTACACCATCAGGGTCGATTTTGATTGCTAGCCAAAAGGTGAGGG
CAGGTGAGTTGTCTTTTGATAATCAGACTCTAAGGGCAAGACTGATCCAGCTGGACATGCGAGATTTTGACGTTATTGTGGGCATGGATTGGCTAGCTACCAACCAAGCC
AACATTAATTGCTTGAGAAGAGAAGTCTCCTTCCAACTACCTTCGAGTCGGAACTTTACTTTTAAAAGGGTTACGGGTAGAGTCCCAAGGACAGTATCAGCGTTGAAGGC
AAGATGCCTGTTGCAGAATGGAGCTTGGGGATATTTGGCCAACGTCGTCGACGTTAGTAAGACTCCACCTGTACCTCCTGGACCAGTGTTAGATGAGTTGGACCGTTCGG
AGGTAGAGTTAGCGGTGGAAGATGTTTCGGCAGTGCTGGCTCAACTCTCGGTCAAACCCACCCTAAGACAACGAATCATCACTGCACAAAAGGGAGACTCCAGTCTGAGC
AAGGGTTTCGGGATGCTGGGCCAAGGAGATTTTTCCCTTTCGAAAGATAAGGCCCTGCTCTATCAGGGGAGACTGTGCGTACCAAGAGTAGATGAAACATTGTGCTATGA
AGAAGTACCCATTGGGATCGTAGCAAGAGAGACCAAAGTGCTGCGGAACCGGGTGATTGATTTGGTGAAGGTCTTGTGGAGGAACCACCAAATAGAAGAGGCTACCTGGG
AGAGAGAAGATGAAATCAGGGCCCATTATCCTGAATTGTTCGAGCAACGAAATTTCGAGGACGAAAGACCAATACGCTGGGAAGTCATCGGTACCTTGGGAATAAACGGC
AAGGACCGGTGCACAGTTCAGGCCTTGGGAATAAATGGCAAGGCCGAACGTCAAGTTTCTGTAGAGGATTATTGTTTGCTATCTCTCACTGCTTATTTGATTGCCTTGCT
ATCGTTTTATTCTTTATGCCGTGACTGGTCGACCGCGGTGATGACGTGGAGAGGGCTTGACCGAGGAAGGATCACTAGCTTAGTTAGTGAGGATAGAGGGAAGGGTCGTT
GGGGAGTTGCTTATCGCGTGTTTAGAGTTAAGGCTAGCCAGGTGAATTATTTAGCGTTGCCTAGGATGATATATAAAATCCTGGGGCGTTACAGATGGTATCAGAGCGGA
ACCTCTCCCAGTAAGATGTGGTTCGGGGACGAACCAAGGCAGAAGCTGGAAGCATGCTGGAAGGACTCGTGTTCGTCTGTAGGGGCAGTCTATGCTCTTCTAAGTGGCTT
AGAACTTGACGGTGGCGTAGCCAGGTCGCAGGGGAATGCCGGGGCAGAGAGGTGCCGAAATCGGGTTTCGAATCCTGGGCCTGGGGCGTTACAAGAGATTGTGGCATTCA
ACAATAGATCTGTCGAGTCTCCGACGAGGTATGATGACAACTTCTCCTCAATGTTTCGTTTCAGCAGCAACGACGCATGGAACCCGTAA
Protein sequenceShow/hide protein sequence
MPFFQTMLPRRSMRLRADTDPAPGGENGADPPPPPVRNQAGIVPPFPPAAARERADPAVPQVNPQLALLVEALQAVISNAAGVGGVQAPPPQHLHTPQSEARFIKDFMRY
GPPTFDCESERATAAEEWIRELEALYAYLGCDDQFKVKGAVFMLRGEALNWWDSVAAAEDHANVAQYERKFTELSRFALELIPTEALKIKRFVKGLRKGIRGPVDLQRPT
NYAEAVRGALVMDKDVSNKASPLPEVGSSSVQLRLPLMQLEFVVNEVGLVLELIQCNLRKARKIQEAFTLHLQKLVNAQEPTKSFEPEFIHNVTSMSQEENGAKMAREKL
SILRDGTEDKKSVQIREQDLEVTIVDSKSDLRVIHLVTLLLVGIIRALVDILWPALEGVPQDPTMAILQSIQGMVEMMREDRQERRVQQRREERVLQEDEGMFDFDVQKE
QVGSRGRNNFANIMQPRRWERKFPSTYADPVLRAPQRQAQHQGMPPVCPTCKKRHTGQCWTGSKGCFRCGREGHFARECPMSAANTQRLGQRIPPPVSTQGNNQRARVFA
LTRKEAADAETVVTGTVLVHDVPAYVLFDSGSSHTFISSAFVRQATLELEPLGFLLSVSTPSGSILIASQKVRAGELSFDNQTLRARLIQLDMRDFDVIVGMDWLATNQA
NINCLRREVSFQLPSSRNFTFKRVTGRVPRTVSALKARCLLQNGAWGYLANVVDVSKTPPVPPGPVLDELDRSEVELAVEDVSAVLAQLSVKPTLRQRIITAQKGDSSLS
KGFGMLGQGDFSLSKDKALLYQGRLCVPRVDETLCYEEVPIGIVARETKVLRNRVIDLVKVLWRNHQIEEATWEREDEIRAHYPELFEQRNFEDERPIRWEVIGTLGING
KDRCTVQALGINGKAERQVSVEDYCLLSLTAYLIALLSFYSLCRDWSTAVMTWRGLDRGRITSLVSEDRGKGRWGVAYRVFRVKASQVNYLALPRMIYKILGRYRWYQSG
TSPSKMWFGDEPRQKLEACWKDSCSSVGAVYALLSGLELDGGVARSQGNAGAERCRNRVSNPGPGALQEIVAFNNRSVESPTRYDDNFSSMFRFSSNDAWNP